Voice Driven Animation System
|
|
|
- Samuel Boyd
- 10 years ago
- Views:
Transcription
1 Voice Driven Animation System Zhijin Wang Department of Computer Science University of British Columbia Abstract The goal of this term project is to develop a voice driven animation system that could take human voice as commands to generate the desired character animation, based on motion capture data. In this report, the idea of our system is first introduced, followed by a review of background which is related to the project. Then we will talk about the Microsoft Speech API, which is used as the voice recognition engine in our system. Some details of our implementation are then explained and results are given at the end. 1. Introduction In a traditional animation system, the animator must use mouse and keyboard to specify the path along which the character will move and the action that the character will be doing during the movements. This kind of interaction is not very effective indeed because either clicking on some buttons or typing on the keyboards will be distracting for the animator who is trying to focus on creating the animation. To improve the interaction, we can borrow the idea from the filmmaking scenario, where the director uses his voice to tell the actor what to do before the shooting of the scene, and then the actor will perform the action exactly as he was told to. This is how we come up the idea of using human voice as a media to make a better interface for the animation system. 2. Background In 1986, Dr. Jacob Nielsen asked a group of 57 IT professionals to predict what would be the greatest changes in user interfaces by the year The top-five answers were Table 1: User Interfaces Prediction Table [1]
2 While Graphical User Interfaces (GUIs) have clearly been the winner since that time, Voice User Interfaces (VUIs) certainly failed to reach the demand that IT professionals expected. The key issue in interaction design and the main determinant of usability is what the user says to the interface. Whether you say it by speaking or by typing at the keyboard matters less to most users. Thus, having voice interfaces will not necessarily free us from the most substantial part of user interface design: determining the structure of the dialogue, what commands or features are available, how the users are to specify what they want, and how the computer is to communicate the feedback. All that voice does is to allow the commands and feedback to be spoken rather than written. [2] Voice interfaces have their greatest potential in the following cases where it is problematic to rely on the traditional keyboard-mouse-monitor combination: [1] Users with various disabilities that prevent them from using a mouse and/or keyboard or that prevent them from seeing the pictures on the screen. All users, with or without disabilities, whose hands and eyes are occupied with other tasks. For example, while driving a car or while repairing a complex piece of equipment. Users who do not have access to a keyboard and/or a monitor. For instance, users accessing a system through a payphone. So it's not that voice is useless. It's just that it is often a secondary interaction mode if additional media are available. Just as in our system, in addition to using voice commands to select different actions of the character, the user still has to use mouse clicks to specify the location at which the action is taking place. The combination of multiple computer medias proves to provide better interactions for most users. As for the voice recognition system, there are two mainstream products available at the current market. One is IBM ViaVoice Dictation SDK, which is based on many years of development by IBM. The other one is Microsoft Speech API, also known as SAPI. As an SDK toolkit, each of them has their unique features, and it s hard to compare which one is better. However, considering the functionality we re going to use and the relative cost of them, we have decided to use Microsoft Speech API as our voice recognition system in this project. 3. Microsoft Speech API 3.1. API Overview The Microsoft Speech API (SAPI) provides a high-level interface between an application and speech engines. SAPI implements all the low-level details needed to control and manage the real-time operations of various speech engines, thus dramatically reduces the code overhead required for an application to use speech recognition and text-to-speech, making speech technology more accessible and robust for a wide range of applications. The two basic types of SAPI engines are text-to-speech (TTS) systems and speech recognizers. TTS systems synthesize text strings and files into spoken audio using synthetic voices. Speech recognizers convert human spoken audio into readable text strings and files.
3 Figure 1: SAPI Engines Layout [3] For this project, we will only use Speech Recognition Engine to retrieve the voice commands from the user. There are two types of utterances to be recognized by the Speech Engine. The first one is dictation, which means the Speech Engine will try to recognize whatever the user is saying to the microphone. The recognition rate of dictation is usually very low, because the computer doesn't know what to expect from the user's speech without any given context. The other type of recognizable utterances is called command and control grammar, which means the user can tell the engine in advance what kinds of voice commands he will probably say to it, then the speech engine will try to match his speech with one of those commands during run time of the application. This is the way we're doing it because it has a much higher recognition rate than dictation Context-Free Grammar The command and control features of SAPI 5 are implemented as context-free grammars (CFGs). A CFG is a structure that defines a specific set of words, and the combinations of these words that can be used. In basic terms, a CFG defines the sentences that are valid, and in SAPI 5, defines the sentences that are valid for recognition by a speech recognition (SR) engine. The CFG format in SAPI 5 defines the structure of grammars and grammar rules using Extensible Markup Language (XML). The XML format is an "expert only readable" declaration of a grammar that a speech application uses to accomplish the following: Improve recognition accuracy by restricting and indicating to an engine what words it should expect. Improve maintainability of textual grammars, by providing constructs for reusable text components (internal and external rule references), phrase lists, and string and numeric identifiers. Improve translation of recognized speech into application actions. This is made easier by providing "semantic tags," (property name, and value associations) to words/phrases declared inside the grammar. The CFG/Grammar compiler transforms the XML tags defining the grammar elements into a binary format used by SAPI 5-compliant SR engines. This compiling process can be performed either before or during application run time. Since our system does not need to modify the grammar at run time, the compiled binary format is loaded statically before the application run time.
4 4. Development of the Voice Driven Animation System 4.1. Design of Grammar Rules Phase one of the project is designing grammar rules for the voice commands. Here is an example of one of the grammar rules used in our system. <RULE ID="VID_TurnCommand" TOPLEVEL="ACTIVE"> <P>turn</P> <RULEREF REFID="VID_Direction" PROPID="VID_Direction"/> <O>by</O> <O><RULEREF REFID="VID_Degree" PROPID="VID_Degree"/></O> <O>degrees</O> </RULE> <RULE ID="VID_Direction" > <L PROPID="VID_Direction" > <P VAL="VID_Left">left</P> <P VAL="VID_Right">right</P> <P VAL="VID_Around">around</P> </L> </RULE> <RULE ID="VID_Degree" > <L PROPID="VID_Degree" > <P VAL="VID_Ten">ten</P> <P VAL="VID_Twenty">twenty</P> <P VAL="VID_Thirty">thirty</P> <P VAL="VID_Forty">forty</P> <P VAL="VID_Fifty">fifty</P> <P VAL="VID_Sixty">sixty</P> <P VAL="VID_Seventy">seventy</P> <P VAL="VID_Eighty">eighty</P> <P VAL="VID_Ninety">ninety</P> </L> </RULE> According to this grammar rule, if the user says "turn right by 70 degrees", then the speech engine will indicate the application that rule name "VID_TurnCommand" has been recognized, with the property of child rule VID_Direction being "right", and the property of child rule VID_Degree being "seventy". We have also performed basic testing on the grammar rules we have written, using a grammar compiler and tester provided by the SDK toolkit. All of the grammar rules can be recognized from the user s speech very well.
5 4.2. Integration of the Speech Engine Phase 2 of the project is integrating the speech engine into the system, so that we can use the recognized information to generate the desired animation. Before the system gets more complicated, we first tested with simple object and movements, i.e., using voice commands to drive a ball move from one place to another. Here are some snapshots of the running application: Figure 2: A Simple Voice Driven Application The blue ball represents the subject, and the red ball represents the destinations that the subject must pass through in the same order as they were created. The locations of the red balls are specified by the mouse clicks of the user. If the user says move to here, to here, to here while he s doing the mouse clicks, the application will recognize the voice command and start to move the blue ball towards those red balls, once the mouse clicking has ended. When the subject passes through a destination, the red ball will disappear showing that it has been reached, and then the subject will head straight to the next destination again until all the red balls have been reached, as shown in the above images. Although this application may seem simple enough, it demonstrates that the speech recognition engine has been successfully integrated into the windows program and they can work seamlessly together. This makes sure that we can build more complex system on top of the speech engine.
6 4.3. Combination with Motion Capture Data Now we have got the speech engine working properly, we can combine voice recognition with motion capture data to generate the animation of a character driven by voice commands. Here is a snapshot of the interface of our system: Figure 3: Voice Driven Animation System The user can speak a limited set of voice commands to make the character walk in different style, such as walk fast, slow backwards ; and he can use faster or slower commands to control the speed of the walking motion. Besides these, the system also supports directional control, so if the user says turn left (by) sixty (degrees), the character will make a left turn by sixty degrees. The brackets enclosing by and degrees mean that these two words are optional, i.e., the system will recognize user s voice command either with or without those two words being said. When the system is started, a single walk cycle motion of the character is loaded from the motion capture data, and the system basically replays this walk cycle at different speed using different translation and orientation of the character, according to the user s voice commands. As for the change of walking direction, in order to make the rotation smoother, we apply a linear interpolation to the rotation angle, so that the orientation of the character is changed by 10 degrees in each successive frame, until the desired rotation is achieved. In this way the turning action looks more natural than a straight cut to the new walking direction.
7 5. Conclusion and Future Work In this project we have successfully developed a voice driven animation system that allows the user to control the movements of a simple object or a character by using voice commands. It is a new and more efficient type of interaction since the user could just use natural human voice as input media, rather than typing on the keyboard or clicking on some buttons of the interface. Since much effort of this project has been done in exploring the Microsoft Speech API and integrating the speech engine into the animation system, only a few features are provided and a few motions can be generated by the system at the moment of this time. However, this system can be easily expanded to support more kinds of operation, such as blending and transition of different motion clips using motion graph technique, and obstacle avoidance using motion planning algorithms. There are still a lot of work to be done in this area. References [1] Nielsen, J., Will Voice Interfaces Replace Screens, In IBM DeveloperWorks, 1999 [2] Apaydin, O., Networked Humanoid Animation Driven by Human Voice Using Extensible 3D (X3D), H-Anim and JAVA Speech Open Standards. In Thesis of Naval Postgraduate School, 2002 [3] Microsoft Speech SDK 5.1 Documentation, 2004
Design Grammars for High-performance Speech Recognition
Design Grammars for High-performance Speech Recognition Copyright 2011 Chant Inc. All rights reserved. Chant, SpeechKit, Getting the World Talking with Technology, talking man, and headset are trademarks
Develop Software that Speaks and Listens
Develop Software that Speaks and Listens Copyright 2011 Chant Inc. All rights reserved. Chant, SpeechKit, Getting the World Talking with Technology, talking man, and headset are trademarks or registered
The preliminary design of a wearable computer for supporting Construction Progress Monitoring
The preliminary design of a wearable computer for supporting Construction Progress Monitoring 1 Introduction Jan Reinhardt, TU - Dresden Prof. James H. Garrett,Jr., Carnegie Mellon University Prof. Raimar
Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System
Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Oana NICOLAE Faculty of Mathematics and Computer Science, Department of Computer Science, University of Craiova, Romania [email protected]
A secure face tracking system
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 10 (2014), pp. 959-964 International Research Publications House http://www. irphouse.com A secure face tracking
Standard Languages for Developing Multimodal Applications
Standard Languages for Developing Multimodal Applications James A. Larson Intel Corporation 16055 SW Walker Rd, #402, Beaverton, OR 97006 USA [email protected] Abstract The World Wide Web Consortium
interactive product brochure :: Nina: The Virtual Assistant for Mobile Customer Service Apps
interactive product brochure :: Nina: The Virtual Assistant for Mobile Customer Service Apps This PDF contains embedded interactive features. Make sure to download and save the file to your computer to
Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications
Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications Oxford University Press 2007. All rights reserved. 1 XML An extensible language The
Interactive product brochure :: Nina TM Mobile: The Virtual Assistant for Mobile Customer Service Apps
TM Interactive product brochure :: Nina TM Mobile: The Virtual Assistant for Mobile Customer Service Apps This PDF contains embedded interactive features. Make sure to download and save the file to your
VoiceXML-Based Dialogue Systems
VoiceXML-Based Dialogue Systems Pavel Cenek Laboratory of Speech and Dialogue Faculty of Informatics Masaryk University Brno Agenda Dialogue system (DS) VoiceXML Frame-based DS in general 2 Computer based
Video, film, and animation are all moving images that are recorded onto videotape,
See also Data Display (Part 3) Document Design (Part 3) Instructions (Part 2) Specifications (Part 2) Visual Communication (Part 3) Video and Animation Video, film, and animation are all moving images
Chapter 13: Program Development and Programming Languages
Understanding Computers Today and Tomorrow 12 th Edition Chapter 13: Program Development and Programming Languages Learning Objectives Understand the differences between structured programming, object-oriented
Objects in Alice: Positioning and Moving Them
Objects in Alice: Positioning and Moving Them Download the Alice World that goes along with this tutorial. You will be learning about the objects in Alice, how they are positioned in the space of the Alice
Working With Animation: Introduction to Flash
Working With Animation: Introduction to Flash With Adobe Flash, you can create artwork and animations that add motion and visual interest to your Web pages. Flash movies can be interactive users can click
Maya 2014 Basic Animation & The Graph Editor
Maya 2014 Basic Animation & The Graph Editor When you set a Keyframe (or Key), you assign a value to an object s attribute (for example, translate, rotate, scale, color) at a specific time. Most animation
Real-time Device Monitoring Using AWS
Real-time Device Monitoring Using AWS 1 Document History Version Date Initials Change Description 1.0 3/13/08 JZW Initial entry 1.1 3/14/08 JZW Continue initial input 1.2 3/14/08 JZW Added headers and
Tutorial: Biped Character in 3D Studio Max 7, Easy Animation
Tutorial: Biped Character in 3D Studio Max 7, Easy Animation Written by: Ricardo Tangali 1. Introduction:... 3 2. Basic control in 3D Studio Max... 3 2.1. Navigating a scene:... 3 2.2. Hide and Unhide
The Keyboard One of the first peripherals to be used with a computer and is still the primary input device for text and numbers.
Standard Methods of Input Keyboard Mouse Input device enables you to input information and commands into the computer. The Keyboard One of the first peripherals to be used with a computer and is still
A Web Based Voice Recognition System for Visually Challenged People
A Web Based Voice Recognition System for Visually Challenged People V.Balaji, Dr. S.Kanaga Suba Raja, M.Vivekanandan Assistant Professor, Department of IT, Easwari Engineering College, Chennai, India Associate
set in Options). Returns the cursor to its position prior to the Correct command.
Dragon NaturallySpeaking Commands Summary Dragon Productivity Commands Relative to Dragon NaturallySpeaking v11-12 or higher Dragon Medical Practice Edition and Practice Edition 2 or higher Dictation success
Flash. Using Flash to Teach Mathematics. The competition
Using Flash to Teach Mathematics G. Donald Allen Department of Mathematics Texas A&M University College Station, TX 77843-3368 [email protected] Flash Flash is the ultra-hot animation tool from Macromedia
Mobile Game and App Development the Easy Way
Mobile Game and App Development the Easy Way Developed and maintained by Pocketeers Limited (http://www.pocketeers.co.uk). For support please visit http://www.appeasymobile.com This document is protected
Movement Animset Pro v.1.5
Movement Animset Pro v.1.5 Animations description and usage Idle TurnRt90_Loop TurnLt90_Loop TurnRt180 TurnLt180 WalkFwdLoop WalkFwdStart WalkFwdStart180_R WalkFwdStart180_L WalkFwdStart90_L WalkFwdStart90_R
Microsoft PowerPoint Exercises 4
Microsoft PowerPoint Exercises 4 In these exercises, you will be working with your Music Presentation file used in part 1 and 2. Open that file if you haven t already done so. Exercise 1. Slide Sorter
9RLFH$FWLYDWHG,QIRUPDWLRQ(QWU\7HFKQLFDO$VSHFWV
Université de Technologie de Compiègne UTC +(8',$6
Specialty Answering Service. All rights reserved.
0 Contents 1 Introduction... 2 1.1 Types of Dialog Systems... 2 2 Dialog Systems in Contact Centers... 4 2.1 Automated Call Centers... 4 3 History... 3 4 Designing Interactive Dialogs with Structured Data...
Enterprise Contact Center
Enterprise Contact Center multi-media platform and applications for contact center and enterprise 1 global switching platform The NobelBiz Enterprise solution is a global switching platform for any media
Avaya Aura Orchestration Designer
Avaya Aura Orchestration Designer Avaya Aura Orchestration Designer is a unified service creation environment for faster, lower cost design and deployment of voice and multimedia applications and agent
CS 4204 Computer Graphics
CS 4204 Computer Graphics Computer Animation Adapted from notes by Yong Cao Virginia Tech 1 Outline Principles of Animation Keyframe Animation Additional challenges in animation 2 Classic animation Luxo
A BRIEF INTRODUCTION TO COMPUTER PROGRAMMING Scratch, LEGO MINDSTORMS, and Visual Basic. July 24, 2015
A BRIEF INTRODUCTION TO COMPUTER PROGRAMMING Scratch, LEGO MINDSTORMS, and Visual Basic July 24, 2015 ABSTRACT This project was created to provide a better understanding of computer programs and their
Book Builder Training Materials Using Book Builder September 2014
Book Builder Training Materials Using Book Builder September 2014 Prepared by WDI, Inc. Table of Contents Introduction --------------------------------------------------------------------------------------------------------------------
Character Animation Tutorial
Character Animation Tutorial 1.Overview 2.Modelling 3.Texturing 5.Skeleton and IKs 4.Keys 5.Export the character and its animations 6.Load the character in Virtools 7.Material & texture tuning 8.Merge
Chapter 12 Programming Concepts and Languages
Chapter 12 Programming Concepts and Languages Chapter 12 Programming Concepts and Languages Paradigm Publishing, Inc. 12-1 Presentation Overview Programming Concepts Problem-Solving Techniques The Evolution
Dragon Solutions. Using A Digital Voice Recorder
Dragon Solutions Using A Digital Voice Recorder COMPLETE REPORTS ON THE GO USING A DIGITAL VOICE RECORDER Professionals across a wide range of industries spend their days in the field traveling from location
GAZETRACKERrM: SOFTWARE DESIGNED TO FACILITATE EYE MOVEMENT ANALYSIS
GAZETRACKERrM: SOFTWARE DESIGNED TO FACILITATE EYE MOVEMENT ANALYSIS Chris kankford Dept. of Systems Engineering Olsson Hall, University of Virginia Charlottesville, VA 22903 804-296-3846 [email protected]
Blender Notes. Introduction to Digital Modelling and Animation in Design Blender Tutorial - week 9 The Game Engine
Blender Notes Introduction to Digital Modelling and Animation in Design Blender Tutorial - week 9 The Game Engine The Blender Game Engine This week we will have an introduction to the Game Engine build
Chapter 12. Introduction. Introduction. User Documentation and Online Help
Chapter 12 User Documentation and Online Help Introduction When it comes to learning about computer systems many people experience anxiety, frustration, and disappointment Even though increasing attention
CHAPTER 6 TEXTURE ANIMATION
CHAPTER 6 TEXTURE ANIMATION 6.1. INTRODUCTION Animation is the creating of a timed sequence or series of graphic images or frames together to give the appearance of continuous movement. A collection of
A Comparison of Programming Languages for Graphical User Interface Programming
University of Tennessee, Knoxville Trace: Tennessee Research and Creative Exchange University of Tennessee Honors Thesis Projects University of Tennessee Honors Program 4-2002 A Comparison of Programming
Quick Start Guide. Microsoft OneNote 2013 looks different from previous versions, so we created this guide to help you minimize the learning curve.
Quick Start Guide Microsoft OneNote 2013 looks different from previous versions, so we created this guide to help you minimize the learning curve. Switch between touch and mouse If you re using OneNote
Go to contents 18 3D Visualization of Building Services in Virtual Environment
3D Visualization of Building Services in Virtual Environment GRÖHN, Matti Gröhn; MANTERE, Markku; SAVIOJA, Lauri; TAKALA, Tapio Telecommunications Software and Multimedia Laboratory Department of Computer
Chapter 10: Multimedia and the Web
Understanding Computers Today and Tomorrow 12 th Edition Chapter 10: Multimedia and the Web Learning Objectives Define Web-based multimedia and list some advantages and disadvantages of using multimedia.
Dragon speech recognition Nuance Dragon NaturallySpeaking 13 comparison by product. Feature matrix. Professional Premium Home.
matrix Recognition accuracy Recognition speed System configuration Turns your voice into text with up to 99% accuracy New - Up to a 15% improvement to out-of-the-box accuracy compared to Dragon version
SignalDraw: GUI Tool For Generating Pulse Sequences
SignalDraw: GUI Tool For Generating Pulse Sequences Konstantin Berlin Department of Computer Science University of Maryland College Park, MD 20742 [email protected] December 9, 2005 Abstract Generating
CATIA V5 Tutorials. Mechanism Design & Animation. Release 18. Nader G. Zamani. University of Windsor. Jonathan M. Weaver. University of Detroit Mercy
CATIA V5 Tutorials Mechanism Design & Animation Release 18 Nader G. Zamani University of Windsor Jonathan M. Weaver University of Detroit Mercy SDC PUBLICATIONS Schroff Development Corporation www.schroff.com
Quick Start Guide. Microsoft OneNote 2013 looks different from previous versions, so we created this guide to help you minimize the learning curve.
Quick Start Guide Microsoft OneNote 2013 looks different from previous versions, so we created this guide to help you minimize the learning curve. Switch between touch and mouse If you re using OneNote
How to Develop Accessible Linux Applications
Sharon Snider Copyright 2002 by IBM Corporation v1.1, 2002 05 03 Revision History Revision v1.1 2002 05 03 Revised by: sds Converted to DocBook XML and updated broken links. Revision v1.0 2002 01 28 Revised
VoiceXML Data Logging Overview
Data Logging Overview - Draft 0.3-20 August 2007 Page 1 Data Logging Overview Forum Tools Committee Draft 0.3-20 August 2007 Data Logging Overview - Draft 0.3-20 August 2007 Page 1 About the Forum: Founded
Creating a 2D Game Engine for Android OS. Introduction
Creating a 2D Game Engine for Android OS Introduction This tutorial will lead you through the foundations of creating a 2D animated game for the Android Operating System. The goal here is not to create
White Paper. Guidelines for Developing Windows Applications Compatible with Dragon NaturallySpeaking and Dragon Medical
White Paper Guidelines for Developing Windows Applications Compatible with Dragon NaturallySpeaking and Dragon Medical White Paper Guidelines for Developing Windows Applications Compatible with Dragon
TATJA: A Test Automation Tool for Java Applets
TATJA: A Test Automation Tool for Java Applets Matthew Xuereb 19, Sanctuary Street, San Ġwann [email protected] Abstract Although there are some very good tools to test Web Applications, such tools neglect
App Development with Talkamatic Dialogue Manager
App Development with Talkamatic Dialogue Manager Dialogue Systems II September 7, 2015 Alex Berman [email protected] Staffan Larsson Outline! Introduction to TDM! Technical architecture! App development
How do non-expert users exploit simultaneous inputs in multimodal interaction?
How do non-expert users exploit simultaneous inputs in multimodal interaction? Knut Kvale, John Rugelbak and Ingunn Amdal 1 Telenor R&D, Norway [email protected], [email protected], [email protected]
Focus X2 (Mac version) Quick Start Guide
Focus X2 (Mac version) Quick Start Guide Welcome to Focus X2 Focus X2 is now used by coaches and educators in more than 60 countries and in over 30 different sports. The purpose of this document is to
Adding emphasis to a presentation in PowerPoint 2010 and 2013 for Windows
Adding emphasis to a presentation in PowerPoint 2010 and 2013 for Windows This document introduces technique to add interest to your presentation using graphics, audio and video. More detailed training
Human-Computer Interaction: Input Devices
Human-Computer Interaction: Input Devices Robert J.K. Jacob Department of Electrical Engineering and Computer Science Tufts University Medford, Mass. All aspects of human-computer interaction, from the
VRSPATIAL: DESIGNING SPATIAL MECHANISMS USING VIRTUAL REALITY
Proceedings of DETC 02 ASME 2002 Design Technical Conferences and Computers and Information in Conference Montreal, Canada, September 29-October 2, 2002 DETC2002/ MECH-34377 VRSPATIAL: DESIGNING SPATIAL
Adobe Dreamweaver Exam Objectives
Adobe Dreamweaver audience needs for a website. 1.2 Identify webpage content that is relevant to the website purpose and appropriate for the target audience. 1.3 Demonstrate knowledge of standard copyright
Dragon Solutions Using A Digital Voice Recorder
Dragon Solutions Using A Digital Voice Recorder COMPLETE REPORTS ON THE GO USING A DIGITAL VOICE RECORDER Professionals across a wide range of industries spend their days in the field traveling from location
White Paper. Guidelines For Speech-Accessible HTML for Dragon NaturallySpeaking and Dragon Medical
White Paper Guidelines For Speech-Accessible HTML for Dragon NaturallySpeaking and Dragon Medical Contents Overview...3 General Requirements...3 Dictation...4 Navigation (Voice Commands)...5 General Recommendations
DRAGON NATURALLYSPEAKING 12 FEATURE MATRIX COMPARISON BY PRODUCT EDITION
1 Recognition Accuracy Turns your voice into text with up to 99% accuracy NEW - Up to a 20% improvement to out-of-the-box accuracy compared to Dragon version 11 Recognition Speed Words appear on the screen
The first program: Little Crab
CHAPTER 2 The first program: Little Crab topics: concepts: writing code: movement, turning, reacting to the screen edges source code, method call, parameter, sequence, if-statement In the previous chapter,
In: Proceedings of RECPAD 2002-12th Portuguese Conference on Pattern Recognition June 27th- 28th, 2002 Aveiro, Portugal
Paper Title: Generic Framework for Video Analysis Authors: Luís Filipe Tavares INESC Porto [email protected] Luís Teixeira INESC Porto, Universidade Católica Portuguesa [email protected] Luís Corte-Real
elearning Guide: Instructional Design
elearning Guide: Instructional Design Produced by NHS Education for Scotland, 2013 Introduction This e-learning Guide provides the standards to be followed when developing content Web-based Training (WBT)
Adobe Conversion Settings in Word. Section 508: Why comply?
It s the right thing to do: Adobe Conversion Settings in Word Section 508: Why comply? 11,400,000 people have visual conditions not correctible by glasses. 6,400,000 new cases of eye disease occur each
CS 378: Computer Game Technology
CS 378: Computer Game Technology http://www.cs.utexas.edu/~fussell/courses/cs378/ Spring 2013 University of Texas at Austin CS 378 Game Technology Don Fussell Instructor and TAs! Instructor: Don Fussell!
Speech Recognition Software Review
Contents 1 Abstract... 2 2 About Recognition Software... 3 3 How to Choose Recognition Software... 4 3.1 Standard Features of Recognition Software... 4 3.2 Definitions... 4 3.3 Models... 5 3.3.1 VoxForge...
WELCOME TO THE WORLD OF TELL ME MORE KIDS!... 3 TEACHER TOOLS... 4 IDENTIFICATION SCREEN... 5 HOMEPAGE... 6
User Manual Contents WELCOME TO THE WORLD OF TELL ME MORE KIDS!... 3 TEACHER TOOLS... 4 IDENTIFICATION SCREEN... 5 HOMEPAGE... 6 NAVIGATION... 6 EXITING TELL ME MORE KIDS... 7 PLACES... 7 VOCABULARY...
4 Understanding. Web Applications IN THIS CHAPTER. 4.1 Understand Web page development. 4.2 Understand Microsoft ASP.NET Web application development
4 Understanding Web Applications IN THIS CHAPTER 4.1 Understand Web page development 4.2 Understand Microsoft ASP.NET Web application development 4.3 Understand Web hosting 4.4 Understand Web services
An Instructional Aid System for Driving Schools Based on Visual Simulation
An Instructional Aid System for Driving Schools Based on Visual Simulation Salvador Bayarri, Rafael Garcia, Pedro Valero, Ignacio Pareja, Institute of Traffic and Road Safety (INTRAS), Marcos Fernandez
Creating Animations with istopmotion
Creating Animations with istopmotion Animation clips can bring excitement to a multimedia project for children and adults alike. The other great benefit of introducing pupils to animation is that they
PowerPoint 2013 Basics of Creating a PowerPoint Presentation
Revision 4 (01-31-2014) PowerPoint 2013 Basics of Creating a PowerPoint Presentation MICROSOFT POWERPOINT PowerPoint is software that lets you create visual presentations. PowerPoint presentations are
Flash Tutorial Part I
Flash Tutorial Part I This tutorial is intended to give you a basic overview of how you can use Flash for web-based projects; it doesn t contain extensive step-by-step instructions and is therefore not
Dialog planning in VoiceXML
Dialog planning in VoiceXML Csapó Tamás Gábor 4 January 2011 2. VoiceXML Programming Guide VoiceXML is an XML format programming language, describing the interactions between human
2 The first program: Little Crab
2 The first program: Little Crab topics: concepts: writing code: movement, turning, reacting to the screen edges source code, method call, parameter, sequence, if statement In the previous chapter, we
Videoconferencing Operation Troubleshooting Procedures
Videoconferencing Operation Troubleshooting Procedures Starting the Vtel Unit: 1. Power up the computer (If you have a Compaq unit, the power switch is located on the front in the middle of the computer.
elearning Journey Multimedia Courseware Design Proposal
elearning Journey Multimedia Courseware Design Proposal Multimedia Courseware Design Proposal Name of Developer: Tracy Lloyd Date: _March 31, 2010 1. Overview Program Title: elearning Journey Target Learner:
Computer Animation. CS 445/645 Fall 2001
Computer Animation CS 445/645 Fall 2001 Let s talk about computer animation Must generate 30 frames per second of animation (24 fps for film) Issues to consider: Is the goal to replace or augment the artist?
Public Speaking Booklet
Public Speaking Booklet Name: Introduction Any public speech is a type of acting. Your written speech is your script. In a speech you research, order and present your ideas and thoughts to an audience
4.2 Understand Microsoft ASP.NET Web Application Development
L E S S O N 4 4.1 Understand Web Page Development 4.2 Understand Microsoft ASP.NET Web Application Development 4.3 Understand Web Hosting 4.4 Understand Web Services MTA Software Fundamentals 4 Test L
MMGD0203 Multimedia Design MMGD0203 MULTIMEDIA DESIGN. Chapter 3 Graphics and Animations
MMGD0203 MULTIMEDIA DESIGN Chapter 3 Graphics and Animations 1 Topics: Definition of Graphics Why use Graphics? Graphics Categories Graphics Qualities File Formats Types of Graphics Graphic File Size Introduction
Voice-Recognition Software An Introduction
Voice-Recognition Software An Introduction What is Voice Recognition? Voice recognition is an alternative to typing on a keyboard. Put simply, you talk to the computer and your words appear on the screen.
Programming in Access VBA
PART I Programming in Access VBA In this part, you will learn all about how Visual Basic for Applications (VBA) works for Access 2010. A number of new VBA features have been incorporated into the 2010
IM 2701 Multimedia Design and Web Development BSc in Business Information Systems. Chapter 01 Introduction to Multimedia Design
IM 2701 Multimedia Design and Web Development BSc in Business Information Systems Chapter 01 Introduction to Multimedia Design Topics Definition of Multimedia Multimedia Elements Web Design Principles
Hulbee Desktop guide. Version 2.1.16.139
Hulbee Desktop guide Version 2.1.16.139 CONTENTS Page 2 of 61 1. INTRODUCTION... 4 1.1 About Hulbee Desktop... 4 1.1.1 The purpose of Hulbee Desktop... 4 2. INSTALLATION AND LAUNCHING... 5 2.1 Installing
Short notes on webpage programming languages
Short notes on webpage programming languages What is HTML? HTML is a language for describing web pages. HTML stands for Hyper Text Markup Language HTML is a markup language A markup language is a set of
CDVS-7000 Series Remote Software Users Guide
CDVS-7000 Series Remote Software Users Guide Crest Electronics, Inc. Version 12.1005 Remote Surveillance Program The remote surveillance program enables users of the Crest 7000 series DVRs to connect remotely
ESKIBS2 Bespoke or specialist software
Overview This is the ability to select and use a suitable specialist or bespoke software application to carry out an appropriate data processing task. It includes understanding the capabilities of the
Contents. Launching FrontPage... 3. Working with the FrontPage Interface... 3 View Options... 4 The Folders List... 5 The Page View Frame...
Using Microsoft Office 2003 Introduction to FrontPage Handout INFORMATION TECHNOLOGY SERVICES California State University, Los Angeles Version 1.0 Fall 2005 Contents Launching FrontPage... 3 Working with
How to Choose the Right Data Storage Format for Your Measurement System
1 How to Choose the Right Data Storage Format for Your Measurement System Overview For many new measurement systems, choosing the right data storage approach is an afterthought. Engineers often end up
Project 2: Character Animation Due Date: Friday, March 10th, 11:59 PM
1 Introduction Project 2: Character Animation Due Date: Friday, March 10th, 11:59 PM The technique of motion capture, or using the recorded movements of a live actor to drive a virtual character, has recently
