VoiceXML and Next-Generation Voice Services

Size: px
Start display at page:

Download "VoiceXML and Next-Generation Voice Services"

Transcription

1 VoiceXML and Next-Generation Voice Services Abstract This is the abstract for my paper. Adam Hocek Table of Contents 1. Voice services VoiceXML framework VoiceXML dialogs SSML GRXML CCXML VoiceXML IP services VoiceXML Events Mixed initiative Introduction to enhanced voice services XHTML + Voice SALT XForms XSLT NLSML Natural dialogs Multimodal (multi-interface) Conclusion Bibliography Glossary Voice services Having applications that can interact with users using more than one interface has advantages. Consider the situation where you need a city directory service, saying the name of the restaurant is a lot easier than pulling down dropdown menus and ensuring you got the spelling correct. Conversely, a web browser is better for getting directions than a sequential vocalized list of directions. Each user interface has its own criteria for how best to represent information. This implies that there is a certain level of design customization for each interface type supported. On the other hand, on the backend there can be a common data model representing the information presented and collected from users. Voice services are one form of user interface. What makes them interesting is the naturalness (or sometime unnaturalness) in which we interact with speech. However, providing voice services requires listening and speaking capabilities, which are both complex technologies. The VoiceXML framework starts with an introduction to VoiceXML and its underlying components. VoiceXML by nature is also complex. The language allows for the scripting of voice dialogs, with speech recognition and speech synthesis capabilities, as well as, integration to other processes through events and IP connectivity. Covering VoiceXML will cover the basics of voice services. XML 2002 Proceedings by deepx 1

2 The second part, Introduction to enhanced voice services, looks at technologies to consider for modeling and leveraging existing web browser technology, and for improving voice dialog systems that interact more natural. The section ends with a look at multiple user interfaces and how applications might change how users interact with them. 2. VoiceXML framework One of the motivations behind VoiceXML was to address Interactive Voice Response (IVR) applications and to standardize on the different voice processing components required. IVR applications, typically present telephone users with questions and based on user responses they eventually direct users to the correct information or to an agent. In order to accomplish this functionality VoiceXML must provide some basic telephony control, speech synthesis control, and control of voice recognition capabilities through grammars. Figure 1 shows a typical IVR system. These traditional IVRs had limitations in performance and integration. The problems with traditional IVR systems can be summarized: All components (text-to-speech, voice recognition, telephony connectivity) are non-standard. Voice menus have little flexibility in handling user variations. Difficult to integrate with other IP services (database, web, etc.) Costly application development using proprietary vendor APIs. No dynamic content for personalized menus. Figure 1. Architecture of a traditional IVR Next we look at how VoiceXML addresses the shortcomings of traditional IVR systems. Shown in Figure 2 is a VoiceXML-based IVR. Functionality is separated into components for Text-To-Speech (TTS), Automatic Speech Recognition (ASR) and telephony control. VoiceXML 2.0 [VoiceXML 2.0]has deferred functionality of these three components by referencing Speech Synthesis Markup Language (SSML)[SSML] for TTS control, Grammar Markup Language (GRXML)[SRGS] for controlling ASR, and Call Control Markup Language (CCXML)[CCXML] for providing telephony control. This architecture allows for each of the sub-components (ASR, TTS, and telephony control) to have a specific XML associated with them. Complete and sophisticated voice dialogs can be authored with VoiceXML and the underlying markups: GRXML, SSML and CCXML. These XML documents allow for dynamic content creation, improved dialog handling, and simplified component integration. The remainder of this section will briefly look XML 2002 Proceedings by deepx 2

3 at these markups, followed by an in-depth look at VoiceXML's dialog, IP integration, and event handling capabilities VoiceXML dialogs Figure 2. Architecture of a VoiceXML-based IVR One of the important benefits of VoiceXML is the ability to author dialogs. Inherent to the VoiceXML specification, is the Form Interpretation Algorithm, FIA, the logic for processing forms. A form is a collection of form items. The FIA is responsible for entry into a form, the next form item to visit, the management of prompts, the activation and deactivation of grammars, and exiting a form. VoiceXML defines a "form item" as an element that is a child of form and is of one of the following element types: field, block, initial, subdialog, object, record, transfer. The FIA uses the form item variables to determine dialog flow. The form item variables are: result variable - This is an ECMAScript variable whose name is defined by the form item's name attribute, and whose scope is that of the containing dialog. When a form item has been successfully visited, its result variable will contain the result of that visitation. For example, a field's result variable will contain the answer collected from the caller. guard condition - This is an ECMAScript expression specified by the form item's cond attribute, which must evaluate to true in order for this form item to be visited. count variable - This is an internal variable that keeps track of how many times the VoiceXML interpreter has attempted to fill a form item in a given invocation of the dialog. XML 2002 Proceedings by deepx 3

4 A simple example showing a form that collects two field values and posts the values to a CGI (Common Gateway Interface). <form> <block> To complete your order we need the following information. </block> <field name="color"> <prompt>select from one of the following colors <enumerate/></prompt> <option>red</option> <option>blue</option> <option>green</option> </field> <field name="size"> <prompt>select from the size. You can select <enumerate/></prompt> <option>small</option> <option>medium</option> <option>large</option> </field> <filled> Thank you. Your order is being processed. <submit next="/cgi/details.cgi"/ namelist="color size"/> </filled> </form> 2.2. SSML Developers use SSML to specify how speech should be rendered. The language has elements for controlling the pronunciation, tone, inflection, and other characteristics of spoken words. Two elements for producing speech, which are part of VoiceXML, are the prompt and the audio elements. SSML elements can occur within either of these elements. SSML elements include the following: emphasis - text spoken with emphasis prosody - allows for control of pitch, rate, duration, and volume sentence - identifies a sentence paragraph - identifies a paragraph say-as - uses a type construct to render text phoneme - specifies a phonetic pronunciation voice - specifies a voice characteristic mark - used for asynchronous notification break - a pause An example that uses SSML elements to control speech and audio outputs is shown: <?xml version="1.0" encoding="iso "?> <vxml version="2.0"> <form id="audiotest"> <block> Your <emphasis>total</emphasis> is <say-as class="currency">$299.95</say-as> <audio src=" I'm sorry. The audio stream is not available today. XML 2002 Proceedings by deepx 4

5 </audio> </block> </form> </vxml> 2.3. GRXML Grammars are a way to define the domain of active spoken words or DTMF (Dual Tone Multi-Frequency)tones that are listened for. Grammars use rules and weights to specify the recognition logic. For describing grammars, VoiceXML will accept any of the following grammars: GRXML, ABNF, and JSGF[SRGS]. Though each of the grammar languages has a similar purpose, GRXML is the only format that is an XML. The grammar element is used by VoiceXML to specify a grammar. Its type attribute states which grammar is being used. The essential elements used to define GRXML grammars are: rule - a rule expansion declaration ruleref - a local or external rule reference item - define an entity one-of - a set of alternatives tag - a string associated to a rule expansion grammar - root element Grammars can be externally defined using GRXML or they can use the built-in grammars types for: digits, boolean, currency, date, number, phone, time How recognition results get processed Grammars take a user response as input and return a string value that represents the match. Using grammars, complex word patterns can be defined and tested for. In the example, the field with attribute value "favcolor" has defined an inline grammar of acceptable answers tto a prompt. <?xml version="1.0" encoding="iso "?> <vxml version="2.0"> <form id="test"> <field name="favcolor"> <prompt>what is your favorite color?</prompt> <grammar xml:lang="en-us" version="1.0" root="example1"> <rule id="example1" scope="public"> <one-of> <item><tag>'red'</tag>red</item> <item><tag>'green'</tag>green</item> <item><tag>'blue'</tag>blue</item> <item><tag>'red'</tag>burgundy</item> XML 2002 Proceedings by deepx 5

6 <item><tag>'blue'</tag>indigo</item> </one-of> </rule> </grammar> <filled> <prompt> You said your favorite color is <value expr="favcolor"/>. </prompt> </filled> </field> </form> </vxml> This grammar, for the sake of simplicity, defines five colors to choose from with the item element. The interpreter is responsible for setting the ASR to recognize the five words. Each item in the grammar has a tag element that will be returned from the ASR to the VoiceXML interpreter upon recognizing one of the five colors. The returned value now has a binding to the field, "favcolor". When the field's filled gets activated, the prompt will say the user's selected color CCXML Using CCXML, applications can control the management of all inbound and outbound call-connectivity and control audio mixing and splitting. VoiceXML provides very little in terms of call control, in-fact it provides only two elements, disconnect and transfer. On its own VoiceXML has no ability for placing out-bound calls or any of the conferencing features offered by CCXML. Another important shortcoming of VoiceXML that CCXML addresses is a framework for managing multiple instances of VoiceXML interpreters and events between interpreter instances. The main features of CCXML are: Allows for outbound calls Support for multi-party calls Selective inbound call routing Asynchronous "external" event handling Conference objects for joining and unjoining participants Audio objects for splitting and mixing audio resources Control and connectivity to one or more VoiceXML interpreter instances VoiceXML control to start, kill, or suspend a process Supports multiple CCXML programs and interconnection through events Can be used to provide coaching, flooring, and delegation control Web server connectivity Whisper transfer Supervised transfer 2.5. VoiceXML IP services Prior to VoiceXML IP (Internet Protocol) connectivity was virtually non-existent in IVRs. Those vendors that did implement them used proprietary calls. IP connectivity can be thought of as two distinct issues. One is the ability to access documents, similar to accessing web pages over the Internet by providing a URI. The second issue XML 2002 Proceedings by deepx 6

7 addressed is the ability to post values to an Internet service. VoiceXML provides two elements for accomplishing variable/value passing, these are the submit and the subdialog elements. A submit uses an HTTP GET or POST to pass key/value pairs up to specified URL. Using the namelist attribute variables and their values are passed, as shown in the code excerpt: <if cond="selection=='menu'"> <submit next=" method="post" namelist="userlevel orderstatus status" fetchtimeout="180s"/> </if> A subdialog element can be used to call another form as if it were a subroutine. The form that is being called as a subdialog must end with a return element. A subdialog form can also accept input parameters. Here we use namelist to pass variables to the subdialog residing in another document and use the results returned from the subdialog (result.username and result.status) to output an audio prompt. <form> <subdialog name="result" src="#getuseraccesslevel"> <param name="userid" expr="`null'"/> <filled> <audio>the subdialog returned the name <value expr="result.username"/> and <value expr="result.status"/> <submit namelist="result" next=" </filled> </subdialog> </form> <!-- subdialog to get user access level --> <form id="getuseraccesslevel"> <var name="userid"/> <field name="username"> <grammar src=" type="application/grammar+xml"/> <prompt> Please say your name. </prompt> <filled> <if cond="validuseraccess(username,userid)"> <var name="status" expr="true"/> <else/> <var name="status" expr="false"/> </if> <return namelist="username status"/> </filled> </field> </form> 2.6. VoiceXML Events As per specification, VoiceXML interpreters have limited event-handling capabilities. An event will be thrown if the interpreter encounters a semantic document error or it encounters a throw element. Inherent event handlers of VoiceXML are the elements: noinput nomatch catch error XML 2002 Proceedings by deepx 7

8 help One drawback with VoiceXML's event handling is that it is single-threaded. Consequently, only events that are explicitly handled by the VoiceXML application will be handled. Another point to consider is that in a real-time telephony environment there are many asynchronous events that occur. As an application evolves it may need to handle some of these events. VoiceXML is limited; instead a CCXML interpreter is a good alternative for providing a multi-threaded event framework Mixed initiative The term mixed initiative refers to the ability for either the computer or the user to drive the conversation. This is an important feature for making better voice interfaces. If a form contains an initial, this element will be visited before all form items. After visiting an initial, the interpreter will wait for a form-level grammar to be satisfied. Once an answer is given that satisfies the form-level grammar, the interpreter will attempt to fill any fields remaining unfilled using the standard Form Interpretation Algorithm. The trick to making a form mixed initiative is to provide a grammar that can answer any of the questions represented by the fields of the form. With a single utterance a user's response could return results to multiple fields. The example below is a mixed initiative with GRXML grammar for scheduling a flight. The user can say - "I'd like to fly from city A to city B" or "I'd like to fly to city B from city A". Both responses are acceptable. The fields for depart and arrive are each filled when the user responds with either utterance. <vxml version="2.0"> <form id="airlines"> <initial name="itinerary"> Where would you like to fly? <catch event="nomatch noinput"> <prompt>i didn't get that.</prompt> <assign name="itinerary" expr="undefined"/> <reprompt/> </catch> </initial> <grammar xml:lang="en-us" version="1.0" root="flight"> <rule id="flight" scope="public"> <one-of> <item> <item repeat="0-1">i'd like to fly</item> from <ruleref uri="#city"> <tag>depart=city.returnvalue;</tag> </ruleref> to <ruleref uri="#city"> <tag>arrive=city.returnvalue;</tag> </ruleref> </item> <item> <item repeat="0-1">i'd like to fly</item> to <ruleref uri="#city"> <tag>arrive=city.returnvalue;</tag> </ruleref> from <ruleref uri="#city"> <tag>depart=city.returnvalue;</tag> </ruleref> </item> </one-of> XML 2002 Proceedings by deepx 8

9 </rule> <rule id="city" scope="public"> <one-of> <item> <tag>returnvalue='new York';</tag>New York</item> <item> <tag>returnvalue='los Angeles';</tag>Los Angeles</item> <item> <tag>returnvalue='los Angeles';</tag>L A</item> <item> <tag>returnvalue='toronto';</tag>toronto</item> <item> <tag>returnvalue='london';</tag>london</item> <item> <tag>returnvalue='paris';</tag>paris</item> </one-of> </rule> </grammar> <field name="depart"> </field> <field name="arrive"> </field> <filled> <prompt> I have you flying from <value expr="depart"/> to <value expr="arrive"/>. </prompt> </filled> </form> </vxml> 3. Introduction to enhanced voice services Voice browsers offer a viable alternative to HTML browsers, not only where device real estate is limited, as in the ever smaller-sized mobile phones and PDAs, but also in the application of natural dialogs, where users are no longer confined to directed-menus or forms, and can naturally describe their objectives through speech. Here we look at some of the other technologies that can be combined with VoiceXML or offer alternatives to VoiceXML for providing enhanced voice services XHTML + Voice XHTML+ Voice [XHTML+Voice]is a current technology that is ready for developing voice-enabled applications. By adapting XHTML for voice input and output, and leveraging its event model, XHTML+Voice offers a good transitional technology solution. Extending existing XHTML applications to support voice is greatly simplified. XHTML 1.0 is a reformulation of HTML 4.0 into an XML. Presentation is deferred to style sheets. XHTML 1.1 structured the XML into modular components. Combing this with the DOM2 Event Model is what allows voice dialogs to be added to XHTML. Event handlers that implement VoiceXML actions can process events received by the Event Listener. The result is to provide XHTML based languages an event syntax that enables an interoperable way of associating behaviors with document-level markup. The XML event types supported by the XHTML+Voice profile include all intrinsic event types defined for HTML 4.01, plus the VoiceXML 2.0 events (noinput, nomatch, error, and help), as well as, an additional filled event for field or form-level filled elements. An XHTML element associates one of the event types with an ID attribute reference to the VoiceXML form that will handle the event. To include voice dialogs with XHTML + Voice, the voice handlers are placed within the XHTML header. Within the XHTML body, an input block for example, would listen for an "onfocus" event and then pass control to the handler. The result of the handler would then need to be assigned to the XHTML form variable. This can be accomplished with a script in the body that defines the assignment. XML 2002 Proceedings by deepx 9

10 3.2. SALT SALT [SALT]is probably more so an alternative to VoiceXML rather than an enhancing technology that would coexist with VoiceXML. Started in October 2001, SALT (Speech Application Language Tags) the specification was submitted to the W3C for review in August SALT is similar to the XHTML+Voice Specification; it also leverages the event-based DOM execution model to integrate with specific interfaces. SALT defines "speech tags" that can be treated as extensions to HTML, enabling developers to add a spoken dialog interface to Web applications. Speech tags are a set of XML elements, not unlike VoiceXML, which provide dialog, speech interface, and call control services. In general there are fewer elements to contend with than in VoiceXML. There is also no inherent FIA, writing this flow logic is left to the application developer. Five main element types that SALT uses: prompt - Configures the speech synthesizer and playing out prompts. reco - Configures the speech recognizer, executing recognition and handling recognition events. grammar - Specifies input grammar resources. bind - Processes recognition results into the page. dtmf - Configures and controls DTMF XForms XForms [XForms]is an XML application that represents the next generation of forms for the Web. By splitting traditional XHTML forms into three parts-xforms model, instance data, and user interface, presentation and content are separated. This allows for reuse and provides strong typing, which subsequently reduces the number of calls to a server. With XForms device independent modeling can be accomplished. Since XForms is intended to be integrated into other markup languages, such as XHTML or SVG, and not intended to be a freestanding document type. As such it may take time before XForms is ready for implementation. There are two issues to consider when combining XForms with VoiceXML. One benefit is that XForms provides a data type specifiable data model. This model is consistent over different browsers and facilitates the sharing of data. The other consideration is the user interface and control. The control portion can be rendered as an integral part of another markup. This could be difficult for already existing interpreters. Another possibility is to transform the control, with an XSLT, into a native, device specific markup - in this case VoiceXML XSLT Shown in Figure 3 is a flexible approach to generating dynamic VoiceXML, HTML, or other markup language. In this approach the data model and the presentation of data are separated. The data model referred to here is not the same data model in XForms. Instead the XForms user interface portion that describes the purpose of the interface would form the data model of the transformation. XML 2002 Proceedings by deepx 10

11 Figure 3. Generating VoiceXML and other markup languages with XSLT The XSLT [XSLT]takes the source XML document and, using different XSL templates, generates HTML, VoiceXML, WML, or other document types. Once generated, an appropriate browser then interprets these documents. In the case of HTML, the HTML document will be served through a Web server to the client's browser where the actual rendering of the Web page takes place. For VoiceXML documents a VoiceXML interpreter is used that will render the document to the client telephone. The XForms user interface representing the data model might look like the following: <selectone ref="as"> <caption>select Payment Method</caption> <choices> <item> <caption>cash</caption> <value>cash</value> </item> <item> <caption>credit</caption> <value>credit</value> </item> </choices> </selectone> <input ref="cc"> <caption>credit Card Number</caption> </input> <input ref="exp"> <caption>expiration Date</caption> </input> <submit submitinfo="submit"> <caption>submit</caption> </submit> Notice that the user interface does not dictate how the interface should look only the purpose is stated. The transformation will be responsible for producing the document that will render the device specific interface NLSML The W3C working draft on Natural Language Semantic Markup Language [NLSML] attempts to formalize the results of semantic interpreters. It is intended that semantic interpreters will generate NLSML documents extracted from the user's utterances and machine determined meaning. These documents provide results on interpretation meaning and mapping to a data model. Using the first order results of meaning represented by a NLSML document can be used to further direct the dialog into context categories. Taking into account that certain words or phrase fragments are more likely to occur and XML 2002 Proceedings by deepx 11

12 have a greater expectation of occurring, these expectations can be used for classifying phrase fragments into context categories[desp]. The next section on Natural dialogs will explore this process further. NLSML also has support for managing multiple input devices (multimodality). The true benefits of NLSML are that intermediate results can be integrated with further processing logic to provide natural dialog systems. Another value of NLSML is for purposes of testing and evaluating such highly interdependent and adaptive systems. Using intermediate results of meaning is beneficial for isolating system factors and for performing comparative vendor tests Natural dialogs Here we look at systems that understand meaning based on context. Instead of using word grammars to define the domain of acceptable phrases, we use task models to describe the acceptable tasks that can be performed. The acceptable tasks will change depending on the state a dialog has taken. To date, language processing has successfully provided understanding, though often constrained to the grammars defined. As a result, most current systems use a loosely-coupled, unidirectional interface, such as grammars within VoiceXML or n-best words, with natural language constraints applied as a post-process for filtering the recognizer output. Context provides a level of discourse that places significant constraints on what people can talk about and how things can be referred to. In other words, knowing the context will narrow down what the speaker is trying to say. Dialog systems, as shown in Figure 4, use the current context, user input, and task model to determine the system response and the new context. Figure 4. Dialog management To achieve reasonable coverage of meaning, language-processing research has developed techniques based on "partial analysis" - the ability to find meaning-bearing phrases in the input and to construct meaning representations out of them without requiring a complete analysis of the entire string [USS]. To go about building a dialog management system, the first phase would be an analysis of the different ways in which individuals can express the finite tasks of the system. To accomplish this a system would need to record, analyze, and categorize acceptable user utterances. The outcome of this analysis is a requirement for the recognition grammars needed. A second outcome will be identifying the partial categories that can be represented with some meaning. NLSML is a good candidate language. The system interactively uses these partial categories to further narrow the scope until a final task is reached. The process is not as daunting as it may seem at first, especially when taking into account that most real world customer support systems ultimately have a finite number of performable tasks, usually on the order of few dozen tasks Multimodal (multi-interface) Multimodal access enables users to interact with an application in a variety of ways; they can input data using speech, a keyboard, keypad, mouse, and/or stylus, and receive data as synthesized speech, audio, plain text, motion video, and/or graphics. Each of these modes can be used independently or concurrently. Modality considers more than one input and output. The interaction can be sequential or synchronous for both inputs and outputs. XML 2002 Proceedings by deepx 12

13 sequential input and output - By sequential multimodality we mean that only one of the different modalities is active at a given time. This applies both to input and output modes. For example, the user can have a bi-modal device with a visual and voice interface. At any time the device can accept, as input, either a button click or a spoken response. However, only the first input will be used for processing. For sequential modal output, again, only one mode is used for output at any time. One scenario a the directory service, where the user uses their phone to dial into a directory service. Mobile user dials directory service. Directory service: How may I help you? Mobile phone: I need the phone number and directions for Ray's Pizza. Directory service: Which one? Is that the one located on rd street, or at 202 Lexington Avenue, or...? [user barge-in] Mobile phone: The one on 33rd Street. Directory service: How would you like the phone number and directions, via voice, visual, or . Mobile phone: visual The directions and number are sent as an SVG document. Another scenario is a web and voice access to the same shopping transaction. Here there are two actors, one that does the shopping and the other that holds the credit card. The shopper goes onto the web and purchases several items. They are asked for their credit card information and choose to defer the transaction until later. The user is given a transaction ID and a security code. The user can now go to the credit card holder and ask them to call in to the company with their transaction ID and security code. The credit card holder can review the shopping basket contents and if they choose proceed with the transaction by supplying their credit card information. synchronous input - Here, more than one input mode will be accepted as input simultaneously. There is of course a finite window of time in which the application will be "listening" to the user's input. Disambiguation of inputs needs to be provided. If the user clicks on one item and verbally selects another, there must be a way to resolve this conflict, whether through markup or as part of the application logic. An example could be the mobile user that has displayed a map on their phone and is engaged in a voice dialog with the directory service. They can say, "Give me all the bookstores in this area." They do this by while simultaneously moving their stylus onto the map and encircling a region. synchronous output - Multimodal output refers to more than one modality being used as output simultaneously in a coordinated manner. There needs to be a mechanism to synchronize output. The Working Draft of the W3C - "Multimodal Requirements for Voice Markup Language" specifies SMIL [SMIL]for synchronization. An example could again be with the mobile user where they are viewing a simple slide presentation on their mobile device while listening to a synchronized narration through their earpiece. Here synchronization of the two streams will need to be managed locally at the device. It is also conceivable that the slide presentation and the audio delivery are to two separate devices, a phone and a desktop browser. In this case, synchronization can be accomplished by letting the audio playback trigger events that control the visual slide presentation Conclusion For complex architectures that integrate multiple interpreter types and connect to external real-time events there are a few considerations that were covered and should be considered for voice and multimodal applications. As new and more specialized markup languages emerge there will likely be some overlap with existing markup languages. We saw this, for example, with CCXML and VoiceXML, both provide call management functionality. The preferred approach would be to use the richer language for the task. In this case CCXML should be used for an application that requires outbound calls or sophisticated call handling. XML 2002 Proceedings by deepx 13

14 Another point to consider when integrating multiple markup languages and real-time events is how the events get managed. CCXML offers an improved framework for managing multiple asynchronous events. Also to consider are how each interpreter context can pass variables to another context. Again CCXML is a good consideration as CCXML events allow for variables to be passed. When considering multimodal applications XForms' data model is valuable for abstracting data elements from different interfaces to a common data representation. For now it is still necessary to transform the user interface control logic to each browser type. An XForm control will need to be transformed with an XSLT to the appropriate browser, e.g. VoiceXML, HTML. Lastly, when designing natural dialog systems a language like NLSML will simplify integration with other decisionmaking logic and provide a manageable mechanism for monitoring and testing results. Bibliography [SSML] Speech Synthesis Markup Language Specification Available at [CCXML] Call Control Markup Language Available at [VoiceXML 2.0] VoiceXML Specification Available at [SRGS] Speech Recognition Grammar Specification Available at [XHTML+Voice] XHTML+Voice Specification Available at [SALT] Speech Application Language Tags Available at [XForms] XForms Specification Available at [XSLT] XSL Transformations Available at [NLSML] Natural Language Semantic Markup Language Available at [DESP] Improved Speech Understanding Using Dialogue Expectations in Sentence Parsing, Proceedings of ICSLP2000, S. Abdou, M. Scordilis [USS] Understanding Spontaneous Speech: The Phoenix System, Proceedings of ICASSP 1991, W.Ward [SMIL] Synchronous Media Integration Language Available at Glossary ASR CCXML GRXML IVR SSML Automatic Speech Recognition Call Control Markup Language Grammar Markup Language Interactive Voice Response Speech Synthesis Markup Language XML 2002 Proceedings by deepx 14

15 TTS Text-To-Speech Biography Adam Hocek Broadstrokes, Inc. New York United States of America Adam is President/CTO of a New York based startup company developing XML-based device technologies. He is also co-author of "Definitive VoiceXML". XML 2002 Proceedings by deepx 15

VoiceXML-Based Dialogue Systems

VoiceXML-Based Dialogue Systems VoiceXML-Based Dialogue Systems Pavel Cenek Laboratory of Speech and Dialogue Faculty of Informatics Masaryk University Brno Agenda Dialogue system (DS) VoiceXML Frame-based DS in general 2 Computer based

More information

Standard Languages for Developing Multimodal Applications

Standard Languages for Developing Multimodal Applications Standard Languages for Developing Multimodal Applications James A. Larson Intel Corporation 16055 SW Walker Rd, #402, Beaverton, OR 97006 USA jim@larson-tech.com Abstract The World Wide Web Consortium

More information

Dialog planning in VoiceXML

Dialog planning in VoiceXML Dialog planning in VoiceXML Csapó Tamás Gábor 4 January 2011 2. VoiceXML Programming Guide VoiceXML is an XML format programming language, describing the interactions between human

More information

VoiceXML versus SALT: selecting a voice

VoiceXML versus SALT: selecting a voice VoiceXML versus SALT: selecting a voice application standard When it comes to speech application standards, it seems we've been asking all the wrong questions. The VXML versus SALT debate is currently

More information

VoiceXML and VoIP. Architectural Elements of Next-Generation Telephone Services. RJ Auburn

VoiceXML and VoIP. Architectural Elements of Next-Generation Telephone Services. RJ Auburn VoiceXML and VoIP Architectural Elements of Next-Generation Telephone Services RJ Auburn Chief Network Architect, Voxeo Corporation Editor, CCXML Version 1.0, W3C Ken Rehor Software Architect, Nuance Communications

More information

An Introduction to VoiceXML

An Introduction to VoiceXML An Introduction to VoiceXML ART on Dialogue Models and Dialogue Systems François Mairesse University of Sheffield F.Mairesse@sheffield.ac.uk http://www.dcs.shef.ac.uk/~francois Outline What is it? Why

More information

! <?xml version="1.0">! <vxml version="2.0">!! <form>!!! <block>!!! <prompt>hello World!</prompt>!!! </block>!! </form>! </vxml>

! <?xml version=1.0>! <vxml version=2.0>!! <form>!!! <block>!!! <prompt>hello World!</prompt>!!! </block>!! </form>! </vxml> Using VoiceXML! Language spec 2.0! Includes support for VUI and for telephony applications (call forward, transfers, etc) " Has tags specific to voice application! Simple (and classic) example! !

More information

XML based Interactive Voice Response System

XML based Interactive Voice Response System XML based Interactive Voice Response System Sharad Kumar Singh PT PureTesting Software P Ltd. Noida, India ABSTRACT The paper presents the architecture of a web based interactive voice response system

More information

VoiceXML. Erik Harborg SINTEF IKT. Presentasjon, 4. årskurs, NTNU, 2007-04-17 ICT

VoiceXML. Erik Harborg SINTEF IKT. Presentasjon, 4. årskurs, NTNU, 2007-04-17 ICT VoiceXML Erik Harborg SINTEF IKT Presentasjon, 4. årskurs, NTNU, 2007-04-17 1 Content Voice as the user interface What is VoiceXML? What type of applications can be implemented? Example applications VoiceXML

More information

VoiceXML Overview. James A. Larson Intel Corporation jim@larson-tech.com. (c) 2007 Larson Technical Services 1

VoiceXML Overview. James A. Larson Intel Corporation jim@larson-tech.com. (c) 2007 Larson Technical Services 1 VoiceXML Overview James A. Larson Intel Corporation jim@larson-tech.com (c) 2007 Larson Technical Services 1 Outline Motivation for VoiceXML W3C Speech Interface Framework Languages Dialog VoiceXML 2.0

More information

A Development Tool for VoiceXML-Based Interactive Voice Response Systems

A Development Tool for VoiceXML-Based Interactive Voice Response Systems A Development Tool for VoiceXML-Based Interactive Voice Response Systems Cheng-Hsiung Chen Nai-Wei Lin Department of Computer Science and Information Engineering National Chung Cheng University Chiayi,

More information

Avaya Aura Orchestration Designer

Avaya Aura Orchestration Designer Avaya Aura Orchestration Designer Avaya Aura Orchestration Designer is a unified service creation environment for faster, lower cost design and deployment of voice and multimedia applications and agent

More information

Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications

Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications Oxford University Press 2007. All rights reserved. 1 XML An extensible language The

More information

A design of the transcoder to convert the VoiceXML documents into the XHTML+Voice documents

A design of the transcoder to convert the VoiceXML documents into the XHTML+Voice documents A design of the transcoder to convert the VoiceXML documents into the XHTML+Voice documents JIEUN KIM, JIEUN PARK, JUNSUK PARK, DONGWON HAN Computer & Software Technology Lab, Electronics and Telecommunications

More information

VoiceXML Programmer s Guide

VoiceXML Programmer s Guide VoiceXML Programmer s Guide VOICEXML PROGRAMMER S GUIDE 1 BeVocal, Inc. 685 Clyde Avenue Mountain View, CA 94043 Part No. 520-0001-02 Copyright 2005. BeVocal, Inc. All rights reserved. 2 VOICEXML PROGRAMMER

More information

Version 2.6. Virtual Receptionist Stepping Through the Basics

Version 2.6. Virtual Receptionist Stepping Through the Basics Version 2.6 Virtual Receptionist Stepping Through the Basics Contents What is a Virtual Receptionist?...3 About the Documentation...3 Ifbyphone on the Web...3 Setting Up a Virtual Receptionist...4 Logging

More information

Voice User Interface Design

Voice User Interface Design Voice User Interface Design Part III: Technology Support for Voice Application Dr. Dirk Schnelle-Walka Copyrighted material; for TUD student use only W3C Speech Interface Framework VoiceXML Speech Synthesis

More information

Thin Client Development and Wireless Markup Languages cont. VoiceXML and Voice Portals

Thin Client Development and Wireless Markup Languages cont. VoiceXML and Voice Portals Thin Client Development and Wireless Markup Languages cont. David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh tipper@tele.pitt.edu http://www.sis.pitt.edu/~dtipper/2727.html

More information

Support and Compatibility

Support and Compatibility Version 1.0 Frequently Asked Questions General What is Voiyager? Voiyager is a productivity platform for VoiceXML applications with Version 1.0 of Voiyager focusing on the complete development and testing

More information

VoiceXML. For: Professor Gerald Q. Maguire Jr. By: Andreas Ångström, it00_aan@it.kth.se and Johan Sverin, it00_jsv@it.kth.se Date: 2004-05-24

VoiceXML. For: Professor Gerald Q. Maguire Jr. By: Andreas Ångström, it00_aan@it.kth.se and Johan Sverin, it00_jsv@it.kth.se Date: 2004-05-24 Royal Institute of Technology, KTH IMIT Practical Voice over IP 2G1325 VoiceXML For: Professor Gerald Q. Maguire Jr. By: Andreas Ångström, it00_aan@it.kth.se and Johan Sverin, it00_jsv@it.kth.se Date:

More information

Open Source VoiceXML Interpreter over Asterisk for Use in IVR Applications

Open Source VoiceXML Interpreter over Asterisk for Use in IVR Applications Open Source VoiceXML Interpreter over Asterisk for Use in IVR Applications Lerato Lerato, Maletšabisa Molapo and Lehlohonolo Khoase Dept. of Maths and Computer Science, National University of Lesotho Roma

More information

VoiceXML Tutorial. Part 1: VoiceXML Basics and Simple Forms

VoiceXML Tutorial. Part 1: VoiceXML Basics and Simple Forms VoiceXML Tutorial Part 1: VoiceXML Basics and Simple Forms What is VoiceXML? XML Application W3C Standard Integration of Multiple Speech and Telephony Related Technologies Automated Speech Recognition

More information

Combining VoiceXML with CCXML

Combining VoiceXML with CCXML Combining VoiceXML with CCXML A Comparative Study Daniel Amyot and Renato Simoes School of Information Technology and Engineering University of Ottawa Ottawa, Canada damyot@site.uottawa.ca, renatops@yahoo.com

More information

Traitement de la Parole

Traitement de la Parole Traitement de la Parole Cours 11: Systèmes à dialogues VoiceXML partie 1 06/06/2005 Traitement de la Parole SE 2005 1 jean.hennebert@unifr.ch, University of Fribourg Date Cours Exerc. Contenu 1 14/03/2005

More information

VoiceXML Discussion. http://www.w3.org/tr/voicexml20/

VoiceXML Discussion. http://www.w3.org/tr/voicexml20/ VoiceXML Discussion http://www.w3.org/tr/voicexml20/ Voice Extensible Markup Language (VoiceXML) o is a markup-based, declarative, programming language for creating speechbased telephony applications o

More information

How To Write A Powerpoint Powerpoint Gsl In A Html Document In A Wordpress 3.5.2 (Html) Or A Microsoft Powerpoint (Html5) (Html3) (Powerpoint) (Web) (Www

How To Write A Powerpoint Powerpoint Gsl In A Html Document In A Wordpress 3.5.2 (Html) Or A Microsoft Powerpoint (Html5) (Html3) (Powerpoint) (Web) (Www VoiceXML Tutorial BeVocal, Inc. 685 Clyde Avenue Mountain View, CA 94043 Part No. 520-0002-02 Copyright 2005. BeVocal, Inc. All rights reserved. 2 VOICEXML TUTORIAL Table of Contents Preface...............................................................1

More information

How To Use Voicexml On A Computer Or Phone (Windows)

How To Use Voicexml On A Computer Or Phone (Windows) Workshop Spoken Language Dialog Systems VoiceXML Rolf Schwitter schwitt@ics.mq.edu.au Macquarie University 2004 1 PhD Scholarship at Macquarie University A Natural Language Interface to a Logic Teaching

More information

VXI* IVR / IVVR. VON.x 2008 OpenSER Summit. Ivan Sixto CEO / Business Dev. Manager. San Jose CA-US, March 17th, 2008

VXI* IVR / IVVR. VON.x 2008 OpenSER Summit. Ivan Sixto CEO / Business Dev. Manager. San Jose CA-US, March 17th, 2008 VXI* IVR / IVVR San Jose CA-US, March 17th, 2008 Ivan Sixto CEO / Business Dev. Manager VON.x 2008 OpenSER Summit Index 1 About INET 2 What is VoiceXML? 3 VXI* Platforms for IVR / IVVR 4 Customer's Business

More information

VOICEXML TUTORIAL AN INTRODUCTION TO VOICEXML

VOICEXML TUTORIAL AN INTRODUCTION TO VOICEXML VOICEXML TUTORIAL AN INTRODUCTION TO VOICEXML Contents Chapter 1 - Introduction... 3 Voice Access to the Web... 3 Developing an Application... 4 Basics of VoiceXML... 4 Conclusion... 7 Chapter 2 - A Basic

More information

BeVocal VoiceXML Tutorial

BeVocal VoiceXML Tutorial BeVocal VoiceXML Tutorial Version 1.0 December 2000 BeVocal, Inc. 1380 Bordeaux Drive Sunnyvale, CA 94089 Copyright 2000. BeVocal, Inc. All rights reserved. 2 VXML QUICK REFERENCE Table of Contents Preface

More information

Interfaces de voz avanzadas con VoiceXML

Interfaces de voz avanzadas con VoiceXML Interfaces de voz avanzadas con VoiceXML Digital Revolution is coming Self driving cars Self voice services Autopilot for CAR Speaker Automatic Speech Recognition ASR DTMF keypad SIP / VoIP or TDM Micro

More information

Phone Routing Stepping Through the Basics

Phone Routing Stepping Through the Basics Ng is Phone Routing Stepping Through the Basics Version 2.6 Contents What is Phone Routing?...3 Logging in to your Ifbyphone Account...3 Configuring Different Phone Routing Functions...4 How do I purchase

More information

Cisco IOS VoiceXML Browser

Cisco IOS VoiceXML Browser Cisco IOS VoiceXML Browser Q. What is VoiceXML? A. Voice Extensible Markup Language (VoiceXML) is an XML-based creation environment for voice applications including user interfaces for use with automatic-speech-recognition

More information

Hosted Fax Mail. Hosted Fax Mail. User Guide

Hosted Fax Mail. Hosted Fax Mail. User Guide Hosted Fax Mail Hosted Fax Mail User Guide Contents 1 About this Guide... 2 2 Hosted Fax Mail... 3 3 Getting Started... 4 3.1 Logging On to the Web Portal... 4 4 Web Portal Mailbox... 6 4.1 Checking Messages

More information

Voicemail. Advanced User s Guide. Version 2.0

Voicemail. Advanced User s Guide. Version 2.0 Advanced User s Guide Version 2.0 Contents Introduction to the Documentation... 3 About the Documentation... 3 Ifbyphone on the Web... 3 Logging in to your ifbyphone Account... 3 Setting Up a Voice Mailbox...

More information

Develop Software that Speaks and Listens

Develop Software that Speaks and Listens Develop Software that Speaks and Listens Copyright 2011 Chant Inc. All rights reserved. Chant, SpeechKit, Getting the World Talking with Technology, talking man, and headset are trademarks or registered

More information

Design Grammars for High-performance Speech Recognition

Design Grammars for High-performance Speech Recognition Design Grammars for High-performance Speech Recognition Copyright 2011 Chant Inc. All rights reserved. Chant, SpeechKit, Getting the World Talking with Technology, talking man, and headset are trademarks

More information

IVR CRM Integration. Migrating the Call Center from Cost Center to Profit. Definitions. Rod Arends Cheryl Yaeger BenchMark Consulting International

IVR CRM Integration. Migrating the Call Center from Cost Center to Profit. Definitions. Rod Arends Cheryl Yaeger BenchMark Consulting International IVR CRM Integration Migrating the Call Center from Cost Center to Profit Rod Arends Cheryl Yaeger BenchMark Consulting International Today, more institutions are seeking ways to change their call center

More information

CCXML & the Power of Standards-Based Call Control E X E C U T I V E B R I E F I N G M A R C H 2 0 1 0

CCXML & the Power of Standards-Based Call Control E X E C U T I V E B R I E F I N G M A R C H 2 0 1 0 CCXML & the Power of Standards-Based Call Control E X E C U T I V E B R I E F I N G M A R C H 2 0 1 0 The Call Control Challenge Advanced call control functionality enables companies to more efficiently

More information

The Future of VoiceXML: VoiceXML 3 Overview. Dan Burnett, Ph.D. Dir. of Speech Technologies, Voxeo Developer Jam Session May 20, 2010

The Future of VoiceXML: VoiceXML 3 Overview. Dan Burnett, Ph.D. Dir. of Speech Technologies, Voxeo Developer Jam Session May 20, 2010 The Future of VoiceXML: VoiceXML 3 Overview Dan Burnett, Ph.D. Dir. of Speech Technologies, Voxeo Developer Jam Session May 20, 2010 Todayʼs topics V3 Motivations MVC, or DFP V3 is a presentation language

More information

Materials Software Systems Inc (MSSI). Enabling Speech on Touch Tone IVR White Paper

Materials Software Systems Inc (MSSI). Enabling Speech on Touch Tone IVR White Paper Materials Software Systems Inc (MSSI). Enabling Speech on Touch Tone IVR White Paper Reliable Customer Service and Automation is the key for Success in Hosted Interactive Voice Response Speech Enabled

More information

Abstract. Avaya Solution & Interoperability Test Lab

Abstract. Avaya Solution & Interoperability Test Lab Avaya Solution & Interoperability Test Lab Application Notes for LumenVox Automated Speech Recognizer, LumenVox Text-to-Speech Server and Call Progress Analysis with Avaya Aura Experience Portal Issue

More information

Personal Voice Call Assistant: VoiceXML and SIP in a Distributed Environment

Personal Voice Call Assistant: VoiceXML and SIP in a Distributed Environment Personal Voice Call Assistant: VoiceXML and SIP in a Distributed Environment Michael Pucher +43/1/5052830-98 pucher@ftw.at Julia Tertyshnaya +43/1/5052830-45 tertyshnaya@ftw.at Florian Wegscheider +43/1/5052830-45

More information

VoiceXML Data Logging Overview

VoiceXML Data Logging Overview Data Logging Overview - Draft 0.3-20 August 2007 Page 1 Data Logging Overview Forum Tools Committee Draft 0.3-20 August 2007 Data Logging Overview - Draft 0.3-20 August 2007 Page 1 About the Forum: Founded

More information

Specialty Answering Service. All rights reserved.

Specialty Answering Service. All rights reserved. 0 Contents 1 Introduction... 3 2 Technology... 5 2.1 VoiceXML Architecture... 6 2.2 Related Standards... 7 2.2.1 SRGS and SISR... 7 2.2.2 SSML... 7 2.2.3 PLS... 7 2.2.4 CCXML... 7 2.2.5 MSML, MSCML, MediaCTRL...

More information

Migrating Legacy IVR Applications to VoiceXML with Voxeo The advantages of a 100% VoiceXML compliant platform

Migrating Legacy IVR Applications to VoiceXML with Voxeo The advantages of a 100% VoiceXML compliant platform Migrating Legacy IVR Applications to VoiceXML with Voxeo The advantages of a 100% VoiceXML compliant platform V O I C E O B J E C T S I S V O X E O N O W C X P TABLE OF CONTENTS 1 Introduction... 2 2 About...

More information

Form. Settings, page 2 Element Data, page 7 Exit States, page 8 Audio Groups, page 9 Folder and Class Information, page 9 Events, page 10

Form. Settings, page 2 Element Data, page 7 Exit States, page 8 Audio Groups, page 9 Folder and Class Information, page 9 Events, page 10 The voice element is used to capture any input from the caller, based on application designer-specified grammars. The valid caller inputs can be specified either directly in the voice element settings

More information

10CS73:Web Programming

10CS73:Web Programming 10CS73:Web Programming Question Bank Fundamentals of Web: 1.What is WWW? 2. What are domain names? Explain domain name conversion with diagram 3.What are the difference between web browser and web server

More information

Email Signatures. Advanced User s Guide. Version 2.0

Email Signatures. Advanced User s Guide. Version 2.0 Advanced User s Guide Version 2.0 Contents Email Signatures... 3 About the Documentation... 3 Ifbyphone on the Web... 3 Copying Click-to-XyZ Code... 4 Logging in to your ifbyphone Account... 4 Web-Based

More information

AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION

AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION APPROVED BY SUPERVISORY COMMITTEE: Supervisor: AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamalai, B.E. in CSE THESIS

More information

Grammar Reference GRAMMAR REFERENCE 1

Grammar Reference GRAMMAR REFERENCE 1 Grammar Reference GRAMMAR REFERENCE 1 BeVocal, Inc. 685 Clyde Avenue Mountain View, CA 94043 Part No. 520-0004-02 Copyright 2004. BeVocal, Inc. All rights reserved. 2 GRAMMAR REFERENCE Table of Contents

More information

How To Develop A Voice Portal For A Business

How To Develop A Voice Portal For A Business VoiceMan Universal Voice Dialog Platform VoiceMan The Voice Portal with many purposes www.sikom.de Seite 2 Voice Computers manage to do ever more Modern voice portals can... extract key words from long

More information

Building Applications with Vision Media Servers

Building Applications with Vision Media Servers Building Applications with Vision Media Servers Getting Your Ideas to Market Fast David Asher Director, Product Management, Platform Solutions NMS at a Glance Founded in 1983, publicly traded since 1994

More information

Dialogos Voice Platform

Dialogos Voice Platform Dialogos Voice Platform Product Datasheet D i a l o g o s S p e e c h C o m m u n i c a t i o n S y s t e m s S. A. September 2007 Contents 1 Dialogos Voice Platform... 3 1.1 DVP features... 3 1.1.1 Standards-based

More information

Dialogic PowerMedia XMS VoiceXML

Dialogic PowerMedia XMS VoiceXML Dialogic PowerMedia XMS VoiceXML Reference Guide February 2013 05-2710-001 www.dialogic.com Copyright and Legal Notice Copyright 2012-2013 Dialogic Inc. All Rights Reserved. You may not reproduce this

More information

NeoIVR. Flexible & high performance IVR platform

NeoIVR. Flexible & high performance IVR platform NeoIVR Flexible & high performance IVR platform Neotiq 2 rue d Estienne d Orves 94230 CACHAN France RCS Créteil491 583 316 info@neotiq.com www.neotiq.com NeoIVR v 1.2 «A unique Interactive Voice Response

More information

Presentation / Interface 1.3

Presentation / Interface 1.3 W3C Recommendations Mobile Web Best Practices 1.0 Canonical XML Version 1.1 Cascading Style Sheets, level 2 (CSS2) SPARQL Query Results XML Format SPARQL Protocol for RDF SPARQL Query Language for RDF

More information

Speech Interface Implementation for XML Browser

Speech Interface Implementation for XML Browser Speech Interface Implementation for XML Browser Aki Teppo & Petri Vuorimaa Telecommunications Software and Multimedia Laboratory Petri.Vuorimaa@hut.fi http://www.tml.hut.fi/~pv/ Agenda Introduction X-Smiles

More information

Multimodality: The Next Wave of Mobile Interaction

Multimodality: The Next Wave of Mobile Interaction Multimodality: The Next Wave of Mobile Interaction White Paper Multimodality is exciting new technology that promises to dramatically enhance the mobile user experience by enabling network operators to

More information

Christian Leibold CMU Communicator 12.07.2005. CMU Communicator. Overview. Vorlesung Spracherkennung und Dialogsysteme. LMU Institut für Informatik

Christian Leibold CMU Communicator 12.07.2005. CMU Communicator. Overview. Vorlesung Spracherkennung und Dialogsysteme. LMU Institut für Informatik CMU Communicator Overview Content Gentner/Gentner Emulator Sphinx/Listener Phoenix Helios Dialog Manager Datetime ABE Profile Rosetta Festival Gentner/Gentner Emulator Assistive Listening Systems (ALS)

More information

Application (service logic) Core server (protocol stack) Figure 1: Division of conference server functions. Table 1: Core server signaling functions

Application (service logic) Core server (protocol stack) Figure 1: Division of conference server functions. Table 1: Core server signaling functions Programmable Conference Server Henning Schulzrinne, Kundan Singh and Xiaotao Wu Department of Computer Science, Columbia University {hgs,kns10,xiaotaow}@cs.columbia.edu Abstract Conferencing services for

More information

Application Notes for Speech Technology Center Voice Navigator 8 with Avaya Aura Experience Portal 7.0.1 - Issue 1.0

Application Notes for Speech Technology Center Voice Navigator 8 with Avaya Aura Experience Portal 7.0.1 - Issue 1.0 Avaya Solution & Interoperability Test Lab Application Notes for Speech Technology Center Voice Navigator 8 with Avaya Aura Experience Portal 7.0.1 - Issue 1.0 Abstract These application notes describe

More information

Voice Driven Animation System

Voice Driven Animation System Voice Driven Animation System Zhijin Wang Department of Computer Science University of British Columbia Abstract The goal of this term project is to develop a voice driven animation system that could take

More information

Advanced Internet Telephony Conferencing Services

Advanced Internet Telephony Conferencing Services Advanced Internet Telephony Conferencing Services Henning Schulzrinne, Kundan Singh and Xiaotao Wu Department of Computer Science, Columbia University {hgs,kns10,xiaotaow}@cs.columbia.edu Abstract Conferencing

More information

Welcome to ScopServ. ScopTEL ACD Module

Welcome to ScopServ. ScopTEL ACD Module Welcome to ScopServ ScopTEL ACD Module Table of contents 1. Introduction... 2 2. ACD manager... 3 2.1. Queues... 3 2.1.1. General parameters... 3 2.1.2. Managing notifications (queue)... 3 2.1.3. Managing

More information

Moving Enterprise Applications into VoiceXML. May 2002

Moving Enterprise Applications into VoiceXML. May 2002 Moving Enterprise Applications into VoiceXML May 2002 ViaFone Overview ViaFone connects mobile employees to to enterprise systems to to improve overall business performance. Enterprise Application Focus;

More information

Vocalité Version 2.4 Feature Overview

Vocalité Version 2.4 Feature Overview Vocalité Version 2.4 Feature Overview 1 Copyright and Trademark Information 1994 2005 Interactive Intelligence Inc./ Vonexus Inc. All rights reserved. Vonexus is a wholly-owned subsidiary of Interactive

More information

9RLFH$FWLYDWHG,QIRUPDWLRQ(QWU\7HFKQLFDO$VSHFWV

9RLFH$FWLYDWHG,QIRUPDWLRQ(QWU\7HFKQLFDO$VSHFWV Université de Technologie de Compiègne UTC +(8',$6

More information

Description: Objective: Upon completing this course, the learner will be able to meet these overall objectives:

Description: Objective: Upon completing this course, the learner will be able to meet these overall objectives: Course: Deploying Cisco Unified Contact Center Express Software v9.0 Duration: 5 Day Hands-On Lab & Lecture Course Price: $ 3,695.00 Learning Credits: 37 Description: Deploying Cisco Unified Contact Center

More information

USER GUIDE SHORETEL NETSUITE CLIENT. ShoreTel Professional Services

USER GUIDE SHORETEL NETSUITE CLIENT. ShoreTel Professional Services USER GUIDE SHORETEL NETSUITE CLIENT ShoreTel Professional Services Introduction The ShoreTel NetSuite Client application provides integration between calls made and received on a user's ShoreTel phone

More information

Telephony Fundamentals

Telephony Fundamentals An Introduction to Basic Telephony Concepts Executive Summary As telecommunications has moved from proprietary to open, standardsbased systems, advanced voice solutions have grown richer and more cost

More information

new voice technologies deliver

new voice technologies deliver new voice technologies deliver to your web environment Solution Brief Development Tools A roadmap for Web-based voice applications You understand the Web and how to develop applications for it. But do

More information

1Building Communications Solutions with Microsoft Lync Server 2010

1Building Communications Solutions with Microsoft Lync Server 2010 1Building Communications Solutions with Microsoft Lync Server 2010 WHAT S IN THIS CHAPTER? What Is Lync? Using the Lync Controls to Integrate Lync Functionality into Your Applications Building Custom Communications

More information

The Program. The Program. ALTA2004 Introduction to VoiceXML. Recommended Literature. Rolf Schwitter schwitt@ics.mq.edu.au

The Program. The Program. ALTA2004 Introduction to VoiceXML. Recommended Literature. Rolf Schwitter schwitt@ics.mq.edu.au The Program ALTA2004 Introduction to VoiceXML Rolf Schwitter schwitt@ics.mq.edu.au Saturday, 4th December 2004 1. Spoken Language Dialog Systems 2. VoiceXML and W3C Speech Interface Framework 3. VoiceXML:

More information

Wave IP 2.0 SP1. Wave ViewPoint User Guide

Wave IP 2.0 SP1. Wave ViewPoint User Guide Wave IP 2.0 SP1 Wave ViewPoint User Guide 2011 by Vertical Communications, Inc. All rights reserved. Vertical Communications and the Vertical Communications logo and combinations thereof and Vertical ViewPoint

More information

Whitepaper: Microsoft Office Communications Server 2007 R2 and Cisco Unified Communications Manager Integration Options

Whitepaper: Microsoft Office Communications Server 2007 R2 and Cisco Unified Communications Manager Integration Options Whitepaper: Microsoft Office Communications Server 2007 R2 and Cisco Unified Communications Manager Integration Options Document Summary This document provides information on several integration scenarios

More information

Realising the Potential of VoiceXML

Realising the Potential of VoiceXML Realising the Potential of VoiceXML mobilkom austria Agenda Mobilkom Austria Group VoiceXML services at Mobilkom Austria Advantages of the VoiceXML approach Realising the potential of VoiceXML Conclusion

More information

4 Understanding. Web Applications IN THIS CHAPTER. 4.1 Understand Web page development. 4.2 Understand Microsoft ASP.NET Web application development

4 Understanding. Web Applications IN THIS CHAPTER. 4.1 Understand Web page development. 4.2 Understand Microsoft ASP.NET Web application development 4 Understanding Web Applications IN THIS CHAPTER 4.1 Understand Web page development 4.2 Understand Microsoft ASP.NET Web application development 4.3 Understand Web hosting 4.4 Understand Web services

More information

Model based development of speech recognition grammar for VoiceXML. Jaspreet Singh

Model based development of speech recognition grammar for VoiceXML. Jaspreet Singh Model based development of speech recognition grammar for VoiceXML Jaspreet Singh University of Tampere School of Information Sciences Computer Science M.Sc Thesis Supervisor: Zheying Zhang December 2011

More information

Deploying Cisco Unified Contact Center Express Volume 1

Deploying Cisco Unified Contact Center Express Volume 1 Deploying Cisco Unified Contact Center Express Volume 1 Course Introduction Learner Skills and Knowledge Course Goal and Course Flow Additional References Your Training Curriculum General Administration

More information

Voice Processing Standards. Mukesh Sundaram Vice President, Engineering Genesys (an Alcatel company)

Voice Processing Standards. Mukesh Sundaram Vice President, Engineering Genesys (an Alcatel company) Voice Processing Standards Mukesh Sundaram Vice President, Engineering Genesys (an Alcatel company) Agenda Interactive Voice Response Speech Processing Computer Telephony Integration IP Telephony Standards

More information

Specialty Answering Service. All rights reserved.

Specialty Answering Service. All rights reserved. 0 Contents 1 Introduction... 2 1.1 Types of Dialog Systems... 2 2 Dialog Systems in Contact Centers... 4 2.1 Automated Call Centers... 4 3 History... 3 4 Designing Interactive Dialogs with Structured Data...

More information

Deploying Cisco Unified Contact Center Express - Digital

Deploying Cisco Unified Contact Center Express - Digital Course Code: CUCCX Vendor: Cisco Course Overview Duration: 5 RRP: 2,396 Deploying Cisco Unified Contact Center Express - Digital Overview This course provides you with hands-on experience and knowledge

More information

Dialogic PowerMedia XMS VoiceXML

Dialogic PowerMedia XMS VoiceXML Dialogic PowerMedia MS VoiceML Reference Guide May 2015 05-2710-006 www.dialogic.com Copyright and Legal Notice Copyright 2012-2015 Dialogic Corporation. All Rights Reserved. You may not reproduce this

More information

WebSphere Voice Server for Multiplatforms. VoiceXML Programmer s Guide

WebSphere Voice Server for Multiplatforms. VoiceXML Programmer s Guide WebSphere Voice Server for Multiplatforms VoiceXML Programmer s Guide WebSphere Voice Server for Multiplatforms VoiceXML Programmer s Guide Note Before using this information and the product it supports,

More information

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Oana NICOLAE Faculty of Mathematics and Computer Science, Department of Computer Science, University of Craiova, Romania oananicolae1981@yahoo.com

More information

Web Analytics Understand your web visitors without web logs or page tags and keep all your data inside your firewall.

Web Analytics Understand your web visitors without web logs or page tags and keep all your data inside your firewall. Web Analytics Understand your web visitors without web logs or page tags and keep all your data inside your firewall. 5401 Butler Street, Suite 200 Pittsburgh, PA 15201 +1 (412) 408 3167 www.metronomelabs.com

More information

Developing XML Solutions with JavaServer Pages Technology

Developing XML Solutions with JavaServer Pages Technology Developing XML Solutions with JavaServer Pages Technology XML (extensible Markup Language) is a set of syntax rules and guidelines for defining text-based markup languages. XML languages have a number

More information

1. Login to www.ifbyphone.com with your User ID and password. Select Virtual Receptionist from the Basic Services tab.

1. Login to www.ifbyphone.com with your User ID and password. Select Virtual Receptionist from the Basic Services tab. Virtual Receptionist Virtual Receptionist is a hosted PBX auto attendant service with intelligent routing that automatically greets and routes phone calls based on your office schedule. It gives your company

More information

Unicom and Microsoft Lync 2010 Enterprise Voice Getting Started

Unicom and Microsoft Lync 2010 Enterprise Voice Getting Started Unicom and Microsoft Lync 2010 Enterprise Voice Getting Started Objectives This course covers the following voice and video features of the Lync 2010 client. Make a Call Answer a Call Manage a Call Forward

More information

Microsoft Lync 2010 Voice and Video Training

Microsoft Lync 2010 Voice and Video Training Microsoft Lync 2010 Voice and Video Training Objectives This course covers the following voice and video features of the Lync 2010 client. Make a Call Answer a Call Manage a Call Forward a Call Follow

More information

Avaya IP Office Unified Communications for Small Business

Avaya IP Office Unified Communications for Small Business Avaya IP Office Unified Communications for Small Business Unified Communications for Small Business is a suite of applications that: Turns a home phone into an IP Office telephone with the Phone Manager

More information

Using Service Oriented Architecture (SOA) for Speaker-Biometrics Applications

Using Service Oriented Architecture (SOA) for Speaker-Biometrics Applications Using Service Oriented Architecture (SOA) for Speaker-Biometrics s Ken Rehor & Judith Markowitz Co-chairs Speaker Biometrics Committee Forum Biometrics in Web Services Biometric Consortium 2006 Baltimore,

More information

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN PAGE 30 Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN Sung-Joon Park, Kyung-Ae Jang, Jae-In Kim, Myoung-Wan Koo, Chu-Shik Jhon Service Development Laboratory, KT,

More information

The ROI. of Speech Tuning

The ROI. of Speech Tuning The ROI of Speech Tuning Executive Summary: Speech tuning is a process of improving speech applications after they have been deployed by reviewing how users interact with the system and testing changes.

More information

Speech-Enabled Interactive Voice Response Systems

Speech-Enabled Interactive Voice Response Systems Speech-Enabled Interactive Voice Response Systems Definition Serving as a bridge between people and computer databases, interactive voice response systems (IVRs) connect telephone users with the information

More information

Firewall Builder Architecture Overview

Firewall Builder Architecture Overview Firewall Builder Architecture Overview Vadim Zaliva Vadim Kurland Abstract This document gives brief, high level overview of existing Firewall Builder architecture.

More information

Terms and Definitions for CMS Administrators, Architects, and Developers

Terms and Definitions for CMS Administrators, Architects, and Developers Sitecore CMS 6 Glossary Rev. 081028 Sitecore CMS 6 Glossary Terms and Definitions for CMS Administrators, Architects, and Developers Table of Contents Chapter 1 Introduction... 3 1.1 Glossary... 4 Page

More information

WWW. World Wide Web Aka The Internet. dr. C. P. J. Koymans. Informatics Institute Universiteit van Amsterdam. November 30, 2007

WWW. World Wide Web Aka The Internet. dr. C. P. J. Koymans. Informatics Institute Universiteit van Amsterdam. November 30, 2007 WWW World Wide Web Aka The Internet dr. C. P. J. Koymans Informatics Institute Universiteit van Amsterdam November 30, 2007 dr. C. P. J. Koymans (UvA) WWW November 30, 2007 1 / 36 WWW history (1) 1968

More information