Language considerations for developing VoiceXML in German



Similar documents
Considerations for developing VoiceXML in Canadian French

How To Write A Spainese Text File In English (Spain) On A Microsoft Powerbook (Spanish)

German for beginners in 7 lessons

For those of you keen to have a written version on the podcast, here is the script below:

Coffee Break German Lesson 06

Coffee Break German. Lesson 09. Study Notes. Coffee Break German: Lesson 09 - Notes page 1 of 17

The University of Toronto. Fall 2009/German 100 Y

AP WORLD LANGUAGE AND CULTURE EXAMS 2012 SCORING GUIDELINES

Learn German. WITH Paul Noble. Review booklet

0525 GERMAN (FOREIGN LANGUAGE)

Introduction to Python

Wie ist das Wetter? (What s the weather like?)

Search Engines Chapter 2 Architecture Felix Naumann

Dialog planning in VoiceXML

Form. Settings, page 2 Element Data, page 7 Exit States, page 8 Audio Groups, page 9 Folder and Class Information, page 9 Events, page 10

MONETA.Assistant API Reference

VoiceXML Tutorial. Part 1: VoiceXML Basics and Simple Forms

3 Data Properties and Validation Rules

Processing Dialogue-Based Data in the UIMA Framework. Milan Gnjatović, Manuela Kunze, Dietmar Rösner University of Magdeburg

Deutsch 1020 Alexander

Field Properties Quick Reference

Hänsel und Gretel Theaterstück

Applications of speech-to-text in customer service. Dr. Joachim Stegmann Deutsche Telekom AG, Laboratories

I Textarbeit. Text 1. I never leave my horse

Exemplar for Internal Achievement Standard. German Level 1

All the English here is taken from students work, both written and spoken. Can you spot the errors and correct them?

Lernsituation 9. Giving information on the phone. 62 Lernsituation 9 Giving information on the phone

Japanese Character Printers EPL2 Programming Manual Addendum

Subject Proficiency Assessment. German Level 1

Eventia Log Parsing Editor 1.0 Administration Guide

VoiceXML Programmer s Guide

VOICEXML TUTORIAL AN INTRODUCTION TO VOICEXML

Directions for the AP Invoice Upload Spreadsheet

An Introduction to VoiceXML

Grandstream XML Application Guide Three XML Applications

German/Print version. German. Main Contents. Introduction. A Textbook on Five Levels. The German Language

VoiceXML-Based Dialogue Systems

Teaching Tips. Lesson 1: Ich und du. (T = teacher P = pupil HP = hand puppet) Singing and playing Overview of the new language material

APPLICATION SETUP DOCUMENT

GERMAN DIE SCHULE UND DIE KULTUR HOME LEARNING YEAR 7

Exemplar for Internal Assessment Resource German Level 1. Resource title: Planning a School Exchange

Introduction to Python

Programme Director, Mr Josef Lederer

Elena Chiocchetti & Natascia Ralli (EURAC) Tanja Wissik & Vesna Lušicky (University of Vienna)

Bei Fragen zu dieser Änderung wenden Sie sich bitte an Ihren Kundenbetreuer, der Ihnen gerne weiterhilft.

JavaScript: Introduction to Scripting Pearson Education, Inc. All rights reserved.

TIn 1: Lecture 3: Lernziele. Lecture 3 The Belly of the Architect. Basic internal components of the Pointers and data storage in memory

FOR TEACHERS ONLY The University of the State of New York

MUSTER. ENGLISH G 21/D3 Test No. 1 Unit 1: My London. 1 LISTENING The London Eye. G - Level: Listen to three texts and tick the correct box.

Eric Miller Dartmouth Hall 330 Classroom: Dartmouth Hall 213

Kap. 2. Transport - Schicht

WhyshouldevenSMBs havea lookon ITIL andit Service Management and

One Report, Many Languages: Using SAS Visual Analytics to Localize Your Reports

! <?xml version="1.0">! <vxml version="2.0">!! <form>!!! <block>!!! <prompt>hello World!</prompt>!!! </block>!! </form>! </vxml>

Technische Alternative elektronische Steuerungsgerätegesellschaft mbh. A-3872 Amaliendorf, Langestr. 124 Tel +43 (0)

Traitement de la Parole

Language Technology II Language-Based Interaction Dialogue design, usability,evaluation. Word-error Rate. Basic Architecture of a Dialog System (3)

SYNTAX AND SEMANTICS OF CAUSAL DENN IN GERMAN TATJANA SCHEFFLER

ipayment Gateway API (IPG API)

BeVocal VoiceXML Tutorial

Multipurpsoe Business Partner Certificates Guideline for the Business Partner

The Use of Text Corpora in Lexical Research

Foreign Language Numbers Games. Fun Tips and Activities for Learners and Teachers of French, Spanish, German, ESL, and More

German Module for datetime2 Package

Building an Architecture Model Entwerfen Sie mit AxiomSys ein Kontextdiagramm, das folgendermaßen aussieht:

22. April 2010 Siemens Enterprise Communications

Cloud Performance Group 1. Event. 14. Januar 2016 / Matthias Gessenay (matthias.gessenay@corporatesoftware.ch)

Welcome to Introduction to programming in Python

Introduction to Java. CS 3: Computer Programming in Java

WESTERNACHER OUTLOOK -MANAGER OPERATING MANUAL

Utility Software II lab 1 Jacek Wiślicki, jacenty@kis.p.lodz.pl original material by Hubert Kołodziejski

FOR TEACHERS ONLY The University of the State of New York

Dialogue Activities. Copyright Goethe-Institut London 1 Alle Rechte vorbehalten

Name: Klasse: Datum: A. Was wissen Sie schon? What do you know already from studying Kapitel 1 in Vorsprung? True or false?

Lernen mit dem Power-Sprachkurs

ORBIS QuickGuide Copyright 2003 Bureau van Dijk Electronic Publishing ( Last updated July 2003

Name: Class: Date: 9. The compiler ignores all comments they are there strictly for the convenience of anyone reading the program.

Number Representation

PL / SQL Basics. Chapter 3

Mark Scheme. German Specification. General Certificate of Education. Listening, Reading and Writing examination - June series

LEJ Langenscheidt Berlin München Wien Zürich New York

Bachelors of Computer Application Programming Principle & Algorithm (BCA-S102T)

Pemrograman Dasar. Basic Elements Of Java

AP GERMAN LANGUAGE AND CULTURE EXAM 2015 SCORING GUIDELINES

FRAUNHOFER INSTITUTe For MATERIAL FLow and LOGISTIcs IML MARKETSTUDY: aided by:

Paynow 3rd Party Shopping Cart or Link Integration Guide

Retrieving Data Using the SQL SELECT Statement. Copyright 2006, Oracle. All rights reserved.

Standard for Communication in Between VINTERMAN and Road Clearing Equipment

Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications

Data Tool Platform SQL Development Tools

Update to V10. Automic Support: Best Practices Josef Scharl. Please ask your questions here Event code 6262

SUSPECT ADVERSE REACTION REPORT

1. Wenn der Spieler/die Spielerin noch keine IPIN hat, bitte auf den Button Register drücken

Offline Payment Methods

Transcription:

Language considerations for developing VoiceXML in German This section contains information that is specific to German. If you are developing German voice applications, use the information in this section, instead of the equivalent US English information sections. v Built-in field types and grammars v Predefined events on page 3 v Built-in commands on page 4 v Specifying character encoding on page 4 v Testing built-in field types on page 5 Built-in field types and grammars Table 1 shows the built-in field types and grammars for German. Table 1. German built-in types. Brackets [] around a keyword mean that the keyword is optional. The vertical-bar symbol indicates a choice between two or more keywords. boolean Users can say one of the positive responses ja, jawohl, stimmt, klar, [das ist] richtig, positiv, OK, natürlich, sicher, or auf jeden Fall, or one of the negative responses nein, falsch, negativ, auf [gar] keinen Fall. Variations on ja (joh) and nein (nee, nö) may also be used. Users can also provide DTMF input: 1 is yes, and 2 is no. The return value sent is a boolean true for a positive response or false for a negative response. If the field name is subsequently used in a value attribute within a prompt, the TTS engine will speak ja or nein. currency Users can say currency values as a combination of a Euro component and a Cent component, such as siebenundzwanzig Euro and fünfzig Cent. Users can also say one of these components without the other. If both components are present, they can be either not separated or separated by the word und, as in zwanzig Euro siebzig and zwanzig Euro und siebzig Cent. A Euro component may be in any of the formats: null Euro, ein Euro, m Euro(s) (where 2 <= m <= 999,999,999) and m (where 1 <= m <= 999,999,999). A Cent component may be in any of the formats: null Cent, ein Cent, n Cent(s) (where 2 <= n <= 99). If both components are present, the Cent component can also have the format n (where 0 <= n <= 99). Note that if users just say an integer value m (where 1 <= m <= 999,999,999), this will be interpreted as a number of Euro. Note also that users can also speak two numerical values, where the first is between 0 and 999,999,999 and the second is between 0 and 99. This will be interpreted as a number of Euros followed by a number of Cents. Also, Franken and Rappen may be used as currency types in place of Euro, Euros, Cent, and Cents in the above specification. Similarly, Dollar, Dollars, Cent, and Cents may be used. Note that both Euro and Euros can be used as the plural of Euro. Users can also provide DTMF input using the numbers 0 through 9 and optionally the * key (to indicate a decimal point), and must terminate DTMF entry using the # key. The return value sent is a string in the format UUUddddddddd.cc where UUU is a currency indicator (CHF, EUR, USD). If the field name is subsequently used in a value attribute within a prompt, the IBM TTS engine will speak the currency value. For example, the IBM TTS engine speaks EUR9,30 as neun Komma drei null Euro. Note: The above example only applies to currency, time, and number built-in grammars. For boolean, date, phone, and digits built-in grammars, the example is only correct if the field value is subsequently used in a <say-as> element with an interpret-as attribute value. 1

Table 1. German built-in types (continued). Brackets [] around a keyword mean that the keyword is optional. The vertical-bar symbol indicates a choice between two or more keywords. date Users can say a date using months, days and years, as well as the words gestern, heute, and morgen. The year must be between 1900 and 2099. Users can speak years from 1901 to 1999 as neunzehn <1- or 2-digit number> or neunzehn hundert [und] <1- or 2-digit number>, such as neunzehn vierundsiebzig and neunzehn hundert [und] vierundsiebzig. Users can speak years from 2001 to 2099 as zweitausend [und] <1- or 2-digit number> such as zweitausend [und] vier and zweitausend [und] fünfundachtzig. Uses can also specify a 2-digit year (such as einundneunzig ). A 2-digit year will be interpreted as being between 1911 and 2010. The following common constructs for dates are supported: dritter März zweitausend [und] eins, der dritte März zweitausend [und] eins, and (am den) dritten März zweitausend [und] eins. Users can say Januar or Jänner for January and Juni or Juno for Juni. Users can also say the day and month without specifying a year. This will be interpreted as a date in the current year. Any of these constructs can be preceded by a day of the week, such as Donnerstag dritter März zweitausend [und] eins, Donnerstag der dritte März zweitausend [und] eins, and [am] Donnerstag den dritten März zweitausend [und] eins. However, the day will be ignored. Thus the specified date will be accepted, whether or not it falls on the specified day. Users can also provide DTMF input in the form yyyymmdd. Note: The date grammar does not perform leap year calculations. February 29th is accepted as a valid date regardless of the year. If desired, your application or servlet can perform the required calculations. The return value sent is a string in the format yyyymmdd, with the VoiceXML browser returning a? in any positions omitted in spoken input. If the value is subsequently spoken in <say-as> with the interpret-as value vxml:date, then it is spoken as a date appropriate to this language. digits Users can say non-negative integer values as strings of individual digits (0 through 9). For example, a user could say eins zwo drei vier fünf sechs. Users can say 2 by saying zwei or zwo, and 5 by saying fünf or fünef. Users can also provide DTMF input using the numbers 0 through 9, and must terminate DTMF entry using the # key. The return value sent is a string of one or more digits. If the result is subsequently used in <say-as> with the interpret-as value vxml:digits, it will be spoken as a sequence of digits appropriate to the current language. For example, the TTS engine speaks 123456 as eins zwei drei vier fünf sechs. number Users can say natural numbers (that is, positive and negative integers, 0, and decimals) from -999,999,999.9999 to 999,999,999.9999. Users can say the word Komma (to indicate a decimal point) and plus or minus (to indicate a positive or negative number). Users can say 2 by saying zwei or zwo, 5 by saying fünf or fünef, and 50 by saying fünfzig or fuffzig. Users can also provide DTMF input using the numbers 0 through 9 and optionally the * key (to indicate a comma), and must terminate DTMF entry using the # key. Only positive numbers can be entered using DTMF. The return value sent is a string of one or more digits, 0 through 9, with a decimal point and a + or - sign as applicable. If the field is subsequently spoken in <say-as> with the interpret-as value vxml:type, where type is the type number you want to specify, then it is spoken as that type number appropriate to this language. For example, the TTS engine speaks "123456" as einhundert dreiundzwanzig tausend vierhundert fünfundsechzig. Use <say-as interpret-as= vxml:digit > to have the number said as a string of digits. 2

Table 1. German built-in types (continued). Brackets [] around a keyword mean that the keyword is optional. The vertical-bar symbol indicates a choice between two or more keywords. phone Users can say a telephone number, including the optional words Durchwahl, Klappe, or Nebenstelle to indicate an extension. In addition to digits (1 to 9), users can use double digits (such as 49 ) and the phrases zwei mal [die]..., and drei mal [die]... before a digit. Users can also use the word hundert, which is interpreted as 00. Users can also provide DTMF input using the numbers 0 through 9 and optionally the * key (to represent the word extension ), and must terminate DTMF entry using the # key. The return value sent is a string of digits which includes an x if an extension was specified. If the field is subsequently spoken in <say-as> with the interpret-as value vxml:phone, then it is spoken as a phone number appropriate to this language. time Users can say a time of day using hours and minutes in either 12- or 24-hour format, as well as the words and phrases jetzt, [jetzt] sofort, and [jetzt] gleich. Times can be stated in any of the following formats: [morgens am Morgen vormittags am Vormittag mittags nachmittags am Nachmittag abends am Abend nachts in der Nacht in der Früh]...... [eine Minute (nach vor)] n [Uhr]...... [um] [m (nach vor)] n [Uhr]...... [um] [m Minuten (nach vor)] n [Uhr]...... [um] [viertel (nach vor)] n...... [um] [viertel dreiviertel] n...... [um] [halb] n...... [um] N Uhr (M)...... [morgens am Morgen vormittags am Vormittag mittags nachmittags am Nachmittag abends am Abend nachts in der Nacht in der Früh früh] [um] Mitternacht [am] Mittag 12 Uhr Mittag where 1 <= m <= 20, 1 <= M <= 59,1 <= n <= 12, and 0 <= N <= 24 Users can also provide DTMF input using the numbers 0 through 9. The return value sent is a string in the format hhmmx, where x is a for AM, p for PM, h for 24 hour format, or? if unspecified or ambiguous. For DTMF input, the return value will always be h or?, since there is no mechanism for specifying AM or PM. If the field is subsequently spoken in <say-as> with the interpret-as value vxml:time, then it is spoken as a time appropriate to this language. Predefined events Table 2 shows the German default event-handler messages for the error, help, and nomatch events. Table 2. German predefined events and event-handler messages Event Default event-handler message error.badfetch error.noauthorization error.semantic error.unsupported.element Verarbeitungsfehler - Das Programm wird beendet help Keine weitere Hilfe verfügbar nomatch Entschuldigung, ich habe Sie nicht verstanden Language considerations for developing VoiceXML in German 3

Built-in commands Table 3 shows the built-in VoiceXML browser commands for German: Table 3. German built-in VoiceXML browser commands Grammar Name Valid User Utterances VoiceXML Browser Response Quiet/Cancel [bitte] Ruhe [bitte] sei [bitte] still abbrechen When barge-in is enabled, stops the current spoken output and waits for further instructions from user. Help Hilfe zu Hilfe wie bitte Plays the help event-handler message. Specifying character encoding The default character encoding for XML documents is utf8, which has 7-bit ASCII as a proper subset. Character codes in VoiceXML documents that are greater than 127 (such as special characters è or ä) will therefore be interpreted as the first byte of a multi-byte utf8 sequence. You can force the XML parser to use another codepage by specifying the desired encoding as the very first thing in the file: <?xml version="1.0" encoding="iso-8859-1"?> 4

Testing built-in field types Table 4 provides examples of the types of input you might specify when testing a German voice application that uses the built-in field types. Table 4. Sample input for German built-in field types Built-in field type Sample input boolean ja, ok, natürlich, richtig, positiv, sicher, nein, falsch, negativ, auf keinen Fall currency siebenundzwanzig neunzig drei Euro fünfzig sieben Franken und neunundneunzig Rappen zehn Millionen Euro date fünfter Mai März der siebzehnte Dezember neunundneunzig gestern heute morgen digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, zwo number zehn Millionen fünf hundert tausend und dreiundfünfzig minus eins komma fünf plus eins komma fünf phone sieben drei fünf acht vier neun null null vier neun siebenundachtzig sechsundvierzig drei drei sechzehn vierzig sieben zweimal die null dreizehn Durchwahl zwölf Nebenstelle acht sieben fünf drei null null achthundert vier vier sechs vier time ein Uhr ein Uhr sieben halb drei viertel vor zehn Uhr abends zehn vor sechs jetzt Language considerations for developing VoiceXML in German 5