1. Introduction to Spoken Dialogue Systems



Similar documents
MULTIMODAL VIRTUAL ASSISTANTS FOR CONSUMER AND ENTERPRISE

Mobile Commerce. Contents

Speech Signal Processing: An Overview

Specialty Answering Service. All rights reserved.

Christian Leibold CMU Communicator CMU Communicator. Overview. Vorlesung Spracherkennung und Dialogsysteme. LMU Institut für Informatik

INF5820, Obligatory Assignment 3: Development of a Spoken Dialogue System

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System

Voice User Interfaces (CS4390/5390)

Materials Software Systems Inc (MSSI). Enabling Speech on Touch Tone IVR White Paper

VOICE INFORMATION RETRIEVAL FOR DOCUMENTS. Except where reference is made to the work of others, the work described in this thesis is.

VoiceXML Tutorial. Part 1: VoiceXML Basics and Simple Forms

NTT DOCOMO Technical Journal. Shabette-Concier for Raku-Raku Smartphone Improvements to Voice Agent Service for Senior Users. 1.

Industry Guidelines on Captioning Television Programs 1 Introduction

Owner's Manual for Voice Control. The Convenient Alternative to Manual Control.

VoiceXML-Based Dialogue Systems

interactive product brochure :: Nina: The Virtual Assistant for Mobile Customer Service Apps

A design of the transcoder to convert the VoiceXML documents into the XHTML+Voice documents

For Quartus II Software. This Quick Start Guide will show you. how to set up a Quartus. enter timing requirements, and

How Can Virtual Assistants Increase the Productivity of Mobile Business People?

Robustness of a Spoken Dialogue Interface for a Personal Assistant

Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications

System Requirements Specification for Content Management Service Authors: Till Halbach Røssvoll Valentin Alonso Gracia

and Passenger Information

Agenda What can we do now? And 5 years from now we will still be current!

Computer/IT Project LIST. Contact:

ABSTRACT 2. SYSTEM OVERVIEW 1. INTRODUCTION. 2.1 Speech Recognition

customer care solutions

Næringslivsperspektivet Språkbankens konferanse

What People Do with Voice Search Services

Why MCL-Bridge for SAP

Linked Transportation Services for Connected Travelers

How To Develop A Voice Portal For A Business

Develop Software that Speaks and Listens

Thin Client Development and Wireless Markup Languages cont. VoiceXML and Voice Portals

Deployment Guide A10 Networks/Infoblox Joint DNS64 and NAT64 Solution

Technologies for Voice Portal Platform

Updating your Network Infrastructure and Active Directory Technology Skills to Windows Server 2008

Australian Standard. Interactive voice response systems user interface Speech recognition AS AS

Market Intelligence and Search Results

Open Source VoiceXML Interpreter over Asterisk for Use in IVR Applications

Creating a Virtual Neighbor

This article presents an overview of these four applications. 2. DOCOMO Phone Book

Efficient diphone database creation for MBROLA, a multilingual speech synthesiser

Speech-Enabled Interactive Voice Response Systems

Frequently Asked Questions

Leave A Speech Act After The Beep : Using the Telephone to Teach Pragmatics Douglas A. Demo, Georgetown University, United States

«e-management» Partie 3 : Management du e-commerce et du m-commerce Part 3: Management of e-commerce and m-commerce

Natural Language Interfaces (NLI s)

Effective Call Center Automation and Agent Support

An OSGi based HMI for networked vehicles. Telefónica I+D Miguel García Longarón

Text-To-Speech Technologies for Mobile Telephony Services

EnglishBusiness CompactMINIs Fast and to the point. Cheap and effective.

Mobile Search Deployments. Mike Wehrs Vice President, Nuance Mobile & Consumer Services

Internet Infrastructure

Lecture 26 Enterprise Internet Computing 1. Enterprise computing 2. Enterprise Internet computing 3. Natures of enterprise computing 4.

An Introduction to VoiceXML

VoiceXML. Erik Harborg SINTEF IKT. Presentasjon, 4. årskurs, NTNU, ICT

ERA6 OBJECTIVE QUSESTION

Unified Communications Solution for Retail Industry

Call Center Multimodal Voice Search

Requirements Definition of Communication Platform

ARMA HOUSTON TRAVEL EXPENSE GUIDELINES

9RLFH$FWLYDWHG,QIRUPDWLRQ(QWU\7HFKQLFDO$VSHFWV

Updating your Network Infrastructure and Active Directory Technology Skills to Windows Server 2008

How To Understand The Power Of Decision Science In Insurance

Unified Messaging. Definition. Overview. Topics

Visit your Google Play or App Store and search for Right Move Perth. The App is FREE to download.

Contact Center Automation

How do non-expert users exploit simultaneous inputs in multimodal interaction?

Paving the Way to Next Generation Media and Signaling VoIP Gateways

CASE STUDY. Uniphore Software Systems Contact: Website: 1

How To Sell A 5 Star Villa For A Bargain Price

IVR CRM Integration. Migrating the Call Center from Cost Center to Profit. Definitions. Rod Arends Cheryl Yaeger BenchMark Consulting International

Owner s Manual FULL PREPARATION PACKAGE MOBILE PHONE

CLASS 705 DATA PROCESSING: FINANCIAL, BUSINESS PRACTICE, MANAGE MENT, OR COST/PRICE DETERMINATION. 3..Patient record management

Microsoft SharePoint 2007: for End Users and Site Owner

A General Evaluation Framework to Assess Spoken Language Dialogue Systems: Experience with Call Center Agent Systems

2014/02/13 Sphinx Lunch

A REGIONAL ITS ARCHITECTURE FOR PUBLIC TRANSPORT IN GREATER MONTRÉAL. Vincent Morency, Senior Manager - Planning & ITS, AMT

Hicom 300 optiset E standard optiset E advance / advance plus optiset E memory

Custom Software Development Portfolio. Innovation Collaboration Evolution Results

Speech Analytics Data Reliability: Accuracy and Completeness

Deploying Cisco Unified Contact Center Express Volume 1

Intelligent Contact Management with. Flexible cloud services from VoltDelta. Contact centre Self service Customer insight

Predicting Flight Delays

Craig McWilliams Craig Burrell. Bringing Smarter, Safer Transport to NZ

Unified Communications

One Stop Shop For Educators

Classroom Training Schedule

Information Systems: Definitions and Components

Integration of Voice over Internet Protocol Experiment in Computer Engineering Technology Curriculum

Standard Languages for Developing Multimodal Applications

Closed Captioning Quality

Business Intelligence Solutions. Cognos BI 8. by Adis Terzić

face2face Upper Intermediate: Common European Framework (CEF) B2 Skills Maps

Mobile Operating Systems Lesson 07 Symbian OS

Consumer Opinions and Habits A XIRRUS STUDY

Prescriptive Analytics. A business guide

Giuseppe Riccardi, Marco Ronchetti. University of Trento

Transcription:

SoSe 2006 Projekt Sprachdialogsysteme 1. Introduction to Spoken Dialogue Systems Walther v. Hahn, Cristina Vertan {vhahn,vertan}@informatik.uni-hamburg.de Content What are Spoken dialogue systems? Types of spoken dialogue systems Examples of spoken dialogue systems What is Dialogue in spoken dialogue systems Architecture of spoken dialogue systems 1

What are Spoken Dialogue Systems (SLDS)? An SLDS is a software tool allowing communication via voice in order to perform a certain task According to the goal of the task one can identify: Transaction-based systems which allow the user to perform a transaction (e.g. reserving a seat in a train, buying or selling stocks) Information-provision systems which provide information as answer to a query (e.g. weather information, timetables for a train) The output can be in form of synthesised voice or some data Why Voice Input/Output? Most part of these systems are designed for telephones or wireless devices which: have quite small screen if at all are used for communication in real time therefore typing the input is not a solution sometimes have to be used in hands-free mode (e.g. in the car) Such systems can be used for people with certain disabilities where typing or screen reading is not possible Speech is the most natural way of communication at least at telephone 2

Dialogue systems and the market Voice portals are becoming extremely used in ecommerce Call centres are replacing at least part of their humanservices through such systems According to a statistic from 2004 there were only 250 million computers with Internet access but 1.3 billion telephones!!! in the world Recent trend: more and more services are integrated on the mobile phone chip. Areas of current usage of SLDS Remote banking Travel reservations Information enquiry Stock transactions Taxi bookings Route planning 3

2 Examples Call centre http://www.clarity-ag.net/demo+m54a708de802.html Weather forecast http://groups.csail.mit.edu/sls//about/jupiter.wav How looks dialogue with a machine? Communicator: How can I help you? User: I'd like to travel to San Francisco next Tuesday. Communicator: Leaving on Tuesday, April 24th, arriving in San Francisco. Leaving what city?user: New York [the system misunderstands, and thinks the user said Newark] Communicator: Leaving Newark Airport. Leaving at what time? User: Um, no, change Newark to New York -- J F K Airport Communicator: Leaving New York J F K. Leaving at what time? User: Anytime is fine Communicator: What airline would you prefer? User: It doesn't matter to me. Communicator: I currently have you leaving New York J F K on Tuesday, April 24th anytime during the day, arriving in San Francisco. Is this correct? User: Sure, that's right. From Bell Labs Communicator : A mixed-initiative spoken dialog system operating in the air travel-planning domain. The system acts as a travel agent, and is able to handle multi-leg trips, hotel and car reservations. http://www.bell-labs.com/project/communicator/tutorial.htm 4

Human factors involved in the design and deployment of high quality speech applications Business process analyst Speech scientist Dialogue System Sociology expert Dialog designer A little bit of History Command Line Interfaces GUI Multimodal UI VUI 1960 1980 1990 2000 5

Dialogue It is important for developers of dialogues systems to have a sound understanding of nature of human dialogue: How it is structured How people engage in dialogue Even when conversational modelling is not the main aim, understanding the complexity of human dialogue is important for knowing how to constrain systems in the interests of performance and avoidance of errors Dialogue vs. Conversation Dialogue = a process of exchanging views, sometimes with the purpose of finding a solution to a problem or to solve differences Conversation = informal spoken interaction with main purpose: developing and maintenance of social relationships 6

Key aspects of dialogue Here only those relevant for SLDS: Dialogue as discourse : the analysis of the use of words, phrases and utterances in the context of extended discourse Dialogue as purposeful activity: examination of the purposes for which people engage in dialogue, the actions that they perform and the meanings that they convey Dialogue as collaborative activity: how dialogue is best understood as joint activity in which people work together to engage in conversational turn-tacking and to achieve mutual understanding Utterances in dialogue: the nature and form of utterances produced in naturally occurring dialogues Dialogue as discourse A dialogue consists of 2 turns, one by each speaker A coherent dialogue will produce elements whose interpretation depends on the dialogue context. To participate in the dialogue it is necessary to keep track of all these elements Usually a record of entities introduced in the dialogue model is maintained. This is used for anaphora resolution Sometimes to solve such references world knowledge is necessary 7

Examples of Anaphora resolution Speaker 1: You have to obtain first the residence permit Speaker 2: What do I have to do with it? Speaker 1: You have to obtain first the residence permit in the foreigners department Speaker 2: What do I have to do with it? Record of the focus Examples of World knowledge involved in dialogue Speaker 1: You have to obtain first the residence permit Speaker 2: But my supervisor did not mentioned this. my supervior was mentioned probably in a previous turn. 8

How look dialogues in simple SLDS System: You can ask about restaurants, traffic, news, sport or weather User: Traffic <traffic dialogue> System: What next? You can ask about restaurants, traffic, news, sport or weather User: sport <sport dialogue> The system offers a choice of topic to the user and the dialogue follows the topic selected, then another choice of topic is offered How look dialogues in advanced systems <event dialogue in progress> System: What sort of event are you interested in User: A concert System: there is a concert in the university grounds at 9 this evening User: What s the weather forecast? In the middle of one dialogue the user changes the topic and the system should realise this 9

Architecture of an SLDS -1- Speech Recognition Speech Synthesis Language Understanding Language Generation Dialog Management Database Architecture of an SLDS -2- Speech Recognizer - analyses the audio speech input signal to extract linguistic units (words, phonemes etc.) Language Understanding Module - determines the meaning of the input. In most case of the existent SLDS this is reduced to the mapping on stored patterns Dialogue Management Module: manages the flow of the conversation, maintains history and context and accesses the Database Database - stores the information required by dialogues. Language Generator: formulates answers Speech Synthesiser produces the audio speech output signal 10