INDEX. List of Figures...XII List of Tables...XV 1. INTRODUCTION TO RECOGNITION OF FOR TEXT TO SPEECH CONVERSION

Size: px
Start display at page:

Download "INDEX. List of Figures...XII List of Tables...XV 1. INTRODUCTION TO RECOGNITION OF FOR TEXT TO SPEECH CONVERSION"

Transcription

1 INDEX Page No. List of Figures...XII List of Tables...XV 1. INTRODUCTION TO RECOGNITION OF FOR TEXT TO SPEECH CONVERSION 1.1 Introduction Statement of the problem Objective of the study Rational of the study Scope of the study Limitations of the study Literature review for the study A glance over related literature Some empirical study Some generic TTS frameworks MBROLA SYNTHESIZER FESTIVAL FLITE Speech Synthesis Markup Languages: An Overview Spoken Text Mark up Language (STML) Java Speech API Mark up Language (JSML) SABLE W3C Speech Synthesis Markup Language (SSML) Apple Speech Synthesis Manager Microsoft Speech API (SAPI) Microsoft Speech Application Software Development Kit (SASDK) VoiceXML (VXML) SML in VHML Sort summary of some speech Engine Linguistic studies in India...23 VI

2 C-DAC BANGALORE MATRUBHASHA API IIT MUMBAI VANI FRAMEWORK HP LABS HINDI TTS IIT KHARAGPUR- SHRUTI TTS SIMPUTER TRUST DHVANI TTS OTHER INSTITUTIONS IIT MADRAS IIIT HYDERABAD HYDERABAD CENTRAL UNIVERSITY (HCU) VAANI IISC BANGALORE -THIRUKKURAL & VAACHAKA UTKAL UNIVERSITY, ORISSA TATA INSTITUTE OF FUNDAMENTAL RESEARCH (TIFR), MUMBAI C-DAC, NOIDA COLLEGE OF ENGINEERING, GUINDY, CHENNAI Salient features of the present study Glossary of terms Organization of the Thesis...33 REFERENCES TEXT TO SPEECH CONVERSION TECHNOLOGY 2.1 Introduction Text to speech conversion - Basic methodology Naturalness Intelligibility Issues and approaches in text-to-speech synthesis Natural Language Processing (NLP) Module Text Analysis Text Normalization Phonetic Analysis Prosodic Analysis Meaning of Prosody Types of prosodic structures...45 VII

3 Rule based prediction Data-driven or stochastic methods ARCHITECTURE FOR PROSODY GENERATION Digital Signal Processing (DSP) module Human Speech Production Mechanism Types of modern synthesis Technologies Articulatory Synthesis Formant Synthesis Formant Synthesis methodology Challenges in Formant Synthesis Concatenative synthesis Approach Unit selection synthesis Diphone synthesis Domain-specific synthesis Database preparation Text to Speech Projects and Products...66 REFERENCE DESIGNING & DEVELOPMENT OF TEXT TO SPEECH CONVERSION MODEL 3.1 Introduction Concatenate Synthesis Technique Gujarati character feature The Basics Framework of a Gujarati symbol Gujarati Consonants / Vowels Concatenative Synthesis Model Base Tables and Master Database preparation Model Creating base tables Making master database empty Mater Table creation Phoneme corpus recording Model VIII

4 Phoneme Selection Phoneme Recording Silence Removal Testing Correctness Saving audio file Synthesis Engine Creation Model Text Editor Phoneme separation and searching Concatenation Playing converted audio file TTS Testing Model REFERENCE PROTOTYPE AND COMPONENTS DEVELOPMENT FOR THE TEXT TO SPEECH CONVERSION MODEL 4.1 Introduction Gujarati Text-to-speech Architecture Text Normalization Text Segmentation Wav Concatenation Software and hardware requirement Hardware requirement Software requirement Microsoft visual studio SQL Database C #.NET Free Audio Editor NAudio mansi.ttf Font True Type Font (TTF) Font development programs and its utility The Font Creator Program Need to create Gujarati font IX

5 Character list of developed Gujarati font named mansi.ttf List of Consonants List of vowels List of Digits List of special characters Database and sound file preparation Base table and master table management module Entry Empty Merge Add half consonants Add General consonants Barakhadi Add digits and special single consonants Add special consonants Barakhadi Sound recording Pre-recording process Speaker Selection Sound file format Sound files naming and storage criteria Recording process Text to speech conversion Logical Development (Algorithm) Text to speech synthesis Engine module Text area Button panel Related Microsoft Visual C# code / programs used in Text to Speech Synthesizer development / testing process Class creation for database connection Base Table data entry and master table creation Base Table data entry sub module Sound recording module Text to speech engine module Listening test module X

6 4.7 Annexure I References RESULTS, DISCUSSION, CONCLUSION AND FUTURE SCOPE FOR EXTENSION OF THE RESEARCH WORK 5.1 Introduction Performance analysis criteria for Text to speech engine model for Gujarati text recognition Performance analysis of Categorical Rating Test Clearness Speed Sound Quality Pronunciation Concentration Intonation Stress Pronunciation mistakes Performance analysis of listening test Results and discussion Conclusion Future scope Reference Publications by the candidate XI

TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE

TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE Sangam P. Borkar M.E. (Electronics)Dissertation Guided by Prof. S. P. Patil Head of Electronics Department Rajarambapu Institute of Technology Sakharale,

More information

Develop Software that Speaks and Listens

Develop Software that Speaks and Listens Develop Software that Speaks and Listens Copyright 2011 Chant Inc. All rights reserved. Chant, SpeechKit, Getting the World Talking with Technology, talking man, and headset are trademarks or registered

More information

Text-To-Speech Technologies for Mobile Telephony Services

Text-To-Speech Technologies for Mobile Telephony Services Text-To-Speech Technologies for Mobile Telephony Services Paulseph-John Farrugia Department of Computer Science and AI, University of Malta Abstract. Text-To-Speech (TTS) systems aim to transform arbitrary

More information

Thirukkural - A Text-to-Speech Synthesis System

Thirukkural - A Text-to-Speech Synthesis System Thirukkural - A Text-to-Speech Synthesis System G. L. Jayavardhana Rama, A. G. Ramakrishnan, M Vijay Venkatesh, R. Murali Shankar Department of Electrical Engg, Indian Institute of Science, Bangalore 560012,

More information

Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System

Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System Arun Soman, Sachin Kumar S., Hemanth V. K., M. Sabarimalai Manikandan, K. P. Soman Centre for Excellence in Computational

More information

Schneps, Leila; Colmez, Coralie. Math on Trial : How Numbers Get Used and Abused in the Courtroom. New York, NY, USA: Basic Books, 2013. p i.

Schneps, Leila; Colmez, Coralie. Math on Trial : How Numbers Get Used and Abused in the Courtroom. New York, NY, USA: Basic Books, 2013. p i. New York, NY, USA: Basic Books, 2013. p i. http://site.ebrary.com/lib/mcgill/doc?id=10665296&ppg=2 New York, NY, USA: Basic Books, 2013. p ii. http://site.ebrary.com/lib/mcgill/doc?id=10665296&ppg=3 New

More information

Design Grammars for High-performance Speech Recognition

Design Grammars for High-performance Speech Recognition Design Grammars for High-performance Speech Recognition Copyright 2011 Chant Inc. All rights reserved. Chant, SpeechKit, Getting the World Talking with Technology, talking man, and headset are trademarks

More information

9RLFH$FWLYDWHG,QIRUPDWLRQ(QWU\7HFKQLFDO$VSHFWV

9RLFH$FWLYDWHG,QIRUPDWLRQ(QWU\7HFKQLFDO$VSHFWV Université de Technologie de Compiègne UTC +(8',$6

More information

Design and Implementation of Text To Speech Conversion for Visually Impaired People

Design and Implementation of Text To Speech Conversion for Visually Impaired People Design and Implementation of Text To Speech Conversion for Visually Impaired People Itunuoluwa Isewon* Department of Computer and Information Sciences Covenant University PMB 1023, Ota, Nigeria * Corresponding

More information

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Oana NICOLAE Faculty of Mathematics and Computer Science, Department of Computer Science, University of Craiova, Romania oananicolae1981@yahoo.com

More information

An Arabic Text-To-Speech System Based on Artificial Neural Networks

An Arabic Text-To-Speech System Based on Artificial Neural Networks Journal of Computer Science 5 (3): 207-213, 2009 ISSN 1549-3636 2009 Science Publications An Arabic Text-To-Speech System Based on Artificial Neural Networks Ghadeer Al-Said and Moussa Abdallah Department

More information

Creating voices for the Festival speech synthesis system.

Creating voices for the Festival speech synthesis system. M. Hood Supervised by A. Lobb and S. Bangay G01H0708 Creating voices for the Festival speech synthesis system. Abstract This project focuses primarily on the process of creating a voice for a concatenative

More information

Pronunciation in English

Pronunciation in English The Electronic Journal for English as a Second Language Pronunciation in English March 2013 Volume 16, Number 4 Title Level Publisher Type of product Minimum Hardware Requirements Software Requirements

More information

NATURAL SOUNDING TEXT-TO-SPEECH SYNTHESIS BASED ON SYLLABLE-LIKE UNITS SAMUEL THOMAS MASTER OF SCIENCE

NATURAL SOUNDING TEXT-TO-SPEECH SYNTHESIS BASED ON SYLLABLE-LIKE UNITS SAMUEL THOMAS MASTER OF SCIENCE NATURAL SOUNDING TEXT-TO-SPEECH SYNTHESIS BASED ON SYLLABLE-LIKE UNITS A THESIS submitted by SAMUEL THOMAS for the award of the degree of MASTER OF SCIENCE (by Research) DEPARTMENT OF COMPUTER SCIENCE

More information

Web Based Maltese Language Text to Speech Synthesiser

Web Based Maltese Language Text to Speech Synthesiser Web Based Maltese Language Text to Speech Synthesiser Buhagiar Ian & Micallef Paul Faculty of ICT, Department of Computer & Communications Engineering mail@ian-b.net, pjmica@eng.um.edu.mt Abstract An important

More information

Standard Languages for Developing Multimodal Applications

Standard Languages for Developing Multimodal Applications Standard Languages for Developing Multimodal Applications James A. Larson Intel Corporation 16055 SW Walker Rd, #402, Beaverton, OR 97006 USA jim@larson-tech.com Abstract The World Wide Web Consortium

More information

A Prototype of an Arabic Diphone Speech Synthesizer in Festival

A Prototype of an Arabic Diphone Speech Synthesizer in Festival Department of Linguistics and Philology Språkteknologiprogrammet (Language Technology Programme) Master s thesis in Computational Linguistics A Prototype of an Arabic Diphone Speech Synthesizer in Festival

More information

Efficient diphone database creation for MBROLA, a multilingual speech synthesiser

Efficient diphone database creation for MBROLA, a multilingual speech synthesiser Efficient diphone database creation for, a multilingual speech synthesiser Institute of Linguistics Adam Mickiewicz University Poznań OWD 2010 Wisła-Kopydło, Poland Why? useful for testing speech models

More information

VXI* IVR / IVVR. VON.x 2008 OpenSER Summit. Ivan Sixto CEO / Business Dev. Manager. San Jose CA-US, March 17th, 2008

VXI* IVR / IVVR. VON.x 2008 OpenSER Summit. Ivan Sixto CEO / Business Dev. Manager. San Jose CA-US, March 17th, 2008 VXI* IVR / IVVR San Jose CA-US, March 17th, 2008 Ivan Sixto CEO / Business Dev. Manager VON.x 2008 OpenSER Summit Index 1 About INET 2 What is VoiceXML? 3 VXI* Platforms for IVR / IVVR 4 Customer's Business

More information

VoiceXML-Based Dialogue Systems

VoiceXML-Based Dialogue Systems VoiceXML-Based Dialogue Systems Pavel Cenek Laboratory of Speech and Dialogue Faculty of Informatics Masaryk University Brno Agenda Dialogue system (DS) VoiceXML Frame-based DS in general 2 Computer based

More information

Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications

Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications Oxford University Press 2007. All rights reserved. 1 XML An extensible language The

More information

TABLE OF CONTENTS ABSTRACT ACKNOWLEDGEMENT LIST OF FIGURES LIST OF TABLES

TABLE OF CONTENTS ABSTRACT ACKNOWLEDGEMENT LIST OF FIGURES LIST OF TABLES TABLE OF CONTENTS ABSTRACT ACKNOWLEDGEMENT LIST OF FIGURES LIST OF TABLES ii iii x xiv CHAPTER 1: INTRODUCTION 1 1.0 Background 1 1.1 Research Motivation 4 1.2 Research Objectives 5 1.3 Project Scope 6

More information

A secure face tracking system

A secure face tracking system International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 10 (2014), pp. 959-964 International Research Publications House http://www. irphouse.com A secure face tracking

More information

From The Little SAS Book, Fifth Edition. Full book available for purchase here.

From The Little SAS Book, Fifth Edition. Full book available for purchase here. From The Little SAS Book, Fifth Edition. Full book available for purchase here. Acknowledgments ix Introducing SAS Software About This Book xi What s New xiv x Chapter 1 Getting Started Using SAS Software

More information

Support and Compatibility

Support and Compatibility Version 1.0 Frequently Asked Questions General What is Voiyager? Voiyager is a productivity platform for VoiceXML applications with Version 1.0 of Voiyager focusing on the complete development and testing

More information

CONATION: English Command Input/Output System for Computers

CONATION: English Command Input/Output System for Computers CONATION: English Command Input/Output System for Computers Kamlesh Sharma* and Dr. T. V. Prasad** * Research Scholar, ** Professor & Head Dept. of Comp. Sc. & Engg., Lingaya s University, Faridabad, India

More information

Multimodal Unit Selection for 2D Audiovisual Text-to-speech Synthesis

Multimodal Unit Selection for 2D Audiovisual Text-to-speech Synthesis Multimodal Unit Selection for 2D Audiovisual Text-to-speech Synthesis Wesley Mattheyses, Lukas Latacz, Werner Verhelst and Hichem Sahli Vrije Universiteit Brussel, Dept. ETRO, Pleinlaan 2, B-1050 Brussels,

More information

Version 2.6. Virtual Receptionist Stepping Through the Basics

Version 2.6. Virtual Receptionist Stepping Through the Basics Version 2.6 Virtual Receptionist Stepping Through the Basics Contents What is a Virtual Receptionist?...3 About the Documentation...3 Ifbyphone on the Web...3 Setting Up a Virtual Receptionist...4 Logging

More information

An Approach towards text messaging to voice message for Smart Android phone

An Approach towards text messaging to voice message for Smart Android phone An Approach towards text messaging to voice message for Smart Android phone 1 Mohammed Waseem Ashfaque, 2 Sumegh Tharewal, 3 Abdul Samad Shaikh, 4 Sayyada Sara Banu, 5 Shaikh Abdul Hannan 1 Department

More information

Problems and Prospects in Collection of Spoken Language Data

Problems and Prospects in Collection of Spoken Language Data Problems and Prospects in Collection of Spoken Language Data Kishore Prahallad+*, Suryakanth V Gangashetty*, B. Yegnanarayana*, D. Raj Reddy+ *Language Technologies Research Center (LTRC) International

More information

Our Raison d'être. Identify major choice decision points. Leverage Analytical Tools and Techniques to solve problems hindering these decision points

Our Raison d'être. Identify major choice decision points. Leverage Analytical Tools and Techniques to solve problems hindering these decision points Analytic 360 Our Raison d'être Identify major choice decision points Leverage Analytical Tools and Techniques to solve problems hindering these decision points Empowerment through Intelligence Our Suite

More information

ACM Survey on PhD Production in India in Computer Science

ACM Survey on PhD Production in India in Computer Science ACM Survey on PhD Production in India in Computer Science Pankaj Jalote Director and Professor, IIIT Delhi PhD production in India in computer science has been an issue of concern over the last many years.

More information

Dialog planning in VoiceXML

Dialog planning in VoiceXML Dialog planning in VoiceXML Csapó Tamás Gábor 4 January 2011 2. VoiceXML Programming Guide VoiceXML is an XML format programming language, describing the interactions between human

More information

Enabling Speech Based Access to Information Management Systems over Wireless Network

Enabling Speech Based Access to Information Management Systems over Wireless Network Enabling Speech Based Access to Information Management Systems over Wireless Network M. Bagein, O. Pietquin, C. Ris and G. Wilfart 1 Faculté Polytechnique de Mons - TCTS Lab. Parc Initialis - Av. Copernic,

More information

31 Case Studies: Java Natural Language Tools Available on the Web

31 Case Studies: Java Natural Language Tools Available on the Web 31 Case Studies: Java Natural Language Tools Available on the Web Chapter Objectives Chapter Contents This chapter provides a number of sources for open source and free atural language understanding software

More information

CHARTES D'ANGLAIS SOMMAIRE. CHARTE NIVEAU A1 Pages 2-4. CHARTE NIVEAU A2 Pages 5-7. CHARTE NIVEAU B1 Pages 8-10. CHARTE NIVEAU B2 Pages 11-14

CHARTES D'ANGLAIS SOMMAIRE. CHARTE NIVEAU A1 Pages 2-4. CHARTE NIVEAU A2 Pages 5-7. CHARTE NIVEAU B1 Pages 8-10. CHARTE NIVEAU B2 Pages 11-14 CHARTES D'ANGLAIS SOMMAIRE CHARTE NIVEAU A1 Pages 2-4 CHARTE NIVEAU A2 Pages 5-7 CHARTE NIVEAU B1 Pages 8-10 CHARTE NIVEAU B2 Pages 11-14 CHARTE NIVEAU C1 Pages 15-17 MAJ, le 11 juin 2014 A1 Skills-based

More information

Online Recruitment - An Intelligent Approach

Online Recruitment - An Intelligent Approach Online Recruitment - An Intelligent Approach Samah Rifai and Ramzi A. Haraty Department of Computer Science and Mathematics Lebanese American University Beirut, Lebanon Email: {samah.rifai, rharaty@lau.edu.lb}

More information

Voice Driven Animation System

Voice Driven Animation System Voice Driven Animation System Zhijin Wang Department of Computer Science University of British Columbia Abstract The goal of this term project is to develop a voice driven animation system that could take

More information

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN PAGE 30 Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN Sung-Joon Park, Kyung-Ae Jang, Jae-In Kim, Myoung-Wan Koo, Chu-Shik Jhon Service Development Laboratory, KT,

More information

RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE. CISY 105 Foundations of Computer Science

RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE. CISY 105 Foundations of Computer Science I. Basic Course Information RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE CISY 105 Foundations of Computer Science A. Course Number and Title: CISY-105, Foundations of Computer Science B. New

More information

Building Text-To-Speech Voices in the Cloud

Building Text-To-Speech Voices in the Cloud Building Text-To-Speech Voices in the Cloud Alistair Conkie, Thomas Okken, Yeon-Jun Kim, Giuseppe Di Fabbrizio AT&T Labs Research 8 Park Avenue, Florham Park, NJ - USA {adc,tokken,yjkim,pino}@research.att.com

More information

Open Source VoiceXML Interpreter over Asterisk for Use in IVR Applications

Open Source VoiceXML Interpreter over Asterisk for Use in IVR Applications Open Source VoiceXML Interpreter over Asterisk for Use in IVR Applications Lerato Lerato, Maletšabisa Molapo and Lehlohonolo Khoase Dept. of Maths and Computer Science, National University of Lesotho Roma

More information

ARTIFICIALLY INTELLIGENT COLLEGE ORIENTED VIRTUAL ASSISTANT

ARTIFICIALLY INTELLIGENT COLLEGE ORIENTED VIRTUAL ASSISTANT ARTIFICIALLY INTELLIGENT COLLEGE ORIENTED VIRTUAL ASSISTANT Vishmita Yashwant Shetty, Nikhil Uday Polekar, Sandipan Utpal Das, Prof. Suvarna Pansambal Department of Computer Engineering, Atharva College

More information

Robust Methods for Automatic Transcription and Alignment of Speech Signals

Robust Methods for Automatic Transcription and Alignment of Speech Signals Robust Methods for Automatic Transcription and Alignment of Speech Signals Leif Grönqvist (lgr@msi.vxu.se) Course in Speech Recognition January 2. 2004 Contents Contents 1 1 Introduction 2 2 Background

More information

Generating natural narrative speech for the Virtual Storyteller

Generating natural narrative speech for the Virtual Storyteller Generating natural narrative speech for the Virtual Storyteller M.Sc. Thesis, March 2004 Human Media Interaction Group Department of Electrical Engineering, Mathematics and Computer Science University

More information

APPLICATION NOTE. Enhance Your Outbound Voice Campaign Success Rates with AudioCodes Call Progress Detectors and Answering Machine Detectors

APPLICATION NOTE. Enhance Your Outbound Voice Campaign Success Rates with AudioCodes Call Progress Detectors and Answering Machine Detectors Enhance Your Outbound Voice Campaign Success Rates with AudioCodes Call Progress Detectors and Answering Machine Detectors Introduction Outbound and blended campaigns enable enterprises to proactively

More information

Adapting espeak for converting text into speech in Albanian

Adapting espeak for converting text into speech in Albanian www.ijcsi.org 21 Adapting espeak for converting text into speech in Albanian Mentor Hamiti 1, Ramiz Kastrati 2 1 South East European University Tetova, 1200,Macedonia 2 College Universum Ferizaj, 70000,

More information

Christian Leibold CMU Communicator 12.07.2005. CMU Communicator. Overview. Vorlesung Spracherkennung und Dialogsysteme. LMU Institut für Informatik

Christian Leibold CMU Communicator 12.07.2005. CMU Communicator. Overview. Vorlesung Spracherkennung und Dialogsysteme. LMU Institut für Informatik CMU Communicator Overview Content Gentner/Gentner Emulator Sphinx/Listener Phoenix Helios Dialog Manager Datetime ABE Profile Rosetta Festival Gentner/Gentner Emulator Assistive Listening Systems (ALS)

More information

TTP User Guide. MLLP Research Group. http://www.mllp.upv.es. Wednesday 2 nd September, 2015

TTP User Guide. MLLP Research Group. http://www.mllp.upv.es. Wednesday 2 nd September, 2015 TTP User Guide MLLP Research Group http://www.mllp.upv.es Wednesday 2 nd September, 2015 Contents 1 Introduction 3 2 Uploading media files 4 3 Reviewing transcriptions and translations 8 TTP Player 9 Help

More information

A design of the transcoder to convert the VoiceXML documents into the XHTML+Voice documents

A design of the transcoder to convert the VoiceXML documents into the XHTML+Voice documents A design of the transcoder to convert the VoiceXML documents into the XHTML+Voice documents JIEUN KIM, JIEUN PARK, JUNSUK PARK, DONGWON HAN Computer & Software Technology Lab, Electronics and Telecommunications

More information

How To Use Voicexml On A Computer Or Phone (Windows)

How To Use Voicexml On A Computer Or Phone (Windows) Workshop Spoken Language Dialog Systems VoiceXML Rolf Schwitter schwitt@ics.mq.edu.au Macquarie University 2004 1 PhD Scholarship at Macquarie University A Natural Language Interface to a Logic Teaching

More information

How To Develop A Voice Portal For A Business

How To Develop A Voice Portal For A Business VoiceMan Universal Voice Dialog Platform VoiceMan The Voice Portal with many purposes www.sikom.de Seite 2 Voice Computers manage to do ever more Modern voice portals can... extract key words from long

More information

TExES Texas Examinations of Educator Standards. Preparation Manual. 191 Generalist EC 6

TExES Texas Examinations of Educator Standards. Preparation Manual. 191 Generalist EC 6 TExES Texas Examinations of Educator Standards Preparation Manual 191 Generalist EC 6 Copyright 2011 by Texas Education Agency (TEA). All rights reserved. The Texas Education Agency logo and TEA are registered

More information

VoiceXML. Erik Harborg SINTEF IKT. Presentasjon, 4. årskurs, NTNU, 2007-04-17 ICT

VoiceXML. Erik Harborg SINTEF IKT. Presentasjon, 4. årskurs, NTNU, 2007-04-17 ICT VoiceXML Erik Harborg SINTEF IKT Presentasjon, 4. årskurs, NTNU, 2007-04-17 1 Content Voice as the user interface What is VoiceXML? What type of applications can be implemented? Example applications VoiceXML

More information

Voice User Interfaces (CS4390/5390)

Voice User Interfaces (CS4390/5390) Revised Syllabus February 17, 2015 Voice User Interfaces (CS4390/5390) Spring 2015 Tuesday & Thursday 3:00 4:20, CCS Room 1.0204 Instructor: Nigel Ward Office: CCS 3.0408 Phone: 747-6827 E-mail nigel@cs.utep.edu

More information

Carla Simões, t-carlas@microsoft.com. Speech Analysis and Transcription Software

Carla Simões, t-carlas@microsoft.com. Speech Analysis and Transcription Software Carla Simões, t-carlas@microsoft.com Speech Analysis and Transcription Software 1 Overview Methods for Speech Acoustic Analysis Why Speech Acoustic Analysis? Annotation Segmentation Alignment Speech Analysis

More information

Avaya Aura Orchestration Designer

Avaya Aura Orchestration Designer Avaya Aura Orchestration Designer Avaya Aura Orchestration Designer is a unified service creation environment for faster, lower cost design and deployment of voice and multimedia applications and agent

More information

WinPitch LTL II, a Multimodal Pronunciation Software

WinPitch LTL II, a Multimodal Pronunciation Software WinPitch LTL II, a Multimodal Pronunciation Software Philippe MARTIN UFRL Université Paris 7 92, Ave. de France 75013 Paris, France philippe.martin@linguist.jussieu.fr Abstract We introduce a new version

More information

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that

More information

Transcription Format

Transcription Format Representing Discourse Du Bois Transcription Format 1. Objective The purpose of this document is to describe the format to be used for producing and checking transcriptions in this course. 2. Conventions

More information

Model based development of speech recognition grammar for VoiceXML. Jaspreet Singh

Model based development of speech recognition grammar for VoiceXML. Jaspreet Singh Model based development of speech recognition grammar for VoiceXML Jaspreet Singh University of Tampere School of Information Sciences Computer Science M.Sc Thesis Supervisor: Zheying Zhang December 2011

More information

Advanced Windows Store App Development Using C#

Advanced Windows Store App Development Using C# 20485C - Version: 1 07 July 2016 Advanced Windows Store App Development Using C# Advanced Windows Store App Development Using C# 20485C - Version: 1 5 days Course Description: This course you will learn

More information

Indiana Department of Education

Indiana Department of Education GRADE 1 READING Guiding Principle: Students read a wide range of fiction, nonfiction, classic, and contemporary works, to build an understanding of texts, of themselves, and of the cultures of the United

More information

Speech Applications - Accessing information by voice. Li Haizhou, PhD Vice President, InfoTalk haizhou.li@infotalkcorp.com

Speech Applications - Accessing information by voice. Li Haizhou, PhD Vice President, InfoTalk haizhou.li@infotalkcorp.com Speech Applications - Accessing information by voice Li Haizhou, PhD Vice President, InfoTalk haizhou.li@infotalkcorp.com nabling technology for voice portal Voice portal Speech recognition & spoken dialogue

More information

Interfaces de voz avanzadas con VoiceXML

Interfaces de voz avanzadas con VoiceXML Interfaces de voz avanzadas con VoiceXML Digital Revolution is coming Self driving cars Self voice services Autopilot for CAR Speaker Automatic Speech Recognition ASR DTMF keypad SIP / VoIP or TDM Micro

More information

VoiceXML Discussion. http://www.w3.org/tr/voicexml20/

VoiceXML Discussion. http://www.w3.org/tr/voicexml20/ VoiceXML Discussion http://www.w3.org/tr/voicexml20/ Voice Extensible Markup Language (VoiceXML) o is a markup-based, declarative, programming language for creating speechbased telephony applications o

More information

Text To Speech for Bangla Language using Festival

Text To Speech for Bangla Language using Festival Text To Speech for Bangla Language using Festival Firoj Alam, Promila Kanti Nath and Mumit Khan BRAC University, Bangladesh firojalam04@yahoo.com, bappinath@hotmail.com, mumit@bracuniversity.net Abstract

More information

From Portuguese to Mirandese: Fast Porting of a Letter-to-Sound Module Using FSTs

From Portuguese to Mirandese: Fast Porting of a Letter-to-Sound Module Using FSTs From Portuguese to Mirandese: Fast Porting of a Letter-to-Sound Module Using FSTs Isabel Trancoso 1,Céu Viana 2, Manuela Barros 2, Diamantino Caseiro 1, and Sérgio Paulo 1 1 L 2 F - Spoken Language Systems

More information

Interavtive Voice Response System

Interavtive Voice Response System Interavtive Voice Response System Ms.Rashmi Janbandhu Rajiv Gandhi College Of Engineering & Reasearch rashmi.janbandhu@gmail.com M s.divya Jawle Rajiv Gandhi College Of Engineering & Reasearch djawl3e@gmail.com

More information

Zeenov Agora High Level Architecture

Zeenov Agora High Level Architecture Zeenov Agora High Level Architecture 1 Major Components i) Zeenov Agora Signaling Server Zeenov Agora Signaling Server is a web server capable of handling HTTP/HTTPS requests from Zeenov Agora web clients

More information

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning Course Outline: Course: Implementing a Data with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning Duration: 5.00 Day(s)/ 40 hrs Overview: This 5-day instructor-led course describes

More information

Development of TTS for Marathi Speech Signal Based on Prosody and Concatenation Approach

Development of TTS for Marathi Speech Signal Based on Prosody and Concatenation Approach Development of TTS for Marathi Speech Signal Based on Prosody and Concatenation Approach 1 Surendra P. Ramteke, 2 Gunjal Oza, 3 Nilima P. Patil 1,2 Department of E&TC Engineering, 3 Department of Computer

More information

SWING: A tool for modelling intonational varieties of Swedish Beskow, Jonas; Bruce, Gösta; Enflo, Laura; Granström, Björn; Schötz, Susanne

SWING: A tool for modelling intonational varieties of Swedish Beskow, Jonas; Bruce, Gösta; Enflo, Laura; Granström, Björn; Schötz, Susanne SWING: A tool for modelling intonational varieties of Swedish Beskow, Jonas; Bruce, Gösta; Enflo, Laura; Granström, Björn; Schötz, Susanne Published in: Proceedings of Fonetik 2008 Published: 2008-01-01

More information

DIXI A Generic Text-to-Speech System for European Portuguese

DIXI A Generic Text-to-Speech System for European Portuguese DIXI A Generic Text-to-Speech System for European Portuguese Sérgio Paulo, Luís C. Oliveira, Carlos Mendes, Luís Figueira, Renato Cassaca, Céu Viana 1 and Helena Moniz 1,2 L 2 F INESC-ID/IST, 1 CLUL/FLUL,

More information

! " # # $ %"&! '() *+,

!  # # $ %&! '() *+, !! " # # $ %"&! '() *+, - %(/ # 0& 1 23 245 6 7!8 7 95 29: 7 8 8 ; : : - 6 7 8 #" 3?@ABAACD8 #" #?@ABAABA E +FG,FE +FHIJE +F K *LLMM,FMN GK *LLMMH+,MN @ A BA A BA M ! 1 (3) (email), (calendar)

More information

SPEECH SYNTHESIZER BASED ON THE PROJECT MBROLA

SPEECH SYNTHESIZER BASED ON THE PROJECT MBROLA Rajs Arkadiusz, Banaszak-Piechowska Agnieszka, Drzycimski Paweł. Speech synthesizer based on the project MBROLA. Journal of Education, Health and Sport. 2015;5(12):160-164. ISSN 2391-8306. DOI http://dx.doi.org/10.5281/zenodo.35266

More information

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777 Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777 Course Outline Module 1: Introduction to Data Warehousing This module provides an introduction to the key components of a data warehousing

More information

Voice Tools Project (VTP) Creation Review

Voice Tools Project (VTP) Creation Review Voice Tools Project (VTP) Creation Review Tuesday, February 15, 2005 1 What is VTP? VTP is an Eclipse technology project that focuses on voice application tools based on W3C standards, to help these standards

More information

COMPUTER TECHNOLOGY IN TEACHING READING

COMPUTER TECHNOLOGY IN TEACHING READING Лю Пэн COMPUTER TECHNOLOGY IN TEACHING READING Effective Elementary Reading Program Effective approach must contain the following five components: 1. Phonemic awareness instruction to help children learn

More information

15.496 Data Technologies for Quantitative Finance

15.496 Data Technologies for Quantitative Finance Paul F. Mende MIT Sloan School of Management Fall 2014 Course Syllabus 15.496 Data Technologies for Quantitative Finance Course Description. This course introduces students to financial market data and

More information

The ROI. of Speech Tuning

The ROI. of Speech Tuning The ROI of Speech Tuning Executive Summary: Speech tuning is a process of improving speech applications after they have been deployed by reviewing how users interact with the system and testing changes.

More information

Blue&Me. Live life while you drive. What you can do: Introduction. What it consists of:

Blue&Me. Live life while you drive. What you can do: Introduction. What it consists of: Blue&Me Live life while you drive Introduction Blue&Me is an innovative in-car system that allows you to use your Bluetooth mobile phone and to listen to your music while you drive. Blue&Me can be controlled

More information

Emerging technologies - AJAX, VXML SOA in the travel industry

Emerging technologies - AJAX, VXML SOA in the travel industry Emerging technologies - AJAX, VXML SOA in the travel industry Siva Kantamneni Executive Architect IBM s SOA Center Of Excellence email: kantamne@us.ibm.com Tel: 813-356-4113 Contents Emerging technologies

More information

50465 - PerformancePoint 2010 Designing and Implementing Scorecards and Dashboards

50465 - PerformancePoint 2010 Designing and Implementing Scorecards and Dashboards 50465 - PerformancePoint 2010 Designing and Implementing Scorecards and Dashboards Introduction Audience At Completion Prerequisites Microsoft Certified Professional Exams Student Materials Outline Introduction

More information

Evaluation of a Segmental Durations Model for TTS

Evaluation of a Segmental Durations Model for TTS Speech NLP Session Evaluation of a Segmental Durations Model for TTS João Paulo Teixeira, Diamantino Freitas* Instituto Politécnico de Bragança *Faculdade de Engenharia da Universidade do Porto Overview

More information

Visual Studio 2008: Windows Presentation Foundation

Visual Studio 2008: Windows Presentation Foundation Visual Studio 2008: Windows Presentation Foundation Course 6460A: Three days; Instructor-Led Introduction This three-day instructor-led course provides students with the knowledge and skills to build and

More information

Program curriculum for graduate studies in Speech and Music Communication

Program curriculum for graduate studies in Speech and Music Communication Program curriculum for graduate studies in Speech and Music Communication School of Computer Science and Communication, KTH (Translated version, November 2009) Common guidelines for graduate-level studies

More information

Mother Tongue Influence on Spoken English

Mother Tongue Influence on Spoken English Mother Tongue Influence on Spoken English Shruti Pal Central Institute of Education (India) palshruti27@gmail.com Abstract Pronunciation is not a major problem in a language classroom until it hinders

More information

Project Plan Dealer Improvement Recommender System

Project Plan Dealer Improvement Recommender System Project Plan Dealer Improvement Recommender System The Capstone Experience Team Urban Science Ty Jones Ben Mastay Collin Myers Department of Computer Science and Engineering Michigan State University Spring

More information

An Introduction to VoiceXML

An Introduction to VoiceXML An Introduction to VoiceXML ART on Dialogue Models and Dialogue Systems François Mairesse University of Sheffield F.Mairesse@sheffield.ac.uk http://www.dcs.shef.ac.uk/~francois Outline What is it? Why

More information

Faculty of Telecommunications and Space Technology

Faculty of Telecommunications and Space Technology Faculty of Telecommunications and Space Technology Master of Science in Communication Network Engineering Page 1 Faculty of Telecommunications and Space Technology Master of Science in Communication Network

More information

"Charting the Course... MOC 20465 C Designing a Data Solution with Microsoft SQL Server Course Summary

Charting the Course... MOC 20465 C Designing a Data Solution with Microsoft SQL Server Course Summary Course Summary Description The focus of this five-day instructor-led course is on planning and implementing enterprise database infrastructure solutions by using SQL and other Microsoft technologies. It

More information

Longman English Interactive

Longman English Interactive Longman English Interactive Level 2 Orientation (English version) Quick Start 2 Microphone for Speaking Activities 2 Translation Setting 3 Goals and Course Organization 4 What is Longman English Interactive?

More information

Contemporary Linguistics

Contemporary Linguistics Contemporary Linguistics An Introduction Editedby WILLIAM O'GRADY MICHAEL DOBROVOLSKY FRANCIS KATAMBA LONGMAN London and New York Table of contents Dedication Epigraph Series list Acknowledgements Preface

More information

CISCO UNIFIED CUSTOMER VOICE PORTAL IMPLEMENTATION (CVPI)

CISCO UNIFIED CUSTOMER VOICE PORTAL IMPLEMENTATION (CVPI) CISCO UNIFIED CUSTOMER VOICE PORTAL IMPLEMENTATION (CVPI) Temario Learn to install, operate, and manage Cisco Unified CVP. In this course, you will learn to operate, administer, manage, and provision Cisco

More information

Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction

Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction : A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction Urmila Shrawankar Dept. of Information Technology Govt. Polytechnic, Nagpur Institute Sadar, Nagpur 440001 (INDIA)

More information

CHANWOO KIM (BIRTH: APR. 9, 1976) Language Technologies Institute School of Computer Science Aug. 8, 2005 present

CHANWOO KIM (BIRTH: APR. 9, 1976) Language Technologies Institute School of Computer Science Aug. 8, 2005 present CHANWOO KIM (BIRTH: APR. 9, 1976) 2602E NSH Carnegie Mellon University 5000 Forbes Avenue Pittsburgh, PA 15213 Phone: +1-412-726-3996 Email: chanwook@cs.cmu.edu RESEARCH INTERESTS Speech recognition system,

More information

CAMBRIDGE FIRST CERTIFICATE Listening and Speaking NEW EDITION. Sue O Connell with Louise Hashemi

CAMBRIDGE FIRST CERTIFICATE Listening and Speaking NEW EDITION. Sue O Connell with Louise Hashemi CAMBRIDGE FIRST CERTIFICATE SKILLS Series Editor: Sue O Connell CAMBRIDGE FIRST CERTIFICATE Listening and Speaking NEW EDITION Sue O Connell with Louise Hashemi PUBLISHED BY THE PRESS SYNDICATE OF THE

More information

Things to remember when transcribing speech

Things to remember when transcribing speech Notes and discussion Things to remember when transcribing speech David Crystal University of Reading Until the day comes when this journal is available in an audio or video format, we shall have to rely

More information

Thin Client Development and Wireless Markup Languages cont. VoiceXML and Voice Portals

Thin Client Development and Wireless Markup Languages cont. VoiceXML and Voice Portals Thin Client Development and Wireless Markup Languages cont. David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh tipper@tele.pitt.edu http://www.sis.pitt.edu/~dtipper/2727.html

More information