Practical Applications of Speech Signal Processing

Save this PDF as:
 WORD  PNG  TXT  JPG

Size: px
Start display at page:

Download "Practical Applications of Speech Signal Processing"

Transcription

1 Practical Applications of Speech Signal Processing Vishu R Viswanathan TI Fellow, Director, Speech Technologies Lab DSP Solutions R&D Center Texas Instruments, Dallas, Texas March 2004 Vishu Viswanathan 1

2 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 2

3 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 3

4 Goals of the Lecture Introduce and discuss each of a number of speech signal processing areas List examples of practical applications Discuss some selected topics in each area High level presentation only March 2004 Vishu Viswanathan 4

5 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 5

6 Goal Speech Coding Reduce speech signal data rate Maintain high speech quality General Principle: Take advantage of Redundancies in the speech signal Properties of speech production and perception Applications Digital cellular telephony, voice over IP, IP phone, audio/video conferencing, PSTN trunking, secure voice communication, digital answering machines, voice mail, voice response systems, talking products March 2004 Vishu Viswanathan 6

7 Components of a Speech Coding System Sampled Speech s(n) Analyzer Channel or Encoder x(n) y(n) Medium y (n) Decoder x (n) Synthesizer s (n) Goal: Minimize data rate of y(n) while maximizing speech quality of s (n) March 2004 Vishu Viswanathan 7

8 Waveform Coders Types of Speech Coders Goal: Reproduce speech on a sample-by-sample basis High data rates, high speech quality Examples: 64 kb/s PCM (G.711), 32 kb/s ADPCM (G.726) Parametric Coders Speech production characterized by parametric models Low data rates, good speech intelligibility, communications/synthetic speech quality Examples: 2.4 kb/s LPC (FS 1015), 2.4 kb/s MELP (recent NATO standard) Analysis-by-Synthesis Coders Hybrid between waveform and parametric coders, with medium data rates Parametric models used, with excitation signal computed by minimizing error between synthesized speech and input speech Examples: 16 kb/s G.728, 8 kb/s G.729 March 2004 Vishu Viswanathan 8

9 Speech Quality Terms Used Toll quality: High-grade wireline telephone High quality Good quality Communications quality Transparent quality Formal Subjective Testing Methods Expensive, time consuming Mean opinion score (MOS): Used in all industry standards bodies Diagnostic acceptability measure (DAM): Used by US Dep t of Defense Informal and Semi-Formal Subjective Tests Pairwise or A/B comparisons Rating tests Objective Methods Signal-to-Noise Ratio, ITU P.802 (PESQ) Automatic, repeatable, useful in coder development and optimization March 2004 Vishu Viswanathan 9

10 Speech Coder Attributes Low bit rate Low quality Clean Speech Low delay Low Complexity Human Speech Bits/Second Handheld Mean Opinion Score Hands-free Milliseconds MIPS, Memory Sound Effects High bit rate High quality Noisy Speech High delay High Complexity Music March 2004 Vishu Viswanathan 10

11 Speech Coding Standards ITU Standards coder rate (kb/s) approach G Mu/A-law G ADPCM G LD-CELP G CS-ACELP G /6.3 MP/ACELP ITU standards are targeted for telephone network applications Also used in Voice over IP applications All produce toll quality speech March 2004 Vishu Viswanathan 11

12 Europe North America Japan Speech Coding Standards Digital Cellular Standards coder rate (kb/s) chan rate approach date GSM FR RPE-LTP 1987 GSM HR VSELP 1994 GSM EFR ACELP 1995 GSM AMR ACELP 1998 TIA IS VSELP 1989 TIA IS QCELP 1993 TIA Q QCELP 1995 TIA IS ACELP 1996 TIA EVRC R-ACELP 1996 TIA SMV R-ACELP 2001 PDC FR VSELP 1990 PDC HR PSI-CELP 1993 PDC EFR ACELP 1999 PDC EFR ACELP 2000 March 2004 Vishu Viswanathan 12

13 Speech Coding Standards Wideband Standards coder rate (kb/s) approach G ,56,64 SB-ADPCM G ,32 Transform ITU WB 16,24 ACELP AMR WB ACELP VMR WB ACELP Wideband: 50 Hz 7 khz (versus narrowband telephone, Hz) March 2004 Vishu Viswanathan 13

14 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 14

15 Speech Synthesis Human Speech Based Systems Suitable for known material Speech coding based Talking toys, talking books, voice prompts, voice response systems Concatenation of pre-recorded voice data Information retrieval (stock quotes, airline schedules, banking) Text-to-Speech Systems Suitable for unknown or arbitrary text Applications: /fax reading, phone access to web based services, spoken telephone directory, car navigation, locationbased services, customer service, help desk, reading machines for the blind March 2004 Vishu Viswanathan 15

16 Components of a TTS System Dictionary and Rules Text Text Analysis Letter-to- Sound Synthesizer Speech - Numerical expansion (dates, times, money) - abbreviations, acronyms -proper name id Dr. Smith lives at 23 Lakeshore Dr. Courtesy of Larry Rabiner - Phonemes -Pitch - Duration -Pauses - loudness/amplitude choice of units words, phones, diphones, dyad, syllables choice of parameters LPC, formants, waveform templates, articulatory parameters, sinusoidal parameters method of computation rules, concatenation March 2004 Vishu Viswanathan 16

17 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 17

18 Speech Recognition & Understanding Problem Recognition: Automatic recognition of human speech by machine Understanding: Interpret the meaning of recognized speech and map them to actions to be taken Applications Voice dialing (name or number dialing) in telephone, cellphone, PDA, smartphone (Safety laws against handheld cellphone use while driving) Voice command & control in telematics, cellphone, PDA, smartphone, PC, toys Voice-enabled web browsing, information retrieval (stock quotes, weather forecast, airline flight information, banking), navigation, , SMS, dictation Automated customer service and help desks Benefits: hands-free, eyes-free use; not using keypad; faster task completion; ease of use; part of multi-modal interface; cost savings March 2004 Vishu Viswanathan 18

19 March 2004 Vishu Viswanathan 19

20 Components of a Speech Recognizer speech signal word string Feature Extraction Acoustic Scoring Decoding Acoustic Models Language Models Front end Back end March 2004 Vishu Viswanathan 20

21 Speaker Dependent Small Vocabulary Isolated Words Recognition Speech Recognizer Attributes Speaker Adaptive Words Continuous Speech Syntax Semantics Speaker Independent Large Vocabulary Conversational Speech Understanding Clean Speech Handheld Hands-free Noisy Speech Low Complexity MIPS, Memory High Complexity Server Based Distributed Client Based March 2004 Vishu Viswanathan 21

22 Performance & Robustness Performance Recognition Accuracy: Word error rate (WER) or task completion rate High enough performance required for user acceptance Robustness Issues Training versus operational condition differences Background noise: extent of noise, its variability (Usually additive) Channel variability: different microphones, different telephone circuits, handheld, handsfree, handheld-handsfree (Usually convolutive) Recognizer must have means to compensate for noise and channel variabilities Out-of-vocabulary rejection capability Speaker dialect and accent variability (handled by speaker adaptation) User Interface: Very important for the success of an application March 2004 Vishu Viswanathan 22

23 Recognition in Multiple Languages Speaker-Dependent Recognition Language independent (User can enroll names for voice dialing in multiple languages!) Some Observations for Speaker-Independent Recognition Same recognition engine but different data (models, dictionary) needed Recognition grammar to handle language-specific usage differences (e.g., French speak telephone numbers in pairs; natural number dialing needed) Training requires speech databases and dictionary in the new language Automatic training tools to minimize time to develop recognition in a new language March 2004 Vishu Viswanathan 23

24 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 24

25 Speaker Recognition Speaker Verification / Authentication Problem: Use voice input to verify the user s claimed identity Applications: Secure access to premises, information (banking), services (voice dialing), etc. Issues True user acceptance traded off with impostor acceptance Total voice verification Fixed text versus free text Speaker Identification Problem: Use voice to identify speaker from a closed or open set of speakers Applications: Legal and forensic use, intelligence, security Issues: Uncooperative user, often relatively short-duration speech, noisy and/or distorted speech. March 2004 Vishu Viswanathan 25

26 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 26

27 Speech Enhancement Noise Suppression Playback Enhancement Acoustic Echo Cancellation March 2004 Vishu Viswanathan 27

28 Noise Suppression Problem Remove acoustic noise from noisy speech signal for better listenability or for improved performance of speech processing devices Requirements: No speech signal distortion, no loss of speech intelligibility, no artifacts like musical noises, natural sounding residual noise Methods Single microphone approach: spectral subtraction family of methods Multi-microphone approach: adaptive noise cancellation, microphone array based fixed or adaptive beamforming, blind signal separation March 2004 Vishu Viswanathan 28

29 Playback Enhancement Problem Enhanced playback of speech to the listener Methods Spectrally shape the speech signal prior to playback, for improved intelligibility when the listener is in a noisy environment (PA system in aircraft, airports, sports arenas) Active noise cancellation to cancel noise acoustically in listener s ears (ANC headsets) Narrowband to wideband speech extension to provide wideband speech perception March 2004 Vishu Viswanathan 29

30 Acoustic Echo Cancellation rn ( ) Downlink Signal s( n) Far End Signal loudspeaker Error Signal A E C ˆ ( ) H z H(z) channel x( n) en ( ) - yn ˆ( ) vn ( ) = un ( ) + yn ( ) + n( n) 0 microphone Uplink Signal + Near End Signal Goal: Cancel feedback from loudspeaker into microphone using adaptive linear filter March 2004 Vishu Viswanathan 30

31 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 31

32 Speech Modification Voice Conversion Convert one voice to sound like another A female voice converted to sound like a low-pitched male voice (security) Time-Scale or Rate Modification Speed up or slow down speech, while preserving naturalness Applications: talking books, pre-recorded lectures, language learning March 2004 Vishu Viswanathan 32

Voice Communication Package v7.0 of front-end voice processing software technologies General description and technical specification

Voice Communication Package v7.0 of front-end voice processing software technologies General description and technical specification Voice Communication Package v7.0 of front-end voice processing software technologies General description and technical specification (Revision 1.0, May 2012) General VCP information Voice Communication

More information

Digital Speech Coding

Digital Speech Coding Digital Speech Processing David Tipper Associate Professor Graduate Program of Telecommunications and Networking University of Pittsburgh Telcom 2720 Slides 7 http://www.sis.pitt.edu/~dtipper/tipper.html

More information

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman A Comparison of Speech Coding Algorithms ADPCM vs CELP Shannon Wichman Department of Electrical Engineering The University of Texas at Dallas Fall 1999 December 8, 1999 1 Abstract Factors serving as constraints

More information

Signal Processing Technologies in Voice over IP Applications

Signal Processing Technologies in Voice over IP Applications Signal Processing Technologies in Voice over IP Applications Eli Shoval, Oren Klimker, Guy Shterlich AudioCodes Ltd. elish@audiocodes.com ; orenk@audiocodes.com ; guys@audiocodes.com Abstract In this paper,

More information

Speech Processing. Introduction to Digital. Speech Processing. The Speech Stack. Speech Coding. Speech Applications

Speech Processing. Introduction to Digital. Speech Processing. The Speech Stack. Speech Coding. Speech Applications Speech Processing Digital Speech Processing Lecture 1 Introduction to Digital Speech Processing Speech is the most natural form of human-human communications. Speech is related to language; linguistics

More information

Distributed Speech Recognition Where is 358 Madison Avenue

Distributed Speech Recognition Where is 358 Madison Avenue Distributed Speech Recognition Where is 358 Madison Avenue David Pearce Motorola Labs bdp003@motorola.com Voice & Multimodal Multimodal-enabled Voice-enabled Services User enters commands via: SPEECH KEYPAD

More information

Euterpe. Digital Voice Processor

Euterpe. Digital Voice Processor Euterpe Digital Voice Processor An Integrated System-on-Chip Solution for Speech Recognition, Text-To-Speech, Speaker Verification, Noise Suppression, Echo Cancellation and InteliMic STMicroelectronics

More information

Availability of Artificial Voice for Measuring Objective QoS of CELP CODECs and Acoustic Echo Cancellers

Availability of Artificial Voice for Measuring Objective QoS of CELP CODECs and Acoustic Echo Cancellers Availability of Artificial Voice for Measuring Objective QoS of CELP CODECs and Acoustic Echo Cancellers Nobuhiko Kitawaki*, Feng Wei*, Takeshi Yamada*, and Futoshi Asano** University of Tsukuba* and AIST**,

More information

Simple Voice over IP (VoIP) Implementation

Simple Voice over IP (VoIP) Implementation Simple Voice over IP (VoIP) Implementation ECE Department, University of Florida Abstract Voice over IP (VoIP) technology has many advantages over the traditional Public Switched Telephone Networks. In

More information

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Synergies and Distinctions Peter Vary RWTH Aachen University Institute of Communication Systems WASPAA, October 23, 2013 Mohonk Mountain

More information

ETSI TS 101 329-2 V1.1.1 (2000-07)

ETSI TS 101 329-2 V1.1.1 (2000-07) TS 101 329-2 V1.1.1 (2000-07) Technical Specification Telecommunications and Internet Protocol Harmonization Over Networks (TIPHON); End to End Quality of Service in TIPHON Systems; Part 2: Definition

More information

Objective Quality Assessment of Wideband Speech Coding using W-PESQ Measure and Artificial Voice

Objective Quality Assessment of Wideband Speech Coding using W-PESQ Measure and Artificial Voice Objective Quality Assessment of Wideband Speech Coding using W-PESQ Measure and Artificial Voice Nobuhiko Kitawaki, Kou Nagai, and Takeshi Yamada University of Tsukuba 1-1-1, Tennoudai, Tsukuba-shi, 305-8573

More information

SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING

SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING CPC - G10L - 2015.09 G10L SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING This subclass covers: processing of speech or voice signals in

More information

Speech Signal Processing: An Overview

Speech Signal Processing: An Overview Speech Signal Processing: An Overview S. R. M. Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati December, 2012 Prasanna (EMST Lab, EEE, IITG) Speech

More information

Voice Encoding Methods for Digital Wireless Communications Systems

Voice Encoding Methods for Digital Wireless Communications Systems SOUTHERN METHODIST UNIVERSITY Voice Encoding Methods for Digital Wireless Communications Systems BY Bryan Douglas Street address city state, zip e-mail address Student ID xxx-xx-xxxx EE6302 Section 324,

More information

Thirukkural - A Text-to-Speech Synthesis System

Thirukkural - A Text-to-Speech Synthesis System Thirukkural - A Text-to-Speech Synthesis System G. L. Jayavardhana Rama, A. G. Ramakrishnan, M Vijay Venkatesh, R. Murali Shankar Department of Electrical Engg, Indian Institute of Science, Bangalore 560012,

More information

Develop Software that Speaks and Listens

Develop Software that Speaks and Listens Develop Software that Speaks and Listens Copyright 2011 Chant Inc. All rights reserved. Chant, SpeechKit, Getting the World Talking with Technology, talking man, and headset are trademarks or registered

More information

Analog-to-Digital Voice Encoding

Analog-to-Digital Voice Encoding Analog-to-Digital Voice Encoding Basic Voice Encoding: Converting Analog to Digital This topic describes the process of converting analog signals to digital signals. Digitizing Analog Signals 1. Sample

More information

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN PAGE 30 Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN Sung-Joon Park, Kyung-Ae Jang, Jae-In Kim, Myoung-Wan Koo, Chu-Shik Jhon Service Development Laboratory, KT,

More information

HD VoIP Sounds Better. Brief Introduction. March 2009

HD VoIP Sounds Better. Brief Introduction. March 2009 HD VoIP Sounds Better Brief Introduction March 2009 Table of Contents 1. Introduction 3 2. Technology Overview 4 3. Business Environment 5 4. Wideband Applications for Diverse Industries 6 5. AudioCodes

More information

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Oana NICOLAE Faculty of Mathematics and Computer Science, Department of Computer Science, University of Craiova, Romania oananicolae1981@yahoo.com

More information

Application of Vocoders to Wireless Communications

Application of Vocoders to Wireless Communications 1 of 5 6/21/2006 6:37 AM DSP Chips Vocoder Boards Voice Codec Units Application Notes Papers Application of Vocoders to Wireless Communications The need for increased utilization of available wireless

More information

VoIP on the iphone. Dr. Jan Linden, VP of Engineering. iphone Developer Summit

VoIP on the iphone. Dr. Jan Linden, VP of Engineering. iphone Developer Summit VoIP on the iphone Imagine the Possibilities! Dr. Jan Linden, VP of Engineering iphone Developer Summit Evolution of IP Voice and Video Processing Advanced Mobile and Multimedia Capabilities Solutions

More information

LPC ANALYSIS AND SYNTHESIS

LPC ANALYSIS AND SYNTHESIS 33 Chapter 3 LPC ANALYSIS AND SYNTHESIS 3.1 INTRODUCTION Analysis of speech signals is made to obtain the spectral information of the speech signal. Analysis of speech signal is employed in variety of

More information

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral GSM speech coding Forelesning INF 5080 Vårsemester 2004 Sources This part contains material from: Web pages Universität Bremen, Arbeitsbereich Nachrichtentechnik (ANT): Prof.K.D. Kammeyer, Jörg Bitzer,

More information

Automated Dialing of Cellular Telephones Using Speech Recognition

Automated Dialing of Cellular Telephones Using Speech Recognition Automated Dialing of Cellular Telephones Using Speech Recognition Application Report Frank Henry Dearden III Voice Control Systems, Incorporated SPRA144 October 1994 Printed on Recycled Paper IMPORTANT

More information

Technology Finds Its Voice. February 2010

Technology Finds Its Voice. February 2010 Technology Finds Its Voice February 2010 Technology Finds Its Voice Overview Voice recognition technology has been around since the early 1970s, but until recently the promise of new advances has always

More information

Speech Compression. 2.1 Introduction

Speech Compression. 2.1 Introduction Speech Compression 2 This chapter presents an introduction to speech compression techniques, together with a detailed description of speech/audio compression standards including narrowband, wideband and

More information

An Arabic Text-To-Speech System Based on Artificial Neural Networks

An Arabic Text-To-Speech System Based on Artificial Neural Networks Journal of Computer Science 5 (3): 207-213, 2009 ISSN 1549-3636 2009 Science Publications An Arabic Text-To-Speech System Based on Artificial Neural Networks Ghadeer Al-Said and Moussa Abdallah Department

More information

Voice over IP Protocols And Compression Algorithms

Voice over IP Protocols And Compression Algorithms University of Tehran Electrical and Computer Engineering School SI Lab. Weekly Presentations Voice over IP Protocols And Compression Algorithms Presented by: Neda Kazemian Amiri Agenda Introduction to

More information

Artificial Intelligence for Speech Recognition

Artificial Intelligence for Speech Recognition Artificial Intelligence for Speech Recognition Kapil Kumar 1, Neeraj Dhoundiyal 2 and Ashish Kasyap 3 1,2,3 Department of Computer Science Engineering, IIMT College of Engineering, GreaterNoida, UttarPradesh,

More information

Adjusting Voice Quality

Adjusting Voice Quality Adjusting Voice Quality Electrical Characteristics This topic describes the electrical characteristics of analog voice and the factors affecting voice quality. Factors That Affect Voice Quality The following

More information

Application Notes. Contents. Overview. Introduction. Echo in Voice over IP Systems VoIP Performance Management

Application Notes. Contents. Overview. Introduction. Echo in Voice over IP Systems VoIP Performance Management Application Notes Title Series Echo in Voice over IP Systems VoIP Performance Management Date January 2006 Overview This application note describes why echo occurs, what effects it has on voice quality,

More information

Philips Speech Recognition Systems. SpeechMagic Glossary. philips

Philips Speech Recognition Systems. SpeechMagic Glossary. philips Philips Speech Recognition Systems SpeechMagic Glossary philips A accession number Unique number assigned to an individual test or procedure by either the Hospital Information System (HIS) or Radiology

More information

Understanding the Transition From PESQ to POLQA. An Ascom Network Testing White Paper

Understanding the Transition From PESQ to POLQA. An Ascom Network Testing White Paper Understanding the Transition From PESQ to POLQA An Ascom Network Testing White Paper By Dr. Irina Cotanis Prepared by: Date: Document: Dr. Irina Cotanis 6 December 2011 NT11-22759, Rev. 1.0 Ascom (2011)

More information

Tech Note. Introduction. Definition of Call Quality. Contents. Voice Quality Measurement Understanding VoIP Performance. Title Series.

Tech Note. Introduction. Definition of Call Quality. Contents. Voice Quality Measurement Understanding VoIP Performance. Title Series. Tech Note Title Series Voice Quality Measurement Understanding VoIP Performance Date January 2005 Overview This tech note describes commonly-used call quality measurement methods, explains the metrics

More information

EVALUATION OF KANNADA TEXT-TO-SPEECH [KTTS] SYSTEM

EVALUATION OF KANNADA TEXT-TO-SPEECH [KTTS] SYSTEM Volume 2, Issue 1, January 2012 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: EVALUATION OF KANNADA TEXT-TO-SPEECH

More information

VOICE RECOGNITION KIT USING HM2007. Speech Recognition System. Features. Specification. Applications

VOICE RECOGNITION KIT USING HM2007. Speech Recognition System. Features. Specification. Applications VOICE RECOGNITION KIT USING HM2007 Introduction Speech Recognition System The speech recognition system is a completely assembled and easy to use programmable speech recognition circuit. Programmable,

More information

SIP Trunking and Voice over IP

SIP Trunking and Voice over IP SIP Trunking and Voice over IP Agenda What is SIP Trunking? SIP Signaling How is Voice encoded and transported? What are the Voice over IP Impairments? How is Voice Quality measured? VoIP Technology Confidential

More information

Course 4: IP Telephony and VoIP

Course 4: IP Telephony and VoIP Course 4: IP Telephony and VoIP Telecommunications Technical Curriculum Program 3: Voice Knowledge 6/9/2009 1 Telecommunications Technical Curriculum Program 1: General Industry Knowledge Course 1: General

More information

ilbc Designed For The Future

ilbc Designed For The Future ilbc White Paper -- October 15, 2004 ilbc Designed For The Future Introduction Since the introduction of VoIP, there have been concerns regarding the current low bit-rate codec standards. The main issues

More information

The BBC s Virtual Voice-over tool ALTO: Technology for Video Translation

The BBC s Virtual Voice-over tool ALTO: Technology for Video Translation The BBC s Virtual Voice-over tool ALTO: Technology for Video Translation Susanne Weber Language Technology Producer, BBC News Labs In this presentation. - Overview over the ALTO Pilot project - Machine

More information

Conference Phone Buyer s Guide

Conference Phone Buyer s Guide Conference Phone Buyer s Guide Conference Phones are essential in most organizations. Almost every business, large or small, uses their conference phone regularly. Such regular use means choosing one is

More information

Subjective test method for quantifying speaker identification accuracy of bandwidth-limited speech

Subjective test method for quantifying speaker identification accuracy of bandwidth-limited speech This article has been accepted and published on J-STAGE in advance of copyediting. Content is final as presented. IEICE Communications Express, Vol.1, 1 6 Subjective test method for quantifying speaker

More information

How to Choose an Acoustic Echo Canceller

How to Choose an Acoustic Echo Canceller How to Choose an Acoustic Echo Canceller Application Note Polycom Installed Voice Business Group September 2004 Introduction Acoustic echo cancellers (AEC) greatly enhance the audio quality of a multipoint

More information

VoIP and IP Telephony

VoIP and IP Telephony VoIP and IP Telephony Reach Out and Ping Someone ISAC Spring School 2006 21 March 2006 Anthony Kava, Sr. Network Admin Pottawattamie County IT Definition VoIP Voice over Internet Protocol Voice Transport

More information

IP PBX using SIP. Voice over Internet Protocol

IP PBX using SIP. Voice over Internet Protocol IP PBX using SIP Voice over Internet Protocol Key Components for an IP PBX setup Wireless/Fiber IP Networks (Point to point/multi point, LAN/WAN/Internet) Central or Multicast SIP Proxy/Server based Virtual

More information

VoIP Conferencing. The latest in IP technologies deliver the next level of service innovation for better meetings. Global Collaboration Services

VoIP Conferencing. The latest in IP technologies deliver the next level of service innovation for better meetings. Global Collaboration Services Global Collaboration Services VoIP Conferencing The latest in IP technologies deliver the next level of service innovation for better meetings. ENERGIZE YOUR CONNECTIONS Table of Contents > > Contents...

More information

1. Public Switched Telephone Networks vs. Internet Protocol Networks

1. Public Switched Telephone Networks vs. Internet Protocol Networks Internet Protocol (IP)/Intelligent Network (IN) Integration Tutorial Definition Internet telephony switches enable voice calls between the public switched telephone network (PSTN) and Internet protocol

More information

Speech recognition technology for mobile phones

Speech recognition technology for mobile phones Speech recognition technology for mobile phones Stefan Dobler Following the introduction of mobile phones using voice commands, speech recognition is becoming standard on mobile handsets. Features such

More information

A MATLAB Software Tool for the Introduction of Speech Coding Fundamentals in a DSP Course

A MATLAB Software Tool for the Introduction of Speech Coding Fundamentals in a DSP Course A MATLAB Software Tool for the Introduction of Speech Coding Fundamentals in a DSP Course Edward Painter, and Andreas Spanias Department of Electrical Engineering, Telecommunications Research Center Arizona

More information

Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction

Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction : A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction Urmila Shrawankar Dept. of Information Technology Govt. Polytechnic, Nagpur Institute Sadar, Nagpur 440001 (INDIA)

More information

Unified Communications Group. Designing for Adoption: Real-time Audio in the Real World

Unified Communications Group. Designing for Adoption: Real-time Audio in the Real World Unified Communications Group Designing for Adoption: Real-time Audio in the Real World Information in this document, including URL and other Internet Web site references, is subject to change without notice.

More information

Optimizing Converged Cisco Networks (ONT)

Optimizing Converged Cisco Networks (ONT) Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations (Deploy) Calculating Bandwidth Requirements for VoIP Objectives Describe factors influencing encapsulation overhead and bandwidth

More information

Feature and Technical

Feature and Technical BlackBerry Mobile Voice System for SIP Gateways and the Avaya Aura Session Manager Version: 5.3 Feature and Technical Overview Published: 2013-06-19 SWD-20130619135120555 Contents 1 Overview...4 2 Features...5

More information

1. Introduction to Spoken Dialogue Systems

1. Introduction to Spoken Dialogue Systems SoSe 2006 Projekt Sprachdialogsysteme 1. Introduction to Spoken Dialogue Systems Walther v. Hahn, Cristina Vertan {vhahn,vertan}@informatik.uni-hamburg.de Content What are Spoken dialogue systems? Types

More information

230622 - DSAP - Digital Speech and Audio Processing

230622 - DSAP - Digital Speech and Audio Processing Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 230 - ETSETB - Barcelona School of Telecommunications Engineering 739 - TSC - Department of Signal Theory and Communications

More information

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Sound is an energy wave with frequency and amplitude. Frequency maps the axis of time, and amplitude

More information

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN PESQ: An Introduction White Paper Prepared by: Psytechnics Limited 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN t: +44 (0) 1473 261 800 f: +44 (0) 1473 261 880 e: info@psytechnics.com September

More information

Comparing Dual Microphone System with Different Algorithms and Distances between Microphones.

Comparing Dual Microphone System with Different Algorithms and Distances between Microphones. Master Thesis Electrical Engineering May 2013 Comparing Dual Microphone System with Different Algorithms and Distances between Microphones. Ariful Islam Shafinaz Shahjahan Nitu This thesis is presented

More information

THE EFFECT OF SPEECH AND AUDIO COMPRESSION ON SPEECH RECOGNITION PERFORMANCE

THE EFFECT OF SPEECH AND AUDIO COMPRESSION ON SPEECH RECOGNITION PERFORMANCE THE EFFECT OF SPEECH AND AUDIO COMPRESSION ON SPEECH RECOGNITION PERFORMANCE L. Besacier, C. Bergamini, D. Vaufreydaz, E. Castelli Laboratoire CLIPS-IMAG, équipe GEOD, Université Joseph Fourier, B.P. 53,

More information

David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh Slides 2.

David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh Slides 2. VoIP QoS Factors David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh Slides 2 VoIP QoS Internet Telephone Quality of Service factors Voice

More information

OPERATOR ASSISTANCE (*0) - Immediate operator support is available by pressing *0 on your telephone keypad*.

OPERATOR ASSISTANCE (*0) - Immediate operator support is available by pressing *0 on your telephone keypad*. In Short: How to Conduct a Conference Call 1. Dial in to the system using either the toll or toll-free domestic phone number or the international phone number that was supplied to you. 2. Enter your HOST

More information

Developing an Isolated Word Recognition System in MATLAB

Developing an Isolated Word Recognition System in MATLAB MATLAB Digest Developing an Isolated Word Recognition System in MATLAB By Daryl Ning Speech-recognition technology is embedded in voice-activated routing systems at customer call centres, voice dialling

More information

Lecture 1-8: Audio Recording Systems

Lecture 1-8: Audio Recording Systems Lecture 1-8: Audio Recording Systems Overview 1. Why do we need to record speech? We need audio recordings of speech for a number of reasons: for off-line analysis, so that we can listen to and transcribe

More information

Images Scientific SR-07 Speech Recognition Circuit Article

Images Scientific SR-07 Speech Recognition Circuit Article Images Scientific SR-07 Speech Recognition Circuit Article This article details the construction and building of a stand alone trainable speech recognition circuit that may be interfaced to control just

More information

Forum 500 Forum 5000 Voice Portal Planning System Forum 500(0) Auto Attendant

Forum 500 Forum 5000 Voice Portal Planning System Forum 500(0) Auto Attendant Forum 500 Forum 5000 Voice Portal Planning System Forum 500(0) Auto Attendant User Guide Welcome to Proximus Thank you for choosing a Proximus product that stands for the best in quality matched with high

More information

MP3 Based Digital Audio Announcement For Mass Transit Systems

MP3 Based Digital Audio Announcement For Mass Transit Systems MP3 Based Digital Audio Announcement For Mass Transit Systems Tobias Maisch INIT Innovations in Transportation, Inc. Chesapeake, VA Technology ABSTRACT One measure to make Mass Transit Systems more userfriendly

More information

Voice Extreme Module Speech Recognition Module

Voice Extreme Module Speech Recognition Module Voice Extreme Module Speech Recognition Module Data sheet Description The Voice Extreme (VE) Module simplifies the design of speech recognition products by integrating all key components onto a single

More information

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA Audio Engineering Society Convention Paper Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA The papers at this Convention have been selected on the basis of a submitted abstract

More information

To help manage calls:

To help manage calls: Mobile Phone Feature Definitions To help manage calls: Call waiting and call hold Allows you to accept a second incoming call with out losing the original call, then switch back and forth between them.

More information

A Microphone Array for Hearing Aids

A Microphone Array for Hearing Aids A Microphone Array for Hearing Aids Bernard Widrow Professor of Electrical Engineering Stanford University widrow@isl.stanford.edu Abstract A directional acoustic receiving system is constructed in the

More information

Enterprise Voice Technology Solutions: A Primer

Enterprise Voice Technology Solutions: A Primer Cognizant 20-20 Insights Enterprise Voice Technology Solutions: A Primer A successful enterprise voice journey starts with clearly understanding the range of technology components and options, and often

More information

CCITT G.728. CODING OF SPEECH AT 16 kbit/s USING LOW-DELAY CODE EXCITED LINEAR PREDICTION

CCITT G.728. CODING OF SPEECH AT 16 kbit/s USING LOW-DELAY CODE EXCITED LINEAR PREDICTION INTERNATIONAL TELECOMMUNICATION UNION CCITT G.728 THE INTERNATIONAL (09/92) TELEGRAPH AND TELEPHONE CONSULTATIVE COMMITTEE GENERAL ASPECTS OF DIGITAL TRANSMISSION SYSTEMS; TERMINAL EQUIPMENTS CODING OF

More information

HANDS FREE COMMUNICATION (UConnect ) IF EQUIPPED

HANDS FREE COMMUNICATION (UConnect ) IF EQUIPPED UConnect Hands Free Communications- Complete Instructions HANDS FREE COMMUNICATION (UConnect ) IF EQUIPPED UConnect is a voice-activated, hands-free, in- vehicle communications system. UConnect allows

More information

High Definition Wideband

High Definition Wideband Polaris Communications Whitepaper High Definition Wideband By extending telephone bandwidth to 7 khz and beyond, it is clear that one can markedly reduce fatigue, improve concentration and increase intelligibility

More information

Active Monitoring of Voice over IP Services with Malden

Active Monitoring of Voice over IP Services with Malden Active Monitoring of Voice over IP Services with Malden Introduction Active Monitoring describes the process of evaluating telecommunications system performance with intrusive tests. It differs from passive

More information

Implementing an In-Service, Non- Intrusive Measurement Device in Telecommunication Networks Using the TMS320C31

Implementing an In-Service, Non- Intrusive Measurement Device in Telecommunication Networks Using the TMS320C31 Disclaimer: This document was part of the First European DSP Education and Research Conference. It may have been written by someone whose native language is not English. TI assumes no liability for the

More information

On the move: technology Great technology makes for great sound

On the move: technology Great technology makes for great sound On the move: technology AcuSpeak (Pulsar 260) Corded headsets featuring AcuSpeak technology are ideal for people who work in noisy environments, because the intelligent microphone selects only the human

More information

T Digital Signal Processing and Filtering. GSM Codec. Kristo Lehtonen 55788E

T Digital Signal Processing and Filtering. GSM Codec. Kristo Lehtonen 55788E T-61.246 Digital Signal Processing and Filtering GSM Codec T-61.246 Digital Signal Processing and Filtering 2(14) 1. Table of contents 1. Table of contents...2 2. Introduction...3 3. Speech creation...4

More information

User Manual. Please read this manual carefully before using the Phoenix Octopus

User Manual. Please read this manual carefully before using the Phoenix Octopus User Manual Please read this manual carefully before using the Phoenix Octopus For additional help and updates, refer to our website To contact Phoenix Audio for support, please send a detailed e-mail

More information

BRINGING VOIP TO THE CONFERENCE ROOM: HOW IT MANAGERS CAN ENHANCE THE USER EXPERIENCE

BRINGING VOIP TO THE CONFERENCE ROOM: HOW IT MANAGERS CAN ENHANCE THE USER EXPERIENCE BRINGING VOIP TO THE CONFERENCE ROOM: HOW IT MANAGERS CAN ENHANCE THE USER EXPERIENCE EXECUTIVE SUMMARY: Voice over IP is one of the fastest growing technologies and in just a few years will be in 80%

More information

Speech-Enabled Interactive Voice Response Systems

Speech-Enabled Interactive Voice Response Systems Speech-Enabled Interactive Voice Response Systems Definition Serving as a bridge between people and computer databases, interactive voice response systems (IVRs) connect telephone users with the information

More information

Nuance Audio Input Specification

Nuance Audio Input Specification c Nuance Audio Input Specification NUANCE MOBILE DIVISION 2015-2016 Nuance Communications, Inc. All rights reserved. ABOUT THIS DOCUMENT This document describes requirements and best practices for the

More information

MPEG-H Audio System for Broadcasting

MPEG-H Audio System for Broadcasting MPEG-H Audio System for Broadcasting ITU-R Workshop Topics on the Future of Audio in Broadcasting Jan Plogsties Challenges of a Changing Landscape Immersion Compelling sound experience through sound that

More information

C E D A T 8 5. Innovating services and technologies for speech content management

C E D A T 8 5. Innovating services and technologies for speech content management C E D A T 8 5 Innovating services and technologies for speech content management Company profile 25 years experience in the market of transcription/reporting services; Cedat 85 Group: Cedat 85 srl Subtitle

More information

Can active noise cancelling improve speech intelligibility even in a single-ear headset? By Horst Gether Product Manager, ams AG

Can active noise cancelling improve speech intelligibility even in a single-ear headset? By Horst Gether Product Manager, ams AG Technical article from ams AG Can active noise cancelling improve speech intelligibility even in a single-ear headset? By Horst Gether Product Manager, ams AG www.ams.com/anc Mobile phones can work amazingly

More information

CA Unified Communications Monitor

CA Unified Communications Monitor CA Unified Communications Monitor Addressing Echo in VoIP Systems Understanding and Monitoring Echo Cancellation for Optimal VoIP Performance This Documentation, which includes embedded help systems and

More information

Tutorial about the VQR (Voice Quality Restoration) technology

Tutorial about the VQR (Voice Quality Restoration) technology Tutorial about the VQR (Voice Quality Restoration) technology Ing Oscar Bonello, Solidyne Fellow Audio Engineering Society, USA INTRODUCTION Telephone communications are the most widespread form of transport

More information

Echo Cancellation. Definition. Overview. Topics

Echo Cancellation. Definition. Overview. Topics Echo Cancellation Definition Wireless phones are increasingly being regarded as essential communications tools, dramatically impacting how people approach day-to-day personal and business communications.

More information

User Guide. BlackBerry 8120 Smartphone

User Guide. BlackBerry 8120 Smartphone User Guide BlackBerry 8120 Smartphone SWDT215510-215510-1001044648-001 Contents BlackBerry basics... 11 About typing input methods... 11 Type text using SureType technology... 11 Switch typing input methods...

More information

IP- PBX. Functionality Options

IP- PBX. Functionality Options IP- PBX Functionality Options With the powerful features integrated in the AtomOS system from AtomAmpd, installing & configuring a cost- effective and extensible VoIP solution is easily possible. 4/26/10

More information

Voyager Legend. User Guide

Voyager Legend. User Guide Voyager Legend User Guide Contents What's in the Box 3 Accessories 4 Headset Overview 5 Pairing 6 Get Paired 6 Pair another phone 6 Charge 7 Fit 8 Change the eartip 8 Wear on the left or right 8 The Basics

More information

Using Basic Call Management Functions

Using Basic Call Management Functions Using Basic Call Management Functions This chapter provides instructions on using your basic phone functions. It includes the following sections: Managing the Audio Source and Volume, page 28 Answering

More information

Internet Telephony Terminology

Internet Telephony Terminology Internet Telephony Terminology Understanding the business phone system world can be a daunting task to a lay person who just wants a system that serves his or her business needs. The purpose of this paper

More information

Application Notes for Konftel 300 Conference Unit with Avaya IP Office - Issue 1.0

Application Notes for Konftel 300 Conference Unit with Avaya IP Office - Issue 1.0 Avaya Solution & Interoperability Test Lab Application Notes for Konftel 300 Conference Unit with Avaya IP Office - Issue 1.0 Abstract These Application Notes document the steps for configuring the Konftel

More information

VoIP Voice and Fax Signal Processing

VoIP Voice and Fax Signal Processing Brochure More information from http://www.researchandmarkets.com/reports/2325511/ VoIP Voice and Fax Signal Processing Description: A complete and systematic treatment of signal processing for VoIP voice

More information

A Smart Telephone Answering Machine with Voice Message Forwarding Capability

A Smart Telephone Answering Machine with Voice Message Forwarding Capability A Smart Telephone Answering Machine with Voice Message Forwarding Capability Chih-Hung Huang 1 Cheng Wen 2 Kuang-Chiung Chang 3 1 Department of Information Management, Lunghwa University of Science and

More information

Linear Predictive Coding

Linear Predictive Coding Linear Predictive Coding Jeremy Bradbury December 5, 2000 0 Outline I. Proposal II. Introduction A. Speech Coding B. Voice Coders C. LPC Overview III. Historical Perspective of Linear Predictive Coding

More information