Practical Applications of Speech Signal Processing

Size: px
Start display at page:

Download "Practical Applications of Speech Signal Processing"

Transcription

1 Practical Applications of Speech Signal Processing Vishu R Viswanathan TI Fellow, Director, Speech Technologies Lab DSP Solutions R&D Center Texas Instruments, Dallas, Texas v-viswanathan@ti.com March 2004 Vishu Viswanathan 1

2 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 2

3 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 3

4 Goals of the Lecture Introduce and discuss each of a number of speech signal processing areas List examples of practical applications Discuss some selected topics in each area High level presentation only March 2004 Vishu Viswanathan 4

5 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 5

6 Goal Speech Coding Reduce speech signal data rate Maintain high speech quality General Principle: Take advantage of Redundancies in the speech signal Properties of speech production and perception Applications Digital cellular telephony, voice over IP, IP phone, audio/video conferencing, PSTN trunking, secure voice communication, digital answering machines, voice mail, voice response systems, talking products March 2004 Vishu Viswanathan 6

7 Components of a Speech Coding System Sampled Speech s(n) Analyzer Channel or Encoder x(n) y(n) Medium y (n) Decoder x (n) Synthesizer s (n) Goal: Minimize data rate of y(n) while maximizing speech quality of s (n) March 2004 Vishu Viswanathan 7

8 Waveform Coders Types of Speech Coders Goal: Reproduce speech on a sample-by-sample basis High data rates, high speech quality Examples: 64 kb/s PCM (G.711), 32 kb/s ADPCM (G.726) Parametric Coders Speech production characterized by parametric models Low data rates, good speech intelligibility, communications/synthetic speech quality Examples: 2.4 kb/s LPC (FS 1015), 2.4 kb/s MELP (recent NATO standard) Analysis-by-Synthesis Coders Hybrid between waveform and parametric coders, with medium data rates Parametric models used, with excitation signal computed by minimizing error between synthesized speech and input speech Examples: 16 kb/s G.728, 8 kb/s G.729 March 2004 Vishu Viswanathan 8

9 Speech Quality Terms Used Toll quality: High-grade wireline telephone High quality Good quality Communications quality Transparent quality Formal Subjective Testing Methods Expensive, time consuming Mean opinion score (MOS): Used in all industry standards bodies Diagnostic acceptability measure (DAM): Used by US Dep t of Defense Informal and Semi-Formal Subjective Tests Pairwise or A/B comparisons Rating tests Objective Methods Signal-to-Noise Ratio, ITU P.802 (PESQ) Automatic, repeatable, useful in coder development and optimization March 2004 Vishu Viswanathan 9

10 Speech Coder Attributes Low bit rate Low quality Clean Speech Low delay Low Complexity Human Speech Bits/Second Handheld Mean Opinion Score Hands-free Milliseconds MIPS, Memory Sound Effects High bit rate High quality Noisy Speech High delay High Complexity Music March 2004 Vishu Viswanathan 10

11 Speech Coding Standards ITU Standards coder rate (kb/s) approach G Mu/A-law G ADPCM G LD-CELP G CS-ACELP G /6.3 MP/ACELP ITU standards are targeted for telephone network applications Also used in Voice over IP applications All produce toll quality speech March 2004 Vishu Viswanathan 11

12 Europe North America Japan Speech Coding Standards Digital Cellular Standards coder rate (kb/s) chan rate approach date GSM FR RPE-LTP 1987 GSM HR VSELP 1994 GSM EFR ACELP 1995 GSM AMR ACELP 1998 TIA IS VSELP 1989 TIA IS QCELP 1993 TIA Q QCELP 1995 TIA IS ACELP 1996 TIA EVRC R-ACELP 1996 TIA SMV R-ACELP 2001 PDC FR VSELP 1990 PDC HR PSI-CELP 1993 PDC EFR ACELP 1999 PDC EFR ACELP 2000 March 2004 Vishu Viswanathan 12

13 Speech Coding Standards Wideband Standards coder rate (kb/s) approach G ,56,64 SB-ADPCM G ,32 Transform ITU WB 16,24 ACELP AMR WB ACELP VMR WB ACELP Wideband: 50 Hz 7 khz (versus narrowband telephone, Hz) March 2004 Vishu Viswanathan 13

14 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 14

15 Speech Synthesis Human Speech Based Systems Suitable for known material Speech coding based Talking toys, talking books, voice prompts, voice response systems Concatenation of pre-recorded voice data Information retrieval (stock quotes, airline schedules, banking) Text-to-Speech Systems Suitable for unknown or arbitrary text Applications: /fax reading, phone access to web based services, spoken telephone directory, car navigation, locationbased services, customer service, help desk, reading machines for the blind March 2004 Vishu Viswanathan 15

16 Components of a TTS System Dictionary and Rules Text Text Analysis Letter-to- Sound Synthesizer Speech - Numerical expansion (dates, times, money) - abbreviations, acronyms -proper name id Dr. Smith lives at 23 Lakeshore Dr. Courtesy of Larry Rabiner - Phonemes -Pitch - Duration -Pauses - loudness/amplitude choice of units words, phones, diphones, dyad, syllables choice of parameters LPC, formants, waveform templates, articulatory parameters, sinusoidal parameters method of computation rules, concatenation March 2004 Vishu Viswanathan 16

17 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 17

18 Speech Recognition & Understanding Problem Recognition: Automatic recognition of human speech by machine Understanding: Interpret the meaning of recognized speech and map them to actions to be taken Applications Voice dialing (name or number dialing) in telephone, cellphone, PDA, smartphone (Safety laws against handheld cellphone use while driving) Voice command & control in telematics, cellphone, PDA, smartphone, PC, toys Voice-enabled web browsing, information retrieval (stock quotes, weather forecast, airline flight information, banking), navigation, , SMS, dictation Automated customer service and help desks Benefits: hands-free, eyes-free use; not using keypad; faster task completion; ease of use; part of multi-modal interface; cost savings March 2004 Vishu Viswanathan 18

19 March 2004 Vishu Viswanathan 19

20 Components of a Speech Recognizer speech signal word string Feature Extraction Acoustic Scoring Decoding Acoustic Models Language Models Front end Back end March 2004 Vishu Viswanathan 20

21 Speaker Dependent Small Vocabulary Isolated Words Recognition Speech Recognizer Attributes Speaker Adaptive Words Continuous Speech Syntax Semantics Speaker Independent Large Vocabulary Conversational Speech Understanding Clean Speech Handheld Hands-free Noisy Speech Low Complexity MIPS, Memory High Complexity Server Based Distributed Client Based March 2004 Vishu Viswanathan 21

22 Performance & Robustness Performance Recognition Accuracy: Word error rate (WER) or task completion rate High enough performance required for user acceptance Robustness Issues Training versus operational condition differences Background noise: extent of noise, its variability (Usually additive) Channel variability: different microphones, different telephone circuits, handheld, handsfree, handheld-handsfree (Usually convolutive) Recognizer must have means to compensate for noise and channel variabilities Out-of-vocabulary rejection capability Speaker dialect and accent variability (handled by speaker adaptation) User Interface: Very important for the success of an application March 2004 Vishu Viswanathan 22

23 Recognition in Multiple Languages Speaker-Dependent Recognition Language independent (User can enroll names for voice dialing in multiple languages!) Some Observations for Speaker-Independent Recognition Same recognition engine but different data (models, dictionary) needed Recognition grammar to handle language-specific usage differences (e.g., French speak telephone numbers in pairs; natural number dialing needed) Training requires speech databases and dictionary in the new language Automatic training tools to minimize time to develop recognition in a new language March 2004 Vishu Viswanathan 23

24 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 24

25 Speaker Recognition Speaker Verification / Authentication Problem: Use voice input to verify the user s claimed identity Applications: Secure access to premises, information (banking), services (voice dialing), etc. Issues True user acceptance traded off with impostor acceptance Total voice verification Fixed text versus free text Speaker Identification Problem: Use voice to identify speaker from a closed or open set of speakers Applications: Legal and forensic use, intelligence, security Issues: Uncooperative user, often relatively short-duration speech, noisy and/or distorted speech. March 2004 Vishu Viswanathan 25

26 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 26

27 Speech Enhancement Noise Suppression Playback Enhancement Acoustic Echo Cancellation March 2004 Vishu Viswanathan 27

28 Noise Suppression Problem Remove acoustic noise from noisy speech signal for better listenability or for improved performance of speech processing devices Requirements: No speech signal distortion, no loss of speech intelligibility, no artifacts like musical noises, natural sounding residual noise Methods Single microphone approach: spectral subtraction family of methods Multi-microphone approach: adaptive noise cancellation, microphone array based fixed or adaptive beamforming, blind signal separation March 2004 Vishu Viswanathan 28

29 Playback Enhancement Problem Enhanced playback of speech to the listener Methods Spectrally shape the speech signal prior to playback, for improved intelligibility when the listener is in a noisy environment (PA system in aircraft, airports, sports arenas) Active noise cancellation to cancel noise acoustically in listener s ears (ANC headsets) Narrowband to wideband speech extension to provide wideband speech perception March 2004 Vishu Viswanathan 29

30 Acoustic Echo Cancellation rn ( ) Downlink Signal s( n) Far End Signal loudspeaker Error Signal A E C ˆ ( ) H z H(z) channel x( n) en ( ) - yn ˆ( ) vn ( ) = un ( ) + yn ( ) + n( n) 0 microphone Uplink Signal + Near End Signal Goal: Cancel feedback from loudspeaker into microphone using adaptive linear filter March 2004 Vishu Viswanathan 30

31 Lecture Outline Goals of the Lecture Speech Coding Speech Synthesis Speech Recognition & Understanding Speaker Recognition Speech Enhancement Speech Modification March 2004 Vishu Viswanathan 31

32 Speech Modification Voice Conversion Convert one voice to sound like another A female voice converted to sound like a low-pitched male voice (security) Time-Scale or Rate Modification Speed up or slow down speech, while preserving naturalness Applications: talking books, pre-recorded lectures, language learning March 2004 Vishu Viswanathan 32

Voice Communication Package v7.0 of front-end voice processing software technologies General description and technical specification

Voice Communication Package v7.0 of front-end voice processing software technologies General description and technical specification Voice Communication Package v7.0 of front-end voice processing software technologies General description and technical specification (Revision 1.0, May 2012) General VCP information Voice Communication

More information

Digital Speech Coding

Digital Speech Coding Digital Speech Processing David Tipper Associate Professor Graduate Program of Telecommunications and Networking University of Pittsburgh Telcom 2720 Slides 7 http://www.sis.pitt.edu/~dtipper/tipper.html

More information

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman A Comparison of Speech Coding Algorithms ADPCM vs CELP Shannon Wichman Department of Electrical Engineering The University of Texas at Dallas Fall 1999 December 8, 1999 1 Abstract Factors serving as constraints

More information

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Synergies and Distinctions Peter Vary RWTH Aachen University Institute of Communication Systems WASPAA, October 23, 2013 Mohonk Mountain

More information

Speech Signal Processing: An Overview

Speech Signal Processing: An Overview Speech Signal Processing: An Overview S. R. M. Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati December, 2012 Prasanna (EMST Lab, EEE, IITG) Speech

More information

Simple Voice over IP (VoIP) Implementation

Simple Voice over IP (VoIP) Implementation Simple Voice over IP (VoIP) Implementation ECE Department, University of Florida Abstract Voice over IP (VoIP) technology has many advantages over the traditional Public Switched Telephone Networks. In

More information

ETSI TS 101 329-2 V1.1.1 (2000-07)

ETSI TS 101 329-2 V1.1.1 (2000-07) TS 101 329-2 V1.1.1 (2000-07) Technical Specification Telecommunications and Internet Protocol Harmonization Over Networks (TIPHON); End to End Quality of Service in TIPHON Systems; Part 2: Definition

More information

Voice Encoding Methods for Digital Wireless Communications Systems

Voice Encoding Methods for Digital Wireless Communications Systems SOUTHERN METHODIST UNIVERSITY Voice Encoding Methods for Digital Wireless Communications Systems BY Bryan Douglas Street address city state, zip e-mail address Student ID xxx-xx-xxxx EE6302 Section 324,

More information

Thirukkural - A Text-to-Speech Synthesis System

Thirukkural - A Text-to-Speech Synthesis System Thirukkural - A Text-to-Speech Synthesis System G. L. Jayavardhana Rama, A. G. Ramakrishnan, M Vijay Venkatesh, R. Murali Shankar Department of Electrical Engg, Indian Institute of Science, Bangalore 560012,

More information

Develop Software that Speaks and Listens

Develop Software that Speaks and Listens Develop Software that Speaks and Listens Copyright 2011 Chant Inc. All rights reserved. Chant, SpeechKit, Getting the World Talking with Technology, talking man, and headset are trademarks or registered

More information

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN PAGE 30 Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN Sung-Joon Park, Kyung-Ae Jang, Jae-In Kim, Myoung-Wan Koo, Chu-Shik Jhon Service Development Laboratory, KT,

More information

HD VoIP Sounds Better. Brief Introduction. March 2009

HD VoIP Sounds Better. Brief Introduction. March 2009 HD VoIP Sounds Better Brief Introduction March 2009 Table of Contents 1. Introduction 3 2. Technology Overview 4 3. Business Environment 5 4. Wideband Applications for Diverse Industries 6 5. AudioCodes

More information

Analog-to-Digital Voice Encoding

Analog-to-Digital Voice Encoding Analog-to-Digital Voice Encoding Basic Voice Encoding: Converting Analog to Digital This topic describes the process of converting analog signals to digital signals. Digitizing Analog Signals 1. Sample

More information

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System Oana NICOLAE Faculty of Mathematics and Computer Science, Department of Computer Science, University of Craiova, Romania oananicolae1981@yahoo.com

More information

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral GSM speech coding Forelesning INF 5080 Vårsemester 2004 Sources This part contains material from: Web pages Universität Bremen, Arbeitsbereich Nachrichtentechnik (ANT): Prof.K.D. Kammeyer, Jörg Bitzer,

More information

An Arabic Text-To-Speech System Based on Artificial Neural Networks

An Arabic Text-To-Speech System Based on Artificial Neural Networks Journal of Computer Science 5 (3): 207-213, 2009 ISSN 1549-3636 2009 Science Publications An Arabic Text-To-Speech System Based on Artificial Neural Networks Ghadeer Al-Said and Moussa Abdallah Department

More information

Automated Dialing of Cellular Telephones Using Speech Recognition

Automated Dialing of Cellular Telephones Using Speech Recognition Automated Dialing of Cellular Telephones Using Speech Recognition Application Report Frank Henry Dearden III Voice Control Systems, Incorporated SPRA144 October 1994 Printed on Recycled Paper IMPORTANT

More information

Speech Compression. 2.1 Introduction

Speech Compression. 2.1 Introduction Speech Compression 2 This chapter presents an introduction to speech compression techniques, together with a detailed description of speech/audio compression standards including narrowband, wideband and

More information

Voice over IP Protocols And Compression Algorithms

Voice over IP Protocols And Compression Algorithms University of Tehran Electrical and Computer Engineering School SI Lab. Weekly Presentations Voice over IP Protocols And Compression Algorithms Presented by: Neda Kazemian Amiri Agenda Introduction to

More information

Tech Note. Introduction. Definition of Call Quality. Contents. Voice Quality Measurement Understanding VoIP Performance. Title Series.

Tech Note. Introduction. Definition of Call Quality. Contents. Voice Quality Measurement Understanding VoIP Performance. Title Series. Tech Note Title Series Voice Quality Measurement Understanding VoIP Performance Date January 2005 Overview This tech note describes commonly-used call quality measurement methods, explains the metrics

More information

Technology Finds Its Voice. February 2010

Technology Finds Its Voice. February 2010 Technology Finds Its Voice February 2010 Technology Finds Its Voice Overview Voice recognition technology has been around since the early 1970s, but until recently the promise of new advances has always

More information

Understanding the Transition From PESQ to POLQA. An Ascom Network Testing White Paper

Understanding the Transition From PESQ to POLQA. An Ascom Network Testing White Paper Understanding the Transition From PESQ to POLQA An Ascom Network Testing White Paper By Dr. Irina Cotanis Prepared by: Date: Document: Dr. Irina Cotanis 6 December 2011 NT11-22759, Rev. 1.0 Ascom (2011)

More information

VOICE RECOGNITION KIT USING HM2007. Speech Recognition System. Features. Specification. Applications

VOICE RECOGNITION KIT USING HM2007. Speech Recognition System. Features. Specification. Applications VOICE RECOGNITION KIT USING HM2007 Introduction Speech Recognition System The speech recognition system is a completely assembled and easy to use programmable speech recognition circuit. Programmable,

More information

Adjusting Voice Quality

Adjusting Voice Quality Adjusting Voice Quality Electrical Characteristics This topic describes the electrical characteristics of analog voice and the factors affecting voice quality. Factors That Affect Voice Quality The following

More information

Conference Phone Buyer s Guide

Conference Phone Buyer s Guide Conference Phone Buyer s Guide Conference Phones are essential in most organizations. Almost every business, large or small, uses their conference phone regularly. Such regular use means choosing one is

More information

Application Notes. Contents. Overview. Introduction. Echo in Voice over IP Systems VoIP Performance Management

Application Notes. Contents. Overview. Introduction. Echo in Voice over IP Systems VoIP Performance Management Application Notes Title Series Echo in Voice over IP Systems VoIP Performance Management Date January 2006 Overview This application note describes why echo occurs, what effects it has on voice quality,

More information

SIP Trunking and Voice over IP

SIP Trunking and Voice over IP SIP Trunking and Voice over IP Agenda What is SIP Trunking? SIP Signaling How is Voice encoded and transported? What are the Voice over IP Impairments? How is Voice Quality measured? VoIP Technology Confidential

More information

VoIP Conferencing. The latest in IP technologies deliver the next level of service innovation for better meetings. Global Collaboration Services

VoIP Conferencing. The latest in IP technologies deliver the next level of service innovation for better meetings. Global Collaboration Services Global Collaboration Services VoIP Conferencing The latest in IP technologies deliver the next level of service innovation for better meetings. ENERGIZE YOUR CONNECTIONS Table of Contents > > Contents...

More information

Course 4: IP Telephony and VoIP

Course 4: IP Telephony and VoIP Course 4: IP Telephony and VoIP Telecommunications Technical Curriculum Program 3: Voice Knowledge 6/9/2009 1 Telecommunications Technical Curriculum Program 1: General Industry Knowledge Course 1: General

More information

Feature and Technical

Feature and Technical BlackBerry Mobile Voice System for SIP Gateways and the Avaya Aura Session Manager Version: 5.3 Feature and Technical Overview Published: 2013-06-19 SWD-20130619135120555 Contents 1 Overview...4 2 Features...5

More information

IP PBX using SIP. Voice over Internet Protocol

IP PBX using SIP. Voice over Internet Protocol IP PBX using SIP Voice over Internet Protocol Key Components for an IP PBX setup Wireless/Fiber IP Networks (Point to point/multi point, LAN/WAN/Internet) Central or Multicast SIP Proxy/Server based Virtual

More information

Developing an Isolated Word Recognition System in MATLAB

Developing an Isolated Word Recognition System in MATLAB MATLAB Digest Developing an Isolated Word Recognition System in MATLAB By Daryl Ning Speech-recognition technology is embedded in voice-activated routing systems at customer call centres, voice dialling

More information

230622 - DSAP - Digital Speech and Audio Processing

230622 - DSAP - Digital Speech and Audio Processing Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 230 - ETSETB - Barcelona School of Telecommunications Engineering 739 - TSC - Department of Signal Theory and Communications

More information

Enterprise Voice Technology Solutions: A Primer

Enterprise Voice Technology Solutions: A Primer Cognizant 20-20 Insights Enterprise Voice Technology Solutions: A Primer A successful enterprise voice journey starts with clearly understanding the range of technology components and options, and often

More information

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA Audio Engineering Society Convention Paper Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA The papers at this Convention have been selected on the basis of a submitted abstract

More information

VoIP and IP Telephony

VoIP and IP Telephony VoIP and IP Telephony Reach Out and Ping Someone ISAC Spring School 2006 21 March 2006 Anthony Kava, Sr. Network Admin Pottawattamie County IT Definition VoIP Voice over Internet Protocol Voice Transport

More information

Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction

Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction : A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction Urmila Shrawankar Dept. of Information Technology Govt. Polytechnic, Nagpur Institute Sadar, Nagpur 440001 (INDIA)

More information

David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh Slides 2.

David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh Slides 2. VoIP QoS Factors David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh Slides 2 VoIP QoS Internet Telephone Quality of Service factors Voice

More information

Speech recognition technology for mobile phones

Speech recognition technology for mobile phones Speech recognition technology for mobile phones Stefan Dobler Following the introduction of mobile phones using voice commands, speech recognition is becoming standard on mobile handsets. Features such

More information

1. Public Switched Telephone Networks vs. Internet Protocol Networks

1. Public Switched Telephone Networks vs. Internet Protocol Networks Internet Protocol (IP)/Intelligent Network (IN) Integration Tutorial Definition Internet telephony switches enable voice calls between the public switched telephone network (PSTN) and Internet protocol

More information

OPERATOR ASSISTANCE (*0) - Immediate operator support is available by pressing *0 on your telephone keypad*.

OPERATOR ASSISTANCE (*0) - Immediate operator support is available by pressing *0 on your telephone keypad*. In Short: How to Conduct a Conference Call 1. Dial in to the system using either the toll or toll-free domestic phone number or the international phone number that was supplied to you. 2. Enter your HOST

More information

MPEG-H Audio System for Broadcasting

MPEG-H Audio System for Broadcasting MPEG-H Audio System for Broadcasting ITU-R Workshop Topics on the Future of Audio in Broadcasting Jan Plogsties Challenges of a Changing Landscape Immersion Compelling sound experience through sound that

More information

How To Recognize Voice Over Ip On Pc Or Mac Or Ip On A Pc Or Ip (Ip) On A Microsoft Computer Or Ip Computer On A Mac Or Mac (Ip Or Ip) On An Ip Computer Or Mac Computer On An Mp3

How To Recognize Voice Over Ip On Pc Or Mac Or Ip On A Pc Or Ip (Ip) On A Microsoft Computer Or Ip Computer On A Mac Or Mac (Ip Or Ip) On An Ip Computer Or Mac Computer On An Mp3 Recognizing Voice Over IP: A Robust Front-End for Speech Recognition on the World Wide Web. By C.Moreno, A. Antolin and F.Diaz-de-Maria. Summary By Maheshwar Jayaraman 1 1. Introduction Voice Over IP is

More information

Active Monitoring of Voice over IP Services with Malden

Active Monitoring of Voice over IP Services with Malden Active Monitoring of Voice over IP Services with Malden Introduction Active Monitoring describes the process of evaluating telecommunications system performance with intrusive tests. It differs from passive

More information

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Sound is an energy wave with frequency and amplitude. Frequency maps the axis of time, and amplitude

More information

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING A TOOL FOR TEACHING LINEAR PREDICTIVE CODING Branislav Gerazov 1, Venceslav Kafedziski 2, Goce Shutinoski 1 1) Department of Electronics, 2) Department of Telecommunications Faculty of Electrical Engineering

More information

Voyager Legend. User Guide

Voyager Legend. User Guide Voyager Legend User Guide Contents What's in the Box 3 Accessories 4 Headset Overview 5 Pairing 6 Get Paired 6 Pair another phone 6 Charge 7 Fit 8 Change the eartip 8 Wear on the left or right 8 The Basics

More information

Optimizing Converged Cisco Networks (ONT)

Optimizing Converged Cisco Networks (ONT) Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations (Deploy) Calculating Bandwidth Requirements for VoIP Objectives Describe factors influencing encapsulation overhead and bandwidth

More information

HANDS FREE COMMUNICATION (UConnect ) IF EQUIPPED

HANDS FREE COMMUNICATION (UConnect ) IF EQUIPPED UConnect Hands Free Communications- Complete Instructions HANDS FREE COMMUNICATION (UConnect ) IF EQUIPPED UConnect is a voice-activated, hands-free, in- vehicle communications system. UConnect allows

More information

To help manage calls:

To help manage calls: Mobile Phone Feature Definitions To help manage calls: Call waiting and call hold Allows you to accept a second incoming call with out losing the original call, then switch back and forth between them.

More information

Ericsson T18s Voice Dialing Simulator

Ericsson T18s Voice Dialing Simulator Ericsson T18s Voice Dialing Simulator Mauricio Aracena Kovacevic, Anna Dehlbom, Jakob Ekeberg, Guillaume Gariazzo, Eric Lästh and Vanessa Troncoso Dept. of Signals Sensors and Systems Royal Institute of

More information

Indepth Voice over IP and SIP Networking Course

Indepth Voice over IP and SIP Networking Course Introduction SIP is fast becoming the Voice over IP protocol of choice. During this 3-day course delegates will examine SIP technology and architecture and learn how a functioning VoIP service can be established.

More information

VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.)

VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.) VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.) 1 Remember first the big picture VoIP network architecture and some terminologies Voice coders 2 Audio and voice quality measuring

More information

Bluetooth Handsfree Kit. Car Speakerphone (For Bluetooth Mobile Phones)

Bluetooth Handsfree Kit. Car Speakerphone (For Bluetooth Mobile Phones) Bluetooth Handsfree Kit Car Speakerphone (For Bluetooth Mobile Phones) Table of Contents 1. Product Description 3 2. Product Overview 3 3. Charging 4 4. Power On/Off 4 Power On 4 Power Off 4 5. Selecting

More information

Search keywords: Connect, Meeting, Collaboration, Voice over IP, VoIP, Acoustic Magic, audio, web conferencing, microphone, best practices

Search keywords: Connect, Meeting, Collaboration, Voice over IP, VoIP, Acoustic Magic, audio, web conferencing, microphone, best practices Title: Acoustic Magic Voice Tracker II array microphone improves operation with VoIP based Adobe Connect Meeting URL: www.acousticmagic.com By: Bob Feingold, President, Acoustic Magic Inc. Search keywords:

More information

Speech-Enabled Interactive Voice Response Systems

Speech-Enabled Interactive Voice Response Systems Speech-Enabled Interactive Voice Response Systems Definition Serving as a bridge between people and computer databases, interactive voice response systems (IVRs) connect telephone users with the information

More information

Voice Activity Detection in the Tiger Platform. Hampus Thorell

Voice Activity Detection in the Tiger Platform. Hampus Thorell Voice Activity Detection in the Tiger Platform Examensarbete utfört i Reglerteknik av Hampus Thorell LiTH-ISY-EX--06/3817--SE Linköping 2006 Voice Activity Detection in the Tiger Platform Examensarbete

More information

1. Introduction to Spoken Dialogue Systems

1. Introduction to Spoken Dialogue Systems SoSe 2006 Projekt Sprachdialogsysteme 1. Introduction to Spoken Dialogue Systems Walther v. Hahn, Cristina Vertan {vhahn,vertan}@informatik.uni-hamburg.de Content What are Spoken dialogue systems? Types

More information

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN PESQ: An Introduction White Paper Prepared by: Psytechnics Limited 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN t: +44 (0) 1473 261 800 f: +44 (0) 1473 261 880 e: info@psytechnics.com September

More information

Linear Predictive Coding

Linear Predictive Coding Linear Predictive Coding Jeremy Bradbury December 5, 2000 0 Outline I. Proposal II. Introduction A. Speech Coding B. Voice Coders C. LPC Overview III. Historical Perspective of Linear Predictive Coding

More information

Thin Client Development and Wireless Markup Languages cont. VoiceXML and Voice Portals

Thin Client Development and Wireless Markup Languages cont. VoiceXML and Voice Portals Thin Client Development and Wireless Markup Languages cont. David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh tipper@tele.pitt.edu http://www.sis.pitt.edu/~dtipper/2727.html

More information

Forum 500 Forum 5000 Voice Portal Planning System Forum 500(0) Auto Attendant

Forum 500 Forum 5000 Voice Portal Planning System Forum 500(0) Auto Attendant Forum 500 Forum 5000 Voice Portal Planning System Forum 500(0) Auto Attendant User Guide Welcome to Proximus Thank you for choosing a Proximus product that stands for the best in quality matched with high

More information

User Manual. Please read this manual carefully before using the Phoenix Octopus

User Manual. Please read this manual carefully before using the Phoenix Octopus User Manual Please read this manual carefully before using the Phoenix Octopus For additional help and updates, refer to our website To contact Phoenix Audio for support, please send a detailed e-mail

More information

On the move: technology Great technology makes for great sound

On the move: technology Great technology makes for great sound On the move: technology AcuSpeak (Pulsar 260) Corded headsets featuring AcuSpeak technology are ideal for people who work in noisy environments, because the intelligent microphone selects only the human

More information

Tutorial about the VQR (Voice Quality Restoration) technology

Tutorial about the VQR (Voice Quality Restoration) technology Tutorial about the VQR (Voice Quality Restoration) technology Ing Oscar Bonello, Solidyne Fellow Audio Engineering Society, USA INTRODUCTION Telephone communications are the most widespread form of transport

More information

Voice Encryption over GSM:

Voice Encryption over GSM: End-to to-end Voice Encryption over GSM: A Different Approach Wesley Tanner Nick Lane-Smith www. Keith Lareau About Us: Wesley Tanner - Systems Engineer for a Software-Defined Radio (SDRF) company - B.S.

More information

Establishing the Uniqueness of the Human Voice for Security Applications

Establishing the Uniqueness of the Human Voice for Security Applications Proceedings of Student/Faculty Research Day, CSIS, Pace University, May 7th, 2004 Establishing the Uniqueness of the Human Voice for Security Applications Naresh P. Trilok, Sung-Hyuk Cha, and Charles C.

More information

Echo Cancellation. Definition. Overview. Topics

Echo Cancellation. Definition. Overview. Topics Echo Cancellation Definition Wireless phones are increasingly being regarded as essential communications tools, dramatically impacting how people approach day-to-day personal and business communications.

More information

B12 Troubleshooting & Analyzing VoIP

B12 Troubleshooting & Analyzing VoIP B12 Troubleshooting & Analyzing VoIP Phillip Sherlock Shade, Senior Forensics / Network Engineer Merlion s Keep Consulting phill.shade@gmail.com Phillip Sherlock Shade (Phill) phill.shade@gmail.com Phillip

More information

High Definition Wideband

High Definition Wideband Polaris Communications Whitepaper High Definition Wideband By extending telephone bandwidth to 7 khz and beyond, it is clear that one can markedly reduce fatigue, improve concentration and increase intelligibility

More information

VoIP Analysis Fundamentals with Wireshark. Phill Shade (Forensic Engineer Merlion s Keep Consulting)

VoIP Analysis Fundamentals with Wireshark. Phill Shade (Forensic Engineer Merlion s Keep Consulting) VoIP Analysis Fundamentals with Wireshark Phill Shade (Forensic Engineer Merlion s Keep Consulting) 1 Phillip D. Shade (Phill) phill.shade@gmail.com Phillip D. Shade is the founder of Merlion s Keep Consulting,

More information

VoiceXML Tutorial. Part 1: VoiceXML Basics and Simple Forms

VoiceXML Tutorial. Part 1: VoiceXML Basics and Simple Forms VoiceXML Tutorial Part 1: VoiceXML Basics and Simple Forms What is VoiceXML? XML Application W3C Standard Integration of Multiple Speech and Telephony Related Technologies Automated Speech Recognition

More information

Voice-Recognition Software An Introduction

Voice-Recognition Software An Introduction Voice-Recognition Software An Introduction What is Voice Recognition? Voice recognition is an alternative to typing on a keyboard. Put simply, you talk to the computer and your words appear on the screen.

More information

Introduction to Packet Voice Technologies and VoIP

Introduction to Packet Voice Technologies and VoIP Introduction to Packet Voice Technologies and VoIP Cisco Networking Academy Program Halmstad University Olga Torstensson 035-167575 olga.torstensson@ide.hh.se IP Telephony 1 Traditional Telephony 2 Basic

More information

User Guide. BlackBerry Storm 9530 Smartphone. Version: 4.7

User Guide. BlackBerry Storm 9530 Smartphone. Version: 4.7 BlackBerry Storm 9530 Smartphone Version: 4.7 SWD-490426-0909090640-001 Contents Shortcuts... 9 BlackBerry basics shortcuts... 9 Phone shortcuts... 9 Camera shortcuts... 9 Media shortcuts... 9 Typing shortcuts...

More information

Radio over Internet Protocol (RoIP)

Radio over Internet Protocol (RoIP) Radio over Internet Protocol (RoIP) Presenter : Farhad Fathi May 2012 What is VoIP? [1] Voice over Internet Protocol is a method for taking analog audio signals, like the kind you hear when you talk on

More information

VoIP Bandwidth Calculation

VoIP Bandwidth Calculation VoIP Bandwidth Calculation AI0106A VoIP Bandwidth Calculation Executive Summary Calculating how much bandwidth a Voice over IP call occupies can feel a bit like trying to answer the question; How elastic

More information

Monitoring VoIP Call Quality Using Improved Simplified E-model

Monitoring VoIP Call Quality Using Improved Simplified E-model Monitoring VoIP Call Quality Using Improved Simplified E-model Haytham Assem, David Malone Hamilton Institute, National University of Ireland, Maynooth Hitham.Salama.2012, David.Malone@nuim.ie Jonathan

More information

White Paper. ETSI Speech Quality Test Event Calling Testing Speech Quality of a VoIP Gateway

White Paper. ETSI Speech Quality Test Event Calling Testing Speech Quality of a VoIP Gateway White Paper ETSI Speech Quality Test Event Calling Testing Speech Quality of a VoIP Gateway A white paper from the ETSI 3rd SQTE (Speech Quality Test Event) Version 1 July 2005 ETSI Speech Quality Test

More information

4. H.323 Components. VOIP, Version 1.6e T.O.P. BusinessInteractive GmbH Page 1 of 19

4. H.323 Components. VOIP, Version 1.6e T.O.P. BusinessInteractive GmbH Page 1 of 19 4. H.323 Components VOIP, Version 1.6e T.O.P. BusinessInteractive GmbH Page 1 of 19 4.1 H.323 Terminals (1/2)...3 4.1 H.323 Terminals (2/2)...4 4.1.1 The software IP phone (1/2)...5 4.1.1 The software

More information

Call Recorder Oygo Manual. Version 1.001.11

Call Recorder Oygo Manual. Version 1.001.11 Call Recorder Oygo Manual Version 1.001.11 Contents 1 Introduction...4 2 Getting started...5 2.1 Hardware installation...5 2.2 Software installation...6 2.2.1 Software configuration... 7 3 Options menu...8

More information

Implementing an In-Service, Non- Intrusive Measurement Device in Telecommunication Networks Using the TMS320C31

Implementing an In-Service, Non- Intrusive Measurement Device in Telecommunication Networks Using the TMS320C31 Disclaimer: This document was part of the First European DSP Education and Research Conference. It may have been written by someone whose native language is not English. TI assumes no liability for the

More information

BRINGING VOIP TO THE CONFERENCE ROOM: HOW IT MANAGERS CAN ENHANCE THE USER EXPERIENCE

BRINGING VOIP TO THE CONFERENCE ROOM: HOW IT MANAGERS CAN ENHANCE THE USER EXPERIENCE BRINGING VOIP TO THE CONFERENCE ROOM: HOW IT MANAGERS CAN ENHANCE THE USER EXPERIENCE EXECUTIVE SUMMARY: Voice over IP is one of the fastest growing technologies and in just a few years will be in 80%

More information

Dragon speech recognition Nuance Dragon NaturallySpeaking 13 comparison by product. Feature matrix. Professional Premium Home.

Dragon speech recognition Nuance Dragon NaturallySpeaking 13 comparison by product. Feature matrix. Professional Premium Home. matrix Recognition accuracy Recognition speed System configuration Turns your voice into text with up to 99% accuracy New - Up to a 15% improvement to out-of-the-box accuracy compared to Dragon version

More information

C E D A T 8 5. Innovating services and technologies for speech content management

C E D A T 8 5. Innovating services and technologies for speech content management C E D A T 8 5 Innovating services and technologies for speech content management Company profile 25 years experience in the market of transcription/reporting services; Cedat 85 Group: Cedat 85 srl Subtitle

More information

Application Note. Introduction. Definition of Call Quality. Contents. Voice Quality Measurement. Series. Overview

Application Note. Introduction. Definition of Call Quality. Contents. Voice Quality Measurement. Series. Overview Application Note Title Series Date Nov 2014 Overview Voice Quality Measurement Voice over IP Performance Management This Application Note describes commonlyused call quality measurement methods, explains

More information

DeNoiser Plug-In. for USER S MANUAL

DeNoiser Plug-In. for USER S MANUAL DeNoiser Plug-In for USER S MANUAL 2001 Algorithmix All rights reserved Algorithmix DeNoiser User s Manual MT Version 1.1 7/2001 De-NOISER MANUAL CONTENTS INTRODUCTION TO NOISE REMOVAL...2 Encode/Decode

More information

From Concept to Production in Secure Voice Communications

From Concept to Production in Secure Voice Communications From Concept to Production in Secure Voice Communications Earl E. Swartzlander, Jr. Electrical and Computer Engineering Department University of Texas at Austin Austin, TX 78712 Abstract In the 1970s secure

More information

GSM VOICE CAPACITY EVOLUTION WITH VAMOS Strategic White Paper

GSM VOICE CAPACITY EVOLUTION WITH VAMOS Strategic White Paper GSM VOICE CAPACITY EVOLUTION WITH VAMOS Strategic White Paper Table of contents VAMOS increases your GSM voice capacity at minimum investment / 1 Take the full benefit of VAMOS / 1 Standard aspects / 1

More information

Beyond VoIP Protocols. Understanding Voice Technology and Networking Techniques for IP Telephony

Beyond VoIP Protocols. Understanding Voice Technology and Networking Techniques for IP Telephony Brochure More information from http://www.researchandmarkets.com/reports/2170384/ Beyond VoIP Protocols. Understanding Voice Technology and Networking Techniques for IP Telephony Description: In 1999 2000,

More information

A Smart Telephone Answering Machine with Voice Message Forwarding Capability

A Smart Telephone Answering Machine with Voice Message Forwarding Capability A Smart Telephone Answering Machine with Voice Message Forwarding Capability Chih-Hung Huang 1 Cheng Wen 2 Kuang-Chiung Chang 3 1 Department of Information Management, Lunghwa University of Science and

More information

Global System for Mobile Communication (GSM)

Global System for Mobile Communication (GSM) Global System for Mobile Communication (GSM) Definition Global system for mobile communication (GSM) is a globally accepted standard for digital cellular communication. GSM is the name of a standardization

More information

ACOUSTICAL CONSIDERATIONS FOR EFFECTIVE EMERGENCY ALARM SYSTEMS IN AN INDUSTRIAL SETTING

ACOUSTICAL CONSIDERATIONS FOR EFFECTIVE EMERGENCY ALARM SYSTEMS IN AN INDUSTRIAL SETTING ACOUSTICAL CONSIDERATIONS FOR EFFECTIVE EMERGENCY ALARM SYSTEMS IN AN INDUSTRIAL SETTING Dennis P. Driscoll, P.E. and David C. Byrne, CCC-A Associates in Acoustics, Inc. Evergreen, Colorado Telephone (303)

More information

Delivering reliable VoIP Services

Delivering reliable VoIP Services QoS Tips and Tricks for VoIP Services: Delivering reliable VoIP Services Alan Clark CEO, Telchemy alan.d.clark@telchemy.com 1 Objectives Clear understanding of: typical problems affecting VoIP service

More information

IP Telephony (Voice over IP)

IP Telephony (Voice over IP) (Voice over IP) Instructor Ai-Chun Pang, acpang@csie.ntu.edu.tw Office Number: 417, New building of CSIE Textbook Carrier Grade Voice over IP, D. Collins, McGraw-Hill, Second Edition, 2003. Requirements

More information

IP- PBX. Functionality Options

IP- PBX. Functionality Options IP- PBX Functionality Options With the powerful features integrated in the AtomOS system from AtomAmpd, installing & configuring a cost- effective and extensible VoIP solution is easily possible. 4/26/10

More information

HEAD acoustics. Standards on Audio Quality - from a system-level view. H. W. Gierlich HEAD acoustics GmbH. www.head-acoustics.

HEAD acoustics. Standards on Audio Quality - from a system-level view. H. W. Gierlich HEAD acoustics GmbH. www.head-acoustics. HEAD acoustics Standards on Audio Quality - from a system-level view H. W. Gierlich HEAD acoustics GmbH www.head-acoustics.de 27-Mar-03 #1 Overview Reference Points and Equalization free field-/diffuse

More information

DRAGON NATURALLYSPEAKING 12 FEATURE MATRIX COMPARISON BY PRODUCT EDITION

DRAGON NATURALLYSPEAKING 12 FEATURE MATRIX COMPARISON BY PRODUCT EDITION 1 Recognition Accuracy Turns your voice into text with up to 99% accuracy NEW - Up to a 20% improvement to out-of-the-box accuracy compared to Dragon version 11 Recognition Speed Words appear on the screen

More information

Figure1. Acoustic feedback in packet based video conferencing system

Figure1. Acoustic feedback in packet based video conferencing system Real-Time Howling Detection for Hands-Free Video Conferencing System Mi Suk Lee and Do Young Kim Future Internet Research Department ETRI, Daejeon, Korea {lms, dyk}@etri.re.kr Abstract: This paper presents

More information

Speech Coding Methods, Standards, and Applications. Jerry D. Gibson

Speech Coding Methods, Standards, and Applications. Jerry D. Gibson Speech Coding Methods, Standards, and Applications Jerry D. Gibson Department of Electrical & Computer Engineering University of California, Santa Barbara Santa Barbara, CA 93106-6065 gibson@ece.ucsb.edu

More information