Subjective SNR measure for quality assessment of. speech coders \A cross language study
|
|
|
- Baldric Lee
- 9 years ago
- Views:
Transcription
1 Subjective SNR measure for quality assessment of speech coders \A cross language study Mamoru Nakatsui and Hideki Noda Communications Research Laboratory, Ministry of Posts and Telecommunications, 4-2-1, Nukuikita, Koganei, 184 Japan (Received 7 January 1994) The subjective speech-to-noise-ratio (SNR), derived from the forced-choice pair-comparison test using the psychometric analysis procedure, has well represented overall speech quality of speech coders in a single dimension. No significant speaker and listener variation has been found for a wide range of waveform coders at the tests conducted in two separate sessions 14 months apart using different groups of English speakers and listeners. The purpose of this study is to investigate reproducibility of the measure conducting the same framework test using Japanese speakers and listeners. The test result shows the subjective SNR measure gives quite reliable scores evaluated in different laboratories with different language backgrounds. Keywords: Subjective quality, Speech coder, Pair-comparison test, Cross language study PACS number: Gv 1. INTRODUCTION The ultimate performance measure for evaluating voice communication systems is the subjective quality of the received speech. Modern digital speech-coding techniques achieve high intelligibility. The high level of speech intelligibility is a necessary but insufficient condition for user acceptance of the systems. Quality, as well, must meet acceptability criteria. Two general surveys of the methods of speech quality assessment have been given by Munson and Karlin1) and Hecker and Guttman,2) with different aspects of classifications. Munson and Karlin have classified the subjective assessment methods into two broad classes depending upon whether listeners are exposed only to the new system to be evaluated or to both the test and reference systems. They report that in the former case, called "indirect comparison," responses of listeners depend upon their prior experience with other systems to make the desired decisions or categorizations and in the latter case, called "direct comparison," responses of listeners depend upon their overall preference judgments alone, and there is no requirement for categorizing the reasons for their preference. Hecker and Guttman have divided the methods for measuring overall quality into two types of approaches depending upon whether or not the method investigates the underlying psychological factors that govern the overall judgments obtained. The former approach, called "analytic," attempts to explore the psychological components of speech quality and to discover the acoustical correlates of these components and the latter approach, called "utilitarian," attempts to obtain a unidimensional measure of speech quality. They conclude that, even though the unidimensional scaling of speech quality is limited in the accuracy of the measure obtained, this utilitarian approach is of primary interest from an engineering point of view. Among the measures obtained through the methods taking the utilitarian approach, the most widely used are categorical ratings depending upon
2 indirect comparison.3) Comparability of ratings obtained with these methods on different occasions is, in general, small since there is considerable variability due to speakers and listeners. Another widely used measure is the relative preference rating, derived from preference judgments on paired testreference systems.4) This relative rating may be less influenced by speaker and listener variability than the absolute rating scale obtained through the categorical judgments, but a direct comparability of ratings obtained in different studies is difficult to ensure. The subjective assessment method expected to provide a practical engineering criterion on overall speech quality should satisfy the following requirements of engineering practice: (1) test administration and data reduction are simple enough to be carried out in most speechcommunication laboratories, without assistance of specialist in psychological experiments or of trained or professional listeners, (2) the measure provides and adequate single absolute scale which is intuitively understandable by most communication engineers, (3) reproducibility of the results across studies is high enough to enable one to compare directly the measures obtained on different occasions or at different laboratories. The subjective speech-to-noise ratio (SNR) has been proposed as one of such measures and experimental results of subjective tests with the measure have been reported.5) The test results show that 1) the subjective SNR measure provides an adequate single absolute scale for a wide range of speech waveform coders, 2) reliable score is available using reasonably small number of listeners and speakers, and 3) no significant speaker and listener variation is found in the scores of two separate test sessions 14 months apart using different groups of English speakers and listeners. The purpose of this study is to investigate whether reproducibility of the results across tests is high enough to enable one to compare directly the measures obtained at different laboratories with different nationalities and language backgrounds. 2. SUBJECTIVE SNR MEASURE The concept of a subjective SNR has been found in the iso-preference method originally introduced by Munson and Karlin.1) The subjective SNR is derived from the forced-choice pair-comparison test using the psychometric analysis procedure commonly used in the method of constants. A speech signal degraded by varying amounts of multiplicative white noise6) is selected as the reference system in our tests. 2.1 Reference System The sampled reference signal r(i) corrupted multiplicative white noise n(i) is defined as by (1a) (1b) where s(i) is original speech signal which is also served as input signal to the speech coders evaluated, k is a coefficient for SNR control, and e(i) is a random variable taking on values of +1 and -1 with equal probability and independently of s(i) for each sample. Since this reference signal is identical to one of the speech signals processed by the modulated-noise reference unit (MNRU) of ITU-T (CCITT)7) and the SNR of the speech signal degraded by MNRU is called Q, our subjective SNR measure can be called as Equivalent Q measure. 2.2 Subjective Test Format Each test signal to be evaluated is paired with five or six reference signals selected so that preference ranging from 0% to 100% would result. During the test, the listeners are presented with repeated signal pairs in the order of ABAB as shown in Fig. 1. The listeners are asked to mark that member of each pair which they will prefer as a source of information. 2.3 Test Data Analysis The proportion of listeners preferring the test signal p(i) is converted to unit normal deviate z(i) using the equation Applying Muller-Urban weighting to the converted Fig. 1 Time pattern of the test pair sequence. (2)
3 M. NAKATSUI and H. NODA: SUBJECTIVE SNR MEASURE Table 1 Experimental frameworks. Fig. 2 An example of the least-square fit of the preference data: black circles denote the rates preferring the test signal and a solid curve denote the normally fitted ogive. data, a weighted least square algorithm8) is used to fit a straight line to the data points: (3) where S represents SNR of the reference signal. Thus the subjective SNR (SNRsubj) and its standard deviation s are given by (4a) (4b) Figure 2 shows an example of the least-square fit of the preference data. The 95% confidence interval of SNRsubj as an estimate of population mean Đ is defined by (5) where t(ė, x) is the t distribution with ė degrees of freedom and x is the significance level (5%). 3. EXPERIMENTAL PROCEDURE Table 1 summarizes the experimental setups of the current test (Test III) and the previous tests (Tests I and II).5) Five kinds of coder configuration were simulated and evaluated in the current test, including 40 and 56 kb/s Đ-255 companded pulse-code modulations (PCMs), 16 and 32 kb/s dual-adaptive delta modulators (DADMs),9) and a 16 kb/s adaptive delta modulator with one-bit memory (ADM).10) Two PCM and a ADM configurations were served as anchor points connecting the current tests in Japan with the previous tests in Canada.5) Four short sentences spoken by two male and two female speakers were bandlimited from 200 to 3,400 Hz and digitized to 12 bits at sampling frequencies of 8, 16 and 32 khz. Those digitized speech samples were served as inputs to the two PCM, 32 kb/s DADM, and 16 kb/s ADM configurations to produce the test signals to be evaluated. The speech samples having non-standard handlimitation from 200 to 2,400 Hz were served as inputs to the 16 kb/s DADM. The reference signals were also processed by Eq. (1) using these speech samples. Two separate sessions of pair-comparison tests, one session using speech samples of standard bandwidth and the other session using those of nonstandard (narrow) bandwidth, were conducted with eleven untrained listeners. Participants in both sessions were native Japanese speakers. 4. RESULT AND DISCUSSION A subjective SNR of each test signal and its 95% confidence interval were estimated from the test data pooled over four utterances and all the listeners and shown in Fig. 3 together with the results of the previous tests.5) Estimates obtained from the speech data having non-standard (narrow) bandwidth are indicated by an arrow in the figure (please refer to the previous report5) concerning coder configurations of APC, ADPCM-V and ADPCM-F in the figure). No statistically significant difference between the subjective SNR estimates at the 5% level is found for the following test signal pairs: (1) 40 kb/s PCMs in tests I, II, and III, (2) 56 kb/s PCMs in the tests I and III, and (3) 16 kb/s ADMs in tests I and III. The subjective SNR measure gives quite reliable scores evaluated in different laboratories with different language backgrounds. No significant variation due to the factors of speaker and listener is found for the tests signals of the current test as has also been shown in the previous tests for the wave-
4 Excited Linear Prediction (CELP),12) whose distortions differ significantly from those of the reference signal in our test. In order to overcome such limitation, reference signal having colored noise that reflects distortions of the coder to be evaluated13) should be introduced in the test. ACKNOWLEDGMENTS The authors are grateful to Dr. Paul Mermelstein, Bell-Northern Research, Canada for his cooperation and suggestion in the initial phase of this series of tests and express their sincere thanks to all the subjects involved in the tests. Fig. 3 Subjective SNR and its 95 % confidence interval of the test signals evaluated in three tests. form coders.5) A mean opinion score (MOS), which is the most widely used subjective measure on overall speech quality and is the absolute scale derived from absolute (categorical) judgements, has shown remarkable variations in test results obtained in different countries for the same speech transmission system.3) A MOS equivalent Q measure11) aiming at interpreting a large set of MOS data pooled into subjective SNR scale, has shown saturation (non-linearity) in high quality range (56 to 64 kb/s PCMs) reflecting inadequacy of the MOS measure in the range. 5. CONCLUSION Subjective SNR measure, which is the absolute scale derived from relative judgements well represents wide range of overall speech quality in a single dimension. A reproducibility of the results across tests is high enough to enable one to compare directly the measures obtained at different laboratories with different language backgrounds. A limitation of the measure may be in preventing its extension to low-bit-rate coders, such as Code REFERENCES 1) W.A. Munson and J.E. Karlin, "Isopreference method for evaluating speech-transmission circuits," J. Acoust. Soc. Am. 34, (1962). 2) M.H.L. Hecker and N. Guttman, "Survey of methods for measuring speech quality," J. Aud. Eng. Soc. 15, (1967). 3) D.J. Goodman and R.D. Nash, "Subjective quality of the same transmission conditions in seven different countries," IEEE Trans. Commun. COM- 30, (1982). 4) P. Mermelstein, "Evaluation of a segmental SNR measure as an indicator of the quality of ADPCM coded speech," J. Acoust. Soc. Am. 66, (1979). 5) M. Nakatsui and P. Mermelstein, "Subjective speech-to-noise ratio as a measure of speech quality for digital waveform coders," J. Acoust. Soc. Am. 72, (1982). 6) M.R. Schroeder, "Reference signal for signal quality studies," J. Acoust. Soc. Am. 44, (1968). 7) ITU-T(CCITT) Recommendation, p.81, V, Blue Book (1988). 8) J.P. Gilford, Psychometric Methods (McGraw- Hill, New York, 1954). 9) M. Natatsui and K. Nakata, "Dual adaptive delta modulation for mobile voice channel and its DSP implementation," Proc. IEEE Int. Conf. Acoust. Speech Signal Process., (1985). 10) N.S. Jayant, "Adaptive delta modulation with a one-bit memory," Bell Syt. Tech. J. 49, (1970). 11) T. Watanabe, K. Itoh and N. Kitawaki, "Comparison of performance on voiceband codecs by several speech quality measures," Tech. Rep. Speech, Acoust. Soc. Jpn. S82-48 (1982) (in Japanese). 12) M.R. Schroeder and B.S. Atal, "Code-exited linear prediction (CELP): high-quality speech at very low bit rates," Proc. IEEE Int. Conf. Acoust. Speech Signal Process., (1985).
5 M. NAKATSUI and H. NODA: SUBJECTIVE SNR MEASURE 13) H. Nagabuchi, "A study on reference signals in subjective quality evaluation of low-bit-rate speech codings," Trans. Inst. Electr. Inf. Commun. Eng. J69-A, (1986) (in Japanese). Mamoru Nakatsui received the B. Eng. degree in 1963 from the University of Electro-Communications and the Dr. Eng. degree in 1976 from the Tohoku University. In 1963 he joined the Radio Research Laboratory (now called the Communications Res. Lab.) of MPT, where he is now the Associate Director General. From 1978 to 1980 he was an invited professor of INRS-Telecommunications, University of Quebec, Canada. His main interests lie in speech communications. Hideki Noda received B.E. and M.E. degrees in electronics engineering from Kyushu University in 1973 and 1975 respectively. He received Dr. Eng. degree in electrical engineering from Kyushu Institute of Technology in He worked at the Daini-Seikosha Ltd. from 1975 to 1978 and at the National Research Institute of Police Science, Japan National Police Agency from 1978 to From 1984 to 1985, he was a visiting scientist of the National Research Council of Canada. In 1989, he joined the Communications Research Laboratory, Japan Ministry of Posts and Telecommunications, where he is now the chief of the Auditory and Visual Informatics Section. His research interests include speech and image processing.
Analog-to-Digital Voice Encoding
Analog-to-Digital Voice Encoding Basic Voice Encoding: Converting Analog to Digital This topic describes the process of converting analog signals to digital signals. Digitizing Analog Signals 1. Sample
A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman
A Comparison of Speech Coding Algorithms ADPCM vs CELP Shannon Wichman Department of Electrical Engineering The University of Texas at Dallas Fall 1999 December 8, 1999 1 Abstract Factors serving as constraints
Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles
Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Sound is an energy wave with frequency and amplitude. Frequency maps the axis of time, and amplitude
Digital Speech Coding
Digital Speech Processing David Tipper Associate Professor Graduate Program of Telecommunications and Networking University of Pittsburgh Telcom 2720 Slides 7 http://www.sis.pitt.edu/~dtipper/tipper.html
THE MEASUREMENT OF SPEECH INTELLIGIBILITY
THE MEASUREMENT OF SPEECH INTELLIGIBILITY Herman J.M. Steeneken TNO Human Factors, Soesterberg, the Netherlands 1. INTRODUCTION The draft version of the new ISO 9921 standard on the Assessment of Speech
Simple Voice over IP (VoIP) Implementation
Simple Voice over IP (VoIP) Implementation ECE Department, University of Florida Abstract Voice over IP (VoIP) technology has many advantages over the traditional Public Switched Telephone Networks. In
Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:
Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Voice Digitization in the POTS Traditional
Implementing an In-Service, Non- Intrusive Measurement Device in Telecommunication Networks Using the TMS320C31
Disclaimer: This document was part of the First European DSP Education and Research Conference. It may have been written by someone whose native language is not English. TI assumes no liability for the
White Paper. Comparison between subjective listening quality and P.862 PESQ score. Prepared by: A.W. Rix Psytechnics Limited
Comparison between subjective listening quality and P.862 PESQ score White Paper Prepared by: A.W. Rix Psytechnics Limited 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN t: +44 (0) 1473 261 800
Evolution from Voiceband to Broadband Internet Access
Evolution from Voiceband to Broadband Internet Access Murtaza Ali DSPS R&D Center Texas Instruments Abstract With the growth of Internet, demand for high bit rate Internet access is growing. Even though
White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN
PESQ: An Introduction White Paper Prepared by: Psytechnics Limited 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN t: +44 (0) 1473 261 800 f: +44 (0) 1473 261 880 e: [email protected] September
Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers
Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers Sung-won ark and Jose Trevino Texas A&M University-Kingsville, EE/CS Department, MSC 92, Kingsville, TX 78363 TEL (36) 593-2638, FAX
Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.
Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice over IP So, today we will discuss about voice over IP and internet
The effect of mismatched recording conditions on human and automatic speaker recognition in forensic applications
Forensic Science International 146S (2004) S95 S99 www.elsevier.com/locate/forsciint The effect of mismatched recording conditions on human and automatic speaker recognition in forensic applications A.
Establishing the Uniqueness of the Human Voice for Security Applications
Proceedings of Student/Faculty Research Day, CSIS, Pace University, May 7th, 2004 Establishing the Uniqueness of the Human Voice for Security Applications Naresh P. Trilok, Sung-Hyuk Cha, and Charles C.
Dynamic sound source for simulating the Lombard effect in room acoustic modeling software
Dynamic sound source for simulating the Lombard effect in room acoustic modeling software Jens Holger Rindel a) Claus Lynge Christensen b) Odeon A/S, Scion-DTU, Diplomvej 381, DK-2800 Kgs. Lynby, Denmark
Audio Coding Algorithm for One-Segment Broadcasting
Audio Coding Algorithm for One-Segment Broadcasting V Masanao Suzuki V Yasuji Ota V Takashi Itoh (Manuscript received November 29, 2007) With the recent progress in coding technologies, a more efficient
QoS Mapping of VoIP Communication using Self-Organizing Neural Network
QoS Mapping of VoIP Communication using Self-Organizing Neural Network Masao MASUGI NTT Network Service System Laboratories, NTT Corporation -9- Midori-cho, Musashino-shi, Tokyo 80-88, Japan E-mail: [email protected]
Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm
1 Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm Hani Mehrpouyan, Student Member, IEEE, Department of Electrical and Computer Engineering Queen s University, Kingston, Ontario,
Workshop Perceptual Effects of Filtering and Masking Introduction to Filtering and Masking
Workshop Perceptual Effects of Filtering and Masking Introduction to Filtering and Masking The perception and correct identification of speech sounds as phonemes depends on the listener extracting various
Use of Human Big Data to Help Improve Productivity in Service Businesses
Hitachi Review Vol. 6 (216), No. 2 847 Featured Articles Use of Human Big Data to Help Improve Productivity in Service Businesses Satomi Tsuji Hisanaga Omori Kenji Samejima Kazuo Yano, Dr. Eng. OVERVIEW:
Tech Note. Introduction. Definition of Call Quality. Contents. Voice Quality Measurement Understanding VoIP Performance. Title Series.
Tech Note Title Series Voice Quality Measurement Understanding VoIP Performance Date January 2005 Overview This tech note describes commonly-used call quality measurement methods, explains the metrics
Monitoring VoIP Call Quality Using Improved Simplified E-model
Monitoring VoIP Call Quality Using Improved Simplified E-model Haytham Assem, David Malone Hamilton Institute, National University of Ireland, Maynooth Hitham.Salama.2012, [email protected] Jonathan
From Concept to Production in Secure Voice Communications
From Concept to Production in Secure Voice Communications Earl E. Swartzlander, Jr. Electrical and Computer Engineering Department University of Texas at Austin Austin, TX 78712 Abstract In the 1970s secure
Effects of Pronunciation Practice System Based on Personalized CG Animations of Mouth Movement Model
Effects of Pronunciation Practice System Based on Personalized CG Animations of Mouth Movement Model Kohei Arai 1 Graduate School of Science and Engineering Saga University Saga City, Japan Mariko Oda
A TOOL FOR TEACHING LINEAR PREDICTIVE CODING
A TOOL FOR TEACHING LINEAR PREDICTIVE CODING Branislav Gerazov 1, Venceslav Kafedziski 2, Goce Shutinoski 1 1) Department of Electronics, 2) Department of Telecommunications Faculty of Electrical Engineering
Synchronization of sampling in distributed signal processing systems
Synchronization of sampling in distributed signal processing systems Károly Molnár, László Sujbert, Gábor Péceli Department of Measurement and Information Systems, Budapest University of Technology and
Telephone Speech Quality Standards. for. IP Phone Terminals (handsets) CES-Q003-2. September 30, 2004
Telephone Speech Quality Standards for IP Phone Terminals (handsets) CES-Q003-2 September 30, 2004 Communications and Information Network Association of Japan (CIAJ) Table of contents Introduction...3
Efficient Data Recovery scheme in PTS-Based OFDM systems with MATRIX Formulation
Efficient Data Recovery scheme in PTS-Based OFDM systems with MATRIX Formulation Sunil Karthick.M PG Scholar Department of ECE Kongu Engineering College Perundurau-638052 Venkatachalam.S Assistant Professor
A Robust Method for Solving Transcendental Equations
www.ijcsi.org 413 A Robust Method for Solving Transcendental Equations Md. Golam Moazzam, Amita Chakraborty and Md. Al-Amin Bhuiyan Department of Computer Science and Engineering, Jahangirnagar University,
THE SIMULATION OF MOVING SOUND SOURCES. John M. Chowning
THE SIMULATION OF MOVING SOUND SOURCES John M. Chowning The Center for Computer Research in Music and Acoustics (CCRMA) Department of Music Stanford University, Stanford, CA 94306 [email protected] (first
Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP
Department of Electrical and Computer Engineering Ben-Gurion University of the Negev LAB 1 - Introduction to USRP - 1-1 Introduction In this lab you will use software reconfigurable RF hardware from National
Sigma- Delta Modulator Simulation and Analysis using MatLab
Computer and Information Science; Vol. 5, No. 5; 2012 ISSN 1913-8989 E-ISSN 1913-8997 Published by Canadian Center of Science and Education Sigma- Delta Modulator Simulation and Analysis using MatLab Thuneibat
Descriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
Introduction to Packet Voice Technologies and VoIP
Introduction to Packet Voice Technologies and VoIP Cisco Networking Academy Program Halmstad University Olga Torstensson 035-167575 [email protected] IP Telephony 1 Traditional Telephony 2 Basic
The Phase Modulator In NBFM Voice Communication Systems
The Phase Modulator In NBFM Voice Communication Systems Virgil Leenerts 8 March 5 The phase modulator has been a point of discussion as to why it is used and not a frequency modulator in what are called
Voice over IP Protocols And Compression Algorithms
University of Tehran Electrical and Computer Engineering School SI Lab. Weekly Presentations Voice over IP Protocols And Compression Algorithms Presented by: Neda Kazemian Amiri Agenda Introduction to
A novel sharp beam-forming flat panel loudspeaker using digitally driven speaker system
Audio Engineering Society Convention Paper Presented at the 131st Convention 2011 October 20 23 New York, NY, USA This Convention paper was selected based on a submitted abstract and 750-word precis that
Jitter Measurements in Serial Data Signals
Jitter Measurements in Serial Data Signals Michael Schnecker, Product Manager LeCroy Corporation Introduction The increasing speed of serial data transmission systems places greater importance on measuring
How To Recognize Voice Over Ip On Pc Or Mac Or Ip On A Pc Or Ip (Ip) On A Microsoft Computer Or Ip Computer On A Mac Or Mac (Ip Or Ip) On An Ip Computer Or Mac Computer On An Mp3
Recognizing Voice Over IP: A Robust Front-End for Speech Recognition on the World Wide Web. By C.Moreno, A. Antolin and F.Diaz-de-Maria. Summary By Maheshwar Jayaraman 1 1. Introduction Voice Over IP is
Electronic Communications Committee (ECC) within the European Conference of Postal and Telecommunications Administrations (CEPT)
Page 1 Electronic Communications Committee (ECC) within the European Conference of Postal and Telecommunications Administrations (CEPT) ECC RECOMMENDATION (06)01 Bandwidth measurements using FFT techniques
An Arabic Text-To-Speech System Based on Artificial Neural Networks
Journal of Computer Science 5 (3): 207-213, 2009 ISSN 1549-3636 2009 Science Publications An Arabic Text-To-Speech System Based on Artificial Neural Networks Ghadeer Al-Said and Moussa Abdallah Department
Non-Data Aided Carrier Offset Compensation for SDR Implementation
Non-Data Aided Carrier Offset Compensation for SDR Implementation Anders Riis Jensen 1, Niels Terp Kjeldgaard Jørgensen 1 Kim Laugesen 1, Yannick Le Moullec 1,2 1 Department of Electronic Systems, 2 Center
Convention Paper Presented at the 118th Convention 2005 May 28 31 Barcelona, Spain
Audio Engineering Society Convention Paper Presented at the 118th Convention 25 May 28 31 Barcelona, Spain 6431 This convention paper has been reproduced from the author s advance manuscript, without editing,
Predict the Popularity of YouTube Videos Using Early View Data
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music
ISO/IEC MPEG USAC Unified Speech and Audio Coding MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music The standardization of MPEG USAC in ISO/IEC is now in its final
Digital Transmission of Analog Data: PCM and Delta Modulation
Digital Transmission of Analog Data: PCM and Delta Modulation Required reading: Garcia 3.3.2 and 3.3.3 CSE 323, Fall 200 Instructor: N. Vlajic Digital Transmission of Analog Data 2 Digitization process
Testing Voice Service for Next Generation Packet Voice Networks
Testing Voice Service for Next Generation Packet Voice Networks Next Generation voice networks combine voice and data on the same transmission path. The advantages are many, but because of the technology
ETSI TS 101 329-2 V1.1.1 (2000-07)
TS 101 329-2 V1.1.1 (2000-07) Technical Specification Telecommunications and Internet Protocol Harmonization Over Networks (TIPHON); End to End Quality of Service in TIPHON Systems; Part 2: Definition
MultiDSLA. Measuring Network Performance. Malden Electronics Ltd
MultiDSLA Measuring Network Performance Malden Electronics Ltd The Business Case for Network Performance Measurement MultiDSLA is a highly scalable solution for the measurement of network speech transmission
RECOMMENDATION ITU-R BO.786 *
Rec. ITU-R BO.786 RECOMMENDATION ITU-R BO.786 * MUSE ** system for HDTV broadcasting-satellite services (Question ITU-R /) (992) The ITU Radiocommunication Assembly, considering a) that the MUSE system
Agilent Creating Multi-tone Signals With the N7509A Waveform Generation Toolbox. Application Note
Agilent Creating Multi-tone Signals With the N7509A Waveform Generation Toolbox Application Note Introduction Of all the signal engines in the N7509A, the most complex is the multi-tone engine. This application
Active Monitoring of Voice over IP Services with Malden
Active Monitoring of Voice over IP Services with Malden Introduction Active Monitoring describes the process of evaluating telecommunications system performance with intrusive tests. It differs from passive
Tutorial about the VQR (Voice Quality Restoration) technology
Tutorial about the VQR (Voice Quality Restoration) technology Ing Oscar Bonello, Solidyne Fellow Audio Engineering Society, USA INTRODUCTION Telephone communications are the most widespread form of transport
Perceived Speech Quality Prediction for Voice over IP-based Networks
Perceived Speech Quality Prediction for Voice over IP-based Networks Lingfen Sun and Emmanuel C. Ifeachor Department of Communication and Electronic Engineering, University of Plymouth, Plymouth PL 8AA,
Emotion Detection from Speech
Emotion Detection from Speech 1. Introduction Although emotion detection from speech is a relatively new field of research, it has many potential applications. In human-computer or human-human interaction
Voice Encoding Methods for Digital Wireless Communications Systems
SOUTHERN METHODIST UNIVERSITY Voice Encoding Methods for Digital Wireless Communications Systems BY Bryan Douglas Street address city state, zip e-mail address Student ID xxx-xx-xxxx EE6302 Section 324,
How To Encode Data From A Signal To A Signal (Wired) To A Bitcode (Wired Or Coaxial)
Physical Layer Part 2 Data Encoding Techniques Networks: Data Encoding 1 Analog and Digital Transmissions Figure 2-23.The use of both analog and digital transmissions for a computer to computer call. Conversion
Timing Errors and Jitter
Timing Errors and Jitter Background Mike Story In a sampled (digital) system, samples have to be accurate in level and time. The digital system uses the two bits of information the signal was this big
Proposal for a Slot Pair Array Having an Invariant Main Beam Direction with a Cosecant Radiation Pattern Using a Post-Wall Waveguide
176 IEICE TRANS. ELECTRON., VOL.E86 C, NO.2 FEBRUARY 2003 PAPER Special Issue on Circuit and Device Technology for High-Speed Wireless Communication Proposal for a Slot Pair Array Having an Invariant Main
Network Sensing Network Monitoring and Diagnosis Technologies
Network Sensing Network Monitoring and Diagnosis Technologies Masanobu Morinaga Yuji Nomura Takeshi Otani Motoyuki Kawaba (Manuscript received April 7, 2009) Problems that occur in the information and
ACOUSTICAL CONSIDERATIONS FOR EFFECTIVE EMERGENCY ALARM SYSTEMS IN AN INDUSTRIAL SETTING
ACOUSTICAL CONSIDERATIONS FOR EFFECTIVE EMERGENCY ALARM SYSTEMS IN AN INDUSTRIAL SETTING Dennis P. Driscoll, P.E. and David C. Byrne, CCC-A Associates in Acoustics, Inc. Evergreen, Colorado Telephone (303)
Current Status and Problems in Mastering of Sound Volume in TV News and Commercials
Current Status and Problems in Mastering of Sound Volume in TV News and Commercials Doo-Heon Kyon, Myung-Sook Kim and Myung-Jin Bae Electronics Engineering Department, Soongsil University, Korea [email protected],
Performance Evaluation of Online Image Compression Tools
Performance Evaluation of Online Image Compression Tools Rupali Sharma 1, aresh Kumar 1, Department of Computer Science, PTUGZS Campus, Bathinda (Punjab), India 1 [email protected], [email protected]
PERCENTAGE ARTICULATION LOSS OF CONSONANTS IN THE ELEMENTARY SCHOOL CLASSROOMS
The 21 st International Congress on Sound and Vibration 13-17 July, 2014, Beijing/China PERCENTAGE ARTICULATION LOSS OF CONSONANTS IN THE ELEMENTARY SCHOOL CLASSROOMS Dan Wang, Nanjie Yan and Jianxin Peng*
Performance Analysis of Interleaving Scheme in Wideband VoIP System under Different Strategic Conditions
Performance Analysis of Scheme in Wideband VoIP System under Different Strategic Conditions Harjit Pal Singh 1, Sarabjeet Singh 1 and Jasvir Singh 2 1 Dept. of Physics, Dr. B.R. Ambedkar National Institute
Thirukkural - A Text-to-Speech Synthesis System
Thirukkural - A Text-to-Speech Synthesis System G. L. Jayavardhana Rama, A. G. Ramakrishnan, M Vijay Venkatesh, R. Murali Shankar Department of Electrical Engg, Indian Institute of Science, Bangalore 560012,
Sound absorption and acoustic surface impedance
Sound absorption and acoustic surface impedance CHRISTER HEED SD2165 Stockholm October 2008 Marcus Wallenberg Laboratoriet för Ljud- och Vibrationsforskning Sound absorption and acoustic surface impedance
TCOM 370 NOTES 99-4 BANDWIDTH, FREQUENCY RESPONSE, AND CAPACITY OF COMMUNICATION LINKS
TCOM 370 NOTES 99-4 BANDWIDTH, FREQUENCY RESPONSE, AND CAPACITY OF COMMUNICATION LINKS 1. Bandwidth: The bandwidth of a communication link, or in general any system, was loosely defined as the width of
Application Note. Introduction. Definition of Call Quality. Contents. Voice Quality Measurement. Series. Overview
Application Note Title Series Date Nov 2014 Overview Voice Quality Measurement Voice over IP Performance Management This Application Note describes commonlyused call quality measurement methods, explains
Lecture 1-6: Noise and Filters
Lecture 1-6: Noise and Filters Overview 1. Periodic and Aperiodic Signals Review: by periodic signals, we mean signals that have a waveform shape that repeats. The time taken for the waveform to repeat
SPEECH INTELLIGIBILITY and Fire Alarm Voice Communication Systems
SPEECH INTELLIGIBILITY and Fire Alarm Voice Communication Systems WILLIAM KUFFNER, M.A. Sc., P.Eng, PMP Senior Fire Protection Engineer Director Fire Protection Engineering October 30, 2013 Code Reference
INTER CARRIER INTERFERENCE CANCELLATION IN HIGH SPEED OFDM SYSTEM Y. Naveena *1, K. Upendra Chowdary 2
ISSN 2277-2685 IJESR/June 2014/ Vol-4/Issue-6/333-337 Y. Naveena et al./ International Journal of Engineering & Science Research INTER CARRIER INTERFERENCE CANCELLATION IN HIGH SPEED OFDM SYSTEM Y. Naveena
Objective Speech Quality Measures for Internet Telephony
Objective Speech Quality Measures for Internet Telephony Timothy A. Hall National Institute of Standards and Technology 100 Bureau Drive, STOP 8920 Gaithersburg, MD 20899-8920 ABSTRACT Measuring voice
CZECH PILOT STUDY OF ROAD HORIZONTAL ALIGNMENT OPTIMIZATION
The XXVIII International Baltic Road Conference CZECH PILOT STUDY OF ROAD HORIZONTAL ALIGNMENT OPTIMIZATION Jiří Ambros 1, Veronika Valentová 2 1,2 Centrum dopravního výzkumu, v.v.i., Brno, Czech Republic
Ericsson T18s Voice Dialing Simulator
Ericsson T18s Voice Dialing Simulator Mauricio Aracena Kovacevic, Anna Dehlbom, Jakob Ekeberg, Guillaume Gariazzo, Eric Lästh and Vanessa Troncoso Dept. of Signals Sensors and Systems Royal Institute of
SIMPLIFIED PERFORMANCE MODEL FOR HYBRID WIND DIESEL SYSTEMS. J. F. MANWELL, J. G. McGOWAN and U. ABDULWAHID
SIMPLIFIED PERFORMANCE MODEL FOR HYBRID WIND DIESEL SYSTEMS J. F. MANWELL, J. G. McGOWAN and U. ABDULWAHID Renewable Energy Laboratory Department of Mechanical and Industrial Engineering University of
Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction
Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals Modified from the lecture slides of Lami Kaya ([email protected]) for use CECS 474, Fall 2008. 2009 Pearson Education Inc., Upper
application note Directional Microphone Applications Introduction Directional Hearing Aids
APPLICATION NOTE AN-4 Directional Microphone Applications Introduction The inability to understand speech in noisy environments is a significant problem for hearing impaired individuals. An omnidirectional
Optical Fibres. Introduction. Safety precautions. For your safety. For the safety of the apparatus
Please do not remove this manual from from the lab. It is available at www.cm.ph.bham.ac.uk/y2lab Optics Introduction Optical fibres are widely used for transmitting data at high speeds. In this experiment,
Data Transmission. Data Communications Model. CSE 3461 / 5461: Computer Networking & Internet Technologies. Presentation B
CSE 3461 / 5461: Computer Networking & Internet Technologies Data Transmission Presentation B Kannan Srinivasan 08/30/2012 Data Communications Model Figure 1.2 Studying Assignment: 3.1-3.4, 4.1 Presentation
PCM Encoding and Decoding:
PCM Encoding and Decoding: Aim: Introduction to PCM encoding and decoding. Introduction: PCM Encoding: The input to the PCM ENCODER module is an analog message. This must be constrained to a defined bandwidth
Effects of microphone arrangement on the accuracy of a spherical microphone array (SENZI) in acquiring high-definition 3D sound space information
November 1, 21 1:3 1 Effects of microphone arrangement on the accuracy of a spherical microphone array (SENZI) in acquiring high-definition 3D sound space information Shuichi Sakamoto 1, Jun ichi Kodama
Inter-Cell Interference Coordination (ICIC) Technology
Inter-Cell Interference Coordination (ICIC) Technology Dai Kimura Hiroyuki Seki Long Term Evolution (LTE) is a promising standard for next-generation cellular systems targeted to have a peak downlink bit
International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol. XXXIV-5/W10
Accurate 3D information extraction from large-scale data compressed image and the study of the optimum stereo imaging method Riichi NAGURA *, * Kanagawa Institute of Technology [email protected]
VALAR: A BENCHMARK SUITE TO STUDY THE DYNAMIC BEHAVIOR OF HETEROGENEOUS SYSTEMS
VALAR: A BENCHMARK SUITE TO STUDY THE DYNAMIC BEHAVIOR OF HETEROGENEOUS SYSTEMS Perhaad Mistry, Yash Ukidave, Dana Schaa, David Kaeli Department of Electrical and Computer Engineering Northeastern University,
TCOM 370 NOTES 99-6 VOICE DIGITIZATION AND VOICE/DATA INTEGRATION
TCOM 370 NOTES 99-6 VOICE DIGITIZATION AND VOICE/DATA INTEGRATION (Please read appropriate parts of Section 2.5.2 in book) 1. VOICE DIGITIZATION IN THE PSTN The frequencies contained in telephone-quality
CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression
Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the
The Optimization of Parameters Configuration for AMR Codec in Mobile Networks
01 8th International Conference on Communications and Networking in China (CHINACOM) The Optimization of Parameters Configuration for AMR Codec in Mobile Networks Nan Ha,JingWang, Zesong Fei, Wenzhi Li,
MATLAB-based Applications for Image Processing and Image Quality Assessment Part II: Experimental Results
154 L. KRASULA, M. KLÍMA, E. ROGARD, E. JEANBLANC, MATLAB BASED APPLICATIONS PART II: EXPERIMENTAL RESULTS MATLAB-based Applications for Image Processing and Image Quality Assessment Part II: Experimental
This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.
This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore. Title Transcription of polyphonic signals using fast filter bank( Accepted version ) Author(s) Foo, Say Wei;
LAB 7 MOSFET CHARACTERISTICS AND APPLICATIONS
LAB 7 MOSFET CHARACTERISTICS AND APPLICATIONS Objective In this experiment you will study the i-v characteristics of an MOS transistor. You will use the MOSFET as a variable resistor and as a switch. BACKGROUND
T = 1 f. Phase. Measure of relative position in time within a single period of a signal For a periodic signal f(t), phase is fractional part t p
Data Transmission Concepts and terminology Transmission terminology Transmission from transmitter to receiver goes over some transmission medium using electromagnetic waves Guided media. Waves are guided
Mobile Phone Tracking & Positioning Techniques
Mobile Phone Tracking & Positioning Techniques Laxmana Siridhara Arigela #1, Putta Aditya Veerendra *2, Simhadri Anvesh *2, Kolisetty Sandeep Satya Hanuman #3 1 Assistant Professor, Department of ECE,
Sampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically.
Sampling Theorem We will show that a band limited signal can be reconstructed exactly from its discrete time samples. Recall: That a time sampled signal is like taking a snap shot or picture of signal
