Lecture 7: Audio compression and coding

Size: px
Start display at page:

Download "Lecture 7: Audio compression and coding"

Transcription

1 EE E682: Speech & Audio Processing & Recognition Lecture 7: Audio compression and coding Dan Ellis Michael Mandel Columbia University Dept. of Electrical Engineering dpwe/e682 March 31, 29 1 Information, Compression & Quantization 2 Speech coding 3 Wide-Bandwidth Audio Coding E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 1 / 37

2 Outline 1 Information, Compression & Quantization 2 Speech coding 3 Wide-Bandwidth Audio Coding E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 2 / 37

3 Compression & Quantization How big is audio data? What is the bitrate? F s frames/second (e.g. 8 or 441) x C samples/frame (e.g. 1 or 2 channels) x B bits/sample (e.g. 8 or 16) F s C B bits/second (e.g. 64 Kbps or 1.4 Mbps) bits / frame CD Audio 1.4 Mbps 32 8 Mobile 8 441!13 Kbps frames / sec Telephony 64 Kbps How to reduce? lower sampling rate less bandwidth (muffled) lower channel count no stereo image lower sample size quantization noise Or: use data compression E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 3 / 37

4 Data compression: Redundancy vs. Irrelevance Two main principles in compression: remove redundant information remove irrelevant information Redundant information is implicit in remainder e.g. signal bandlimited to 2kHz, but sample at 8kHz can recover every other sample by interpolation: In a bandlimited signal, the red samples can be exactly recovered by interpolating the blue samples sample Irrelevant info is unique but unnecessary e.g. recording a microphone signal at 8 khz sampling rate time E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 4 / 37

5 Irrelevant data in audio coding For coding of audio signals, irrelevant means perceptually insignificant an empirical property Compact Disc standard is adequate: 44 khz sampling for 2 khz bandwidth 16 bit linear samples for 96 db peak SNR Reflect limits of human sensitivity: 2 khz bandwidth, 1 db intensity sinusoid phase, detail of noise structure dynamic properties - hard to characterize Problem: separating salient & irrelevant E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 5 / 37

6 Quantization Represent waveform with discrete levels Q[x] x Q[x[n]] Equivalent to adding error e[n]: x[n] = Q [x[n]] + e[n] e[n] uncorrelated, uniform white noise p(e[n]) -D /2 +D /2 x[n] -2 error e[n] = x[n] - Q[x[n]] variance σ 2 e = D2 12 E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 6 / 37

7 Quantization noise (Q-noise) Uncorrelated noise has flat spectrum With a B bit word and a quantization step D max signal range (x) = (2 B 1 ) D... (2 B 1 1) D quantization noise (e) = D/2... D/2 Best signal-to-noise ratio (power) SNR = E[x 2 ]/E[e 2 ] = (2 B ) 2.. or, in db, 2 log 1 2 B 6 B db level / db -2-4 Quantized at 7 bits freq / Hz E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 7 / 37

8 Redundant information Redundancy removal is lossless Signal correlation implies redundant information e.g. if x[n] = x[n 1] + v[n] x[n] has a greater amplitude range uses more bits than v[n] sending v[n] = x[n] x[n 1] can reduce amplitude, hence bitrate x[n] - x[n-1] but: white noise sequence has no redundancy... Problem: separating unique & redundant E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 8 / 37

9 Optimal coding Shannon information: An unlikely occurrence is more informative p(a) =.5 p(b) =.5 ABBBBAAABBABBABBABB p(a) =.9 p(b) =.1 AAAAABBAAAAAABAAAAB A, B equiprobable A is expected; B is big news Information in bits I = log 2 (probability) clearly works when all possibilities equiprobable Optimal bitrate av.token length = entropy H = E[I ].. equal-length tokens are equally likely How to achieve this? transform signal to have uniform pdf nonuniform quantization for equiprobable tokens variable-length tokens Huffman coding E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 9 / 37

10 Quantization for optimum bitrate Quantization should reflect pdf of signal: p(x = x ) p(x < x ) x' cumulative pdf p(x < x ) maps to uniform x Or, codeword length per Shannon log 2 (p(x)): p(x) x Shannon info / bits Codewords xx 11111xx 1111xx 11xx 1xx xx 11xx 111xx 11111xx xx xx xx xx Huffman coding: tree-structured decoder E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 1 / 37

11 Vector Quantization Quantize mutually dependent values in joint space: 3 2 x x May help even if values are largely independent larger space x1,x2 is easier for Huffman E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

12 Compression & Representation As always, success depends on representation Appropriate domain may be naturally bandlimited e.g. vocal-tract-shape coefficients can reduce sampling rate without data loss In right domain, irrelevance is easier to get at e.g. STFT to separate magnitude and phase E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

13 Aside: Coding standards Coding only useful if recipient knows the code! Standardization efforts are important Federal Standards: Low bit-rate secure voice: FS115e: LPC Kbps FS116: 4.8 Kbps CELP ITU G.x series (also H.x for video) G.726 ADPCM G.729 Low delay CELP MPEG MPEG-Audio layers 1,2,3 (mp3) MPEG 2 Advanced Audio Codec (AAC) MPEG 4 Synthetic-Natural Hybrid Codec More recent standards proprietary: WMA, Skype... Speex... E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

14 Outline 1 Information, Compression & Quantization 2 Speech coding 3 Wide-Bandwidth Audio Coding E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

15 Speech coding Standard voice channel: analog: 4 khz slot ( 4 db SNR) digital: 64 Kbps = 8 bit µ-law x 8 khz How to compress? Redundant signal assumed to be a single voice, not any possible waveform Irrelevant need code only enough for intelligibility, speaker identification (c/w analog channel) Specifically, source-filter decomposition vocal tract & f change slowly Applications: live communications offline storage E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

16 Channel Vocoder (194s-196s) Basic source-filter decomposition filterbank breaks into spectral bands transmit slowly-changing energy in each band Encoder Bandpass filter 1 Smoothed energy Downsample & encode E 1 Decoder Bandpass filter 1 Input Bandpass filter N Smoothed energy Downsample & encode E N Bandpass filter 1 Output Voicing analysis V/UV Pitch Pulse generator Noise source 1-2 bands, perceptually spaced Downsampling? Excitation? pitch / noise model or: baseband + flattening... E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

17 LPC encoding The classic source-filter model Input s[n] Encoder Filter coefficients {ai} 1 /A(e j! ) LPC analysis f Residual e[n] t Represent & encode Represent & encode Decoder Excitation generator e[n] ^ All-pole filter H(z) = "a i z -i Output s[n] ^ Compression gains: filter parameters are slowly changing excitation can be represented many ways Filter parameters Excitation/pitch parameters 5 ms 2 ms E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

18 Encoding LPC filter parameters For communications quality : 8 khz sampling (4 khz bandwidth) 1th order LPC (up to 5 pole pairs) update every 2-3 ms 3-5 param/s Representation & quantization {ai } - poor distribution, can t interpolate reflection coefficients {ki }: guaranteed stable LSPs - lovely! a i k i f Li Bit allocation (filter): GSM (13 kbps): 8 LARs x 3-6 bits / 2 ms = 1.8 Kbps FS116 (4.8 kbps): 1 LSPs x 3-4 bits / 3 ms = 1.1 Kbps E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

19 Line Spectral Pairs (LSPs) LSPs encode LPC filter by a set of frequencies Excellent for quantization & interpolation Definition: zeros of P(z) = A(z) + z p 1 A(z 1 ) Q(z) = A(z) z p 1 A(z 1 ) z = e jω z 1 = e jω A(z) = A(z 1 ) on u.circ. P(z), Q(z) have (interleaved) zeros when {A(z)} = ± {z p 1 A(z 1 )} reconstruct P(z), Q(z) = i (1 ζ iz 1 ) etc. A(z) = [P(z) + Q(z)]/2 A(z -1 ) = A(z) = Q(z) = P(z) = E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

20 Encoding LPC excitation Excitation already better than raw signal: 5 Original signal LPC residual time / s save several bits/sample, but still > 32 Kbps Crude model: U/V flag + pitch period 7 bits / 5 ms = 1.4 Kbps 2.4 Kbps Pitch period values 16 ms frame boundaries time / s Band-limit then re-extend (RELP) E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 2 / 37

21 Encoding excitation Something between full-quality residual (32 Kbps) and pitch parameters (1.4 kbps)? Analysis by synthesis loop: Input s[n] LPC analysis Filter coefficients A(z) Excitation code Synthetic excitation c[n] control Pitch-cycle predictor + b z NL x[n] ^ LPC filter 1 A(z) + MSError minimization Excitation encoding Perceptual weighting A(z) W(z)= A(z/!) Perceptual weighting discounts peaks: gain / db 2 1/A(z) 1 1/A(z/!) -1 A(z) W(z)= A(z/!) ^x[n]*h a [n] - s[n] freq / Hz E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

22 Multi-Pulse Excitation (MPE-LPC) Stylize excitation as N discrete pulses mi 5 original LPC residual multi-pulse excitation ti time / samps encode as N (ti, m i ) pairs Greedy algorithm places one pulse at a time: E pcp = A(z) [ ] X (z) A(z/γ) A(z) S(z) = X (z) A(z/γ) R(z) A(z/γ) R(z) is residual of target waveform after inverse-filtering cross-correlate hγ and r h γ, iterate E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

23 CELP Represent excitation with codebook e.g. 512 sparse excitation vectors Codebook Search index excitation time / samples linear search for minimum weighted error? FS Kbps CELP (3ms frame = 144 bits): 1 LSPs 4x4 + 6x3 bits = 34 bits Pitch delay 4 x 7 bits = 28 bits Pitch gain 4 x 5 bits = 2 bits Codebk index 4 x 9 bits = 36 bits Codebk gain 4 x 5 bits = 2 bits 138 bits E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

24 Aside: CELP for nonspeech? CELP is sometimes called a hybrid coder: originally based on source-filter voice model CELP residual is waveform coding (no model) freq / khz 4 3 Original (mrzebra-8k) CELP does not break with multiple voices etc. just does the best it can kbps buzz-hiss 4.8 kbps CELP LPC filter models vocal tract; also matches auditory system? i.e. the source-filter separation is good for relevance as well as redundancy? E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37 time / sec

25 Outline 1 Information, Compression & Quantization 2 Speech coding 3 Wide-Bandwidth Audio Coding E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

26 Wide-Bandwidth Audio Coding Goals: transparent coding i.e. no perceptible effect general purpose - handles any signal Simple approaches (redundancy removal) Adaptive Differential PCM (ADPCM) X[n] + + D[n] Adaptive quantizer C[n] = Q[D[n]] C[n] X p [n] Predictor X p [n] = F[X'[n-i]] X'[n] D'[n] Dequantizer D'[n] = Q -1 [C[n]] as prediction gets smarter, becomes LPC e.g. shorten - lossless LPC encoding Larger compression gains needs irrelevance hide quantization noise with psychoacoustic masking E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

27 Noise shaping Plain Q-noise sounds like added white noise actually, not all that disturbing.. but worst-case for exploiting masking Have Q-noise scale with signal level i.e. quantizer step gets larger with amplitude minimum distortion for some center-heavy pdf mu-law quantization mu(x) Or: put Q-noise around peaks in spectrum key to getting benefit of perceptual masking x - mu(x) level / db 2-2 Signal Linear Q-noise -4-6 Transform Q-noise freq / Hz E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

28 Subband coding Idea: Quantize separately in separate bands Input Encoder Bandpass Analysis filters f Decoder Downsample Quantize M Q[ ] Q -1 [ ] M f Reconstruction Output (M channels) + filters f M Q[ ] Q -1 [ ] M Q-noise stays within band, gets masked Critical sampling 1/M of spectrum per band gain / db -1-2 alias energy normalized freq 1 some aliasing inevitable Trick is to cancel with alias of adjacent band quadrature-mirror filters E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

29 MPEG-Audio (layer I, II) Basic idea: Subband coding plus psychoacoustic masking model to choose dynamic Q-levels in subbands Input 32 band polyphase filterbank Psychoacoustic masking model Scale normalize Scale normalize Scale indices Per-band masking margins Quantize Quantize Format & pack bitstream Bitstream Data frame Control & scales 32 chans x 36 samples 24 ms 22 khz 32 equal bands = 69 Hz bandwidth 8 / 24 ms frames = 12 / 36 subband samples fixed bitrates Kbps/chan (1-6 bits/samp) scale factors are like LPC envelope? E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

30 MPEG Psychoacoustic model Based on simultaneous masking experiments Difficulties: noise energy masks 1 db better than tones masking level nonlinear in frequency & intensity complex, dynamic sounds not well understood Procedure pick tonal peaks in NB FFT spectrum remaining energy noisy peaks apply nonlinear spreading function sum all masking & threshold in power domain SPL / db SPL / db SPL / db Signal spectrum Tonal peaks Masking spread Non-tonal energy Resultant masking freq / Bark E682 (Ellis & Mandel) L7: Audio compression and coding March 31, 29 3 / 37

31 MPEG Bit allocation Result of psychoacoustic model is maximum tolerable noise per subband Masking tone level SNR ~ 6 B Masked threshold Safe noise level Quantization noise Subband N freq safe noise level required SNR bits B Bit allocation procedure (fixed bit rate): pick channel with worst noise-masker ratio improve its quantization by one step repeat while more bits available for this frame Bands with no signal above masking curve can be skipped E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

32 MPEG Audio Layer III Transform coder on top of subband coder Digital Audio Signal (PCM) (768 kbit/s) Subband 31 Filterbank 32 Subbands MDCT FFT 124 Points Window Switching Line 575 Psychoacoustic Model Distortion Control Loop Nonuniform Quantization Rate Control Loop Huffman Encoding Coding of Sideinformation External Control Bitstream Formatting CRC-Check Coded Audio Signal 192 kbit/s 32 kbit/s Blocks of 36 subband time-domain samples become 18 pairs of frequency-domain samples more redundancy in spectral domain finer control e.g. of aliasing, masking scale factors now in band-blocks Fixed Huffman tables optimized for audio data Power-law quantizer E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

33 Adaptive time window Time window relies on temporal masking single quantization level over 8-24 ms window Nightmare scenario: Pre-echo distortion backward masking saves in most cases Adaptive switching of time window: window level normal transition short time / ms E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

34 The effects of MP3 Before & after: freq / khz Josie - direct from CD 5 freq / khz After MP3 encode (16 kbps) and decode 5 freq / khz Residual (after aligning 1148 sample delay) chop off high frequency (above 16 khz) occasional other time-frequency holes quantization noise under signal time / sec E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

35 MP3 & Beyond MP3 is transparent at 128 Kbps for stereo (11x smaller than 1.4 Mbps CD rate) only decoder is standardized: better psychological models better encoders MPEG2 AAC rebuild of MP3 without backwards compatibility 3% better (stereo at 96 Kbps?) multichannel etc. MPEG4-Audio wide range of component encodings MPEG Audio, LSPs,... SAOL synthetic component of MPEG-4 Audio complete DSP/computer music language! how to encode into it? E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

36 Summary For coding, every bit counts take care over quantization domain & effects Shannon limits... Speech coding LPC modeling is old but good CELP residual modeling can go beyond speech Wide-band coding noise shaping hides quantization noise detailed psychoacoustic models are key E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

37 References Ben Gold and Nelson Morgan. Speech and Audio Signal Processing: Processing and Perception of Speech and Music. Wiley, July ISBN D. Pan. A tutorial on MPEG/audio compression. IEEE Multimedia, 2(2):6 74, Karlheinz Brandenburg. MP3 and AAC explained. In Proc. AES 17th International Conference on High Quality Audio Coding, pages 1 12, T. Painter and A. Spanias. Perceptual coding of digital audio. Proc. IEEE, 8(4): , Apr. 2. E682 (Ellis & Mandel) L7: Audio compression and coding March 31, / 37

Audio Coding, Psycho- Accoustic model and MP3

Audio Coding, Psycho- Accoustic model and MP3 INF5081: Multimedia Coding and Applications Audio Coding, Psycho- Accoustic model and MP3, NR Torbjørn Ekman, Ifi Nils Christophersen, Ifi Sverre Holm, Ifi What is Sound? Sound waves: 20Hz - 20kHz Speed:

More information

AUDIO CODING: BASICS AND STATE OF THE ART

AUDIO CODING: BASICS AND STATE OF THE ART AUDIO CODING: BASICS AND STATE OF THE ART PACS REFERENCE: 43.75.CD Brandenburg, Karlheinz Fraunhofer Institut Integrierte Schaltungen, Arbeitsgruppe Elektronische Medientechnolgie Am Helmholtzring 1 98603

More information

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION Adiel Ben-Shalom, Michael Werman School of Computer Science Hebrew University Jerusalem, Israel. {chopin,werman}@cs.huji.ac.il

More information

Digital Speech Coding

Digital Speech Coding Digital Speech Processing David Tipper Associate Professor Graduate Program of Telecommunications and Networking University of Pittsburgh Telcom 2720 Slides 7 http://www.sis.pitt.edu/~dtipper/tipper.html

More information

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music ISO/IEC MPEG USAC Unified Speech and Audio Coding MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music The standardization of MPEG USAC in ISO/IEC is now in its final

More information

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Voice Digitization in the POTS Traditional

More information

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de MPEG-1 / MPEG-2 BC Audio The Basic Paradigm of T/F Domain Audio Coding Digital Audio Input Filter Bank Bit or Noise Allocation Quantized Samples Bitstream Formatting Encoded Bitstream Signal to Mask Ratio

More information

Audio Coding Introduction

Audio Coding Introduction Audio Coding Introduction Lecture WS 2013/2014 Prof. Dr.-Ing. Karlheinz Brandenburg bdg@idmt.fraunhofer.de Prof. Dr.-Ing. Gerald Schuller shl@idmt.fraunhofer.de Page Nr. 1 Organisatorial Details - Overview

More information

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29. Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice over IP So, today we will discuss about voice over IP and internet

More information

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Sound is an energy wave with frequency and amplitude. Frequency maps the axis of time, and amplitude

More information

Analog-to-Digital Voice Encoding

Analog-to-Digital Voice Encoding Analog-to-Digital Voice Encoding Basic Voice Encoding: Converting Analog to Digital This topic describes the process of converting analog signals to digital signals. Digitizing Analog Signals 1. Sample

More information

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman A Comparison of Speech Coding Algorithms ADPCM vs CELP Shannon Wichman Department of Electrical Engineering The University of Texas at Dallas Fall 1999 December 8, 1999 1 Abstract Factors serving as constraints

More information

Speech Compression. 2.1 Introduction

Speech Compression. 2.1 Introduction Speech Compression 2 This chapter presents an introduction to speech compression techniques, together with a detailed description of speech/audio compression standards including narrowband, wideband and

More information

Digital Audio Compression: Why, What, and How

Digital Audio Compression: Why, What, and How Digital Audio Compression: Why, What, and How An Absurdly Short Course Jeff Bier Berkeley Design Technology, Inc. 2000 BDTI 1 Outline Why Compress? What is Audio Compression? How Does it Work? Conclusions

More information

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING A TOOL FOR TEACHING LINEAR PREDICTIVE CODING Branislav Gerazov 1, Venceslav Kafedziski 2, Goce Shutinoski 1 1) Department of Electronics, 2) Department of Telecommunications Faculty of Electrical Engineering

More information

The Theory Behind Mp3

The Theory Behind Mp3 The Theory Behind Mp3 Rassol Raissi December 2002 Abstract Since the MPEG-1 Layer III encoding technology is nowadays widely used it might be interesting to gain knowledge of how this powerful compression/decompression

More information

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Synergies and Distinctions Peter Vary RWTH Aachen University Institute of Communication Systems WASPAA, October 23, 2013 Mohonk Mountain

More information

VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.)

VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.) VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.) 1 Remember first the big picture VoIP network architecture and some terminologies Voice coders 2 Audio and voice quality measuring

More information

Simple Voice over IP (VoIP) Implementation

Simple Voice over IP (VoIP) Implementation Simple Voice over IP (VoIP) Implementation ECE Department, University of Florida Abstract Voice over IP (VoIP) technology has many advantages over the traditional Public Switched Telephone Networks. In

More information

Digital Audio Compression

Digital Audio Compression By Davis Yen Pan Abstract Compared to most digital data types, with the exception of digital video, the data rates associated with uncompressed digital audio are substantial. Digital audio compression

More information

Electronic Communications Committee (ECC) within the European Conference of Postal and Telecommunications Administrations (CEPT)

Electronic Communications Committee (ECC) within the European Conference of Postal and Telecommunications Administrations (CEPT) Page 1 Electronic Communications Committee (ECC) within the European Conference of Postal and Telecommunications Administrations (CEPT) ECC RECOMMENDATION (06)01 Bandwidth measurements using FFT techniques

More information

MP3 AND AAC EXPLAINED

MP3 AND AAC EXPLAINED MP3 AND AAC EXPLAINED KARLHEINZ BRANDENBURG ½ ½ Fraunhofer Institute for Integrated Circuits FhG-IIS A, Erlangen, Germany bdg@iis.fhg.de The last years have shown widespread proliferation of.mp3-files,

More information

White Paper: An Overview of the Coherent Acoustics Coding System

White Paper: An Overview of the Coherent Acoustics Coding System White Paper: An Overview of the Coherent Acoustics Coding System Mike Smyth June 1999 Introduction Coherent Acoustics is a digital audio compression algorithm designed for both professional and consumer

More information

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral GSM speech coding Forelesning INF 5080 Vårsemester 2004 Sources This part contains material from: Web pages Universität Bremen, Arbeitsbereich Nachrichtentechnik (ANT): Prof.K.D. Kammeyer, Jörg Bitzer,

More information

HD Radio FM Transmission System Specifications Rev. F August 24, 2011

HD Radio FM Transmission System Specifications Rev. F August 24, 2011 HD Radio FM Transmission System Specifications Rev. F August 24, 2011 SY_SSS_1026s TRADEMARKS HD Radio and the HD, HD Radio, and Arc logos are proprietary trademarks of ibiquity Digital Corporation. ibiquity,

More information

TCOM 370 NOTES 99-6 VOICE DIGITIZATION AND VOICE/DATA INTEGRATION

TCOM 370 NOTES 99-6 VOICE DIGITIZATION AND VOICE/DATA INTEGRATION TCOM 370 NOTES 99-6 VOICE DIGITIZATION AND VOICE/DATA INTEGRATION (Please read appropriate parts of Section 2.5.2 in book) 1. VOICE DIGITIZATION IN THE PSTN The frequencies contained in telephone-quality

More information

A Review of Algorithms for Perceptual Coding of Digital Audio Signals

A Review of Algorithms for Perceptual Coding of Digital Audio Signals A Review of Algorithms for Perceptual Coding of Digital Audio Signals Ted Painter, Student Member IEEE, and Andreas Spanias, Senior Member IEEE Department of Electrical Engineering, Telecommunications

More information

AC 2012-5055: MULTIMEDIA SYSTEMS EDUCATION INNOVATIONS I: SPEECH

AC 2012-5055: MULTIMEDIA SYSTEMS EDUCATION INNOVATIONS I: SPEECH AC -555: MULTIMEDIA SYSTEMS EDUCATION INNOVATIONS I: SPEECH Prof. Tokunbo Ogunfunmi, Santa Clara University Tokunbo Ogunfunmi is the Associate Dean for Research and Faculty Development in the School of

More information

L9: Cepstral analysis

L9: Cepstral analysis L9: Cepstral analysis The cepstrum Homomorphic filtering The cepstrum and voicing/pitch detection Linear prediction cepstral coefficients Mel frequency cepstral coefficients This lecture is based on [Taylor,

More information

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden) Quality Estimation for Scalable Video Codec Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden) Purpose of scalable video coding Multiple video streams are needed for heterogeneous

More information

MPEG Layer-3. An introduction to. 1. Introduction

MPEG Layer-3. An introduction to. 1. Introduction An introduction to MPEG Layer-3 MPEG Layer-3 K. Brandenburg and H. Popp Fraunhofer Institut für Integrierte Schaltungen (IIS) MPEG Layer-3, otherwise known as MP3, has generated a phenomenal interest among

More information

Solutions to Exam in Speech Signal Processing EN2300

Solutions to Exam in Speech Signal Processing EN2300 Solutions to Exam in Speech Signal Processing EN23 Date: Thursday, Dec 2, 8: 3: Place: Allowed: Grades: Language: Solutions: Q34, Q36 Beta Math Handbook (or corresponding), calculator with empty memory.

More information

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder.

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder. A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER Manoj Kumar 1 Mohammad Zubair 1 1 IBM T.J. Watson Research Center, Yorktown Hgts, NY, USA ABSTRACT The MPEG/Audio is a standard for both

More information

Introduction to image coding

Introduction to image coding Introduction to image coding Image coding aims at reducing amount of data required for image representation, storage or transmission. This is achieved by removing redundant data from an image, i.e. by

More information

Web-Conferencing System SAViiMeeting

Web-Conferencing System SAViiMeeting Web-Conferencing System SAViiMeeting Alexei Machovikov Department of Informatics and Computer Technologies National University of Mineral Resources Mining St-Petersburg, Russia amachovikov@gmail.com Abstract

More information

Linear Predictive Coding

Linear Predictive Coding Linear Predictive Coding Jeremy Bradbury December 5, 2000 0 Outline I. Proposal II. Introduction A. Speech Coding B. Voice Coders C. LPC Overview III. Historical Perspective of Linear Predictive Coding

More information

Analysis/resynthesis with the short time Fourier transform

Analysis/resynthesis with the short time Fourier transform Analysis/resynthesis with the short time Fourier transform summer 2006 lecture on analysis, modeling and transformation of audio signals Axel Röbel Institute of communication science TU-Berlin IRCAM Analysis/Synthesis

More information

Audio Coding Algorithm for One-Segment Broadcasting

Audio Coding Algorithm for One-Segment Broadcasting Audio Coding Algorithm for One-Segment Broadcasting V Masanao Suzuki V Yasuji Ota V Takashi Itoh (Manuscript received November 29, 2007) With the recent progress in coding technologies, a more efficient

More information

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton Abstract This paper intends to give the reader some insight into the workings

More information

PCM Encoding and Decoding:

PCM Encoding and Decoding: PCM Encoding and Decoding: Aim: Introduction to PCM encoding and decoding. Introduction: PCM Encoding: The input to the PCM ENCODER module is an analog message. This must be constrained to a defined bandwidth

More information

Digital Audio and Video Data

Digital Audio and Video Data Multimedia Networking Reading: Sections 3.1.2, 3.3, 4.5, and 6.5 CS-375: Computer Networks Dr. Thomas C. Bressoud 1 Digital Audio and Video Data 2 Challenges for Media Streaming Large volume of data Each

More information

Multimedia Communications

Multimedia Communications Multimedia Communications Dr. Ing. Audio Processing and Coding MMC Overview 1. Introduction 2. Fundamentals (Signal Processing, Information Theorie) 3. Speech Processing & Coding 4. Audio Processing &

More information

Introduction to Packet Voice Technologies and VoIP

Introduction to Packet Voice Technologies and VoIP Introduction to Packet Voice Technologies and VoIP Cisco Networking Academy Program Halmstad University Olga Torstensson 035-167575 olga.torstensson@ide.hh.se IP Telephony 1 Traditional Telephony 2 Basic

More information

MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN. zl2211@columbia.edu. ml3088@columbia.edu

MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN. zl2211@columbia.edu. ml3088@columbia.edu MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN Zheng Lai Zhao Liu Meng Li Quan Yuan zl2215@columbia.edu zl2211@columbia.edu ml3088@columbia.edu qy2123@columbia.edu I. Overview Architecture The purpose

More information

Speech Signal Processing: An Overview

Speech Signal Processing: An Overview Speech Signal Processing: An Overview S. R. M. Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati December, 2012 Prasanna (EMST Lab, EEE, IITG) Speech

More information

Luigi Piroddi Active Noise Control course notes (January 2015)

Luigi Piroddi Active Noise Control course notes (January 2015) Active Noise Control course notes (January 2015) 9. On-line secondary path modeling techniques Luigi Piroddi piroddi@elet.polimi.it Introduction In the feedforward ANC scheme the primary noise is canceled

More information

Figure1. Acoustic feedback in packet based video conferencing system

Figure1. Acoustic feedback in packet based video conferencing system Real-Time Howling Detection for Hands-Free Video Conferencing System Mi Suk Lee and Do Young Kim Future Internet Research Department ETRI, Daejeon, Korea {lms, dyk}@etri.re.kr Abstract: This paper presents

More information

Tutorial about the VQR (Voice Quality Restoration) technology

Tutorial about the VQR (Voice Quality Restoration) technology Tutorial about the VQR (Voice Quality Restoration) technology Ing Oscar Bonello, Solidyne Fellow Audio Engineering Society, USA INTRODUCTION Telephone communications are the most widespread form of transport

More information

AN-007 APPLICATION NOTE MEASURING MAXIMUM SUBWOOFER OUTPUT ACCORDING ANSI/CEA-2010 STANDARD INTRODUCTION CEA-2010 (ANSI) TEST PROCEDURE

AN-007 APPLICATION NOTE MEASURING MAXIMUM SUBWOOFER OUTPUT ACCORDING ANSI/CEA-2010 STANDARD INTRODUCTION CEA-2010 (ANSI) TEST PROCEDURE AUDIOMATICA AN-007 APPLICATION NOTE MEASURING MAXIMUM SUBWOOFER OUTPUT ACCORDING ANSI/CEA-2010 STANDARD by Daniele Ponteggia - dp@audiomatica.com INTRODUCTION The Consumer Electronics Association (CEA),

More information

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals Modified from the lecture slides of Lami Kaya (LKaya@ieee.org) for use CECS 474, Fall 2008. 2009 Pearson Education Inc., Upper

More information

Aalborg Universitet. Estimation and Modeling Problems in Parametric Audio Coding Christensen, Mads Græsbøll. Publication date: 2005

Aalborg Universitet. Estimation and Modeling Problems in Parametric Audio Coding Christensen, Mads Græsbøll. Publication date: 2005 Aalborg Universitet Estimation and Modeling Problems in Parametric Audio Coding Christensen, Mads Græsbøll Publication date: 2005 Document Version Publisher's PDF, also known as Version of record Link

More information

Lab 5 Getting started with analog-digital conversion

Lab 5 Getting started with analog-digital conversion Lab 5 Getting started with analog-digital conversion Achievements in this experiment Practical knowledge of coding of an analog signal into a train of digital codewords in binary format using pulse code

More information

Example/ an analog signal f ( t) ) is sample by f s = 5000 Hz draw the sampling signal spectrum. Calculate min. sampling frequency.

Example/ an analog signal f ( t) ) is sample by f s = 5000 Hz draw the sampling signal spectrum. Calculate min. sampling frequency. 1 2 3 4 Example/ an analog signal f ( t) = 1+ cos(4000πt ) is sample by f s = 5000 Hz draw the sampling signal spectrum. Calculate min. sampling frequency. Sol/ H(f) -7KHz -5KHz -3KHz -2KHz 0 2KHz 3KHz

More information

Digital Transmission of Analog Data: PCM and Delta Modulation

Digital Transmission of Analog Data: PCM and Delta Modulation Digital Transmission of Analog Data: PCM and Delta Modulation Required reading: Garcia 3.3.2 and 3.3.3 CSE 323, Fall 200 Instructor: N. Vlajic Digital Transmission of Analog Data 2 Digitization process

More information

GSM/EDGE Output RF Spectrum on the V93000 Joe Kelly and Max Seminario, Verigy

GSM/EDGE Output RF Spectrum on the V93000 Joe Kelly and Max Seminario, Verigy GSM/EDGE Output RF Spectrum on the V93000 Joe Kelly and Max Seminario, Verigy Introduction A key transmitter measurement for GSM and EDGE is the Output RF Spectrum, or ORFS. The basis of this measurement

More information

Basic principles of Voice over IP

Basic principles of Voice over IP Basic principles of Voice over IP Dr. Peter Počta {pocta@fel.uniza.sk} Department of Telecommunications and Multimedia Faculty of Electrical Engineering University of Žilina, Slovakia Outline VoIP Transmission

More information

SPEECH CODING: FUNDAMENTALS AND APPLICATIONS

SPEECH CODING: FUNDAMENTALS AND APPLICATIONS SPEECH CODING: FUNDAMENTALS AND APPLICATIONS 1. INTRODUCTION MARK HASEGAWA-JOHNSON University of Illinois at Urbana Champaign Urbana, Illinois ABEER ALWAN University of California at Los Angeles Los Angeles,

More information

Sampling and Interpolation. Yao Wang Polytechnic University, Brooklyn, NY11201

Sampling and Interpolation. Yao Wang Polytechnic University, Brooklyn, NY11201 Sampling and Interpolation Yao Wang Polytechnic University, Brooklyn, NY1121 http://eeweb.poly.edu/~yao Outline Basics of sampling and quantization A/D and D/A converters Sampling Nyquist sampling theorem

More information

Understanding the Transition From PESQ to POLQA. An Ascom Network Testing White Paper

Understanding the Transition From PESQ to POLQA. An Ascom Network Testing White Paper Understanding the Transition From PESQ to POLQA An Ascom Network Testing White Paper By Dr. Irina Cotanis Prepared by: Date: Document: Dr. Irina Cotanis 6 December 2011 NT11-22759, Rev. 1.0 Ascom (2011)

More information

Network Traffic #5. Traffic Characterization

Network Traffic #5. Traffic Characterization Network #5 Section 4.7.1, 5.7.2 1 Characterization Goals to: Understand the nature of what is transported over communications networks. Use that understanding to improve network design Characterization

More information

MODULATION Systems (part 1)

MODULATION Systems (part 1) Technologies and Services on Digital Broadcasting (8) MODULATION Systems (part ) "Technologies and Services of Digital Broadcasting" (in Japanese, ISBN4-339-62-2) is published by CORONA publishing co.,

More information

FAST MIR IN A SPARSE TRANSFORM DOMAIN

FAST MIR IN A SPARSE TRANSFORM DOMAIN ISMIR 28 Session 4c Automatic Music Analysis and Transcription FAST MIR IN A SPARSE TRANSFORM DOMAIN Emmanuel Ravelli Université Paris 6 ravelli@lam.jussieu.fr Gaël Richard TELECOM ParisTech gael.richard@enst.fr

More information

T = 1 f. Phase. Measure of relative position in time within a single period of a signal For a periodic signal f(t), phase is fractional part t p

T = 1 f. Phase. Measure of relative position in time within a single period of a signal For a periodic signal f(t), phase is fractional part t p Data Transmission Concepts and terminology Transmission terminology Transmission from transmitter to receiver goes over some transmission medium using electromagnetic waves Guided media. Waves are guided

More information

Workshop Perceptual Effects of Filtering and Masking Introduction to Filtering and Masking

Workshop Perceptual Effects of Filtering and Masking Introduction to Filtering and Masking Workshop Perceptual Effects of Filtering and Masking Introduction to Filtering and Masking The perception and correct identification of speech sounds as phonemes depends on the listener extracting various

More information

ELEC 4801 THESIS PROJECT

ELEC 4801 THESIS PROJECT School of Information Technology and Electrical Engineering ELEC 4801 THESIS PROJECT Thesis: Speech Compression Using Wavelets Name: Nikhil Rao Student No: 33710745 Supervisor: Dr John Homer Submitted

More information

Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers

Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers Sung-won ark and Jose Trevino Texas A&M University-Kingsville, EE/CS Department, MSC 92, Kingsville, TX 78363 TEL (36) 593-2638, FAX

More information

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Video Coding Basics Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Outline Motivation for video coding Basic ideas in video coding Block diagram of a typical video codec Different

More information

Lecture 1-10: Spectrograms

Lecture 1-10: Spectrograms Lecture 1-10: Spectrograms Overview 1. Spectra of dynamic signals: like many real world signals, speech changes in quality with time. But so far the only spectral analysis we have performed has assumed

More information

The Effect of Network Cabling on Bit Error Rate Performance. By Paul Kish NORDX/CDT

The Effect of Network Cabling on Bit Error Rate Performance. By Paul Kish NORDX/CDT The Effect of Network Cabling on Bit Error Rate Performance By Paul Kish NORDX/CDT Table of Contents Introduction... 2 Probability of Causing Errors... 3 Noise Sources Contributing to Errors... 4 Bit Error

More information

http://www.springer.com/0-387-23402-0

http://www.springer.com/0-387-23402-0 http://www.springer.com/0-387-23402-0 Chapter 2 VISUAL DATA FORMATS 1. Image and Video Data Digital visual data is usually organised in rectangular arrays denoted as frames, the elements of these arrays

More information

Convention Paper 5553

Convention Paper 5553 Audio Engineering Society Convention Paper 5553 Presented at the 112th Convention 2 May 1 13 Munich, Germany This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN PESQ: An Introduction White Paper Prepared by: Psytechnics Limited 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN t: +44 (0) 1473 261 800 f: +44 (0) 1473 261 880 e: info@psytechnics.com September

More information

Sampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically.

Sampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically. Sampling Theorem We will show that a band limited signal can be reconstructed exactly from its discrete time samples. Recall: That a time sampled signal is like taking a snap shot or picture of signal

More information

Appendix C GSM System and Modulation Description

Appendix C GSM System and Modulation Description C1 Appendix C GSM System and Modulation Description C1. Parameters included in the modelling In the modelling the number of mobiles and their positioning with respect to the wired device needs to be taken

More information

Objective Speech Quality Measures for Internet Telephony

Objective Speech Quality Measures for Internet Telephony Objective Speech Quality Measures for Internet Telephony Timothy A. Hall National Institute of Standards and Technology 100 Bureau Drive, STOP 8920 Gaithersburg, MD 20899-8920 ABSTRACT Measuring voice

More information

Real-Time Audio Watermarking Based on Characteristics of PCM in Digital Instrument

Real-Time Audio Watermarking Based on Characteristics of PCM in Digital Instrument Journal of Information Hiding and Multimedia Signal Processing 21 ISSN 273-4212 Ubiquitous International Volume 1, Number 2, April 21 Real-Time Audio Watermarking Based on Characteristics of PCM in Digital

More information

David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh Slides 2.

David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh Slides 2. VoIP QoS Factors David Tipper Associate Professor Department of Information Science and Telecommunications University of Pittsburgh Slides 2 VoIP QoS Internet Telephone Quality of Service factors Voice

More information

Lecture 1-6: Noise and Filters

Lecture 1-6: Noise and Filters Lecture 1-6: Noise and Filters Overview 1. Periodic and Aperiodic Signals Review: by periodic signals, we mean signals that have a waveform shape that repeats. The time taken for the waveform to repeat

More information

SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A

SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A International Journal of Science, Engineering and Technology Research (IJSETR), Volume, Issue, January SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A N.Rama Tej Nehru, B P.Sunitha

More information

TCOM 370 NOTES 99-4 BANDWIDTH, FREQUENCY RESPONSE, AND CAPACITY OF COMMUNICATION LINKS

TCOM 370 NOTES 99-4 BANDWIDTH, FREQUENCY RESPONSE, AND CAPACITY OF COMMUNICATION LINKS TCOM 370 NOTES 99-4 BANDWIDTH, FREQUENCY RESPONSE, AND CAPACITY OF COMMUNICATION LINKS 1. Bandwidth: The bandwidth of a communication link, or in general any system, was loosely defined as the width of

More information

1. (Ungraded) A noiseless 2-kHz channel is sampled every 5 ms. What is the maximum data rate?

1. (Ungraded) A noiseless 2-kHz channel is sampled every 5 ms. What is the maximum data rate? Homework 2 Solution Guidelines CSC 401, Fall, 2011 1. (Ungraded) A noiseless 2-kHz channel is sampled every 5 ms. What is the maximum data rate? 1. In this problem, the channel being sampled gives us the

More information

Evolution from Voiceband to Broadband Internet Access

Evolution from Voiceband to Broadband Internet Access Evolution from Voiceband to Broadband Internet Access Murtaza Ali DSPS R&D Center Texas Instruments Abstract With the growth of Internet, demand for high bit rate Internet access is growing. Even though

More information

Image Compression through DCT and Huffman Coding Technique

Image Compression through DCT and Huffman Coding Technique International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Rahul

More information

MUSICAL INSTRUMENT FAMILY CLASSIFICATION

MUSICAL INSTRUMENT FAMILY CLASSIFICATION MUSICAL INSTRUMENT FAMILY CLASSIFICATION Ricardo A. Garcia Media Lab, Massachusetts Institute of Technology 0 Ames Street Room E5-40, Cambridge, MA 039 USA PH: 67-53-0 FAX: 67-58-664 e-mail: rago @ media.

More information

Voice Encryption over GSM:

Voice Encryption over GSM: End-to to-end Voice Encryption over GSM: A Different Approach Wesley Tanner Nick Lane-Smith www. Keith Lareau About Us: Wesley Tanner - Systems Engineer for a Software-Defined Radio (SDRF) company - B.S.

More information

HIGH-QUALITY FREQUENCY DOMAIN-BASED AUDIO WATERMARKING. Eric Humphrey. School of Music Engineering Technology University of Miami

HIGH-QUALITY FREQUENCY DOMAIN-BASED AUDIO WATERMARKING. Eric Humphrey. School of Music Engineering Technology University of Miami HIGH-QUALITY FREQUENCY DOMAIN-BASED AUDIO WATERMARKING Eric Humphrey School of Music Engineering Technology University of Miami ABSTRACT An investigation of current audio watermarking technology is provided,

More information

How To Encode Data From A Signal To A Signal (Wired) To A Bitcode (Wired Or Coaxial)

How To Encode Data From A Signal To A Signal (Wired) To A Bitcode (Wired Or Coaxial) Physical Layer Part 2 Data Encoding Techniques Networks: Data Encoding 1 Analog and Digital Transmissions Figure 2-23.The use of both analog and digital transmissions for a computer to computer call. Conversion

More information

A Sound Analysis and Synthesis System for Generating an Instrumental Piri Song

A Sound Analysis and Synthesis System for Generating an Instrumental Piri Song , pp.347-354 http://dx.doi.org/10.14257/ijmue.2014.9.8.32 A Sound Analysis and Synthesis System for Generating an Instrumental Piri Song Myeongsu Kang and Jong-Myon Kim School of Electrical Engineering,

More information

IBM Research Report. CSR: Speaker Recognition from Compressed VoIP Packet Stream

IBM Research Report. CSR: Speaker Recognition from Compressed VoIP Packet Stream RC23499 (W0501-090) January 19, 2005 Computer Science IBM Research Report CSR: Speaker Recognition from Compressed Packet Stream Charu Aggarwal, David Olshefski, Debanjan Saha, Zon-Yin Shae, Philip Yu

More information

Dream DRM Receiver Documentation

Dream DRM Receiver Documentation Dream DRM Receiver Documentation Dream is a software implementation of a Digital Radio Mondiale (DRM) receiver. All what is needed to receive DRM transmissions is a PC with a sound card and a modified

More information

JPEG Image Compression by Using DCT

JPEG Image Compression by Using DCT International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-4, Issue-4 E-ISSN: 2347-2693 JPEG Image Compression by Using DCT Sarika P. Bagal 1* and Vishal B. Raskar 2 1*

More information

Information, Entropy, and Coding

Information, Entropy, and Coding Chapter 8 Information, Entropy, and Coding 8. The Need for Data Compression To motivate the material in this chapter, we first consider various data sources and some estimates for the amount of data associated

More information

Compression techniques

Compression techniques Compression techniques David Bařina February 22, 2013 David Bařina Compression techniques February 22, 2013 1 / 37 Contents 1 Terminology 2 Simple techniques 3 Entropy coding 4 Dictionary methods 5 Conclusion

More information

MICROPHONE SPECIFICATIONS EXPLAINED

MICROPHONE SPECIFICATIONS EXPLAINED Application Note AN-1112 MICROPHONE SPECIFICATIONS EXPLAINED INTRODUCTION A MEMS microphone IC is unique among InvenSense, Inc., products in that its input is an acoustic pressure wave. For this reason,

More information

TECHNICAL PAPER. Fraunhofer Institute for Integrated Circuits IIS

TECHNICAL PAPER. Fraunhofer Institute for Integrated Circuits IIS TECHNICAL PAPER Enhanced Voice Services (EVS) Codec Until now, telephone services have generally failed to offer a high-quality audio experience due to limitations such as very low audio bandwidth and

More information

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP Department of Electrical and Computer Engineering Ben-Gurion University of the Negev LAB 1 - Introduction to USRP - 1-1 Introduction In this lab you will use software reconfigurable RF hardware from National

More information

Voice over IP Protocols And Compression Algorithms

Voice over IP Protocols And Compression Algorithms University of Tehran Electrical and Computer Engineering School SI Lab. Weekly Presentations Voice over IP Protocols And Compression Algorithms Presented by: Neda Kazemian Amiri Agenda Introduction to

More information

Voice Encoding Methods for Digital Wireless Communications Systems

Voice Encoding Methods for Digital Wireless Communications Systems SOUTHERN METHODIST UNIVERSITY Voice Encoding Methods for Digital Wireless Communications Systems BY Bryan Douglas Street address city state, zip e-mail address Student ID xxx-xx-xxxx EE6302 Section 324,

More information

NICE-RJCS Issue 2011 Evaluation of Potential Effectiveness of Desktop Remote Video Conferencing for Interactive Seminars Engr.

NICE-RJCS Issue 2011 Evaluation of Potential Effectiveness of Desktop Remote Video Conferencing for Interactive Seminars Engr. NICE-RJCS Issue 2011 Evaluation of Potential Effectiveness of Desktop Remote Video Conferencing for Interactive Seminars Engr. Faryal Zia Abstract This research paper discusses various aspects of desktop

More information