Introduction to MPEG-1/2 L2 Audio 尤信程國立台北科技大學資訊工程系

Size: px
Start display at page:

Download "Introduction to MPEG-1/2 L2 Audio 尤信程國立台北科技大學資訊工程系"

Transcription

1 Introduction to MPEG-1/2 L2 Audio 尤信程國立台北科技大學資訊工程系

2 Contents Intro to audio coding Psychoacoustics Audio encoding Frame structure Audio decoding Conclusions

3 Intro to audio coding (1) Audio coding uses MPEG-1/2 (part 3) Layer II, originally developed by Philips MPEG-1 1 Layer III audio is known as MP-3 3 (by FhG-IIS) Sampling rate: MPEG-1 1 = 48 ks/s, MPEG-2 2 = 24 ks/s Each audio frame encodes 1152 PCM samples per channel. Audio frame duration: MPEG-1 1 = 24 ms, MPEG-2 2 = 48 ms.

4 Intro to audio coding (2) DAB audio uses MPEG-1 1 Layer II & MPEG-2 LSF Layer II coding DVB-T T audio uses MPEG-1 1 Layer II DVB-T T audio can optionally use other audio coding technique such as MPEG-2 2 MC Layer II & Dolby AC-3 DAB also uses PAD for carrying data DVB-T T does not use PAD as it can transmit data via TS

5 Intro to audio coding (3) MPEG-1 1 audio for higher bitrate, MPEG-2 2 for lower bitrate (lower audio quality). Contrary to common believing Lower sampling freq for low bitrate coding: common practice in audio society MPEG-2 2 Multi-Channel (MC) extension is not currently used in Taiwan MPEG-4 4 HE-AAC could be a future coding scheme

6 Intro to audio coding (4) Features of MPEG-1/2 audio standard psychoacoustic model 32-subband filterbank Bit allocation table SCaleFactor Selection Info (SCFSI) Data grouping Bitrate range: 32 kbps ~ 384 kbps

7 Psychoacoustics (1) Brief intro to psychoacoustics Absolute hearing threshold Frequency masking (main factor) Temporal masking Layer I/II uses psychoacoustic model I Layer III uses psychoacoustic model II

8 Psychoacoustics (2) Absolute hearing threshold is the minimum level of sound human beings can hear Every one has (slightly) different threshold SPL (Sound Pressure Level) = 20 log (P/P( 0 ) db. P 0 = 2 x 10 5 N/m 2 SPL Freq

9 Psychoacoustics (3) Frequency masking: Human ears can be modeled as many bandpass filters. Different signals within the passband of one filter will interfere each other. Stronger one masks weaker one. SPL Freq

10 Psychoacoustics (4) Passband of a bandpass filter is known as one critical band. There are around 25 critical bands. Noise Masks Tone (NMT) ~ -66 db Tone Masks Noise (TMN) ~ -18 db Inter-critical critical-band masking also exists Frequency masking assumes stationary signals. Not correct for transient signals.

11 Psychoacoustics (5) Temporal masking Pre-masking (5 ms) Simultaneous masking Post-masking (200 ms) Pre-masking can be used to mask pre-echoes. echoes. In general, temporal masking is more difficult to use.

12 Audio encoding (1) Simple block diagram of MPEG-1/2 encoding. Digital Audio Input Filter Bank Bit allocation and scalefact or calculatio n Quantized Samples Bitstream Formatting Encoded Bitstream Signal to Mask Ratio Psychoacoustic Model

13 Audio encoding (2) Filter-bank, also called subband analysis, has 32 bandpass filters with the same proto-type. type. Let g(n) ) be the impulse response of proto-type. type. Filter i in the filter bank is obtained by (2i + 1)( n 16) π h i ( n) = g( n) cos( ) 64 Recall MPY in time domain is convolution in frequency domain. Center frequency is shifted according to cosine term.

14 Audio encoding (3) The proto-type type filter is a 512-coefficient FIR filter. Coefficients are in the spec. Since the BW of each BPF is (1/32) of full BW, decimate its output samples by 32. PCM input h 0 (n) 32 h 31 (n) 32 Subband output

15 Audio encoding (4) Number of bits to quantize a subband sample based on PSY model is recorded in Bit Allocation (BA). 3 X 12 = 36 samples from the same subband share one BA info. 12 samples of one subband = 1 part. The amplification ratio of a subband sample in encoding is called SCaleFactor (SCF). It s shared by 12 subband samples.

16 Audio encoding (5) It s s possible to use one, two, or three SCF s s for 36 output samples of one subband. This info is called SCaleFactor Selection Information (SCFSI). Four different cases identified: Three different SCF s First-two two parts share one SCF Last-two two parts share one SCF All three parts share one SCF

17 Audio encoding (6) 1152 PCM samples per ch is packed in one audio frame. Subband samples in bitstream: 3 samples from L ch of SB 0 3 samples form R ch of SB 0 Same for SB 1 till SB 31 Repeat the above 12 times Remember SCF scope is 12 samples from the same SB.

18 Audio encoding (7) Quantized subband samples may be grouped together into one codeword. The following quantization levels use grouping in packing subband samples. 3 levels: use 5 bits 5 levels: use 7 bits 9 levels: use 10 bits

19 Audio frame structure (1) The audio frame in MPEG-1/2 has the following fields: header, CRC, audio data, and ancillary. Audio data has BA field, SCFSI field, SCF field, and sb sample field. DAB uses ancillary to pack F-PAD F (Fixed Program Associated Data), X-PAD X (Extended PAD), and SCF-CRC. CRC.

20 Audio frame structure (2) CRC field in MPEG-1/2 is optional, but is mandatory in DAB. Header CRC Audio data Ancillary BA SCFSCI SCF Subband samples Stuffing X-PAD SCF CRC F-PAD

21 Audio frame structure (3) SCF CRC = CRC for next frame s s SCF. F-PAD contains the following Dynamic Range Control (DRC) X-PAD info (no, short, long) Music/Speech flag ISRC (Int l l Standard Recording Code)

22 Audio frame structure (4) X-PAD has many application types, including label data ITTS (Interactive Text Transmit System) MOT (Multimedia Object Transfer protocol) etc.

23 Audio frame structure (5) Frame header has 32 bits, arranged as follows: AAAAAAAA AAAABCCD EEEEFFGH IIJJKLMM A = sync word = B = ID. 0 = MEPG-2, 1 = MPEG-1 C = layer ID. 10 = Layer 2. (DAB uses it only) D = protection bit. Use 0 (with CRC) for DAB. E = bitrate index. From 32 kbps ~ 384 kbps. F = Sampling frequency. 01 = 48/24 khz. G = padding = 0. No padding required H = private bit. Not used.

24 Audio frame structure (6) AAAAAAAA AAAABCCD EEEEFFGH IIJJKLMM. I = mode. 00 = stereo, 01 = joint stereo, 10 = dual channel, 11 = mono. J = mode extension. Used only if mode = = 4, 01 = 8, 10 = 11 = K = copyright. 0 = no, 1 = yes L = original/copy. 0 = copy, 1 = org. M = emphasis. Use 00 = no emphasis.

25 Audio Decoding (1) Sync frame header by finding Parse frame header to know the coding mode. Bitrate info can also be obtained from FIC. Use assigned BA table to parse BA. Parse SCFSI. No SCFSI for BA field of a SB = 0 Parse SCF based on SCFSI and BA. Parse SB samples. Watch out grouping codeword.

26 Audio Decoding (2) Check if frame CRC and SCF CRC are OK. Do error concealment if necessary. Re-quantize SB samples. Perform subband synthesis via better algorithms than ISO s. Synchronize CODEC s s Fs if necessary.

27 Audio Decoding (3)

28 Audio Decoding (4) Brief introduction to de-grouping and re- quantize the SB samples. 1. De-grouping: for i = 0 to 2 S[i] = c % nlevel c = c div nlevel 2. S S = Invert MSB of S. S S is 2 s-2 complement fractional number. 3. S = = C * (S + D). C & D in tables of spec. 4. S S = S S * SCF

29 Audio Decoding (5) Subband synthesis from ISO s s spec: Input 32 new subband samples, S i, i = 0,.., 31 Shifting: for i =1023 down to 64 do V[i] = V[i-64] Matrixing: : for i = 0 to 63 do + + = 31 (16 i)(2k 1) π V[i] cos k= 0 64 S k

30 Audio Decoding (6) Build a 512 vector U: for i = 0 to 7 do for j = 0 to 31 do U[i*64+j] = V[i*128+j] U[i*64+32+j] = V[i* j] Window by 512-coefficient matrix D: for i = 0 to 511 do W[i] = U[i] * D[i]

31 Audio Decoding (7) Calculate 32 PCM samples: for j = 0 to 31 do s j = 15 i= 0 W[j + 32i] Output 32 reconstructed PCM samples s j.

32 Audio Decoding (8) Subband synthesis algorithm mentioned previously is slow We can avoid data shifting by using circular queue. Matrixing can be more efficient if DCT is in use. Cf. IEEE SPL, vol. 1, no. 2, pp , A fully optimized code can be at least five times faster!

33 Conclusions Introduction to audio coding Brief intro to psychoacoustics Audio encoding flow Audio frame structure Audio decoding, including subband synthesis

Digital Audio Compression: Why, What, and How

Digital Audio Compression: Why, What, and How Digital Audio Compression: Why, What, and How An Absurdly Short Course Jeff Bier Berkeley Design Technology, Inc. 2000 BDTI 1 Outline Why Compress? What is Audio Compression? How Does it Work? Conclusions

More information

The Theory Behind Mp3

The Theory Behind Mp3 The Theory Behind Mp3 Rassol Raissi December 2002 Abstract Since the MPEG-1 Layer III encoding technology is nowadays widely used it might be interesting to gain knowledge of how this powerful compression/decompression

More information

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de MPEG-1 / MPEG-2 BC Audio The Basic Paradigm of T/F Domain Audio Coding Digital Audio Input Filter Bank Bit or Noise Allocation Quantized Samples Bitstream Formatting Encoded Bitstream Signal to Mask Ratio

More information

AUDIO CODING: BASICS AND STATE OF THE ART

AUDIO CODING: BASICS AND STATE OF THE ART AUDIO CODING: BASICS AND STATE OF THE ART PACS REFERENCE: 43.75.CD Brandenburg, Karlheinz Fraunhofer Institut Integrierte Schaltungen, Arbeitsgruppe Elektronische Medientechnolgie Am Helmholtzring 1 98603

More information

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION Adiel Ben-Shalom, Michael Werman School of Computer Science Hebrew University Jerusalem, Israel. {chopin,werman}@cs.huji.ac.il

More information

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton Abstract This paper intends to give the reader some insight into the workings

More information

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder.

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder. A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER Manoj Kumar 1 Mohammad Zubair 1 1 IBM T.J. Watson Research Center, Yorktown Hgts, NY, USA ABSTRACT The MPEG/Audio is a standard for both

More information

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music ISO/IEC MPEG USAC Unified Speech and Audio Coding MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music The standardization of MPEG USAC in ISO/IEC is now in its final

More information

MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN. zl2211@columbia.edu. ml3088@columbia.edu

MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN. zl2211@columbia.edu. ml3088@columbia.edu MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN Zheng Lai Zhao Liu Meng Li Quan Yuan zl2215@columbia.edu zl2211@columbia.edu ml3088@columbia.edu qy2123@columbia.edu I. Overview Architecture The purpose

More information

Audio Coding, Psycho- Accoustic model and MP3

Audio Coding, Psycho- Accoustic model and MP3 INF5081: Multimedia Coding and Applications Audio Coding, Psycho- Accoustic model and MP3, NR Torbjørn Ekman, Ifi Nils Christophersen, Ifi Sverre Holm, Ifi What is Sound? Sound waves: 20Hz - 20kHz Speed:

More information

Audio Coding Algorithm for One-Segment Broadcasting

Audio Coding Algorithm for One-Segment Broadcasting Audio Coding Algorithm for One-Segment Broadcasting V Masanao Suzuki V Yasuji Ota V Takashi Itoh (Manuscript received November 29, 2007) With the recent progress in coding technologies, a more efficient

More information

MP3 AND AAC EXPLAINED

MP3 AND AAC EXPLAINED MP3 AND AAC EXPLAINED KARLHEINZ BRANDENBURG ½ ½ Fraunhofer Institute for Integrated Circuits FhG-IIS A, Erlangen, Germany bdg@iis.fhg.de The last years have shown widespread proliferation of.mp3-files,

More information

Digital terrestrial television broadcasting Audio coding

Digital terrestrial television broadcasting Audio coding Digital terrestrial television broadcasting Audio coding Televisão digital terrestre Codificação de vídeo, áudio e multiplexação Parte 2: Codificação de áudio Televisión digital terrestre Codificación

More information

For Articulation Purpose Only

For Articulation Purpose Only E305 Digital Audio and Video (4 Modular Credits) This document addresses the content related abilities, with reference to the module. Abilities of thinking, learning, problem solving, team work, communication,

More information

Audio Coding Introduction

Audio Coding Introduction Audio Coding Introduction Lecture WS 2013/2014 Prof. Dr.-Ing. Karlheinz Brandenburg bdg@idmt.fraunhofer.de Prof. Dr.-Ing. Gerald Schuller shl@idmt.fraunhofer.de Page Nr. 1 Organisatorial Details - Overview

More information

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Sound is an energy wave with frequency and amplitude. Frequency maps the axis of time, and amplitude

More information

An Optimised Software Solution for an ARM Powered TM MP3 Decoder. By Barney Wragg and Paul Carpenter

An Optimised Software Solution for an ARM Powered TM MP3 Decoder. By Barney Wragg and Paul Carpenter An Optimised Software Solution for an ARM Powered TM MP3 Decoder By Barney Wragg and Paul Carpenter Abstract The market predictions for MP3-based appliances are extremely positive. The ability to maintain

More information

ETSI TS 102 563 V1.1.1 (2007-02)

ETSI TS 102 563 V1.1.1 (2007-02) TS 102 563 V1.1.1 (2007-02) Technical Specification Digital Audio Broadcasting (DAB); Transport of Advanced Audio Coding (AAC) audio uropean Broadcasting Union Union uropéenne de Radio-Télévision BU UR

More information

Datasheet EdgeVision

Datasheet EdgeVision Datasheet Multichannel Quality of Experience Monitoring Stay in control with customizable monitoring and interfaces. offers richly featured, Quality of Experience (QoE) monitoring across an entire network

More information

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden) Quality Estimation for Scalable Video Codec Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden) Purpose of scalable video coding Multiple video streams are needed for heterogeneous

More information

DAB + The additional audio codec in DAB

DAB + The additional audio codec in DAB DAB + The additional audio codec in DAB 2007 Contents Why DAB + Features of DAB + Possible scenarios with DAB + Comparison of DAB + and DMB for radio services Performance of DAB + Status of standardisation

More information

VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.)

VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.) VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.) 1 Remember first the big picture VoIP network architecture and some terminologies Voice coders 2 Audio and voice quality measuring

More information

White Paper: An Overview of the Coherent Acoustics Coding System

White Paper: An Overview of the Coherent Acoustics Coding System White Paper: An Overview of the Coherent Acoustics Coding System Mike Smyth June 1999 Introduction Coherent Acoustics is a digital audio compression algorithm designed for both professional and consumer

More information

RECOMMENDATION ITU-R BO.786 *

RECOMMENDATION ITU-R BO.786 * Rec. ITU-R BO.786 RECOMMENDATION ITU-R BO.786 * MUSE ** system for HDTV broadcasting-satellite services (Question ITU-R /) (992) The ITU Radiocommunication Assembly, considering a) that the MUSE system

More information

MPEG Layer-3. An introduction to. 1. Introduction

MPEG Layer-3. An introduction to. 1. Introduction An introduction to MPEG Layer-3 MPEG Layer-3 K. Brandenburg and H. Popp Fraunhofer Institut für Integrierte Schaltungen (IIS) MPEG Layer-3, otherwise known as MP3, has generated a phenomenal interest among

More information

Tutorial about the VQR (Voice Quality Restoration) technology

Tutorial about the VQR (Voice Quality Restoration) technology Tutorial about the VQR (Voice Quality Restoration) technology Ing Oscar Bonello, Solidyne Fellow Audio Engineering Society, USA INTRODUCTION Telephone communications are the most widespread form of transport

More information

4 Digital Video Signal According to ITU-BT.R.601 (CCIR 601) 43

4 Digital Video Signal According to ITU-BT.R.601 (CCIR 601) 43 Table of Contents 1 Introduction 1 2 Analog Television 7 3 The MPEG Data Stream 11 3.1 The Packetized Elementary Stream (PES) 13 3.2 The MPEG-2 Transport Stream Packet.. 17 3.3 Information for the Receiver

More information

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29. Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice over IP So, today we will discuss about voice over IP and internet

More information

Alarms of Stream MultiScreen monitoring system

Alarms of Stream MultiScreen monitoring system STREAM LABS Alarms of Stream MultiScreen monitoring system Version 1.0, June 2013. Version history Version Author Comments 1.0 Krupkin V. Initial version of document. Alarms for MPEG2 TS, RTMP, HLS, MMS,

More information

Ogg Vorbis Audio Decoder Jon Stritar and Matt Papi 6.111 December 14, 2005

Ogg Vorbis Audio Decoder Jon Stritar and Matt Papi 6.111 December 14, 2005 Ogg Vorbis Audio Decoder Jon Stritar and Matt Papi 6.111 December 14, 2005 Abstract The goal of this project was to design and implement an Ogg Vorbis decoder in hardware. Ogg Vorbis is a highly dynamic

More information

MP3/mp3PRO plug-in. How you can make an audio CD from mp3 or mp3pro files

MP3/mp3PRO plug-in. How you can make an audio CD from mp3 or mp3pro files MP3/mp3PRO plug-in How you can make an audio CD from mp3 or mp3pro files...1 The mp3pro encoder...2 How you can make your own mp3pro files with Nero...3 How you can make your own MP3 files with Nero...12

More information

APPLICATION BULLETIN AAC Transport Formats

APPLICATION BULLETIN AAC Transport Formats F RA U N H O F E R I N S T I T U T E F O R I N T E G R A T E D C I R C U I T S I I S APPLICATION BULLETIN AAC Transport Formats INITIAL RELEASE V. 1.0 2 18 1 AAC Transport Protocols and File Formats As

More information

Dream DRM Receiver Documentation

Dream DRM Receiver Documentation Dream DRM Receiver Documentation Dream is a software implementation of a Digital Radio Mondiale (DRM) receiver. All what is needed to receive DRM transmissions is a PC with a sound card and a modified

More information

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6 Page 1 of 6 1. Scope. This operational practice sets out the requirements for downmixing 5.1 and 5.0 channel surround sound audio mixes to 2 channel stereo. This operational practice recommends a number

More information

A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques

A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques Vineela Behara,Y Ramesh Department of Computer Science and Engineering Aditya institute of Technology and

More information

MINIMUM SPECIFICATIONS FOR DAB AND DAB+ IN-VEHICLE DIGITAL RADIO RECEIVERS AND ADAPTORS

MINIMUM SPECIFICATIONS FOR DAB AND DAB+ IN-VEHICLE DIGITAL RADIO RECEIVERS AND ADAPTORS Department for Culture, Media and Sport 1 MINIMUM SPECIFICATIONS FOR DAB AND DAB+ IN-VEHICLE DIGITAL RADIO RECEIVERS AND ADAPTORS Digital Radio Action Plan Report Published February 2013 Department for

More information

ARIB STD-T64-C.S0042 v1.0 Circuit-Switched Video Conferencing Services

ARIB STD-T64-C.S0042 v1.0 Circuit-Switched Video Conferencing Services ARIB STD-T-C.S00 v.0 Circuit-Switched Video Conferencing Services Refer to "Industrial Property Rights (IPR)" in the preface of ARIB STD-T for Related Industrial Property Rights. Refer to "Notice" in the

More information

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Synergies and Distinctions Peter Vary RWTH Aachen University Institute of Communication Systems WASPAA, October 23, 2013 Mohonk Mountain

More information

Digital Speech Coding

Digital Speech Coding Digital Speech Processing David Tipper Associate Professor Graduate Program of Telecommunications and Networking University of Pittsburgh Telcom 2720 Slides 7 http://www.sis.pitt.edu/~dtipper/tipper.html

More information

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Video Coding Basics Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Outline Motivation for video coding Basic ideas in video coding Block diagram of a typical video codec Different

More information

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur Module 8 VIDEO CODING STANDARDS Version ECE IIT, Kharagpur Lesson H. andh.3 Standards Version ECE IIT, Kharagpur Lesson Objectives At the end of this lesson the students should be able to :. State the

More information

Chapter 6: Broadcast Systems. Mobile Communications. Unidirectional distribution systems DVB DAB. High-speed Internet. architecture Container

Chapter 6: Broadcast Systems. Mobile Communications. Unidirectional distribution systems DVB DAB. High-speed Internet. architecture Container Mobile Communications Chapter 6: Broadcast Systems Unidirectional distribution systems DAB DVB architecture Container High-speed Internet Prof. Dr.-Ing. Jochen Schiller, http://www.jochenschiller.de/ MC

More information

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Voice Digitization in the POTS Traditional

More information

RFS-805. Digital Modulator AV to COFDM. User Manual

RFS-805. Digital Modulator AV to COFDM. User Manual RFS-805 Digital Modulator AV to COFDM User Manual 1. Purpose of use RFS-805 is a digital modulator designed for a processing audio and video signals into COFDM (DVB-T) multiplex. 2. Installation The connections

More information

Multimedia Communications

Multimedia Communications Multimedia Communications Dr. Ing. Audio Processing and Coding MMC Overview 1. Introduction 2. Fundamentals (Signal Processing, Information Theorie) 3. Speech Processing & Coding 4. Audio Processing &

More information

LOW COST HARDWARE IMPLEMENTATION FOR DIGITAL HEARING AID USING

LOW COST HARDWARE IMPLEMENTATION FOR DIGITAL HEARING AID USING LOW COST HARDWARE IMPLEMENTATION FOR DIGITAL HEARING AID USING RasPi Kaveri Ratanpara 1, Priyan Shah 2 1 Student, M.E Biomedical Engineering, Government Engineering college, Sector-28, Gandhinagar (Gujarat)-382028,

More information

Starlink 9003T1 T1/E1 Dig i tal Trans mis sion Sys tem

Starlink 9003T1 T1/E1 Dig i tal Trans mis sion Sys tem Starlink 9003T1 T1/E1 Dig i tal Trans mis sion Sys tem A C ombining Moseley s unparalleled reputation for high quality RF aural Studio-Transmitter Links (STLs) with the performance and speed of today s

More information

Estimation of Loudness by Zwicker's Method

Estimation of Loudness by Zwicker's Method Estimation of Loudness by Zwicker's Method Loudness is one category in the list of human perceptions of sound. There are many methods of estimating Loudness using objective measurements. No method is perfect.

More information

A Framework for Robust and Scalable Audio Streaming

A Framework for Robust and Scalable Audio Streaming A Framework for Robust and Scalable Audio Streaming Ye Wang, Wendong Huang, Jari Korhonen School of Computing, National University of Singapore {wangye, huangwd, jari}@comp.nus.edu.sg ABSTRACT We propose

More information

High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform

High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 4, JULY 2003 365 High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform Dai Yang, Member, IEEE, Hongmei Ai, Member, IEEE, Chris

More information

Sampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically.

Sampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically. Sampling Theorem We will show that a band limited signal can be reconstructed exactly from its discrete time samples. Recall: That a time sampled signal is like taking a snap shot or picture of signal

More information

Digital Audio Compression

Digital Audio Compression By Davis Yen Pan Abstract Compared to most digital data types, with the exception of digital video, the data rates associated with uncompressed digital audio are substantial. Digital audio compression

More information

(51) Int Cl.: H04S 3/00 (2006.01)

(51) Int Cl.: H04S 3/00 (2006.01) (19) TEPZZ_9 7 66B_T (11) EP 1 927 266 B1 (12) EUROPEAN PATENT SPECIFICATION (4) Date of publication and mention of the grant of the patent: 14.0.14 Bulletin 14/ (21) Application number: 0679846.2 (22)

More information

MINIMUM TECHNICAL AND EXPLOITATION REQUIREMENTS FOR DIGITAL SOUND BROADCASTING DAB+ RECEIVER DESIGNED FOR POLAND

MINIMUM TECHNICAL AND EXPLOITATION REQUIREMENTS FOR DIGITAL SOUND BROADCASTING DAB+ RECEIVER DESIGNED FOR POLAND MINIMUM TECHNICAL AND EXPLOITATION REQUIREMENTS FOR DIGITAL SOUND BROADCASTING DAB+ RECEIVER DESIGNED FOR POLAND Version 1.0 Prepared by: Technical Subgroup of Digital Sound Broadcasting Committee National

More information

Real-Time Audio Watermarking Based on Characteristics of PCM in Digital Instrument

Real-Time Audio Watermarking Based on Characteristics of PCM in Digital Instrument Journal of Information Hiding and Multimedia Signal Processing 21 ISSN 273-4212 Ubiquitous International Volume 1, Number 2, April 21 Real-Time Audio Watermarking Based on Characteristics of PCM in Digital

More information

Image Compression through DCT and Huffman Coding Technique

Image Compression through DCT and Huffman Coding Technique International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Rahul

More information

Lab Exercise 802.11. Objective. Requirements. Step 1: Fetch a Trace

Lab Exercise 802.11. Objective. Requirements. Step 1: Fetch a Trace Lab Exercise 802.11 Objective To explore the physical layer, link layer, and management functions of 802.11. It is widely used to wireless connect mobile devices to the Internet, and covered in 4.4 of

More information

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Project Proposal Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Sumedha Phatak-1000731131- sumedha.phatak@mavs.uta.edu Objective: A study, implementation and comparison of

More information

Broadcasting your attack: Security testing DAB radio in cars

Broadcasting your attack: Security testing DAB radio in cars Broadcasting your attack: Security testing DAB radio in cars Andy Davis, Research Director Image: computerworld.com.au Agenda Who am I and why am I interested in security testing DAB? Overview of DAB How

More information

All About Audio Metadata. The three Ds: dialogue level, dynamic range control, and downmixing

All About Audio Metadata. The three Ds: dialogue level, dynamic range control, and downmixing Information All About Audio Metadata Metadata, the data about the audio data that travels along with the multichannel audio bitstream in Dolby Digital, makes life easier for broadcasters while also increasing

More information

Basic principles of Voice over IP

Basic principles of Voice over IP Basic principles of Voice over IP Dr. Peter Počta {pocta@fel.uniza.sk} Department of Telecommunications and Multimedia Faculty of Electrical Engineering University of Žilina, Slovakia Outline VoIP Transmission

More information

73M2901CE Programming the Imprecise Call Progress Monitor Filter

73M2901CE Programming the Imprecise Call Progress Monitor Filter A Maxim Integrated Products Brand 73M2901CE Programming the Imprecise Call Progress Monitor Filter APPLICATION NOTE AN_2901CE_042 March 2009 Introduction The Teridian 73M2901CE integrated circuit modem

More information

Video Encoding Best Practices

Video Encoding Best Practices Video Encoding Best Practices SAFARI Montage Creation Station and Managed Home Access Introduction This document provides recommended settings and instructions to prepare user-created video for use with

More information

Case Study: Real-Time Video Quality Monitoring Explored

Case Study: Real-Time Video Quality Monitoring Explored 1566 La Pradera Dr Campbell, CA 95008 www.videoclarity.com 408-379-6952 Case Study: Real-Time Video Quality Monitoring Explored Bill Reckwerdt, CTO Video Clarity, Inc. Version 1.0 A Video Clarity Case

More information

FAST MIR IN A SPARSE TRANSFORM DOMAIN

FAST MIR IN A SPARSE TRANSFORM DOMAIN ISMIR 28 Session 4c Automatic Music Analysis and Transcription FAST MIR IN A SPARSE TRANSFORM DOMAIN Emmanuel Ravelli Université Paris 6 ravelli@lam.jussieu.fr Gaël Richard TELECOM ParisTech gael.richard@enst.fr

More information

DENSITÉ SERIES XVP-3901

DENSITÉ SERIES XVP-3901 up down cross Simultaneous 3G/HD & SD outputs audio fiber Rethink what s possible with one card DENSITÉ SERIES XVP-3901 3Gbps/hd/sd UNIVERSAL video & audio processor DENSITÉ SERIES XVP-3901 3Gbps/HD/SD

More information

Preservation Handbook

Preservation Handbook Preservation Handbook Digital Audio Author Gareth Knight & John McHugh Version 1 Date 25 July 2005 Change History Page 1 of 8 Definition Sound in its original state is a series of air vibrations (compressions

More information

Introduction to Medical Image Compression Using Wavelet Transform

Introduction to Medical Image Compression Using Wavelet Transform National Taiwan University Graduate Institute of Communication Engineering Time Frequency Analysis and Wavelet Transform Term Paper Introduction to Medical Image Compression Using Wavelet Transform 李 自

More information

AN3998 Application note

AN3998 Application note Application note PDM audio software decoding on STM32 microcontrollers 1 Introduction This application note presents the algorithms and architecture of an optimized software implementation for PDM signal

More information

The AAC audio Coding Family For

The AAC audio Coding Family For White PapER The AAC audio Coding Family For Broadcast and Cable TV Over the last few years, the AAC audio codec family has played an increasingly important role as an enabling technology for state-of-the-art

More information

Digital Audio and Video Data

Digital Audio and Video Data Multimedia Networking Reading: Sections 3.1.2, 3.3, 4.5, and 6.5 CS-375: Computer Networks Dr. Thomas C. Bressoud 1 Digital Audio and Video Data 2 Challenges for Media Streaming Large volume of data Each

More information

HD Radio FM Transmission System Specifications Rev. F August 24, 2011

HD Radio FM Transmission System Specifications Rev. F August 24, 2011 HD Radio FM Transmission System Specifications Rev. F August 24, 2011 SY_SSS_1026s TRADEMARKS HD Radio and the HD, HD Radio, and Arc logos are proprietary trademarks of ibiquity Digital Corporation. ibiquity,

More information

PCM Encoding and Decoding:

PCM Encoding and Decoding: PCM Encoding and Decoding: Aim: Introduction to PCM encoding and decoding. Introduction: PCM Encoding: The input to the PCM ENCODER module is an analog message. This must be constrained to a defined bandwidth

More information

Technical Paper. Dolby Digital Plus Audio Coding

Technical Paper. Dolby Digital Plus Audio Coding Technical Paper Dolby Digital Plus Audio Coding Dolby Digital Plus is an advanced, more capable digital audio codec based on the Dolby Digital (AC-3) system that was introduced first for use on 35 mm theatrical

More information

Study and Implementation of Video Compression standards (H.264/AVC, Dirac)

Study and Implementation of Video Compression standards (H.264/AVC, Dirac) Study and Implementation of Video Compression standards (H.264/AVC, Dirac) EE 5359-Multimedia Processing- Spring 2012 Dr. K.R Rao By: Sumedha Phatak(1000731131) Objective A study, implementation and comparison

More information

Audio and Video Synchronization:

Audio and Video Synchronization: White Paper Audio and Video Synchronization: Defining the Problem and Implementing Solutions Linear Acoustic Inc. www.linearacaoustic.com 2004 Linear Acoustic Inc Rev. 1. Introduction With the introduction

More information

A Review of Algorithms for Perceptual Coding of Digital Audio Signals

A Review of Algorithms for Perceptual Coding of Digital Audio Signals A Review of Algorithms for Perceptual Coding of Digital Audio Signals Ted Painter, Student Member IEEE, and Andreas Spanias, Senior Member IEEE Department of Electrical Engineering, Telecommunications

More information

Image Authentication Scheme using Digital Signature and Digital Watermarking

Image Authentication Scheme using Digital Signature and Digital Watermarking www..org 59 Image Authentication Scheme using Digital Signature and Digital Watermarking Seyed Mohammad Mousavi Industrial Management Institute, Tehran, Iran Abstract Usual digital signature schemes for

More information

Digital Transmission of Analog Data: PCM and Delta Modulation

Digital Transmission of Analog Data: PCM and Delta Modulation Digital Transmission of Analog Data: PCM and Delta Modulation Required reading: Garcia 3.3.2 and 3.3.3 CSE 323, Fall 200 Instructor: N. Vlajic Digital Transmission of Analog Data 2 Digitization process

More information

For version 3.7.12p (September 4, 2012)

For version 3.7.12p (September 4, 2012) Zephyr Xstream INSTALLATION For version 3.7.12p (September 4, 2012) The following information applies to Zephyr Xstream units currently running a version ending in p or i. If your Xstream is running software

More information

Convention Paper 5553

Convention Paper 5553 Audio Engineering Society Convention Paper 5553 Presented at the 112th Convention 2 May 1 13 Munich, Germany This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

Digital Radio Certification Mark

Digital Radio Certification Mark Digital Radio Certification Mark Minimum requirements for domestic and in-vehicle digital radio receivers Test specifications for technologies and products Version 1.0r2 Changes to this document A process

More information

HIGH-QUALITY FREQUENCY DOMAIN-BASED AUDIO WATERMARKING. Eric Humphrey. School of Music Engineering Technology University of Miami

HIGH-QUALITY FREQUENCY DOMAIN-BASED AUDIO WATERMARKING. Eric Humphrey. School of Music Engineering Technology University of Miami HIGH-QUALITY FREQUENCY DOMAIN-BASED AUDIO WATERMARKING Eric Humphrey School of Music Engineering Technology University of Miami ABSTRACT An investigation of current audio watermarking technology is provided,

More information

Overview ISDB-T for sound broadcasting Terrestrial Digital Radio in Japan. Shunji NAKAHARA. NHK (Japan Broadcasting Corporation)

Overview ISDB-T for sound broadcasting Terrestrial Digital Radio in Japan. Shunji NAKAHARA. NHK (Japan Broadcasting Corporation) Overview ISDB-T for sound broadcasting Terrestrial Digital Radio in Japan Shunji NAKAHARA NHK (Japan Broadcasting Corporation) 2003/11/04 1 Contents Features of ISDB-T SB system Current status of digital

More information

DVB-T BER MEASUREMENTS IN THE PRESENCE OF ADJACENT CHANNEL AND CO-CHANNEL ANALOGUE TELEVISION INTERFERENCE

DVB-T BER MEASUREMENTS IN THE PRESENCE OF ADJACENT CHANNEL AND CO-CHANNEL ANALOGUE TELEVISION INTERFERENCE DVB-T MEASUREMENTS IN THE PRESENCE OF ADJACENT CHANNEL AND CO-CHANNEL ANALOGUE TELEVISION INTERFERENCE M. Mª Vélez (jtpveelm@bi.ehu.es), P. Angueira, D. de la Vega, A. Arrinda, J. L. Ordiales UNIVERSITY

More information

Classes of multimedia Applications

Classes of multimedia Applications Classes of multimedia Applications Streaming Stored Audio and Video Streaming Live Audio and Video Real-Time Interactive Audio and Video Others Class: Streaming Stored Audio and Video The multimedia content

More information

Red Bee Media. Technical file and tape delivery specification for Commercials. Applies to all channels transmitted by Red Bee Media

Red Bee Media. Technical file and tape delivery specification for Commercials. Applies to all channels transmitted by Red Bee Media Red Bee Media Technical file and tape delivery specification for Commercials Applies to all channels transmitted by Red Bee Media Red Bee Media Rev. November 2010 Introduction Commercial copy is delivered

More information

How To Make A Multi-User Communication Efficient

How To Make A Multi-User Communication Efficient Multiple Access Techniques PROF. MICHAEL TSAI 2011/12/8 Multiple Access Scheme Allow many users to share simultaneously a finite amount of radio spectrum Need to be done without severe degradation of the

More information

2K Processor AJ-HDP2000

2K Processor AJ-HDP2000 Jan.31, 2007 2K Processor AJ-HDP2000 Technical Overview Version 2.0 January 31, 2007 Professional AV Systems Business Unit Panasonic AVC Networks Company Panasonic Broadcast & Television Systems Company

More information

CHAPTER 8 MULTIPLEXING

CHAPTER 8 MULTIPLEXING CHAPTER MULTIPLEXING 3 ANSWERS TO QUESTIONS.1 Multiplexing is cost-effective because the higher the data rate, the more cost-effective the transmission facility.. Interference is avoided under frequency

More information

Figure 1: Relation between codec, data containers and compression algorithms.

Figure 1: Relation between codec, data containers and compression algorithms. Video Compression Djordje Mitrovic University of Edinburgh This document deals with the issues of video compression. The algorithm, which is used by the MPEG standards, will be elucidated upon in order

More information

Understanding CIC Compensation Filters

Understanding CIC Compensation Filters Understanding CIC Compensation Filters April 2007, ver. 1.0 Application Note 455 Introduction f The cascaded integrator-comb (CIC) filter is a class of hardware-efficient linear phase finite impulse response

More information

Combating Anti-forensics of Jpeg Compression

Combating Anti-forensics of Jpeg Compression IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 6, No 3, November 212 ISSN (Online): 1694-814 www.ijcsi.org 454 Combating Anti-forensics of Jpeg Compression Zhenxing Qian 1, Xinpeng

More information

UNIFORM POLYPHASE FILTER BANKS FOR USE IN HEARING AIDS: DESIGN AND CONSTRAINTS

UNIFORM POLYPHASE FILTER BANKS FOR USE IN HEARING AIDS: DESIGN AND CONSTRAINTS 6th European Signal Processing Conference (EUSIPCO 28), Lausanne, Switzerland, August 25-29, 28, copyright by EURASIP UNIFORM POLYPHASE FILTER BANKS FOR USE IN HEARING AIDS: DESIGN AND CONSTRAINTS Robert

More information

Implementing an In-Service, Non- Intrusive Measurement Device in Telecommunication Networks Using the TMS320C31

Implementing an In-Service, Non- Intrusive Measurement Device in Telecommunication Networks Using the TMS320C31 Disclaimer: This document was part of the First European DSP Education and Research Conference. It may have been written by someone whose native language is not English. TI assumes no liability for the

More information

LoRa FAQs. www.semtech.com 1 of 4 Semtech. Semtech Corporation LoRa FAQ

LoRa FAQs. www.semtech.com 1 of 4 Semtech. Semtech Corporation LoRa FAQ LoRa FAQs 1.) What is LoRa Modulation? LoRa (Long Range) is a modulation technique that provides significantly longer range than competing technologies. The modulation is based on spread-spectrum techniques

More information

CBS RECORDS PROFESSIONAL SERIES CBS RECORDS CD-1 STANDARD TEST DISC

CBS RECORDS PROFESSIONAL SERIES CBS RECORDS CD-1 STANDARD TEST DISC CBS RECORDS PROFESSIONAL SERIES CBS RECORDS CD-1 STANDARD TEST DISC 1. INTRODUCTION The CBS Records CD-1 Test Disc is a highly accurate signal source specifically designed for those interested in making

More information

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral GSM speech coding Forelesning INF 5080 Vårsemester 2004 Sources This part contains material from: Web pages Universität Bremen, Arbeitsbereich Nachrichtentechnik (ANT): Prof.K.D. Kammeyer, Jörg Bitzer,

More information

Speech Signal Processing: An Overview

Speech Signal Processing: An Overview Speech Signal Processing: An Overview S. R. M. Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati December, 2012 Prasanna (EMST Lab, EEE, IITG) Speech

More information