Multimedia Communications

Size: px
Start display at page:

Download "Multimedia Communications"

Transcription

1 Multimedia Communications Dr. Ing. Audio Processing and Coding

2 MMC Overview 1. Introduction 2. Fundamentals (Signal Processing, Information Theorie) 3. Speech Processing & Coding 4. Audio Processing & Coding 5. Still Image Coding (JPEG, etc.) 6. Video Coding (MPEG, etc.) 7. MPEG-4 Multimedia Framework, MPEG D Video and Free Viewpoint Video Properties of audition Audio Overview Motivation, requirements, parameters, quality of audio coding PCM parameters of audio coding MPEG-4 Audio Lossless Coding (TU-Berlin, Liebchen/Noll) Standard Audio coding: MPEG-1 (MP3), MPEG-2/4 AAC Multi channel audio (e.g. 5.1 systems), 3D audio (MPEG-4)

3 Materials Liebchen/Noll, FG, NUE, TU-Berlin, Lossless Audio Coding: berlin.de/menue/forschung/projekte/beendete_projekte/mpeg- 4_audio_lossless_coding_als/ K.H. Brandenburg, MP3 and AAC Explained, AES 17th International Conference on High Quality Audio Coding Book M. Bossi, R.E. Goldberg, Introduction to Digital Audio Coding and Standards, Kluwer Acad. Publishers, Examples MP3 Audio Coding Audio\MP3\Karma_Police.wav Audio\MP3\128_Karma_Police.mp3 Audio\MP3\64_Karma_Police.mp3 Audio\MP3\32_Karma_Police.mp3 Audio\MP3\20_Karma_Police.mp3 Original 128 kbit/s 64 kbit/s 32 kbit/s 20 kbit/s

4 Term Audio audire = to hear Audio = everything related to the human aural sense Also used for technical equipment Sound = oscillations of air pressure Ear = Measuring instrument for oscillations of air pressure Physiology of Audition Ossicle Balance organ Acoustic nerve Snail Auditory canal Eardrum Eustachian tube

5 Physiology of Audition External ear: Auricle (ear conch), auditory canal Resonator Amplification, direction Physiology of Audition Middle ear: Eardrum Ossicle: Amplification (leverage, area transformation) Protection, blocking of ossicle by a muscle, but: reaction time ms Eustachian tube to mouth, pressure compensation

6 Physiology of Audition Cochlea, 32mm long, 2 ½ windings Filled with fluid 3 canals separated by membranes On the Basilar membrane: hair cells (Corti organ) Physiology of Audition

7 Physiology of Audition Physiology of Audition

8 Process of hearing: Eardrum is set into oscillation by sound Motion of ossicle Transmission to the inner Cochlea A fluid wave emerges in the Cochlea Hair cells are sheared Physiology of Audition Position of maximum displacement of membrane determines frequency (position frequency transform) Stimulus is transmitted via acoustic nerve to brain Sound Intensity Level Sound intensity is measured as sound intensity level L 20 log p p 0 db Reference pressure: auditory threshold at 1 khz p 20 0 Pa Logarithmic scale for adaptation to characteristics of audition, (doubling of pressure = 6 db increase of sound intensity level)

9 Parameters of Audition Frequency range: Hz Parameters of Audition Dynamic range: 120 db Frequency resolution: 850 tones sounds Ear performs a short term spectral anaysis, perception of frequencies Phase difference is mainly important for sound directions

10 Masking by White Noise Masking by Narrow Band limited Noise inaudible audible

11 Masking by Narrow Band limited Noise Masking by Harmonic Sound

12 Relative Perception of Tones Relative Perception of Tones Tones that are close together in frequency are not perceived separatly Minimum distance of two tones to be perceived separately is dependent on absolute frequency: The higher the absolute frequency, the larger the minimum distance

13 Relative Perception of Tones Audition works a band pass filter bank (third octave) Separation of audible frequency range in 25 frequency groups (each a third octave) Usage e.g. for coding (sub band coding in mp3) Temporal Masking

14 Requirements on Audio Coding High Quality Lossless Transparent Acceptable quality Robustness against channel errors Low complexity, power consumption Low delay Graceful Degradation Editing capability (for professional applications) Quality Parameters of Audio Coding Distortion, signal to noise ratio (SNR, MOS) Frequency range ( Hz) Dynamic range ( db)

15 Subjective Quality Measures MOS: Mean opinion score 5 Imperceptible 4 Perceptible, but not annoying 3 Slightly annoying 2 Annoying 1 Very annoying Subjective Tests many participants (experts or average?) Representative test material (average, challenging material?) Test conditions (speakers, room acoustics, distraction) PCM Coding Mono signals Bereich [Hz] Abtastrate [khz] Bit / Wert Bitrate [kbit/s] Telefon Sprache Breitband Audio Schmalband Audio Breitband

16 Redundancy for error protection Stereo 44,1 khz PCM Coding Gerät Rate [Mbit/s] Redundanz [Mbit/s] Gesamtrate [Mbit/s] Compact Disc (CD) 1,41 2,91 4,32 Digital Audio Tape (DAT) 1,41 1,67 3,08 PCM Coding Further parameters, e.g. for professional applications (studios) Sampling rates: 94 khz, 192 khz better filtering security range for effects, mixing, Resolution: 24 bit, 32 bit higher dynamic range security range for amlification, effects, mixing, Significant increase of raw PCM data rate

17 PCM Coding Multi channel audio, e.g. Dolby surround (cinema, DVD) 5.1 = 6 separate audio channels Significant increase of raw PCM data rate Lossless Audio Coding Motivation: Storage and archiving for higher quality requirements, e.g. in studios Pure redundancy reduction, i.e. the signal can be reconstructed perfectly Irrelevancy can turn into relevancy by editing (filtering, effects, mixing, ) Applications also in consumer area (highest-quality audio)

18 Example audio studio: Lossless Audio Coding highest-quality audio: 96 khz x 24 bit => 2.3 Mbit/s 4 min song: ca. 69 MByte 48 tracks: ca. 3.3 GByte storage Several takes, songs per album, artists, => Extreme storage requirements Lossless Audio Coding Example film audio, multi channel 5.1 highest-quality audio: 96 khz x 24 bit => 2.3 Mbit/s 90 min: ca GByte 6 channels: ca. 7.5 Gbyte storage Mixed from different takes, versions, music, speech, sounds, effects, => Extreme storage requirements

19 Part of MPEG-4 MPEG 4 Lossless Coding Based on a proposal of Tilman Liebchen und Prof. Peter Noll, FG Nachrichtenübertragung, TU-Berlin Principle: ADPCM + Entropy coding MPEG 4 Lossless Coding

20 MPEG 4 Lossless Coding MPEG 4 Lossless Coding Forward ADPCM: coefficients have to be transmitted The more coefficients are used for prediction the lower the error variance and with that the bitrate But: the more coefficients have to be transmitted, bitrate increases Optimization problem regarding overall bitrate Optimum predictor length and optimum coefficients are transmitted

21 MPEG 4 Lossless Coding Processing in time windows of 2048 Samples at 48 khz (ca. 43 ms) Division into 4 sub-segments in case of instationary signals Joint Stereo Coding: exploitation of redundancies between channels by prediction Entropy coding (Rice Coding) MPEG 4 Lossless Coding Results: compression related to PCM in % (same quality, lossless!) Representative MPEG test set, average values

22 MPEG 4 Lossless Coding Limited compression, factor More is hardly possible only by redundancy reduction Nevertheless: important economic factor in professional applications (storage/archiving in studios) and others Sub band Coding Band-pass filter bank Often separated in analogy to human perception (third octaves) h 1 (n) COD 1 x(n) h 2 (n) COD 2 Multiplex Bitstrom h M (n) COD M

23 Irrelevancy reduction Lossy Audio Coding Perceptual Coding Things that cannot be perceived are omitted The error is adjusted just below the threshold of perception Perceptual Coding Analysis of the signal related to a psycho acoustic model, which models the characteristic of audition Calculation of auditory thresholds, masking, temporal masking, etc. Quantization of the signal related to the audibility of the error Audio in Filterbank Quantisierung Codierung MUX Bitstream out Psychoakust. Modell

24 Lossy Audio Coding Main technology: Sub-band coding and transform coding (DCT) Coding in the frequency domain Adapted to human audition, which perceives frequency characteristics in time windows Separation of the signal in frequency components and separate coding Lossy Audio Coding Sub-band coding: filter bank, relatively low frequency resolution Transform coding (DCT): relatively high frequency resolution Windows: processing in blocks of samples (e.g. 8 ms) Problem: pre-echoes in sudden percussive sounds, error is distributed over the whole window and may become audible in silent passages

25 Pre-Echoes Time window of samples Transformation Quantization Inv. Transf. Distribution of the error over the whole window MPEG 1 Audio Layer 1+2 Filterbank Analyse FFT Mithörschwellen Skalierer Quantisierer Dynamische Bitzuweisung & Codierung Mux Filterbank Synthese Deskalierer Dequantisierer Dynamische Bitzuweisung & Decodierung De- Mux

26 MPEG 1 Audio Layer 1+2 Layer 1, simplest form, 32 frequency bands Layer 2 some improvements finer quantization Operation modi Mono Stereo (separately coded) Dual separate channels (e.g. for bilingual programs) Joint Stereo (exploitation of stereo redundancy) MPEG 1 Audio Layer 3 / MP3 Filterbank Analyse FFT MDCT dyn. Fensterung Mithörschwellen Skalierer Quantisierer Huffman- Coder Coder Zusatzinfo Mux Filterbank Synthese Inverse MDCT dyn. Fensterung Deskalierer Dequantisierer Huffman- Decoder Decodier Zusatzinfo De- Mux

27 Improvements of MP3 Hybrid coding, sub-band + MDCT for higher frequency resolution 32 bands -> up to 576 spectral lines Better bit assignment Entropy coding Pre-echo control Dynamic windows, overlapped MPEG Standards MPEG Standards only specify the syntax of the bitstream and the decoding process The encoder can be implemented by any provider differently as long as the right syntax is created (same language) Allows competition among providers while ensuring interoperability of systems A good MP3 Encoder: Fraunhofer IIS, esp. Psycho-acoustic model

28 MPEG Standards Principle of asymmetric coding: encoder is much more complex than decoder (complex signal analysis only in encoder) Receivers can be realized simple, cheap High complexity is often acceptable at sender side (broadcast, DVD production) Not the case in telecommunication, but that is not the main focus of MPEG MPEG 1 Audio Compression Gains Layer Stereo-Rate [kbit/s] Gewinn Anwendung DCC DAB MP3

29 5.1 Surround Setup MPEG 2 BC Backward compatible Supports 5.1 and is compatible to MPEG-1 Supports lower sampling frequencies (16, 22,05, 24 khz) for special applications at very low bitrates (MPEG-1: 32, 44,1, 48 khz)

30 MPEG 2 AAC Abandoning backward compatibility even higher compression ratios can be achieved Advanced Audio Coding (AAC) Same principle as in MP3, perceptual coding Some new algorithms that in sum provide further improvements At 96 kbit/s same quality as MP3 at 128 kbit/s Suitable for very low bitrates down to 32 kbit/s and below MPEG 2 AAC n Modified DCT (128/1024) Psychoakustisches Modell Temporal Noise Shaping Temporal Prediction Stereo Coding Skalierung und Quantisierung Huffman- Codierung Codierung der Zusatzinformation M u l t i p l e x

31 MPEG 4 AAC Some small improvements over MPEG-2 AAC Spectral band replication Not only coding of audio MPEG 4 Audio Toolbox Definition of an audio-visual scene (2D/3D), e.g. distribution of AV-objects in a virtual 3D room AV-scene consists of audio, video and synthetic objects => the scene is composed Described in a specific script language (BInary Format for Scenes, BIFS, superset of VRML) The AV-scene is presented, edited, decoded, one interacts

32 MPEG 4 Audio Toolbox MPEG 4 Audio Toolbox Specification of different codecs, optimized for different types of data, e.g. General Audio (MPEG-4 AAC) Speech codecs (CELP, text-to-speech) Wave table codecs für synthetic audio (MIDI) Specification of interaction mechanisms with audio objects Switch on/off, move, increase/decrease volume, copy, Final mix of the scene at the decoder Decoding, interpreting interaction, rendering, display

33 Parametric Audio Coding Separation of audio into components (analysis), e.g. harmonic, sine, noise components Separate optimized coding Modellbasierte Signalzerlegung Harmonische Komponenten Sinusoidale Komponenten Quantisierung Parametercodierung Psychoakustisches Modell Codierung der Zusatzinformation Rausch- Komponenten Selektion relevanter Komponenten M u l t i p l e x Parametric Audio Coding

Introduction FOR CLASS USE ONLY DO NOT DISTRIBUTE. MMC Overview

Introduction FOR CLASS USE ONLY DO NOT DISTRIBUTE. MMC Overview Introduction MMC Overview 1. Introduction 2. Fundamentals (Signal Processing, Information Theorie) 3. Speech Processing & Coding 4. Audio Processing & Coding 5. Still Image Coding (JPEG, etc.) 6. Video

More information

AUDIO CODING: BASICS AND STATE OF THE ART

AUDIO CODING: BASICS AND STATE OF THE ART AUDIO CODING: BASICS AND STATE OF THE ART PACS REFERENCE: 43.75.CD Brandenburg, Karlheinz Fraunhofer Institut Integrierte Schaltungen, Arbeitsgruppe Elektronische Medientechnolgie Am Helmholtzring 1 98603

More information

Introduction FOR CLASS USE ONLY DO NOT DISTRIBUTE. MMC Overview

Introduction FOR CLASS USE ONLY DO NOT DISTRIBUTE. MMC Overview Introduction MMC Overview 1. Introduction 2. Fundamentals (Signal Processing, Information Theorie) 3. Speech Processing & Coding 4. Audio Processing & Coding 5. Still Image Coding (JPEG, etc.) 6. Video

More information

Audio Coding, Psycho- Accoustic model and MP3

Audio Coding, Psycho- Accoustic model and MP3 INF5081: Multimedia Coding and Applications Audio Coding, Psycho- Accoustic model and MP3, NR Torbjørn Ekman, Ifi Nils Christophersen, Ifi Sverre Holm, Ifi What is Sound? Sound waves: 20Hz - 20kHz Speed:

More information

Digital Audio Compression: Why, What, and How

Digital Audio Compression: Why, What, and How Digital Audio Compression: Why, What, and How An Absurdly Short Course Jeff Bier Berkeley Design Technology, Inc. 2000 BDTI 1 Outline Why Compress? What is Audio Compression? How Does it Work? Conclusions

More information

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music ISO/IEC MPEG USAC Unified Speech and Audio Coding MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music The standardization of MPEG USAC in ISO/IEC is now in its final

More information

Audio Coding Introduction

Audio Coding Introduction Audio Coding Introduction Lecture WS 2013/2014 Prof. Dr.-Ing. Karlheinz Brandenburg bdg@idmt.fraunhofer.de Prof. Dr.-Ing. Gerald Schuller shl@idmt.fraunhofer.de Page Nr. 1 Organisatorial Details - Overview

More information

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION Adiel Ben-Shalom, Michael Werman School of Computer Science Hebrew University Jerusalem, Israel. {chopin,werman}@cs.huji.ac.il

More information

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Sound is an energy wave with frequency and amplitude. Frequency maps the axis of time, and amplitude

More information

Lecture 4: Jan 12, 2005

Lecture 4: Jan 12, 2005 EE516 Computer Speech Processing Winter 2005 Lecture 4: Jan 12, 2005 Lecturer: Prof: J. Bilmes University of Washington Dept. of Electrical Engineering Scribe: Scott Philips

More information

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de MPEG-1 / MPEG-2 BC Audio The Basic Paradigm of T/F Domain Audio Coding Digital Audio Input Filter Bank Bit or Noise Allocation Quantized Samples Bitstream Formatting Encoded Bitstream Signal to Mask Ratio

More information

The Theory Behind Mp3

The Theory Behind Mp3 The Theory Behind Mp3 Rassol Raissi December 2002 Abstract Since the MPEG-1 Layer III encoding technology is nowadays widely used it might be interesting to gain knowledge of how this powerful compression/decompression

More information

For Articulation Purpose Only

For Articulation Purpose Only E305 Digital Audio and Video (4 Modular Credits) This document addresses the content related abilities, with reference to the module. Abilities of thinking, learning, problem solving, team work, communication,

More information

Audio Coding Algorithm for One-Segment Broadcasting

Audio Coding Algorithm for One-Segment Broadcasting Audio Coding Algorithm for One-Segment Broadcasting V Masanao Suzuki V Yasuji Ota V Takashi Itoh (Manuscript received November 29, 2007) With the recent progress in coding technologies, a more efficient

More information

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton Abstract This paper intends to give the reader some insight into the workings

More information

DAB + The additional audio codec in DAB

DAB + The additional audio codec in DAB DAB + The additional audio codec in DAB 2007 Contents Why DAB + Features of DAB + Possible scenarios with DAB + Comparison of DAB + and DMB for radio services Performance of DAB + Status of standardisation

More information

MP3 AND AAC EXPLAINED

MP3 AND AAC EXPLAINED MP3 AND AAC EXPLAINED KARLHEINZ BRANDENBURG ½ ½ Fraunhofer Institute for Integrated Circuits FhG-IIS A, Erlangen, Germany bdg@iis.fhg.de The last years have shown widespread proliferation of.mp3-files,

More information

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden) Quality Estimation for Scalable Video Codec Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden) Purpose of scalable video coding Multiple video streams are needed for heterogeneous

More information

Analog-to-Digital Voice Encoding

Analog-to-Digital Voice Encoding Analog-to-Digital Voice Encoding Basic Voice Encoding: Converting Analog to Digital This topic describes the process of converting analog signals to digital signals. Digitizing Analog Signals 1. Sample

More information

MPEG Layer-3. An introduction to. 1. Introduction

MPEG Layer-3. An introduction to. 1. Introduction An introduction to MPEG Layer-3 MPEG Layer-3 K. Brandenburg and H. Popp Fraunhofer Institut für Integrierte Schaltungen (IIS) MPEG Layer-3, otherwise known as MP3, has generated a phenomenal interest among

More information

FAST MIR IN A SPARSE TRANSFORM DOMAIN

FAST MIR IN A SPARSE TRANSFORM DOMAIN ISMIR 28 Session 4c Automatic Music Analysis and Transcription FAST MIR IN A SPARSE TRANSFORM DOMAIN Emmanuel Ravelli Université Paris 6 ravelli@lam.jussieu.fr Gaël Richard TELECOM ParisTech gael.richard@enst.fr

More information

The Design and Implementation of Multimedia Software

The Design and Implementation of Multimedia Software Chapter 10 Auditory Content The Design and Implementation of Multimedia Software David Bernstein Jones and Bartlett Publishers www.jbpub.com David Bernstein (jbpub.com) Multimedia Software Jones and Bartlett

More information

Convention Paper 5553

Convention Paper 5553 Audio Engineering Society Convention Paper 5553 Presented at the 112th Convention 2 May 1 13 Munich, Germany This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

Digital Audio Compression

Digital Audio Compression By Davis Yen Pan Abstract Compared to most digital data types, with the exception of digital video, the data rates associated with uncompressed digital audio are substantial. Digital audio compression

More information

Digital Speech Coding

Digital Speech Coding Digital Speech Processing David Tipper Associate Professor Graduate Program of Telecommunications and Networking University of Pittsburgh Telcom 2720 Slides 7 http://www.sis.pitt.edu/~dtipper/tipper.html

More information

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Voice Digitization in the POTS Traditional

More information

Digital terrestrial television broadcasting Audio coding

Digital terrestrial television broadcasting Audio coding Digital terrestrial television broadcasting Audio coding Televisão digital terrestre Codificação de vídeo, áudio e multiplexação Parte 2: Codificação de áudio Televisión digital terrestre Codificación

More information

Tutorial about the VQR (Voice Quality Restoration) technology

Tutorial about the VQR (Voice Quality Restoration) technology Tutorial about the VQR (Voice Quality Restoration) technology Ing Oscar Bonello, Solidyne Fellow Audio Engineering Society, USA INTRODUCTION Telephone communications are the most widespread form of transport

More information

A Review of Algorithms for Perceptual Coding of Digital Audio Signals

A Review of Algorithms for Perceptual Coding of Digital Audio Signals A Review of Algorithms for Perceptual Coding of Digital Audio Signals Ted Painter, Student Member IEEE, and Andreas Spanias, Senior Member IEEE Department of Electrical Engineering, Telecommunications

More information

White Paper: An Overview of the Coherent Acoustics Coding System

White Paper: An Overview of the Coherent Acoustics Coding System White Paper: An Overview of the Coherent Acoustics Coding System Mike Smyth June 1999 Introduction Coherent Acoustics is a digital audio compression algorithm designed for both professional and consumer

More information

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman A Comparison of Speech Coding Algorithms ADPCM vs CELP Shannon Wichman Department of Electrical Engineering The University of Texas at Dallas Fall 1999 December 8, 1999 1 Abstract Factors serving as constraints

More information

Hearing and Deafness 1. Anatomy & physiology

Hearing and Deafness 1. Anatomy & physiology Hearing and Deafness 1. Anatomy & physiology Chris Darwin Web site for lectures, lecture notes and filtering lab: http://www.lifesci.susx.ac.uk/home/chris_darwin/ safari 1 Outer, middle & inner ear Capture;

More information

JPEG Image Compression by Using DCT

JPEG Image Compression by Using DCT International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-4, Issue-4 E-ISSN: 2347-2693 JPEG Image Compression by Using DCT Sarika P. Bagal 1* and Vishal B. Raskar 2 1*

More information

HE-AAC v2. MPEG-4 HE-AAC v2 (also known as aacplus v2 ) is the combination of three technologies:

HE-AAC v2. MPEG-4 HE-AAC v2 (also known as aacplus v2 ) is the combination of three technologies: HE- v2 MPEG-4 audio coding for today s digital media world Stefan Meltzer and Gerald Moser Coding Technologies Delivering broadcast-quality content to consumers is one of the most challenging tasks in

More information

UNIVERSITY OF CALICUT

UNIVERSITY OF CALICUT UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION BMMC (2011 Admission) V SEMESTER CORE COURSE AUDIO RECORDING & EDITING QUESTION BANK 1. Sound measurement a) Decibel b) frequency c) Wave 2. Acoustics

More information

Preservation Handbook

Preservation Handbook Preservation Handbook Digital Audio Author Gareth Knight & John McHugh Version 1 Date 25 July 2005 Change History Page 1 of 8 Definition Sound in its original state is a series of air vibrations (compressions

More information

Technical Paper. Dolby Digital Plus Audio Coding

Technical Paper. Dolby Digital Plus Audio Coding Technical Paper Dolby Digital Plus Audio Coding Dolby Digital Plus is an advanced, more capable digital audio codec based on the Dolby Digital (AC-3) system that was introduced first for use on 35 mm theatrical

More information

Figure 1: Relation between codec, data containers and compression algorithms.

Figure 1: Relation between codec, data containers and compression algorithms. Video Compression Djordje Mitrovic University of Edinburgh This document deals with the issues of video compression. The algorithm, which is used by the MPEG standards, will be elucidated upon in order

More information

Aalborg Universitet. Estimation and Modeling Problems in Parametric Audio Coding Christensen, Mads Græsbøll. Publication date: 2005

Aalborg Universitet. Estimation and Modeling Problems in Parametric Audio Coding Christensen, Mads Græsbøll. Publication date: 2005 Aalborg Universitet Estimation and Modeling Problems in Parametric Audio Coding Christensen, Mads Græsbøll Publication date: 2005 Document Version Publisher's PDF, also known as Version of record Link

More information

Introduction to image coding

Introduction to image coding Introduction to image coding Image coding aims at reducing amount of data required for image representation, storage or transmission. This is achieved by removing redundant data from an image, i.e. by

More information

encoding compression encryption

encoding compression encryption encoding compression encryption ASCII utf-8 utf-16 zip mpeg jpeg AES RSA diffie-hellman Expressing characters... ASCII and Unicode, conventions of how characters are expressed in bits. ASCII (7 bits) -

More information

Simple Voice over IP (VoIP) Implementation

Simple Voice over IP (VoIP) Implementation Simple Voice over IP (VoIP) Implementation ECE Department, University of Florida Abstract Voice over IP (VoIP) technology has many advantages over the traditional Public Switched Telephone Networks. In

More information

Basic principles of Voice over IP

Basic principles of Voice over IP Basic principles of Voice over IP Dr. Peter Počta {pocta@fel.uniza.sk} Department of Telecommunications and Multimedia Faculty of Electrical Engineering University of Žilina, Slovakia Outline VoIP Transmission

More information

PRIMER ON PC AUDIO. Introduction to PC-Based Audio

PRIMER ON PC AUDIO. Introduction to PC-Based Audio PRIMER ON PC AUDIO This document provides an introduction to various issues associated with PC-based audio technology. Topics include the following: Introduction to PC-Based Audio Introduction to Audio

More information

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder.

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder. A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER Manoj Kumar 1 Mohammad Zubair 1 1 IBM T.J. Watson Research Center, Yorktown Hgts, NY, USA ABSTRACT The MPEG/Audio is a standard for both

More information

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Video Coding Basics Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Outline Motivation for video coding Basic ideas in video coding Block diagram of a typical video codec Different

More information

Digital Audio and Video Data

Digital Audio and Video Data Multimedia Networking Reading: Sections 3.1.2, 3.3, 4.5, and 6.5 CS-375: Computer Networks Dr. Thomas C. Bressoud 1 Digital Audio and Video Data 2 Challenges for Media Streaming Large volume of data Each

More information

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Synergies and Distinctions Peter Vary RWTH Aachen University Institute of Communication Systems WASPAA, October 23, 2013 Mohonk Mountain

More information

Starlink 9003T1 T1/E1 Dig i tal Trans mis sion Sys tem

Starlink 9003T1 T1/E1 Dig i tal Trans mis sion Sys tem Starlink 9003T1 T1/E1 Dig i tal Trans mis sion Sys tem A C ombining Moseley s unparalleled reputation for high quality RF aural Studio-Transmitter Links (STLs) with the performance and speed of today s

More information

Trigonometric functions and sound

Trigonometric functions and sound Trigonometric functions and sound The sounds we hear are caused by vibrations that send pressure waves through the air. Our ears respond to these pressure waves and signal the brain about their amplitude

More information

MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN. zl2211@columbia.edu. ml3088@columbia.edu

MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN. zl2211@columbia.edu. ml3088@columbia.edu MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN Zheng Lai Zhao Liu Meng Li Quan Yuan zl2215@columbia.edu zl2211@columbia.edu ml3088@columbia.edu qy2123@columbia.edu I. Overview Architecture The purpose

More information

DOLBY SR-D DIGITAL. by JOHN F ALLEN

DOLBY SR-D DIGITAL. by JOHN F ALLEN DOLBY SR-D DIGITAL by JOHN F ALLEN Though primarily known for their analog audio products, Dolby Laboratories has been working with digital sound for over ten years. Even while talk about digital movie

More information

Welcome to the United States Patent and TradeMark Office

Welcome to the United States Patent and TradeMark Office Welcome to the United States Patent and TradeMark Office an Agency of the United States Department of Commerce United States Patent 5,159,703 Lowery October 27, 1992 Silent subliminal presentation system

More information

SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A

SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A International Journal of Science, Engineering and Technology Research (IJSETR), Volume, Issue, January SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A N.Rama Tej Nehru, B P.Sunitha

More information

Your Hearing ILLUMINATED

Your Hearing ILLUMINATED Your Hearing ILLUMINATED INFORMATION FROM YOUR HEARING CARE PROFESSIONAL REDISCOVER your hearing and reconnect 1 with the important things you might have been missing. Your sense of hearing is a vital

More information

Analog Representations of Sound

Analog Representations of Sound Analog Representations of Sound Magnified phonograph grooves, viewed from above: The shape of the grooves encodes the continuously varying audio signal. Analog to Digital Recording Chain ADC Microphone

More information

Video compression: Performance of available codec software

Video compression: Performance of available codec software Video compression: Performance of available codec software Introduction. Digital Video A digital video is a collection of images presented sequentially to produce the effect of continuous motion. It takes

More information

Compression techniques

Compression techniques Compression techniques David Bařina February 22, 2013 David Bařina Compression techniques February 22, 2013 1 / 37 Contents 1 Terminology 2 Simple techniques 3 Entropy coding 4 Dictionary methods 5 Conclusion

More information

An Optimised Software Solution for an ARM Powered TM MP3 Decoder. By Barney Wragg and Paul Carpenter

An Optimised Software Solution for an ARM Powered TM MP3 Decoder. By Barney Wragg and Paul Carpenter An Optimised Software Solution for an ARM Powered TM MP3 Decoder By Barney Wragg and Paul Carpenter Abstract The market predictions for MP3-based appliances are extremely positive. The ability to maintain

More information

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29. Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice over IP So, today we will discuss about voice over IP and internet

More information

Classes of multimedia Applications

Classes of multimedia Applications Classes of multimedia Applications Streaming Stored Audio and Video Streaming Live Audio and Video Real-Time Interactive Audio and Video Others Class: Streaming Stored Audio and Video The multimedia content

More information

Chapter 6: Broadcast Systems. Mobile Communications. Unidirectional distribution systems DVB DAB. High-speed Internet. architecture Container

Chapter 6: Broadcast Systems. Mobile Communications. Unidirectional distribution systems DVB DAB. High-speed Internet. architecture Container Mobile Communications Chapter 6: Broadcast Systems Unidirectional distribution systems DAB DVB architecture Container High-speed Internet Prof. Dr.-Ing. Jochen Schiller, http://www.jochenschiller.de/ MC

More information

2K Processor AJ-HDP2000

2K Processor AJ-HDP2000 Jan.31, 2007 2K Processor AJ-HDP2000 Technical Overview Version 2.0 January 31, 2007 Professional AV Systems Business Unit Panasonic AVC Networks Company Panasonic Broadcast & Television Systems Company

More information

T-61.184. Automatic Speech Recognition: From Theory to Practice

T-61.184. Automatic Speech Recognition: From Theory to Practice Automatic Speech Recognition: From Theory to Practice http://www.cis.hut.fi/opinnot// September 27, 2004 Prof. Bryan Pellom Department of Computer Science Center for Spoken Language Research University

More information

Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet

Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet DICTA2002: Digital Image Computing Techniques and Applications, 21--22 January 2002, Melbourne, Australia Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet K. Ramkishor James. P. Mammen

More information

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Video Coding Standards Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Yao Wang, 2003 EE4414: Video Coding Standards 2 Outline Overview of Standards and Their Applications ITU-T

More information

A Digital Audio Watermark Embedding Algorithm

A Digital Audio Watermark Embedding Algorithm Xianghong Tang, Yamei Niu, Hengli Yue, Zhongke Yin Xianghong Tang, Yamei Niu, Hengli Yue, Zhongke Yin School of Communication Engineering, Hangzhou Dianzi University, Hangzhou, Zhejiang, 3008, China tangxh@hziee.edu.cn,

More information

Peter Eisert, Thomas Wiegand and Bernd Girod. University of Erlangen-Nuremberg. Cauerstrasse 7, 91058 Erlangen, Germany

Peter Eisert, Thomas Wiegand and Bernd Girod. University of Erlangen-Nuremberg. Cauerstrasse 7, 91058 Erlangen, Germany RATE-DISTORTION-EFFICIENT VIDEO COMPRESSION USING A 3-D HEAD MODEL Peter Eisert, Thomas Wiegand and Bernd Girod Telecommunications Laboratory University of Erlangen-Nuremberg Cauerstrasse 7, 91058 Erlangen,

More information

A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques

A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques Vineela Behara,Y Ramesh Department of Computer Science and Engineering Aditya institute of Technology and

More information

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Project Proposal Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Sumedha Phatak-1000731131- sumedha.phatak@mavs.uta.edu Objective: A study, implementation and comparison of

More information

BASS MANAGEMENT SYSTEM

BASS MANAGEMENT SYSTEM E BASS MANAGEMENT SYSTEM 1 What is Bass Management? In stereo reproduction, signals from 20Hz to 20kHz need to be replayed. Multi-way speaker systems will reproduce that bandwidth evenly. With multichannel

More information

GUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS

GUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS GUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS Digitization Best Practices for Audio This document sets forth guidelines for digitizing audio materials for CARLI Digital Collections. The issues described

More information

EE3414 Multimedia Communication Systems Part I

EE3414 Multimedia Communication Systems Part I EE3414 Multimedia Communication Systems Part I Spring 2003 Lecture 1 Yao Wang Electrical and Computer Engineering Polytechnic University Course Overview A University Sequence Course in Multimedia Communication

More information

!"#$"%&' What is Multimedia?

!#$%&' What is Multimedia? What is Multimedia? %' A Big Umbrella Goal of This Course Understand various aspects of a modern multimedia pipeline Content creating, editing Distribution Search & mining Protection Hands-on experience

More information

High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform

High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 4, JULY 2003 365 High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform Dai Yang, Member, IEEE, Hongmei Ai, Member, IEEE, Chris

More information

Dream DRM Receiver Documentation

Dream DRM Receiver Documentation Dream DRM Receiver Documentation Dream is a software implementation of a Digital Radio Mondiale (DRM) receiver. All what is needed to receive DRM transmissions is a PC with a sound card and a modified

More information

Network Traffic #5. Traffic Characterization

Network Traffic #5. Traffic Characterization Network #5 Section 4.7.1, 5.7.2 1 Characterization Goals to: Understand the nature of what is transported over communications networks. Use that understanding to improve network design Characterization

More information

Overview: Video Coding Standards

Overview: Video Coding Standards Overview: Video Coding Standards Video coding standards: applications and common structure Relevant standards organizations ITU-T Rec. H.261 ITU-T Rec. H.263 ISO/IEC MPEG-1 ISO/IEC MPEG-2 ISO/IEC MPEG-4

More information

Tonal Detection in Noise: An Auditory Neuroscience Insight

Tonal Detection in Noise: An Auditory Neuroscience Insight Image: http://physics.ust.hk/dusw/ Tonal Detection in Noise: An Auditory Neuroscience Insight Buddhika Karunarathne 1 and Richard H.Y. So 1,2 1 Dept. of IELM, Hong Kong University of Science & Technology,

More information

Convention Paper Presented at the 112th Convention 2002 May 10 13 Munich, Germany

Convention Paper Presented at the 112th Convention 2002 May 10 13 Munich, Germany Audio Engineering Society Convention Paper Presented at the 112th Convention 2002 May 10 13 Munich, Germany This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral GSM speech coding Forelesning INF 5080 Vårsemester 2004 Sources This part contains material from: Web pages Universität Bremen, Arbeitsbereich Nachrichtentechnik (ANT): Prof.K.D. Kammeyer, Jörg Bitzer,

More information

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6 Page 1 of 6 1. Scope. This operational practice sets out the requirements for downmixing 5.1 and 5.0 channel surround sound audio mixes to 2 channel stereo. This operational practice recommends a number

More information

http://www.springer.com/0-387-23402-0

http://www.springer.com/0-387-23402-0 http://www.springer.com/0-387-23402-0 Chapter 2 VISUAL DATA FORMATS 1. Image and Video Data Digital visual data is usually organised in rectangular arrays denoted as frames, the elements of these arrays

More information

VIDEOTELEPHONY AND VIDEOCONFERENCE OVER ISDN

VIDEOTELEPHONY AND VIDEOCONFERENCE OVER ISDN VIDEOTELEPHONY AND VIDEOCONFERENCE OVER ISDN Fernando Pereira Instituto Superior Técnico Digital Video Video versus Images Still Image Services No strong temporal requirements; no realtime notion. Video

More information

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals Modified from the lecture slides of Lami Kaya (LKaya@ieee.org) for use CECS 474, Fall 2008. 2009 Pearson Education Inc., Upper

More information

Music technology. Draft GCE A level and AS subject content

Music technology. Draft GCE A level and AS subject content Music technology Draft GCE A level and AS subject content July 2015 Contents The content for music technology AS and A level 3 Introduction 3 Aims and objectives 3 Subject content 4 Recording and production

More information

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN PESQ: An Introduction White Paper Prepared by: Psytechnics Limited 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN t: +44 (0) 1473 261 800 f: +44 (0) 1473 261 880 e: info@psytechnics.com September

More information

A Framework for Robust and Scalable Audio Streaming

A Framework for Robust and Scalable Audio Streaming A Framework for Robust and Scalable Audio Streaming Ye Wang, Wendong Huang, Jari Korhonen School of Computing, National University of Singapore {wangye, huangwd, jari}@comp.nus.edu.sg ABSTRACT We propose

More information

DTS Enhance : Smart EQ and Bandwidth Extension Brings Audio to Life

DTS Enhance : Smart EQ and Bandwidth Extension Brings Audio to Life DTS Enhance : Smart EQ and Bandwidth Extension Brings Audio to Life White Paper Document No. 9302K05100 Revision A Effective Date: May 2011 DTS, Inc. 5220 Las Virgenes Road Calabasas, CA 91302 USA www.dts.com

More information

What Audio Engineers Should Know About Human Sound Perception. Part 2. Binaural Effects and Spatial Hearing

What Audio Engineers Should Know About Human Sound Perception. Part 2. Binaural Effects and Spatial Hearing What Audio Engineers Should Know About Human Sound Perception Part 2. Binaural Effects and Spatial Hearing AES 112 th Convention, Munich AES 113 th Convention, Los Angeles Durand R. Begault Human Factors

More information

5th Congress of Alps-Adria Acoustics Association NOISE-INDUCED HEARING LOSS

5th Congress of Alps-Adria Acoustics Association NOISE-INDUCED HEARING LOSS 5th Congress of Alps-Adria Acoustics Association 12-14 September 2012, Petrčane, Croatia NOISE-INDUCED HEARING LOSS Davor Šušković, mag. ing. el. techn. inf. davor.suskovic@microton.hr Abstract: One of

More information

A diagram of the ear s structure. The outer ear includes the portion of the ear that we see the pinna/auricle and the ear canal.

A diagram of the ear s structure. The outer ear includes the portion of the ear that we see the pinna/auricle and the ear canal. A diagram of the ear s structure THE OUTER EAR The outer ear includes the portion of the ear that we see the pinna/auricle and the ear canal. The pinna or auricle is a concave cartilaginous structure,

More information

Digital Transmission of Analog Data: PCM and Delta Modulation

Digital Transmission of Analog Data: PCM and Delta Modulation Digital Transmission of Analog Data: PCM and Delta Modulation Required reading: Garcia 3.3.2 and 3.3.3 CSE 323, Fall 200 Instructor: N. Vlajic Digital Transmission of Analog Data 2 Digitization process

More information

Networked AV Systems Pretest

Networked AV Systems Pretest Networked AV Systems Pretest Instructions Choose the best answer for each question. Score your pretest using the key on the last page. If you miss three or more out of questions 1 11, consider taking Essentials

More information

Dr. Abdel Aziz Hussein Lecturer of Physiology Mansoura Faculty of Medicine

Dr. Abdel Aziz Hussein Lecturer of Physiology Mansoura Faculty of Medicine Physiological Basis of Hearing Tests By Dr. Abdel Aziz Hussein Lecturer of Physiology Mansoura Faculty of Medicine Introduction Def: Hearing is the ability to perceive certain pressure vibrations in the

More information

Technical Advances in Digital Audio Radio Broadcasting

Technical Advances in Digital Audio Radio Broadcasting Technical Advances in Digital Audio Radio Broadcasting CHRISTOF FALLER, BIING-HWANG JUANG, FELLOW, IEEE, PETER KROON, FELLOW, IEEE, HUI-LING LOU, MEMBER, IEEE, SEAN A. RAMPRASHAD, MEMBER, IEEE, AND CARL-ERIK

More information

New Methods of Stereo Encoding for FM Radio Broadcasting Based on Digital Technology

New Methods of Stereo Encoding for FM Radio Broadcasting Based on Digital Technology 12 P. STRAŇÁK, NEW METHODS OF STEREO ENCODING FOR FM RADIO BROADCASTING BASED ON DIGITAL TECHNOLOGY New Methods of Stereo Encoding for FM Radio Broadcasting Based on Digital Technology Pavel STRAŇÁK Phobos

More information

White paper. An explanation of video compression techniques.

White paper. An explanation of video compression techniques. White paper An explanation of video compression techniques. Table of contents 1. Introduction to compression techniques 4 2. Standardization organizations 4 3. Two basic standards: JPEG and MPEG 4 4. The

More information