Lecture 11 MP3 and MP4 Audio (Part 7)

Size: px
Start display at page:

Download "Lecture 11 MP3 and MP4 Audio (Part 7)"

Transcription

1 CS 414 Multimedia Systems Design Lecture 11 MP3 and MP4 Audio (Part 7) Klara Nahrstedt Spring 2012

2 Administrative MP1 deadline February 18

3 Outline MP3 Audio Encoding MP4 Audio Reading: Media Coding book, Section Recommended Paper on MP3: Davis Pan, A Tutorial on MPEG/Audio Compression, IEEE Multimedia, pp. 6-74, 1995 Recommended books on JPEG/ MPEG Audio/Video Fundamentals: Haskell, Puri, Netravali, Digital Video: An Introduction to MPEG-2, Chapman and Hall, 1996

4 Why Compression is Needed Data rate = sampling rate * quantization bits * channels (+ control information) For example (digital audio): Hz; 16 bits; 2 channels generates about 1.4M of data per second; 84M per minute; 5G per hour

5 MPEG-1 Audio Lossy compression of audio In late 1980 s ISO s MPEG group started to standardize TV broadcasting Use of Audio on CD-ROM (later DVD) MPEG-1 Audio 1992 MPEG-2 Audio MPEG-1 Audio Layer I, II, III

6 Criteria for A Good Standard Achieve desired outcome Be comprehensible Allow efficient implementation Support competition Give benchmark tests Be supported by industry Be good for end users. Two models: implement first, then standardize standardize first, then implement

7 MPEG-1 Audio Layer II Called MP2 Dominant standard for audio broadcasting DAB digital radio and DVB digital television Came out of MUSICAM codecs with bit rates kbps MUSICAM audio coding - basis for MPEG-1 and MPEG-2 audio Sampling rates: 32, 44.1, 48 khz Bit rates: 32, 48, 56, 64, 80, 96, 384 kbps Format: mono, stereo, dual channel, MP2 sub-band audio encoder in time domain

8 MPEG-1 Audio Layer III MPEG-1 Layer III is called MP3 format Popular for PC and Internet applications Goal to compress to 128 kbps, but can be compressed to higher or lower resulting quality Utilization of psychoacoustics Scientific study of sound perception.

9 MPEG Audio MP3 First psychoacoustic masking code was proposed in 1979 in AT&T Bell Labs, Murray Hill. MP3 based on OCF (optimum coding in frequency domain) and PXFM (Perceptual transform coding) MPEG-1 Audio Layer III public release 1993 MPEG-2 Audio III public release 1995

10 MPEG Audio MP mp3.com offering thousands of MP3s created by independent artists for free 1999 Napster MP3 peer-to-peer file sharing Problem: copyright infringement Authorized services: Amazon.com, Rhapsody, Juno Records,..

11 MPEG-1 Audio Encoding Characteristics Precision 16 bits Sampling frequency: 32KHz, 44.1 KHz, 48 KHz 3 compression layers: Layer 1, Layer 2, Layer 3 (MP3) Layer 3: kbps, target 64 kbps Layer 2: kbps, target 128 kbps Layer 1: kbps, target 192 kbps

12 MPEG Audio Encoding Steps

13 MPEG Audio Filter Bank Filter bank divides input into multiple sub-bands (32 equal frequency sub-bands) Sub-band i defined St[ i] 7 k j 0 (2i cos( i [ 0,31], S [ i] 1)( k 16) *( C[ k j]* x[ k 64 j] t - filter output sample for sub-band i at time t, C[n] one of 512 coefficients, x[n] audio input sample from 512 sample buffer

14 MPEG Audio Psycho-acoustic Model MPEG audio compresses by removing acoustically irrelevant parts of audio signals Takes advantage of human auditory systems inability to hear quantization noise under auditory masking Auditory masking: occurs when ever the presence of a strong audio signal makes a temporal or spectral neighborhood of weaker audio signals imperceptible.

15

16 Loudness and Pitch (Review on Psychoacoustic Effects) More sensitive to loudness at mid frequencies than at other frequencies intermediate frequencies at [500hz, 5000hz] Human hearing frequencies at [20hz,20000hz] Perceived loudness of a sound changes based on frequency of that sound basilar membrane reacts more to intermediate frequencies than other frequencies

17 Fletcher-Munson Contours Each contour represents an equal perceived sound Perception sensitivity (loudness) is not linear across all frequencies and intensities

18 Masking Effects (Review of Psychoacoustic Effects) Frequency masking Temporal masking

19 MPEG/audio divides audio signal into frequency sub-bands that approximate critical bands. Then we quantize each sub-band according to the audibility of quantization noise within the band

20 MPEG Audio Bit Allocation This process determines number of code bits allocated to each sub-band based on information from the psychoacoustic model Algorithm: 1. Compute mask-to-noise ratio: MNR=SNR-SMR Standard provides tables that give estimates for SNR resulting from quantizing to a given number of quantizer levels 2. Get MNR for each sub-band 3. Search for sub-band with the lowest MNR 4. Allocate code bits to this sub-band. If sub-band gets allocated more code bits than appropriate, look up new estimate of SNR and repeat step 1

21 Audio Quality Bitrate With too low bit rate, we get compression artifacts Ringing Pre-echo sound is heard before it occurs. It is most noticeable in impulsive sounds from percussion instruments such as cymbals Occurs in transform-based audio compression algorithms Quality of encoder and encoding parameters Constant Bit rate encoding Variable Bit rate encoding

22 MP3 Audio Format Source:

23 MPEG Audio Comments Precision of 16 bits per sample is needed to get good SNR ratio Noise we are getting is quantization noise from the digitization process For each added bit, we get 6dB better SNR ratio Masking effect means that we can raise the noise floor around a strong sound because the noise will be masked away Raising noise floor is the same as using less bits and using less bits is the same as compression

24 Successor of MP3 Advanced Audio Coding (AAC) now part of MPEG-4 Audio Inclusion of 48 full-bandwidth audio channels Default audio format for iphone, ipad, Nintendo, PlayStation, Nokia, Android, BlackBerry Introduced 1997 as MPEG-2 Part 7 In 1999 updated and included in MPEG-4

25 AAC s Improvements over MP3 More sample frequencies (8-96 khz) Arbitrary bit rates and variable frame length Higher efficiency and simpler filterbank Uses pure MDCT (modified discrete cosine transform) Used in Windows Media Audio

26 MPEG-4 Audio Variety of applications General audio signals Speech signals Synthetic audio Synthesized speech (structured audio)

27 MPEG-4 Audio Part 3 Includes variety of audio coding technologies Lossy speech coding (e.g., CELP) CELP code-excited linear prediction speech coding General audio coding (AAC) Lossless audio coding Text-to-Speech interface Structured Audio (e.g., MIDI)

28 MPEG-4 Part 14 Called MP4 with Extension.mp4 Multimedia container format Stores digital video and audio streams and allows streaming over Internet Container or wrapper format meta-file format whose spec describes how different data elements and metadata coesit in computer file

29 MPEG-4 Audio Bit-rate 2-64kbps Scalable for variable rates MPEG-4 defines set of coders Parametric Coding Techniques: low bit-rate 2-6kbps, 8kHz sampling frequency Code Excited Linear Prediction: medium bit-rates 6-24 kbps, 8 and 16 khz sampling rate Time Frequency Techniques: high quality audio 16 kbps and higher bit-rates, sampling rate > 7 khz CS Spring 2011

30 Conclusion MPEG Audio is an integral part of the MPEG standard to be considered together with video MPEG-4 Audio represents an major extension in terms of capabilities to MPEG-1 Audio

AUDIO CODING: BASICS AND STATE OF THE ART

AUDIO CODING: BASICS AND STATE OF THE ART AUDIO CODING: BASICS AND STATE OF THE ART PACS REFERENCE: 43.75.CD Brandenburg, Karlheinz Fraunhofer Institut Integrierte Schaltungen, Arbeitsgruppe Elektronische Medientechnolgie Am Helmholtzring 1 98603

More information

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Sound is an energy wave with frequency and amplitude. Frequency maps the axis of time, and amplitude

More information

Audio Coding Introduction

Audio Coding Introduction Audio Coding Introduction Lecture WS 2013/2014 Prof. Dr.-Ing. Karlheinz Brandenburg bdg@idmt.fraunhofer.de Prof. Dr.-Ing. Gerald Schuller shl@idmt.fraunhofer.de Page Nr. 1 Organisatorial Details - Overview

More information

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION Adiel Ben-Shalom, Michael Werman School of Computer Science Hebrew University Jerusalem, Israel. {chopin,werman}@cs.huji.ac.il

More information

Multimedia Communications

Multimedia Communications Multimedia Communications Dr. Ing. Audio Processing and Coding MMC Overview 1. Introduction 2. Fundamentals (Signal Processing, Information Theorie) 3. Speech Processing & Coding 4. Audio Processing &

More information

Audio Coding, Psycho- Accoustic model and MP3

Audio Coding, Psycho- Accoustic model and MP3 INF5081: Multimedia Coding and Applications Audio Coding, Psycho- Accoustic model and MP3, NR Torbjørn Ekman, Ifi Nils Christophersen, Ifi Sverre Holm, Ifi What is Sound? Sound waves: 20Hz - 20kHz Speed:

More information

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music ISO/IEC MPEG USAC Unified Speech and Audio Coding MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music The standardization of MPEG USAC in ISO/IEC is now in its final

More information

Digital Audio Compression: Why, What, and How

Digital Audio Compression: Why, What, and How Digital Audio Compression: Why, What, and How An Absurdly Short Course Jeff Bier Berkeley Design Technology, Inc. 2000 BDTI 1 Outline Why Compress? What is Audio Compression? How Does it Work? Conclusions

More information

Audio Coding Algorithm for One-Segment Broadcasting

Audio Coding Algorithm for One-Segment Broadcasting Audio Coding Algorithm for One-Segment Broadcasting V Masanao Suzuki V Yasuji Ota V Takashi Itoh (Manuscript received November 29, 2007) With the recent progress in coding technologies, a more efficient

More information

DAB + The additional audio codec in DAB

DAB + The additional audio codec in DAB DAB + The additional audio codec in DAB 2007 Contents Why DAB + Features of DAB + Possible scenarios with DAB + Comparison of DAB + and DMB for radio services Performance of DAB + Status of standardisation

More information

For Articulation Purpose Only

For Articulation Purpose Only E305 Digital Audio and Video (4 Modular Credits) This document addresses the content related abilities, with reference to the module. Abilities of thinking, learning, problem solving, team work, communication,

More information

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton Abstract This paper intends to give the reader some insight into the workings

More information

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de MPEG-1 / MPEG-2 BC Audio The Basic Paradigm of T/F Domain Audio Coding Digital Audio Input Filter Bank Bit or Noise Allocation Quantized Samples Bitstream Formatting Encoded Bitstream Signal to Mask Ratio

More information

Study and Implementation of Video Compression standards (H.264/AVC, Dirac)

Study and Implementation of Video Compression standards (H.264/AVC, Dirac) Study and Implementation of Video Compression standards (H.264/AVC, Dirac) EE 5359-Multimedia Processing- Spring 2012 Dr. K.R Rao By: Sumedha Phatak(1000731131) Objective A study, implementation and comparison

More information

Digital Audio Compression

Digital Audio Compression By Davis Yen Pan Abstract Compared to most digital data types, with the exception of digital video, the data rates associated with uncompressed digital audio are substantial. Digital audio compression

More information

Preservation Handbook

Preservation Handbook Preservation Handbook Digital Audio Author Gareth Knight & John McHugh Version 1 Date 25 July 2005 Change History Page 1 of 8 Definition Sound in its original state is a series of air vibrations (compressions

More information

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Project Proposal Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Sumedha Phatak-1000731131- sumedha.phatak@mavs.uta.edu Objective: A study, implementation and comparison of

More information

Tutorial about the VQR (Voice Quality Restoration) technology

Tutorial about the VQR (Voice Quality Restoration) technology Tutorial about the VQR (Voice Quality Restoration) technology Ing Oscar Bonello, Solidyne Fellow Audio Engineering Society, USA INTRODUCTION Telephone communications are the most widespread form of transport

More information

!"#$"%&' What is Multimedia?

!#$%&' What is Multimedia? What is Multimedia? %' A Big Umbrella Goal of This Course Understand various aspects of a modern multimedia pipeline Content creating, editing Distribution Search & mining Protection Hands-on experience

More information

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden) Quality Estimation for Scalable Video Codec Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden) Purpose of scalable video coding Multiple video streams are needed for heterogeneous

More information

DTS Enhance : Smart EQ and Bandwidth Extension Brings Audio to Life

DTS Enhance : Smart EQ and Bandwidth Extension Brings Audio to Life DTS Enhance : Smart EQ and Bandwidth Extension Brings Audio to Life White Paper Document No. 9302K05100 Revision A Effective Date: May 2011 DTS, Inc. 5220 Las Virgenes Road Calabasas, CA 91302 USA www.dts.com

More information

Digital terrestrial television broadcasting Audio coding

Digital terrestrial television broadcasting Audio coding Digital terrestrial television broadcasting Audio coding Televisão digital terrestre Codificação de vídeo, áudio e multiplexação Parte 2: Codificação de áudio Televisión digital terrestre Codificación

More information

The Theory Behind Mp3

The Theory Behind Mp3 The Theory Behind Mp3 Rassol Raissi December 2002 Abstract Since the MPEG-1 Layer III encoding technology is nowadays widely used it might be interesting to gain knowledge of how this powerful compression/decompression

More information

UNIVERSITY OF CALICUT

UNIVERSITY OF CALICUT UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION BMMC (2011 Admission) V SEMESTER CORE COURSE AUDIO RECORDING & EDITING QUESTION BANK 1. Sound measurement a) Decibel b) frequency c) Wave 2. Acoustics

More information

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6 Page 1 of 6 1. Scope. This operational practice sets out the requirements for downmixing 5.1 and 5.0 channel surround sound audio mixes to 2 channel stereo. This operational practice recommends a number

More information

MPEG Layer-3. An introduction to. 1. Introduction

MPEG Layer-3. An introduction to. 1. Introduction An introduction to MPEG Layer-3 MPEG Layer-3 K. Brandenburg and H. Popp Fraunhofer Institut für Integrierte Schaltungen (IIS) MPEG Layer-3, otherwise known as MP3, has generated a phenomenal interest among

More information

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids Synergies and Distinctions Peter Vary RWTH Aachen University Institute of Communication Systems WASPAA, October 23, 2013 Mohonk Mountain

More information

Speech Signal Processing: An Overview

Speech Signal Processing: An Overview Speech Signal Processing: An Overview S. R. M. Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati December, 2012 Prasanna (EMST Lab, EEE, IITG) Speech

More information

Digital Audio and Video Data

Digital Audio and Video Data Multimedia Networking Reading: Sections 3.1.2, 3.3, 4.5, and 6.5 CS-375: Computer Networks Dr. Thomas C. Bressoud 1 Digital Audio and Video Data 2 Challenges for Media Streaming Large volume of data Each

More information

Dream DRM Receiver Documentation

Dream DRM Receiver Documentation Dream DRM Receiver Documentation Dream is a software implementation of a Digital Radio Mondiale (DRM) receiver. All what is needed to receive DRM transmissions is a PC with a sound card and a modified

More information

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING A TOOL FOR TEACHING LINEAR PREDICTIVE CODING Branislav Gerazov 1, Venceslav Kafedziski 2, Goce Shutinoski 1 1) Department of Electronics, 2) Department of Telecommunications Faculty of Electrical Engineering

More information

Figure 1: Relation between codec, data containers and compression algorithms.

Figure 1: Relation between codec, data containers and compression algorithms. Video Compression Djordje Mitrovic University of Edinburgh This document deals with the issues of video compression. The algorithm, which is used by the MPEG standards, will be elucidated upon in order

More information

MP3 AND AAC EXPLAINED

MP3 AND AAC EXPLAINED MP3 AND AAC EXPLAINED KARLHEINZ BRANDENBURG ½ ½ Fraunhofer Institute for Integrated Circuits FhG-IIS A, Erlangen, Germany bdg@iis.fhg.de The last years have shown widespread proliferation of.mp3-files,

More information

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman A Comparison of Speech Coding Algorithms ADPCM vs CELP Shannon Wichman Department of Electrical Engineering The University of Texas at Dallas Fall 1999 December 8, 1999 1 Abstract Factors serving as constraints

More information

JPEG Image Compression by Using DCT

JPEG Image Compression by Using DCT International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-4, Issue-4 E-ISSN: 2347-2693 JPEG Image Compression by Using DCT Sarika P. Bagal 1* and Vishal B. Raskar 2 1*

More information

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder.

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder. A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER Manoj Kumar 1 Mohammad Zubair 1 1 IBM T.J. Watson Research Center, Yorktown Hgts, NY, USA ABSTRACT The MPEG/Audio is a standard for both

More information

The AAC audio Coding Family For

The AAC audio Coding Family For White PapER The AAC audio Coding Family For Broadcast and Cable TV Over the last few years, the AAC audio codec family has played an increasingly important role as an enabling technology for state-of-the-art

More information

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Video Coding Basics Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Outline Motivation for video coding Basic ideas in video coding Block diagram of a typical video codec Different

More information

How To Test Video Quality With Real Time Monitor

How To Test Video Quality With Real Time Monitor White Paper Real Time Monitoring Explained Video Clarity, Inc. 1566 La Pradera Dr Campbell, CA 95008 www.videoclarity.com 408-379-6952 Version 1.0 A Video Clarity White Paper page 1 of 7 Real Time Monitor

More information

A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques

A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques Vineela Behara,Y Ramesh Department of Computer Science and Engineering Aditya institute of Technology and

More information

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN PESQ: An Introduction White Paper Prepared by: Psytechnics Limited 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN t: +44 (0) 1473 261 800 f: +44 (0) 1473 261 880 e: info@psytechnics.com September

More information

EE3414 Multimedia Communication Systems Part I

EE3414 Multimedia Communication Systems Part I EE3414 Multimedia Communication Systems Part I Spring 2003 Lecture 1 Yao Wang Electrical and Computer Engineering Polytechnic University Course Overview A University Sequence Course in Multimedia Communication

More information

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29. Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice over IP So, today we will discuss about voice over IP and internet

More information

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals Modified from the lecture slides of Lami Kaya (LKaya@ieee.org) for use CECS 474, Fall 2008. 2009 Pearson Education Inc., Upper

More information

Introduction to image coding

Introduction to image coding Introduction to image coding Image coding aims at reducing amount of data required for image representation, storage or transmission. This is achieved by removing redundant data from an image, i.e. by

More information

Any Video Converter Professional User Manual 1. Any Video Converter Professional. User Manual

Any Video Converter Professional User Manual 1. Any Video Converter Professional. User Manual Any Video Converter Professional User Manual 1 Any Video Converter Professional User Manual Any Video Converter Professional User Manual 2 1. Welcome to Any Video Converter Professional...3 1.1 Main Window

More information

Figure1. Acoustic feedback in packet based video conferencing system

Figure1. Acoustic feedback in packet based video conferencing system Real-Time Howling Detection for Hands-Free Video Conferencing System Mi Suk Lee and Do Young Kim Future Internet Research Department ETRI, Daejeon, Korea {lms, dyk}@etri.re.kr Abstract: This paper presents

More information

FAST MIR IN A SPARSE TRANSFORM DOMAIN

FAST MIR IN A SPARSE TRANSFORM DOMAIN ISMIR 28 Session 4c Automatic Music Analysis and Transcription FAST MIR IN A SPARSE TRANSFORM DOMAIN Emmanuel Ravelli Université Paris 6 ravelli@lam.jussieu.fr Gaël Richard TELECOM ParisTech gael.richard@enst.fr

More information

encoding compression encryption

encoding compression encryption encoding compression encryption ASCII utf-8 utf-16 zip mpeg jpeg AES RSA diffie-hellman Expressing characters... ASCII and Unicode, conventions of how characters are expressed in bits. ASCII (7 bits) -

More information

Born-digital media for long term preservation and access: Selection or deselection of media independent music productions

Born-digital media for long term preservation and access: Selection or deselection of media independent music productions Submitted on: 29.07.2015 Born-digital media for long term preservation and access: Selection or deselection of media independent music productions Trond Valberg Head Curator, Music Section, National Library

More information

PRIMER ON PC AUDIO. Introduction to PC-Based Audio

PRIMER ON PC AUDIO. Introduction to PC-Based Audio PRIMER ON PC AUDIO This document provides an introduction to various issues associated with PC-based audio technology. Topics include the following: Introduction to PC-Based Audio Introduction to Audio

More information

Convention Paper 5553

Convention Paper 5553 Audio Engineering Society Convention Paper 5553 Presented at the 112th Convention 2 May 1 13 Munich, Germany This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

Lecture 1-6: Noise and Filters

Lecture 1-6: Noise and Filters Lecture 1-6: Noise and Filters Overview 1. Periodic and Aperiodic Signals Review: by periodic signals, we mean signals that have a waveform shape that repeats. The time taken for the waveform to repeat

More information

Creating Content for ipod + itunes

Creating Content for ipod + itunes apple Apple Education Creating Content for ipod + itunes This guide provides information about the file formats you can use when creating content compatible with itunes and ipod. This guide also covers

More information

A Digital Audio Watermark Embedding Algorithm

A Digital Audio Watermark Embedding Algorithm Xianghong Tang, Yamei Niu, Hengli Yue, Zhongke Yin Xianghong Tang, Yamei Niu, Hengli Yue, Zhongke Yin School of Communication Engineering, Hangzhou Dianzi University, Hangzhou, Zhejiang, 3008, China tangxh@hziee.edu.cn,

More information

Classes of multimedia Applications

Classes of multimedia Applications Classes of multimedia Applications Streaming Stored Audio and Video Streaming Live Audio and Video Real-Time Interactive Audio and Video Others Class: Streaming Stored Audio and Video The multimedia content

More information

HIGH-QUALITY FREQUENCY DOMAIN-BASED AUDIO WATERMARKING. Eric Humphrey. School of Music Engineering Technology University of Miami

HIGH-QUALITY FREQUENCY DOMAIN-BASED AUDIO WATERMARKING. Eric Humphrey. School of Music Engineering Technology University of Miami HIGH-QUALITY FREQUENCY DOMAIN-BASED AUDIO WATERMARKING Eric Humphrey School of Music Engineering Technology University of Miami ABSTRACT An investigation of current audio watermarking technology is provided,

More information

Basic principles of Voice over IP

Basic principles of Voice over IP Basic principles of Voice over IP Dr. Peter Počta {pocta@fel.uniza.sk} Department of Telecommunications and Multimedia Faculty of Electrical Engineering University of Žilina, Slovakia Outline VoIP Transmission

More information

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA Audio Engineering Society Convention Paper Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA The papers at this Convention have been selected on the basis of a submitted abstract

More information

APPLICATION BULLETIN AAC Transport Formats

APPLICATION BULLETIN AAC Transport Formats F RA U N H O F E R I N S T I T U T E F O R I N T E G R A T E D C I R C U I T S I I S APPLICATION BULLETIN AAC Transport Formats INITIAL RELEASE V. 1.0 2 18 1 AAC Transport Protocols and File Formats As

More information

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Video Coding Standards Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Yao Wang, 2003 EE4414: Video Coding Standards 2 Outline Overview of Standards and Their Applications ITU-T

More information

Technical Paper. Dolby Digital Plus Audio Coding

Technical Paper. Dolby Digital Plus Audio Coding Technical Paper Dolby Digital Plus Audio Coding Dolby Digital Plus is an advanced, more capable digital audio codec based on the Dolby Digital (AC-3) system that was introduced first for use on 35 mm theatrical

More information

MPEG-H Audio System for Broadcasting

MPEG-H Audio System for Broadcasting MPEG-H Audio System for Broadcasting ITU-R Workshop Topics on the Future of Audio in Broadcasting Jan Plogsties Challenges of a Changing Landscape Immersion Compelling sound experience through sound that

More information

H.264/MPEG-4 AVC Video Compression Tutorial

H.264/MPEG-4 AVC Video Compression Tutorial Introduction The upcoming H.264/MPEG-4 AVC video compression standard promises a significant improvement over all previous video compression standards. In terms of coding efficiency, the new standard is

More information

EUROPEAN COMPUTER DRIVING LICENCE. Multimedia Audio Editing. Syllabus

EUROPEAN COMPUTER DRIVING LICENCE. Multimedia Audio Editing. Syllabus EUROPEAN COMPUTER DRIVING LICENCE Multimedia Audio Editing Syllabus Purpose This document details the syllabus for ECDL Multimedia Module 1 Audio Editing. The syllabus describes, through learning outcomes,

More information

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP Department of Electrical and Computer Engineering Ben-Gurion University of the Negev LAB 1 - Introduction to USRP - 1-1 Introduction In this lab you will use software reconfigurable RF hardware from National

More information

The Design and Implementation of Multimedia Software

The Design and Implementation of Multimedia Software Chapter 10 Auditory Content The Design and Implementation of Multimedia Software David Bernstein Jones and Bartlett Publishers www.jbpub.com David Bernstein (jbpub.com) Multimedia Software Jones and Bartlett

More information

Objective Speech Quality Measures for Internet Telephony

Objective Speech Quality Measures for Internet Telephony Objective Speech Quality Measures for Internet Telephony Timothy A. Hall National Institute of Standards and Technology 100 Bureau Drive, STOP 8920 Gaithersburg, MD 20899-8920 ABSTRACT Measuring voice

More information

C Implementation & comparison of companding & silence audio compression techniques

C Implementation & comparison of companding & silence audio compression techniques ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 26 C Implementation & comparison of companding & silence audio compression techniques Mrs. Kruti Dangarwala 1 and Mr. Jigar Shah 2 1 Department of Computer

More information

Web-Conferencing System SAViiMeeting

Web-Conferencing System SAViiMeeting Web-Conferencing System SAViiMeeting Alexei Machovikov Department of Informatics and Computer Technologies National University of Mineral Resources Mining St-Petersburg, Russia amachovikov@gmail.com Abstract

More information

Understanding the Transition From PESQ to POLQA. An Ascom Network Testing White Paper

Understanding the Transition From PESQ to POLQA. An Ascom Network Testing White Paper Understanding the Transition From PESQ to POLQA An Ascom Network Testing White Paper By Dr. Irina Cotanis Prepared by: Date: Document: Dr. Irina Cotanis 6 December 2011 NT11-22759, Rev. 1.0 Ascom (2011)

More information

Video compression: Performance of available codec software

Video compression: Performance of available codec software Video compression: Performance of available codec software Introduction. Digital Video A digital video is a collection of images presented sequentially to produce the effect of continuous motion. It takes

More information

Analog-to-Digital Voice Encoding

Analog-to-Digital Voice Encoding Analog-to-Digital Voice Encoding Basic Voice Encoding: Converting Analog to Digital This topic describes the process of converting analog signals to digital signals. Digitizing Analog Signals 1. Sample

More information

Frequently asked QUESTIONS. about DOLBY DIGITAL

Frequently asked QUESTIONS. about DOLBY DIGITAL Frequently asked QUESTIONS about DOLBY DIGITAL Table of Contents 1. What is Dolby Digital?... 1 2. What program sources deliver Dolby Digital audio?... 1 3. Can I hear Dolby Digital programs over a regular

More information

Loudness and Dynamic Range

Loudness and Dynamic Range Loudness and Dynamic Range in broadcast audio the Dolby solution Tony Spath Dolby Laboratories, Inc. Digital delivery media offer a wider dynamic range for audio than their analogue predecessors. This

More information

A Framework for Robust and Scalable Audio Streaming

A Framework for Robust and Scalable Audio Streaming A Framework for Robust and Scalable Audio Streaming Ye Wang, Wendong Huang, Jari Korhonen School of Computing, National University of Singapore {wangye, huangwd, jari}@comp.nus.edu.sg ABSTRACT We propose

More information

From Concept to Production in Secure Voice Communications

From Concept to Production in Secure Voice Communications From Concept to Production in Secure Voice Communications Earl E. Swartzlander, Jr. Electrical and Computer Engineering Department University of Texas at Austin Austin, TX 78712 Abstract In the 1970s secure

More information

Polycom Video Communications

Polycom Video Communications Polycom Video Communications Advanced Audio Technology for Video Conferencing Author: Peter L. Chu September 2004 Connect. Any Way You Want. If you ask frequent users of video conferencing technology what

More information

What Audio Engineers Should Know About Human Sound Perception. Part 2. Binaural Effects and Spatial Hearing

What Audio Engineers Should Know About Human Sound Perception. Part 2. Binaural Effects and Spatial Hearing What Audio Engineers Should Know About Human Sound Perception Part 2. Binaural Effects and Spatial Hearing AES 112 th Convention, Munich AES 113 th Convention, Los Angeles Durand R. Begault Human Factors

More information

Analog Representations of Sound

Analog Representations of Sound Analog Representations of Sound Magnified phonograph grooves, viewed from above: The shape of the grooves encodes the continuously varying audio signal. Analog to Digital Recording Chain ADC Microphone

More information

An Optimised Software Solution for an ARM Powered TM MP3 Decoder. By Barney Wragg and Paul Carpenter

An Optimised Software Solution for an ARM Powered TM MP3 Decoder. By Barney Wragg and Paul Carpenter An Optimised Software Solution for an ARM Powered TM MP3 Decoder By Barney Wragg and Paul Carpenter Abstract The market predictions for MP3-based appliances are extremely positive. The ability to maintain

More information

Lecture 1-10: Spectrograms

Lecture 1-10: Spectrograms Lecture 1-10: Spectrograms Overview 1. Spectra of dynamic signals: like many real world signals, speech changes in quality with time. But so far the only spectral analysis we have performed has assumed

More information

White Paper: An Overview of the Coherent Acoustics Coding System

White Paper: An Overview of the Coherent Acoustics Coding System White Paper: An Overview of the Coherent Acoustics Coding System Mike Smyth June 1999 Introduction Coherent Acoustics is a digital audio compression algorithm designed for both professional and consumer

More information

Understanding Compression Technologies for HD and Megapixel Surveillance

Understanding Compression Technologies for HD and Megapixel Surveillance When the security industry began the transition from using VHS tapes to hard disks for video surveillance storage, the question of how to compress and store video became a top consideration for video surveillance

More information

high-quality surround sound at stereo bit-rates

high-quality surround sound at stereo bit-rates FRAUNHOFER Institute For integrated circuits IIS MPEG Surround high-quality surround sound at stereo bit-rates Benefits exciting new next generation services MPEG Surround enables new services such as

More information

SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A

SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A International Journal of Science, Engineering and Technology Research (IJSETR), Volume, Issue, January SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A N.Rama Tej Nehru, B P.Sunitha

More information

CM0340 SOLNS. Do not turn this page over until instructed to do so by the Senior Invigilator.

CM0340 SOLNS. Do not turn this page over until instructed to do so by the Senior Invigilator. CARDIFF UNIVERSITY EXAMINATION PAPER Academic Year: 2008/2009 Examination Period: Examination Paper Number: Examination Paper Title: SOLUTIONS Duration: Autumn CM0340 SOLNS Multimedia 2 hours Do not turn

More information

Voice Communication Package v7.0 of front-end voice processing software technologies General description and technical specification

Voice Communication Package v7.0 of front-end voice processing software technologies General description and technical specification Voice Communication Package v7.0 of front-end voice processing software technologies General description and technical specification (Revision 1.0, May 2012) General VCP information Voice Communication

More information

Best practices for producing quality digital video files

Best practices for producing quality digital video files University of Michigan Deep Blue deepblue.lib.umich.edu 2011-03-09 Best practices for producing quality digital video files Formats Group, Deep Blue http://hdl.handle.net/2027.42/83222 Best practices for

More information

Starlink 9003T1 T1/E1 Dig i tal Trans mis sion Sys tem

Starlink 9003T1 T1/E1 Dig i tal Trans mis sion Sys tem Starlink 9003T1 T1/E1 Dig i tal Trans mis sion Sys tem A C ombining Moseley s unparalleled reputation for high quality RF aural Studio-Transmitter Links (STLs) with the performance and speed of today s

More information

Digital Speech Coding

Digital Speech Coding Digital Speech Processing David Tipper Associate Professor Graduate Program of Telecommunications and Networking University of Pittsburgh Telcom 2720 Slides 7 http://www.sis.pitt.edu/~dtipper/tipper.html

More information

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral

GSM speech coding. Wolfgang Leister Forelesning INF 5080 Vårsemester 2004. Norsk Regnesentral GSM speech coding Forelesning INF 5080 Vårsemester 2004 Sources This part contains material from: Web pages Universität Bremen, Arbeitsbereich Nachrichtentechnik (ANT): Prof.K.D. Kammeyer, Jörg Bitzer,

More information

Introduzione alle Biblioteche Digitali Audio/Video

Introduzione alle Biblioteche Digitali Audio/Video Introduzione alle Biblioteche Digitali Audio/Video Biblioteche Digitali 1 Gestione del video Perchè è importante poter gestire biblioteche digitali di audiovisivi Caratteristiche specifiche dell audio/video

More information

ARIB STD-T64-C.S0042 v1.0 Circuit-Switched Video Conferencing Services

ARIB STD-T64-C.S0042 v1.0 Circuit-Switched Video Conferencing Services ARIB STD-T-C.S00 v.0 Circuit-Switched Video Conferencing Services Refer to "Industrial Property Rights (IPR)" in the preface of ARIB STD-T for Related Industrial Property Rights. Refer to "Notice" in the

More information

Chapter 6: Broadcast Systems. Mobile Communications. Unidirectional distribution systems DVB DAB. High-speed Internet. architecture Container

Chapter 6: Broadcast Systems. Mobile Communications. Unidirectional distribution systems DVB DAB. High-speed Internet. architecture Container Mobile Communications Chapter 6: Broadcast Systems Unidirectional distribution systems DAB DVB architecture Container High-speed Internet Prof. Dr.-Ing. Jochen Schiller, http://www.jochenschiller.de/ MC

More information

HIGH QUALITY AUDIO RECORDING IN NOKIA LUMIA SMARTPHONES. 1 Nokia 2013 High quality audio recording in Nokia Lumia smartphones

HIGH QUALITY AUDIO RECORDING IN NOKIA LUMIA SMARTPHONES. 1 Nokia 2013 High quality audio recording in Nokia Lumia smartphones HIGH QUALITY AUDIO RECORDING IN NOKIA LUMIA SMARTPHONES 1 Nokia 2013 High quality audio recording in Nokia Lumia smartphones HIGH QUALITY AUDIO RECORDING IN NOKIA LUMIA SMARTPHONES This white paper describes

More information

High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform

High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 4, JULY 2003 365 High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform Dai Yang, Member, IEEE, Hongmei Ai, Member, IEEE, Chris

More information

MP3/mp3PRO plug-in. How you can make an audio CD from mp3 or mp3pro files

MP3/mp3PRO plug-in. How you can make an audio CD from mp3 or mp3pro files MP3/mp3PRO plug-in How you can make an audio CD from mp3 or mp3pro files...1 The mp3pro encoder...2 How you can make your own mp3pro files with Nero...3 How you can make your own MP3 files with Nero...12

More information