Digital Multi-Channel Audio Compression and Metadata

Similar documents
All About Audio Metadata. The three Ds: dialogue level, dynamic range control, and downmixing

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6

Loudness and Dynamic Range

TECHNICAL OPERATING SPECIFICATIONS

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting

TV 2 AS HD DELIVERY FOR TV 2 AS

5.1 audio. How to get on-air with. Broadcasting in stereo. the Dolby "5.1 Cookbook" for broadcasters. Tony Spath Dolby Laboratories, Inc.

LOUDNESS MONITORING AND CALM COMPLIANCE (UNFOLDING EVENTS) Jim Welch. IneoQuest Technologies

Monitoring Surround-Sound Audio

Multichannel Audio Line-up Tones

DeNoiser Plug-In. for USER S MANUAL

MPEG-H Audio System for Broadcasting

Technical Paper. Dolby Digital Plus Audio Coding

RECOMMENDATION ITU-R BR *, ** Parameters for international exchange of multi-channel sound recordings with or without accompanying picture ***

Digital terrestrial television broadcasting Audio coding

Frequently asked QUESTIONS. about DOLBY DIGITAL

How To Control Loudness On A Tv Or Radio

DAB + The additional audio codec in DAB

ATSC Recommended Practice: Techniques for Establishing and Maintaining Audio Loudness for Digital Television

Audio DSP Features. 5.1-Ch Loudness Processing. Upmix. Dolby D+ 3/2L Encode L R. LFE Ls Rs. Figure 1 Example Routing and Dolby Re-Encode Processing

SoundCode For Dolby Digital 2

Multichannel stereophonic sound system with and without accompanying picture

The loudness war is fought with (and over) compression

AUDIO/VIDEO MULTI-CHANNEL RECEIVER VSX-D411 VSX-D511

Audio Engineering Society. Convention Paper. Presented at the 128th Convention 2010 May London, UK

Preparing for the Broadcast Analog Television Turn-Off: How to Keep Cable Subscribers TVs from Going Dark

COMMON SURROUND SOUND FORMATS

Multichannel audio: From studio to listener

BASS MANAGEMENT SYSTEM

DTS-HD Audio. Consumer White Paper for Blu-ray Disc and HD DVD Applications

Sonic Studio Strix Series. User Guide

Datasheet EdgeVision

UNIVERSITY OF CALICUT

The AAC audio Coding Family For

AUDIO CODING: BASICS AND STATE OF THE ART

Digital Audio Compression: Why, What, and How

high-quality surround sound at stereo bit-rates

DELIVERY SPECIFICATIONS FOR COMMERCIALS AND BILLBOARDS Release January 2016

Loudness. The second line of defence. don t forget the distribution chain! Richard van Everdingen Delta Sigma Consultancy

TECHNICAL SPECIFICATIONS FOR PROGRAM DELIVERY

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

Audiometry and Hearing Loss Examples

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de

Overview. Dolby PC Entertainment Experience v4

Dolby DP600 and DP600-C Program Optimizer Overview for Cable and IPTV Operators

Sound System Buying Guide

How To Test Video Quality With Real Time Monitor

Technical Recommendations. CST - RT 017 TV v Technical Recommendations for Ready For Broadcast Broadcasters (CST/FICAM/HDFORUM)

Convention Paper 7896

Technical Specifications for Standard Definition Programmes

Engineering Bulletin

The Waves Dorrough Meter Collection. User Guide

What Audio Engineers Should Know About Human Sound Perception. Part 2. Binaural Effects and Spatial Hearing

Receiver Customization

Creating Content for ipod + itunes

Home Theater System HT-DDW650. Operating Instructions. Owner s Record (1) 2003 Sony Corporation

Dolby DP600 and DP600-C Program Optimizer Overview for Postproduction Facilities

Tutorial about the VQR (Voice Quality Restoration) technology

RX-6010RBK / RX-6012RSL

DENSITÉ SERIES XVP-3901

Case Study: Real-Time Video Quality Monitoring Explored

JBL CINEMA BASE. Home Cinema 2.2 all-in-one soundbase for television OWNER S MANUAL

INTRODUCTION. Please read this manual carefully for a through explanation of the Decimator ProRackG and its functions.

Wireless. with Personal Mix Control and EP3 Dynamic Earphones

bel canto The Computer as High Quality Audio Source A Primer

Design concepts and understanding a stereo surround audio system for the Science On the Sphere. By David Eltzroth

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

Multichannel stereophonic sound system with and without accompanying picture

Polycom Video Communications

HE-AAC v2. MPEG-4 HE-AAC v2 (also known as aacplus v2 ) is the combination of three technologies:

multi-channel SURROUND SOUND FROM A single component...

Audio Coding Algorithm for One-Segment Broadcasting

Any Time, Anywhere Audio

Authoring for Dolby Atmos Cinema Sound Manual

DTS Enhance : Smart EQ and Bandwidth Extension Brings Audio to Life

Radio TV Forum 2014 Rohde & Schwarz Solutions

Preservation Handbook

RECORDING AND CAPTURING SOUND

T 175 AV Tuner Preamplifier

Audio Coding Introduction

LT-82 Stationary IR Transmitter

Loudness Normalization: The Future of File-Based Playback

Receiver Customization

HIGH PERFORMANCE CAR AUDIO MODEL:CR-82 8 CHANNEL ELECTRONIC CROSSOVER NETWORK SYSTEM.

R S E R I E S M I X E R S R R R

User Guide. VT1708A VIA HD Audio Adeck For Windows 2000, Windows XP & Server Jun Revision 1.1e

Hear The Future...Now! SIEM-2T/SIEM-2R

HIGH QUALITY AUDIO RECORDING IN NOKIA LUMIA SMARTPHONES. 1 Nokia 2013 High quality audio recording in Nokia Lumia smartphones

PRIMER ON PC AUDIO. Introduction to PC-Based Audio

Cable TV Headend Solutions

The TASA Standard. Recommendations from the TASA Ad Hoc Committee for regulating motion picture trailer audio volume. INTRODUCTION

Your Hearing ILLUMINATED

High Quality Podcast Recording

Audio and Video Synchronization:

Welcome to the United States Patent and TradeMark Office

AI Audio 2 (SoundMAX High Definition Audio utility)

DOLBY SR-D DIGITAL. by JOHN F ALLEN

Transcription:

Digital Multi-Channel Audio Compression and Metadata Dolby E and Dolby Digital (AC3) surround sound, concept of Dolby metadata #1 assuredcommunications Feb 19, 20

Digital Audio Compression Multi-Channel Programming Enhances the home listening experience Desired by consumers Extra features Additional languages, SAP Emergency audio services Descriptive comments for viewing impaired and so on Backward compatibility to Mono, Stereo #2 assuredcommunications Feb 19, 20

Implementation L R C Ls Rs LFE Lt Rt VTR / Server Codec / Encoder Typical 8 Channels (4 AES pr.) L R C Ls Rs LFE Lt Rt Metadata Using un-compressed signals, all channels of VTR / Server used. Requires 8 levels of audio router. External connection of serial Metadata stream (routing?) #3 assuredcommunications Feb 19, 20

Implementation 5.1 (+2) Lt/Rt VO EFX SAP-1 SAP-2 Other? Other? VTR / Server Codec / Encoder Typical 8 Channels (4 AES pr.) 5.1 (+2) Lt/Rt VO EFX SAP-1 SAP-2 Other? Other? Metadata Using compressed 5.1 signals, allows more programs and monitoring of Lt/Rt downmix without decoder. #4 assuredcommunications Feb 19, 20

Digital Audio Compression Convenience Digitally encoded and compressed signals less susceptible to impairments due to signal path Impulse Noise Ground Loops Random Noise Amplitude/Phase Distortion Single stream can be transported in video ancillary Reduce physical routing layers ($$) A/V timing advantages #5 assuredcommunications Feb 19, 20

Digital Audio Compression Perceptual Coding (AC-3) Reduce the data rate of a digital audio signal without introducing any perceivable audible changes. Several physiological limitations of the human hearing system. Predicts which sounds your ears will and will not hear, and only encodes audible sounds. Uses the human ear s - hearing threshold phenomenon. Ear is not equally sensitive at all frequencies Detect quiet signals in the 2 khz - 4 khz midrange Less sensitive to quiet signals at very low or very high frequencies #6 assuredcommunications Feb 19, 20

Digital Audio Compression Perceptual Coding (AC-3) Examples: Louder Sounds Mask Quiet Sounds (Relative) Orchestra, flutes quiet during loud passage - only hear flutes if you are in the orchestra Out of doors - listen to a bird sing - truck passes Some sounds under threshold of hearing #7 assuredcommunications Feb 19, 20

Digital Audio Compression Perceptual Coding (AC-3) Low Frequencies db Speech and Primary Frequencies Harmonics and Incidental High Frequency Content Eliminated with Perceptual Coding 0 Hz 100 Hz 500 Hz 1 Khz 10K Hz 20K Hz Frequency Threshold of Hearing #8 assuredcommunications Feb 19, 20

Metadata Metadata used to shape consumer listening experience to particular requirements Transmission Bitstream - Intended for the transmission of audio to the home through digital television broadcast (either high or standard definition), Set Top Box, DVD, or other media. Defines a single channel of audio through a full 5.1-channel program, including Metadata in both D-TV and DVD. AC-3 designed for maximum fidelity and space efficiency, and only passes through one encode/decode cycle. #9 assuredcommunications Feb 19, 20

Metadata Additional control information carried with the encoded audio program and provides essential information about the audio to an AC-3 decoder. Data describing the audio data format Created at time of program origination Provides many important functions including the three D s : Dynamic range control Dialnorm Downmix Complete Control of final audio listening environment! #10 assuredcommunications Feb 19, 20

Metadata Flow Receiver Metadata Modifies Listening Levels Audio Mixer Audio Multi-Channel Encoder Audio Metadata VTR / Server Audio Metadata MPEG Encoder Mix set in Production Video #11 assuredcommunications Feb 19, 20

Metadata Dialog Level (Dialog Normalization or Dialnorm) Dialnorm (loudness uniformity) Describes the average program volume Level variations are undesirable Between different programs Between program segments (station breaks and commercials) Metadata contains Dialnorm value, used by decoder #12 assuredcommunications Feb 19, 20

Metadata Dialog Level (Dialog Normalization or Dialnorm) Set by the program producer or the broadcaster Defined as the level of normal spoken dialogue with respect to Full Scale Digital Dialnorm values range between -31(no level shift in the home decoder) to -1 (maximum level shift in the home decoder) Dialnorm also applies to other types of program material, like music videos and concerts #13 assuredcommunications Feb 19, 20

Metadata Dialog Level (Dialog Normalization or Dialnorm) Turn It Down! Output Level Program 1 Dialog Level Program 2 Dialog Level Comfortable Listening Level Input Level #14 assuredcommunications Feb 19, 20

Metadata Dialog Level, Dialog Normalization, or Dialnorm Why -27dB? Film Soundtrack Dialog Level 78 db AC-3 has 105 db of dynamic range, Loudest level is "0 db, Quietest level is "-105 db". -27 db aligns with movie soundtracks in that 78 db (above silence) 105-27=78 is an accepted level for speech. Dialnorm = value means the level that dialogue is lower than the peak (0 db), Value of "-31" is 31 db below the peak (the value at which no volume adjustment is performed by a consumer decoder). A value of -27 causes the decoder to reduce the program by 4 db AC-3 Output Level #15 assuredcommunications Feb 19, 20

Metadata Downmix Three types of downmix: Surround downmix Lt / Rt Left total / Right total, for Pro Logic compatibility No Monaural Compatibility Stereo downmix Lo / Ro Left only / Right only, possibly for headphones Correct audio phase, is monaural compatible Mono downmix (DVB-H) From Lo / Ro Metadata controls C and S mix level #16 assuredcommunications Feb 19, 20

Metadata DRC, Dynamic Range Control Some listeners want full dynamic range Some listeners do not! listening conditions vary ambient noise problems late night listening the kids are asleep the neighbors are complaining #17 assuredcommunications Feb 19, 20

Metadata DRC, Dynamic Range Control The level of audio that falls above the dialog area (as defined by the dialnorm value) is cut. -31 consumer dialnorm Output Level Dialog Level Audio that falls within the dialog area (a.k.a. the null band ) is unaffected. The level of audio that falls below the dialog area (as defined by the dialnorm value) is boosted. Input Level #18 assuredcommunications Feb 19, 20

Metadata Compression (AC-3 decoder) RF Link Heavy Protect peak levels, transmission paths with small dynamic range Line Light Use in noisy environments, small reduction of dynamics None Automatic or User Controlled (Set top box design) #19 assuredcommunications Feb 19, 20

AC3 Metadata Variables #20 assuredcommunications Feb 19, 20

Summary Metadata - Specific Data about the audio data Controls listening experience Contains Information about the content Program Name Stream Type Time Reference Controls downmixing Center Surround Provides some user control #21 assuredcommunications Feb 19, 20

Questions? TVM Series VTM Series #22 assuredcommunications Feb 19, 20