Levelling and Loudness



Similar documents
Loudness and Dynamic Range

RECOMMENDATION ITU-R BR *, ** Parameters for international exchange of multi-channel sound recordings with or without accompanying picture ***

Target Loudness Level and Dynamic Range

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting

Audio Loudness Normalization and Measurement of Broadcast Programs

Multichannel Audio Line-up Tones

R 128 LOUDNESS NORMALISATION AND PERMITTED MAXIMUM LEVEL OF AUDIO SIGNALS

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6

Engineering Bulletin

Multichannel stereophonic sound system with and without accompanying picture

RECOMMENDATION ITU-R BS *,** Audio quality parameters for the performance of a high-quality sound-programme transmission chain

Convention Paper 7896

DAB + The additional audio codec in DAB

RECOMMENDATION ITU-R BS.704 *, ** Characteristics of FM sound broadcasting reference receivers for planning purposes

Red Bee Media. Technical file and tape delivery specification for Commercials. Applies to all channels transmitted by Red Bee Media

Current Status and Problems in Mastering of Sound Volume in TV News and Commercials

1 Multi-channel frequency division multiplex frequency modulation (FDM-FM) emissions

MPEG-H Audio System for Broadcasting

5.1 audio. How to get on-air with. Broadcasting in stereo. the Dolby "5.1 Cookbook" for broadcasters. Tony Spath Dolby Laboratories, Inc.

DVB-T. C/N in a static Rayleigh channel. C/N values for portable single and diversity reception. Measurement method.

Application Note. Introduction to. Page 1 / 10

The recording angle based on localisation curves

The Waves Dorrough Meter Collection. User Guide

Convention Paper Presented at the 112th Convention 2002 May Munich, Germany

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

Program Loudness: Nuts & Bolts

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:

Timing Errors and Jitter

RECORDING AND CAPTURING SOUND

Electronic Communications Committee (ECC) within the European Conference of Postal and Telecommunications Administrations (CEPT)

Sound-field parameter

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

Audio Coding Algorithm for One-Segment Broadcasting

BASS MANAGEMENT SYSTEM

UNIVERSITY OF CALICUT

CBS RECORDS PROFESSIONAL SERIES CBS RECORDS CD-1 STANDARD TEST DISC

Technical Specifications for Standard Definition Programmes

STI 9 Edition 2/October 2000

Audio Engineering Society. Convention Paper. Presented at the 127th Convention 2009 October 9 12 New York, NY, USA

How To Make A Radio Accessible To People With Disabilities

DT3: RF On/Off Remote Control Technology. Rodney Singleton Joe Larsen Luis Garcia Rafael Ocampo Mike Moulton Eric Hatch

Hierarchical. modulation. 1. Principles of DVB-T and hierarchical modulation Parameters of DVB-T

MSB MODULATION DOUBLES CABLE TV CAPACITY Harold R. Walker and Bohdan Stryzak Pegasus Data Systems ( 5/12/06) pegasusdat@aol.com

DVB-T BER MEASUREMENTS IN THE PRESENCE OF ADJACENT CHANNEL AND CO-CHANNEL ANALOGUE TELEVISION INTERFERENCE

Loudness. The second line of defence. don t forget the distribution chain! Richard van Everdingen Delta Sigma Consultancy

Sampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically.

22.2 Multichannel Audio Format Standardization Activity

How To Control Loudness On A Tv Or Radio

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP

Born-digital media for long term preservation and access: Selection or deselection of media independent music productions

RECOMMENDATION ITU-R BR *, ** Analogue composite television tape recording

Direct Digital Amplification (DDX ) The Evolution of Digital Amplification

Where to go with digital sound broadcasting

Analog-to-Digital Voice Encoding

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA

AM/FM/ϕM Measurement Demodulator FS-K7

RECOMMENDATION ITU-R BR Analogue composite television tape recording

COMPATIBILITY AND SHARING ANALYSIS BETWEEN DVB T AND RADIO MICROPHONES IN BANDS IV AND V

BROADCASTING ACT (CHAPTER 28) Code of Practice for Television Broadcast Standards

Digital terrestrial television broadcasting Audio coding

Audio and Video Synchronization:

All About Audio Metadata. The three Ds: dialogue level, dynamic range control, and downmixing

DVB-T. The echo performance of. receivers. Theory of echo tolerance. Ranulph Poole BBC Research and Development

Audio Engineering Society. Convention Paper. Presented at the 128th Convention 2010 May London, UK

ATSC Recommended Practice: Techniques for Establishing and Maintaining Audio Loudness for Digital Television

CLIO 8 CLIO 8 CLIO 8 CLIO 8

MINIMUM TECHNICAL AND EXPLOITATION REQUIREMENTS FOR DIGITAL SOUND BROADCASTING DAB+ RECEIVER DESIGNED FOR POLAND

PCM Encoding and Decoding:

Convention Paper Presented at the 118th Convention 2005 May Barcelona, Spain

How To Test Video Quality With Real Time Monitor

Technical Instruction Manual

For Articulation Purpose Only

Making Spectrum Measurements with Rohde & Schwarz Network Analyzers

RECOMMENDATION ITU-R BO.786 *

TECHNICAL OPERATING SPECIFICATIONS

MONITOR STATION. Studio Control Center. User s Manual v , PreSonus Audio Electronics, Inc. All Rights Reserved.

AUDIO CODING: BASICS AND STATE OF THE ART

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de

Aus: von Jon Risch. The New Test Signal Phi Spectral Contamination

TCOM 370 NOTES 99-4 BANDWIDTH, FREQUENCY RESPONSE, AND CAPACITY OF COMMUNICATION LINKS

Understanding the DriveRack PA. The diagram below shows the DriveRack PA in the signal chain of a typical sound system.

Minimum requirements for DVB-T receiving antennas for portable indoor and portable outdoor reception

MATRIX TECHNICAL NOTES

DAB Digital Radio Broadcasting. Dr. Campanella Michele

for the Operation of Digital Cable Television Systems

White Paper. PESQ: An Introduction. Prepared by: Psytechnics Limited. 23 Museum Street Ipswich, Suffolk United Kingdom IP1 1HN

SSL 4000 Collection. SSL E-Channel & G-Channel User Manual

RESULTS OF TESTS WITH DOMESTIC RECEIVER IC S FOR DVB-T. C.R. Nokes BBC R&D, UK ABSTRACT

Sound Quality Aspects for Environmental Noise. Abstract. 1. Introduction

DIGITAL-TO-ANALOGUE AND ANALOGUE-TO-DIGITAL CONVERSION

Tutorial about the VQR (Voice Quality Restoration) technology

T = 1 f. Phase. Measure of relative position in time within a single period of a signal For a periodic signal f(t), phase is fractional part t p

Preservation Handbook

Waveform Monitor Audio & Loudness Logging

Loudness Normalization: The Future of File-Based Playback

Product Information S N O. Portable VIP protection CCTV & Alarm System 2

Introduction to FM-Stereo-RDS Modulation

MICROPHONE SPECIFICATIONS EXPLAINED

Transcription:

Levelling and Loudness Gerhard Spikofski and Siegfried Klar Institut für Rundfunktechnik GmbH (IRT) in radio and television broadcasting Sudden differences in loudness between and even within radio and television programmes have been well known for a long time. With the more-recent introduction of digital techniques, combined with the parallel transmission of digital and analogue broadcasts, this problem is again becoming highly significant. This article presents some solutions for avoiding loudness differences in radio and television broadcasting, based on levelling recommendations and a newly-developed loudness algorithm. Listeners and viewers are becoming increasingly concerned over sudden variations in programme loudness. These loudness jumps are most apparent when zapping through European DVB television and radio channels. The loudness differences between film dialogues and highly-compressed commercial breaks (adverts) are perceived as being particularly jarring. Both under-levelling and over-levelling can be observed, resulting in level differences of more than 15 db. The reasons for such programme level and loudness variations are, among others:! apparent inexperience in the levelling of sound channels;! the use of different and sometimes non-standardized programme level meters;! no standardized loudness meter has been available until now;! archive material (both analogue and digital) has not, to any great extent, been adapted to the types of sound channel being used today. In FM radio broadcasting, loudness is mainly balanced or controlled by means of compressors and limiters that prevent the frequency deviation of the transmitter from exceeding the permissable limits. In the case of digital broadcasting, it should also be possible to achieve balanced loudness profiles by following the existing international recommendations of the ITU and the EBU. These profiles should be met not only when comparing different programmes / channels, but also between different contributions within any single programme. Characteristics of radio programme meters Alignment level ITU Recommendation ITU-R BS.645-2 [1] defines the programme level of radio channels by means of an alignment signal (1 khz sine wave). The specified level of the sine wave corresponds approximately to fullscale programme level, in terms of loudness. EBU TECHNICAL REVIEW January 24 1 / 12

Table 1 Audio levels in studio and transmission environments AUDIO LOUDNESS Recommendations for analogue & digital audio levels ITU-R BS.645-2 Transmission Level (international) ARD HFBL-K Studio Level (national) US Reference Level (national) EBU digital Transmission & Studio Level (international) Alignment Level (AL) 9 db (35%) dbu b Nominal Level (PML) a db (1%) +9 dbu 3 dbu (adaptation) +6 dbu (adaptation) +4 dbu (adaptation) 18 dbfs 9 dbfs c a. PML = Permitted Maximum Level b. dbu =.775 V rms (sine wave) = 1.1 V peak c. dbfs = Clipping Level (FS = Full Scale) As the alignment signal is static, it can be measured by means of typical RMS meters as well as specific programme meters. It should be noted that the analogue alignment level (AL), and the nominal or permitted maximum level (PML), are specified diversely due to the different national and international recommendations in use (see Table 1). In the case of digital audio channels, the relationship between the alignment signal and the full-scale or clipping level was already specified in 1992 (EBU Rec. R68 [2]). When following this recommendation, the difference between full-scale (or clipping) level and the alignment level is 18 db (Table 1). In other words, the alignment level should be 18 dbfs. Audio programme meters for broadcasting Today, many different programme meters are in use at professional studios, with widely varying ballistical features (see Table 2 and Fig. 1b). Table 2 Programme meters used in international transmission and studio environments Programme Meter Type Recommendation VU Meter ANSI C 16.5 IEC 268-17 DIN PPM (QPPM) BBC PPM (QPPM) EBU PPM Std (QPPM) EBU Digi PPM (QPPM) IRT Digi PPM (QPPM) DIN 4546 IEC 268-1/1 ARD Pfl.H.3/6 IEC 268-1 / IIa EBU 325 E IEC 268-1 / IIb EBU IEC 268-18 IRT proposal PML a 1% VU + dbu dbr +9 dbu 6' +8 dbu +9 db +9 dbu Limit Level +16 dbr +25 dbu Scale 2 to +3 [db] 5 to +5 [db] 1 to 7 [ ] 12 to +12 [db] 9 dbfs dbfs 4 to + [db] dbr 1% +1 dbr 5 to +1 [db] Attack time (integration) Decay time (fall-back) Invisible peaks 3 ms / 9% 3 ms / 1% +13... +16 db 1 ms / 9% 5 ms / 8% 2 db / 1.5 s =13dB/s 1 ms / 8% 24 db / 2.8 s =8.6dB/s 1 ms / 8% 24 db / 2.8 s =8.6dB/s 5 ms / 8% 5... 1 ms / to 8% 2 db / 1.7 s =12dB/s 2 db / 1.7 s =12dB/s +3... +4 db +4... +6 db +4... +6 db +3... +4 db +3... +4 db a. Permitted Maximum Level (PML): 1% Modulation = +9 dbu = 9 dbfs for transmission lines [1][2] +6 dbu ARD Nominal Studio Level [3]. EBU TECHNICAL REVIEW January 24 2 / 12

Broadcast peak programme meter for analogue and digital audio Switchable attack time QPPM (5 ms) / Fast PPM ( ms) Modulation Range Headroom Programme level 9% 35% 1% PPM FAST >5 ms 8% indication 51 relative level 21 9 +5 +9 db >5 ms 8% indication 9% 35% 1% 18% L R AUDIO LOUDNESS Whereas in America and Australia, VU meters [4] are mainly used, the peak programme meter (PPM) is recommended by the EBU [5] for use in European countries. They are specified in the following IEC recommendations:! IEC 268-1 [6] (analogue PPM);! IEC 268-18 [7] (digital PPM). 6 digital level 18 9 dbfs 45 analogue level ITU-R +9 +14 +18 dbm 45 analogue level ARD 3 +6 +11 +15 dbm IRT/ AS - sk EBU Alignment Level Figure 1a Recommended broadcast peak-programme meter clipping level single peak IRT/ AS - sk QPPM headroom Ls AL PPM FAST ms to 1% indication 3 6 9 18 dbfs DIN 5 ms to 8% indication IEC QPPM +9 +6 +3 BBC 1 ms to 8% indication +12 +9 9 VU ARD HFBL-K +9 dbm = 3.9 V peak different attack times 3 ms integ. +3 db dbµ db 24 15 6 6 Ballistical behaviour of programme meter types 9 dbfs 18 dbfs The IEC category of PPM is the so-called quasi-peak programme meter (QPPM) which neglects any short-duration signal variations. For digital PPMs, the EBU recommends almost the same ballistical characteristics as that described in IEC 268-1 (Type 1). Since the introduction of digital audio techniques in broadcasting, additional but not precisely specified PPMs have caused some confusion. Besides their different scale layouts, these PPMs primarily vary in their ballistical features described by parameters such as attack time or integration time, and fall-back time or decay time. Table 2 shows the PPMs that are currently used in Europe. Regarding the layout of the scale, the fullscale tag (1% tag = db) and also the specified headroom should take into account the attack time of the programme meter. As an example, the VU meter which can be considered as relatively slow obviously needs an appropriate headroom because of the invisible signal peaks. Consequently, the difference between the 1% tag and the alignment level has to be smaller than in the case of other meter types. Note: The attack time of the PPM used by the German broadcasters ARD and ZDF [8] is specified as 1 ms / 9%. This means that it takes 1 ms to reach the 9% tag. The IEC meter type which is used by the BBC is specified slightly differently (1 ms / 8%). In the case of the fast digital sample programme meter (SPPM), theoretically no headroom is needed. These meters are appropriate for controlling signal peaks with respect to clipping but they are not as suitable as QPPMs for normal programme levelling. For example, signals with a high proportion of peaks tend to be under-levelled whereas heavily-compressed signals with limited peaks tend to be over-levelled. This can result in huge jumps in loudness, which seem to be more intensive than when using a QPPM. The use of unspecified level meters is widely observed in the digital audio field. If sound engineers are reasonably familiar with a particular level meter, the use of an unspecified device could result in severe levelling EBU TECHNICAL REVIEW January 24 3 / 12 1% VU different headrooms 1% QPPM Loudness Level Alignment Level (35%) PPM display decay = 12 db/s Figure 1b Ballistical characteristics of different broadcast programme meters

mistakes such as clipping and loudness jumps. Because unspecified instruments offer a wide gamut of characteristics, it is difficult to become familiar with them and gain sufficient experience in levelling. Digital programme meters are frequently software applications. As is well known from such applications, there are infinite error sources. Because the attack time tends towards ms, this means that the peak samples are indicated correctly. However, there are wide variations in the decay time. Those effects can result in different displays as well as differences in level. In Germany, the QPPM is precisely specified in ARD-Pflichtenheft 3/6 [8]. This meter is recommended for the levelling of both analogue and digital audio signals. Additional PPMs with attack times shorter than 1 ms are also specified here, but should only be used for monitoring not for levelling. In order to avoid confusion, the IRT recommends that the scale layout of the digital PPM is adapted to that of the analogue QPPM [6] (Fig. 1a, Table 2). That means that the 1% tag has to be 9 db below full scale. Dynamic range of digital audio systems Programme levelling and headroom As already mentioned, the levelling range and the necessary headroom depend on the ballistical features of the meter in use. Whereas VU meters need up to 18 db headroom, PPMs only require 9 db [1][9][1]. The 9 db headroom of the EBU PPM is strictly linked to QPPMs that accord with [1] and the alignment level specified in [2]. Using instruments with different ballistical features obviously results in other headroom recommendations. Headroom has to be considered as a buffer range between the nominal and clipping levels. If the European recommendation is followed, the exchange of programme material is guaranteed to have no levelling problems. German broadcasters have accepted this recommendation and the headroom is specified in document ARD HFBL-K Rec. 15 IRT [3], which accords with the EBU recommendation. In the case of analogue signals and also devices that involve A/D and D/A conversions, the absolute audio limit at German broadcast studios is +15 dbu (1% tag = +6 dbu, plus another 9 db headroom) (see Fig. 1a). Usable dynamic range objective and subjective considerations When discussing headroom and footroom [1], the question always arises whether the resulting system dynamics are sufficient to accommodate the full dynamic range of the human ear. In other words, which quantization level or how many bits are necessary to guarantee the transmission of music signals without any perceivable noise. One answer to this question was given in a paper published in 1985 [11]. In the following section, the conditions and results of this 1985 study are presented. This investigation was conducted before the era of bitrate reduction systems such as MiniDisc (ATRAC), MPEG-1 Layer 2 (mp2) and Layer 3 (mp3). Bitrate reduction systems are therefore not considered in this context. Compared to PCM (Pulse Code Modulation) systems, the aforementioned bitrate reduction systems evidently need less quantization. The fact that they allow noise-free recordings nevertheless shows that, in these cases, other quality features have to be considered. In PCM systems, the dynamic range of a system is defined as the level differences between full-scale programme level and the inherent noise level of the system. The dynamic range, the signal-to-noise ratio and the quantization noise can be calculated by means of the following formula:... where n = quantization level (number of bits). S/N [db] = 6n + 2 EBU TECHNICAL REVIEW January 24 4 / 12

Table3 Achievable signal-to-noise ratios for different quantizations and noise-level measurements Noise voltage level 16-bit 2-bit 24-bit RMS (db) 98 122 146 DIN 45 45 (db) [12] 9 114 138 ITU 468 (dbqps) [13] 86 11 134 The calculated value with a negative sign corresponds to the RMS value of the quantization noise, relative to dbfs programme level (the Full Scale / Clipping Level of a digital system). Table 3 shows the RMS noise values for three typical quantizations. These absolute values represent the maximum dynamic range (in db) for each of the three quantizations shown in the table. If we consider a headroom of 9 db [2] and a footroom of 2 db [1], the derived values for dynamic range are shown in Fig. 2 as a function of quantization. Usable system dynamics (db) 14 12 1 8 6 4 2 Footroom Effectively usable system dynamics Headroom Objective measurement pmax = dbfs Quantization noise 16-bit system 2-bit system 24-bit system CD Loudspeaker 9% values Subjective measurement SPLmax = 1 dba Perception limit SPLmax = 14 dba Headphone (cumulative) Figure 2 Usable dynamic range of digital PCM systems both subjective and objective In principle, the reference values for the dynamic range the maximum full-scale programme level on the one hand and the system noise level on the other correspond to certain sound pressure levels in the reproduction of music signals. The relevant sound pressure levels are the maximum listening level and the just imperceptible noise level. These two levels were actually investigated separately by the IRT, despite the fact that they are linked together as system features. The five selected test items (female speech, male speech, orchestral, string quartet and rock music) were only used to determine the maximum listening level. The representative noise signals (idle channel noises, white noise, etc.) were investigated in the absence of programme signals. That meant that the disturbing noises were only assessed during music pauses without having to consider the masking effect that would occur in the presence of programme signals. Subjective experiments were carried out with 2 normal listeners in individual sessions. The listening setup met the requirements for professional listening evaluations, including stereo loudspeaker and headphone reproduction [14][15]. The results of the investigation are presented in Fig. 2. The two aforementioned reference values correspond to (i) the 9% value of the cumulative frequency distribution of the maximum listening levels (dba) and (ii) the average value of the individual perception limits for the system noises that were investigated. In the left part of Fig. 2, the relationship between quantization and system dynamics is shown for three linear PCM systems (16-bit, 2-bit and 24-bit). In each case, the recommended headroom of 9 db and footroom of 2 db have been included. The results show that a linear 16-bit system, such as CD, just meets the requirements of the human ear for loudspeaker reproduction. In the case of headphone reproduction, the human requirements are only met if the headroom allowance is relinquished which is normally the case with CD production today. Consequently, for digital audio studio production, where headroom and footroom are essential, the test results presented show that professional audio production needs at least 18-bit systems. EBU TECHNICAL REVIEW January 24 5 / 12

Programme levelling and loudness Programme levelling AUDIO LOUDNESS Level adjustments are controlled by means of a level meter (e.g. QPPM) such that the maximum programme levels almost meet but do not exceed the 1% tag. In German broadcasting, the level meter QPPM accords with IEC 268-1 [6] and is standardized for both analogue and digital signals. Meeting the 1% tag, which implies a 9 db headroom, guarantees transmissions that are free of distortions. This does not mean that no amplitudes greater than 1% occur. Any short-term peaks that are invisible to the sound engineer should not generally produce clipping because a sufficient headroom of 9 db is provided, as a result of extensive programme signal analysis [9]. Programme loudness As is generally known, the same levelling applied to different programme signals does not normally result in the same loudness impression. This discrepancy is especially evident when comparing music and speech. In order to reach a uniform loudness balance in mixed broadcast programming, special levelling recommendations have been defined following detailed investigations [17][18]. Meeting these recommendations in situations where speech is more important (e.g. magazines, motoring programmes and commercials), the speech should be levelled to db and the music to between 8 db and 4 db. Those recommendations are useful for avoiding extreme loudness differences between and within broadcast programmes. However, adapting the programme loudness to suit the requirements of the human ear cannot always be achieved by this means alone. This is particularly true when using special audio processors. In this case, when adapting the loudness of broadcast programmes to the characteristics of the human ear, an additional loudness meter is necessary along with the level meter which is controlling the technical levels. Although some investigations had been carried out in this field [16][19][2][21], no standardized loudness meter is available at the moment. Loudness corrections today still have to be done manually by the control engineer. This, of course, is not practicable when most of the control functions are handled automatically. However, new investigations have shown that a studio loudness meter may be realizable [21], using new loudness algorithms based on measuring both the signal level and the signal power. The following methods were tested:! loudness measurement RTW [16];! loudness measurement EMMETT [19];! signal level QPPM;! signal power level PWR. The study dealt with both the subjective and objective aspects of loudness measurements. In the former case, psychoacoustic measurements were carried out to determine the subjectively-perceived loudness of the selected broadcast programme material. In the latter case, objective measurements were aimed at deriving rel- Abbreviations A/D Analogue-to-Digital ISO International Organization for Standardization ADR Astra Digital Radio ITU International Telecommunication Union AL Alignment Level MPEG (ISO/IEC) Moving Picture Experts Group dbfs db relative to Full-Scale reading PCM Pulse Code Modulation D/A Digital-to-Analogue PML Permitted Maximum Level DAB Digital Audio Broadcasting (Eureka-147) PPM Peak Programme Meter DSR Digital Satellite Radio QPPM Quasi-Peak Programme Meter DVB Digital Video Broadcasting RMS Root-Mean-Square FM Frequency Modulation SPPM Sample Peak Programme Meter IEC International Electrotechnical Commission VU (Audio) Volume Units EBU TECHNICAL REVIEW January 24 6 / 12

QPPM / Piano QPPM / Piano Cumulative freqency distribution ( %) 1 8 6 4 2 1% level 3% level 5% level QPPM level (db) [ db = 9 dbfs ] 3 6 9 12 15 18 21 1% level 3% level 5% level 3 25 2 15 1 5 Level (db) Figure 3 Principal analysis of the cumulative frequency distribution of objective loudness levels 24 2 4 6 8 1 12 14 16 18 2 Time (sec) Figure 4 QPPM level and cumulative frequency rates under test (test item = piano) Spearman Rank Correlation 1 8 6 4 2 [ A ] 1 % Subjective (average values) and Objective Loudness [ A ] 3 % [ A ] 5 % [ B ] 1 % [ B ] 3 % [ B ] 5 % [ C ] 1 % [ C ] 3 % [ C ] 5 % [ D ] 3 % [ D ] 5 % [ E ] 3 % [ E ] 5 % [ F ] 3 % [ F ] 5 % Spearman Rank Correlation (%) 1% 9% 8% 7% 6% 5% 4% 3% 2% 1% % Subjective Loudness & Analysing Time Cumulative frequency distribution of QPPM = 5% ( 56 items ) 1s 3s 5s 7s 1s Total time Figure 5 Spearman Rank Correlation between subjective (average values) and objective loudness parameters (A - F, not specified, are the six loudness algorithms under test) Figure 6 Correlation between subjective and objective QPPM loudness variation of analysing time evant signal parameters which would allow us to define objective loudness. The performance/accuracy of the objective parameters was assessed by correlating them with the associated subjective loudness values. The test material comprised recordings of DSR (Digital Satellite Radio) with 16 stereo radio programmes, recorded in 1984. Each of the 16 programmes was represented by audio clips of about 15 s duration. The 56 clips eventually chosen contained announcements, orchestral, chamber, piano, vocal and pop music. This selection of clips was considered to be representative of actual radio programming at the time, especially with respect to levelling and audio processing. In order to derive relevant objective parameters for each of the loudness algorithms and programmes, audiolevel histograms (frequency of specific level values within the item duration) were analysed. In each case, the cumulative frequency distribution was plotted, to illustrate how programme levels were being exceeded for 1%, 3% and 5% of the time (Fig. 3). As an example, the measurement of QPPM vs. Time (incorporating the analysed cumulative frequency distribution) is presented in Fig. 4. The measurements were made using the ARD-Pflichtenheft Nr. 3/6 level meter, with 1 ms integration time and a release time of 1.5 s [8]. The criterion used for assessing the performance of these loudness algorithms is the Spearman Rank Correlation between the subjective and objective loudness measurements. Whereas subjective loudness is represented EBU TECHNICAL REVIEW January 24 7 / 12

Spearman Rank Correlation (%) 1 9 8 7 6 5 4 3 2 1 Subjective Loudness & Cumulative Frequency Distribution of QPPM ( 56 items ) 8% level 7% level 6% level 5% level Figure 7 Correlation between subjective and objective QPPM loudness variation of cumulative frequency rate AUDIO LOUDNESS by the average values of the subjective loudness assessments, the corresponding objective parameters are the levels that were exceeded for 1%, 3% and 5% of the time. With reference to Fig. 5, it can be seen that the 5% level displays the highest correlation, for all the algorithms tested. If we consider just the 5% values in Fig. 5, a correlation > 67% is achieved with each algorithm (labelled A - F), whereas algorithms A and B display the highest correlation (78%). Because of the high correlation coefficients of algorithms A and B, and because of the relatively small deviations between the subjective and objective loudness parameters [21], these two algorithms form a good basis for developing a studio loudness meter. It can be stated that these programme meters are in accordance with the meter specified in [8], with an integration time of 1 ms. In order to optimize the loudness algorithm with the level meter specified in [6] (with 1 ms integration time and 1.5 s release time), additional measurements were carried out. Among other parameters, the cumulative frequency distribution (6%, 7% and 8%) and the analysing time (1 s, 3 s, 5 s, 7 s and 1 s) were tested. The corresponding results are presented in Figs 6-8. After optimizing the parameters under test, the resulting correlation between subjective and objective loudness amounts to 9%. The individual results of subjective and objective loudness are presented in Fig. 8 with additional indication of the average values and the 95% confidence intervals of the subjective loudness levels. Based on these results, a loudness algorithm was defined and a prototype of the studio loudness meter was developed. At the moment, this prototype is undergoing tests with special emphasis being given to practical performance problems. phon Subjective & Objective loudness Spearman Rank Correlation = 9% ( 56 items ) 9 OBJ SUBJ 8 7 6 5 4 Item 12 Item 11 Item 4 Item 3 Item 55 Item 46 Item 9 Item 2 Item 6 Item 29 Item 45 Item 33 Item 56 Item 41 Item 59 Item 6 Item 58 Item 27 Item 37 Item 43 Item 5 Item 25 Item 4 Item 42 Item 19 Item 52 Item 34 Item 39 Figure 8 Subjective (averages and 95% confidence intervals) and objective QPPM loudness EBU TECHNICAL REVIEW January 24 8 / 12

Signal amplitude 9 6 3 3 6 9 12 15 18 3 4 2 1 5 Clipping Level 28% 1% QPPM Programme Level Nominal Level dbr 21 1E-6 1E-3 1 1 3 6 8 Signal level (db) DVB Radio & TV Sound (R/TV): [1] R, news, es [2] R, rock music fr [3] R, easy listening music uk [4] TV, news, fem. de [5] test, pink noise > db QPPM Probability of excessive amplitude (%) Figure 9 Amplitude statistics of DVB radio and TV signals 9 6 3 3 6 9 12 Alignment Level 35% clipping level +9 db db = 1% QPPM true peaks SPM1 15 18 pow1 (1s-rms) LsM DVB Radio Channel 4 21 3 6 9 12 15 18 21 24 27 3 time (sec) Figure 1 Programme levels (QPPM, SPPM, PWR 1s, QPPM Loudness LsM) female speaker (no compression) Signal level (db) 9 6 3 3 6 9 12 15 18 21 pow1s (1s-rms) DVB Radio Channel 2 db = 1% QPPM clipping level +9 db QPPM QPPM 3 6 9 12 15 18 21 24 27 3 time (sec) Figure 11 Programme levels (QPPM, SPPM, PWR 1s, QPPM Loudness LsM) pop music (high compression) SPM1 LsM AUDIO LOUDNESS Programme analysis of DVB channels In order to gain experience with this newly-developed studio loudness meter, audio measurements were carried out on different European DVB channels. Besides the loudness levels (LsM) and the signal levels (PPM, QPPM), the signal amplitudes were also included in the measurement campaign. The following two methods of analysing the derived data were considered as appropriate:! Amplitude statistics analysis of the cumulative frequency distribution of the audio samples. The form of the diagrams presented here (signal amplitude vs. probability of exceeding the amplitude) yield interesting information about loudness and compression features of the analysed signals (Fig. 9).! Level registration vs. time recording the normally displayed levels (e.g. QPPM, SPPM, PWR 1s, QPPM- Loudness LsM) for later evaluation of the programme signals (Figs 1-11). The measurement results presented in Figs 9-11 show beyond doubt that there are tremendous differences between the DVB channels under test, when considering amplitude statistics, QPPM, PPM and LsM. In other words, the results clearly display non-adherence to the relevant ITU levelling recommendations [1]. Programme and loudness levelling in digital sound broadcasting General aspects Digital Radio offers the chance to get rid of those constraints that EBU TECHNICAL REVIEW January 24 9 / 12

are well known in analogue FM radio. In Digital Radio, there is no relationship between loudness and transmission range that requires audio processing. Therefore, the wide dynamic range of Digital Radio can be used to good effect, e.g. to broadcast the full dynamic range of top-quality CD recordings. First of all, the transmitters have to be levelled correctly according to the relevant ITU/EBU Recommendations [1][2]. This should prevent the occurrence of extreme variations in programme loudness. In today s European radio channels (DVB, DAB and ADR), programme signals equivalent to 2-bit PCM quantization can be transmitted with a headroom of 9 db and without compromising the perceived audio quality. These arguments support the 9 db EBU headroom as well as the use of QPPM in the broadcast studios, and should result in a much-needed homogenisation of engineering operations and maintenance. With respect to manual levelling, only specified and correctly calibrated IEC instruments (QPPM) should be used (see Table 2). In order to control the loudness profile within a single programme, an additional loudness meter such as the algorithm proposed in this article should be used. The proposed loudness meter, moreover, offers the opportunity to control the loudness profile automatically. Automatic pre-fading... and adjusting archive programme material Because of level and loudness differences in archive material, an accompanying archive (database) of level and loudness correction values would be useful for automatic broadcast operations. Digital Broadcast Studio A Archive BS K AF PPM & LsM A D C SP VP db F1 F2 Audio metering QPPM LsM A = Archive (CD player) AF = Automatic fader BS = Broadcast Server LsM = Loudness meter DVB = Digital sound channel K = Level-correction data SP/VP = Sound/voice processor FM = Analogue Sound Channel 1% + ext 1% Fig. 12 shows a possible signal Metadata processing scheme for computeraided radio (CAR). The archive Live material is pre-levelled by means of an automatic fader (AF). The archive contribution on the broadcast server (BS) can be levelled optimally before broadcasting, by means of level correction (K) and loudness correction (LsM), which is realized by the automatic fader. Controlling all the contributions in the sum channel are the QPPM and the proposed loudness meter (LsM). Head- Ad [Ls-Lim] FM limiter Figure 12 Proposed levelling scheme for digital sound broadcasting Audio channel 1% = 9 dbfs DVB 1% = 4 khz FM Deviation FM Loudness metering In addition to the 1% tag of QPPM, the loudness meter (LsM) also needs a 1% tag. For optimal levelling of digital sound channels, an additional limit value has to be defined along with the headroom. Unwanted high-level signals could be controlled by means of loudness limitation (Ls-Lim). The loudness limiter can be realized by means of an automatic fader that is controlled by the proposed loudness meter. By ensuring that the velocity of the loudness fading matches that of manual fading by a sound engineer, audible distortion could be avoided. This operation could be described as headroom adaptation. Conclusions If the relevant recommendations of the ITU [1] and EBU [2] are met, and the broadcast signal is levelled optimally by means of QPPM [6], a certain loudness balance could be achieved thus avoiding extreme jumps in loudness. Nevertheless, loudness differences will remain because of diverse recording and audio processing EBU TECHNICAL REVIEW January 24 1 / 12

Gerhard Spikofski studied electrical engineering at Berlin Technical University, one of his main areas of study being technical acoustics. Since 198, he has been on the scientific staff of the Institut für Rundfunktechnik, Munich (IRT). His field of interest covers development and optimization of audio systems in broadcasting, with special reference to the psychoacoustic aspects. Dipl.-Ing. Spikofski has published many articles in national and international specialist journals and is a regular speaker at national and international technical conferences. He is also a member of various national and international standardization bodies. Siegfried Klar studied communications engineering at the academy of Giessen (Germany). Since 1978, he has been on the scientific staff of the Institut für Rundfunktechnik, Munich (IRT). After dealing with video measurement engineering, he changed to the radio broadcast department. In this new working field, he concentrated on problems addressing analogue and digital audio processing and the broadcasting of radio and TV signals. Dipl.-Ing. Klar's current area of activity covers the analysis and optimization of digital audio broadcasting systems. techniques. These remaining loudness differences can be controlled by an additional loudness meter at the studio output. In order to achieve loudness balancing of digital audio broadcasts (such as DVB, DAB and ADR), the first step is to meet the 9 db headroom proposal. The resulting reduction of the available dynamic range is of no consequence to current Digital Radio and TV sound channels with their quasi 2-bit resolution. As high audio levels cannot be avoided in practice and, at the same time, in order to guarantee an agreed loudness limit, an automatic loudness limiter is suggested a so called headroom adapter. This solution (to avoid clipping of the signal) seems to be preferable to that of using limiters. The automatic controlling of both level and loudness is achieved by the proposed loudness meter. Because of the different requirements of archive and broadcast material, it is advisable to distinguish between the levelling of archive and broadcast material. In the case of archive material that will be broadcast, it is highly recommended that this programme material is properly adjusted to suit the new transmission channels available today. Bibliography [1] ITU-R Recommendation BS.645-2: Test signals and metering to be used on international sound programme connections ITU, Geneva, 1992. [2] EBU Recommendation R68-2: Alignment level in digital audio production equipment and in digital audio recorders EBU, Geneva, 2. [3] Empfehlung 15 IRT der ARD-Hörfunk-betriebsleiterkonferenz: Headroom bei digitalen Tonsignalen (Headroom in digital audio) Institut für Rundfunktechnik, München, Okt. 1994. [4] International Standard IEC 268-17: Sound system equipment, Standard volume indicators IEC, Geneva, 199. [5] EBU Tech. 325-E: EBU Standard peak programme meter for the control of international transmissions. EBU, Geneva, 1979. EBU TECHNICAL REVIEW January 24 11 / 12

[6] International Standard IEC 268-1, 2 nd Edition: Sound system equipment, Peak programme level meters IEC, Geneva, 1991. [7] International Standard IEC 268-18: Sound system equipment, Peak programme level meters - Digital audio peak level meter IEC, Geneva, 1995. [8] ARD Pflichtenheft 3/6: Aussteuerungsmesser (Level meter) Institut für Rundfunktechnik, München, Jan. 1977 / März. 1998 (Technische Pflichtenhefte der öffentlich-rechtlichen Rundfunkanstalten in der Bundesrepublik Deutschland; 3/6) [9] Horst Jakubowski: Analyse des Programmaterials des Hörrundfunks (Analysis of radio programme material) Rundfunktechnische Mitteilungen (RTM) 15 (198), H. 5, S. 197-22. [1] Horst Jakubowski: Aussteuerung in der digitalen Tonstudiotechnik (Levelling in digital audio) Rundfunktechnische Mitteilungen (RTM) 28 (1984), H. 5, S. 213-219. [11] Gerhard Spikofski: Signal-to-noise-ratio for digital transmission systems Preprint No. 2196, 77 th AES Convention, Hamburg, 1985. [12] DIN 4545: Störspannungsmessung in der Tontechnik (Measurement of disturbance voltage in audio) Deutsch Normen, Nov. 1983. [13] ITU-R Recommendation BS.468: Measurement of audio-frequency noise voltage level in sound broadcasting ITU, Geneva, 199, 1997. [14] ITU-R Recommendation BS.78: Determination of the electro-acoustical properties of studio monitor headphones ITU, Geneva, 199, 1997. [15] EBU Tech. 3276-E-2nd edition: Listening conditions for the assessment of sound programme material: Monophonic and two-channel stereophonic EBU, Geneva, 1998. [16] Die Lautheitsanzeige in RTW Peakmetern (Loudness display of RTW peak meters) RTW (Radio-Technische Werkstätten GmbH & Co. KG), 1997 [17] Jens Blauert and Jobst P. Fricke: Optimale Aussteuerung in der Sendung (Optimal levelling in broadcasting) Rundfunktechnische Mitteilungen (RTM) 24 (198), S. 63-71. [18] Horst Jakubowski: Das Problem der Programmlautstärke (The problem of programme loudness) Rundfunktechnische Mitteilungen (RTM) 12 (1968), S. 53 ff. [19] John Emmett and Charles Girdwood: Programme Loudness Metering. http://www.bpr.org.uk [2] John Emmett: Programme Loudness Metering and Control. Preprint No. 3295, 92 nd AES Convention, Vienna, March 1992. [21] Gerhard Spikofski,: Lautstärkemessung im Rundfunk-Sendestudio (Loudness measurement in broadcast studios) Tonmeistertagung < 21, 2, Hannover >: Bericht. München: Saur, 21, S. 64-618. EBU TECHNICAL REVIEW January 24 12 / 12