How To Write A Dra Audio Coding Standard

Size: px
Start display at page:

Download "How To Write A Dra Audio Coding Standard"

Transcription

1 Chinese Journal of Electronics Vol.23, No.3, July 2014 DRA Audio Coding Standard MA Wenhua 1,XUJing 2, MA Yuanzhe 3 and YOU Yuli 4 (1.Department of Computers, Cisco School of Informatics, Guangdong University of Foreign Studies, Guangzhou , China) (2.Department of Science and Technology, Guangdong Rising Assets Management Co., Guangzhou , China) (3.Department of Biomedical Science and Technology, South China University of Technology, Guangzhou , China) (4.Guangdong Provincial Key Laboratary for Digital Audio Technology, Guangzhou , China) Abstract DRA (Dynamic resolution adaptation) audio coding standard was shown to deploy transientlocalized MDCT to effectively suppress pre-echo artifacts and statistic allocation of codebooks to improve the compression efficiency of Huffman coding. Its quantizers and Huffman codebooks are designed in such a way that a signal path of 24 bits is provided throughout the codec so that high audio quality can be delivered if bit rate suffices. Although simple, it delivers state-of-the-arts compression efficiency as shown by five rounds of ITU-R BS compliant subjective listening tests. Key words Audio coding, Standard, Listening test, Modified discrete cosine transform (MDCT), Huffman coding. I. Introduction There have been extensive standardization activities for audio coding in the past twenty years. MPEG-1 may be the first international standard for perceptual high quality audio coding [1,2]. It is essentially a subband coder that deploys a 32- band QMF (Quadrature mirror filter bank). Its Layer 3 applies switched MDCT (Modified discrete cosine transform) to the subband signals output from the QMF for increased frequency resolution. MPEG-1 was extended by MPEG-2 BC (Backward compatible) to provide for lower sample rates and multichannel surround sounds [2,3]. MPEG-2 AAC abandons backward compatibility with MPEG-1 and MPEG-2 BC in order to achieve significant improvement in compression efficiency [2,4]. AAC uses an MDCT that switches between 1024 and 128 MDCT coefficients. AAC is carried over into MPEG-4 AAC with the addition of more coding tools, such as Perceptual Noise Shaping and Long Term Prediction, and coder configurations [5].In order to effectively handle low-bitrate applications such as internet streaming, HE-AAC version 1 profile (HE-AAC v1) introduces Spectral band replication (SBR) to enhance the compression efficiency in the frequency domain [6] and HE-AAC version 2 profile (HE-AAC v2) adds to it Parametric stereo (PS) to enhance its compression efficiency of stereo signals [7]. Dolby AC-3 is probably the most commercially successful audio coding standard [2,8]. ItusesMDCTthatswitches between 128 and 256 MDCT coefficients. WMA (Windows media audio) of Microsoft uses MDCT that switches between 64, 128, 256, 512, 1024, and 2048 MDCT coefficients [9]. Vorbis, an open source codec offered by the Xiph.Org Foundation, uses MDCT that switches between 256 and 1024 MDCT coefficients [10]. DRA (Dynamic resolution adaptation) audio coding algorithm from Digital rise technologies was first adopted as an audio coding standard for the electronic industry by China s Ministry of Information Industry in 2007 [11] and quickly became a mandatory audio coding standard for China mobile multimedia broadcasting (CMMB) lunched by China s State Administration of Radio, Film and Television [12]. In early 2009, it was adopted as part of the Blu-ray disc format by the Blu-ray Disc Association [13] and then as a national standard of China by China s State Standardization Commission [14].It became a mandatory audio coding standard for China s Terrestrial DTV Broadcasting in 2011 [15]. This paper presents an overview of DRA audio coding standard. Section II outlines its algorithmic architecture. Section III depicts its transient-localized MDCT that provides improved pre-echo suppression with minimal bit and computational overheads and Section IV explains its statistic approach to the allocation of Huffman codebooks to enhance its compression efficiency. Section V summarizes the results of five rounds of ITU-RBS.1116 compliant subjective listening tests administrated by Chinese government during DRAs standardization process, which clearly indicates that DRA audio coding standard delivers state-of-art compression efficiency. Section VI concludes this paper. II. Algorithmic Architecture The encoder and decoder of DRA audio coding standard Manuscript Received Mar. 2013; Accepted May This work is supported by the 2012 Guangdong Science and Technology Plan for Commercialization of Advanced and New Technologies (No.2012B ), Collaboration of Guangdong and Hong Kong for Major Break-Through in Key Technical Fields (No.2005A ) and by Fund for Guangdong Provincial Key Labs (No.2005B-60149).

2 522 Chinese Journal of Electronics 2014 are shown in Figs.1 and 2, respectively. It is apparent that DRA audio coding algorithm is a simple, essentially bare-bone, adaptive transform coder. The quantization step sizes are themselves logarithmically quantized with a step size of 0.2dB. When the quantization step size is one, the maximum allowed quantization index is ±(2 23 1) and the Huffman codebooks are designed to accommodate this. Consequently, a signal path of up to 24 bits is provided throughout the codec so that audio quality far exceeds the perceptual capability of the human ear can be delivered if bit rate suffices. The quantization indexes thus obtained are Huffmanencoded with codebooks assigned from a library by the Statistics-adaptive codebook assignment tool, to be discussed in Section IV. While transient detection, perceptual model and global bit allocation are necessary components of DRA encoder, they are not part of DRA decoder and there is little, if any, restriction on their implementation, so they are not stipulated in the DRA standard and are thereby not discussed here. Fig. 1. DRA encoder Fig. 2. DRA decoder The encoding process (see Fig.1) starts with the transformation of the input PCM samples into the frequency domain by Transient-localized MDCT. It is a simple and effective tool to suppress pre-echo artifacts that frequently impedes many audio coding algorithms and will be discussed in Section III. The resulting MDCT coefficients are optionally (deployable only at low bitrates) processed by joint channel coding: sum/difference coding and joint intensity coding. While the implementation for sum/difference coding is regular, the joint intensity coding is a little different. Instead of joining stereo pairs, it joins all channels into the left channel, thereby providing significant bit rate reduction when surround sounds are involved. Linear scalar quantization is used to quantize the MDCT coefficients. Toward this end, the MDCT coefficients are first segmented into quantization units, each of which is boxed in the frequency domain by the critical bands and in the time domain by the MDCT blocks that are statistically similar. Then each quantization unit is assigned a quantization step size by the global bit allocation algorithm. This step size is subsequently used to uniformly (linearly) quantize all MDCT coefficients inside the unit. III. Transient-Localized MDCT Given that audio signals are usually quasi-stationary with very strong tonal components, but are frequently interrupted by dramatic transients, it is, therefore, critical for an audio coding algorithm to employ a filter bank that can adapt its temporal-frequency resolution to this piecewise quasistationary nature of audio signals, i.e., having high frequency resolution during quasi-stationary episodes and high temporal resolution around transients. One widely used filter bank is MDCT [2,16],whichcanbe described by the following basis function: r» 2 π h(k, n) =w(n) M cos n + M +1 «k + 1 «M 2 2 where k =0, 1,,M 1; n =0, 1,, 2M 1; and w(n) isa window function of size 2M. The temporal-frequency resolution is largely determined by M, which may be referred to as block size. A large M means low temporal resolution but high frequency resolution, while a small M means high temporal resolution and low frequency resolution. Many widely used algorithms, such as MPEG-1 Layer 3 [1], Dolby AC-3 [8], and MPEG-2/4 ACC [4,5], use two block sizes (L for long and S for short windows, respectively) in order to adapt their resolutions to an input audio signal. The window functions corresponding to these two block sizes are illustrated as (a) and(b) in Fig.3. In order for the MDCT to be able to properly switch between these two block sizes, the perfect reconstruction conditions [2,16] call for the use of the three transitional window functions illustrated in Fig.3 as (c), (d), and (e). Window function (Fig.3(e)) may not be needed if certain restriction on the application of short window is imposed. Because transients in audio signals typically consist of no more than a few samples, the short block size for a frame of detected transient should be a few samples as well, thereby matching the filter s temporal resolution to the transient. Unfortunately, such a choice results in extremely poor frequency resolution within the frame and, therefore, is inappropriate for the rest of the samples in the frame because such other (1)

3 DRA Audio Coding Standard 523 samples, provided they are sufficiently far away from the transient, are quasi-stationary and therefore are better processed using high frequency resolution. This conflict conventionally has resulted in a compromise short block size that is neither optimal for the transient samples nor for the quasi-stationary samples in the same frame. One consequence of this is pre-echo artifacts. Fig. 3. Window functions Temporal noise shaping (TNS) [17],usedbyAAC [5],isone of the pre-echo control methods [2]. It deploys linear prediction over the MDCT coefficients for the block with transient, in order to boost the coding gain for the block and also to shape some of the pre-echo quantization noise to behind the transient. TNS is computationally intensive because of the linear prediction and the overhead for transferring the description of the predictor, including the prediction filter coefficients, is remarkable. DRA audio coding standard [18,19] deploys a very simple method for pre-echo suppression: the transient samples are covered by a special short window, which has the same window span as the normal short window, but has a significantly shorter effective size. The shorter effective size provides better transient localization while the normal window span leaves the frequency resolution unchanged. Pre-echo is better suppressed because transient localization reduces the spread of both quantization noise and high bit rates associated with transients. 1. Brief window functions This special short window is illustrated as (f) in Fig.3 and is referred to as the brief window function and labeled as WS B2B. It is still a short window of length S, thesame length as other windows within the frame. However, unlike those other windows, brief window WS B2B uses only a central portion of its overall length for signal shaping, employing a number of leading and trailing zeros in order to improve its temporal resolution. Obviously, its effective size is reduced. For example, if the short window size S = 256 samples, the brief window WS B2B may be designed such that it is nonzero within the central 160 samples, with the first 16 and last 16 of such samples overlapping the respective transition windows that are adjacent to it, and with zeros for the first 48 and the last 48 samples of the window. Its effective window size is reduced from 256 to 160. In order to switch to/from this brief window from/to the long (WL L2L) and short (WS S2S) windows, the perfect reconstruction conditions [2,16] call for the addition of transitional windows which are illustrated as (g), (h), (i), (j), (k), (l), and (m) in Fig.3. Obviously, these and the brief window need more memory to store. This brief window is applied only to the block of samples containing a transient, while the short and/or the appropriate transition windows are applied to the quasi-stationary samples in the remainder of the frame. 2. Enhanced pre-echo control The significantly reduced effective size of the brief window offers better pre-echo suppression for two reasons. The first is that a finer temporal resolution is deployed to transient samples and high bit rates associated with transients are constrained to fewer samples. The second reason is that the spread of quantization noise is reduced. For the example at the beginning of this section where a short window size S = 256 samples, its spread of quantization noise is 256 samples, but this is reduced to only 160 samples for the brief window. For a typical sample rate of 48 khz, they amount to 5.33 and 3.3ms, respectively. Given that significant premasking tends to last about 1 2ms [2],thespread of quantization noise that may cause pre-echo is reduced from to ms, a significant reduction. 3. Window sequencing If there is no transient detected within the current frame, select a long window, the specific shape of which depending on the existence and location of any transient in the previous and next frame, as shown in Table 1. Table 1. Determination of long windows for a frame without detected transient Frame Previous Current Subsequent WL L2L No transient No transient WL L2S Transient, but not in the first block WL L2B Transient in the first block WL S2L No transient Transient not WL S2S Transient, but not in the first block in last block WL S2B Transient in the first block WL B2L No transient Transient in WL B2S Transient, but not in the first block last block WL B2B Transient in the first block Due to the increased number of windows as compared with conventional methods, the determination of appropriate window sequence is more involved, but a short procedure [18],out-

4 524 Chinese Journal of Electronics 2014 lined as follows, that applies the perfect reconstruction conditions suffices. If a transient has been detected in the current frame, the selection of windows is a little bit more involved. First of all, identify the location(s) of the transient(s). Around a transient, select a series of short windows according to the following principles: WS B2B is applied to the block where a transient occurs, in order to improve the temporal resolution of the MDCT. The window for the block that is immediately before this transient block has a designation of the form...2b. The window for the block that is immediately after this transient block has a designation of the form... B2.... Consequently, the allowed placement of windows is summarized in Table 2. For the rest of the frame (away from the transient), short window WS S2S should be deployed, except for the first and last blocks of the frame. For the first block of the frame, if there is no transient in the last block of the previous frame, short window WS S2S should be used; otherwise, short window WS B2S should be used. For the last block of the frame, if there is no transient in the first block of the subsequent frame, short window WS S2S should be used; otherwise, short window WS S2B should be used. Table 2. Allowed placement of windows around a block with a detected transient Pre-transient Transient Post-transient WL L2B WL B2L WL S2B WL B2S WL B2B WS B2B WL B2B WS S2B WS B2S WS B2B WS B2B Some example window sequences are shown in Fig.4. Fig.4(a) is an example for the conventional approach; (b)shows that a transient occurs in the first block of the frame, so brief window WS B2B is deployed for this block, WL L2B is deployed for the previous frame and WS B2S is deployed for the second block of the frame; (c) shows that a transient occurs in the third block of the frame; (d) shows that two transients occur in the third and sixth blocks, so two brief windows are placed, respectively; (e) shows that a transient occurs in the last block of the frame. 4. Indication of window sequence to decoder The encoder needs to indicate to the decoder the window(s) that it used to encode the current frame so that the decoder can use the same window(s) to decode the frame. This can be accomplished by a window index with the conventional methods. This is also true for the present method. For a frame without a detected transient, one label from those in Table 1 needs to be transferred to the decoder. For a frame with detected transient, the window sequencing procedure outlined in Section III needs to know (1) transient location(s), which the conventional methods also need, and (2) whether there is a transient in the first block of the current frame and of the subsequent frame, respectively. This later information may be conveyed by the following nomenclature: WS CurrentSubs, where Current (S =no, B =yes) identifies if there is transient in the first block of current frame, and Subs (S =no, B =yes) identifies if there is transient in the first block of the subsequent frame. This is shown in Table 3. Obviously, the first column of Table 3 is the same set of labels used for the short windows, i.e., (a), (f), (g), and (h) in Fig.3. Fig. 4. Window sequence examples. (a) Window sequence for conventional methods; (b) Window sequence for the proposed method when a transient occurs in the first block; (c) When a transient occurs in the third block; (d) When two transients occur in the third and sixth blocks; (e) Whenatransientoccursinthelastblock Table 3. Encoding of the existence or absence of transient in the first block of the current/subsequent frame Label Transient in the first block of current frame subsequent frame WS B2B yes yes WS B2S yes no WS S2B no yes WS S2S no no Combining the labels in Table 1 and in Table 3, we get the complete set of window labels shown in Fig.3, and they are sufficient to convey the window sequencing information. The total number of window labels is now 13, while that number is 5 or 4 for the conventional approach (depending on whether or not window WL S2S is forbidden), so one or two more bits (4 versus 3 or 2) are needed to transfer the window sequencing information. IV. Statistics-Adaptive Codebook Assignment Another special feature of DRA audio coding standard is the assignment of its Huffman codebooks. Conventional approach to codebook allocation usually assigns a Huffman codebook to encode all MDCT coefficients within a quantization unit. This codebook is usually the smallest one that can accommodate the largest quantization index within the unit. Consequently, once the quantization step size is determined, all quantization indexes within the unit are also determined. And so is the Huffman codebook, there is no other option.

5 DRA Audio Coding Standard 525 Since the quantization indexes within a quantization unit do not necessarily share the same statistic properties, the traditional approach does not provide a good match, if any, between the statistic properties of the Huffman codebooks and those of the quantization indexes. This motivates a statisticsadaptive approach to codebook assignment, whose steps are outlined as follows: (1) The quantization indexes are grouped into granules of four, the smallest codebook that can accommodate the largest quantization index in the granule is assigned to the granule. (2) Segment the indexes of these codebook into large segments based on their local statistic properties. (3) Select the largest codebook within each segment as the codebook for that segment. The advantage of this approach is illustrated in Fig.5. Since the largest quantization index falls into quantization unit d, so a large codebook is assigned using previous methods, which is obviously not a good match because most of the indexes in the unit are much smaller. Using the DRA approach, however, the largest quantization index is segmented into segment C, so share a codebook with other large quantization indexes. Also, all quantization indexes in segment D are small, so a small codebook is selected. This obviously results in fewer bits for coding the quantization indexes. standard went through five rounds of ITU-R BS.1116 [1] compliant subjective listening test. The results of them are shown in Table 4. Table 4. Results of ITU-R BS compliant subjective listening tests. All listening panels are expert listeners and most are young audio professional Test Test Stereo lab date 128 kbps 320 kbps 384 kbps NTICRT 08/ SLDST 10/ SLDST 01/ SLDST 07/ SLDST 08/ The bit rate of all these tests was specified in such a way that it is the upper limit absolutely not to be exceeded in any frame. For example, if the sample rate is 48 khz, the bit rate of 128 kbps translates into 2730 bits per frame because a DRA frame consists of 1024 samples. No frame can use more than 2730 bits and no bit reservoir is allowed. The first test was conducted by National Testing and Inspection Center for Radio and TV Products of China (NTI- CRT) in August Ten stereo sound tracks selected mostly from SQAM CD [6] and five 5.1 surround sound tracks were used in the test. The test subjects were all expert listeners consisting of conductors, musicians, recording engineers, and audio engineers. The other four tests were all performed by the State Lab for DTV System Testing (SLDST) under China s State Administration for Radio, Film, and TV. Some of China s famous conductors, musicians and recording engineers were among the listening panels. Seniors and graduate students from the School of Recording Arts, Communication University of China, and the Department of Sound Recording, Beijing Film Academy, were found to possess better acuity due to their age and training, so they had become the majority in the later tests. Other than a few Chinese sound tracks, most of the test materials were selected from the SQAM CD [6] and a pool of surround sound tracks used by EBU and MPEG, including Pitch pipe, Harpsichord, and Elloit1. Fig. 5. Advantages of statistic allocation of Huffman codebooks The conventional methods only need to transfer the codebook indexes because the scopes of codebook application are the same as the quantization units, which are pre-determined. DRA approach, however, need to transfer the scopes of codebook application in addition to the codebook indexes, since the scopes are unrelated to the quantization units. This overhead must be contained in order for DRA approach to offer any advantage. This can be easily achieved by imposing a limit to the number of segments. V. Subjective Listening Tests During its standardization process, DRA audio coding VI. Conclusion DRA audio coding standard was shown to use transientlocalized MDCT to effectively suppress pre-echo artifacts and statistic allocation of codebooks to improve its entropy coding efficiency. Its quantizers and Huffman codebooks are designed in such a way that a signal path of 24 bits is provided throughout the codec so that highest audio quality can be delivered if bit rate suffices. The final results of five ITU-R BS compliant subjective listening tests indicate that DRA audio coding standard has achieved transparency for stereo at 128 kbps and 5.1 surround at 384 kbps. References [1] ISO/IEC :1993 Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to About 1, 5

6 526 Chinese Journal of Electronics 2014 Mbit/s Part [2] T. Painter and A. Spanias, Perceptual coding of digital audio, Proceedings of the IEEE, Vol.88, No.4, pp , [3] ISO/IEC :1997 Information Technology: Generic Coding [4] ISO/IEC :1997 Information Technology: Generic Coding 7: Advanced Audio Coding (AAC). [5] ISO/IEC :1999 Information Technology: Generic Coding [6] ISO/IEC :2001 Information Technology: Generic Coding [7] ISO/IEC :2005 Information Technology: Generic Coding [8] Dolby Laboratories, Digital Audio Compression Standard A/52B, Advanced Television Systems Committee (ATSC), [9] Windows media audio, Media Audio, [10] Vorbis I specification, Xiph.org Foundation, [11] SJ/T , Electronic Industry Standard: DRA Audio Coding Technology. [12] GY/T , Mobile Multimedia Broadcasting, Part 2: Multiplexing. [13] Q/DCHL , BD-ROM 2.3 Specification: DRA Audio Coding Technology. [14] GB/T , National Standard: DRA Audio Coding Technology. [15] GB/T , Terrestrial DTV Receiver Specification. [16] John P. Princen and Alan B. Bradley, Analysis/synthesis filter bank design based on time domain aliasing cancellation, IEEE Transactions on ASSP, Vol.34, No.5, pp , [17] J. Herre and J.D. Johnston, Enhancing the performance of perceptual audio coders by using temporal noise shaping (TNS), 101st AES Convention, 1996 Reprint#4384. [18] Yuli You, Audio Coding: Theory and Practice, Springer, USA, [19] Yuli You and Wenhua Ma, Modile Multimedin Broadcasting Standards: Technology and Practice, Springer, USA, MA Wenhua is an associate professor at Guangdong University of Foreign Studies and a senior member of Chinese Institute of Electronics. His research interests include signal processing and embedded systems. ( @qq.com) XU Jing received the MBA and M.S. degrees from Sun Yat-Sen University and is the director of Department of Science and Technology, Guangdong Rising Asset Management Co., Ltd., Guangzhou, China. His research interests include audio coding and streaming technologies. MA Yuanzhe is pursuing the B.S. degree in biomedical engineering in South China University of Technology, Guangzhou, China. YOU Yuli received the Ph.D. degree from the University of Minnesota in Twin Cities, Minneapolis, MN 55455, USA in 1995 and is the inventor of DRA audio coding standard. His research interests include audio, image and video processing.

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music ISO/IEC MPEG USAC Unified Speech and Audio Coding MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music The standardization of MPEG USAC in ISO/IEC is now in its final

More information

AUDIO CODING: BASICS AND STATE OF THE ART

AUDIO CODING: BASICS AND STATE OF THE ART AUDIO CODING: BASICS AND STATE OF THE ART PACS REFERENCE: 43.75.CD Brandenburg, Karlheinz Fraunhofer Institut Integrierte Schaltungen, Arbeitsgruppe Elektronische Medientechnolgie Am Helmholtzring 1 98603

More information

Audio Coding Algorithm for One-Segment Broadcasting

Audio Coding Algorithm for One-Segment Broadcasting Audio Coding Algorithm for One-Segment Broadcasting V Masanao Suzuki V Yasuji Ota V Takashi Itoh (Manuscript received November 29, 2007) With the recent progress in coding technologies, a more efficient

More information

Digital Audio Compression: Why, What, and How

Digital Audio Compression: Why, What, and How Digital Audio Compression: Why, What, and How An Absurdly Short Course Jeff Bier Berkeley Design Technology, Inc. 2000 BDTI 1 Outline Why Compress? What is Audio Compression? How Does it Work? Conclusions

More information

Technical Paper. Dolby Digital Plus Audio Coding

Technical Paper. Dolby Digital Plus Audio Coding Technical Paper Dolby Digital Plus Audio Coding Dolby Digital Plus is an advanced, more capable digital audio codec based on the Dolby Digital (AC-3) system that was introduced first for use on 35 mm theatrical

More information

Audio Coding Introduction

Audio Coding Introduction Audio Coding Introduction Lecture WS 2013/2014 Prof. Dr.-Ing. Karlheinz Brandenburg bdg@idmt.fraunhofer.de Prof. Dr.-Ing. Gerald Schuller shl@idmt.fraunhofer.de Page Nr. 1 Organisatorial Details - Overview

More information

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de

MPEG-1 / MPEG-2 BC Audio. Prof. Dr.-Ing. K. Brandenburg, bdg@idmt.fraunhofer.de Dr.-Ing. G. Schuller, shl@idmt.fraunhofer.de MPEG-1 / MPEG-2 BC Audio The Basic Paradigm of T/F Domain Audio Coding Digital Audio Input Filter Bank Bit or Noise Allocation Quantized Samples Bitstream Formatting Encoded Bitstream Signal to Mask Ratio

More information

DAB + The additional audio codec in DAB

DAB + The additional audio codec in DAB DAB + The additional audio codec in DAB 2007 Contents Why DAB + Features of DAB + Possible scenarios with DAB + Comparison of DAB + and DMB for radio services Performance of DAB + Status of standardisation

More information

Digital terrestrial television broadcasting Audio coding

Digital terrestrial television broadcasting Audio coding Digital terrestrial television broadcasting Audio coding Televisão digital terrestre Codificação de vídeo, áudio e multiplexação Parte 2: Codificação de áudio Televisión digital terrestre Codificación

More information

5.1 audio. How to get on-air with. Broadcasting in stereo. the Dolby "5.1 Cookbook" for broadcasters. Tony Spath Dolby Laboratories, Inc.

5.1 audio. How to get on-air with. Broadcasting in stereo. the Dolby 5.1 Cookbook for broadcasters. Tony Spath Dolby Laboratories, Inc. 5.1 audio How to get on-air with the Dolby "5.1 Cookbook" for broadcasters Tony Spath Dolby Laboratories, Inc. This article is aimed at television broadcasters who want to go on-air with multichannel audio

More information

Audio Coding, Psycho- Accoustic model and MP3

Audio Coding, Psycho- Accoustic model and MP3 INF5081: Multimedia Coding and Applications Audio Coding, Psycho- Accoustic model and MP3, NR Torbjørn Ekman, Ifi Nils Christophersen, Ifi Sverre Holm, Ifi What is Sound? Sound waves: 20Hz - 20kHz Speed:

More information

For Articulation Purpose Only

For Articulation Purpose Only E305 Digital Audio and Video (4 Modular Credits) This document addresses the content related abilities, with reference to the module. Abilities of thinking, learning, problem solving, team work, communication,

More information

DOLBY SR-D DIGITAL. by JOHN F ALLEN

DOLBY SR-D DIGITAL. by JOHN F ALLEN DOLBY SR-D DIGITAL by JOHN F ALLEN Though primarily known for their analog audio products, Dolby Laboratories has been working with digital sound for over ten years. Even while talk about digital movie

More information

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting Page 1 of 9 1. SCOPE This Operational Practice is recommended by Free TV Australia and refers to the measurement of audio loudness as distinct from audio level. It sets out guidelines for measuring and

More information

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION Adiel Ben-Shalom, Michael Werman School of Computer Science Hebrew University Jerusalem, Israel. {chopin,werman}@cs.huji.ac.il

More information

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder.

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder. A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER Manoj Kumar 1 Mohammad Zubair 1 1 IBM T.J. Watson Research Center, Yorktown Hgts, NY, USA ABSTRACT The MPEG/Audio is a standard for both

More information

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6 Page 1 of 6 1. Scope. This operational practice sets out the requirements for downmixing 5.1 and 5.0 channel surround sound audio mixes to 2 channel stereo. This operational practice recommends a number

More information

high-quality surround sound at stereo bit-rates

high-quality surround sound at stereo bit-rates FRAUNHOFER Institute For integrated circuits IIS MPEG Surround high-quality surround sound at stereo bit-rates Benefits exciting new next generation services MPEG Surround enables new services such as

More information

MP3 AND AAC EXPLAINED

MP3 AND AAC EXPLAINED MP3 AND AAC EXPLAINED KARLHEINZ BRANDENBURG ½ ½ Fraunhofer Institute for Integrated Circuits FhG-IIS A, Erlangen, Germany bdg@iis.fhg.de The last years have shown widespread proliferation of.mp3-files,

More information

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Sound is an energy wave with frequency and amplitude. Frequency maps the axis of time, and amplitude

More information

MPEG-H Audio System for Broadcasting

MPEG-H Audio System for Broadcasting MPEG-H Audio System for Broadcasting ITU-R Workshop Topics on the Future of Audio in Broadcasting Jan Plogsties Challenges of a Changing Landscape Immersion Compelling sound experience through sound that

More information

AVP 4000. SW9.5 07-2013-v2

AVP 4000. SW9.5 07-2013-v2 AVP 4000 In a fast changing, highly competitive market, media organisations need encoding solutions that deliver high quality, high reliability and operational flexibility. The number of channels continues

More information

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton Abstract This paper intends to give the reader some insight into the workings

More information

APPLICATION BULLETIN AAC Transport Formats

APPLICATION BULLETIN AAC Transport Formats F RA U N H O F E R I N S T I T U T E F O R I N T E G R A T E D C I R C U I T S I I S APPLICATION BULLETIN AAC Transport Formats INITIAL RELEASE V. 1.0 2 18 1 AAC Transport Protocols and File Formats As

More information

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Project Proposal Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Sumedha Phatak-1000731131- sumedha.phatak@mavs.uta.edu Objective: A study, implementation and comparison of

More information

The AAC audio Coding Family For

The AAC audio Coding Family For White PapER The AAC audio Coding Family For Broadcast and Cable TV Over the last few years, the AAC audio codec family has played an increasingly important role as an enabling technology for state-of-the-art

More information

Audio Engineering Society. Convention Paper. Presented at the 119th Convention 2005 October 7 10 New York, New York USA

Audio Engineering Society. Convention Paper. Presented at the 119th Convention 2005 October 7 10 New York, New York USA Audio Engineering Society Convention Paper Presented at the 9th Convention 5 October 7 New York, New York USA This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

H.264/MPEG-4 AVC Video Compression Tutorial

H.264/MPEG-4 AVC Video Compression Tutorial Introduction The upcoming H.264/MPEG-4 AVC video compression standard promises a significant improvement over all previous video compression standards. In terms of coding efficiency, the new standard is

More information

MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN. zl2211@columbia.edu. ml3088@columbia.edu

MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN. zl2211@columbia.edu. ml3088@columbia.edu MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN Zheng Lai Zhao Liu Meng Li Quan Yuan zl2215@columbia.edu zl2211@columbia.edu ml3088@columbia.edu qy2123@columbia.edu I. Overview Architecture The purpose

More information

Multichannel Audio Line-up Tones

Multichannel Audio Line-up Tones EBU Tech 3304 Multichannel Audio Line-up Tones Status: Technical Document Geneva May 2009 Tech 3304 Multichannel audio line-up tones Contents 1. Introduction... 5 2. Existing line-up tone arrangements

More information

Multichannel stereophonic sound system with and without accompanying picture

Multichannel stereophonic sound system with and without accompanying picture Recommendation ITU-R BS.775-2 (07/2006) Multichannel stereophonic sound system with and without accompanying picture BS Series Broadcasting service (sound) ii Rec. ITU-R BS.775-2 Foreword The role of the

More information

Convention Paper 5553

Convention Paper 5553 Audio Engineering Society Convention Paper 5553 Presented at the 112th Convention 2 May 1 13 Munich, Germany This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

TECHNICAL OPERATING SPECIFICATIONS

TECHNICAL OPERATING SPECIFICATIONS TECHNICAL OPERATING SPECIFICATIONS For Local Independent Program Submission September 2011 1. SCOPE AND PURPOSE This TOS provides standards for producing programs of a consistently high technical quality

More information

RECOMMENDATION ITU-R SM.1792. Measuring sideband emissions of T-DAB and DVB-T transmitters for monitoring purposes

RECOMMENDATION ITU-R SM.1792. Measuring sideband emissions of T-DAB and DVB-T transmitters for monitoring purposes Rec. ITU-R SM.1792 1 RECOMMENDATION ITU-R SM.1792 Measuring sideband emissions of T-DAB and DVB-T transmitters for monitoring purposes (2007) Scope This Recommendation provides guidance to measurement

More information

HE-AAC v2. MPEG-4 HE-AAC v2 (also known as aacplus v2 ) is the combination of three technologies:

HE-AAC v2. MPEG-4 HE-AAC v2 (also known as aacplus v2 ) is the combination of three technologies: HE- v2 MPEG-4 audio coding for today s digital media world Stefan Meltzer and Gerald Moser Coding Technologies Delivering broadcast-quality content to consumers is one of the most challenging tasks in

More information

JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions

JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions Edith Cowan University Research Online ECU Publications Pre. JPEG compression of monochrome D-barcode images using DCT coefficient distributions Keng Teong Tan Hong Kong Baptist University Douglas Chai

More information

Convention Paper 7896

Convention Paper 7896 Audio Engineering Society Convention Paper 7896 Presented at the 127th Convention 2009 October 9 12 New York, NY, USA The papers at this Convention have been selected on the basis of a submitted abstract

More information

MPEG Layer-3. An introduction to. 1. Introduction

MPEG Layer-3. An introduction to. 1. Introduction An introduction to MPEG Layer-3 MPEG Layer-3 K. Brandenburg and H. Popp Fraunhofer Institut für Integrierte Schaltungen (IIS) MPEG Layer-3, otherwise known as MP3, has generated a phenomenal interest among

More information

DESIGN AND SIMULATION OF TWO CHANNEL QMF FILTER BANK FOR ALMOST PERFECT RECONSTRUCTION

DESIGN AND SIMULATION OF TWO CHANNEL QMF FILTER BANK FOR ALMOST PERFECT RECONSTRUCTION DESIGN AND SIMULATION OF TWO CHANNEL QMF FILTER BANK FOR ALMOST PERFECT RECONSTRUCTION Meena Kohli 1, Rajesh Mehra 2 1 M.E student, ECE Deptt., NITTTR, Chandigarh, India 2 Associate Professor, ECE Deptt.,

More information

The Theory Behind Mp3

The Theory Behind Mp3 The Theory Behind Mp3 Rassol Raissi December 2002 Abstract Since the MPEG-1 Layer III encoding technology is nowadays widely used it might be interesting to gain knowledge of how this powerful compression/decompression

More information

Web-Conferencing System SAViiMeeting

Web-Conferencing System SAViiMeeting Web-Conferencing System SAViiMeeting Alexei Machovikov Department of Informatics and Computer Technologies National University of Mineral Resources Mining St-Petersburg, Russia amachovikov@gmail.com Abstract

More information

Analog-to-Digital Voice Encoding

Analog-to-Digital Voice Encoding Analog-to-Digital Voice Encoding Basic Voice Encoding: Converting Analog to Digital This topic describes the process of converting analog signals to digital signals. Digitizing Analog Signals 1. Sample

More information

What is 802.11? Why are standards important?

What is 802.11? Why are standards important? What is 802.11? The 802.11 standards are a group of evolving specifications defined by the Institute of Electrical and Electronic Engineers (IEEE). Commonly referred to as Wi Fi the 802.11 standards define

More information

PCM Encoding and Decoding:

PCM Encoding and Decoding: PCM Encoding and Decoding: Aim: Introduction to PCM encoding and decoding. Introduction: PCM Encoding: The input to the PCM ENCODER module is an analog message. This must be constrained to a defined bandwidth

More information

RECOMMENDATION ITU-R BR.1384-1*, ** Parameters for international exchange of multi-channel sound recordings with or without accompanying picture ***

RECOMMENDATION ITU-R BR.1384-1*, ** Parameters for international exchange of multi-channel sound recordings with or without accompanying picture *** Rec. ITU-R BR.1384-1 1 RECOMMENDATION ITU-R BR.1384-1*, ** Parameters for international exchange of multi-channel sound recordings with or without accompanying picture *** (Question ITU-R 58/6) (1998-2005)

More information

Video compression: Performance of available codec software

Video compression: Performance of available codec software Video compression: Performance of available codec software Introduction. Digital Video A digital video is a collection of images presented sequentially to produce the effect of continuous motion. It takes

More information

Preservation Handbook

Preservation Handbook Preservation Handbook Digital Audio Author Gareth Knight & John McHugh Version 1 Date 25 July 2005 Change History Page 1 of 8 Definition Sound in its original state is a series of air vibrations (compressions

More information

Study and Implementation of Video Compression standards (H.264/AVC, Dirac)

Study and Implementation of Video Compression standards (H.264/AVC, Dirac) Study and Implementation of Video Compression standards (H.264/AVC, Dirac) EE 5359-Multimedia Processing- Spring 2012 Dr. K.R Rao By: Sumedha Phatak(1000731131) Objective A study, implementation and comparison

More information

Figure 1: Relation between codec, data containers and compression algorithms.

Figure 1: Relation between codec, data containers and compression algorithms. Video Compression Djordje Mitrovic University of Edinburgh This document deals with the issues of video compression. The algorithm, which is used by the MPEG standards, will be elucidated upon in order

More information

Mike Perkins, Ph.D. perk@cardinalpeak.com

Mike Perkins, Ph.D. perk@cardinalpeak.com Mike Perkins, Ph.D. perk@cardinalpeak.com Summary More than 28 years of experience in research, algorithm development, system design, engineering management, executive management, and Board of Directors

More information

ARIB STD-T64-C.S0042 v1.0 Circuit-Switched Video Conferencing Services

ARIB STD-T64-C.S0042 v1.0 Circuit-Switched Video Conferencing Services ARIB STD-T-C.S00 v.0 Circuit-Switched Video Conferencing Services Refer to "Industrial Property Rights (IPR)" in the preface of ARIB STD-T for Related Industrial Property Rights. Refer to "Notice" in the

More information

Audio and Video Synchronization:

Audio and Video Synchronization: White Paper Audio and Video Synchronization: Defining the Problem and Implementing Solutions Linear Acoustic Inc. www.linearacaoustic.com 2004 Linear Acoustic Inc Rev. 1. Introduction With the introduction

More information

High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform

High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 4, JULY 2003 365 High-Fidelity Multichannel Audio Coding With Karhunen-Loève Transform Dai Yang, Member, IEEE, Hongmei Ai, Member, IEEE, Chris

More information

Introduction to image coding

Introduction to image coding Introduction to image coding Image coding aims at reducing amount of data required for image representation, storage or transmission. This is achieved by removing redundant data from an image, i.e. by

More information

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING A TOOL FOR TEACHING LINEAR PREDICTIVE CODING Branislav Gerazov 1, Venceslav Kafedziski 2, Goce Shutinoski 1 1) Department of Electronics, 2) Department of Telecommunications Faculty of Electrical Engineering

More information

BlazeVideo HDTV Player v6.0r User s Manual. Table of Contents

BlazeVideo HDTV Player v6.0r User s Manual. Table of Contents BlazeVideo HDTV Player v6.0r User s Manual Table of Contents Ⅰ. Overview... 2 1.1 Introduction... 2 1.2 Features... 2 1.3 System Requirements... 2 Ⅱ. Appearance & Menus... 4 Ⅲ. Operation Guide... 7 3.1

More information

Image Compression through DCT and Huffman Coding Technique

Image Compression through DCT and Huffman Coding Technique International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Rahul

More information

Khalid Sayood and Martin C. Rost Department of Electrical Engineering University of Nebraska

Khalid Sayood and Martin C. Rost Department of Electrical Engineering University of Nebraska PROBLEM STATEMENT A ROBUST COMPRESSION SYSTEM FOR LOW BIT RATE TELEMETRY - TEST RESULTS WITH LUNAR DATA Khalid Sayood and Martin C. Rost Department of Electrical Engineering University of Nebraska The

More information

An Optimised Software Solution for an ARM Powered TM MP3 Decoder. By Barney Wragg and Paul Carpenter

An Optimised Software Solution for an ARM Powered TM MP3 Decoder. By Barney Wragg and Paul Carpenter An Optimised Software Solution for an ARM Powered TM MP3 Decoder By Barney Wragg and Paul Carpenter Abstract The market predictions for MP3-based appliances are extremely positive. The ability to maintain

More information

White Paper: An Overview of the Coherent Acoustics Coding System

White Paper: An Overview of the Coherent Acoustics Coding System White Paper: An Overview of the Coherent Acoustics Coding System Mike Smyth June 1999 Introduction Coherent Acoustics is a digital audio compression algorithm designed for both professional and consumer

More information

Loudness and Dynamic Range

Loudness and Dynamic Range Loudness and Dynamic Range in broadcast audio the Dolby solution Tony Spath Dolby Laboratories, Inc. Digital delivery media offer a wider dynamic range for audio than their analogue predecessors. This

More information

BROADCASTING ACT (CHAPTER 28) Code of Practice for Television Broadcast Standards

BROADCASTING ACT (CHAPTER 28) Code of Practice for Television Broadcast Standards BROADCASTING ACT (CHAPTER 28) Code of Practice for Television Broadcast Standards 1 In exercise of the powers conferred by section 6 of the Broadcasting Act (Cap. 28), the Media Development Authority of

More information

encoding compression encryption

encoding compression encryption encoding compression encryption ASCII utf-8 utf-16 zip mpeg jpeg AES RSA diffie-hellman Expressing characters... ASCII and Unicode, conventions of how characters are expressed in bits. ASCII (7 bits) -

More information

FAST FACTS. Fraunhofer Institute for Integrated Circuits IIS

FAST FACTS. Fraunhofer Institute for Integrated Circuits IIS FAST FACTS HE-AAC - The codec of choice for broadcast and streaming MPEG HE-AAC (High Efficiency Advanced Audio Coding) has become one of the most important enabling technologies for state-of-the-art multimedia

More information

Complexity-bounded Power Control in Video Transmission over a CDMA Wireless Network

Complexity-bounded Power Control in Video Transmission over a CDMA Wireless Network Complexity-bounded Power Control in Video Transmission over a CDMA Wireless Network Xiaoan Lu, David Goodman, Yao Wang, and Elza Erkip Electrical and Computer Engineering, Polytechnic University, Brooklyn,

More information

Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet

Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet DICTA2002: Digital Image Computing Techniques and Applications, 21--22 January 2002, Melbourne, Australia Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet K. Ramkishor James. P. Mammen

More information

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29. Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice over IP So, today we will discuss about voice over IP and internet

More information

LOUDNESS MONITORING AND CALM COMPLIANCE (UNFOLDING EVENTS) Jim Welch. IneoQuest Technologies

LOUDNESS MONITORING AND CALM COMPLIANCE (UNFOLDING EVENTS) Jim Welch. IneoQuest Technologies LOUDNESS MONITORING AND CALM COMPLIANCE (UNFOLDING EVENTS) Jim Welch IneoQuest Technologies Topics What s the issue? What s the history? What are the new rules? What are the key concepts? What are the

More information

WHITE PAPER. H.264/AVC Encode Technology V0.8.0

WHITE PAPER. H.264/AVC Encode Technology V0.8.0 WHITE PAPER H.264/AVC Encode Technology V0.8.0 H.264/AVC Standard Overview H.264/AVC standard was published by the JVT group, which was co-founded by ITU-T VCEG and ISO/IEC MPEG, in 2003. By adopting new

More information

ATSC Standard: 3D-TV Terrestrial Broadcasting, Part 2 Service Compatible Hybrid Coding Using Real-Time Delivery

ATSC Standard: 3D-TV Terrestrial Broadcasting, Part 2 Service Compatible Hybrid Coding Using Real-Time Delivery ATSC Standard: 3D-TV Terrestrial Broadcasting, Part 2 Service Compatible Hybrid Coding Using Real-Time Delivery Doc. A/104 Part 2 26 December 2012 Advanced Television Systems Committee 1776 K Street, N.W.

More information

JPEG Image Compression by Using DCT

JPEG Image Compression by Using DCT International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-4, Issue-4 E-ISSN: 2347-2693 JPEG Image Compression by Using DCT Sarika P. Bagal 1* and Vishal B. Raskar 2 1*

More information

Fraunhofer Institute for Integrated Circuits IIS. Director Prof. Dr.-Ing. Albert Heuberger Am Wolfsmantel 33 91058 Erlangen www.iis.fraunhofer.

Fraunhofer Institute for Integrated Circuits IIS. Director Prof. Dr.-Ing. Albert Heuberger Am Wolfsmantel 33 91058 Erlangen www.iis.fraunhofer. WHITE PAPER MPEG audio encoders and decoders on various platforms Fraunhofer IIS offers quality- and resource optimized software implementations of the MPEG-4 audio en- and decoding algorithms on various

More information

TV 2 AS HD DELIVERY FOR TV 2 AS

TV 2 AS HD DELIVERY FOR TV 2 AS TV 2 AS HD DELIVERY FOR TV 2 AS Version 1.0 Contents 1. Delivery Format... 4 2. Standard Definition... 4 2.1. Standard definition... 4 3. Video Standards... 4 4. Action and Caption safe areas... 5 5. Audio...

More information

Overview ISDB-T for sound broadcasting Terrestrial Digital Radio in Japan. Shunji NAKAHARA. NHK (Japan Broadcasting Corporation)

Overview ISDB-T for sound broadcasting Terrestrial Digital Radio in Japan. Shunji NAKAHARA. NHK (Japan Broadcasting Corporation) Overview ISDB-T for sound broadcasting Terrestrial Digital Radio in Japan Shunji NAKAHARA NHK (Japan Broadcasting Corporation) 2003/11/04 1 Contents Features of ISDB-T SB system Current status of digital

More information

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman A Comparison of Speech Coding Algorithms ADPCM vs CELP Shannon Wichman Department of Electrical Engineering The University of Texas at Dallas Fall 1999 December 8, 1999 1 Abstract Factors serving as constraints

More information

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur Module 8 VIDEO CODING STANDARDS Version ECE IIT, Kharagpur Lesson H. andh.3 Standards Version ECE IIT, Kharagpur Lesson Objectives At the end of this lesson the students should be able to :. State the

More information

PRIMER ON PC AUDIO. Introduction to PC-Based Audio

PRIMER ON PC AUDIO. Introduction to PC-Based Audio PRIMER ON PC AUDIO This document provides an introduction to various issues associated with PC-based audio technology. Topics include the following: Introduction to PC-Based Audio Introduction to Audio

More information

An Efficient Architecture for Image Compression and Lightweight Encryption using Parameterized DWT

An Efficient Architecture for Image Compression and Lightweight Encryption using Parameterized DWT An Efficient Architecture for Image Compression and Lightweight Encryption using Parameterized DWT Babu M., Mukuntharaj C., Saranya S. Abstract Discrete Wavelet Transform (DWT) based architecture serves

More information

All About Audio Metadata. The three Ds: dialogue level, dynamic range control, and downmixing

All About Audio Metadata. The three Ds: dialogue level, dynamic range control, and downmixing Information All About Audio Metadata Metadata, the data about the audio data that travels along with the multichannel audio bitstream in Dolby Digital, makes life easier for broadcasters while also increasing

More information

RECOMMENDATION ITU-R BO.786 *

RECOMMENDATION ITU-R BO.786 * Rec. ITU-R BO.786 RECOMMENDATION ITU-R BO.786 * MUSE ** system for HDTV broadcasting-satellite services (Question ITU-R /) (992) The ITU Radiocommunication Assembly, considering a) that the MUSE system

More information

Dithering in Analog-to-digital Conversion

Dithering in Analog-to-digital Conversion Application Note 1. Introduction 2. What is Dither High-speed ADCs today offer higher dynamic performances and every effort is made to push these state-of-the art performances through design improvements

More information

Improved user experiences are possible with enhanced FM radio data system (RDS) reception

Improved user experiences are possible with enhanced FM radio data system (RDS) reception Improved user experiences are possible with enhanced FM radio data system (RDS) reception Aravind Ganesan, Senior Systems Engineer, and Jaiganesh Balakrishnan, Senior Member of Technical Staff, Wireless

More information

(51) Int Cl.: H04S 3/00 (2006.01)

(51) Int Cl.: H04S 3/00 (2006.01) (19) TEPZZ_9 7 66B_T (11) EP 1 927 266 B1 (12) EUROPEAN PATENT SPECIFICATION (4) Date of publication and mention of the grant of the patent: 14.0.14 Bulletin 14/ (21) Application number: 0679846.2 (22)

More information

COMMON SURROUND SOUND FORMATS

COMMON SURROUND SOUND FORMATS COMMON SURROUND SOUND FORMATS The range of decoding formats used to deliver the theatre surround sound experience include: Dolby Pro Logic: The Longstanding stereo matrix encoded format. Extracting four

More information

MINIMUM TECHNICAL AND EXPLOITATION REQUIREMENTS FOR DIGITAL SOUND BROADCASTING DAB+ RECEIVER DESIGNED FOR POLAND

MINIMUM TECHNICAL AND EXPLOITATION REQUIREMENTS FOR DIGITAL SOUND BROADCASTING DAB+ RECEIVER DESIGNED FOR POLAND MINIMUM TECHNICAL AND EXPLOITATION REQUIREMENTS FOR DIGITAL SOUND BROADCASTING DAB+ RECEIVER DESIGNED FOR POLAND Version 1.0 Prepared by: Technical Subgroup of Digital Sound Broadcasting Committee National

More information

HD Radio FM Transmission System Specifications Rev. F August 24, 2011

HD Radio FM Transmission System Specifications Rev. F August 24, 2011 HD Radio FM Transmission System Specifications Rev. F August 24, 2011 SY_SSS_1026s TRADEMARKS HD Radio and the HD, HD Radio, and Arc logos are proprietary trademarks of ibiquity Digital Corporation. ibiquity,

More information

Introduction to Digital Audio

Introduction to Digital Audio Introduction to Digital Audio Before the development of high-speed, low-cost digital computers and analog-to-digital conversion circuits, all recording and manipulation of sound was done using analog techniques.

More information

1-MINIMUM REQUIREMENT SPECIFICATIONS FOR DVB-T SET-TOP-BOXES RECEIVERS (STB) FOR SDTV

1-MINIMUM REQUIREMENT SPECIFICATIONS FOR DVB-T SET-TOP-BOXES RECEIVERS (STB) FOR SDTV 1-MINIMUM REQUIREMENT SPECIFICATIONS FOR DVB-T SET-TOP-BOXES RECEIVERS (STB) FOR SDTV Standard definition television Terminology SHALL (mandatory): Denotes that the item is mandatory. SHOULD (recommended):

More information

A Digital Audio Watermark Embedding Algorithm

A Digital Audio Watermark Embedding Algorithm Xianghong Tang, Yamei Niu, Hengli Yue, Zhongke Yin Xianghong Tang, Yamei Niu, Hengli Yue, Zhongke Yin School of Communication Engineering, Hangzhou Dianzi University, Hangzhou, Zhejiang, 3008, China tangxh@hziee.edu.cn,

More information

A Framework for Robust and Scalable Audio Streaming

A Framework for Robust and Scalable Audio Streaming A Framework for Robust and Scalable Audio Streaming Ye Wang, Wendong Huang, Jari Korhonen School of Computing, National University of Singapore {wangye, huangwd, jari}@comp.nus.edu.sg ABSTRACT We propose

More information

Digital Audio Compression

Digital Audio Compression By Davis Yen Pan Abstract Compared to most digital data types, with the exception of digital video, the data rates associated with uncompressed digital audio are substantial. Digital audio compression

More information

FAST MIR IN A SPARSE TRANSFORM DOMAIN

FAST MIR IN A SPARSE TRANSFORM DOMAIN ISMIR 28 Session 4c Automatic Music Analysis and Transcription FAST MIR IN A SPARSE TRANSFORM DOMAIN Emmanuel Ravelli Université Paris 6 ravelli@lam.jussieu.fr Gaël Richard TELECOM ParisTech gael.richard@enst.fr

More information

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Voice Digitization in the POTS Traditional

More information

DAB Digital Radio Broadcasting. Dr. Campanella Michele

DAB Digital Radio Broadcasting. Dr. Campanella Michele DAB Digital Radio Broadcasting Dr. Campanella Michele Intel Telecomponents Via degli Ulivi n. 3 Zona Ind. 74020 Montemesola (TA) Italy Phone +39 0995664328 Fax +39 0995932061 Email:info@telecomponents.com

More information

From Concept to Production in Secure Voice Communications

From Concept to Production in Secure Voice Communications From Concept to Production in Secure Voice Communications Earl E. Swartzlander, Jr. Electrical and Computer Engineering Department University of Texas at Austin Austin, TX 78712 Abstract In the 1970s secure

More information

4 Digital Video Signal According to ITU-BT.R.601 (CCIR 601) 43

4 Digital Video Signal According to ITU-BT.R.601 (CCIR 601) 43 Table of Contents 1 Introduction 1 2 Analog Television 7 3 The MPEG Data Stream 11 3.1 The Packetized Elementary Stream (PES) 13 3.2 The MPEG-2 Transport Stream Packet.. 17 3.3 Information for the Receiver

More information

Chapter 3 ATM and Multimedia Traffic

Chapter 3 ATM and Multimedia Traffic In the middle of the 1980, the telecommunications world started the design of a network technology that could act as a great unifier to support all digital services, including low-speed telephony and very

More information

WHITE PAPER. Ad Insertion within a statistical multiplexing pool: Monetizing your content with no compromise on picture quality

WHITE PAPER. Ad Insertion within a statistical multiplexing pool: Monetizing your content with no compromise on picture quality Ad Insertion within a statistical multiplexing pool: Monetizing your content with no compromise on picture quality Pascal Jezequel, May 2013 Operators or broadcasters can increase their ad revenue by specifically

More information

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Video Coding Basics Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu Outline Motivation for video coding Basic ideas in video coding Block diagram of a typical video codec Different

More information