APPLICATION OF FILTER BANK THEORY TO SUBBAND CODING OF IMAGES

Similar documents
DESIGN AND SIMULATION OF TWO CHANNEL QMF FILTER BANK FOR ALMOST PERFECT RECONSTRUCTION

Admin stuff. 4 Image Pyramids. Spatial Domain. Projects. Fourier domain 2/26/2008. Fourier as a change of basis

Introduction to Medical Image Compression Using Wavelet Transform

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.


SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A

Sachin Patel HOD I.T Department PCST, Indore, India. Parth Bhatt I.T Department, PCST, Indore, India. Ankit Shah CSE Department, KITE, Jaipur, India

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:

CHAPTER 2 LITERATURE REVIEW

An Efficient Architecture for Image Compression and Lightweight Encryption using Parameterized DWT

Design of Efficient Digital Interpolation Filters for Integer Upsampling. Daniel B. Turek

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.

Understanding CIC Compensation Filters

A HIGH PERFORMANCE SOFTWARE IMPLEMENTATION OF MPEG AUDIO ENCODER. Figure 1. Basic structure of an encoder.



Non-Data Aided Carrier Offset Compensation for SDR Implementation

CHAPTER 6 Frequency Response, Bode Plots, and Resonance

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

Image Compression through DCT and Huffman Coding Technique

Analog-to-Digital Voice Encoding

Matlab GUI for WFB spectral analysis

Probability and Random Variables. Generation of random variables (r.v.)

Superresolution images reconstructed from aliased images

PCM Encoding and Decoding:

Speech Signal Processing: An Overview

Frequency Response of Filters

SIGNAL PROCESSING & SIMULATION NEWSLETTER

Audio Coding, Psycho- Accoustic model and MP3

Region of Interest Access with Three-Dimensional SBHP Algorithm CIPR Technical Report TR

MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN.

An Introduction to Digital Filters

Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers

A Novel Method to Improve Resolution of Satellite Images Using DWT and Interpolation

MEDICAL IMAGE COMPRESSION USING HYBRID CODER WITH FUZZY EDGE DETECTION

Redundant Wavelet Transform Based Image Super Resolution

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

Khalid Sayood and Martin C. Rost Department of Electrical Engineering University of Nebraska

Sampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically.

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

How to Design 10 khz filter. (Using Butterworth filter design) Application notes. By Vadim Kim

A Comparison of Speech Coding Algorithms ADPCM vs CELP. Shannon Wichman

*EP B1* EP B1 (19) (11) EP B1 (12) EUROPEAN PATENT SPECIFICATION

UNIFORM POLYPHASE FILTER BANKS FOR USE IN HEARING AIDS: DESIGN AND CONSTRAINTS

Sampling and Interpolation. Yao Wang Polytechnic University, Brooklyn, NY11201

Introduction to IQ-demodulation of RF-data

FFT Algorithms. Chapter 6. Contents 6.1

HSI BASED COLOUR IMAGE EQUALIZATION USING ITERATIVE n th ROOT AND n th POWER

Video-Conferencing System

Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet

Final Year Project Progress Report. Frequency-Domain Adaptive Filtering. Myles Friel. Supervisor: Dr.Edward Jones

JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions

Image Authentication Scheme using Digital Signature and Digital Watermarking

Implementation of Digital Signal Processing: Some Background on GFSK Modulation

Analysis/resynthesis with the short time Fourier transform

Time series analysis Matlab tutorial. Joachim Gross

AUDIO CODING: BASICS AND STATE OF THE ART

A Digital Audio Watermark Embedding Algorithm

COMP 250 Fall 2012 lecture 2 binary representations Sept. 11, 2012

The Effective Number of Bits (ENOB) of my R&S Digital Oscilloscope Technical Paper

Spike-Based Sensing and Processing: What are spikes good for? John G. Harris Electrical and Computer Engineering Dept

Solutions to Exam in Speech Signal Processing EN2300

CBS RECORDS PROFESSIONAL SERIES CBS RECORDS CD-1 STANDARD TEST DISC

An Algorithm for Automatic Base Station Placement in Cellular Network Deployment

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201

AN-007 APPLICATION NOTE MEASURING MAXIMUM SUBWOOFER OUTPUT ACCORDING ANSI/CEA-2010 STANDARD INTRODUCTION CEA-2010 (ANSI) TEST PROCEDURE

Simulation of Frequency Response Masking Approach for FIR Filter design

Noise Reduction in Video Images Using Coring on QMF Pyramids by. Arthur J. Kalb

Analysis of Filter Coefficient Precision on LMS Algorithm Performance for G.165/G.168 Echo Cancellation

A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques

Filter Comparison. Match #1: Analog vs. Digital Filters

A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton

An Implementation of a High Capacity 2D Barcode

Digital Audio and Video Data

JPEG File Interchange Format

(2) (3) (4) (5) 3 J. M. Whittaker, Interpolatory Function Theory, Cambridge Tracts

MODULATION Systems (part 1)

Analog Filters. A common instrumentation filter application is the attenuation of high frequencies to avoid frequency aliasing in the sampled data.

RF Measurements Using a Modular Digitizer

ADC and DAC. Quantization

Performance of Quasi-Constant Envelope Phase Modulation through Nonlinear Radio Channels

Locating and Decoding EAN-13 Barcodes from Images Captured by Digital Cameras

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP

Jitter Measurements in Serial Data Signals

TCOM 370 NOTES 99-4 BANDWIDTH, FREQUENCY RESPONSE, AND CAPACITY OF COMMUNICATION LINKS

Laboratory #5: RF Filter Design

(Refer Slide Time: 00:01:16 min)

Analog Representations of Sound

Transcription:

EC 623 ADVANCED DIGITAL SIGNAL PROCESSING TERM-PROJECT APPLICATION OF FILTER BANK THEORY TO SUBBAND CODING OF IMAGES Y. PRAVEEN KUMAR 03010240 KANCHAN MISHRA 03010242 Supervisor: Dr. S.R.M. Prasanna Department of Electronics & Communication Engineering Indian Institute of Technology, Guwahati

Abstract This project has been implemented as a part of EC 623 course- Advanced Digital Signal Processing, under the supervision of Dr. SRM Prasanna. The aim of this project is to study the basic principles of sub-band coding using the filter bank approach and its application in decomposition and coding of images. Also we wish to demonstrate the concept of sub-band coding of images by making use of some standard sub-band coding filters. 1. INTRODUCTION One of the popular schemes for decomposition and coding of speech and image signals is Transform coding technique. Transform coding techniques decompose the source output into different frequency bands using block transforms. The transform coefficients have different characteristics and differing perceptual importance. These differences are exploited in allocating bits for encoding the different coefficients. This variable bit allocation results in a decrease in the average number of bits required to encode the source output. One of the drawbacks of transform coding is the artificial division of the source output into blocks, which results in the generation of coding artifacts at the block edges, or blocking. A popular approach to decomposing the image into different frequency bands without the imposition of an arbitrary block structure is sub-band coding. After the input has been decomposed into its constituents, we can use the coding technique best suited to each constituent to improve compression performance. Furthermore, each component of the source output may have different perceptual characteristics. Quantization error that is perceptually objectionable in one component may be acceptable in a different component of the source output. Therefore, a coarser quantizer may be used for perceptually less important components. This is how the concept of sub-band coding comes into picture. 2. SUB-BAND CODING ALGORITHM The basic sub-band coding scheme is shown in Fig. 1. 2.1 ANALYSIS Fig. 1 Block diagram of sub-band coding system The source output is passed through a bank of filters, called the analysis filter bank, which covers the range of frequencies that make up the source output. The passbands of

the filter can be non-overlapping or overlapping. Non-overlapping and overlapping filter banks are shown in Fig. 2. The outputs of the filters are then sub-sampled. The justification for the sub-sampling is the Nyquist Rule and its generalization, which tells us that we need only twice as many samples per second as the range of frequencies. This means that we can reduce the number of samples at the output of the filter because the range of frequencies at the output of the filter is less than the range of frequencies at the input to the filter. This process of reducing the number of samples is called decimation, or down-sampling. The amount of decimation depends on the ratio of the band-width of the filter output to the filter input. If the band width at the output of the filter is 1/M of the band-width at the input to the filter, we would decimate the output by a factor of M by keeping every Mth sample. Once the output of the filters has been decimated, the output is encoded using one of the several schemes, including ADPCM, PCM, and vector quantization. Fig. 2 Non-overlapping and overlapping filter banks 2.2 QUANTIZATION & CODING Along with the selection of the compression scheme, the allocation of bits between the sub-bands is an important design parameter. Different sub-bands contain different amounts of information. Therefore, we need to allocate the available bits among the subbands according to some measure of the information content. The bit allocation procedure can have significant impact on the quality of reconstruction, especially when the information content of different bands is very different. If we use the variance of the output of each filter as a measure of information, and assume that the compression scheme is scalar quantization, we can arrive at several simple bit allocation schemes. One of the frequently used schemes for quantization and coding of transform coefficients is described below. If the amount of information conveyed by each coefficient is different, it makes sense to assign differing number of bits to the different coefficients. There are two approaches to assigning bits. One approach relies on the average properties of the transform coefficients, while the other approach assigns bits as needed by individual transform coefficients.

In the first approach, we first obtain an estimate of the variances of the transform coefficients. These estimates can be used by one of the two algorithms to assign the number of bits used to quantize each of the coefficients. We assume that the relative variance of the coefficients corresponds to the amount of information contained in each coefficient. Thus, coefficients with higher variance are assigned more bits than coefficients with smaller variance. We use the bit allocation that minimizes the distortion caused by quantization and encoding. If the average number of bits to be used per sample by the transform coding is R, and the average number of bits per sample used by the k th coefficient is R k, then (1) where, M is the number of transform coefficients. The reconstruction error variance for the kth quantizer is related to the kth quantizer input variance by the following: where, is a factor that depends on the input distribution and the quantizer. (2) The total reconstruction error is given by (3) The objective of the bit allocation procedure is to find R k to minimize (3) subject to the constraint of (1). The following expression for R k, minimizes (3), (4) Although these values of R k minimize (3), they are not guaranteed to be integers, or even positive. The standard approach at this point is to set the negative R k s to zero. This will increase the average bit rate above the constraint. Therefore, the non-zero R k s are uniformly reduced until the average rate is equal to R. The second algorithm that uses estimates of the variance is a recursive algorithm and functions as follows: 1. Compute for each coefficient. 2. Set R k =0 for all k and set R b =MR, where R b is the total number of bits available for distribution. 3. Sort the variances { }. Suppose is the maximum. 4. Increment R l by 1, and divide by 2. 5. Decrement R b by 1. If R b =0, then stop; otherwise go to 3. If we follow this procedure, we end up allocating more bits to the coefficients with higher variance. This form of bit allocation is called zonal sampling. The advantage to this approach is its simplicity. Once the bit allocation has been obtained, every coefficient at a particular location is always quantized using the same number of bits. The disadvantage

is that, because the bit allocations are performed based on average value, variations that occur on the local level are not reconstructed properly. This problem can be avoided by using a different approach to bit allocation known as threshold coding. In this approach, which coefficient to keep, and which to discard, is not decided a priori. In the simplest form of threshold coding, we specify a threshold value. Coefficients with magnitude below this threshold are discarded, while the remaining are quantized and transmitted. The information about which coefficients have been retained is sent to the receiver as side information. An efficient approach is to scan the block of transformed coefficients in a zig-zag fashion. If we scan the block in this fashion, a large section of the tail-end of the scan will consist of zeros. 2.3 SYNTHESIS The quantized and coded coefficients are used to reconstruct a representation of the original signal at the decoder. First, the encoded samples from each sub-band are decoded at the receiver. These decoded values are then upsampled by inserting an appropriate number of 0s between the samples. Once the number of samples per second is brought back to the original rate, the upsampled signals are passed through a bank of reconstruction filters. The outputs of the reconstruction filters are added to give the final reconstructed output. Thus, the three major components of sub-band system are analysis and synthesis filters, the bit allocation scheme, and the encoding scheme. 3. SOME FILTERS USED IN SUB-BAND CODING The most frequently used filters in sub-band coding consist of a cascade of stages, where each stage consists of a low-pass filter and a high-pass filter. The most popular among these are quadrature mirror filters (QMF). These filters have the property that if the impulse response of the low-pass filter is given by {h n }, then the high-pass impulse response is given by {(-1) n h N-1-n }. The QMF filters designed by Johnston are widely used in a number of applications. The filter coefficients for 8-, and 16- tap filters are given in Tables 1 and 2. These filters are symmetric: h n = h N-1-n n = 0,1,2,.,N/2-1. (5) The filters with fewer taps are less efficient in their decomposition than the filters with more taps. However, the number of taps dictates the number of multiply-add operations necessary to generate the filter outputs. Thus, more efficient decomposition are obtained at the cost of increasing the computation. Table 1 Coefficients for the 8-tap Johnston low-pass filter. h0, h7 0.00938715 h1, h6 0.06942827 h2, h5-0.07065183 h3, h4 0.48998080

Table 2 Coefficients for the 16-tap Johnston low-pass filter. h0, h15 0.002898163 h1, h14-0.009972252 h2, h13-0.001920936 h3, h12 0.03596853 h4, h11-0.01611869 h5, h10-0.09530234 h6, h9 0.1067987 h7, h8 0.4773469 Another popular set of filters is the Smith-Barnwell filters, some of which are shown in Table 3. These families of filters differ in a number of ways. The magnitude transfer functions of 8-tap filters of these two families are plotted in Fig. 3. As can be seen, the cut-off for the Smith- Barnwell filter is much sharper than the cut-off for the Johnston filter. This means that the separation provided by the 8-tap Johnston filter is not as good as that provided by the 8-tap Smith-Barnwell filter. 0 M a g n i t u d e r e s p o n s e o f 8 - T a p J o h n s t o n a n a l y s i s f i l t e r s - 5-1 0-1 5-2 0 db - 2 5-3 0-3 5-4 0-4 5 0 5 0 0 1 0 0 0 1 5 0 0 2 0 0 0 2 5 0 0 3 0 0 0 3 5 0 0 4 0 0 0 F r e q u e n c y i n H Z - > M a g n i t u d e r e s p o n s e o f 8 - T a p S m i t h - B a r n w e l l a n a l y s i s f i l t e r s 1 0 0-1 0-2 0-3 0 db - 4 0-5 0-6 0-7 0-8 0 0 5 0 0 1 0 0 0 1 5 0 0 2 0 0 0 2 5 0 0 3 0 0 0 3 5 0 0 4 0 0 0 F r e q u e n c y i n H Z - > Fig. 3 Magnitude transfer functions of the (a) 8-tap Johnston, and (b) 8-tap Smith- Barnwell Filters. 3.1 M-BAND QMF FILTER BANKS Various filter bank structures have been studied, in order to find filters that are simple to implement and provide good separation between the frequency bands. This section discusses some of the techniques used in the design of filter banks.

Table 3 Coefficients of the 8-tap Smith-Barnwell low-pass filter. h0 0.0348975582178515 h1-0.01098301946252854 h2-0.06286453934951963 h3 0.223907720892568 h4 0.556856993531445 h5 0.357976304997285 h6-0.02390027056113145 h7-0.07594096379188282 In many applications it is necessary to divide the input into multiple bands. We can do this by a recursive two-band splitting as shown in Fig. 4, or we can obtain banks of filters that directly split into multiple bands. Given that we have good filters that provide twoband splitting, recursive splitting is an efficient way to design M-band QMF. However, even when the spectral characteristics of two-band filters are quite good, when applied to the tree structure of Fig. 4, the spectral characteristic may not be very good. This is because as we proceed through the tree, there is significant overlap between the bands, and hence there will be significant amount of aliasing in each band. Fig. 4. Decomposition of an input sequence into multiple bands by recursively using a two-band split. An M-band filter bank has two set of filters. The input sequence is split into M frequency bands using analysis bank of M-filters, and reconstructed by passing the sub-bands through synthesis bank of M-filters. 4. APPLICATION TO IMAGE COMPRESSION So far, we have been concerned with sub-band coding of one-dimensional sequences. With two-dimensional sequences such as image, we need to use two-dimensional filters that separate the output into components based on both horizontal and vertical frequencies. Fortunately, in most cases, this two-dimensional filter can be implemented as two- one dimensional filters, which can be applied first in one-dimension, then in the other dimension. Filters that have this property are called separable filters.

Generally, for sub-band coding of images we filter each row of the image separately by using a high pass and a low-pass filter. The output of the filter is decimated by a factor of two. Assume that the images were of size NxN. After the first stage, we have two NxN/2 images. We then process each image column-wise, and obtain four images of size N/2 x N/2. We can stop at this point or continue the decomposition process with one or more of the sub-images. Generally, of the four original sub-images, only one or two are further decomposed. The reason for not decomposing the other sub-images is that many of the pixel values in the high frequency sub-images are close to zero. Thus, there is little reason to spend computational power to decompose these sub-images. 4.1 DECOMPOSING AN IMAGE Earlier a set of filters was provided to be used in one-dimensional sub-band coding. We can use those same filters to decompose an image into its sub-bands. Original Image Decomposed image using 8-tap Johnston Filterbank (a) (b) Decomposed image using 16-tap Johnston Filterbank Decomposed image using 8-tap Smith-Barnwell Filterbank (c) (d) Fig. 5 Decomposition of (a) original image, using (b) Johnston 8-tap, (c) Johnston 16-tap, (d) Smith-Barnwell.

As can be seen, there is much less energy in the higher sub-bands. This difference in energy compaction can have drastic effect on the reconstruction. Fig. 5(c) shows the decomposition of the image using 16-tap Johnston Filter. As is evident, increasing the size of the filter improves the energy compaction. Fig. 5(d) illustrates decomposition of an image using 8-tap Smith-Barnwell filter. The results are almost identical to a 16-tap Johnston filter. Therefore, rather than increase the computational load by going to a 16- tap filter, we can keep the same computational load and use a different filter. 4.2 CODING THE SUB-BANDS Once we have decomposed an image into sub-bands, we need to find the best encoding scheme for coding each sub-band. In this project, we use the coding scheme explained in 2.2. Fig. 6(b), and Fig. 6(d) compare the image coded at 0.5 bits per pixel using 8-tap Johnston filter and 8-tap Smith-Barnwell filter. Original Image reconstructed image for johnston 8tap for johnston 16 tap for smith-barnwell Fig. 6 (a) Original image, (b)image coded at 0.5-bits per pixel using 8-tap Johnston filter, (c) 16-tap Johnston filter, (d) 8-tap Smith-Barnwell filter. In the image decomposed using Johnston filter (Fig.5 (b)), there was significant energy in the high-low band. The algorithm allocated 1-bit to the low-low band and 1-bit to the high-low band. This resulted in poor encoding for both, and subsequently, poor reconstruction. There was very low energy in any of the bands other than the low-low band for the image decomposed using Smith-Barnwell filter. Therefore, the bit allocation

algorithm assigned both bits to the low-low band which provided a reasonable reconstruction. The issue of energy compaction becomes a very important factor in the reconstruction quality. Filters that allow for more energy compaction permit the allocation of bits to a smaller number of sub-bands. This in turn results in a better reconstruction. 5. CONCLUSION Sub-band coding is any form of transform coding that breaks a signal into a number of different frequency bands and encodes each one independently. It is an important extension of Filter-bank theory and widely used in data compression. 6. REFERENCES [1] Subband coding of images, John W.Woods, Sean D. Oneil, IEEE transactions on acoustic speech and signal processing, vol - assp 34. [2] Subband coding : Application to image compression, 'Data Compression', Khalid Sayood, Morgan series, 2nd edition. [3] Subband coding of images using vector quantization, Peter H. Westerink, Dick E. Boekee, Jan Biemond, IEEE transactions on communications, vol -36. [4] Optimized subband coding of images, Kyu-Sik Park, Sung-Chan Hong, Hyeong Kyo kim, Proceedings of IEEE asia pacific conference on circuits and systems.