CM0340 SOLNS. Do not turn this page over until instructed to do so by the Senior Invigilator.



Similar documents
Figure 1: Relation between codec, data containers and compression algorithms.

H 261. Video Compression 1: H 261 Multimedia Systems (Module 4 Lesson 2) H 261 Coding Basics. Sources: Summary:

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201

For Articulation Purpose Only

Video-Conferencing System

FFT Algorithms. Chapter 6. Contents 6.1


Image Compression through DCT and Huffman Coding Technique

Introduction to image coding

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

Final Year Project Progress Report. Frequency-Domain Adaptive Filtering. Myles Friel. Supervisor: Dr.Edward Jones

Reading.. IMAGE COMPRESSION- I IMAGE COMPRESSION. Image compression. Data Redundancy. Lossy vs Lossless Compression. Chapter 8.

Performance Analysis and Comparison of JM 15.1 and Intel IPP H.264 Encoder and Decoder

*EP B1* EP B1 (19) (11) EP B1 (12) EUROPEAN PATENT SPECIFICATION

H.264/MPEG-4 AVC Video Compression Tutorial

The continuous and discrete Fourier transforms

Compression techniques

4 Digital Video Signal According to ITU-BT.R.601 (CCIR 601) 43

Data Storage. Chapter 3. Objectives. 3-1 Data Types. Data Inside the Computer. After studying this chapter, students should be able to:

TTT4120 Digital Signal Processing Suggested Solution to Exam Fall 2008

Video compression: Performance of available codec software

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

Video Authentication for H.264/AVC using Digital Signature Standard and Secure Hash Algorithm

Overview: Video Coding Standards

Classes of multimedia Applications

(Refer Slide Time: 01:11-01:27)

THE EMERGING JVT/H.26L VIDEO CODING STANDARD

Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

Introduction to Medical Image Compression Using Wavelet Transform

Data Storage 3.1. Foundations of Computer Science Cengage Learning

Introduction to Digital Filters

A NEW LOSSLESS METHOD OF IMAGE COMPRESSION AND DECOMPRESSION USING HUFFMAN CODING TECHNIQUES

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201

Digital Audio Compression: Why, What, and How

White paper. H.264 video compression standard. New possibilities within video surveillance.

Signal to Noise Instrumental Excel Assignment

Probability and Random Variables. Generation of random variables (r.v.)

Figure1. Acoustic feedback in packet based video conferencing system

Understanding Network Video Security Systems

Understanding Compression Technologies for HD and Megapixel Surveillance

Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay

LOW COST HARDWARE IMPLEMENTATION FOR DIGITAL HEARING AID USING

SECTION 6 DIGITAL FILTERS

Comparison of different image compression formats. ECE 533 Project Report Paula Aguilera

Auto-Tuning Using Fourier Coefficients

Video Encryption Exploiting Non-Standard 3D Data Arrangements. Stefan A. Kramatsch, Herbert Stögner, and Andreas Uhl

Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet

RECOMMENDATION ITU-R BO.786 *

A Proposal for OpenEXR Color Management

Advanced Signal Processing and Digital Noise Reduction

TTT4110 Information and Signal Theory Solution to exam

Information, Entropy, and Coding

Implementation of Digital Signal Processing: Some Background on GFSK Modulation

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.

Understanding HD: Frame Rates, Color & Compression

Multihypothesis Prediction using Decoder Side Motion Vector Derivation in Inter Frame Video Coding

Digital Video Coding Standards and Their Role in Video Communications

Video codecs in multimedia communication

Frequency Response of FIR Filters

The Fourier Analysis Tool in Microsoft Excel

Software Audio Processor. Users Guide

How To Improve Performance Of The H264 Video Codec On A Video Card With A Motion Estimation Algorithm

A JPEG Decoder Implementation in C Chris Tralie ELE 201 Fall 2007

Direct and Reflected: Understanding the Truth with Y-S 3

We are presenting a wavelet based video conferencing system. Openphone. Dirac Wavelet based video codec

LMS is a simple but powerful algorithm and can be implemented to take advantage of the Lattice FPGA architecture.

IMPACT OF COMPRESSION ON THE VIDEO QUALITY

REIHE INFORMATIK 7/98 Efficient Video Transport over Lossy Networks Christoph Kuhmünch and Gerald Kühne Universität Mannheim Praktische Informatik IV

Design of FIR Filters

Transform-domain Wyner-Ziv Codec for Video

Analysis/resynthesis with the short time Fourier transform

Em bedded DSP : I ntroduction to Digital Filters

SIGNAL PROCESSING & SIMULATION NEWSLETTER

Audio Coding, Psycho- Accoustic model and MP3

How to Send Video Images Through Internet

Introduction to Digital Audio

Time and Frequency Domain Equalization

PCM Encoding and Decoding:

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)

RF Measurements Using a Modular Digitizer

Fast Arithmetic Coding (FastAC) Implementations

Convolution, Correlation, & Fourier Transforms. James R. Graham 10/25/2005

Multidimensional Transcoding for Adaptive Video Streaming

Audio Analysis & Creating Reverberation Plug-ins In Digital Audio Engineering

Convolution. The Delta Function and Impulse Response

Motion Estimation. Macroblock Partitions. Sub-pixel Motion Estimation. Sub-pixel Motion Estimation

Department of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP

Automatic Detection of Emergency Vehicles for Hearing Impaired Drivers

The Z transform (3) 1

The Calculation of G rms

The Essence of Image and Video Compression 1E8: Introduction to Engineering Introduction to Image and Video Processing

Transcription:

CARDIFF UNIVERSITY EXAMINATION PAPER Academic Year: 2008/2009 Examination Period: Examination Paper Number: Examination Paper Title: SOLUTIONS Duration: Autumn CM0340 SOLNS Multimedia 2 hours Do not turn this page over until instructed to do so by the Senior Invigilator. Structure of Examination Paper: There are 14 pages. There are 4 questions in total. There are no appendices. The mark obtainable for a question or part of a question is shown in brackets alongside the question. Students to be provided with: The following items of stationery are to be provided: ONE answer book. Instructions to Students: Answer 3 questions. The use of translation dictionaries between English or Welsh and a foreign language bearing an appropriate departmental stamp is permitted in this examination. 1 PLEASE TURN OVER

Q1. (a) Briefly outline the four broad classes of approach that one may exploit to compress multimedia data. Do not detail any specific compression algorithms. Compression basically employs redundancy in the data: Temporal in 1D data, 1D signals, Audio etc. correlation between sequential data points. Spatial correlation between neighbouring pixels or data items in 2D. Spectral This uses the frequency domain to exploit relationships between frequency of change in data. E.g. in video/imagery, correlation between colour or luminescence components. Psycho-visual exploit perceptual properties of the human auditory/visual system. 8 MARKS BOOKWORK Give one example of a compression algorithm for each class. EXAMPLES: Temporal Any Audio/Video compression method, Zero length suppression, pattern substitution, Pulse Code Modulation (a few variants), MPEG Audio, MPEG Video, H.264 [1] Spatial Any Image/Video compression algorithm, GIF, JPEG, MPEG, H.264. [1] Spectral JPEG, MPEG, H.264. [1] Psycho-visual MPEG audio, MPEG Video, JPEG (colour conversion). [1] 4 MARKS BOOKWORK (b) What advantage does arithmetic coding offer over Huffman coding for data compression? Huffman coding assumes an integer number (k) of bits for each symbol hence k is never less than 1 [1] Arithmetic coding can represent fractional number of bits and can achieve better compression ratios. [1] 2 MARKS BOOKWORK (c) Briefly state an algorithm for arithmetic decoding. SEEN In LECTURE Coding: The idea behind arithmetic coding is To have a probability line, 0 1, and Assign to every symbol a range in this line based on its probability, 2

Order in terms of probability highest first. Note: The higher the probability, the higher range which assigns to it. For each symbol in the sequence assign a code based in symbols probability and then subdivide for all the symbols: range = high - low; high = low + range * high_range of the symbol being coded; low = low + range * low_range of the symbol being coded; Decoding is the opposite so need to work out (unseen in lectures) For current code value: look up in table and assign symbol [1] Eliminate symbol effect by subtracting the low value in the range and divide by range Repeat above two steps until zero reached see last part of problem. Total 5 marks Unseen. (Coding algorithm discussed in lectures, decoding simply mentioned as the reverse process) (d) Given the following table of frequency counts, probabilities and probability ranges for the following characters: Char Freq Prob. Range A 2 0.5 [0.0, 0.5) B 1 0.25 [0.5, 0.75) C 1 0.25 [0.75, 1.0) What is the 4 character sequence for the arithmetic coding: 0.59375? Char Code-Low Range B 0. 0.59375-0.5 = 0.09375 0.09375/0.25 = 0.375 A 0.375-0.0 = 0.375 0.375/0.5 = 0.75 C 0.75-0.75 = 0.0 0.0/0.25 = 0.0 A 0.0-0.0 = 0.0 0.0/0.5 = 0.0 4 marks for each step in computation of the code It is possible for the decoder to return a zero value which corresponds to the symbol in a probability range rather than the end of the decoding process. How can this problem be avoided in the arithmetic decoder? As can be seen in the above decoding it third step decoding to C returns 0 but we need A also 0 as last step. Solution: Need some end of input (end-of-file) additional symbol. 4 Marks TOTAL 8 MARKS UNSEEN. Coding algorithm discussed in lectures, decoding simply mentioned as the reverse process. 3 PLEASE TURN OVER

Q2. (a) In a digital signal processing system, what are meant by block and sample-bysample processing. Give one example of an application of each type. Block processing : data is transferred into a memory buffer and then processed each time the buffer is filled with new data. [1] E.g. fast Fourier transforms (FFT), Discrete Cosine Transform (DCT), convolution, convolution reverb more soon [1] Sample-by-sample processing : input is processed on individual sample data. [1] E.g. volume control, envelope shaping, modulation (ring/amplitude), IIR/- FIR filtering... [1] 4 Marks Total Bookwork (b) Give definitions of the transfer function and frequency response of a digital system, in terms of its impulse response. Given an impulse response h(n) simply apply the Z-Transform: H(z) = n= h(n).z n to get the transfer function H(z). [1] Similarly apply the Fourier Transform: H(f) = h(n).e i2πfn/fs n= to get the Frequency Response H(f). [1] 2 Marks Total - Bookwork 4

(c) Briefly discuss three algorithmic approaches to implementing filtering in a digital system Infinite Impulse Response Filter (IIR) : A simple IIR system can be described as follows: y(n) = x(n) a 1 y(n 1) a 2 y(n 2) The output signal y(n) is fed back through a series of delays Each delay is weighted Fed back weighted delay summed and passed to new output. Such a feedback system is called a recursive system x(n) + y(n) T y(n 1) = x H1 (n) a 1 T y(n 2) = x H2 (n) a 2 1 5 PLEASE TURN OVER

Finite Impulse Response Filter (FIR) : A simple FIR system can be described as follows: y(n) = b 0 x(n) + b 1 x(n 1) + b 2 x(n 2) The input is fed through delay elements Weighted sum of delays give y(n) x(n) + y(n) b 0 T x(n 1) = x H1 (n) T b 1 x(n 2) = x H2 (n) Fourier Space Filtering : b 2 F (u, v) is the Fourier transform of the original image, H(u, v) is a filter function (in Fourier Space) could be inverse of Fourier Transform of a real space filter function, G(u, v = H(u, v)f (u, v)) is the Fourier transform of the improved image. Inverse Fourier transform G(u, v) to get g(x, y) our improved image TOTAL 9 Marks 3 marks per method 1 6

(d) Given the following difference equation construct its signal flow diagram: y(n) = 6x(n) + 3x(n 1) + 1x(n 2) 5y(n 1) 4y(n 2) Solution: x(n) + + y(n) 6 T + 5 T 3 4 1 [6] 6 Marks Unseen. Possible other less efficient solutions with more than two delay units (not shared for y(n 1)/x(n 1)... tapping award less marks if so 1 7 PLEASE TURN OVER

(e) Given the following signal flow diagram construct its difference equation, y(n): 0.5 x(n) 0.8 + T 3 (= z 3 ) + y(n) -0.3 x h (n) = x(n) 0.3x(n 3) y(n) = 0.5x h (n) + 0.8x h (n 3) Trick is to break up feedback loop into sub equation x h (n). 0.5 0.5x h (n) x(n) x h (n) 0.8 x 1 h (n 3) T 3 (= z 3 ) + + y(n) -0.3 6 Marks Unseen. [6] 8

Q3. (a) What is the difference between reverb and echo? Echo implies a distinct, delayed version of a sound, [1] Reverb each delayed sound wave arrives in such a short period of time such that we do not perceive each reflection as a copy of the original sound. [1] TOTAL 2 Marks Bookwork (b) Give the names of two filter based approaches to simulating the reverb effect in digitial audio. Schroeder s Reverberator [1] Moorer s Reverberator [1] Comment on how one approach builds on the other and how filters are used to achieve the desired effect.par Schroeder s Reverberator : Early digital reverberation algorithms tried to mimic the a rooms reverberation by primarily using two types of infinite impulse response (IIR) filters. Comb filter usually in parallel banks Allpass filter usually sequentially after comb filter banks A delay is (set via the feedback loops allpass filter) aims to make the output would gradually decay. Moorer s Reverberator : Moorer s reverberator build s on Schroeder: Parallel comb filters with different delay lengths are used to simulate modes of a room, and sound reflecting between parallel walls [1] Allpass filters to increase the reflection density (diffusion). [1] Lowpass filters inserted in the feedback loops to alter the reverberation time as a function of frequency [1] Shorter reverberation time at higher frequencies is caused by air absorption and reflectivity characteristics of wall). [1] Implement a dc-attenuation, and a frequency dependent attenuation. [1] Different in each comb filter because their coefficients depend on the delay line length [1] 6 Marks Bookwork 9 PLEASE TURN OVER

(c) State one alternative approach to reverb simulation that does not employ filters. Convolution Reverb [1] 1 Mark Bookwork Briefly, giving no mathematical detail, describe how this approach is implemented. record impulse response of room g(x), input audio is f(x) [1] compute Fourier transform of impulse response G(u and audio signal, F (u) [1] compute convolution of two signals, multiply both Fourier transforms H(u) = F (u).g(u) [1] compute inverse Fourier transform of H(u), h(x) the reverberated signal.[1] Total 4 marks Bookwork (d) For each of the three reverb methods you have described above discuss how, in the following two scenarios, the sounds recorded by the microphone could be modelled: i. A long hallway where the long walls are lined with a high frequency absorbing acoustic panels. The sound source is placed at one end of the hallway and a microphone is placed at the other end. Schroeder s Reverberator : estimate time of sound bouncing down corridor to set delay estimate some filtering of high frequencies Moorer s Reverberator : estimate time of sound bouncing down corridor to set comb filter delay estimate some filtering of high frequencies for allpass filters and lowpass filters Convolution Reverb : Record impulse response of hallway Perform convolution reverb computation. Total 6 marks unseen 10

ii. A cardoid microphone is a microphone that accepts sound from the front and sides but not the back of the microphone. In a square recording studio, with uniform surfaces, a cardoid microphone is placed directly facing a sound source a few feet away. Schroeder s Reverberator : Not much one can model except as before, estimate time of set short delay to account for no reflections recorded and estimate some filtering of high frequencies Can t easily model Cardoid response Moorer s Reverberator : There will be little recording of back reflections so allow little feedback to comb filters Tapped delay lines which simulate early reflections could have delay and frequency filters set. Convolution Reverb : Record impulse response of hallway with a cardoid microphone. Perform convolution reverb computation. Total 6 marks unseen 11 PLEASE TURN OVER

Q4. (a) Briefly outline the basic principles of Intra-Frame Coding in Video Compression, as used in MPEG-2. This is a basic Intra Frame Coding Scheme is as follows: Convert tp more effective color space: YUV (YCbCr). [1] A macroblock usually consists of 4 Y blocks, 1 Cr block, and 1 Cb block. (4:2:0 chroma subsampling) Since eye most sensitive luminance, less sensitive chrominance. [1] Break frame up into macroblocks which are typically 16x16 pixels. [1] Perform DCT on each Macroblock [1] Quantization is by constant value for all DCT coefficients. I.e., no quantization table as in JPEG. [1] Zig-zag vectorisation of quantised DCT coefficients [1] Run length encoding (RLE) on zig-zaf vector [1] Huffman coding on RLE values [1] Total 8 Marks bookwork (b) What is the key difference between I-Frames, P-Frames and B-Frames? I-Frame Basic Reference frame for each Group of pictures essentially a JPEG Compressed image. [1] P-Frame Coded forward Difference frame w.r.t last I or P frame [1] B-Frame Coded backward Difference frame w.r.t last I or P frame [1] Total 3 Marks Bookwork 12

(c) Why are I-frames inserted into the compressed output stream relatively frequently? Differences between frames get too large large errors hard to track fast blocks etc. So need to restart card with a new I-frame. Total 2 Marks Bookwork (d) Given the following coding order of a group of frames in MPEG-2: I P B B B P B B B I B B B I P B P 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 What is display order of the frames? Display Order is: I B B B P B B B P I B B B I B P P 1 3 4 5 2 7 8 9 6 10 11 12 13 14 16 15 17 [7] 2 Marks for decoding the first IBBBP 1 Mark for decoing next BBBP (essentially a repeat of first block) 2 Marks for next IBBB I frames cant change order so no change in order. 2 Marks for IBPP only one (first in this sequence) P frame changes order. 7 Marks Unseen 13 PLEASE TURN OVER

(e) The following macroblock window has a best sum of absolute difference (SAD) match of 1 to a given MPEG Interframe search: 4 2 3 5 Should inter or intraframe coding be employed to code this macroblock, and why? Method from lecture notes: Based upon the motion estimation a decision is made on whether INTRA or INTER coding is made. To determine INTRA/INTER MODE we do the following calculation: MB mean = ΣN 1 i=0,j=0 C(i,j) N A = Σ n,m i=0,j=0 C(i, j) MB mean If A < (SAD 2N) INTRA Mode is chosen. So for this problem: MB mean = 7 A = 4 7 + 2 7 + 3 7 + 5 7 = 3 + 5 + 4 + 2 = 14 SAD 2N = 3 So A is not less than (SAD 2N) we choose INTER FRAME Coding [1] 7 marks unseen problem application of bookwork formula. 14X END OF EXAMINATION