DISCOVER Monoview Video Codec
|
|
|
- Scott Hudson
- 10 years ago
- Views:
Transcription
1 DISCOVER Monoview Video Codec Fernando Pereira Instituto Superior Técnico, Portugal on behalf of the DISCOVER project DISCOVER Workshop on Recent Advances in Distributed Video Coding November 6, 007, Lisboa - Portugal
2 Outline. DVC Before DISCOVER. PRISM Solution (Univ. Berkeley). Feedback-channel based Solution (Univ. Stanford). DISCOVER Threats and Opportunities. Distributed Video Coding: The Challenges. Promising Applications. DISCOVER Monoview Video Codec. Architecture. Problems to Address. Encoder Modules 4. Decoder Modules 4. DISCOVER Performance 5. DISCOVER Future
3 DVC Before DISCOVER
4 The DVC World in 004 PRISM (Power-efficient, Robust, high compression Syndrome based Multimedia coding) solution developed at Univ. Berkeley by Prof. Ramchandran s team. 004 Feedback-channel based solution developed at Univ. Stanford by Prof. Girod s team. 4
5 PRISM: Encoder Encoder: Divides frame n in blocks. Selects skip/intra/wz coding with different syndrome modes based on Frame Differences. For WZ blocks, sends a syndrome and a CRC (hash). 5
6 PRISM: Encoder LSB MSB Channel Code Encoding of a WZ block: Compute DCT. Syndrome on LSB of low frequency coefficients. CRC for these low frequency coefficients. Conventional coding for high frequency coefficients. Position of low vs high frontier depends on correlation strength. 6
7 PRISM: Decoder Decoder: For every WZ block in frame n: Motion search at the decoder by triying every Side Information candidate block in frame (n-). Corrects the (lower frequency) DCT coefficients with the syndrome. Check if the CRC is correct. Keeps the first predictor that provides a correct CRC. Joints the high frequency coefficients which have been conventionally coded. 7
8 Feedback-channel Solution: Encoder Encoder: Creates groups of frames with one key frame and (N-) WZ coded frames. For every WZ frame: DCT + bitplanes (T-domain) or bitplanes of pixel values (pixel domain). Bitplanes are fed to a turbo encoder and parity bits are generated to be send on (decoder) request. 8
9 Feedback-channel Solution: Decoder Decoder: Constructs an estimation of the WZ frame using motion compensated interpolation using previous (n-) and next (n+) key frames sent conventionally (if GOP size=). Corrects the bitplanes of the estimation using the received parity bits and a noise correlation model. Requests more parity bits to the encoder through the feedback channel if necessary to converge the decoding process. 9
10 Pros and Cons PRISM (Univ. Berkeley) Block base approach No need for a feedback channel for rate control A fixed high number of bits per WZ coded block Encoder more complex (mode decision) Decoder more complex (motion search) Feedback-based codec (Univ. Stanford) Finer rate control Simpler encoder and decoder (?) Frame based approach Feedback channel, latency 0
11 DISCOVER Threats and Opportunities
12 The Challenges The Conceptual Challenge The Coding Efficiency Challenge The Complexity Challenge The Error Robustness Challenge The Scalability Challenge The Multiview Video Challenge
13 Conventional Coding is a Winner
14 Emerging Challenges Applications (from down-link to up-link) Wireless digital video cameras Multimedia mobile phones and PDAs Low-power video sensors and surveillance cameras Wireless video teleconferencing systems Requirements Light and flexible distribution codec complexity Robustness to packet/frame losses High compression efficiency Low latency Target Inter coding efficiency Intra coding complexity (encoder) Intra coding robustness Heavy encoder light Transcoding Light decoder 4
15 DISCOVER Studied Applications. Wireless Video Cameras. Wireless Low-Power Surveillance. Visual Sensor Networks 4. Networked Camcorders 5. Distributed Video Streaming 6. Mobile Document Scanner 7. Video Conferencing with Mobile Devices 8. Mobile Video Mail 9. Disposable Video Cameras 0. Multiview Image Acquisition. Wireless Capsule Endoscopy 5
16 DISCOVER Monoview Codec 6
17 Selecting an Architecture PRISM (Univ. Berkeley) Block base approach No need for a feedback channel for rate control A fixed high number of bits per WZ coded block Encoder more complex (mode decision) Decoder more complex (motion search) Feedback-based codec (Univ. Stanford) Finer rate control Simpler encoder and decoder (?) Frame based approach Feedback channel, latency 7
18 Selecting an Architecture PRISM (Univ. Berkeley) Block base approach No need for a feedback channel for rate control DIFFICULT TO OBTAIN DETAILED SPECIFICATION A fixed high number of bits per WZ coded block Encoder more complex (mode decision) Decoder more complex ( motion search) September 007 Feedback-based codec (Univ. Stanford) Finer rate control Simpler encoder and decoder (?) A BIT LESS DIFFICULT TO OBTAIN DETAILED SPECIFICATION SOFTWARE IMPLEMENTATION AVAILABLE (from IST and VISNET) Frame based approach Feedback channel, latency 8
19 DISCOVER Architecture Wyner-Ziv Encoder 8 Wyner-Ziv Decoder a b c d 8a 8b 8c 8d T Q Channel Encoder Buffer Channel Decoder Decoder Succ. / Failure Q - and Reconst. T - e Minimum Rate Estimation 7 7b Soft Input Computation 7a T 6 Virtual Channel Model 4 5 WZ and Conventional Video Splitting Conventional Video Encoder Conventional Video Decoder Side Information Extraction Based on the feedback-channel solution from Univ. Stanford. Based on a split between Wyner-Ziv (WZ) and key frames. Key frames used with a regular (GOP size) or dynamic periodicity. Key frames coded with H.64/AVC Intra. 9
20 Main Problems to Address Elimination of architectural limitations Coded key frames (not lossless) No original frames for decoder request control No original frames at decoder for correlation noise modeling Efficient exploitation of temporal correlation at encoder by controlling the GOP size. Improvement of the accuracy of the side information interpolation/extrapolation. Improvement of the accuracy of correlation noise estimation at decoder. Elimination or reduction of feedback-channel usage through encoder or hybrid rate control. 0
21 Encoder Modules: Adaptive GOP Size To better exploit the temporal redundancy in the video, the encoder performs GOP length selection depending on the motion activity in the sequence: High motion low correlation smaller GOP sizes Low motion high correlation longer GOPs sizes
22 Encoder Modules: Adaptive GOP Size To perform GOP size control, it is proposed to: Measure at the encoder the amount of motion in a video sequence using adequate (low complexity) metrics. Perform hierarchical clustering of motion activity data - group frames which accumulate less motion using four (simple) motion activity metrics.
23 Enc. Modules: Transform and Quantization Transform: Wyner-Ziv frames are transformed using a 4 4 Discrete Cosine Transform, the one from H.64/AVC, whose coefficients are organized in (4 4) 6 bands. Independent Quantization: Each DCT band is quantized separately using a predefined number of levels, depending on the target quality for the WZ frame. DC Quantization: A uniform scalar quantizer is used for the DC band, assuming the data range. AC Quantization: For AC bands, a dead-zone quantizer with doubled zero interval is applied. The dynamic data range is calculated separately for each bth band, b>, to be quantized, and transmitted to the decoder in the coded bit stream. Bitplane Coding: The quantization indices of each DCT band b are then organized in bitplanes and fed to the channel encoder.
24 Encoder Modules: Channel Coding Turbo Codes Turbo Encoder identical Recursive Systematic Convolutional (RSC) encoders. Pseudo-random interleaver. Puncturing for lower rates. Turbo Decoder Two Soft-Input Soft-Output (SISO) decoders. Maximum A Posteriori (MAP) algorithm. Laplacian distribution to model the X,Y correlation. LDPC (Low-Density Parity-Check) Codes LDPC Accumulate (LDPCA) codec as developed by D. Varodayan, et al. in Rate-Adaptive Codes for Distributed Source Coding, EURASIP Signal Processing Journal, Special Issue on Distributed Source Coding, pp. - 0, vol. 86, nº, Nov
25 Encoder Modules: Minimum Rate Estimator To reduce the number of requests to be made by the decoder (with a strong impact on the decoding complexity), the encoder can estimate a minimum number of accumulated syndromes to be sent per bitplane and per band. The DISCOVER codec solution is based on the Wyner-Ziv ratedistortion bound for two correlated Gaussian sources which defines the minimal rate at which one source (X) can be transmitted at a given distortion D X, to be, where σ is the variance of the correlation noise between the two sources, given that the second source (Y, the Side Information) is known perfectly at the decoder. A separate rate for each bitplane can be obtained by estimating the reduction of distortion brought by each bitplane with respect to previously decoded bitplanes (for each band). σ is a parameter of the noise correlation channel model, which is estimated at the decoder side and sent back to the encoder via the feedback-channel. 5
26 Encoder Modules: Encoder Rate Control The DISCOVER codec assumes Decoder Rate Control based on a feedback channel but In some applications, the feedback channel is not available. The feedback channel introduces delay in the system. So, it may be important to perform efficient Encoder Rate Control (ERC) for transform domain (TD) WZ video coding. 6
27 Encoder Modules: Encoder Rate Control An estimate of the SI frame is generated at the encoder using a lowcomplexity estimation technique (adjacent original key frames are used as input). The same 4x4 DCT transform is applied over the SI frame estimate and each DCT band is uniformly quantized. The conditional entropy B is computed for each bitplane. The relative error probability p between corresponding DCT band bitplanes of the SI and WZ frames is computed. The parity rate associated to each DCT band bitplane is computed as a function of p and H B. X b X f Ŷ Yˆ DCT X Y ˆ H H XY ˆ B X Y ˆ X DCT H B X Y ˆ Rˆ j Rˆ j B j B H j = N H e X Yˆ + p X Yˆ Yˆ X SI frame estimate WZ frame X DCT 7
28 The Clever Guy But opposite to conventional video coding, the decoder (not anymore the encoder!) is the KING 8
29 Decoder Modules: Side Information Creation Since the RD performance is highly dependent on the quality of the side information, it is essential to find efficient encoder and decoder tools to generate the highest quality side information. 9
30 Decoder Modules: Side Information Creation Trajectory-based Motion Interpolation: Hash-based Motion Estimation: 0
31 Dec. Modules: Correlation Noise Estimation Performing efficient decoder (online) correlation noise estimation for WZ video coding Is essential for a more realistic/practical PDWZ video coding scenario. Implies the dynamic estimation of the correlation noise distribution parameter assuming a Laplacian distribution. Targets to be as efficient as the offline estimation based on the original information.
32 Dec. Modules: Correlation Noise Estimation Correlation noise estimation for WZ video coding: Made at the decoder, based on the key frames realistic scenario. Exploits temporal correlation by using the motion compensated residual. Different spatial granularity levels may be used to achieve better adaptation to the correlation noise statistics: Frame level Block level Pixel level Motion compensated residual frame R Frame level? Yes Compute R frame variance Compute CN parameter at frame level as function of R frame variance Next frame No No Block level? Yes Compute block variance Compute CN parameter at block level as function of block variance Last R frame block? Yes Next frame No No Pixel level? Yes Compute CN parameter at pixel level Last R frame pixel? Yes Next frame
33 Dec. Modules: Request Stopping Criteria To establish if decoding is successful, the decoder convergence is tested by computing the syndrome check error, i.e. the Hamming distance between the received syndrome and the one generated using the decoded bitplane, followed by a cyclic redundancy check (CRC). If the Hamming distance is different from zero, then the decoder proceeds to the next iteration. After a certain amount of iterations ( 00), if the Hamming distance remains different from zero, then the bitplane is assumed to be erroneously decoded and the LDPCA decoder requests for more syndromes via the return channel. If the Hamming distance is equal to zero, then the successfulness of the decoding operation is verified using a 8-bit CRC sum. If the CRC sum computed on the decoded bitplane matches the value received from the encoder, the decoding is declared successful and the decoded bitplane is sent to the reconstruction module. Otherwise, the decoder requests more accumulated syndromes and thus a final low error probability is always guaranteed. If the compression factor comes to, no further requests are made since the code is invertible.
34 Decoder Modules: Reconstruction The decoded value is reconstructed in a mean squared erroroptimal way as the expectation of x given the decoded quantization index, q, and the side information value, y, this means. The calculation of this expectation value is performed using closed-form expressions derived for a Laplacian correlation model. Those frequency bands for which no information was transmitted from the encoder are taken directly from the Side Information. After that, the inverse 4x4 DCT transform is applied, and the whole WZ frame is restored in the pixel domain. 4
35 DISCOVER Performance 5
36 Test Conditions Frames: all frames this means 99 for Foreman, 9 for Hall Monitor, 99 for Coast Guard, and 99 for Soccer. Spatial resolution: QCIF. Temporal resolution: 5 Hz and 0 Hz which means 7.5 or 5 Hz for the WZ frames when GOP= is used. GOP length:, 4 and 8 (a) (b) (c) (d) (e) (f) (g) (h) 6
37 Evaluation Metrics Forward Channel Performance Evaluation Measuring the Overall Rate-Distortion Performance Measuring the Quality Evolution of WZ Decoded Frames Measuring the Bitplane Compression Factor Measuring the Decoded Quality Versus the Side Information Quality Feedback Channel Performance Evaluation Measuring the Number of Requests Measuring the Feedback Channel Rate Measuring the Number of Errors Versus the Number of Requests Measuring the Number of Requests Versus Side Information Quality Complexity Performance Evaluation Encoding Complexity Decoding Complexity 7
38 RD Performance (GOP ) QCIF, 5 Hz Coast Guard Soccer PSNR [db] 4 0 PSNR [db] DISCOVER H.64/AVC (Intra) H.6+ (Intra) H.64/AVC (No Motion) DISCOVER H.64/AVC (Intra) H.6+ (Intra) H.64/AVC (No Motion) Rate [kbps] Rate [kbps] Hall Monitor Foreman PSNR [db] DISCOVER H.64/AVC (Intra) 7 H.6+ (Intra) H.64/AVC (No Motion) Rate [kbps] PSNR [db] DISCOVER H.64/AVC (Intra) H.6+ (Intra) H.64/AVC (No Motion) Rate [kbps] 8
39 RD Performance (GOP,4,8) QCIF, 5 Hz 8 Coast Guard 9 Soccer PSNR [db] 0 PSNR [db] LDPC - GOP LDPC - GOP 4 LDPC - GOP LDPC - GOP LDPC - GOP 4 LDPC - GOP Rate [kbps] Rate [kbps] 4 Hall Monitor 4 Foreman 9 9 PSNR [db] LDPC - GOP LDPC - GOP 4 LDPC - GOP Rate [kbps] PSNR [db] Rate [kbps] LDPC - GOP LDPC - GOP 4 LDPC - GOP 8 9
40 LDPC versus Turbo Codes PSNR [db] Coast Guard Rate [kbps] LDPC - GOP TC - GOP LDPC - GOP 4 TC - GOP 4 LDPC - GOP 8 TC - GOP 8 PSNR [db] Soccer Rate [kbps] LDPC - GOP TC - GOP LDPC - GOP 4 TC - GOP 4 LDPC - GOP 8 TC - GOP Hall Monitor Foreman PSNR [db] Rate [kbps] LDPC - GOP TC - GOP LDPC - GOP 4 TC - GOP 4 LDPC - GOP 8 TC - GOP 8 PSNR [db] Rate [kbps] LDPC - GOP TC - GOP LDPC - GOP 4 TC - GOP 4 LDPC - GOP 8 TC - GOP 8 40
41 Bitplane Compression Factor (Qi 4) Compression Factor (Qi=8) 40 5 Coastguard Foreman Compression Factor DC AC AC AC AC4 AC5 AC6 AC7 AC8 AC9 AC0 AC AC AC AC Bitplane Number Compression Factor (Qi=8) 40 Hall Monitor Soccer 5 Compression Factor DC AC AC AC AC4 AC5 AC6 AC7 AC8 AC9 AC0 AC AC AC AC Bitplane Number 4
42 Number of Requests (Qi 8) Number of Requests (Qi=8) 5 Coastguard Foreman Number of Requests DC AC AC AC AC4 AC5 AC6 AC7 AC8 AC9 AC0 AC AC AC AC Bitplane Number Number of Requests (Qi=8) 5 Hall Monitor Soccer Number of Requests DC AC AC AC AC4 AC5 AC6 AC7 AC8 AC9 AC0 AC AC AC AC Bitplane Number 4
43 Encoding Complexity (GOP ) DISCOVER (WZ Frames) H.64/AVC (Intra) DISCOVER (Key Frames) H.64/AVC (No Motion) Coast Guard DISCOVER (WZ Frames) H.64/AVC (Intra) DISCOVER (Key Frames) H.64/AVC (No Motion) Soccer Time (sec) Time (sec) Qi Qi DISCOVER (WZ Frames) H.64/AVC (Intra) DISCOVER (Key Frames) H.64/AVC (No Motion) Hall Monitor DISCOVER (WZ Frames) H.64/AVC (Intra) DISCOVER (Key Frames) H.64/AVC (No Motion) Foreman 40 Time (sec) 0 0 Time (sec) Qi Qi 4
44 Decoding Complexity (GOP ) DISCOVER (WZ Frames) H.64/AVC (Intra) Coast Guard DISCOVER (Key Frames) H.64/AVC (No Motion) DISCOVER (WZ Frames) H.64/AVC (Intra) Soccer DISCOVER (Key Frames) H.64/AVC (No Motion) Time (sec) Time (sec) Qi Qi 400 Hall Monitor 500 Foreman 00 DISCOVER (WZ Frames) H.64/AVC (Intra) DISCOVER (Key Frames) H.64/AVC (No Motion) 000 DISCOVER (WZ Frames) H.64/AVC (Intra) DISCOVER (Key Frames) H.64/AVC (No Motion) Time (sec) Time (sec) Qi Qi 44
45 Performance Conclusions In terms of RD performance, the DISCOVER codec already wins against the H.64/AVC Intra codec, for most test sequences, and for GOP=; for more quiet sequences, the DISCOVER codec already wins against the H.64/AVC No Motion codec. For longer GOP sizes, winning against H.64/AVC Intra is more difficult highlighting the importance and difficulty of side information, notably when key frames are farther away. The total bitrate for the feedback channel is rather low but the feedback adds delay and requires a real-time setup. DISCOVER encoding complexity is always much lower than the H.64/AVC Intra encoding complexity, even for GOP= where it performs better in terms of RD performance. 45
46 DISCOVER (the) Future 46
47 Main Conclusions Since the DISCOVER monoview codec performs better than H.64/AVC Intra for GOP=, for most sequences, this highlights that Wyner-Ziv is already a credible coding solution when encoding complexity is a very critical requirement (even if at the cost of some additional decoding complexity). The results achieved during the lifetime of DISCOVER allowed to improve the compression performance of monoview WZ codecs but it is clear that much research is still to be made to approach the theoretical limits Further research should address side information creation, correlation noise modeling, channel codes, rate control, reconstruction, WZ selective coding, etc 47
48 DISCOVER for the World The DISCOVER Codec may be downloaded at The executable codec, along with sample configuration and test files, can be downloaded for: Windows Linux/-bit Linux/64-bit An overview paper and a detailed performance evaluation with precise test conditions are also available. 48
49 Main References General J. Ascenso, C. Brites, and F. Pereira, Improving frame interpolation with spatial motion smoothing for pixel domain distributed video coding, in Proc. 5th EURASIP Conf. Speech Image Processing, Multimedia Commun. Services, Smolenice, Slovak Republic, July 005. J. Ascenso, C. Brites, F. Pereira, Content adaptive Wyner-Ziv video coding driven by motion activity, IEEE International Conference on Image Processing, Atlanta, USA, October 8-, 006. X. Artigas, J. Ascenso, M. Dalai, S. Klomp, D. Kubasov, M. Ouaret, The DISCOVER codec: architecture, techniques and evaluation, Picture Coding Symposium, Lisboa, Portugal, November 007. C. Guillemot, F. Pereira, L. Torres, T. Ebrahimi, R. Leonardi, J. Ostermann, Distributed monoview and multiview video coding, IEEE Signal Processing Magazine, vol. 4, nº 5, pp , September 007. Codec J. Ascenso, C. Brites, F. Pereira, "Content adaptive Wyner-Ziv video coding driven by motion activity", IEEE International Conference on Image Processing, Atlanta, USA, October 006. J. Ascenso, F.Pereira, "Adaptive hash based side information exploitation for efficient Wyner-Ziv video coding", IEEE International Conference on Image Processing, San Antonio, USA, September 007. Encoder C. Brites, F. Pereira, Encoder rate control for transform domain Wyner-Ziv Video coding, IEEE International Conference on Image Processing, San Antonio, Texas, USA, September 007. D. Kubasov, K. Lajnef, and C. Guillemot, A hybrid encoder/decoder rate control for a Wyner-Ziv video codec with a feedback channel, IEEE Multimedia Signal Processing Workshop, MMSP, Chania, Crete, Greece, Oct Decoder C. Brites, J. Ascenso, F. Pereira, Modeling correlation noise statistics at decoder for pixel based Wyner-Ziv video coding, Picture Coding Symposium, Beijing, China, April 006. C. Brites, J. Ascenso, F. Pereira, Studying temporal correlation noise modeling for pixel based Wyner-Ziv video coding, IEEE International Conference on Image Processing, Atlanta, USA, October 006 D. Kubasov, J. Nayak, C. Guillemot, Optimal reconstruction in Wyner-Ziv video coding with multiple side information, IEEE MultiMedia Signal Processing Workshop, Chania, Crete, October. -,
50 IST DISCOVER Team Thanks for your attention! More information at 50
Transform-domain Wyner-Ziv Codec for Video
Transform-domain Wyner-Ziv Codec for Video Anne Aaron, Shantanu Rane, Eric Setton, and Bernd Girod Information Systems Laboratory, Department of Electrical Engineering Stanford University 350 Serra Mall,
DISTRIBUTED VIDEO CODING: CODEC ARCHITECTURE AND IMPLEMENTATION
Signal & Image Processing : An International Journal(SIPIJ) Vol.2, No., March 20 DISTRIBUTED VIDEO CODING: CODEC ARCHITECTURE AND IMPLEMENTATION Vijay Kumar Kodavalla and Dr. P.G. Krishna Mohan 2 Semiconductor
Alberto Corrales-García, Rafael Rodríguez-Sánchez, José Luis Martínez, Gerardo Fernández-Escribano, José M. Claver and José Luis Sánchez
Alberto Corrales-García, Rafael Rodríguez-Sánchez, José Luis artínez, Gerardo Fernández-Escribano, José. Claver and José Luis Sánchez 1. Introduction 2. Technical Background 3. Proposed DVC to H.264/AVC
Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)
Quality Estimation for Scalable Video Codec Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden) Purpose of scalable video coding Multiple video streams are needed for heterogeneous
Multi-Hypothesis based Distributed Video Coding using LDPC Codes
Multi-Hypothesis based Distributed Video Coding using LDPC Codes Kiran Misra, Shirish Karande, Hayder Radha Department of Electrical and Computer Engineering 2120, Engineering Building Michigan State University
Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)
Project Proposal Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Sumedha Phatak-1000731131- [email protected] Objective: A study, implementation and comparison of
Study and Implementation of Video Compression standards (H.264/AVC, Dirac)
Study and Implementation of Video Compression standards (H.264/AVC, Dirac) EE 5359-Multimedia Processing- Spring 2012 Dr. K.R Rao By: Sumedha Phatak(1000731131) Objective A study, implementation and comparison
Performance Analysis and Comparison of JM 15.1 and Intel IPP H.264 Encoder and Decoder
Performance Analysis and Comparison of 15.1 and H.264 Encoder and Decoder K.V.Suchethan Swaroop and K.R.Rao, IEEE Fellow Department of Electrical Engineering, University of Texas at Arlington Arlington,
Video Authentication for H.264/AVC using Digital Signature Standard and Secure Hash Algorithm
Video Authentication for H.264/AVC using Digital Signature Standard and Secure Hash Algorithm Nandakishore Ramaswamy Qualcomm Inc 5775 Morehouse Dr, Sam Diego, CA 92122. USA [email protected] K.
We are presenting a wavelet based video conferencing system. Openphone. Dirac Wavelet based video codec
Investigating Wavelet Based Video Conferencing System Team Members: o AhtshamAli Ali o Adnan Ahmed (in Newzealand for grad studies) o Adil Nazir (starting MS at LUMS now) o Waseem Khan o Farah Parvaiz
Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 [email protected]
Video Coding Basics Yao Wang Polytechnic University, Brooklyn, NY11201 [email protected] Outline Motivation for video coding Basic ideas in video coding Block diagram of a typical video codec Different
Parametric Comparison of H.264 with Existing Video Standards
Parametric Comparison of H.264 with Existing Video Standards Sumit Bhardwaj Department of Electronics and Communication Engineering Amity School of Engineering, Noida, Uttar Pradesh,INDIA Jyoti Bhardwaj
Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet
DICTA2002: Digital Image Computing Techniques and Applications, 21--22 January 2002, Melbourne, Australia Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet K. Ramkishor James. P. Mammen
JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions
Edith Cowan University Research Online ECU Publications Pre. JPEG compression of monochrome D-barcode images using DCT coefficient distributions Keng Teong Tan Hong Kong Baptist University Douglas Chai
Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur
Module 8 VIDEO CODING STANDARDS Version ECE IIT, Kharagpur Lesson H. andh.3 Standards Version ECE IIT, Kharagpur Lesson Objectives At the end of this lesson the students should be able to :. State the
Figure 1: Relation between codec, data containers and compression algorithms.
Video Compression Djordje Mitrovic University of Edinburgh This document deals with the issues of video compression. The algorithm, which is used by the MPEG standards, will be elucidated upon in order
Distributed Video Coding: Trends and Perspectives
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Distributed Video Coding: Trends and Perspectives Frederic Dufaux, Wen Gao, Stefano Tubaro, Anthony Vetro TR2010-026 June 2010 Abstract This
Introduction to image coding
Introduction to image coding Image coding aims at reducing amount of data required for image representation, storage or transmission. This is achieved by removing redundant data from an image, i.e. by
Image Compression through DCT and Huffman Coding Technique
International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Rahul
Video compression: Performance of available codec software
Video compression: Performance of available codec software Introduction. Digital Video A digital video is a collection of images presented sequentially to produce the effect of continuous motion. It takes
Gaming as a Service. Prof. Victor C.M. Leung. The University of British Columbia, Canada www.ece.ubc.ca/~vleung
Gaming as a Service Prof. Victor C.M. Leung The University of British Columbia, Canada www.ece.ubc.ca/~vleung International Conference on Computing, Networking and Communications 4 February, 2014 Outline
Fast Hybrid Simulation for Accurate Decoded Video Quality Assessment on MPSoC Platforms with Resource Constraints
Fast Hybrid Simulation for Accurate Decoded Video Quality Assessment on MPSoC Platforms with Resource Constraints Deepak Gangadharan and Roger Zimmermann Department of Computer Science, National University
A comprehensive survey on various ETC techniques for secure Data transmission
A comprehensive survey on various ETC techniques for secure Data transmission Shaikh Nasreen 1, Prof. Suchita Wankhade 2 1, 2 Department of Computer Engineering 1, 2 Trinity College of Engineering and
WHITE PAPER. H.264/AVC Encode Technology V0.8.0
WHITE PAPER H.264/AVC Encode Technology V0.8.0 H.264/AVC Standard Overview H.264/AVC standard was published by the JVT group, which was co-founded by ITU-T VCEG and ISO/IEC MPEG, in 2003. By adopting new
Complexity-rate-distortion Evaluation of Video Encoding for Cloud Media Computing
Complexity-rate-distortion Evaluation of Video Encoding for Cloud Media Computing Ming Yang, Jianfei Cai, Yonggang Wen and Chuan Heng Foh School of Computer Engineering, Nanyang Technological University,
How To Improve Performance Of The H264 Video Codec On A Video Card With A Motion Estimation Algorithm
Implementation of H.264 Video Codec for Block Matching Algorithms Vivek Sinha 1, Dr. K. S. Geetha 2 1 Student of Master of Technology, Communication Systems, Department of ECE, R.V. College of Engineering,
Internet Video Streaming and Cloud-based Multimedia Applications. Outline
Internet Video Streaming and Cloud-based Multimedia Applications Yifeng He, [email protected] Ling Guan, [email protected] 1 Outline Internet video streaming Overview Video coding Approaches for video
Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 [email protected]
Video Coding Standards Yao Wang Polytechnic University, Brooklyn, NY11201 [email protected] Yao Wang, 2003 EE4414: Video Coding Standards 2 Outline Overview of Standards and Their Applications ITU-T
Video-Conferencing System
Video-Conferencing System Evan Broder and C. Christoher Post Introductory Digital Systems Laboratory November 2, 2007 Abstract The goal of this project is to create a video/audio conferencing system. Video
Multimedia Data Transmission over Wired/Wireless Networks
Multimedia Data Transmission over Wired/Wireless Networks Bharat Bhargava Gang Ding, Xiaoxin Wu, Mohamed Hefeeda, Halima Ghafoor Purdue University Website: http://www.cs.purdue.edu/homes/bb E-mail: [email protected]
Complexity-bounded Power Control in Video Transmission over a CDMA Wireless Network
Complexity-bounded Power Control in Video Transmission over a CDMA Wireless Network Xiaoan Lu, David Goodman, Yao Wang, and Elza Erkip Electrical and Computer Engineering, Polytechnic University, Brooklyn,
For Articulation Purpose Only
E305 Digital Audio and Video (4 Modular Credits) This document addresses the content related abilities, with reference to the module. Abilities of thinking, learning, problem solving, team work, communication,
Video Codec Requirements and Evaluation Methodology
-47pt -30pt :white Font : edium t Video Codec Requirements and Evaluation Methodology www.huawei.com draft-filippov-netvc-requirements-02 Alexey Filippov, Jose Alvarez (Huawei Technologies) Contents An
302 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 2, FEBRUARY 2009
302 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 2, FEBRUARY 2009 Transactions Letters Fast Inter-Mode Decision in an H.264/AVC Encoder Using Mode and Lagrangian Cost Correlation
DTU FOTONIK. Ann Ukhanova, Lars Dittmann and Søren Forchhammer. Technical University of Denmark
DTU FOTONIK Ann Ukhanova, Lars Dittmann and Søren Forchhammer Technical University of Denmark TECHNICAL UNIVERSITY of DENMARK DTU Established Nov. 5 1829 by H.C. Ørsted DTU today: 4500 employees in 18
TRANSPARENT ENCRYPTION FOR HEVC USING BIT-STREAM-BASED SELECTIVE COEFFICIENT SIGN ENCRYPTION. Heinz Hofbauer Andreas Uhl Andreas Unterweger
TRANSPARENT ENCRYPTION FOR HEVC USING BIT-STREAM-BASED SELECTIVE COEFFICIENT SIGN ENCRYPTION Heinz Hofbauer Andreas Uhl Andreas Unterweger University of Salzburg, Jakob Haringer Str. 2, 2 Salzburg, Austria
Multiple Description Coding (MDC) and Scalable Coding (SC) for Multimedia
Multiple Description Coding (MDC) and Scalable Coding (SC) for Multimedia Gürkan Gür PhD. Candidate e-mail: [email protected] Dept. Of Computer Eng. Boğaziçi University Istanbul/TR ( Currenty@UNITN)
CODING MODE DECISION ALGORITHM FOR BINARY DESCRIPTOR CODING
CODING MODE DECISION ALGORITHM FOR BINARY DESCRIPTOR CODING Pedro Monteiro and João Ascenso Instituto Superior Técnico - Instituto de Telecomunicações ABSTRACT In visual sensor networks, local feature
http://www.springer.com/0-387-23402-0
http://www.springer.com/0-387-23402-0 Chapter 2 VISUAL DATA FORMATS 1. Image and Video Data Digital visual data is usually organised in rectangular arrays denoted as frames, the elements of these arrays
Overview: Video Coding Standards
Overview: Video Coding Standards Video coding standards: applications and common structure Relevant standards organizations ITU-T Rec. H.261 ITU-T Rec. H.263 ISO/IEC MPEG-1 ISO/IEC MPEG-2 ISO/IEC MPEG-4
THE EMERGING JVT/H.26L VIDEO CODING STANDARD
THE EMERGING JVT/H.26L VIDEO CODING STANDARD H. Schwarz and T. Wiegand Heinrich Hertz Institute, Germany ABSTRACT JVT/H.26L is a current project of the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC
Region of Interest Access with Three-Dimensional SBHP Algorithm CIPR Technical Report TR-2006-1
Region of Interest Access with Three-Dimensional SBHP Algorithm CIPR Technical Report TR-2006-1 Ying Liu and William A. Pearlman January 2006 Center for Image Processing Research Rensselaer Polytechnic
MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music
ISO/IEC MPEG USAC Unified Speech and Audio Coding MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music The standardization of MPEG USAC in ISO/IEC is now in its final
White paper. H.264 video compression standard. New possibilities within video surveillance.
White paper H.264 video compression standard. New possibilities within video surveillance. Table of contents 1. Introduction 3 2. Development of H.264 3 3. How video compression works 4 4. H.264 profiles
Performance Evaluation of VoIP Services using Different CODECs over a UMTS Network
Performance Evaluation of VoIP Services using Different CODECs over a UMTS Network Jianguo Cao School of Electrical and Computer Engineering RMIT University Melbourne, VIC 3000 Australia Email: [email protected]
H 261. Video Compression 1: H 261 Multimedia Systems (Module 4 Lesson 2) H 261 Coding Basics. Sources: Summary:
Video Compression : 6 Multimedia Systems (Module Lesson ) Summary: 6 Coding Compress color motion video into a low-rate bit stream at following resolutions: QCIF (76 x ) CIF ( x 88) Inter and Intra Frame
MULTI-STREAM VOICE OVER IP USING PACKET PATH DIVERSITY
MULTI-STREAM VOICE OVER IP USING PACKET PATH DIVERSITY Yi J. Liang, Eckehard G. Steinbach, and Bernd Girod Information Systems Laboratory, Department of Electrical Engineering Stanford University, Stanford,
An Efficient Compression of Strongly Encrypted Images using Error Prediction, AES and Run Length Coding
An Efficient Compression of Strongly Encrypted Images using Error Prediction, AES and Run Length Coding Stebin Sunny 1, Chinju Jacob 2, Justin Jose T 3 1 Final Year M. Tech. (Cyber Security), KMP College
Appendix C GSM System and Modulation Description
C1 Appendix C GSM System and Modulation Description C1. Parameters included in the modelling In the modelling the number of mobiles and their positioning with respect to the wired device needs to be taken
Multidimensional Transcoding for Adaptive Video Streaming
Multidimensional Transcoding for Adaptive Video Streaming Jens Brandt, Lars Wolf Institut für Betriebssystem und Rechnerverbund Technische Universität Braunschweig Germany NOSSDAV 2007, June 4-5 Jens Brandt,
Video Coding Technologies and Standards: Now and Beyond
Hitachi Review Vol. 55 (Mar. 2006) 11 Video Coding Technologies and Standards: Now and Beyond Tomokazu Murakami Hiroaki Ito Muneaki Yamaguchi Yuichiro Nakaya, Ph.D. OVERVIEW: Video coding technology compresses
How To Improve Performance Of H.264/Avc With High Efficiency Video Coding (Hevc)
Evaluation of performance and complexity comparison for coding standards HEVC vs. H.264/AVC Zoran M. Milicevic and Zoran S. Bojkovic Abstract In order to compare the performance and complexity without
Michael W. Marcellin and Ala Bilgin
JPEG2000: HIGHLY SCALABLE IMAGE COMPRESSION Michael W. Marcellin and Ala Bilgin Department of Electrical and Computer Engineering, The University of Arizona, Tucson, AZ 85721. {mwm,bilgin}@ece.arizona.edu
Video Encryption Exploiting Non-Standard 3D Data Arrangements. Stefan A. Kramatsch, Herbert Stögner, and Andreas Uhl [email protected].
Video Encryption Exploiting Non-Standard 3D Data Arrangements Stefan A. Kramatsch, Herbert Stögner, and Andreas Uhl [email protected] Andreas Uhl 1 Carinthia Tech Institute & Salzburg University Outline
Very Low Frame-Rate Video Streaming For Face-to-Face Teleconference
Very Low Frame-Rate Video Streaming For Face-to-Face Teleconference Jue Wang, Michael F. Cohen Department of Electrical Engineering, University of Washington Microsoft Research Abstract Providing the best
Efficient Stream-Reassembling for Video Conferencing Applications using Tiles in HEVC
Efficient Stream-Reassembling for Video Conferencing Applications using Tiles in HEVC Christian Feldmann Institut für Nachrichtentechnik RWTH Aachen University Aachen, Germany [email protected]
Efficient Motion Estimation by Fast Three Step Search Algorithms
Efficient Motion Estimation by Fast Three Step Search Algorithms Namrata Verma 1, Tejeshwari Sahu 2, Pallavi Sahu 3 Assistant professor, Dept. of Electronics & Telecommunication Engineering, BIT Raipur,
Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.
Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice over IP So, today we will discuss about voice over IP and internet
WATERMARKING FOR IMAGE AUTHENTICATION
WATERMARKING FOR IMAGE AUTHENTICATION Min Wu Bede Liu Department of Electrical Engineering Princeton University, Princeton, NJ 08544, USA Fax: +1-609-258-3745 {minwu, liu}@ee.princeton.edu ABSTRACT A data
Overview of the Scalable Video Coding Extension of the H.264/AVC Standard
To appear in IEEE Transactions on Circuits and Systems for Video Technology, September 2007. 1 Overview of the Scalable Video Coding Extension of the H.264/AVC Standard Heiko Schwarz, Detlev Marpe, Member,
Redundant Wavelet Transform Based Image Super Resolution
Redundant Wavelet Transform Based Image Super Resolution Arti Sharma, Prof. Preety D Swami Department of Electronics &Telecommunication Samrat Ashok Technological Institute Vidisha Department of Electronics
X264: A HIGH PERFORMANCE H.264/AVC ENCODER. Loren Merritt and Rahul Vanam*
X264: A HIGH PERFORMANCE H.264/AVC ENCODER Loren Merritt and Rahul Vanam* In Preparation *Dept. of Electrical Engineering, University of Washington, Seattle, WA 98195-2500 Email: {lorenm, rahulv}@u.washington.edu
View Sequence Coding using Warping-based Image Alignment for Multi-view Video
View Sequence Coding using Warping-based mage Alignment for Multi-view Video Yanwei Liu, Qingming Huang,, Wen Gao 3 nstitute of Computing Technology, Chinese Academy of Science, Beijing, China Graduate
IMPACT OF COMPRESSION ON THE VIDEO QUALITY
IMPACT OF COMPRESSION ON THE VIDEO QUALITY Miroslav UHRINA 1, Jan HLUBIK 1, Martin VACULIK 1 1 Department Department of Telecommunications and Multimedia, Faculty of Electrical Engineering, University
Efficient Coding Unit and Prediction Unit Decision Algorithm for Multiview Video Coding
JOURNAL OF ELECTRONIC SCIENCE AND TECHNOLOGY, VOL. 13, NO. 2, JUNE 2015 97 Efficient Coding Unit and Prediction Unit Decision Algorithm for Multiview Video Coding Wei-Hsiang Chang, Mei-Juan Chen, Gwo-Long
Image Authentication Scheme using Digital Signature and Digital Watermarking
www..org 59 Image Authentication Scheme using Digital Signature and Digital Watermarking Seyed Mohammad Mousavi Industrial Management Institute, Tehran, Iran Abstract Usual digital signature schemes for
International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol. XXXIV-5/W10
Accurate 3D information extraction from large-scale data compressed image and the study of the optimum stereo imaging method Riichi NAGURA *, * Kanagawa Institute of Technology [email protected]
Development and Evaluation of Point Cloud Compression for the Point Cloud Library
Development and Evaluation of Point Cloud Compression for the Institute for Media Technology, TUM, Germany May 12, 2011 Motivation Point Cloud Stream Compression Network Point Cloud Stream Decompression
Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies
Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com
Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics:
Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Voice Digitization in the POTS Traditional
Introduction to Medical Image Compression Using Wavelet Transform
National Taiwan University Graduate Institute of Communication Engineering Time Frequency Analysis and Wavelet Transform Term Paper Introduction to Medical Image Compression Using Wavelet Transform 李 自
Performance Evaluation of AODV, OLSR Routing Protocol in VOIP Over Ad Hoc
(International Journal of Computer Science & Management Studies) Vol. 17, Issue 01 Performance Evaluation of AODV, OLSR Routing Protocol in VOIP Over Ad Hoc Dr. Khalid Hamid Bilal Khartoum, Sudan [email protected]
Combating Anti-forensics of Jpeg Compression
IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 6, No 3, November 212 ISSN (Online): 1694-814 www.ijcsi.org 454 Combating Anti-forensics of Jpeg Compression Zhenxing Qian 1, Xinpeng
Quality Optimal Policy for H.264 Scalable Video Scheduling in Broadband Multimedia Wireless Networks
Quality Optimal Policy for H.264 Scalable Video Scheduling in Broadband Multimedia Wireless Networks Vamseedhar R. Reddyvari Electrical Engineering Indian Institute of Technology Kanpur Email: [email protected]
High Quality Image Magnification using Cross-Scale Self-Similarity
High Quality Image Magnification using Cross-Scale Self-Similarity André Gooßen 1, Arne Ehlers 1, Thomas Pralow 2, Rolf-Rainer Grigat 1 1 Vision Systems, Hamburg University of Technology, D-21079 Hamburg
Using AVC/H.264 and H.265 expertise to boost MPEG-2 efficiency and make the 6-in-6 concept a reality
Using AVC/H.264 and H.265 expertise to boost MPEG-2 efficiency and make the 6-in-6 concept a reality A Technical Paper prepared for the Society of Cable Telecommunications Engineers By Anais Painchault
Proactive Video Assurance through QoE and QoS Correlation
A Complete Approach for Quality and Service Assurance W H I T E P A P E R Introduction Video service providers implement new technologies to maximize the quality and diversity of their entertainment program
VoIP Shim for RTP Payload Formats
PITALS 50 pt 32 pt VoIP Shim for RTP Payload Formats draft-johansson-avt-rtp-shim Ingemar Johansson, Ericsson AB Outline MTSI in 3GPP Voice service requirements Problems with RTCP Why is inband signaling
H.264/MPEG-4 AVC Video Compression Tutorial
Introduction The upcoming H.264/MPEG-4 AVC video compression standard promises a significant improvement over all previous video compression standards. In terms of coding efficiency, the new standard is
Classes of multimedia Applications
Classes of multimedia Applications Streaming Stored Audio and Video Streaming Live Audio and Video Real-Time Interactive Audio and Video Others Class: Streaming Stored Audio and Video The multimedia content
Understanding Compression Technologies for HD and Megapixel Surveillance
When the security industry began the transition from using VHS tapes to hard disks for video surveillance storage, the question of how to compress and store video became a top consideration for video surveillance
SteganographyinaVideoConferencingSystem? AndreasWestfeld1andGrittaWolf2 2InstituteforOperatingSystems,DatabasesandComputerNetworks 1InstituteforTheoreticalComputerScience DresdenUniversityofTechnology
