The H.264/MPEG-4 Advanced Video Coding (AVC) Standard



Similar documents
Standard encoding protocols for image and video coding

THE EMERGING JVT/H.26L VIDEO CODING STANDARD

Performance Analysis and Comparison of JM 15.1 and Intel IPP H.264 Encoder and Decoder

WHITE PAPER. H.264/AVC Encode Technology V0.8.0

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)

How To Improve Performance Of H.264/Avc With High Efficiency Video Coding (Hevc)

Study and Implementation of Video Compression standards (H.264/AVC, Dirac)

Motion Estimation. Macroblock Partitions. Sub-pixel Motion Estimation. Sub-pixel Motion Estimation

White paper. H.264 video compression standard. New possibilities within video surveillance.

Overview: Video Coding Standards

Video Authentication for H.264/AVC using Digital Signature Standard and Secure Hash Algorithm

The H.264/AVC Advanced Video Coding Standard: Overview and Introduction to the Fidelity Range Extensions

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

A Look at Emerging Standards in Video Security Systems. Chris Adesanya Panasonic Network Systems Company

Comparison of the Coding Efficiency of Video Coding Standards Including High Efficiency Video Coding (HEVC)

THE PRIMARY goal of most digital video coding standards

A Tutorial on Image/Video Coding Standards

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201

Overview of the Scalable Video Coding Extension of the H.264/AVC Standard

H.264/AVC for Wireless Applications

Overview of the H.264/AVC Video Coding Standard

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

An Introduction to Ultra HDTV and HEVC

Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet

X264: A HIGH PERFORMANCE H.264/AVC ENCODER. Loren Merritt and Rahul Vanam*

H 261. Video Compression 1: H 261 Multimedia Systems (Module 4 Lesson 2) H 261 Coding Basics. Sources: Summary:

302 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 2, FEBRUARY 2009

Video coding with H.264/AVC:

GPU Compute accelerated HEVC decoder on ARM Mali TM -T600 GPUs

Introduction to image coding

The H.264/MPEG-4 Advanced

IMPACT OF COMPRESSION ON THE VIDEO QUALITY

Video Coding Standards and Scalable Coding

H.264/MPEG-4 AVC Video Compression Tutorial

White paper. An explanation of video compression techniques.

Comparison of Video Compression Standards

Video Coding Technologies and Standards: Now and Beyond

Multihypothesis Prediction using Decoder Side Motion Vector Derivation in Inter Frame Video Coding

Video Network Traffic and Quality Comparison of VP8 and H.264 SVC

We are presenting a wavelet based video conferencing system. Openphone. Dirac Wavelet based video codec


Video Encryption Exploiting Non-Standard 3D Data Arrangements. Stefan A. Kramatsch, Herbert Stögner, and Andreas Uhl

Parametric Comparison of H.264 with Existing Video Standards

TECHNICAL OVERVIEW OF VP8, AN OPEN SOURCE VIDEO CODEC FOR THE WEB

Multidimensional Transcoding for Adaptive Video Streaming

H.264/MPEG-4 AVC Encoder Parameter Selection Algorithms for Complexity Distortion Tradeoff

How To Improve Performance Of The H264 Video Codec On A Video Card With A Motion Estimation Algorithm

Figure 1: Relation between codec, data containers and compression algorithms.

Comparative Assessment of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC Encoders for Low-Delay Video Applications

THE FIRST commercially successful digital video compression

Complexity-rate-distortion Evaluation of Video Encoding for Cloud Media Computing

Huffman Movement DCT. Encoding H.261 Detection. video Raw video Interframe coding data. Inverse Inverse Memory

NICE-RJCS Issue 2011 Evaluation of Potential Effectiveness of Desktop Remote Video Conferencing for Interactive Seminars Engr.

Fast entropy based CABAC rate estimation for mode decision in HEVC

CM0340 SOLNS. Do not turn this page over until instructed to do so by the Senior Invigilator.

Multiple Description Coding (MDC) and Scalable Coding (SC) for Multimedia

MPEG-H Audio System for Broadcasting

Internet Video Streaming and Cloud-based Multimedia Applications. Outline

Digital Video Coding Standards and Their Role in Video Communications

Efficient Stream-Reassembling for Video Conferencing Applications using Tiles in HEVC

Understanding Network Video Security Systems

MISB EG Engineering Guideline. 14 May H.264 / AVC Coding and Multiplexing. 1 Scope. 2 References

Video Codec Requirements and Evaluation Methodology

THE High Efficiency Video Coding (HEVC) standard is

QuickTime and MPEG-4: Now Featuring H.264

FAQs. Getting started with the industry s most advanced compression technology. when it counts

Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles

Understanding HD: Frame Rates, Color & Compression

For Articulation Purpose Only

Video Coding: Recent Developments for HEVC and Future Trends

Complexity/Performance Analysis of a H.264/AVC Video Encoder

Video compression: Performance of available codec software

Survey of Dirac: A Wavelet Based Video Codec for Multiparty Video Conferencing and Broadcasting

REDUCED COMPUTATION USING ADAPTIVE SEARCH WINDOW SIZE FOR H.264 MULTI-FRAME MOTION ESTIMATION

REIHE INFORMATIK 7/98 Efficient Video Transport over Lossy Networks Christoph Kuhmünch and Gerald Kühne Universität Mannheim Praktische Informatik IV

Using AVC/H.264 and H.265 expertise to boost MPEG-2 efficiency and make the 6-in-6 concept a reality

ARIB STD-T64-C.S0042 v1.0 Circuit-Switched Video Conferencing Services

Bosch Video Management System Scheduled Recording Settings as of Bosch VMS 3.0. Technical Note

Application Note. Introduction. Video Basics. Contents. IP Video Encoding Explained Series Understanding IP Video Performance.

Transform-domain Wyner-Ziv Codec for Video

Transcription:

International Telecommunication Union The H.264/MPEG-4 Advanced Video Coding (AVC) Standard Gary J. Sullivan, Ph.D. ITU-T T VCEG Rapporteur Chair ISO/IEC MPEG Video Rapporteur Co-Chair Chair ITU/ISO/IEC JVT Rapporteur Co-Chair Chair July 2005 ITU-T VICA Workshop 22-23 July 2005, ITU Headquarter, Geneva

The Advanced Video Coding Project AVC = ITU-T T H.264 / MPEG-4 4 part 10 History: ITU-T Q.6/SG16 (VCEG - Video Coding Experts Group) H.26L standardization activity (where the L stood for long-term ) August 1999: 1 st test model (TML-1) July 2001: MPEG open call for technology: H.26L demo ed December 2001: Formation of the Joint Video Team (JVT) between VCEG and MPEG to finalize H.26L as a new joint project (similar to MPEG-2/H.262) July 2002: Final Committee Draft status in MPEG Dec 02 technical freeze, FCD ballot approved May 03 completed in both orgs July 04 Fidelity Range Extensions (FRExt) completed January 05 Scalable Video Coding launched H.264/AVC July 05 Gary Sullivan 1

AVC Objectives Primary technical objectives: Significant improvement in coding efficiency High loss/error robustness Network Friendliness (carry it well on MPEG-2 or RTP or H.32x or in MPEG-4 file format or MPEG-4 systems or ) Low latency capability (better quality for higher latency) Exact match decoding Additional version 2 objectives (in FRExt): Professional applications (more than bits per sample, 4:4:4 color sampling, etc.) Higher-quality high-resolution video Alpha plane support (a degree of object functionality) H.264/AVC July 05 Gary Sullivan 2

Relating to Other ITU & MPEG Standards Same design to be approved in both ITU-T VCEG and ISO/IEC MPEG In ITU-T VCEG this is a new & separate standard ITU-T Recommendation H.264 ITU-T Systems (H.32x) support it In ISO/IEC MPEG this is a new part in the MPEG-4 suite Separate codec design from prior MPEG-4 visual New part 10 called Advanced Video Coding (AVC similar to AAC position in MPEG-2 as separate codec) Not backward or forward compatible with prior standards (incl. the prior MPEG-4 visual spec. core technology is different) MPEG-4 Systems / File Format supports it H.222.0 MPEG-2 Systems also supports it H.264/AVC July 05 Gary Sullivan 3

A Comparison of Performance Test of different standards (ICIP 2002 study) Using same rate-distortion optimization techniques for all codecs Streaming test: High-latency (included B frames) Four QCIF sequences coded at 10 Hz and 15 Hz (Foreman, Container, News, Tempete) and Four CIF sequences coded at 15 Hz and 30 Hz (Bus, Flower Garden, Mobile and Calendar, and Tempete) Real-time conversation test: No B frames Four QCIF sequences encoded at 10Hz and 15Hz (Akiyo, Foreman, Mother and Daughter, and Silent Voice) Four CIF sequences encoded at 15Hz and 30Hz (Carphone, Foreman, Paris, and Sean) Compare four codecs using PSNR measure: MPEG-2 (in high-latency/streaming test only) H.263 (high-latency profile, conversational high-compression profile, baseline profile) MPEG-4 Visual (simple and advanced simple profiles with & without B pictures) H.264/AVC (with & without B pictures) Note: These test results are from a private study and are not an endorsed report of the JVT, VCEG or MPEG organizations. H.264/AVC July 05 Gary Sullivan 4

Comparison to MPEG-2, H.263, MPEG-4p2 Quality Y-PSNR [db] 39 3 37 36 35 34 33 32 31 30 29 2 27 Foreman QCIF 10Hz JVT/H.264/AVC MPEG-4 Visual MPEG-2 H.263 0 50 100 150 200 250 Bit-rate [kbit/s] H.264/AVC July 05 Gary Sullivan 5

Comparison to MPEG-2, H.263, MPEG-4p2 Quality Y-PSNR [db] 3 37 36 35 34 33 32 31 30 29 2 27 26 25 Tempete CIF 30Hz JVT/H.264/AVC MPEG-4 Visual MPEG-2 H.263 0 500 1000 1500 2000 2500 3000 3500 Bit-rate [kbit/s] H.264/AVC July 05 Gary Sullivan 6

Caution: Your Mileage Will Vary This encoding software may not represent implementation quality These tests only up to CIF (quarter-standard-definition) resolution Interlace, SDTV, and HDTV not tested in this test Test sequences may not be representative of the variety of content encountered by applications These tests so far not aligned with profile designs This study reports PSNR, but perceptual quality is what matters H.264/AVC July 05 Gary Sullivan 7

Computing Resources for the New Design New design includes relaxation of traditional bounds on computing resources rough guess 2-3x the MIPS, ROM & RAM requirements of MPEG-2 for decoding, 3-4x for encoding Particularly an issue for low-power (e.g., mobile) devices Problem areas: Smaller block sizes for motion compensation (cache access issues) Longer filters for motion compensation (more memory access) Multi-frame motion compensation (more memory for reference frame storage) In-loop deblocking filter (more processing & memory access) More segmentations of macroblock to choose from (more searching in the encoder) More methods of predicting intra data (more searching) Arithmetic coding (adaptivity, computation on output bits) H.264/AVC July 05 Gary Sullivan

AVC Structure Input Video Signal Split into Macroblocks 16x16 pixels - Decoder Coder Control Transform/ Scal./Quant. Scaling & Inv. Transform Control Data Quant. Transf. coeffs Entropy Coding Intra/Inter Intra-frame Prediction Motion- Compensation Deblocking Filter Output Video Signal Motion Estimation Motion Data H.264/AVC July 05 Gary Sullivan 9

Motion Compensation Accuracy Input Video Signal Split into Macroblocks 16x16 pixels - Decoder Coder Control Transform/ Scal./Quant. Scaling & Inv. Transform Control Data Quant. Transf. coeffs Entropy Coding Intra/Inter Intra-frame Prediction Motion- Compensation Motion Estimation De-blocking 16x16 16x x16 Filter MB 0 Types 0 0 1 Output1 Video x Signal x4 4x x 0 0 0 1 Types Motion 1 Data Motion vector accuracy 1/4 sample (6-tap filter) x 0 1 2 3 4x4 0 1 2 3 H.264/AVC July 05 Gary Sullivan 10

Multiple Reference Frames Input Video Signal Split into Macroblocks 16x16 pixels - Decoder Coder Control Transform/ Scal./Quant. Scaling & Inv. Transform Control Data Quant. Transf. coeffs Entropy Coding Intra/Inter Intra-frame Prediction Motion- Compensation De-blocking Filter Output Video Signal Motion Estimation Multiple Reference MotionFrames Data Generalized B Frames H.264/AVC July 05 Gary Sullivan 11

Input Video Signal Split into Macroblocks 16x16 pixels - Decoder Intra/Inter Intra Prediction Coder Control Transform/ Scal./Quant. Intra-frame Prediction Motion- Compensation Motion Estimation Scaling & Inv. Transform De-blocking Filter Directional spatial prediction (9 types for luma, 1 chroma) Control Data Q A B C D E F G H I a Quant. b c d J Transf. e f g coeffs h K i j k l L m n o p M N O P Output Video Signal 4 Entropy Coding 0 e.g., Mode 3: diagonal down/right Motion prediction Data a, f, k, p are predicted by (A + 2Q + I + 2) >> 2 6 1 5 3 7 2 H.264/AVC July 05 Gary Sullivan 12

Transform Coding Input Video Signal Intra/Inter Coder Control Transform/ Scal./Quant. 4x4 Decoder Split into Block Integer Transform Macroblocks 1 1 1 1 16x16 pixels 2 1 1 2 H = 1 1 1 1 1 2 2 1 Intra-frame Prediction Repeated transform of DC coeffs for x chroma and 16x16 Intra luma blocks - Motion- Compensation Scaling & Inv. Transform De-blocking Filter Control Data Quant. Transf. coeffs Output Video Signal Entropy Coding Motion Estimation Motion Data H.264/AVC July 05 Gary Sullivan 13

Deblocking Filter Input Video Signal Split into Macroblocks 16x16 pixels - Decoder Coder Control Transform/ Scal./Quant. Scaling & Inv. Transform Control Data Quant. Transf. coeffs Entropy Coding Intra/Inter Intra-frame Prediction Motion- Compensation Deblocking Filter Output Video Signal Motion Estimation Motion Data H.264/AVC July 05 Gary Sullivan 14

Entropy Coding - Decoder Coder Control Transform/ Quantizer Deq./Inv. Transform Control Data Quant. Transf. coeffs Intra/Inter 0 Motion- Compensated Predictor Entropy Coding Motion Estimator Motion Data H.264/AVC July 05 Gary Sullivan 15

AVC Version 1 Profiles Three profiles in version 1: Baseline, Main, and Extended Baseline (esp. Videoconferencing & Wireless) I and P progressive-scan picture coding (not B) In-loop deblocking filter 1/4-sample motion compensation Tree-structured motion segmentation down to 4x4 block size VLC-based entropy coding Some enhanced error resilience features Flexible macroblock ordering/arbitrary slice ordering Redundant slices H.264/AVC July 05 Gary Sullivan 16

Non-Baseline AVC Version 1 Profiles Main Profile (esp. Broadcast) All Baseline features except enhanced error resilience features Interlaced video handling Generalized B pictures Adaptive weighting for B and P picture prediction CABAC (arithmetic entropy coding) Extended Profile (esp. Streaming) All Baseline features Interlaced video handling Generalized B pictures Adaptive weighting for B and P picture prediction More error resilience: Data partitioning SP/SI switching pictures H.264/AVC July 05 Gary Sullivan 17

Amendment 1: Fidelity-Range Extensions AVC standard finished 2003 ITU-T/H.264 finalized May, 2003 MPEG-4 AVC finalized July, 2003 (same thing) Only corrigenda (bug fixes) since then Fidelity-Range Extensions (FRExt) New work item initiated in July 2003 More than bits, color other than 4:2:0 Alpha coding More coding efficiency capability Also new supplemental information H.264/AVC July 05 Gary Sullivan 1

FRExt Finished July 04 Project initiated July 2003 Motivations Higher quality, higher rates 4:4:4, 4:2:2, and also 4:2:0, 10, or 12 bits (14 bits considered and not included) Lossless Stereo Finished in one year! (July 04) H.264/AVC July 05 Gary Sullivan 19

New Things in FRExt Part 1 Larger transforms x transform (again!) Drop 4x, x4, or larger, 16-point Filtered intra prediction modes for x block size Quantization matrix 4x4, x, intra, inter trans. coefficients weighted differently Old idea, dating to JPEG and before (circa 196?) Full capabilities not yet explored (visual weighting) Coding in various color spaces 4:4:4, 4:2:2, 4:2:0, Monochrome, with/without Alpha New integer color transform (a VUI-message item) H.264/AVC July 05 Gary Sullivan 20

New Things in FRExt Part 2 Efficient lossless interframe coding Film grain characterization for analysis/synthesis representation Stereo-view video support Deblocking filter display preference H.264/AVC July 05 Gary Sullivan 21

H.264/AVC July 05 Gary Sullivan 22 x 16 x 16-Bit ( Bit (Bossen Bossen) Transform ) Transform 3 6 10 12 12 10 6 3 4 4 4 4 6 12 3 10 10 3 12 6 10 3 12 6 6 12 3 10 4 4 4 4 12 10 6 3 3 6 10 12

x Transform Advantage (JVT-K02, IBBP coding, prog.. scan) Sequence % BD bit-rate reduction Movie 1 Movie 2 Movie 3 Movie 4 Movie 5 Crawford Riverbed Average 11.59 12.71 12.01 11.06 13.46 10.93 15.65 12.4 H.264/AVC July 05 Gary Sullivan 23

Quantization Matrix Similar concept to MPEG-2 design Vary step size based on frequency Adapted to modified transform structure More efficient representation of weights Eight downloadable matrices (at least 4:2:0) Intra 4x4 Y, Cb, Cr Intra x Y Inter 4x4 Y, Cb, Cr Inter x Y H.264/AVC July 05 Gary Sullivan 24

New Profiles Created by FRExt 4:2:0, -bit: 4:2:0, 10-bit: 4:2:2, 10-bit: 4:4:4, 12-bit: High (HP) High 10 (Hi10) High 4:2:2 (Hi422) High 4:4:4 (Hi444) Effectively the same tools, but acting on different input data (with a couple of exceptions in the 4:4:4 profile) H.264/AVC July 05 Gary Sullivan 25

Some Notes on Quality Testing Use appropriate High profile (incl. adaptive transform) If testing for PSNR, use flat quant matrices Otherwise, use non-flat quant matrices Use more than 1 or 2 reference pictures Use hierarchical reference frames coding structure Use CABAC entropy coding If testing high-quality PSNR, use adaptive quantization* Use rate-distortion optimization in encoder Use large-range good-quality motion search * = See G. Sullivan & S. Sun, On Dead-Zone, VCIP 2005/JVT-N011 H.264/AVC July 05 Gary Sullivan 26

A Performance Test for High Profile (from JVT-L033 - Panasonic) Subjective tests by Blu-Ray Disk Founders of FRExt HP 4:2:0/ (HP) 1920x100x24p (100p), 3 clips. Nominal 3:1 advantage to MPEG-2 Mbps HP scored better than 24 Mbps MPEG-2 Apparent transparency at 16 Mbps 5 4.5 Figure 1: Results of subjective test with studio participants (Blu-Ray Disk Founders) Mean Opinion Score 4 3.5 3 3.65 3.71 4.00 3.90 4.03 3.59 5: Perfect 4: Good 3: Fair (OK for DVD) 2: Poor 1: Very Poor 2.5 2 H.264/AVC FRExt Mbps H.264/AVC FRExt 12Mbps H.264/AVC FRExt 16Mbps H.264/AVC FRExt 20Mbps Original MPEG2 24 Mbs, DVHS emulation H.264/AVC July 05 Gary Sullivan 27

For Further Information JVT, MPEG, and VCEG management team members: Gary Sullivan (garysull@microsoft.com) Jens Ohm (ohm@ient.rwth-aachen.de) Ajay Luthra (aluthra@motorola.com) Thomas Wiegand (wiegand@hhi.de) IEEE Transactions on Circuits and Systems for Video Technology Special Issue on H.264/AVC (July 2003) Paper in Proceedings of IEEE January 2005 I. Richardson, H.264 and MPEG-4 Video Compression Overview including FRExt: SPIE Aug 2004 (Sullivan, Topiwala, and Luthra) Paper at VCIP 2005: Meta-overview and deployment H.264/AVC July 05 Gary Sullivan 2