New coding techniques, standardisation, and quality metrics

Similar documents

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

White paper. H.264 video compression standard. New possibilities within video surveillance.

Alberto Corrales-García, Rafael Rodríguez-Sánchez, José Luis Martínez, Gerardo Fernández-Escribano, José M. Claver and José Luis Sánchez

The H.264/MPEG-4 Advanced Video Coding (AVC) Standard

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)

Overview: Video Coding Standards

DISCOVER Monoview Video Codec

Understanding Compression Technologies for HD and Megapixel Surveillance

MISB EG Engineering Guideline. 14 May H.264 / AVC Coding and Multiplexing. 1 Scope. 2 References

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201

Video Coding Technologies and Standards: Now and Beyond

Study and Implementation of Video Compression standards (H.264/AVC, Dirac)

For Articulation Purpose Only

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201

How To Compare Video Resolution To Video On A Computer Or Tablet Or Ipad Or Ipa Or Ipo Or Ipom Or Iporom Or A Tv Or Ipro Or Ipot Or A Computer (Or A Tv) Or A Webcam Or

A Tutorial on Image/Video Coding Standards

Performance Analysis and Comparison of JM 15.1 and Intel IPP H.264 Encoder and Decoder

Multiple Description Coding (MDC) and Scalable Coding (SC) for Multimedia

Standard encoding protocols for image and video coding

Video compression: Performance of available codec software

Figure 1: Relation between codec, data containers and compression algorithms.

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Overview of the Scalable Video Coding Extension of the H.264/AVC Standard

Media - Video Coding: Motivation & Scenarios

ARIB STD-T64-C.S0042 v1.0 Circuit-Switched Video Conferencing Services

White Paper. The Next Generation Video Codec Scalable Video Coding (SVC)

White paper. An explanation of video compression techniques.

AUDIO CODING: BASICS AND STATE OF THE ART

Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet

SVC and Video Communications WHITE PAPER VIDYO. Alex Eleftheriadis, Chief Scientist and co-founder of Vidyo

Introduzione alle Biblioteche Digitali Audio/Video

Emerging Markets for H.264 Video Encoding

IMPACT OF COMPRESSION ON THE VIDEO QUALITY

CCTV & Video Surveillance over 10G ip

DAB + The additional audio codec in DAB

Traffic Prioritization of H.264/SVC Video over e Ad Hoc Wireless Networks

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

IP Video Surveillance

!"#$"%&' What is Multimedia?

MPEG-H Audio System for Broadcasting

Video Coding Standards and Scalable Coding

MPEG-4. The new standard for multimedia on the Internet, powered by QuickTime. What Is MPEG-4?

H 261. Video Compression 1: H 261 Multimedia Systems (Module 4 Lesson 2) H 261 Coding Basics. Sources: Summary:

Complexity-rate-distortion Evaluation of Video Encoding for Cloud Media Computing

High Dynamic Range Video The Future of TV Viewing Experience

The EDCINE Project Enhanced Digital Cinema

We are presenting a wavelet based video conferencing system. Openphone. Dirac Wavelet based video codec

Understanding Megapixel Camera Technology for Network Video Surveillance Systems. Glenn Adair

Wireless Ultrasound Video Transmission for Stroke Risk Assessment: Quality Metrics and System Design

Sommario [1/2] Vannevar Bush Dalle Biblioteche ai Cataloghi Automatizzati Gli OPAC accessibili via Web Le Biblioteche Digitali

An Introduction to Ultra HDTV and HEVC

A Look at Emerging Standards in Video Security Systems. Chris Adesanya Panasonic Network Systems Company

Proactive Video Assurance through QoE and QoS Correlation

WHITE PAPER. H.264/AVC Encode Technology V0.8.0

WHITE PAPER Personal Telepresence: The Next Generation of Video Communication VIDYO

THE EMERGING JVT/H.26L VIDEO CODING STANDARD

Parametric Comparison of H.264 with Existing Video Standards

REIHE INFORMATIK 7/98 Efficient Video Transport over Lossy Networks Christoph Kuhmünch and Gerald Kühne Universität Mannheim Praktische Informatik IV

Opportunities with new digital media

M3039 MPEG 97/ January 1998

high-quality surround sound at stereo bit-rates

How To Improve Performance Of H.264/Avc With High Efficiency Video Coding (Hevc)

MPEG-4 Natural Video Coding - An overview

Technical Paper. Dolby Digital Plus Audio Coding

APPLICATION BULLETIN AAC Transport Formats

H.264/MPEG-4 AVC Video Compression Tutorial

A Tool for Multimedia Quality Assessment in NS3: QoE Monitor

Peter Eisert, Thomas Wiegand and Bernd Girod. University of Erlangen-Nuremberg. Cauerstrasse 7, Erlangen, Germany

Audiovisual Services on Internet

ATSC Standard: 3D-TV Terrestrial Broadcasting, Part 2 Service Compatible Hybrid Coding Using Real-Time Delivery

CHANGE REQUEST. Work item code: MMS6-Codec Date: 15/03/2005

QOS Requirements and Service Level Agreements. LECTURE 4 Lecturer: Associate Professor A.S. Eremenko

Develop Computer Animation

FAQs. Getting started with the industry s most advanced compression technology. when it counts

Immersive Medien und 3D-Video

Digital Audio and Video Data

9! Multimedia Content! Production and Management

The H.264/MPEG-4 Advanced

REMOTE RENDERING OF COMPUTER GAMES

Next Generation DTV: ATSC 3.0

How To Test Video Quality On A Network With H.264 Sv (H264)

High Definition (HD) Technology and its Impact. on Videoconferencing F770-64

Distributed Video Coding: Trends and Perspectives

Video Network Traffic and Quality Comparison of VP8 and H.264 SVC

ETSI TS V1.4.1 ( ) Technical Specification

H.264 Based Video Conferencing Solution

Terminal, Software Technologies

Microsoft Lync. Unified Communication Specification for H.264 AVC and SVC UCConfig Modes V 1.1

Classes of multimedia Applications

Introduction to image coding

Video Authentication for H.264/AVC using Digital Signature Standard and Secure Hash Algorithm

Gaming as a Service. Prof. Victor C.M. Leung. The University of British Columbia, Canada

Quality Optimal Policy for H.264 Scalable Video Scheduling in Broadband Multimedia Wireless Networks

P2P Video Streaming Strategies based on Scalable Video Coding

How To Use An Amd Ramfire R7 With A 4Gb Memory Card With A 2Gb Memory Chip With A 3D Graphics Card With An 8Gb Card With 2Gb Graphics Card (With 2D) And A 2D Video Card With

A Tutorial On Network Marketing And Video Transoding

HIGH-DEFINITION: THE EVOLUTION OF VIDEO CONFERENCING

FREQUENTLY ASKED QUESTIONS ABOUT DOLBY ATMOS FOR THE HOME

Digital Audio Compression: Why, What, and How

Transcription:

2 New coding techniques, standardisation, and quality metrics Touradj Ebrahimi Touradj.Ebrahimi@epfl.ch

3 Background

Multimedia 4 Means of Expression aimed at being consumed by humans Modalities Audio (music, song, radio, ) Image (paintings, photos, press, ) Video (cinema, TV, ) Closely linked to content creation

Digital revolution 5 Representation of media in digital form Ease of manipulation by computers Efficient transmission through various communication channels Robustness to errors Infinitely reproducible Low cost Easily searchable and accessible Moore s law (exponential growth) Computing power Storage Communication channels bitrates

Moore s law 6 Number of transistors on a chip doubles every 18 months

Multimedia challenges 7 Applications Compression as an enabler and essential tool in many applications Importance of international standards Seamless exchange of information Media Security, ownership protection, privacy Information management Quality New information modalities

8 Digital Media Roadmap

Multimedia 9 Means of Expression aimed at being consumed by humans Modalities Audio (music, song, radio, ) Image (paintings, photos, press, ) Video (cinema, TV, ) Closely linked to content creation

Multimedia anywhere, anytime, accessible to everybody 10

In many forms, applications and services 11

Digital revolution 12 Representation of media in digital form Ease of manipulation by computers Efficient transmission through various communication channels Robustness to errors Infinitely reproducible Low cost Easily searchable and accessible Moore s law (exponential growth) Computing power Storage Communication channels bitrates

Moore s law 13 Number of transistors on a chip doubles every 18 months

14 Digital Media Roadmap

Phase 1 (yesterday): Era of all digital 15 World of acronyms CD (Digital music) DVD (Digital video) DAB (Digital radio) DVB (Digital TV) ISDN (Digital phone) GSM/UMTS (Digital mobile phone) Media in bits! Essentially same experience as before, but more efficiently deployed

Phase 2 (today): Era of multimedia 16 Synthetic and natural media Natural content (AV) can be acquired easily and cost efficiently Content is not anymore only the result of acquisition of real world through various Virtual reality Augmented reality Mixed reality

Cyber glasses 17

Phase 2 (today): Era of multimedia 18 New media, rich media New types of content (media) and experience start appearing thanks to digital revolution Content is increasingly rich (associated metadata) and accessible Content democratization (consumerproducer)

Phase 3 (tomorrow): Era of interface 19 Moore s law does not apply to the interface between man and machine We basically use the same interface devices in the last 50 years (keyboard, mouse, screen, ) The communication bitrate between man and machine remains very low Interfaces remain unnatural New man-machine interfaces Computers/devices that can see and hear Speech recognition Gesture recognition

Phase 4 (day after tomorrow): Era of extended media and interface 20 Multimedia beyond audiovisual Olfaction (smell) Haptics (touch) Gustation (taste) New interfaces Emotional computers Brain-computer interface

Digital Media Roadmap 21 1985 1990 1995 2000 2005 2010 2015 2020 All Digital Phase 1 Multimedia Phase 2 Interface Phase 3 Extended media and interface Phase 4

Phase 5 : Era of symbiosis 22 Neural implants Brain co-processors Communication beyond five senses

Phase 5 : Era of symbiosis 23

Phase 5 : Era of symbiosis 24 Quote from Murray Gell-Mann, Nobel Prize and founder of Santa Fe institute: One day, for the good or for the bad of it, such interconnections will be possible. Human beings will be able to connect to computers, without need to go through the interface of spoken language or a display, and through these computers to one or more other human beings. There will be a total sharing of thoughts and feelings, without any of the limitations of the language.

25 Media Standards

Standards 26 Communication convention (interoperability) ISO/IEC ITU-T Increasingly important in the global village Often long and tedious process Often not the best technology around by the time it is agreed upon IP issues

Why compression? 27 Essential in most applications because: Bandwidth is limited Content is bandwidth hungry One of the most confusing and hyped components in any multimedia system Intimately linked to content representation Intimately linked to application requirements

Content representation 28 Pixel-based Object-based Content-based Model-based

Application requirements 29 Functionalities Compression efficiency Error resiliency Complexity Scalability Power consumption Delay

H.26x approach 30 Compression efficiency is very important Other functionalities are also important but not as much as compression efficiency Pixel-based approach is good enough to cover all requirements in today and tomorrow s applications

MPEG-x approach 31 Compression efficiency is important But other functionalities are as important Pixel-based approach is not always enough to cover well all requirements in today and tomorrow s applications

Media Coding Standards Evolution 32

Frame-based Video/Image Coding Standards up to AVC 33 JPEG (1988) - Still pictures H.261 (1990) - Videotelephony MPEG-1 (1991) - CD-ROM storage MPEG-2/H.262 (1993) - Digital TV H.263 (1995) - Video over PSTN and mobile networks MPEG-4 4 Visual (1998) All applications JPEG 2000 (2000) Still pictures and video Standards continuously buy quality and functionality with technological novelty and complexity...

MPEG-1: The First Entertainment Bits 1991! 34 Based on the DCT-motion compensation cocktail for video and perceptual coding for audio MPEG-1 is one of the most common AV format for PC environments Video CDs and MP3 (MPEG-1 Audio Layer III) players are big successes MPEG-1 content is nowadays everybody s stuff D e m u lt Video Source Delivery i p le x er Audio

MPEG-2: Digital TV 1993! 35 Based on the DCT-motion compensation cocktail for video and perceptual coding for audio (multi-channel) Adopted in DVB, ATSC and ISBN digital TV solutions; adopted also for DVD storage Hundreds of millions of MPEG-2 set tops boxes in use in the US, Europe and Japan D e m u Video Source Delivery l t i p l e x e r Audio Interaction

MPEG-4: Object-Based Coding Standard 36 Adopts the object-based model giving a semantic value to the data structure Integration of natural and synthetic content Object-based functionalities, e.g., reusing and manipulation capabilities Powerful data model for interaction and personalisation Exploitation of synergies, e.g., between Video Coding, Computer Vision and Computer Graphics

Video Coding in MPEG-4 37 Two Parts in the MPEG-4 standard deal with video coding: Part 2: Visual (1998) Specifies several coding tools targeting the efficient and error resilient of video, including arbitrarily shaped video; it also includes coding of 3D faces and bodies. Part 10: Advanced Video Coding (AVC) (2003) Specifies more efficient (about 50%) and more resilient video coding tools; this Part has been jointly developed by ISO/IEC MPEG and ITU-T through the Joint Video Team (JVT): H.264/MPEG-4 AVC. While Part 2 addresses object and frame-based coding, Part 10 only addresses rectangular frames.

H.264/AVC: Main Functionalities 38 Coding efficiency (about 50% compared to previous standards) Enhanced motion compensation, multiple reference frames, hierarchical B frames, improved de-blocking filter, smaller blocks for transform, enhanced entropy coding Network friendliness Enabled through the Network Adaptation Layer (NAL) design Error resilience Data partitioning, flexible macroblock ordering (FMO), slices, redundant slices, resynchronization markers, multiple reference pictures, parameter sets, etc. Temporal scalability Enabled by flexible reference picture management via hierarchical predictions

Basic Coding Architecture 39 Input Video Signal Coder Control Control Data Split into Macroblocks 16x16 pixels - Transform/ Scal./Quant. Intra-frame Estimation Scaling & Inv. Transform Quant. Transf. coeffs Intra Prediction Data Entropy Coding Intra/Inter MB select Intra-frame Prediction Motion Compensation Motion Estimation Deblocking Filter Motion Data Output Video Signal

H.264/AVC Profiles 40

H.264/AVC: a Success Story 41 3GPP (recommended in rel 6) 3GPP2 (optional for streaming service) ARIB (Japan mobile segment broadcast) ATSC (preliminary adoption for robust-mode back-up channel) Blu-ray Disc Association DLNA (optional in first version) DMB (Korea - mandatory) DVB (specified in TS 102 005 and one of two in TS 101 154) DVD Forum (mandatory for HD DVD players) IETF AVT (RTP payload spec approved as RFC 3984) ISMA (mandatory specified in near-final rel 2.0) US DoD MISB (US government preferred codec up to 1080p) (and, of course, MPEG and the ITU-T)

Is there a Moore s law of compression? 42 Every 4 years, the video compression factors have been roughly doubled H.263/MPEG-4 SP as good as H.261, at half the bitrate H.264/MPEG-4 Part10 as good as MPEG- 2/MPEG-4 ASP, at half the bitrate How long before H.264 is replaced by an alternative twice better?

Scalable Video Coding: Objectives 43 Scalability is a functionality regarding the useful decoding of parts of the coded bitstream, ideally 1. while achieving an RD performance at any supported spatial, temporal, or SNR resolution that is comparable to single-layer coding at that particular resolution, and 2. without significantly increasing the decoding complexity.

SVC Applications 44 Robust Video Delivery Adaptive delivery over error-prone networks and to devices with varying capability Combined with unequal error protection Internet/mobile transmission Scalable Storage Scalable export of video content Graceful expiration or deletion Surveillance DVR s and Home PVR s Enhancement Services Upgrade delivery from 1080i/720p to 1080p DTV broadcasting, optical storage devices

SVC Coding Architecture Progressive SNR refinement texture coding 2007! 45 Hierarchical MCP & Intra prediction texture motion Base layer coding Spatial decimation Inter-layer prediction: Intra Motion Residual Progressive SNR refinement texture coding Hierarchical MCP & Intra prediction texture motion Base layer coding Multiplex Scalable bit-stream Spatial decimation Inter-layer prediction: Intra Motion Residual Progressive SNR refinement texture coding Hierarchical MCP & Intra prediction texture motion Base layer coding H.264/AVC compatible encoder H.264/AVC compatible base layer bit-stream

Spatio-Temporal-Quality Cube 46 Spatial Resolution global bit-stream 4CIF CIF Bit Rate (Quality, SNR) QCIF high low Temporal Resolution 60 30 15 7.5

SVC Scalability Types 47 Temporal scalability - Can be typically achieved without losses in ratedistortion performance. Spatial scalability - When applying an optimized SVC encoder control, the bit rate increase relative to non-scalable H.264/AVC coding at the same fidelity can be as low as 10% for dyadic spatial scalability. The results become worse as spatial resolution of both layers decreases and results improve as spatial resolution increases. SNR/Quality scalability - When applying an optimized encoder control, the bit rate increase relative to non-scalable H.264/AVC coding at the same fidelity can be as low as 10% for all supported rate points when spanning a bit rate range with a factor of 2-3 between the lowest and highest supported rate point.

SVC Performance: Spatial Scalability 48 10~15% gains over simulcasting Performs within 10% of single layer coding

3D Worlds 49 3D experiences may be provided through multi-view video, notably 3D video (also called stereo) which brings a depth impression of a scene Free viewpoint video (FVV) which allows an interactive selection of the viewpoint and direction within certain ranges. May require especial 3D display technology: many new products announced recently and being exhibited New 3D display technology is driving this area: no glasses, multi-persons displays, higher display resolutions, avoid uneasy feelings (headaches, nausea, eye strain, etc.) Relevant for broadcast TV, teleconference, surveillance, interactive video, cinema, gaming or other immersive video applications

Multiview Video Coding (MVC) 50 In addition to exploiting the temporal and spatial redundancy within each view to achieve coding gains, redundancy can also be exploited across the different views. JVT standardization goal is to reach 50% bitrate savings over independent coding of views with same quality by defining another extension to the H.264/AVC standard.

MVC Prediction Structures 51 Many prediction structures possible to exploit inter-camera redundancy: trade-off in memory, delay, computation and coding efficiency. MPEG-2 Video Multiview profile MPEG-4 MVC

MVC: Technical Solution 52 Current H.264/AVC multiview extension does not require any changes to lower-level syntax Very compatible with single-layer AVC hardware Inter-view prediction Enabled through flexible design of decoded reference picture management Allow decoded pictures from other views to be inserted and removed from reference picture buffer Small changes to high-level syntax E.g., specify view dependency

Some MVC Performance Results 53 Anchor is H.264/AVC without hierarchical B pictures; however, Simulcast already includes hierarchical B pictures.

54

State-of-the-Art: H.264/AVC Encoding Architecture 55 Input Video Signal Split into Macroblocks 16x16 pixels - Coder Control Transform/ Scal./Quant. Decoder Scaling & Inv. Transform Control Data Quant. Transf. coeffs Entropy Coding Intra/Inter Intra-frame Prediction Motion- Compensation De-blocking Filter Output Video Signal Motion Estimation Motion Data

The Current Video Coding Paradigm 56 Coding efficiency: a smart-dummy relation Complexity budget: a rigid master-slave relation Error robustness: the snow ball effect Fitting better one-to-many topologies

Emerging Challenges Applications (from down-link to up-link) Wireless digital video cameras Multimedia mobile phones and PDAs Low-power video sensors and surveillance cameras Wireless video teleconferencing systems Requirements Light and flexible distribution codec complexity Robustness to packet/frame losses High compression efficiency Low latency Target Inter coding efficiency Intra coding complexity (encoder) Intra coding robustness Heavy encoder light Transcoding Light decoder 57

Distributed Coding of Dependent Sources: The Slepian-Wolf Theorem 58 R Y [bits] Independent enc. & dec: no errors ( ) R H X Y X ( ) R H Y X Y ( R + R ) H ( X, Y) X Y H(Y) H(Y X) Distributed enc. and joint decoding: vanishing error probability for long sequences R X +R Y =H(X,Y) H(X Y) H(X) R X [bits] Red zone: Combinations of R X and R Y needed to reconstruct X and Y, with an arbitrarily small probability of error

Lossy Coding with Side Information: The Wyner-Ziv Theorem 59 Source X Lossy Encoder Lossy Joint Decoder ^ X Y Y The Wyner-Ziv Theorem states that when X and Y are only jointly decoded, there is no coding efficiency loss with respect to the case when they are jointly encoded and decoded, if X and Y are jointly Gaussian and a mean squared error distortion measure is considered. Later, it was proved that only the innovation, X Y, needs to be Gaussian.

Feedback Channel Based DVC Architecture 60 WZ KF KF Based on a split between Wyner-Ziv (WZ) and key frames. Key frames used with a regular (GOP size) or dynamic periodicity. Key frames coded with H.264/AVC Intra. Turbo codes or LDPC codes may be used.

DVC RD Performance 61 Coastguard QCIF@15Hz, GOP2

VISNET II DVC RD Performance 62 Foreman QCIF@15Hz, GOP2

VISNET II DVC RD Performance 63 Soccer QCIF@15Hz, GOP2

VISNET II DVC RD Performance 64 Hall&Monitor QCIF@15Hz, GOP2

Looking over the Horizon 65 From middle 80s until recently, major video compression gains have been obtained in an almost continuous way: about 50% gains every 5 years. Current hybrid coding schemes seem to be close to saturation in terms of RD performance... Other approaches may follow Besides compression efficiency, there are other important functionalities and scenarios that still need much attention, notably scalability, multiview, etc. DVC is a new interesting way to look to video coding still not clear what real will come out of it New breakthroughs in video coding are not emerging is there more to learn from the Human Visual System (HVS)?

rugby 4.7 Mbit/s AVC 66

rugby 4.7 Mbit/s AVC + texture synthesis 67

Quality of Experience 68 What matters is the impact of the content on the end-user! Metrics to estimate the perceived value and impact of the content on the end-user, encompassing: Inherent quality of the signal Quality of interaction Multimodal nature of the quality Context Environment (device, surrounding, etc.)

Quality of Experience 69 Quality of Service: Estimated value by provider about user s experience Quality of Experience: Value (estimated or measured) of the actual user s experience Quality of Experience is the dual (and extended) view of QoS problem QoS=provider-centric QoE=user-centric

Challenges in visual quality assessment 70 Primary focus: Reliable and widely accepted perceived quality metrics for pictures Reliable perceived quality metrics for video Reliable perceived quality metrics without reference or with reduced reference Should be extended to: Stereoscopic and multi-view quality metrics 3D quality metrics Multi-modal quality metrics Segmentation and tracking quality metrics