GPU Compute accelerated HEVC decoder on ARM Mali TM -T600 GPUs



Similar documents
High Efficiency Video Coding (HEVC) or H.265 is a next generation video coding standard developed by ITU-T (VCEG) and ISO/IEC (MPEG).

Performance Analysis and Comparison of JM 15.1 and Intel IPP H.264 Encoder and Decoder

An Introduction to Ultra HDTV and HEVC

Thor High Efficiency, Moderate Complexity Video Codec using only RF IPR

White paper. H.264 video compression standard. New possibilities within video surveillance.

Using Mobile Processors for Cost Effective Live Video Streaming to the Internet

WHITE PAPER. H.264/AVC Encode Technology V0.8.0

Video Coding Technologies and Standards: Now and Beyond

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

Video Compression Trends & OUTLOOK. Andreas Pentrelli Solution Consultant

NVIDIA GeForce GTX 580 GPU Datasheet

Alberto Corrales-García, Rafael Rodríguez-Sánchez, José Luis Martínez, Gerardo Fernández-Escribano, José M. Claver and José Luis Sánchez

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201

Multihypothesis Prediction using Decoder Side Motion Vector Derivation in Inter Frame Video Coding

TECHNICAL OVERVIEW OF VP8, AN OPEN SOURCE VIDEO CODEC FOR THE WEB

H.264/MPEG-4 AVC Video Compression Tutorial

HIGH-PERFORMANCE GPU VIDEO ENCODING ABHIJIT PATAIT SR. MANAGER, NVIDIA

Comparison of the Coding Efficiency of Video Coding Standards Including High Efficiency Video Coding (HEVC)

Company and Market Overview

Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)

Study and Implementation of Video Compression standards (H.264/AVC, Dirac)

BDTI Solution Certification TM : Benchmarking H.264 Video Decoder Hardware/Software Solutions

Figure 1: Relation between codec, data containers and compression algorithms.

Power Benefits Using Intel Quick Sync Video H.264 Codec With Sorenson Squeeze

4Kp60 H.265/HEVC Glass-to-Glass Real-Time Encoder Reference Design

How To Improve Performance Of H.264/Avc With High Efficiency Video Coding (Hevc)

THE PRIMARY goal of most digital video coding standards

A Survey on ARM Cortex A Processors. Wei Wang Tanima Dey

Taos - A Revolutionary H.264 Video Codec Architecture For 2-Way Video Communications Applications

big.little Technology Moves Towards Fully Heterogeneous Global Task Scheduling Improving Energy Efficiency and Performance in Mobile Devices

Parametric Comparison of H.264 with Existing Video Standards

Rate-Constrained Coder Control and Comparison of Video Coding Standards

FRAUNHOFER INSTITUTE FOR INTEGRATED CIRCUITS IIS AUDIO COMMUNICATION ENGINE RAISING THE BAR IN COMMUNICATION QUALITY

Application Performance Analysis of the Cortex-A9 MPCore

MPEG-H Audio System for Broadcasting

Home and Digital. Jean-Marc Chery Chief Operating Officer

The H.264/MPEG-4 Advanced Video Coding (AVC) Standard

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

H.264/MPEG-4 Advanced Video Coding Alexander Hermans

Digital Audio Compression: Why, What, and How

Using AVC/H.264 and H.265 expertise to boost MPEG-2 efficiency and make the 6-in-6 concept a reality

HIGH PERFORMANCE VIDEO ENCODING WITH NVIDIA GPUS

High Dynamic Range Video The Future of TV Viewing Experience

Evaluating Wavelet Tranforms for Video Conferencing Applications. Second quarter report (Oct Dec, 2008)

IP Video Rendering Basics

THE EMERGING JVT/H.26L VIDEO CODING STANDARD

H 261. Video Compression 1: H 261 Multimedia Systems (Module 4 Lesson 2) H 261 Coding Basics. Sources: Summary:

Sample Project List. Software Reverse Engineering

Accelerating Wavelet-Based Video Coding on Graphics Hardware

DCT-JPEG Image Coding Based on GPU

For Articulation Purpose Only

NVIDIA VIDEO ENCODER 5.0

Video Coding Standards and Scalable Coding

HPC with Multicore and GPUs

Low power GPUs a view from the industry. Edvard Sørgård

Choosing the Right DSP for High-Resolution Imaging in Mobile and Wearable Applications


Architectures and Platforms

VALAR: A BENCHMARK SUITE TO STUDY THE DYNAMIC BEHAVIOR OF HETEROGENEOUS SYSTEMS

COMPUTING. SharpStreamer Platform. 1U Video Transcode Acceleration Appliance

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

H.264 Based Video Conferencing Solution

How To Compare Video Resolution To Video On A Computer Or Tablet Or Ipad Or Ipa Or Ipo Or Ipom Or Iporom Or A Tv Or Ipro Or Ipot Or A Computer (Or A Tv) Or A Webcam Or

A Tutorial on Image/Video Coding Standards

CM0340 SOLNS. Do not turn this page over until instructed to do so by the Senior Invigilator.

A Look at Emerging Standards in Video Security Systems. Chris Adesanya Panasonic Network Systems Company

high-quality surround sound at stereo bit-rates

Graphics Cards and Graphics Processing Units. Ben Johnstone Russ Martin November 15, 2011

Solomon Systech Image Processor for Car Entertainment Application

IMPACT OF COMPRESSION ON THE VIDEO QUALITY

Internet Video Streaming and Cloud-based Multimedia Applications. Outline

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201

Efficient Stream-Reassembling for Video Conferencing Applications using Tiles in HEVC

Overview: Video Coding Standards

White Paper Video and Image Processing Design Using FPGAs

Datasheet EdgeVision

Overview of the Scalable Video Coding Extension of the H.264/AVC Standard

Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet

THE High Efficiency Video Coding (HEVC) standard is

Graphical Processing Units to Accelerate Orthorectification, Atmospheric Correction and Transformations for Big Data

GPU Architectures. A CPU Perspective. Data Parallelism: What is it, and how to exploit it? Workload characteristics

A Tutorial On Network Marketing And Video Transoding

White paper. An explanation of video compression techniques.

How To Improve Performance Of The H264 Video Codec On A Video Card With A Motion Estimation Algorithm

VoIP: Enabling Communications for Everything. May,

The H.264/AVC Advanced Video Coding Standard: Overview and Introduction to the Fidelity Range Extensions

Video compression: Performance of available codec software

NTT DOCOMO Technical Journal. Standardization of HEVC Video Coding Scheme Reducing Video Traffic by Half and Perspective for Mobile Services

Multidimensional Transcoding for Adaptive Video Streaming

Voice Communication Package v7.0 of front-end voice processing software technologies General description and technical specification

Proactive Video Assurance through QoE and QoS Correlation

MPEG-4. The new standard for multimedia on the Internet, powered by QuickTime. What Is MPEG-4?

VA (Video Acceleration) API. Jonathan Bian 2009 Linux Plumbers Conference

The MeeGo Multimedia Stack. Dr. Stefan Kost Nokia - The MeeGo Multimedia Stack - CELF Embedded Linux Conference Europe

A Scalable VISC Processor Platform for Modern Client and Cloud Workloads

Video Authentication for H.264/AVC using Digital Signature Standard and Secure Hash Algorithm

X264: A HIGH PERFORMANCE H.264/AVC ENCODER. Loren Merritt and Rahul Vanam*

Advances of Media Technology in Modern Computing

Computer Graphics Hardware An Overview

Transcription:

GPU Compute accelerated HEVC decoder on ARM Mali TM -T600 GPUs

Ittiam Systems Introduction DSP Systems IP Company Multimedia + Communication Systems Multimedia Components, Systems, Hardware Focus on Broadcast, Video Communication, Video Security, Mobile IP Licensing Business Model Founded in 2001 Venture funded Flexible mix of one time fees and royalties for licensing 300+ licensees Worldwide Fortune 100 companies, Tier 1 OEMs Consistently rated as Most Preferred DSP IP Supplier 250 strong Engineering Team World Class Talent Deep Multimedia and end application Expertise 29 patents issued 30+ patents filed World s most preferred DSP IP supplier 2004 2005 2006 DSP Professionals Survey by Forward Concepts 2

Ittiam Multimedia Overview Multimedia Components Audio Codecs Video Codecs/Image Codecs Algorithms for Audio Effects, Acoustics, Imaging ARM CPU, NEON Optimized DSP+HW Accelerators + GPU expertise and capabilities Middleware + SDKs System components Parsers, Creators, Stacks, Subtitles Multimedia Integration Android, Other Frameworks Use Case validation Enhancements to existing Middleware Application Specific SDKs OEM Applications Complete Multimedia Applications Covers major Multimedia Use Cases Camera, Gallery, Editor, Players, Video Editor Production tested Customizable to requirements 4x 3

Ittiam Multimedia Solutions and ARM Strategic Platform Focus on Mobile, Home, Portable segments ARM Connected Community Member Strong Portfolio of IP Expertise in ARM architecture and optimizations for ARM Long Investment Many years of development on ARM Platforms Covering ARM9E, ARM11, Cortex -A8, A9, A15, A5, A7 and NEON TM In house developed reference C models for all IP Efficient, targeted for ARM, validated across multiple generations Partnership Joint Benchmarking of implementations Early Access to Mali/OpenCL information Early involvement on new platforms 4

Ittiam Media Processing Elements Audio Codes Stereo and Multichannel MP12, AAC- LC/HE v1&v2, AC3, DD+ High Quality Resampler Post Processing and Audio Effects Field Proven Acoustics Voice Quality Enhancements with Echo Cancellation/ AEC), Noise Reduction/ANR Equalizer for Microphone Sin & Speaker AGC, AVC, Audio De-Reverb Mic Beam Forming Video Codecs MPEG2, MPEG-4, H.264, HEVC / H.265 Scalable across Multiple ARM Cores Optimized for bandwidth and CPU + NEON Error Resilience for Streaming Use cases In Production Image Processing De-noise, Face detection, Red-eye correction Panorama, HDR, Low Light, 3D B&W, Sepia, Cross Process Exposure, Colours, Geometric, Filters 5

HEVC Overview

HEVC / H.265 Sandard HEVC aka H.265 is a video compression standard, jointly developed by ISO/IEC MPEG and ITU-T VCEG MPEG and VCEG have established a Joint Collaborative Team on Video Coding (JCT-VC) to develop the HEVC standard HEVC is a successor to H.264 standard HEVC can support ultra high resolutions upto 8192 x 4320 pixels HEVC offers substantially higher video compression ratio compared to existing standards

H.265 vs H.264 Tool H.264 H.265 Coding unit 16x16 macroblocks Block coding Structure Coding tree blocks (64x64) Quadtree coding structure Transforms 4x4 and 8x8 4x4, 8x8, 16x16 and 32x32 Inter Prediction 4x4 to 16x16 Symmetric partitions Intra Prediction 9 Modes 35 Modes 4x4 to 64x64 Asymmetric partitions Motion Prediction Spatial Median Advanced Motion Vection Prediction (Spatial + Temporal) Luma motion compensation Chroma motion compensation 6 taps for half-pel positions+ Bilinear filter for qpel positions 2 taps 4 taps 8 taps for half-pixel positions + 7 tap filter for quarter-pel positions Slices Slices for parallel parsing Wavefront parallel processing Tiles and slices for parallel parsing In-loop filters Deblocking Deblocking and SAO

BitRate HEVC compression About 50% compression over H264 for video resolutions of 1080p and above. 30-40% compression over H264 for lower resolutions MPEG-2 35% reduction in bitrate for same PSNR output when compared to H.264 H264/AVC Perceptual video quality is subjective and cannot be measured with PSNR values 1990 2000 2010 H265/HEVC Subjective tests have shown around 50% reduction in bitrate for similar perceptual video quality when compared to H.264

HEVC Applications Near Term Over-the-top(OTT) video services market is growing at a rapid pace, thanks to Netflix, Hulu, YouTube etc., Smarter Phones and Tablets contribute significantly to OTT growth with consumers opting to view videos on-the-go OTT video services are popularly used with in TVs/set-top boxes as well Rapid growth in OTT market chokes the network bandwidth One in five Consumers abandon viewing due to slow feeds, poor quality viewing experience HEVC will enable superior viewing experience with OTT video service

HEVC Applications Long Term Higher quality video in the traditional terrestrial and satellite broadcasts Video recording in cameras and mobile phones, for saving storage space or higher quality Broadcasting 1080p video at 50 or 60 frames per second for the same bandwidth as 1080i (25 or 30 fps) 4K and 8K Ultra-HD broadcasts for theatre-like quality

Need for Software HEVC Decoder HEVC is a newly ratified standard and there is no hardware support in the current generation of Processors (Embedded / Mobile / Applications SoCs) Dedicated HW accelerators for HEVC increases the silicon area and hence the cost significantly Lack of HEVC content makes the early HW implementation risky Software Decoding is simpler and economically viable option for HEVC deployment NOW Handling the HEVC decoder complexity on a wide range of processors with constraints on the power consumption is key challenge for the Software Decoder

Why use GPUs for Video Processing? Decoding of high resolution videos in software involves high computational complexity and will load the CPU enormously GPUs are highly compute capable and power efficient devices Sin CPU Core(s) ARM Cortex with NEON GPUs are generally idle during video playout GPU acceleration will free up the CPU to perform other (system) tasks MALI T600 / OpenCL compliant GPU

HEVC Decoding on Capable GPUs GPUs are massively multithreaded devices capable of handling hundreds or thousands of threads in parallel at any given time Only highly data parallel algorithms of video codec can be efficiently offloaded to the GPU for processing Parsing & Entropy Decode Motion Compensa tion Inverse Quant Intra Prediction Inverse Transform Recon Deblocking & SAO Not suitable for GPU execution Data parallel execution,suitable for GPU execution

Motion Compensation The current picture/frame pixels is predicted from the reference frame s pixels The reference picture can be from past or future The prediction happens on a block-by-block basis Sin And there can be multiple reference frames for each block

Motion Compensation The most compute intensive part of Motion compensation is sub-pixel interpolation Luma 8 or 7 tap filter Chroma 4 tap filter Sub pixel interpolation is data parallel, i.e., interpolation of each block within a frame can happen in parallel and hence suited for GPU computing Sin

Inverse Quantization and Transform Inverse Quantization & Transform The residue value need to be Inverse quantized 2-D Inverse DCT transformations should be performed over the inverse quantized data Recon & InLoop Filters Reconstruction : The output from the Motion compensation and intra prediction should be added with the output from Inverse transform In loop filtering such as Deblocking and SAO filters are applied over reconstructed samples Parsing & Entropy Decode Motion Compensati on Inverse Quant Intra Prediction Sin Inverse Transform Recon Deblocking & SAO

Challenges in CPU+GPU Implementation Efficient Partitioning of work between CPU and GPU The effective FPS of decoder will be the minimum of the FPS achieved by the CPU and GPU for their respective work So the partitioning needs to be efficient so that both of them perform their respective work at almost the same speed(fps) Efficient pipelining data between CPU and GPU The algorithms running on CPU will depend on the output of algorithms from GPU and/or vice versa A good design should make sure neither the CPU nor the GPU spend any time waiting for the output of the other Cache coherency Cache coherency between CPU and GPU data need to ensured.

Benefits of Mali T600 GPU The 128-bit vector processing Presence of GPU cache instead of Local memory Flexible OpenCL workgroup size No divergent threads Unified memory Suits DSP algorithms like Video processing No requirement for data transfers from/to global memory. Can be understood just like a CPU. Works optimizally for a large range of OpenCL workgroup sizes. Multiple block sizes in a Video frame can be handled efficiently. Similar to CPU code, conditional code can be used in OpenCL kernels as well. Different kinds of filter types, filter lengths etc., in video decode can be handled efficiently. CPU and GPU share the same memory. Video YUV buffers are pretty big. There is no need of costly memory transfers of those buffers. MALI GPUs are well suited for Video Acceleration with significant power/performance benefits

Thank You For more information visit www.ittiam.com or contact us at mkt@ittiam.com