HIGH-PERFORMANCE GPU VIDEO ENCODING ABHIJIT PATAIT SR. MANAGER, NVIDIA



Similar documents
AGENDA. Overview GPU Video Encoding NVIDIA Video Encoding Capabilities. Software API Performance & Quality. Kepler vs Maxwell GPU capabilities Roadmap

NVIDIA VIDEO ENCODER 5.0

HIGH PERFORMANCE VIDEO ENCODING WITH NVIDIA GPUS

GRID SDK FAQ. PG January Frequently Asked Questions

CLOUD GAMING WITH NVIDIA GRID TECHNOLOGIES Franck DIARD, Ph.D., SW Chief Software Architect GDC 2014

Cloud Gaming & Application Delivery with NVIDIA GRID Technologies. Franck DIARD, Ph.D. GRID Architect, NVIDIA

REMOTE HIGH FIDELITY VISUALIZATION. May 2015 Jeremy Main, Sr. Solution Architect GRID

Get the Best out of NVIDIA GPUs for 3D Design and Engineering in the Cloud

Amazon EC2 Product Details Page 1 of 5

PNY Professional Solutions NVIDIA GRID - GPU Acceleration for the Cloud

NVIDIA GRID OVERVIEW SERVER POWERED BY NVIDIA GRID. WHY GPUs FOR VIRTUAL DESKTOPS AND APPLICATIONS? WHAT IS A VIRTUAL DESKTOP?

Boundless Security Systems, Inc.

S Hands on Tutorial: Deploying GRID in Citrix and VMware Virtual Desktop Environments

MPEG-4 AVC/H.264 Video Codecs Comparison

Overview. Lecture 1: an introduction to CUDA. Hardware view. Hardware view. hardware view software view CUDA programming

QuickSpecs. NVIDIA Quadro M GB Graphics INTRODUCTION. NVIDIA Quadro M GB Graphics. Overview

NVIDIA Quadro M4000 Sync PNY Part Number: VCQM4000SYNC-PB. User Guide

GTC EXPRESS WEBINAR: GETTING THE MOST OUT OF VMWARE HORIZON VIEW VDGA WITH NVIDIA GRID. Steve Harpster - Solution Architect NALA

PLANNING FOR DENSITY AND PERFORMANCE IN VDI WITH NVIDIA GRID JASON SOUTHERN SENIOR SOLUTIONS ARCHITECT FOR NVIDIA GRID

Using Mobile Processors for Cost Effective Live Video Streaming to the Internet

OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC

NVIDIA GeForce GTX 580 GPU Datasheet

Wirecast Release Notes

HP Workstations graphics card options

XID ERRORS. vr352 May XID Errors

Alberto Corrales-García, Rafael Rodríguez-Sánchez, José Luis Martínez, Gerardo Fernández-Escribano, José M. Claver and José Luis Sánchez

Graphics Cards and Graphics Processing Units. Ben Johnstone Russ Martin November 15, 2011

NVIDIA GeForce GTX 750 Ti

Vmware Horizon View with Rich Media, Unified Communications and 3D Graphics

CTX OVERVIEW. Ucentrik CTX

This letter contains latest information about the above mentioned software version.

Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing

Introduction to GPU hardware and to CUDA

Introduction to GP-GPUs. Advanced Computer Architectures, Cristina Silvano, Politecnico di Milano 1

MPEG-4 AVC/H.264 Video Codecs Comparison

This letter contains latest information about the above mentioned software version.

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

S4726 VIRTUALIZATION AN INTRO TO VIRTUALIZATION. Luke Wignall & Jared Cowart Senior Solution Architects GRID

Low power GPUs a view from the industry. Edvard Sørgård

ADVANTAGES OF AV OVER IP. EMCORE Corporation

COMPUTING. SharpStreamer Platform. 1U Video Transcode Acceleration Appliance

QuickSpecs. NVIDIA Quadro K5200 8GB Graphics INTRODUCTION. NVIDIA Quadro K5200 8GB Graphics. Technical Specifications

4Kp60 H.265/HEVC Glass-to-Glass Real-Time Encoder Reference Design

QuickSpecs. NVIDIA Quadro K1200 4GB Graphics INTRODUCTION PERFORMANCE AND FEATURES. Overview

Cluster Monitoring and Management Tools RAJAT PHULL, NVIDIA SOFTWARE ENGINEER ROB TODD, NVIDIA SOFTWARE ENGINEER

VMware and NVIDIA: Bringing Workstations to the cloud

GPU Accelerated XenDesktop 3D Graphics beyond Designers and Engineers

VIRTU Universal MVP Installation Guide

ANDROID DEVELOPER TOOLS TRAINING GTC Sébastien Dominé, NVIDIA

Comparative Assessment of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC Encoders for Low-Delay Video Applications

GPU Computing - CUDA

QuickSpecs. NVIDIA Quadro K5200 8GB Graphics INTRODUCTION. NVIDIA Quadro K5200 8GB Graphics. Overview. NVIDIA Quadro K5200 8GB Graphics J3G90AA

ivms-4500(android) Mobile Client Software User Manual (V1.0)

QuickSpecs. NVIDIA Quadro K2200 4GB Graphics INTRODUCTION. NVIDIA Quadro K2200 4GB Graphics. Technical Specifications

High Efficiency Video Coding (HEVC) or H.265 is a next generation video coding standard developed by ITU-T (VCEG) and ISO/IEC (MPEG).

GPU Compute accelerated HEVC decoder on ARM Mali TM -T600 GPUs

Bosch Video Management System

Performance Testing in Virtualized Environments. Emily Apsey Product Engineer

Red Hat VDI. David Simmons

HP Workstations graphics card options

CSE 237A Final Project Final Report

Video Codec Requirements and Evaluation Methodology

STORAGE HIGH SPEED INTERCONNECTS HIGH PERFORMANCE COMPUTING VISUALISATION GPU COMPUTING

QuickSpecs. NVIDIA Quadro K4200 4GB Graphics INTRODUCTION. NVIDIA Quadro K4200 4GB Graphics. Overview

NVIDIA GeForce GT630 DP (2GB) PCIe x16 Card Graphics Card. QuickSpecs. Overview. Models. DA Worldwide Version 4 November 27, 2012 Page 1

NVIDIA CUDA GETTING STARTED GUIDE FOR MICROSOFT WINDOWS

GPU Parallel Computing Architecture and CUDA Programming Model

GPU System Architecture. Alan Gray EPCC The University of Edinburgh

Media Cloud Based on Intel Graphics Virtualization Technology (Intel GVT-g) and OpenStack *

Chapter 19 Cloud Computing for Multimedia Services

NVIDIA GeForce Experience

Getting Started with the ZED 2 Introduction... 2

S5445 BUILDING THE BEST USER EXPERIENCE WITH CITRIX XENAPP & NVIDIA GRID THOMAS POPPELGAARD

We are presenting a wavelet based video conferencing system. Openphone. Dirac Wavelet based video codec

Brainlab Node TM Technical Specifications

Remoting Your 3D. Ian Williams Manager, PSG Applied Engineering NVIDIA Corporation.

GPU Hardware and Programming Models. Jeremy Appleyard, September 2015

NVIDIA Tesla Compute Cluster Driver for Windows

Virtual Maintenance Training Deployment The Latest Technology Options

How To Use An Amd Ramfire R7 With A 4Gb Memory Card With A 2Gb Memory Chip With A 3D Graphics Card With An 8Gb Card With 2Gb Graphics Card (With 2D) And A 2D Video Card With

Virtual Desktop VMware View Horizon

Video Compression Trends & OUTLOOK. Andreas Pentrelli Solution Consultant

HP Z Workstations graphics card options

The MOST Affordable HD Video Conferencing. Conferencing for Enterprises, Conferencing for SMBs

Technical Brief. Quadro FX 5600 SDI and Quadro FX 4600 SDI Graphics to SDI Video Output. April 2008 TB _v01

NVIDIA GRID VGPU Steve Harpster SA NALA October 2013

Catalyst Software Suite Version 9.3 Release Notes

Planning Your Installation or Upgrade

Understanding Compression Technologies for HD and Megapixel Surveillance

Apple ProRes. White Paper June 2014

Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga

Milestone Solution Partner IT Infrastructure Components Certification Summary

Home Exam 3: Distributed Video Encoding using Dolphin PCI Express Networks. October 20 th 2015

ivms-4500(windows Mobile) Mobile Client Software User Manual Version 1.0

Lecture 1: an introduction to CUDA

Android In The Cloud: A New PaaS Computing Platform

ArcGIS Pro: Virtualizing in Citrix XenApp and XenDesktop. Emily Apsey Performance Engineer

Transcription:

HIGH-PERFORMANCE GPU VIDEO ENCODING ABHIJIT PATAIT SR. MANAGER, NVIDIA

AGENDA GPU Video Encoding Overview NVIDIA Video Encoding Capabilities Kepler, Maxwell Gen 1, Maxwell Gen 2 Software API Performance & Quality Roadmap

WHY GPU VIDEO ENCODING?

BENEFITS Low power Fixed function hardware, free CPU Reduced memory transfers Low latency High performance Higher density Scalability Automatic benefit from improvements in hardware Ease of programming Linux, Windows, C/C++, Application portability

NVIDIA VIDEO ENCODER CAPABILITIES

MAIN FEATURES Feature Benefits H.264 base, main, high profiles Wide range of use-cases H.265/HEVC main profile High performance (4K @ 60 fps) YUV 4:2:0 and 4:4:4 support QP maps 4K encoding in hardware API - NV Encode SDK & GRID SDK Independent of CUDA Lower bitrates at same quality Blazing-speed encoding High quality encoding without chroma subsampling Customizable quality, region of interest encoding High resolution encode Flexible, Win/Linux, DirectX/CUDA Use CUDA and encode simultaneously

FEATURE COMPARISON Kepler Maxwell Gen 1 (GM10x) Maxwell Gen 2 (GM20x) H.264 only H.264 only H.264 and HEVC/H.265 Planar 4:4:4 & proprietary 4:4:4; no lossless encoding ~240 fps 2-pass encoding @ 720p GRID K340/K520, K1/K2, Quadro, Tesla K10/K20 GeForce 2 full-speed encode sessions/system Standard 4:4:4 and H.264 lossless encoding ~500 fps 2-pass encoding @ 720p Maxwell-based GRID & Quadro products GeForce 2 full-speed encode sessions/system Standard 4:4:4 and H.264 lossless encoding ~900 fps 2-pass encoding @ 720p TBA GeForce 2 full-speed encode sessions/system NV Encode SDK 1.0-5.0 (Now) NV Encode SDK 4.0+ (Now) NV Encode SDK 5.0+ (Now) GRID SDK 1.x, 2.2, 2.3 (Now) GRID SDK 3.0+ (Now) In development

WHAT S NEW HARDWARE HEVC 8-bit encoding Main8 profile Optimized for low-latency applications (I and P frames) > 300 fps at very high quality 720p H.264 Improved performance (~80% higher compared to 1st Gen Maxwell) 4:4:4 and lossless

WHAT S NEW - SOFTWARE NVENC SDK 5.0 NVIDIA GPU driver 347.18 and above HEVC Unified API for H.264 and HEVC Linux & Windows Intra refresh, ref-pic invalidation, etc. for H.264 and HEVC Support for all NVENC hardware up to GM20x Adaptive quantization Quality improvements All-new sample applications, including a performance application

SOFTWARE API

NVENC SDK GRID SDK USING NVENC Direct Encode Capture + Encode No capture Transcoding Archiving Video editing CUDA pre-process + encoding Granular encoder settings D3D, CUDA interop Capture + encode Optimized for low-latency apps Capture + CUDA preprocess + encoding Encoder settings optimized for streaming D3D, CUDA interop

DIRECT ENCODE (NVENC SDK) Encoded bitstream Client application NVENC API Initialize, Configure, Encode CUDA Driver NVENC Driver Configure HW HW Encode NVENC firmware + hardware DirectX Driver

CAPTURE & ENCODE (GRID SDK) Encoded Bitstream Client application DX/OGL Present NvFBC/NvIFR Encode NVENC Driver YUV DirectX/OGL Driver Capture NVENC Hardware GPU 3D Engine

NVENC SDK (1/2) Available on NVIDIA developer zone https://developer.nvidia.com/nvidia-video-codec-sdk Current release: 5.0 Interface header, documentation, sample application.dll/.so included in the driver Unified API for Windows and Linux Works on x86/x64 API s, presets, rate control modes for Low-latency streaming Transcoding Video conferencing

NVENC SDK (2/2) Unified API for H.264 and HEVC Flexibility Dynamic resolution/bitrate change Low-level encoder settings Windows, Linux, DirectX, CUDA, OGL (via CUDA) Works on GeForce (2 sessions/system) Error concealment Reference picture invalidation Intra-refresh Greater flexibility for quality/performance trade-off Lossless encoding only in NVENC SDK

GRID SDK ENCODE NDA only older release available on NV developer zone https://developer.nvidia.com/grid-app-game-streaming Current release: 3.1 (Now NDA), 2.3 (Public) Interface header, documentation, sample apps.dll/.so included in the driver Windows and Linux Works on x86/x64 Presets and API s for Remote graphics (Cloud gaming, remote desktop, capture & stream) Optimized for low latency

QUALITY

PSNR (db) H.264 QUALITY 1-PASS ENCODING 46 H.264 quality with 1-pass rate control 44 42 Default 40 38 LL-Default HP HQ BD LL-HQ 36 34 6 8 10 12 15 18 20 bitrate (Mbps)

PSNR (db) H.264 QUALITY 2-PASS ENCODING 46 H.264 quality with 2-pass rate control 44 42 Default 40 38 LL-Default HP HQ BD LL-HQ 36 34 6 8 10 12 15 18 20 bitrate (Mbps)

1-pass 2-pass 1-pass 2-pass 1-pass 2-pass 1-pass 2-pass 1-pass 2-pass 1-pass 2-pass PSNR (db) COMPARISON: 1-PASS VS 2-PASS H.264 quality comparison: 1-pass vs 2-pass 40.5 40 39.5 39 38.5 38 37.5 Default LL-Default HP HQ BD LL-HQ Encoder preset

H.264 HEVC H.264 HEVC H.264 HEVC H.264 HEVC H.264 HEVC H.264 HEVC BITRATE SAVINGS Bitrate savings - Default preset 9.8 Bitrate savings - HQ preset 9.7 8 8 7.8 7.9 6 6 5.8 5.8 4 3.9 39.5 db 41.0 db 42.0 db 39.5 db 41.0 db 42.0 db 33% 25% 18% Bitrate savings 33% 26% 19%

H.264 VS HEVC Courtesy: Vanguard video

H.264 VS HEVC Courtesy: Vanguard video

PERFORMANCE

1 pass 1 pass 2 pass 2 pass 1 pass 1 pass 2 pass 2 pass 1 pass 1 pass 2 pass 2 pass 1 pass 1 pass 2 pass 2 pass Encode FPS H.264 PERFORMANCE GM20X H.264 Performance (1080p) 500 fps 450 fps 400 fps 350 fps 300 fps 250 fps 200 fps 150 fps 100 fps 50 fps fps HP LL-HP HQ LL-HQ Single pass 464 fps 342 fps 291 fps 293 fps Two pass 306 fps 246 fps 171 fps 181 fps 1080p, GM20x

H.264 HEVC H.264 HEVC H.264 HEVC H.264 HEVC Encode FPS H.264/HEVC PERF COMPARISON H.264/HEVC Performance: 2-pass 350 fps 300 fps 250 fps 200 fps 150 fps 100 fps 50 fps fps HP LL-HP HQ LL-HQ H.264 306 fps 246 fps 171 fps 181 fps H.265 220 fps 214 fps 102 fps 153 fps

PERFORMANCE - TREND 1800 fps Performance 1600 fps 1400 fps 1200 fps 1000 fps 800 fps 600 fps 400 fps 200 fps fps Kepler (2011) Maxwell Gen 1 (2013) Maxwell Gen 2 (2014) Future

ROADMAP

ROADMAP Core GPU chip IP Motion estimation only mode 2H2015 SAO, 10/12-bit, HEVC B-frames Lossless/4:4:4 Improved quality for screen content encoding ME performance and quality enhancements Today: 4K@60fps Next: 8K@??

THANK YOU APATAIT@NVIDIA.COM