Advances of Media Technology in Modern Computing

Similar documents
Power Benefits Using Intel Quick Sync Video H.264 Codec With Sorenson Squeeze

Intel Media Server Studio - Metrics Monitor (v1.1.0) Reference Manual

Intel Media SDK Library Distribution and Dispatching Process

Specification Update. January 2014

Enabling Innovation in Mobile User Experience. Bruce Fleming Sr. Principal Engineer Mobile and Communications Group

Douglas Fisher Vice President General Manager, Software and Services Group Intel Corporation

Desktop Reinvented. Lisa Graff Vice President and General Manager Desktop Client Platforms Group

* * * Intel RealSense SDK Architecture

Riding silicon trends into our future

Version Rev. 1.0

NVIDIA GeForce GTX 580 GPU Datasheet

The Transition to PCI Express* for Client SSDs

Intel Data Direct I/O Technology (Intel DDIO): A Primer >

Head-Coupled Perspective

Video Encoding on Intel Atom Processor E38XX Series using Intel EMGD and GStreamer

Application. EDIUS and Intel s Sandy Bridge Technology

Intel Built-in Visuals

Intel Core TM i3 Processor Series Embedded Application Power Guideline Addendum

2013 Intel Corporation

Haswell Cryptographic Performance

Intel Service Assurance Administrator. Product Overview

VNF & Performance: A practical approach

Intel HTML5 Development Environment. Tutorial Test & Submit a Microsoft Windows Phone 8* App (BETA)

Intel Media SDK Features in Microsoft Windows 7* Multi- Monitor Configurations on 2 nd Generation Intel Core Processor-Based Platforms

iscsi Quick-Connect Guide for Red Hat Linux

Intel Identity Protection Technology (IPT)

Vendor Update Intel 49 th IDC HPC User Forum. Mike Lafferty HPC Marketing Intel Americas Corp.

Intel Technical Advisory

Intel Retail Client Manager

Upsurge in Encrypted Traffic Drives Demand for Cost-Efficient SSL Application Delivery

Intel Identity Protection Technology Enabling improved user-friendly strong authentication in VASCO's latest generation solutions

Software Solutions for Multi-Display Setups

Intel Media Server Studio Professional Edition for Windows* Server

PC Solutions That Mean Business

The Case for Rack Scale Architecture

Benefits of Intel Matrix Storage Technology

Intel Platform and Big Data: Making big data work for you.

COLO: COarse-grain LOck-stepping Virtual Machine for Non-stop Service

Creating Overlay Networks Using Intel Ethernet Converged Network Adapters

Intel vpro Technology Module for Microsoft* Windows PowerShell*

Intel HTML5 Development Environment. Tutorial Building an Apple ios* Application Binary

NVIDIA VIDEO ENCODER 5.0

Built for Business. Ready for the Future.

i.mx Applications Processors with Hantro's Multimedia Framework

Whitepaper. NVIDIA Miracast Wireless Display Architecture

Intel Data Migration Software

How To Scale At 14 Nanomnemester

How To Use An Amd Ramfire R7 With A 4Gb Memory Card With A 2Gb Memory Chip With A 3D Graphics Card With An 8Gb Card With 2Gb Graphics Card (With 2D) And A 2D Video Card With

Benchmarking Cloud Storage through a Standard Approach Wang, Yaguang Intel Corporation

Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms

Intel Ethernet and Configuring Single Root I/O Virtualization (SR-IOV) on Microsoft* Windows* Server 2012 Hyper-V. Technical Brief v1.

GPU Compute accelerated HEVC decoder on ARM Mali TM -T600 GPUs

Chrome OS*-Based Devices in the Enterprise

Intel HTML5 Development Environment Article Using the App Dev Center

Intel Cyber Security Briefing: Trends, Solutions, and Opportunities. Matthew Rosenquist, Cyber Security Strategist, Intel Corp

Accelerating Business Intelligence with Large-Scale System Memory

COMPUTING. SharpStreamer Platform. 1U Video Transcode Acceleration Appliance

New Dimensions in Configurable Computing at runtime simultaneously allows Big Data and fine Grain HPC

Intel and Qihoo 360 Internet Portal Datacenter - Big Data Storage Optimization Case Study

Introducing the First Datacenter Atom SOC

High Performance Computing and Big Data: The coming wave.

Intel Identity Protection Technology with PKI (Intel IPT with PKI)

COLO: COarse-grain LOck-stepping Virtual Machine for Non-stop Service. Eddie Dong, Tao Hong, Xiaowei Yang

Revision History. Revision Revision History Date

Intel: a Thought Leader Helping IoT Scale Out

COSBench: A benchmark Tool for Cloud Object Storage Services. Jiangang.Duan@intel.com

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Using GStreamer for hardware accelerated video decoding on Intel Atom Processor E6xx series

Dynamic Digital Depth (DDD) and Real-time 2D to 3D conversion on the ARM processor

Intel Retail Client Manager Audience Analytics

Intel 810 and 815 Chipset Family Dynamic Video Memory Technology

Creating Full Screen Applications Across Multiple Displays in Extended Mode

Intel Compute Stick STCK1A32WFC User Guide. Intel Compute Stick STCK1A32WFC

Intel Retail Client Manager

Intel vpro Technology. How To Purchase and Install Symantec* Certificates for Intel AMT Remote Setup and Configuration

Displaying Stereoscopic 3D (S3D) with Intel HD Graphics Processors for Software Developers August 2011

ivms-4200 Client Software Technical Specification v1.02

新 一 代 軟 體 定 義 的 網 路 架 構 Software Defined Networking (SDN) and Network Function Virtualization (NFV)

Dynamic Resolution Rendering

Intel s Next Generation Integrated Graphics Architecture Intel Graphics Media Accelerator X3000 and 3000

Accelerating Business Intelligence with Large-Scale System Memory

Intel Solid-State Drive Pro 2500 Series Opal* Compatibility Guide

Intel HTML5 Development Environment. Article - Native Application Facebook* Integration

HPC & Big Data THE TIME HAS COME FOR A SCALABLE FRAMEWORK

Accomplish Optimal I/O Performance on SAS 9.3 with

Software Evaluation Guide for Autodesk 3ds Max 2009* and Enemy Territory: Quake Wars* Render a 3D character while playing a game

New System-on-a-Chip (SoC) Intel Celeron and Pentium Processors Balance Cost, Performance, and Power for Everyday Computing

Intel Unite. User Guide

SAPPHIRE TOXIC R9 270X 2GB GDDR5 WITH BOOST

Intel vpro Technology. How To Purchase and Install Go Daddy* Certificates for Intel AMT Remote Setup and Configuration

Intel Small Business Advantage (Intel SBA) Release Notes for OEMs

Media Cloud Based on Intel Graphics Virtualization Technology (Intel GVT-g) and OpenStack *

Intel Perceptual Computing SDK My First C++ Application

Panasonic Video Surveillance Solutions One Partner. One Solution.

How to Configure Intel Ethernet Converged Network Adapter-Enabled Virtual Functions on VMware* ESXi* 5.1

Transcription:

Advances of Media Technology in Modern Computing Dr. Hong Jiang, Intel Fellow, Intel Corporation ICME 2013 Keynote San Jose, California, USA, July 16, 2013 Be Bold. Define the Future of Software.

Legal INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. A "Mission Critical Application" is any application in which failure of the Intel Product could result, directly or indirectly, in personal injury or death. SHOULD YOU PURCHASE OR USE INTEL'S PRODUCTS FOR ANY SUCH MISSION CRITICAL APPLICATION, YOU SHALL INDEMNIFY AND HOLD INTEL AND ITS SUBSIDIARIES, SUBCONTRACTORS AND AFFILIATES, AND THE DIRECTORS, OFFICERS, AND EMPLOYEES OF EACH, HARMLESS AGAINST ALL CLAIMS COSTS, DAMAGES, AND EXPENSES AND REASONABLE ATTORNEYS' FEES ARISING OUT OF, DIRECTLY OR INDIRECTLY, ANY CLAIM OF PRODUCT LIABILITY, PERSONAL INJURY, OR DEATH ARISING IN ANY WAY OUT OF SUCH MISSION CRITICAL APPLICATION, WHETHER OR NOT INTEL OR ITS SUBCONTRACTOR WAS NEGLIGENT IN THE DESIGN, MANUFACTURE, OR WARNING OF THE INTEL PRODUCT OR ANY OF ITS PARTS. Intel may make changes to specifications and product descriptions at any time, without notice. All products, dates, and figures specified are preliminary based on current expectations, and are subject to change without notice. Intel processors, chipsets, and desktop boards may contain design defects or errors known as errata, which may cause the product to deviate from published specifications. Current characterized errata are available on request. Any code names featured are used internally within Intel to identify products that are in development and not yet publicly announced for release. Customers, licensees and other third parties are not authorized by Intel to use code names in advertising, promotion or marketing of any product or services and any such use of Intel's internal code names is at the sole risk of the user. Intel product plans in this presentation do not constitute Intel plan of record product roadmaps. Please contact your Intel representative to obtain Intel s current plan of record product roadmaps. Performance claims: Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance Intel, Intel Inside, the Intel logo, Centrino, Intel Core, Intel Atom, Pentium, and Ultrabook are trademarks of Intel Corporation in the United States and other countries 2

Contents History Heterogeneous Computing Architecture Video Codec Video Processing Perceptual Computing Initiatives Summary 3

Contents History Heterogeneous Computing Architecture Video Codec Video Processing Perceptual Computing Initiatives Summary 4

Two decades??? 1993 2003 2013 5

Two decades Video Coding MPEG2: Block motion comp Block DCT AVC/H.264: More complex block coding Loop filter & CABAC HEVC/H.265: More complex block coding More complex loop filter 1993 2003 2013 2X coding efficiency every 10 years 6

Two decades Video Coding MPEG2: Standard Definition (SD) DVD, Broadcast AVC/H.264: High Definition (HD=6xSD) Blu-ray, Internet Streaming HEVC/H.265 (promises): Ultra HD (4K = 24xSD) Cellular Wireless Streaming 1993 2003 2013 Complexity compounded by Resolution & Power 7

From Big Screen to Mobile HD media becomes ubiquitous Optical media led the digital conversion (2006: Blu-ray) Internet streaming is catching up (2009: 720p, 2012: 1080p) All devices are HD capable (2008: DTV/PC, 2010: Smart Phone) Beyond HD 4K Video is coming (2-3 years to reach consumers) 1.1B users 100h/min uploads Today, video file >50% internet traffic In 2016, sum of all video >86% traffic HD is ubiquitous & We Are Not Done Yet 8

Two decades Personal Computing Intel Pentium : Performance spiral MMX/SSE extensions (Desktop PC Era) Intel Centrino : Low power Wi-Fi (Mobile Computing Era) Haswell (4 th Gen Intel Core ): Breakthrough battery life Leading Graphics/Media (Ultra mobility) 1993 2003 2013 Major transformation each decade 9

Two decades Personal Computing Intel Pentium : 0.8 um process 3.1 millions transistors (Desktop PC Era) Intel Centrino : 0.13 um process 77 millions transistors (Mobile Computing Era) Haswell (4 th Gen Intel Core ): 22 nm process ~1.4 billions transistors (Ultra Mobility) 1993 2003 2013 450x more transistors over 20 years! 10

Two decades Personal Computing 1993: Barely getting thumbnail video on PC 2003: Can play DVD; Limited quality. 2013: Multiple HD, 4K playback and encode; High quality HD video processing 1993 2003 2013 Media Technology revolutionized 11

Contents History Heterogeneous Computing Architecture Video Processing Video Codec Perceptual Computing Initiatives Summary 12

Demands and Challenges 13

Two-Prone Solutions Process Technology Moore s Law Architecture Innovation 14

Intel Tick/Tock Development Model 45nm Process Technology 32nm Process Technology 22nm Process Technology Nehalem Westmere Sandy Bridge Ivy Bridge Haswell NEW Intel Microarchitecture (Nehalem) (Nehalem) NEW Intel Microarchitecture (Sandy Bridge) (Sandy Bridge) NEW Intel Microarchitecture (Haswell) TOCK TICK TOCK TICK TOCK Haswell CPU Family 22nm Process Technology Haswell, 4 th Generation Intel Core Processors, builds upon innovations in the previous Core generations Intel Microarchitecture code name Haswell, Ivy Bridge, Sandy Bridge 15

Architecture Concept Trends: Higher capability in a lower power budget Power considerations drive a Fixed Function solution Flexibility considerations requires a Programmable solution EPI 10 nj 1 nj 0.1 nj 0.01 nj Power Efficiency General purpose microprocessors (x86, ARM cores ) Domain specific processors (GPU, DSP ) ASIC, fixed function Blocks EPI: Energy spent per Instruction in nj Max Flexibility Max Power Efficiency 16

17

80x 70x 60x 50x 40x 30x 20x 10x Baseline Increasing Graphics Performance 2 nd Gen Intel Core 2006 2007 2008 2009 2010 2011 2012 2013 Source: Intel. 3DMark06 3rd Gen Intel Core 4 th Gen Intel Core 75X Increase Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance 18

Haswell: Processor Graphics Architecture Video Quality Engine Video Processing Color Processing Multi-Format Codec: Parallel engine High performance Video Decode and Encode Media Optimized Execution Units: Zero overhead thread switching Native media ISA Vector/Matrix oriented operations Next Generation Intel Microarchitecture Code Name Haswell Media Accelerators Higher throughput & low power Retaining flexibility 19

A Scalable Graphics Architecture GT2/GT1 GT3 GT1 Some GT3 sku s come with an 128MB edram, as cache shared with CPU Significant generational EU count growth (Top line from 8, 16 to 40 EU s)

Contents History Heterogeneous Computing Architecture Video Codec Video Processing Perceptual Computing Initiatives Summary 21

Intel Quick Sync Video Video Conversion Record and Upload Stereo 3D Creation HD Video Chat Intel Quick Sync Video is HW-based video codec capability: Break-through performance & quality new User Experiences Many more applications: Wireless Display, Game recording Other names and brands may be claimed as the property of others

Haswell: Quick Sync Video Performance and Power 4x-12x real-time transcode at various quality modes 10-hour video playback time on latest Apple MacBook Air HD Mosaic Intel HD Graphics 4600 AVC Transcode Performance and Power 4K Mosaic Multi-stream 4K decode Intel QSV Disable 48.4 Power (W) 24.74 Intel QSV Enable 4.6x faster speed at 2x Power saving 0.5x power 4.6x Faster over speed SW encoder 60 Speed (FPS) > real-time 4K Encode 280 0 50 100 150 200 250 300 *Measurements based on Intel Demo Clip in Cyberlink Media Espresso Fast Conversion Mode 23

Deployment as Media Servers Example: QuickFire Network 1U userver contains 11 3 rd Gen Intel Core mobile processors Transcoding over 88 1080p30 HD streams per blade! 24

Video Encode: A Balanced Approach Encode Solutions Performance Power Flexibility GPGPU Low High High Traditional Fix Function HW High Low Low Flexible Intel Quick Sync Video High Low Balanced Intel implements a flexible hardware design approach for encoding Hybrid of fix function HW and programmable EU array. Provides balance between performance, power and flexibility Subject to actual implementation Hybrid 2-Stage Video Encoder: ENC Rate Control Motion Estimation Intra Prediction Accelerated by HW VME Mode Decision PAK Full HW fix function pipeline Motion Comp. Intra Prediction Forward Quant Pixel Reconstruction Entropy Coding 25

Encoder Parallelization and Scalability Decoupled Encoding Operations ENC: Multi-threaded in wavefront order macroblocks PAK: Pipelined in raster order macroblocks Multiple level of parallelization Decoding vs. Encoding ENC and PAK ENC: Multiple Macroblocks in Wavefronts ENC/VME HW: Integer vs. Fractional Search

Intel Quick Sync Video Multi-year Improvements Performance: Over 50% hardware CAGR for three generations Quality: Hardware features and algorithm improvements Usability: Fine grained quality vs. performance tradeoff control db 1 0.9 0.8 1.2 db 1 0.7 0.6 0.8 0.5 0.6 0.4 0.3 0.4 0.2 0.1 0 3rd Gen 4th Gen TU1 TU2 TU3 TU4 TU5 TU6 TU7 CQP Mode BD-PSNR 0.2 0 4th Gen 3rd Gen 4th Gen+Lookahead TU1 TU2 TU3 TU4 TU5 TU6 TU7 VBR Mode BD-PSNR Data is captured with internal test app over a big set of test clips. 27

Intel HD Graphics 4000 vs. Intel Iris Pro Graphics 5200 HD Graphics 4000 (3 rd Gen) Handbrake Enabled Iris Pro Graphics 5200 (4 th Gen) Frame 670 of Star Trek Into Darkness movie trailer transcoded in HandBrake QSV beta. Both set to VBR 2Mbps Best Quality setting under HandBrake High Profile preset Same encode quality on Intel 4 th Gen Core Processor with HD 4200/4400/4600, Iris, and Iris Pro. Performance varies on different SKUs Generational quality improvements 28

Intel HD Graphics 4000 vs. Intel Iris Pro Graphics 5200 Handbrake HD Graphics 4000 Enabled Iris Pro Graphics 5200 Frame 670 of Star Trek Into Darkness movie trailer transcoded in HandBrake QSV beta. Both set to VBR 2Mbps Best Quality setting under HandBrake High Profile preset Same encode quality on Intel 4 th Gen Core Processor with HD 4200/4400/4600, Iris, and Iris Pro. Performance varies on different SKUs Generational quality improvements 29

Contents History Heterogeneous Computing Architecture Video Codec Video Processing Perceptual Computing Initiatives Summary 30

Video Processing Pipe Migrated to a dedicated VP pipe Video Quality Engine (VQE) Extensive suite of functions for higher quality video at lower power Denoise Deinterlace Skin-tone Gamut Compression Contrast Saturation ProcAmp Sharpness* Scaler* Gamut Expansion CSC * In Media Sampler 31

De-noise Spatial and Temporal De-noise Filter Global noise level measurement Content-adaptive spatiotemporal filtering of noise Motion history-based blending of spatial and temporal filter results Block Interface Input: YCbCr 420/422 Output: YCbCr 420/422 32

Skin-tone Processing Per-pixel Enhancement of Skin-tone Pixels Reproduce the natural skin colors on the display screen Skin Tone Detection identifies pixels with skin-like colors with per-pixel indicator Skin Tone Enhancement modifies the Saturation and Hue of the skin-tone pixels Skin-tone Definition Skin-tone Detection Skin-tone Indicator Skin-tone Enhancement Block Interface Input: YCbCr444 Output: YCbCr444 with modified CbCr components; Per-pixel skin tone indicator 33

Contrast Enhancement Automatic Contrast Enhancement: Per-pixel mapping of luma to enhance contrast 1. Histogram of luma Y pixel values is generated for the input video frame 2. Piece-Wise Linear Function (PWLF) is generated from luma histogram 3. Pixel values are modified according to the PWLF Histogram Computation PWLF Mapping Block Interface Input: YCbCr 444 Output: YCbCr 444 with modified Y 34

Saturation Enhancement Per-Pixel Saturation Enhancement Utilize 6 basic colors as primaries/anchors (Red, Green, Blue, Magenta, Yellow, Cyan) Adjust colorfulness (saturation) of pixels while maintaining their color (hue) Block Interface Input: YCbCr 444 Output: YCbCr 444 with modified CbCr components 35

Color Correction Display proper colors on display screen 1. Inverse gamma correction via PWLF 2. 3x3 matrix multiplication with input/output offset 3. Forward gamma correction via PWLF Block Interface Input: RGB Output: RGB 36

37

Contents History Heterogeneous Computing Architecture Video Codec Video Processing Perceptual Computing Initiatives Summary 38

Towards Natural, Intuitive and Immersive Human-Computer Interactions Now Near Future The Vision 39

Perceptual Computing Adding Human-like Senses to the Computing Devices 40

Intel Perceptual Computing SDK Download the SDK and order the 3D Camera at intel.com/software/perceptual 41

Intel Perceptual Computing SDK Speech Recognition Facial Analysis Hand Tracking 3D Gestures Object Tracking Easily Implemented by Application Developers for: Games Entertainment Productivity Accessibility Immersive Teleconferencing Education Health Enterprises Retail Industrial Download the SDK and order the 3D Camera at intel.com/software/perceptual 42

Contents History Heterogeneous Computing Architecture Video Processing Video Codec Perceptual Computing Initiatives Summary 43

Consumption Media Usage Outlook Internet streaming video DVD/Blu-ray disc Stereoscopic 3D video Interaction Video and sensory inputs Gesture recognition Augmented reality Creation Photo, video, audio encoding Video encode and transcode User generated contents Perception Face & object detection Scene analysis Perceptual computation 44

Concluding Remarks We are in the era of HD digital media Moore s Law inspires Innovations Heterogeneous computing addresses the HD media demands Media continues to be an exciting fields for years to come 45

46