The All Programmable SoC FPGA for Networking and Computing in Big Data Infrastructure

Similar documents
Extending the Power of FPGAs. Salil Raje, Xilinx

Data Center and Cloud Computing Market Landscape and Challenges

Xeon+FPGA Platform for the Data Center

Data and Control Plane Interconnect solutions for SDN & NFV Networks Raghu Kondapalli August 2014

Making Multicore Work and Measuring its Benefits. Markus Levy, president EEMBC and Multicore Association

Cloud Data Center Acceleration 2015

Definition of a White Box. Benefits of White Boxes

FPGA Accelerator Virtualization in an OpenPOWER cloud. Fei Chen, Yonghua Lin IBM China Research Lab

Seeking Opportunities for Hardware Acceleration in Big Data Analytics

All Programmable Logic. Hans-Joachim Gelke Institute of Embedded Systems. Zürcher Fachhochschule

7a. System-on-chip design and prototyping platforms

Foundation for High-Performance, Open and Flexible Software and Services in the Carrier Network. Sandeep Shah Director, Systems Architecture EZchip

Accelerating I/O- Intensive Applications in IT Infrastructure with Innodisk FlexiArray Flash Appliance. Alex Ho, Product Manager Innodisk Corporation

Codesign: The World Of Practice

Accelerating the Data Plane With the TILE-Mx Manycore Processor

Intel Network Builders Solution Brief. Intel and ASTRI* Help Mobile Network Operators Support Small Cell Networks

COMPUTING. Centellis Virtualization Platform An open hardware and software platform for implementing virtualized applications

Scaling Cloud-Native Virtualized Network Services with Flash Memory

ICRI-CI Retreat Architecture track

Embedded Systems: map to FPGA, GPU, CPU?

Architectures and Platforms

Eli Levi Eli Levi holds B.Sc.EE from the Technion.Working as field application engineer for Systematics, Specializing in HDL design with MATLAB and

Agenda. Michele Taliercio, Il circuito Integrato, Novembre 2001

Using Network Virtualization to Scale Data Centers

Model-based system-on-chip design on Altera and Xilinx platforms

Conference. Smart Future Networks THE NEXT EVOLUTION OF THE INTERNET FROM INTERNET OF THINGS TO INTERNET OF EVERYTHING

IoT: Smart Vision Leads The Way

Netvisor Software Defined Fabric Architecture

FPGA Acceleration using OpenCL & PCIe Accelerators MEW 25

OpenSoC Fabric: On-Chip Network Generator

FPGAs in Next Generation Wireless Networks

White Paper. Innovate Telecom Services with NFV and SDN

VIRTUALIZING THE EDGE

LSI SAS inside 60% of servers. 21 million LSI SAS & MegaRAID solutions shipped over last 3 years. 9 out of 10 top server vendors use MegaRAID

SOFTWARE DEFINED NETWORKING

What is a System on a Chip?

SOFTWARE-DEFINED: MAKING CLOUDS MORE EFFICIENT. Julian Chesterfield, Director of Emerging Technologies

新 一 代 軟 體 定 義 的 網 路 架 構 Software Defined Networking (SDN) and Network Function Virtualization (NFV)

DesignWare IP for IoT SoC Designs

Networking Virtualization Using FPGAs

Introduction to System-on-Chip

SDN and NFV in the WAN

PCI Express Impact on Storage Architectures and Future Data Centers. Ron Emerick, Oracle Corporation

100 Gigabit Ethernet is Here!

Intel Xeon +FPGA Platform for the Data Center

Different NFV/SDN Solutions for Telecoms and Enterprise Cloud

Data Centric Systems (DCS)

NFV Network and Compute Intensive H/W Acceleration (using SDN/PI forwarding)

RIDE THE SDN AND CLOUD WAVE WITH CONTRAIL

Emerging storage and HPC technologies to accelerate big data analytics Jerome Gaysse JG Consulting

Router Architectures

Advanced Core Operating System (ACOS): Experience the Performance

Next Generation Operating Systems

A Scalable VISC Processor Platform for Modern Client and Cloud Workloads

Use Cases for the NPS the Revolutionary C-Programmable 7-Layer Network Processor. Sandeep Shah Director, Systems Architecture EZchip

Intel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family

Simplifying Big Data Deployments in Cloud Environments with Mellanox Interconnects and QualiSystems Orchestration Solutions

PCI Express Impact on Storage Architectures. Ron Emerick, Sun Microsystems

9/14/ :38

Networking Goes Open-Source. Michael Zimmerman VP Marketing, Tilera

The promise of SDN. EU Future Internet Assembly March 18, Yanick Pouffary Chief Technologist HP Network Services

High-Level Synthesis for FPGA Designs

Scaling Mobile Compute to the Data Center. John Goodacre

Embedded Development Tools

2015 Investor and Analyst Day

Blue Planet. Introduction. Blue Planet Components. Benefits

Unified Computing Systems

High-performance vswitch of the user, by the user, for the user

Optimizing Configuration and Application Mapping for MPSoC Architectures

WiSER: Dynamic Spectrum Access Platform and Infrastructure

Solving I/O Bottlenecks to Enable Superior Cloud Efficiency

Home and Digital. Jean-Marc Chery Chief Operating Officer

Architekturen und Einsatz von FPGAs mit integrierten Prozessor Kernen. Hans-Joachim Gelke Institute of Embedded Systems Professur für Mikroelektronik

VP/GM, Data Center Processing Group. Copyright 2014 Cavium Inc.

The 5G Infrastructure Public-Private Partnership

Applied Micro development platform. ZT Systems (ST based) HP Redstone platform. Mitac Dell Copper platform. ARM in Servers

David Lawler Vice President Server, Access & Virtualization Group

PCI Express IO Virtualization Overview

Getting performance & scalability on standard platforms, the Object vs Block storage debate. Copyright 2013 MPSTOR LTD. All rights reserved.

Extreme Networks: Building Cloud-Scale Networks Using Open Fabric Architectures A SOLUTION WHITE PAPER

Infrastructure Matters: POWER8 vs. Xeon x86

SDN. What's Software Defined Networking? Angelo Capossele

REAL-TIME STREAMING ANALYTICS DATA IN, ACTION OUT

The Evolution of the Central Office

CARRIER LANDSCAPE FOR SDN NEXT LEVEL OF TELCO INDUSTRILIZATION?

The Open Cloud Near-Term Infrastructure Trends in Cloud Computing

SDN/Virtualization and Cloud Computing

Introduction to Software Defined Networking (SDN) and how it will change the inside of your DataCentre

Microsoft Private Cloud Fast Track

Seven Challenges of Embedded Software Development

HOW SDN AND (NFV) WILL RADICALLY CHANGE DATA CENTRE ARCHITECTURES AND ENABLE NEXT GENERATION CLOUD SERVICES

ONOS [Open Source SDN Network Operating System for Service Provider networks]

Accelerate Cloud Computing with the Xilinx Zynq SoC

MCA Standards For Closely Distributed Multicore

The Evolving Data Center. Past, Present and Future Scott Manson CISCO SYSTEMS

The Role of Virtual Routers In Carrier Networks

Ensuring end-user quality in NFV-based infrastructures

Networking Services Trusted at every level and every phase

Efficient and multi-market embedded processing based on ARM : beyond architecture dilemma

Intel Data Direct I/O Technology (Intel DDIO): A Primer >

Transcription:

The All Programmable SoC FPGA for Networking and Computing in Big Data Infrastructure Ivo Bolsens, Senior Vice President & CTO Page 1

Moore s Law: The Technology Pipeline Page 2

Industry Debates on Transistor Cost Page 3

Design Cost Estimated Chip Design Cost, by Process Node, Worldwide, 2011 Design cost ($M) Mask cost ($M) 28/22-nm 32-nm Embedded software ($M) Yield ramp-up cost ($M) 45-nm 65-nm 90-nm 130-nm 28nm = 2x 45nm cost > $170 M 180-nm ($ Million) 0 45 90 135 180 Page 4

Growing Problems for ASIC & ASSP Offerings >50% of Top 16 ASSP Vendors Losing Money Growing ASSP Gaps Eroding customer confidence in vendors High cost burden from over design for diverse needs No ability to differentiate or customize Source Public reports, Xilinx estimates Page 5

Trend Mobile Infrastructure: Scalable Platforms Capacity Indoor Outdoor Macrocell Macrocell + Active Antennas Microcell Picocell Residential Femto 4-16 Users <100mW Home Enterprise Femto 30-60 Users <250mW Office 30-200 Users <1W Dense Indoor (Malls, Transport Hubs) Urban Infill ~200 Users <1-10W Single Sector Wide Area ~200 Users/ sector 20-100W Multi-Sector Wide Area 2-3x Data Capacity of RRU Coverage Page 6

Trend Services : Different Figures of Merit Latency (ms) Computing GB, Latency (s) Storing GB, BW (mean, var) Streaming GB, BW (mean, var) latency (ms) Gaming Communicating Latency (ms), BW (mean)

Trend Wired Infrastructure: Software Defined Networks Slide credit: From Virtualizing the Net by Jon Turner (2004) Page 8

Software Defined Networking gains industry mindshare The best thing about OpenFlow or SDN, is that it s brought back a new hope to networking Networking is cool again- Jayshree, CEO - Arista Networks

Trend Data Center Infrastructure: Cloud Computing Big Data Increasing Volume, Velocity, and Variety Low power Reduce operation and cooling costs Security Both outside and inside Page 10

Impact of trends (1) Networking New network fabrics Faster, Fatter, and Flatter Software defined networking Software control plane Hardware data plane Content-aware networking Deep packet inspection Enhanced security Page 11

Impact of trends (2) Compute ARM-based microservers Improved performance per watt Hybrid SoC CPU+accelerators+fabric Cost and power reduction Larger memory Hybrid NVRAM and DRAM Latency reduction Page 12

Impact of trends (3) Storage Specialized functions Compression, encryption, memcached Custom SSD controllers Higher performance Reduced latency Data-aware storage Integrated database support Offload from processor Page 13

Future Data Centre Architecture Internet Intelligent Appliances (router, firewall) Intelligence in the network Further convergence Fewer tiers FibreChannel new topologies (torus NIC) Rack TOR TOR Convergence SAN Server Server Server Server Low latency NICs Network attached storage Server Server Direct-attached Direct-attached storage storage Page 14

Core Network Coherent Memory Bus I/O Bus and Fabrics Core Network Generic Data Center QPI AXI ACE PCIe Fabrics DRAM DRAM x86 40/80G NIC SOC Ethernet Switch 100/400G I/O Attach Accel Direct FPGA Attach Storage Storage Ctrl NVM NVM H D H D Network FPGA Attach Storage Storage Ctrl NVM NVM H D H D Memory Bus Coherency Lowest Latency I/O Bus PCIe Core Network Public and Private Ethernet Page 15

Core Network Coherent Memory Bus I/O Bus and Fabrics Core Network The New Data Center DRAM DRAM x86 QPI AXI ACE PCIe Fabrics FPGA NIC 40/80G Networking Security, DPI Low latency switch/bridge Custom fabrics SDN control plane DRAM Zynq SOC ARM Fabric Bridge Node Ctrl DRAM DRAM NVM FPGA FPGA Acc FPGA Acc Ethernet FPGA Switch 100/400G NVM Compute Transcode Search / Database DSP Filters Large Memory Hybrid NVM/DRAM ARM FPGA Storage FPGA Ctrl NVM NVM NVM NVM ARM FPGA NVM NVM H D H D Storage Custom SSD Low latency Memcache appliance Data aware storage Encrypt, Compress Page 16

Trend : More Intelligence in Embedded Systems SMART Data Center Revolution New Opportunities to Control Costs and Increase Strategic Advantage MACHINES THAT UNDERSTAND Smart wireless networks to the rescue Carriers are turning toward more intelligent network management Smart Factories For factory management in the future, it will become essential to strive to implement smart capabilities The Next Big, Digital Economy; Smart Energy The energy market is undergoing a major transformation Page 17 Page 17

Programmable & Smart Across All Markets Wireless Comms Wired Comms Data Center Embedded All Programmable Multiple Spectrums Multiple Standards (LTE, 3G) Multiple Levels of QoS Network Function Virtualization (NFV) Multiple Stds (400Gb etc) Dynamic QoS Provisioning Software Defined Networks (SDN) Multiple Stds (FCoE, iscsi ) Config Storage (SAN, NAS, SSD ) Changing Resolutions (MPixel, Fps) Emerging Video Stds (UHD, 8K/4K) Evolving Video Processing Algorithms Smarter Self Organizing Networks (SON) Cognitive Radio Smart Antenna Context Aware Network Services Self-Healing Networks Video Caching at the Edge Data Pre-Processing & Analytics Virtualized Resource Optimization Intelligent Appliances Object Detection & Analytics Automotive Collision Avoidance Industrial Machine Vision Page 18

Industry Mandates Programmable Imperative Programmable Systems Integration Insatiable Intelligent Bandwidth Page 19

The All Programmable Platform Security : Bit level operations Packet Processing : Wide Datapaths DSP Processing : Pipelined Datapaths Graphics Processing : Parallel Micro-Engines System Management : Finite State machines

The Heterogeneous MPSoC Heterogeneous Connected Scalable Parallel Configurable uengine uengine uengine ARM multicore Accelerator Accelerator Accelerator MicroBlaze micro engines X86 or ARM Platform Host Accelerator Accelerator Accelerator DSP engines Dedicated Accelerators

The Era of Heterogeneous Processing Unit Page 22

The UltraSCALE FPGA SoC Page 23

Programming the Heterogeous SoC Page 24

Virtual Cache Coherency Virtual CPU + FPGA Use Models FPGA 0 Pipelined datapath HDL programmed Control Processor FPGA 1 Pipelined datapath with SW control CPU sets register values CPU Memory 2 CPU + FPGA co-processing FPGA part of explicit address space FPGA CPU $ Memory 3 CPU + FPGA peer processing Cache Coherency FPGA $ Page 25

CPU + FPGA Evolution IO-Connected Coherent Integrated QPI ARM AXI PCIe Page 26

Complexity of building MPSoC systems High-Speed Analog Design High-Speed Interface Design SOC System Assembly DSP Hardware Design Software Development Requires distinct design skills!

Platform IP Integrator Radio pipeline Radio pipeline CPU Build/ Re-Use IP Subsystems Link/Assemble Subsystems Generate SW Drivers

Data Movement Interconnect HW/SW Design Flow CPU FPGA C-compiler Concurrent SW C-synthesis Application SW-drivers Middleware Libraries Platform AXI Hardware Wires CPU Video Codec Encryption Memory LTE Modem Page 29

High Level Synthesis (HLS) Create IP from C/C++/System C algorithm specification Abstract algorithm verification to the specification level Traditional FPGA design experience not required Page 30

Quality From of Results C Algorithm to FPGA Implementation Video frames/second 200 196 150 FPGA resources 100 6 4 50 0 51 DSP C2FPGA 2 0 RTL C2FPGA FPGA: >38 times better performance than DSP video processor QOR: C2FPGA equal to or better than RTL synthesis Ease-of-use: C2FPGA 2x fewer lines of C code than DSP processor Page 31

Programming Heterogeneous Multi-core OpenCL Hardware / Software partitioning & interfacing C Compile / Debug ARM Processor HS-SW Interfacing Domain Specific API C-HLS Accelerator synth FPGA A9 A9 Commercial Software Ecosystem Video codec Encryption Packet Processing FFT Application-Specific Search Page 32

Data Movement Interconnect HW/SW Design Flow: SW Programmer View CPU FPGA C-compiler Concurrent SW C-synthesis Application SW-drivers Middleware Libraries AXI Hardware Wires CPU Video Codec Encryption Memory LTE Modem Page 33 Application Programming

Programmable Platform: CPU + FPGA Peer Processing Core Core Accel Accel Logic L1 Cache L1 Cache L1 Cache L1 Cache Shared L2 Cache Coherency Engine Capabilities Coherent Caches for HW Coherency Engine Over NOC Interconnect Coherent Caches for SW Coherency Management DDR MemCon SRAM I/O Device DMA Coherency Benefits: Peer Processing: Direct Cache-2-Cache data movement Latency: Very low latency access to CPU (FPGA) data Usability: No SW cache flush needed Page 34

Domain Specific Abstractions IP Abstraction IPI Automation

ZED Board ZED Board Zynq Evaluation and Development Kit Low cost Zynq based community board (XC7Z020) Partnership between Avnet, Digilent, Xilinx Digilent will fulfill academic market for Xilinx University Program wwwzedboardorg Open source SW and IP Linux Eclipse based IDE Vivado HLS: C to FPGA Reference designs Page 36

Conclusions New Markets Require Heterogeneous Multi-Core SoC Modern FPGA are All Programmable SoC Software Centric Design Flow Becoming Possible Democratizing SoC Design : Targeted Teaching Platform Page 37