Extending the Power of FPGAs. Salil Raje, Xilinx



Similar documents
Seeking Opportunities for Hardware Acceleration in Big Data Analytics

High-Level Synthesis for FPGA Designs

Eli Levi Eli Levi holds B.Sc.EE from the Technion.Working as field application engineer for Systematics, Specializing in HDL design with MATLAB and

7a. System-on-chip design and prototyping platforms

LogiCORE IP AXI Performance Monitor v2.00.a

Agenda. Michele Taliercio, Il circuito Integrato, Novembre 2001

FPGA Accelerator Virtualization in an OpenPOWER cloud. Fei Chen, Yonghua Lin IBM China Research Lab

Product Development Flow Including Model- Based Design and System-Level Functional Verification

9 REASONS WHY THE VIVADO DESIGN SUITE ACCELERATES DESIGN PRODUCTIVITY

Data Center and Cloud Computing Market Landscape and Challenges

Model-based system-on-chip design on Altera and Xilinx platforms

Rapid System Prototyping with FPGAs

Digitale Signalverarbeitung mit FPGA (DSF) Soft Core Prozessor NIOS II Stand Mai Jens Onno Krah

Zynq-7000 Platform Software Development Using the ARM DS-5 Toolchain Authors: Simon George and Prushothaman Palanichamy

Codesign: The World Of Practice

2015 Investor and Analyst Day

FPGAs in Next Generation Wireless Networks

Jonathan C. Sevy. Software and Systems Engineering Experience

Echtzeittesten mit MathWorks leicht gemacht Simulink Real-Time Tobias Kuschmider Applikationsingenieur

Xeon+FPGA Platform for the Data Center

Architectures and Platforms

ReCoSoC'11 Montpellier, France. Implementation Scenario for Teaching Partial Reconfiguration of FPGA

Networking Virtualization Using FPGAs

System Performance Analysis of an All Programmable SoC

Optimizing Configuration and Application Mapping for MPSoC Architectures

Digital Systems Design! Lecture 1 - Introduction!!

Developing reliable Multi-Core Embedded-Systems with NI Linux Real-Time

Network connectivity controllers

High Performance or Cycle Accuracy?

CFD Implementation with In-Socket FPGA Accelerators

Von der Hardware zur Software in FPGAs mit Embedded Prozessoren. Alexander Hahn Senior Field Application Engineer Lattice Semiconductor

Best Practises for LabVIEW FPGA Design Flow. uk.ni.com ireland.ni.com

AXI Performance Monitor v5.0

Why 25GE is the Best Choice for Data Centers

MAJORS: Computer Engineering, Computer Science, Electrical Engineering

Xilinx Training Course Listing

White Paper 40-nm FPGAs and the Defense Electronic Design Organization

Defining Platform-Based Design. System Definition. Platform Based Design What is it? Platform-Based Design Definitions: Three Perspectives

Quartus II Software Design Series : Foundation. Digitale Signalverarbeitung mit FPGA. Digitale Signalverarbeitung mit FPGA (DSF) Quartus II 1

Reconfigurable Architecture Requirements for Co-Designed Virtual Machines

Building an Embedded Processor System on a Xilinx Zync FPGA (Profiling): A Tutorial

White Paper. S2C Inc Technology Drive, Suite 620 San Jose, CA 95110, USA Tel: Fax:

Ping Pong Game with Touch-screen. March 2012

Any Media Over Any Network. July 14 th, 2015

MPSoC Designs: Driving Memory and Storage Management IP to Critical Importance

Float to Fix conversion

Open Flow Controller and Switch Datasheet

Software-Programmable FPGA IoT Platform. Kam Chuen Mak (Lattice Semiconductor) Andrew Canis (LegUp Computing) July 13, 2016

Hybrid Platform Application in Software Debug

OpenSoC Fabric: On-Chip Network Generator

All Programmable Logic. Hans-Joachim Gelke Institute of Embedded Systems. Zürcher Fachhochschule

Beschleunigen von Algorithmen mit High-Level Synthese auf Xilinx Zynq

FPGA Acceleration using OpenCL & PCIe Accelerators MEW 25

Virtual Platforms Addressing challenges in telecom product development

9/14/ :38

Embedded Development Tools

Five Ways to Build Flexibility into Industrial Applications with FPGAs

RAPID PROTOTYPING OF DIGITAL SYSTEMS Second Edition

Intel Xeon +FPGA Platform for the Data Center

Compiling PCRE to FPGA for Accelerating SNORT IDS

Using Vivado Design Suite with Version Control Systems Author: Jim Wu

Introduction to Digital System Design

BY STEVE BROWN, CADENCE DESIGN SYSTEMS AND MICHEL GENARD, VIRTUTECH

Introduction to Functional Verification. Niels Burkhardt

Custom design services

HARNESS project: Managing Heterogeneous Compute Resources for a Cloud Platform

H MICRO CASE STUDY. Device API + IPC mechanism. Electrical and Functional characterization of HMicro s ECG patch

What is a System on a Chip?

Acceleration for Personalized Medicine Big Data Applications

Going to the wire: The next generation financial risk management platform

Systolic Computing. Fundamentals

FPGA Prototyping Primer

Reconfig'09 Cancun, Mexico

Accelerate Cloud Computing with the Xilinx Zynq SoC

10/100/1000Mbps Ethernet MAC with Protocol Acceleration MAC-NET Core with Avalon Interface

Networking Services Trusted at every level and every phase

Getting Started with Embedded System Development using MicroBlaze processor & Spartan-3A FPGAs. MicroBlaze

WiSER: Dynamic Spectrum Access Platform and Infrastructure

Simplifying Embedded Hardware and Software Development with Targeted Reference Designs

The Internet of Things: Opportunities & Challenges

How To Design An Image Processing System On A Chip

How Router Technology Shapes Inter-Cloud Computing Service Architecture for The Future Internet

Concurrent Hardware/Software Development Platforms Speed System Integration and Bring-Up

Linux. Reverse Debugging. Target Communication Framework. Nexus. Intel Trace Hub GDB. PIL Simulation CONTENTS

Seven Challenges of Embedded Software Development

Electronic system-level development: Finding the right mix of solutions for the right mix of engineers.

Xilinx SDAccel. A Unified Development Environment for Tomorrow s Data Center. By Loring Wirbel Senior Analyst. November

Lesson 7: SYSTEM-ON. SoC) AND USE OF VLSI CIRCUIT DESIGN TECHNOLOGY. Chapter-1L07: "Embedded Systems - ", Raj Kamal, Publs.: McGraw-Hill Education

Systems on Chip Design

FSMD and Gezel. Jan Madsen

Architekturen und Einsatz von FPGAs mit integrierten Prozessor Kernen. Hans-Joachim Gelke Institute of Embedded Systems Professur für Mikroelektronik

1. PUBLISHABLE SUMMARY

Moving Beyond CPUs in the Cloud: Will FPGAs Sink or Swim?

Microwatt to Megawatt - Transforming Edge to Data Centre Insights

Attention. restricted to Avnet s X-Fest program and Avnet employees. Any use

Embedded Systems Engineering Certificate Program

Ericsson Introduces a Hyperscale Cloud Solution

Cloud-Based Apps Drive the Need for Frequency-Flexible Clock Generators in Converged Data Center Networks

AN FPGA FRAMEWORK SUPPORTING SOFTWARE PROGRAMMABLE RECONFIGURATION AND RAPID DEVELOPMENT OF SDR APPLICATIONS

EVALUATION OF SCHEDULING AND ALLOCATION ALGORITHMS WHILE MAPPING ASSEMBLY CODE ONTO FPGAS

Transcription:

Extending the Power of FPGAs Salil Raje, Xilinx

Extending the Power of FPGAs The Journey has Begun Salil Raje Xilinx Corporate Vice President Software and IP Products Development

Agenda The Evolution of FPGAs and FPGA Programming IP-Centric Design with High Level Languages Software Defined Systems

The Evolution of FPGAs and FPGA Programming

The Evolution of Programmable Devices Logic Cells 1M 3D ICs 10K Programmable SoCs FPGAs 100 PLDs 1985 1995 2005 2015 2025

The Progression of FPGA Design Methodology Logic Software Defined Cells 1M IP-Centric with High-level Languages 10K Schematics RTL Programmable SoCs 3D ICs FPGAs 100 PLDs 1985 1995 2005 2015 2025

The Shift in Developer Personas Application Developer Logic Software Defined Cells 1M Hardware Designer Algorithm Developer Embedded SW Dev IP-Centric with High-level Languages 10K Hardware Designer Schematics RTL Programmable SoCs 3D ICs FPGAs 100 PLDs 1985 1995 2005 2015 2025

IP-Centric Design with High Level Languages IP-Centric Design with High Level Languages

Step 1: Leverage Hard and Soft IP + Embedded Processors Example of Hard IP: Zynq MPSOC Examples of Complex Soft IP AXI-MM AXI-Lite AXI-MM interconnect AXI-Lite interconnect AXI-MM AXI-MM AXI-Lite VDMA Deinterlacer V Scaler H Scaler CSC 422-444 420-422 420-422 Letterboxing AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S router 10x10 AXI4-S OTN Subsystem Video Subsystem HMC Controller Digital Pre-Distortion SmartConnect

Step 2: Develop New IP blocks in C/C++ Algorithmic Specification Micro-architecture Exploration RTL Implementation FPGA Integration Create IP from C/C++/System C algorithm specification Abstract algorithm verification 10,000x faster than RTL sim Traditional FPGA design experience not required

Step 3: Use Automated IP Assembly = IP Assembly Example: Zynq Processor Subsystem + Video Subsystem + 6 IP Blocks 4700 lines of VHDL (top-level connectivity only) Video Processing IP Subsystem

High Level Design Case Study: GainSpeed Venture-backed start-up Products for cable operators to: Meet skyrocketing capacity requirements of streaming video Cost-effectively migrate networks to a software-driven, all-ip architecture Need to be 10x better and 10x cheaper than much larger incumbents Have a much smaller team and need to work smarter 13

Previous Approach to Design 100K+ lines of RTL RTL RTL RTL (VHDL) Test Bench Test Test Bench Bench C) C) (System C) Testbench same as driver code Model Sim Minimal Test Cases Synthesis P&R System Debug (Chipscope) Exhaustive Corner Cases Used Virtex-6 240Ts, targeting 200+ MHz Ran P&R on 100 servers Spent 20% of time designing and 80% making it work Took a team of 10 engineers working for 2 years 14

Current Design Methodology : HLS + IPI Low 1000s lines of C code RTL IP RTL Blocks (VHDL) (C code)) Test Bench Test Test Bench Bench (System C) C) (C code) C Compiler Exhaustive Test Cases HLS IPI Synthesis Kintex 480T + off-the-shelf parts Used HLS to build 80% of the IP Blocks DSP functions, closed-loop timing recovery, DMA engines, etc Fast functional simulation in C P&R Much better coverage achieved earlier Team of 2 people working for 6 months System Debug (Chipscope) System-level Debug 15

Automated IP Assembly Eliminated grunt-work in wiring IP 16

Overall Project Results Elapsed time from project start to running system in lab: 6 months Total number of IP blocks integrated: 30+ Leveraged key IP cores: SRIO, 10G Ethernet MAC, MIG controller, FIR Compiler, Reed-Solomon Design running at 368 MHz in Kintex-7 Enabled co-debug with software developers 17

The Era of Software Defined Systems

Why FPGAs for Software Defined Systems? The Era of Virtualization Reconfigurable computing, storage and networking in the cloud The Thirst for Acceleration Heterogeneous computing Compute-intensive algorithms DNA sequencing Search engines Video processing Encryption/Decryption Packet routing FPGAs and Programmable SoCs: Power-efficient Reconfigurable Massively-Parallel Compute Engines

Query Example of FPGAs as Accelerators Smith-Waterman DNA Sequencing Application Reference Compares Query(N) with Reference(M) genome strings Involves MxN Matrix Computation and Dynamic Programming Maximal parallelism along diagonals Xilinx Virtex-7 690T (reference) Intel Xeon E5-2697 12 core Ratio Virtex-7 vs Intel 12 core Intel Xeon Phi 5110P 60 core Ratio Virtex-7 vs Intel 60 Core GCUPS 7700 1975 390 3000 257 Watts 2800 13000 022 22500 012 GCUPS/Watt 275 015 1810 013 2063

SDSoc: Software Defined SoC Development Applications: Machine Vision Driver Assistance/ADAS Software-Defined Radio (SDR) Wireless Radio Surveillance UAV / Drones Full System Optimizing Compiler ARM Code Main( ) C/C++ Development System-level Profiling Mark C/C++ Functions for Acceleration GCC Connectivity HLS+ SP&R Standard Eclipse IDE Accelerator Func( ) Embedded ARM Processor Subsystem Programmable Logic

SDAccel: Software Defined Algorithm Acceleration Sample Applications: Machine Learning Bioinformatics Graph Processing Stringology Data Analytics Modelling Science Codes Signal Processing Video & Image Processing Software-Defined FPGA Acceleration

Platforms Enable Software Defined FPGA Systems Pre-defined Platform Hardware System Performance Partial Design Board Algorithms Support Reconfig Analysis & Host Software Stack

Summary HW designers: SW developers: C-based IP development + highlevel IP assembly are the next step beyond RTL Software-defined algorithm development + platforms will enable you to exploit the power of FPGAs & SoCs We re making major investments in next generation silicon and tools that will revolutionize FPGA design

Thank You Thank You!