Networking Virtualization Using FPGAs



Similar documents
THe Internet provides a critical infrastructure for a

Scalable Network Virtualization Using FPGAs

Xeon+FPGA Platform for the Data Center

Seeking Opportunities for Hardware Acceleration in Big Data Analytics

Performance Evaluation of VMXNET3 Virtual Network Device VMware vsphere 4 build

Where IT perceptions are reality. Test Report. OCe14000 Performance. Featuring Emulex OCe14102 Network Adapters Emulex XE100 Offload Engine

A Low Latency Library in FPGA Hardware for High Frequency Trading (HFT)

How Router Technology Shapes Inter-Cloud Computing Service Architecture for The Future Internet

Kirchhoff Institute for Physics Heidelberg

Achieving a High-Performance Virtual Network Infrastructure with PLUMgrid IO Visor & Mellanox ConnectX -3 Pro

FPGA Accelerator Virtualization in an OpenPOWER cloud. Fei Chen, Yonghua Lin IBM China Research Lab

HANIC 100G: Hardware accelerator for 100 Gbps network traffic monitoring

Open Flow Controller and Switch Datasheet

High-performance vswitch of the user, by the user, for the user

Windows Server 2008 R2 Hyper-V Live Migration

VON/K: A Fast Virtual Overlay Network Embedded in KVM Hypervisor for High Performance Computing

Virtual Software Routers: A Performance and Migration Study

Data Center and Cloud Computing Market Landscape and Challenges

Sockets vs. RDMA Interface over 10-Gigabit Networks: An In-depth Analysis of the Memory Traffic Bottleneck

COLO: COarse-grain LOck-stepping Virtual Machine for Non-stop Service

The new frontier of the DATA acquisition using 1 and 10 Gb/s Ethernet links. Filippo Costa on behalf of the ALICE DAQ group

Intel Xeon +FPGA Platform for the Data Center

OpenFlow with Intel Voravit Tanyingyong, Markus Hidell, Peter Sjödin

Scaling in a Hypervisor Environment

Cloud Data Center Acceleration 2015

White Paper FPGA Performance Benchmarking Methodology

Performance Evaluation of Linux Bridge

PCI Express* Ethernet Networking

White Paper Increase Flexibility in Layer 2 Switches by Integrating Ethernet ASSP Functions Into FPGAs

Telecom - The technology behind

Virtual Switching Without a Hypervisor for a More Secure Cloud

基 於 SDN 與 可 程 式 化 硬 體 架 構 之 雲 端 網 路 系 統 交 換 器

7a. System-on-chip design and prototyping platforms

Gigabit Ethernet Design

Windows Server 2008 R2 Hyper-V Live Migration

1000Mbps Ethernet Performance Test Report

The virtualization of SAP environments to accommodate standardization and easier management is gaining momentum in data centers.

Quantifying the Performance Degradation of IPv6 for TCP in Windows and Linux Networking

D1.2 Network Load Balancing

Linux NIC and iscsi Performance over 40GbE

Evolution to Virtual and Highly Scalable Platforms

Performance Analysis of IPv4 v/s IPv6 in Virtual Environment Using UBUNTU

Benchmarking Virtual Switches in OPNFV draft-vsperf-bmwg-vswitch-opnfv-00. Maryam Tahhan Al Morton

Virtualised MikroTik

All Programmable Logic. Hans-Joachim Gelke Institute of Embedded Systems. Zürcher Fachhochschule

Full and Para Virtualization

Evaluation Report: Emulex OCe GbE and OCe GbE Adapter Comparison with Intel X710 10GbE and XL710 40GbE Adapters

Distributed Elastic Switch Architecture for efficient Networks-on-FPGAs

CFD Implementation with In-Socket FPGA Accelerators

Performance of Network Virtualization in Cloud Computing Infrastructures: The OpenStack Case.

FPGA-based Multithreading for In-Memory Hash Joins

Evaluation Methodology of Converged Cloud Environments

FPGA-based MapReduce Framework for Machine Learning

Architekturen und Einsatz von FPGAs mit integrierten Prozessor Kernen. Hans-Joachim Gelke Institute of Embedded Systems Professur für Mikroelektronik

Simplify VMware vsphere* 4 Networking with Intel Ethernet 10 Gigabit Server Adapters

QoS & Traffic Management

Performance Comparison of Fujitsu PRIMERGY and PRIMEPOWER Servers

An Oracle White Paper July Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide

LSI MegaRAID CacheCade Performance Evaluation in a Web Server Environment

Network performance in virtual infrastructures

Hardware Performance Optimization and Tuning. Presenter: Tom Arakelian Assistant: Guy Ingalls

High-performance vnic framework for hypervisor-based NFV with userspace vswitch Yoshihiro Nakajima, Hitoshi Masutani, Hirokazu Takahashi NTT Labs.

Leveraging NIC Technology to Improve Network Performance in VMware vsphere

Creating Overlay Networks Using Intel Ethernet Converged Network Adapters

Performance of Software Switching

Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies

Virtual Machine Monitors. Dr. Marc E. Fiuczynski Research Scholar Princeton University

PCI Express IO Virtualization Overview

A Reconfigurable and Programmable Gigabit Ethernet Network Interface Card

A Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems

LogiCORE IP AXI Performance Monitor v2.00.a

I/O Virtualization Using Mellanox InfiniBand And Channel I/O Virtualization (CIOV) Technology

Broadcom Ethernet Network Controller Enhanced Virtualization Functionality

Next Generation Operating Systems

IMPLEMENTATION OF FPGA CARD IN CONTENT FILTERING SOLUTIONS FOR SECURING COMPUTER NETWORKS. Received May 2010; accepted July 2010

Building a Fast, Virtualized Data Plane with Programmable Hardware

Operating System Support for Multiprocessor Systems-on-Chip

IxChariot Virtualization Performance Test Plan

Understanding Latency in Software Defined Networks

RoCE vs. iwarp Competitive Analysis

How To Compare Performance Of A Router On A Hypervisor On A Linux Virtualbox 2.5 (Xen) To A Virtualbox (Xeen) Xen-Virtualization (X

Managed Virtualized Platforms: From Multicore Nodes to Distributed Cloud Infrastructures

Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand

Virtualization: TCP/IP Performance Management in a Virtualized Environment Orlando Share Session 9308

Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage

Network Virtualization for Large-Scale Data Centers

A Holistic Model of the Energy-Efficiency of Hypervisors

Network Virtualization Model - Planet Lab

Datacenter Operating Systems

Gigabit Ethernet Packet Capture. User s Guide

TCP Offload Engines. As network interconnect speeds advance to Gigabit. Introduction to

OPTIMIZE DMA CONFIGURATION IN ENCRYPTION USE CASE. Guillène Ribière, CEO, System Architect

PCI Express Overview. And, by the way, they need to do it in less time.

Performance of Host Identity Protocol on Nokia Internet Tablet

NAND Flash Architecture and Specification Trends

FPGA area allocation for parallel C applications

Comparing SMB Direct 3.0 performance over RoCE, InfiniBand and Ethernet. September 2014

Mellanox Cloud and Database Acceleration Solution over Windows Server 2012 SMB Direct

VMware vsphere 4.1 Networking Performance

Challenges in high speed packet processing

Transcription:

Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical and Computer Engineering University of Massachusetts, Amherst, USA

Outline Network virtualization Advantage of FPGA for virtualization Architecture of FPGA-based virtualization system Partially-reconfigurable implementation Results 2

Virtual Networking Internet growing to include new applications and services Cloud computing, Data center networking Challenges Lack of innovation at network core Fixed internet routers Coupled infrastructure-service provider model Solution Network virtualization Router hardware shared across multiple networks 3

Network Virtualization Many logical networks over a physical infrastructure Virtual nodes B D Shared network resources among multiple virtual networks A C E F Reduces costs Independent routing policies A B C D E F 4

Virtual Router Independent routing policies for each virtual router Key challenges Isolation Performance Flexibility Scalability 5

Traditional Network Virtualization Techniques Software Full virtualization Container virtualization Limitations VM VM Limited performance (~100Mbps) HW Full virtualization Limited isolation ASIC Supercharging PlanetLab Platform 1 Juniper E series Possible Limitations Flexibility Scalability OS instance OS instance HW Container virtualization 1 Supercharging PlanetLab A High Performance, Multi-Application, Overlay Network Platform, J. Turner et al., SIGCOMM 2007 6

Virtualization using FPGAs A novel network virtualization substrate which Uses FPGA to implement high performance virtual routers Introduces scalability through virtual routers in host software Exploits reconfiguration to customize hardware virtual routers 7

System Overview NIC Software Virtual Router Software Virtual Router Software Virtual Router NIC Linux Software bridge Kernel driver PCI SDRAM SDRAM HW VRouter HW VRouter SRAM SRAM PHY 1G Ethernet I/F NetFPGA 8

Architecture 9

Scalable Network Virtualization Approaches for implementing scalable virtual routers Single receiver approach All packets routed through NetFPGA hardware Multi-receiver approach Use a switch to separate packets to NetFPGA and software 10

Virtual Router Customization Multiple virtual routers share the substrate Individual virtual routers may need customization Challenge Minimize the impact on traffic in shared hardware virtual routers during modification 11

Multi Receiver Approach 12

Dynamic Reconfiguration Reconfigure FPGA New HW Virtual Router Sw Virtual Router Address Remap 13

Single receiver approach 14

Partial Reconfiguration Use partial reconfiguration to independently configure virtual routers 15

Partial Reconfiguration Up to 2 partially reconfigurable virtual routers in Virtex 2 Up to 20 partially reconfigurable virtual routers in Virtex 5 16

Experimental Approach Metrics Throughput Latency Source NetFPGA Pktgen/ iperf 1Gbps Virtual Router 1Gbps Sink NetFPGA Pktcap/ iperf Packet generation NetFPGA packet generator iperf Ping utility used for latency measurements Software virtual routers run on 3Ghz AMD X2 6000+ processor, 2GB RAM, Intel E1000 GbE in PCIe slot 17

Dynamic Reconfiguration Overhead Full FPGA reconfiguration Source pings destination through a single h/w virtual router Migrate traffic to software Reconfigure FPGA Migrate traffic to hardware 12 seconds reconfiguration overhead Partial FPGA reconfiguration 0.6 seconds reconfiguration Remaining virtual routers in FPGA remain active 18

Throughput of Single Hardware Virtual Router Hardware virtual router 1 to 2 order better than the software virtual router Consistent forwarding rates vs packet size Throughput (Mbps) 1000 100 10 Hardware virtual router Software virtual router 1 64 128 512 1024 1470 Packet size (bytes) 19

Full FPGA Reconfiguration Two virtual routers (A, B) initially in FPGA During reconfiguration router A migrated to software, the other eliminated After reconfiguration two virtual routers (A, B ) again in FPGA Reduced Throughput 20

Partial FPGA Reconfiguration A remains in hardware and operates at full speed 20X speedup in reconfiguration down time due to partial reconfiguration Sustained Throughput 21

Scalability - Average Throughput of Virtual Routers Aggregate throughput of 1Gbps up to 4 h/w virtual routers Adding software virtual routers causes drop in aggregate throughput 1000 Throughput (Mbps) 100 10 Avg hardware and software Software only 1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Virtual routers 22

Scalability Average Latency of Virtual Routers Latency of h/w virtual router substantially better than software virtual router Latency (ms) 1.2 1 0.8 0.6 0.4 0.2 Software only Avg hardware and software 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Virtual routers 23

Throughput Effect of Reconfiguration Frequency Partial reconfiguration beneficial during frequent updates 24

Average throughput for combined hardware/software Rigid placement constraints limit #hardware virtual routers Larger FPGAs sustain high throughputs for more virtual routers 25

Altera DE4 26

Throughput Measurements on DE-4 Reference Router Consistent packet forwarding rates of 1Gbps for all packet sizes (64-1024) bytes. Packet throughput matches NetFPGA reference router in all cases. Packet buffering in on-chip SRAM (300Kbit). 27

Resource Utilization Logic Utilization 14% LUTs 19% Logic registers 9% Memory bits 13% IOs Altera DE4 NetFPGA Used Available % Used Available % Combinat ional LUTs Logic Registers 24,727 182,400 14% 19,783 47,232 41% 34,218 182,400 19% 14,682 47232 31% Memory bits 1,448,61 2 14,625,79 2 9% 24 232 10.3% IOs 118 888 13% 356 692 51% 28

Reconfiguration Time 20X Faster virtual router updates with partial reconfiguration Full Reconfiguration Partial Reconfiguration 29

Resource Usage and Power Consumption Virtex 2 can support 5 h/w virtual routers (without support for software routers) 4 h/w virtual routers (with support for software routers) 2 partially reconfigurable hardware routers Virtex 5 can support up to 32 virtual routers A single h/w virtual router consumes 156mW static power 688mW dynamic power Clock gating saves 10% power 30

Dynamic Virtual Network Allocation Assign virtual networks to software/hardware virtual routers based on bandwidth-resource requirements 15-20% more successful network upgrades with dynamic allocation 31

Conclusion FPGAs are ideal for a variety of networking applications Partial reconfiguration growing in importance Virtual networking is an emerging networking area Combined FPGA and software system gives high performance and scalability A novel heterogeneous network virtualization substrate using FPGAs NetFPGA platform used to implement up to five virtual routers Two order of magnitude throughput increase versus software. Partial FPGA reconfiguration reduces the number of implemented routers but reduces reconfiguration time to 0.6s Allocation algorithm migrates highest throughput routers to hardware 32