Axon: A Flexible Substrate for Source- routed Ethernet. Jeffrey Shafer Brent Stephens Michael Foss Sco6 Rixner Alan L. Cox

Similar documents
VXLAN: Scaling Data Center Capacity. White Paper

Radhika Niranjan Mysore, Andreas Pamboris, Nathan Farrington, Nelson Huang, Pardis Miri, Sivasankar Radhakrishnan, Vikram Subramanya and Amin Vahdat

Portland: how to use the topology feature of the datacenter network to scale routing and forwarding

CS 91: Cloud Systems & Datacenter Networks Networks Background

Ethernet-based Software Defined Network (SDN) Cloud Computing Research Center for Mobile Applications (CCMA), ITRI 雲 端 運 算 行 動 應 用 研 究 中 心

Internet Packets. Forwarding Datagrams

PortLand:! A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric

OVERLAYING VIRTUALIZED LAYER 2 NETWORKS OVER LAYER 3 NETWORKS

Advanced Computer Networks. Datacenter Network Fabric

基 於 SDN 與 可 程 式 化 硬 體 架 構 之 雲 端 網 路 系 統 交 換 器

Network layer: Overview. Network layer functions IP Routing and forwarding

ABSTRACT. Designing Scalable Networks for Future Large Datacenters. Brent Stephens

hp ProLiant network adapter teaming

UPPER LAYER SWITCHING

CS335 Sample Questions for Exam #2

Ethernet-based Software Defined Network (SDN)

TRILL for Service Provider Data Center and IXP. Francois Tallet, Cisco Systems

Objectives. The Role of Redundancy in a Switched Network. Layer 2 Loops. Broadcast Storms. More problems with Layer 2 loops

Transport and Network Layer

20. Switched Local Area Networks

Network Virtualization for Large-Scale Data Centers

VXLAN Bridging & Routing

Definition. A Historical Example

CORPORATE NETWORKING

OpenFlow: Enabling Innovation in Campus Networks

Data Networking and Architecture. Delegates should have some basic knowledge of Internet Protocol and Data Networking principles.

Internet Protocols Fall Lectures 7-8 Andreas Terzis

RESILIENT NETWORK DESIGN

Data Center Network Topologies: FatTree

Extending Networking to Fit the Cloud

IP Networking. Overview. Networks Impact Daily Life. IP Networking - Part 1. How Networks Impact Daily Life. How Networks Impact Daily Life

- Hubs vs. Switches vs. Routers -

ETHERNET WAN ENCRYPTION SOLUTIONS COMPARED

Cisco Data Centre: Introducing Cisco Data Center Networking

vsphere Networking vsphere 6.0 ESXi 6.0 vcenter Server 6.0 EN

Juniper / Cisco Interoperability Tests. August 2014

CCT vs. CCENT Skill Set Comparison

ADVANCED NETWORK CONFIGURATION GUIDE

ANZA Formación en Tecnologías Avanzadas

Chapter 4 Network Layer

Efficient Addressing. Outline. Addressing Subnetting Supernetting CS 640 1

Howstuffworks "How LAN Switches Work" Click here to go back to the normal view!

Network layer" 1DT066! Distributed Information Systems!! Chapter 4" Network Layer!! goals: "

Performance Evaluation of Linux Bridge

What is VLAN Routing?

How To Build A Policy Aware Switching Layer For Data Center Data Center Servers

Floodless in SEATTLE: A Scalable Ethernet Architecture for Large Enterprises

STATE OF THE ART OF DATA CENTRE NETWORK TECHNOLOGIES CASE: COMPARISON BETWEEN ETHERNET FABRIC SOLUTIONS

Introduction to LAN/WAN. Network Layer (part II)

Network Virtualization and Data Center Networks Data Center Virtualization - Basics. Qin Yin Fall Semester 2013

Final for ECE374 05/06/13 Solution!!

Zarząd (7 osób) F inanse (13 osób) M arketing (7 osób) S przedaż (16 osób) K adry (15 osób)

Datagram-based network layer: forwarding; routing. Additional function of VCbased network layer: call setup.

Chapter 3. Enterprise Campus Network Design

Multiple Service Load-Balancing with OpenFlow

VIA CONNECT PRO Deployment Guide

Leased Line + Remote Dial-in connectivity

SSVP SIP School VoIP Professional Certification

Networking Technology Online Course Outline

OpenFlow Based Load Balancing

NComputing L-Series LAN Deployment

vsphere Networking ESXi 5.0 vcenter Server 5.0 EN

How To Make A Vpc More Secure With A Cloud Network Overlay (Network) On A Vlan) On An Openstack Vlan On A Server On A Network On A 2D (Vlan) (Vpn) On Your Vlan

Ethernet Fabric Requirements for FCoE in the Data Center

Fibre Channel over Ethernet in the Data Center: An Introduction

DG Forwarding Algorithm

Switching in an Enterprise Network

IT-AD08: ADD ON DIPLOMA IN COMPUTER NETWORK DESIGN AND INSTALLATION

Quantifying the Performance Degradation of IPv6 for TCP in Windows and Linux Networking

Scaling IP Mul-cast on Datacenter Topologies. Xiaozhou Li Mike Freedman

Torii-HLMAC: Torii-HLMAC: Fat Tree Data Center Architecture Elisa Rojas University of Alcala (Spain)

Module 15: Network Structures

EVOLVING ENTERPRISE NETWORKS WITH SPB-M APPLICATION NOTE

Virtual PortChannels: Building Networks without Spanning Tree Protocol

vsphere Networking vsphere 5.5 ESXi 5.5 vcenter Server 5.5 EN

: Interconnecting Cisco Networking Devices Part 1 v2.0 (ICND1)

Performance Analysis of IPv4 v/s IPv6 in Virtual Environment Using UBUNTU

Outline. VL2: A Scalable and Flexible Data Center Network. Problem. Introduction 11/26/2012

Redundancy and load balancing at L3 in Local Area Networks. Fulvio Risso Politecnico di Torino

Agenda. Distributed System Structures. Why Distributed Systems? Motivation

Intel Advanced Network Services Software Increases Network Reliability, Resilience and Bandwidth

Candidates should attempt FOUR questions. All questions carry 25 marks.

Internet Working 5 th lecture. Chair of Communication Systems Department of Applied Sciences University of Freiburg 2004

Intel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family

Operating System Concepts. Operating System 資 訊 工 程 學 系 袁 賢 銘 老 師

COURSE AGENDA. Lessons - CCNA. CCNA & CCNP - Online Course Agenda. Lesson 1: Internetworking. Lesson 2: Fundamentals of Networking

Chapter 1 Reading Organizer

Redundancy and load balancing at L3 in Local Area Networks. Fulvio Risso Politecnico di Torino

CCNA R&S: Introduction to Networks. Chapter 5: Ethernet

Networking Topology For Your System

A New Approach to Developing High-Availability Server

Cisco Certified Network Associate (CCNA) 120 Hours / 12 Months / Self-Paced WIA Fee: $

From Active & Programmable Networks to.. OpenFlow & Software Defined Networks. Prof. C. Tschudin, M. Sifalakis, T. Meyer, M. Monti, S.

Outline. CSc 466/566. Computer Security. 18 : Network Security Introduction. Network Topology. Network Topology. Christian Collberg

CS 457 Lecture 19 Global Internet - BGP. Fall 2011

A Case for Overlays in DCN Virtualization Katherine Barabash, Rami Cohen, David Hadas, Vinit Jain, Renato Recio and Benny Rochwerger IBM

CCNP SWITCH: Implementing High Availability and Redundancy in a Campus Network

Using Network Virtualization to Scale Data Centers

John Ragan Director of Product Management. Billy Wise Communications Specialist

Transcription:

Axon: A Flexible Substrate for Source- routed Ethernet Jeffrey Shafer Brent Stephens Michael Foss Sco6 Rixner Alan L. Cox

2 Ethernet Tradeoffs Strengths Weaknesses Cheap Simple High data rate Ubiquitous Loop- free forwarding topology limits bandwidth Broadcasts and packet flooding for locajon discovery limits scalability Scale up Ethernet to work effecjvely in a modern datacenter?

3 Ethernet in the Datacenter TradiJonal solujon: Small Ethernet LANs + IP routers Increases network complexity Hinders live virtual machine migrajon Recent proposals Many VLAN overlays (see SPAIN) Re- wrijng MAC addresses to add hierarchy (see PortLand or MOOSE) New non- broadcast locajon service (see SEATTLE) ExisJng techniques keep Ethernet frame format 30 years old! Let s replace it!

4 What is the Axon Device? Take a tradi*onal datacenter network Router Border Router Router Internet Router Switch Switch Switch Switch Switch Server Rack Server Rack Server Rack Server Rack Server Rack

5 What is the Axon Device? Border Router Internet Replace all the interior network devices (Ethernet switches, IP routers) Server Rack Server Rack Server Rack Server Rack Server Rack

6 What is the Axon Device? with an arbitrary graph of Axons Border Router Internet Axon Axon Axon Axon Axon Server Rack Server Rack Server Rack Server Rack Server Rack

7 Axon Overview Axons deploy a new datalink- layer protocol: source- routed Ethernet Full path placed in packet header Used internally between Axons (Axon Axon) Standard Ethernet PHYs Axons maintain compacbility with unmodified hosts AbstracJon of a single large subnet TradiJonal Ethernet used externally (Host Axon) Packets are transparently rewri6en by Axons

8 Advantages of Source- routed Ethernet Flexibility in network topology Support arbitrary paths, including loops! In tradijonal Ethernet, STP disables redundant links (cannot carry data) Flexibility in roucng algorithms Shortest- path? CongesJon- aware? Improved scalability Each Axon only stores routes for locally- connected hosts Interior Axons just follow route in packet header In tradijonal switches/routers, a lookup must be performed at every hop along the path

9 Organization 1. Introduc=on 2. Design Overview Source- routed Ethernet CompaCbility with ExisJng Hosts 3. EvaluaJon Hardware Prototype So_ware Simulator

10 Traditional Ethernet Bytes: (6) (6) (2) (Varies) (4) Dest MAC Addr Src MAC Addr Type/ Length Data CRC Forwarding At each hop, must lookup desjnajon address in a forwarding table to obtain output port (CAM lookup) In contrast, source- routed Ethernet has a new header containing the full path list How to obtain transparent compa*bility?

11 Communication Axons present illusion that encre network is simply a large Ethernet segment Host A wishes to communicate with Host Z Host A Axon A Network of Axons Axon Z Host Z

12 Communication Host A issues ARP request to locate Host Z ARP Request Host A Axon A Network of Axons Axon Z Host Z

13 Communication Axon A intercepts broadcast ARP request Axon A begins establishing route with Axon Z Axon Z sends ARP request to Host Z ARP Request Establishing Route ARP Request ARP Request Host A Axon A Network of Axons Axon Z Host Z

14 Communication Host Z responds with ARP reply (captured by Axon Z) Axon Z installs route to Host A Axon A installs route to Host Z Axon A issues ARP reply Establish Route ARP Reply ARP Reply ARP Reply Host A Axon A Network of Axons Axon Z Host Z C A M Z Route C A M A Route

15 Communication Host A sends data to Host Z Axon A looks up route and encapsulates data for transport Source roucng used internally (Axon Axon) Data unpacked for delivery Data Axon Hdr Data Data Host A Axon A Network of Axons Axon Z Host Z C A M Z Route C A M A Route

16 Source- Routed Ethernet Bytes: (0.5) (2) (3) (1 per hop) (1 per hop) (0.5) (Varies) (4) Type Len Hop Counts (Fwd / Rev) Fwd Route Rev Route Unused Data CRC Original Ethernet frame Packet header contains two routes: Forward route from current Axon to desjnajon Grows shorter at each hop Reverse route from current Axon to source Grows longer at each hop Each 1- byte route item specifies an output port Forwarding At each hop, read header to obtain next output port Prepend arrival port to reverse route header Works with standard Ethernet PHYs and MACs by using jumbo frames

17 Route Generation Generate a route on the first ARP for flow Cache at local Axon for subsequent packets Prototype design Central route controller with full topology knowledge Inspired by Ethane and Tesseract projects Could also implement a distributed mechanism RouJng algorithm: Shortest- path or congesjon aware Key point: source roujng allows for arbitrary topologies, arbitrary paths (including loops), and arbitrary roucng algorithms Casado et. al., Ethane: taking control of the enterprise, SIGCOMM 2007 Yan et. al, Tesseract: A 4D network control plane, NSDI 2007

18 Organization 1. Introduc=on 2. Design Overview Source- routed Ethernet Compa=bility with Exis=ng Hosts 3. EvaluaJon Hardware Prototype Measure performance Demonstrate compacbility So_ware Simulator

19 Hardware Prototype Data plane 4- port NetFPGA Custom verilog Packet forwarding and translajon Control plane Intel Atom processor on mini- ITX board Linux + applicajon program

20 Test Networks Line Topology: Host Host Host Axon Axon Axon Host Host Host Ring Topology: (Can t build with conven=onal Ethernet!) Host Host Host Axon Axon Axon Host Host Host

21 Higher Bandwidth Test setup: Used both ring and line topology 1 TCP or UDP flow from each host to a host on a different Axon Measured aggregate bandwidth (Mbit/s) UDP TCP Line Ring Line Ring 2906 5690 2425 3951 Shows bandwidth benefit of using redundant links

22 Lower Latency Measured forwarding latency Axon Axon Axon Host Host Axon 520ns 520ns 720ns Compares favorably against gigabit Ethernet switch 7-28us per hop (varies with packet size) Latency advantage in Axon design Cut- through forwarding instead of store- and- forward Forwarding table lookup only at first hop (to obtain route) TradiJonal Ethernet switches do lookup at every hop

23 Lower Latency in Applications Test setup PostMark benchmark Line topology with Axons or switches Each Axon adds a smaller per- hop latency compared to an Ethernet switch Only first Axon does a route lookup PostMark TransacCons/sec 1400 1200 1000 800 600 400 200 0 Axon Switch 1 2 3 4 5 Number of Hops

24 Host Compatibility Host Host Host Axon Axon Axon Host Host Host Demonstrated compajbility with unmodified hosts Windows, Mac OS X, FreeBSD, Linux, Netgear switch, Cisco IP router, Linksys wireless access point,

25 Organization 1. Introduc=on 2. Design Overview Source- routed Ethernet Compa=bility with Exis=ng Hosts 3. EvaluaJon Hardware Prototype Soaware Simulator Evaluate design at large scales and arbitrary topologies

26 Simulator Custom soaware simulator Simulated Axons, hosts, and links Based on prototype Each simulated Axon runs same control soaware Each simulated host represented by ARP generator ARPs from host trigger route generajon, which is the overhead we are most concerned about

27 Lower Control Overhead Characterize overhead bandwidth used for Axon control Network topology maintenance (discovery and heartbeat messages) Route generajon and disseminajon Simulator Setup Topologies: Torus, Fat tree, Fla6ened- bu6erfly, Random Up to 50,000 hosts and 5,000 Axons Each host generates 10 ARPs/sec (new flows only!) ConservaJve choice compared to peak of 0.5 ARPs/sec reported in Ethane network and LBNL trace

28 Lower Control Overhead Showing torus topology Max Axon Overhead Max link has highest overhead A6ached to central controller 1000 Avg Ethernet Overhead Avg Axon Overhead Average Axon link has less overhead than average Ethernet link ARPs not broadcast Torus is worst case topology for Axons Highest average distance from controller Mbps per Link 100 10 1 0.1 500 5000 50000 Number of Hosts

29 Overhead Comparison Compared against PortLand architecture Fat tree topology Axon host discovery protocol more efficient Very similar average link overhead to Axons once PortLand has warmed up Axon packets are slightly larger due to source routes Mbps per Link 100 0.01 Mysore et al., PortLand: a scalable fault- tolerant layer 2 data center network fabric, SIGCOMM 2009. 10 1 0.1 Max Axon Average Axon Max PortLand (Cold) Avg PortLand (Cold) Max PortLand (Warm) Average PortLand (Warm) 9000 11000 13000 15000 17000 Number of Hosts

30 Flexible Route Selection Implemented weighted shortest path roucng in central controller (similar to SPAIN) Weight is number of flows across a link Disperses flows across many links (congesjon avoidance!) Demonstrated Axon flexibility to easily support alternate route selecjon algorithms Results Average route length increases by 0.1 hops Busiest link (measured by flow count) has the number of flows cut in half! Mudigonda et. al., SPAIN: COTS data- center Ethernet for muljpathing over arbitrary topologies, NSDI 2010.

31 Summary Source- routed Ethernet is flexible Supports arbitrary topologies and roujng algorithms Axons unlock this flexibility for exisjng hosts AbstracJon giant Ethernet segment (flat IP address space) Migrate a VM from any point to any point in the enjre network Transparent packet rewrijng FPGA prototype demonstrated design is simple and pracjcal Simulator demonstrated reasonable control overhead for real- world network sizes Control overhead on a 50,000 host network is only 0.25% of total link bandwidth

32 Questions????

33

34 Lower Control Overhead Showing average overhead for all topologies Torus has highest average distance from controller Thus highest overhead Even the torus was a significant win over convenjonal Ethernet

35 Byte Overhead of Source- Routed Ethernet Payload Bandwidth (Mbps) Ethernet Axon (1 Hop) Axon (10 Hops) Axon (100 Hops) 1000 800 600 400 200 0 0 200 400 600 800 1000 1200 1400 Packet Size (Bytes)

36 Evaluation Memory Requirements How large of a CAM does each Axon need to support all locally- a6ached hosts? Worst- case scenario Axon a6ached to the border router (to reach public Internet) must have routes to all internal hosts with an acjve flow Best- case scenario Core Axons no a6ached hosts at all! Wrote custom trace analyzer to measure re- use distance between messages to the same desjnajon IP address

37 Evaluation Memory Requirements Traces examined LBNL NCAR- I CESCA- I Link connecjng scienjfic ring to public Internet 4k CAM entries sufficient Commercial switches already have 8k+ entries Many datacenter flows will be internal (and thus avoid the worst- case Axon)

38 Axon Compatibility The first Axon (connected to a sending host) has several funcjons Intercept ARP and DHCP packets Transparently rewrite packet from tradijonal to source Ethernet format Interior Axons just follow route in packet header The last Axon (connected to a receiving host) transparently rewrites packet back to tradijonal Ethernet format