Cisco s Massively Scalable Data Center



Similar documents
Cisco s Massively Scalable Data Center. Network Fabric for Warehouse Scale Computer

Scaling 10Gb/s Clustering at Wire-Speed

VMDC 3.0 Design Overview

TRILL for Service Provider Data Center and IXP. Francois Tallet, Cisco Systems

Data Center Switch Fabric Competitive Analysis

Scalable Approaches for Multitenant Cloud Data Centers

Large-Scale Distributed Systems. Datacenter Networks. COMP6511A Spring 2014 HKUST. Lin Gu

Data Center Infrastructure of the future. Alexei Agueev, Systems Engineer

Architecting Low Latency Cloud Networks

Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches

Data Center Network Topologies: FatTree

Non-blocking Switching in the Cloud Computing Era

VMware Virtual SAN 6.2 Network Design Guide

OVERLAYING VIRTUALIZED LAYER 2 NETWORKS OVER LAYER 3 NETWORKS

Large Scale Clustering with Voltaire InfiniBand HyperScale Technology

Lecture 7: Data Center Networks"

Intel Ethernet Switch Converged Enhanced Ethernet (CEE) and Datacenter Bridging (DCB) Using Intel Ethernet Switch Family Switches

InfiniBand Clustering

Simplify Your Data Center Network to Improve Performance and Decrease Costs

In-band Network Telemetry (INT) Mukesh Hira, VMware Naga Katta, Princeton University

Portland: how to use the topology feature of the datacenter network to scale routing and forwarding

Brocade Solution for EMC VSPEX Server Virtualization

Cloud Networking: A Novel Network Approach for Cloud Computing Models CQ1 2009

Stretched Active- Active Application Centric Infrastructure (ACI) Fabric

CONGA: Distributed Congestion-Aware Load Balancing for Datacenters

STATE OF THE ART OF DATA CENTRE NETWORK TECHNOLOGIES CASE: COMPARISON BETWEEN ETHERNET FABRIC SOLUTIONS

Voice Over IP. MultiFlow IP Phone # 3071 Subnet # Subnet Mask IP address Telephone.

How Router Technology Shapes Inter-Cloud Computing Service Architecture for The Future Internet

Data Center Fabrics What Really Matters. Ivan Pepelnjak NIL Data Communications

Introducing Basic MPLS Concepts

Brocade Data Center Fabric Architectures

Data Center Network Topologies

Load Balancing in Data Center Networks

Core and Pod Data Center Design

Data Center Use Cases and Trends

Brocade Data Center Fabric Architectures

Avoiding Network Polarization and Increasing Visibility in Cloud Networks Using Broadcom Smart- Hash Technology

Flexible SDN Transport Networks With Optical Circuit Switching

MPLS-TP. Future Ready. Today. Introduction. Connection Oriented Transport

TRILL for Data Center Networks

全 新 企 業 網 路 儲 存 應 用 THE STORAGE NETWORK MATTERS FOR EMC IP STORAGE PLATFORMS

What is VLAN Routing?

SDN IN WAN NETWORK PROGRAMMABILITY THROUGH CENTRALIZED PATH COMPUTATION. 1 st September 2014

Qualifying SDN/OpenFlow Enabled Networks

Cisco Dynamic Workload Scaling Solution

The Software Defined Hybrid Packet Optical Datacenter Network SDN AT LIGHT SPEED TM CALIENT Technologies

Data Center Networking Designing Today s Data Center

CloudEngine Series Data Center Switches

DEMYSTIFYING ROUTING SERVICES IN SOFTWAREDEFINED NETWORKING

Computer Networks COSC 6377

Testing Software Defined Network (SDN) For Data Center and Cloud VERYX TECHNOLOGIES

Network Virtualization for the Enterprise Data Center. Guido Appenzeller Open Networking Summit October 2011

Operating Systems. Cloud Computing and Data Centers

CORD Fabric, Overlay Virtualization, and Service Composition

How to Monitor a FabricPath Network

Pluribus Netvisor Solution Brief

Leveraging Advanced Load Sharing for Scaling Capacity to 100 Gbps and Beyond

PortLand:! A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric

Virtualizing the SAN with Software Defined Storage Networks

SOLARWINDS NETWORK PERFORMANCE MONITOR

SolarWinds Network Performance Monitor

Transform Your Business and Protect Your Cisco Nexus Investment While Adopting Cisco Application Centric Infrastructure

Table of Contents. Cisco How Does Load Balancing Work?

CS6204 Advanced Topics in Networking

Where IT perceptions are reality. Test Report. OCe14000 Performance. Featuring Emulex OCe14102 Network Adapters Emulex XE100 Offload Engine

Virtual PortChannels: Building Networks without Spanning Tree Protocol

Intel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family

CS 78 Computer Networks. Internet Protocol (IP) our focus. The Network Layer. Interplay between routing and forwarding

Use Case Brief BUILDING A PRIVATE CLOUD PROVIDING PUBLIC CLOUD FUNCTIONALITY WITHIN THE SAFETY OF YOUR ORGANIZATION

Solving Scale and Mobility in the Data Center A New Simplified Approach

SolarWinds Network Performance Monitor powerful network fault & availabilty management

TRILL Large Layer 2 Network Solution

SDN CENTRALIZED NETWORK COMMAND AND CONTROL

Networking in the Era of Virtualization

Multihoming and Multi-path Routing. CS 7260 Nick Feamster January

Longer is Better? Exploiting Path Diversity in Data Centre Networks

SolarWinds Network Performance Monitor

CloudEngine Series Data Center Switches. Cloud Fabric Data Center Network Solution

Data Center Convergence. Ahmad Zamer, Brocade

Troubleshooting and Maintaining Cisco IP Networks Volume 1

Fibre Channel over Ethernet in the Data Center: An Introduction

A Coordinated. Enterprise Networks Software Defined. and Application Fluent Programmable Networks

Network Contention and Congestion Control: Lustre FineGrained Routing

Network Virtualization and Data Center Networks Data Center Virtualization - Basics. Qin Yin Fall Semester 2013

Ethernet Fabrics: An Architecture for Cloud Networking

Advanced Computer Networks. Datacenter Network Fabric

MPLS/BGP Network Simulation Techniques for Business Enterprise Networks

Deliver the Next Generation Intelligent Datacenter Fabric with the Cisco Nexus 1000V, Citrix NetScaler Application Delivery Controller and Cisco vpath

How To Switch A Layer 1 Matrix Switch On A Network On A Cloud (Network) On A Microsoft Network (Network On A Server) On An Openflow (Network-1) On The Network (Netscout) On Your Network (

VMware and Brocade Network Virtualization Reference Whitepaper

Extreme Networks CoreFlow2 Technology TECHNOLOGY STRATEGY BRIEF

software networking Jithesh TJ, Santhosh Karipur QuEST Global

Software-Defined Networks Powered by VellOS

Routing & Traffic Analysis for Converged Networks. Filling the Layer 3 Gap in VoIP Management

ALCATEL-LUCENT ENTERPRISE DATA CENTER SWITCHING SOLUTION Automation for the next-generation data center

Ethernet Fabric Requirements for FCoE in the Data Center

2013 ONS Tutorial 2: SDN Market Opportunities

Transcription:

Cisco s Massively Scalable Data Center Network Fabric for Warehouse Scale Computer At-A-Glance Datacenter is the Computer MSDC is the Network Cisco s Massively Scalable Data Center (MSDC) is a framework that data center architects can use to build elastic data centers that host a few applications that are distributed across thousands of servers and that scale from department to internet scale audiences Unlike a traditional data center, which hosts applications deployed in silos, the MSDC data center is characterized by a few very large applications that are distributed over geographically-distributed homogenous pools of compute and storage In essence, these data centers behave less like hosting environments and more like highly optimized computers The industry refers to these data centers as warehouse-scale computers (WSC) Cisco s MSDC framework is a blueprint for building the network fabric of these WSCs and is grounded in a reference architecture that Cisco has deployed in a laboratory environment A typical network fabric for WSC provides symmetric bi-section bandwidth between any two nodes that are attached to the fabric It is built using identical switching platforms and is deployed in a Clos topology with variable Clos stages with short spine layer serving as the switch fabric to an up to 2X longer leaf layer serving as the server access layer Theoretically, a three-stage fat tree Clos (with a 16-wide spine) architecture using 32 port switches can connect up to 8192 servers Cisco s MSDC reference architecture is deployed in a three-stage Clos topology using a Nexus 7000 platform used as spine and a Nexus 3000 platform used as leaf Table 1 shows the server host ports realized at different design points in the MSDC reference architecture The following sections discuss the design choices that drove the reference architecture Design Choices Topology and Architecture If we subject a network topology design to the universality theorem (for a given number of switches, the most optimal network exists), we quickly conclude that existing network designs, which are optimized for transporting data into and out of a data center, are not the most optimal By imposing a logical tree topology on a multi-path physical network, existing topologies do not use all the available paths between two end points Cisco MSDC uses a multi-rooted topology, which actively manages the multiple paths available between two end points using Equal Cost Multipathing (ECMP) In addition, the selection of the high port count platform, ie, Nexus 7000, as the spine enables a deployment of folded-clos (or fat-trees) without using additional switching components The key characteristics of the Cisco MSDC topology are: Multiple design and deployment options enabled using variable length spine with ECMP to leverage multiple available paths between leaf and spine Future proofing for higher performance folded Clos network through the selection of high radix (or port count) switches at the spine Reduced network congestion using a large, shared buffer-switching platform for leaf node Network congestion results when multiple packets inside a switch request the same outbound port because of application architecture Lower power consumption using multiple 10Gbps between spine and leaf (in a folded Clos topology) instead of a single 40Gbps link The current power consumption of a 40Gbps optics is more than 10X a single 10Gbps Control Plane Layer 3 Fabric For a given topology, an appropriate control plane is needed to load-balance the traffic and minimize the latency Cisco MSDC uses dynamic Layer 3 protocols such as BGP and OSPF to build the routing table that most efficiently routes a packet from a source to a spine node (the common ancestor of the source and destination) In a Clos topology, multiple paths or ancestors exist between source and destination The selection of ancestor, therefore, is best done at the leaf node That is, a Layer 3 lookup should exist at the leaf node that will result in the selection of the optimal egress port at the leaf The use of both BGP and OSPF protocols in Cisco MSDC control plane was Copyright 2010 Cisco Systems, Inc All rights reserved Cisco, the Cisco logo, Cisco Systems, and the Cisco Systems logo are registered trademarks or trademarks of Cisco Systems, Inc and/or its affiliates in the United States and certain other countries 1 of 6

At-A-Glance Cisco s Massively Scalable Data Center driven by the need to sustain performance as the network scales from a few thousand nodes to a few hundred thousand nodes Using the optimal routing architecture minimizes the congestion in the network by reducing packet collision when multiple packets head for the same egress port The key characteristics of the Cisco MSDC control plane are: Use of BGP to advertise reachability and OSPF to determine the shortest path to it Optimal execution of both protocol stacks to reduce router CPU and memory utilization Layer 3 look up at every hop to optimize the selection of the spine node reducing latency and avoiding congestion It has the additional advantage of not exhausting a scarce resource: the leaf s forwarding table The current architecture serves as a good platform to add future application-specific optimizations through this selection Monitoring and Maintenance Cisco MSDC framework and reference architecture deals with the issue of monitoring, managing, and maintaining a large network through minimal use of SNMP-based infrastructure for monitoring and management The management framework prefers the use of an agentless monitoring system that scales to thousands of nodes The key characteristics of the Cisco MSDC monitoring and maintenance plane are: Use of Ganglia for monitoring host infrastructure through collection of metrics and trends Customization of monitoring using Python extension mechanism to Ganglia Like many other open source and proprietary tools, Ganglia uses RRDTool for graphics Use of Ganglia extension mechanism for in-band plugins to deploy custom python modules to monitor the host environment Use of Nagios to monitor network infrastructure and resource managers Use of Nagios s built-in mechanism to alert on Ganglia metrics Future Programmability and Network Overlays Cisco MSDC architecture does not preclude the future use of an external or built-in controller that could populate the forwarding table at the spine or leaf nodes to introduce adaptive or deterministic load balancing or routing Using the Layer 3 control plane offers the large advantage of reducing the state that a switch holds This offers immense help to an external control by reducing the amount of sync traffic between controller and infrastructure Network overlay initiatives are attempting to remove the one shortcoming of a Layer 3 control plane, which is a Layer 2 extension model for the developers Cisco s MSDC framework can be adapted easily to work with any network overlay technology that crosses the chasm Table 1 Cisco MSDC Topology Options Cisco MSDC Design Options Design Options Small Medium Large Node Nexus 70XX Node Total Switches in Topology 4 + 8 4 + 12 8 + 16 Total Switching Capacity Topology Server Host Port Count at 1:1 Server Host Port Count at 3:1 2 of 6

Cisco s Massively Scalable Data Center At-A-Glance Figure 1 Topology of Cisco MSDC Design Evolution Phase 1 Figure 2 Topology of Cisco MSDC Design Evolution Phase 2 Option 1 N7K/N3064 293246 2x or 1x 10 GE 4x or 1x 10GE IXIA Servers Option 2 N7K/N3064/N3016?/QuickSilver? 293247 Copyright 2010 Cisco Systems, Inc All rights reserved Cisco, the Cisco logo, Cisco Systems, and the Cisco Systems logo are registered trademarks or trademarks of Cisco Systems, Inc and/or its affiliates in the United States and certain other countries 3 of 6

At-A-Glance Cisco s Massively Scalable Data Center Figure 3 Topology of Cisco MSDC Design Evolution Phase 3 Super POD B POD A Option 3 N7K/N3064 293248 Figure 4 Topology of Cisco MSDC Design Evolution Phase 4 Option 4 N7K/N3064/N3048 293249 4 of 6

Cisco s Massively Scalable Data Center At-A-Glance Figure 5 Topology of Cisco MSDC Design Evolution Phase 5 Super POD B POD A Option 5 N7K/N3064/N3048 293250 5 of 6

At-A-Glance Cisco s Massively Scalable Data Center Figure 6 Topology of Cisco MSDC Design Evolution Phase 6 Option 6 N7K/N3064/N3048 Direct 1Gb servers/10gb dual homed 293251 Figure 7 Topology of Cisco MSDC Design Evolution Phase 7 Super POD B POD A 293252 6 of 6