Datacenter Networks Are In My Way



Similar documents
Data Center Infrastructure Innovation

Lecture 7: Data Center Networks"

Cloud Computing Driving Infrastructure Innovation Intel DCSG Distinguished Speaker Series

BURSTING DATA BETWEEN DATA CENTERS CASE FOR TRANSPORT SDN

CS244 Lecture 5 Architecture and Principles

Why Software Defined Networking (SDN)? Boyan Sotirov

Network Virtualization and Data Center Networks Data Center Virtualization - Basics. Qin Yin Fall Semester 2013

THE REVOLUTION TOWARDS SOFTWARE- DEFINED NETWORKING

Data Center Fabrics What Really Matters. Ivan Pepelnjak NIL Data Communications

Open Source Network: Software-Defined Networking (SDN) and OpenFlow

Cloud Computing Is Driving Infrastructure Innovation

Software-Defined Networks Powered by VellOS

Walmart s Data Center. Amadeus Data Center. Google s Data Center. Data Center Evolution 1.0. Data Center Evolution 2.0

Load Balancing Mechanisms in Data Center Networks

SDN/Virtualization and Cloud Computing

Data Center Network Topologies: FatTree

Software Defined Networking and Network Virtualization

Data Center Network Topologies: VL2 (Virtual Layer 2)

OPENFLOW, SDN, OPEN SOURCE AND BARE METAL SWITCHES. Guido Appenzeller (Not representing Anyone)

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

OpenFlow/SDN for IaaS Providers

Panel : Future Data Center Networks

Does SDN accelerate network innovations? Example of Flexible Service Creation

Networking in the Era of Virtualization

SDN CENTRALIZED NETWORK COMMAND AND CONTROL

Advanced Computer Networks. Datacenter Network Fabric

How To Orchestrate The Clouddusing Network With Andn

BROCADE NETWORKING: EXPLORING SOFTWARE-DEFINED NETWORK. Gustavo Barros Systems Engineer Brocade Brasil

Networking Virtualization Using FPGAs

Operating Systems. Cloud Computing and Data Centers

Optimizing Data Center Networks for Cloud Computing

Cloud Optimize Your IT

SDN and Open Ethernet Switches Empower Modern Data Center Networks

Software Defined Networking and Network Virtualization

Software Defined Networking

Software-Defined Networking Architecture Framework for Multi-Tenant Enterprise Cloud Environments

Software Defined Environments

Red Hat Storage Server

The Future of Cloud Networking. Idris T. Vasi

The Software Defined Hybrid Packet Optical Datacenter Network SDN AT LIGHT SPEED TM CALIENT Technologies

Building an Open, Adaptive & Responsive Data Center using OpenDaylight

Data Centers and Cloud Computing. Data Centers

IOS110. Virtualization 5/27/2014 1

LAYER3 HELPS BUILD NEXT GENERATION, HIGH-SPEED, LOW LATENCY, DATA CENTER SOLUTION FOR A LEADING FINANCIAL INSTITUTION IN AFRICA.

Data Center Challenges Building Networks for Agility

Introduction to Network Virtualization in IaaS Cloud. Akane Matsuo, Midokura Japan K.K. LinuxCon Japan 2013 May 31 st, 2013

2013 ONS Tutorial 2: SDN Market Opportunities

基 於 SDN 與 可 程 式 化 硬 體 架 構 之 雲 端 網 路 系 統 交 換 器

Use Cases for the NPS the Revolutionary C-Programmable 7-Layer Network Processor. Sandeep Shah Director, Systems Architecture EZchip

Software Defined Networks

The Role of Virtual Routers In Carrier Networks

Juniper Update Enabling New Network Architectures. Debbie Montano Chief Architect, Gov t, Edu & Medical dmontano@juniper.

Data Center Network Architectures

Last time. Data Center as a Computer. Today. Data Center Construction (and management)

Microsoft s Cloud Networks

Software Defined Networks (SDN)

OpenFlow Technology Investigation Vendors Review on OpenFlow implementation

Flexible SDN Transport Networks With Optical Circuit Switching

A Coordinated. Enterprise Networks Software Defined. and Application Fluent Programmable Networks

(R)Evolution im Software Defined Datacenter Hyper-Converged Infrastructure

Dell Networking ARGOS 24/03/2016. Nicolas Roughol. Networking Sales Engineer. Tel : nicolas_roughol@dell.com

Global Headquarters: 5 Speen Street Framingham, MA USA P F

Topics. Images courtesy of Majd F. Sakr or from Wikipedia unless otherwise noted.

What is SDN? And Why Should I Care? Jim Metzler Vice President Ashton Metzler & Associates

Help! My Big Expensive Router Is Really Expensive!

OpenFlow/SDN activities of NTT Communications

Scalable Network Monitoring with SDN-Based Ethernet Fabrics

Juniper Networks QFabric: Scaling for the Modern Data Center

HAWAII TECH TALK SDN. Paul Deakin Field Systems Engineer

When Does Colocation Become Competitive With The Public Cloud? WHITE PAPER SEPTEMBER 2014

Data Center Infrastructure of the future. Alexei Agueev, Systems Engineer

Architecting Data Center Networks in the era of Big Data and Cloud

SDN AND SECURITY: Why Take Over the Hosts When You Can Take Over the Network

When Does Colocation Become Competitive With The Public Cloud?

SOFTWARE-DEFINED NETWORKING AND OPENFLOW

A Case for Overlays in DCN Virtualization Katherine Barabash, Rami Cohen, David Hadas, Vinit Jain, Renato Recio and Benny Rochwerger IBM

Definition of a White Box. Benefits of White Boxes

Data Center Challenges Building Networks for Agility

Virtualization, SDN and NFV

Windows Server 2003 Migration Guide: Nutanix Webscale Converged Infrastructure Eases Migration

A Presentation at DGI 2014 Government Cloud Computing and Data Center Conference & Expo, Washington, DC. September 18, 2014.

Albert Greenberg Principal Researcher

PortLand:! A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric

White Paper Solarflare High-Performance Computing (HPC) Applications

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

RIDE THE SDN AND CLOUD WAVE WITH CONTRAIL

OpenFlow and Onix. OpenFlow: Enabling Innovation in Campus Networks. The Problem. We also want. How to run experiments in campus networks?

Building a small Data Centre

Software Defined Networking & Openflow

Transcription:

Datacenter Networks Are In My Way Principals of Amazon James Hamilton, 2010.10.28 e: James@amazon.com blog: perspectives.mvdirona.com With Albert Greenberg, Srikanth Kandula, Dave Maltz, Parveen Patel, Sudipta Sengupta, Changhoon Kim, Jagwinder Brar, Justin Pietsch, Tyson Lamoreaux, Dhiren Dedhia, Alan Judge, Dave O'Meara, & Mike Marr

Agenda Datacenter Economics Is Net Gear Really the Problem? Workload Placement Restrictions Hierarchical & Over-Subscribed Net Gear: SUV of the Data Center Mainframe Business Model Manually Configured & Fragile at Scale New Architecture for Networking 2010/10/26 http://perspectives.mvdirona.com 2

Datacenter Economics Assumptions: Facility: ~$72M for 8MW critical power Servers: 46,000 @ $1.45k each Commercial Power: ~$0.07/kWhr Power Usage Effectiveness: 1.45 18% 13% Monthly Costs 4% Servers 57% Networking Equipment Power Distribution & Cooling Power 8% Other Infrastructure 3yr server & 10 yr infrastructure amortization Observations: 31% costs functionally related to power (trending up while server costs down) Networking high at 8% of costs & 19% of total server cost (much more with inter-dc) Source: http://perspectives.mvdirona.com/2010/09/18/overalldatacentercosts.aspx 2010/10/26 http://perspectives.mvdirona.com 3

Is Net Gear Really the Problem? Inside the DC net gear represents only: 8% of the monthly cost 5.2% of the power Improvement needed but not dominant Servers: 64% Power & 57% monthly cost Low server utilization Low: 30% good, 10% common Networking in way of the most vital optimizations Improving server utilization Supporting data intensive analytic workloads Source: http://perspectives.mvdirona.com/2010/09/18/overalldatacentercosts.aspx 2010/10/26 http://perspectives.mvdirona.com 4

Workload placement restrictions Workload placement over-constrained problem Near storage, near app tiers, distant from redundant instances, near customer, same subnet (LB & VM migration restrictions), Goal: all data center locations equidistant High bandwidth between servers anywhere in DC Any workload any place Need to exploit non-correlated growth/shrinkage in workload through dynamic over-provisioning Optimize for server utilization rather than locality We are allowing the network to constrain optimization of the most valuable assets 2010/10/26 http://perspectives.mvdirona.com 5

80 to 240:1 Oversubscription Common Hierarchical & Over-Subscribed Internet Data Center Layer 3 Layer 2 S LB Internet CR Poor net gear price/performance forces oversubscription Oversubscription: Constrains workload placement Support for data intensive workloads poorly MapReduce often moves entire multi-pb dataset during single job MapReduce, HPC, Analysis, MPP database, Conclusion: Need cheap, non-oversubscribed 10Gbps 2010/10/26 http://perspectives.mvdirona.com 6 CR AR AR AR AR S S S S LB S Key: CR = L3 Core Router AR = L3 Access Router S = L2 Switch LB = Load Balancer A = Rack of 20 servers with Top of Rack switch Albert Greenberg Diagram

Net gear: SUV of the data center Net gear incredibly power inefficient Continuing with Juniper EX8216 example: Power consumption: 19.2kW/pair Entire server racks commonly 8kW to 10kW But at 128 ports per switch pair, 150W/port Often used as aggregation switch Assume pair, each with 110 ports down & 40 servers/rack Only: 4.4W/server port in pair configuration Network power consumption is increasing quickly Far from dominant data center issue but still conspicuous consumption 2010/10/26 http://perspectives.mvdirona.com 7

Mainframe Business Model Central Logic Manufacture Proprietary & closely guarded Single source Finished Supply Proprietary & closely guarded Single source System Software Supply Proprietary & closely guarded Single source Application Stack Not supported No programming tools No 3 rd party ecosystem Central Logic Manufacture Standard design (x86) Multiple source AMD, Intel, Via, Finished Supply Standard design Multiple source Dell, SGI, HP, IBM, System Software Supply Linux (many distros/support) Windows & other proprietary offerings Application Stack Public/published APIs High quality prog tools Rich 3 rd party ecosystem Net Equipment Commodity Server Example: Juniper EX 8216 (used in core or aggregation layers) Fully configured list: $716k w/o optics and $908k with optics Solution: Merchant silicon, H/W independence, open source protocol/mgmt stack 2010/10/26 http://perspectives.mvdirona.com 8

Manually Configured & Fragile at Scale Unaffordable, scale-up model yields 2-way redundancy ROC needs more than 2-way Brownout & partial failure common Unhealthy equipment continues to operate & drop packets Complex protocol stacks, proprietary extensions, and proprietary mgmt Norm is error-prone manual configuration Networking distributed management model Complex & slow to converge Central, net & app aware mgmt is practical even in large DCs (50k+ servers) Need application input (QOS, priorities, requirements,.) Scale-up reliability gets expensive faster than reliable Asymptotically approaches unaffordable but never gets to good enough ROC management techniques work best with more than 2-way redundancy 2010/10/26 http://perspectives.mvdirona.com 9

X-ceiver (SerDes) What Enables New Solution Now? General purpose CPU for control plane Switch-on-a-chip ASIC tables pipeline Packet buffer Memory Last year: 24 port 1G, 4 10G 16K IPv4 fwd entries, 2 MB buff This year: 48 port 10G, 16K fwd entries, 4 MB buff Next year: 64 to 96 port 10G ASIC cost roughly constant while port count scales every 18 to 24 months Slide: Albert Greenberg 2010/10/26 http://perspectives.mvdirona.com 10

Software-Defined Networks Application Application Application Application Network Operating System Slide: Nick McKeown 2010/10/26 http://perspectives.mvdirona.com 11

OpenFlow/Onix Metering Routing QOS Monitoring Network Operating System (Onix) Openflow Openflow Openflow Openflow Openflow 2010/10/26 http://perspectives.mvdirona.com 12

Summary Again we learn: scale-up doesn t work single-source, vertically integrated is bad idea Ingredients for solution near: Merchant silicon broadly available Distributed systems techniques Standardized H/W platform layer (OpenFlow) Need an open source protocol & mgmt stack 2010/10/26 http://perspectives.mvdirona.com 13

More Information VL2: A Scalable and Flexible Data Center Network http://research.microsoft.com/pubs/80693/vl2-sigcomm09-final.pdf Cost of a Cloud: Research Problems in Data Center Networks http://ccr.sigcomm.org/online/files/p68-v39n1o-greenberg.pdf PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric http://cseweb.ucsd.edu/~vahdat/papers/portland-sigcomm09.pdf OpenFlow Switch Consortium http://www.openflowswitch.org/ Next Generation Data Center Architecture: Scalability & Commoditization http://research.microsoft.com/en-us/um/people/dmaltz/papers/monsoon-presto08.pdf A Scalable, Commodity Data Center Network http://cseweb.ucsd.edu/~vahdat/papers/sigcomm08.pdf Data Center Switch Architecture in the Age of Merchant Silicone http://www.nathanfarrington.com/pdf/merchant_silicon-hoti09.pdf Berkeley Above the Clouds http://perspectives.mvdirona.com/2009/02/13/berkeleyabovetheclouds.aspx Blog & Email: http://perspectives.mvdirona.com & James@amazon.com 2010/10/26 http://perspectives.mvdirona.com 14