John D. Davis Researcher Microsoft Research Silicon Valley

Similar documents
Data Center Facility Basics

Cloud Computing Is Driving Infrastructure Innovation

Cloud Computing Driving Infrastructure Innovation Intel DCSG Distinguished Speaker Series

Green HPC - Dynamic Power Management in HPC

Data Center Infrastructure Innovation

Green Computing: Datacentres

Green Computing: Datacentres

Server Technology, Inc.

Understanding Date Center. Power Usage Effectiveness

Power Efficiency Metrics for the Top500. Shoaib Kamil and John Shalf CRD/NERSC Lawrence Berkeley National Lab

Datacenter Design Issues

CarbonDecisions. The green data centre. Why becoming a green data centre makes good business sense

Reducing Data Center Energy Consumption

Server Chassis and Triplet Hardware v1.0

BC43: Virtualization and the Green Factor. Ed Harnish

Fabric-defined Datacenters at Cloud-scale. David Gauthier Sr. Director, Datacenter Architecture Microsoft Corporation

Datacenter Efficiency

Microsoft Private Cloud Fast Track Reference Architecture

Motherboard- based Servers versus ATCA- based Servers

Energy Efficiency and Availability Management in Consolidated Data Centers

Consolidating Your Database Infrastructure. Tom Mills Consultant Microsoft Corporation

Power Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and IBM FlexSystem Enterprise Chassis

Server Consolidation with SQL Server 2008

Solve your IT energy crisis WITH An energy SMArT SoluTIon FroM Dell

EX ECUT IV E ST RAT EG Y BR IE F. Server Design for Microsoft s Cloud Infrastructure. Cloud. Resources

7 Best Practices for Increasing Efficiency, Availability and Capacity. XXXX XXXXXXXX Liebert North America

Power Efficiency Comparison: Cisco UCS 5108 Blade Server Chassis and Dell PowerEdge M1000e Blade Enclosure

opdc Cut your data center OPEX upto 70%

Datacenter Networks Are In My Way

Microsoft s Open CloudServer

Leveraging Renewable Energy in Data Centers: Present and Future

Case study: End-to-end data centre infrastructure management

Five9NS R2100RS Rack Mount Server. Installation Manual

How Microsoft Designs its Cloud-Scale Servers

JNIOR. Overview. Get Connected. Get Results. JNIOR Model 310. JNIOR Model 312. JNIOR Model 314. JNIOR Model 410

Resource Efficient Computing for Warehouse-scale Datacenters

Power efficiency and power management in HP ProLiant servers

Server Platform Optimized for Data Centers

Model Manage Monitor Maximize your Data Center

Cooling Small Server Rooms Can Be. - Jim Magallanes Computer Room Uptime: Uptime Racks:

Liebert GXT MT+ CX UPS, 1000VA VA Intelligent, Reliable UPS Protection. AC Power for Business-Critical Continuity

Electrical Systems. <Presenter>

Increasing Data Center Efficiency through Metering and Monitoring Power Usage

Newsletter 4/2013 Oktober

Server Technology Inc.

Runtime Power Management

Industrial-Grade UPS System Heavy-duty power protection for harsh industrial environments

DECREASING DATACENTER AND OTHER IT ENERGY

TEST CHAPTERS 1 & 2 OPERATING SYSTEMS

Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array

A SOLAR GUIDE - EVERYTHING YOU NEED TO KNOW

IOS110. Virtualization 5/27/2014 1

SUN ORACLE EXADATA STORAGE SERVER

Dan Reed Multicore and Scalable Computing Strategist Managing Director, Cloud Computing Futures

Data Center Trend: Distributed Power in the White Space

Open Rack Charter. Authors: Gio Coglitore, Pierluigi Sarti, John Kenevey, Pete Bratach

VMware Server 2.0 Essentials. Virtualization Deployment and Management

Solar Photovoltaic Frequently Asked Questions

HP ProLiant power supply technology

PC Personal 120V 600VA 375W Standby UPS with Pure Sine Wave Output, Tower, 6 Outlets

Concepts Introduced in Chapter 6. Warehouse-Scale Computers. Important Design Factors for WSCs. Programming Models for WSCs

January 16-17, 2013 Santa Clara

Optimizing IT costs with Cloud. Tarmo Tikerpäe Datacenter SSP Microsoft Corporation

1.44 kw Programmable DC Power Supplies XLN Series

COOLSIDE box R410A. conditioned server - racks for direct installation into the room INVERTER E C

UCS M-Series Modular Servers

Introducing UNSW s R1 Data Centre

Daker DK 1, 2, 3 kva. Manuel d installation Installation manual. Part. LE05334AC-07/13-01 GF

An Oracle White Paper October Oracle Database Appliance

Measuring Processor Power

SmartPro 230V 1000VA 900W Line-Interactive Sine Wave UPS, SNMP, Webcard, 2U Rack/Tower, USB, DB9 Serial

SUN ORACLE DATABASE MACHINE

DATA CENTER UNIQUE CHARACTERISTICS AND THE ROLE OF MEDIUM VOLTAGE VACUUM CIRCUIT BREAKERS

IT Cost Savings Audit. Information Technology Report. Prepared for XYZ Inc. November 2009

Is Hyperconverged Cost-Competitive with the Cloud?

Data Center Solutions - The Containerized Evolution

Last time. Data Center as a Computer. Today. Data Center Construction (and management)

Server Farms. CIS Emerging Technology Research Paper

Figure 1A: Dell server and accessories Figure 1B: HP server and accessories Figure 1C: IBM server and accessories

HP UPS R1500 Generation 3

Power Benchmarking: A New Methodology for Analyzing Performance by Applying Energy Efficiency Metrics

TechTarget Virtualization Media. E-Guide

QuickSpecs. What's New New GB Pluggable Ultra320 SCSI 15,000 rpm Universal Hard Drive (1") HP SCSI Ultra320 Hard Drive Option Kits (Servers)

LTR Series Uninterruptible Power Systems 700 VA KVA. General Specification

StorTrends 3400 Hardware Guide for Onsite Support

Virtualization and the U2 Databases

Mobile Visual Analytics for Data Center Power and Cooling Management

Data Centre Testing and Commissioning

AIRAH Presentation April 30 th 2014

HPC TCO: Cooling and Computer Room Efficiency

IBM BladeCenter S Big benefits for the small office

QPS,KW- hr,mtbf, T,PUE,IOPS,DB/RH: A Day in a Life of a Datacenter Architect

Data Sheet FUJITSU Server PRIMERGY CX420 S1 Out-of-the-box Dual Node Cluster Server

Convergence of Complexity

SOLARCARE SERIES PRODUCT AND APPLICATION GUIDE

Cloud Servers in the Datacenter: The Evolution of Density-Optimized

SOLAR TECHNOLOGY CHRIS PRICE TECHNICAL SERVICES OFFICER BIMOSE TRIBAL COUNCIL

HP ProLiant DL585 G5 earns #1 virtualization performance record on VMmark Benchmark

Calculating power requirements for HP ProLiant rack-mounted systems

Green IT Solutions From Desktop to Datacenter. Dave Spear Sr. Engagement Manager & Sustainability Advocate dave.spear@microsoft.

Transcription:

John D. Davis Researcher Microsoft Research Silicon Valley

Power & Related Costs Dominate Facility: ~$200M for 15MW facility (15-year amort.) Servers: ~$2k/each, roughly 50,000 (3-year amort.) Average server power draw at 30% utilization: 80% Commercial Power: ~$0.07/KWhr $1,042,440 $1,296,902 Observations: $284,682 Monthly Costs Servers $2,997,090 Power & Cooling Infrastructure Power Other Infrastructure $2.3M/month from charges functionally related to power Power related costs trending flat or up while server costs trending down Details at: http://perspectives.mvdirona.com/2008/11/28/costofpowerinlargescaledatacenters.aspx Courtesy: James Hamilton, ISCA 2009

Understanding the applications Application-driven design Software re-engineering Discovering what matters So many metrics, so little time Energy efficient hardware Low-power processors are NOT the solution Data Center design requires full system engineering

Taking an application-driven approach to DC design Internal: Search, web server, file system, etc. SPEC: CPU2006, Power, and many others Dryad/DryadLinq applications Joulesort TPC-* Re-engineer software Remove/Reduce/Consolidate heartbeats How do we understand what is important?

ETW Framework 100 s of metrics Performance Counters VTUNE (Intel) Code Analyst (AMD) Need visualization tools Find the needle in the haystack Machine learning and other techniques Identify correlations and significant metrics

Performance analysis of distributed systems Modular, extensible, and interactive Courtesy: Mihai Budiu

Measure application energy usage using performance events Source code Workload JouleMeter Optimizations Detailed Energy Profile Designer Insights (Eg. Network is holding up the CPU in method X ) Run-time control (Eg. Tune QoS) Auto-optimize (Eg. Compiler warning: Change line 136 in Program.c to save ) Courtesy: Aman Kansal

Understanding the applications Application-driven design Software re-engineering Discovering what matters So many metrics, so little time Energy efficient hardware Low-power processors are NOT the solution

Building a DC Servers Power distribution Cooling Packaging Networking All of these can be improved to reduce both capital and operating cost.

We currently use commodity servers designed by HP, Rackable, others. Higher quality and reliability than the average PC, but they still operate in the PC ecosystem. IBM doesn t. Why not roll our own?

Minimize SKUs One for computing. Lots of CPU, Lots of memory, relatively few disks. One for storage. Modest CPU, memory, lots of disks. Maybe Flash memory has a home here. What do they look like? Custom motherboards. Redesign the power supply. Commodity disks. Cabling exits the front panel. Error correction where possible Processor dictated by workload data.

Need to minimize conversion steps to minimize losses. Deliver 3-phase AC to the rack Must balance the phases anyway Lower ripple after rectification What voltage? TBD, but probably 12-20 VAC. Select to maximize overall efficiency

Once-through air cooling is possible in some locations. Unfortunately, data centers tend to be built in inhospitable places. Air must be filtered. Designs are not compatible with side-to-side airflow. Cooling towers are well understood technology. And need not be used all the time. Once-through water cooling is attractive. Pump water from a river, use it once, sell the output to farms.

Use a shipping container and build a parking lot instead of a building. Doesn t need to be human-friendly. Might never open it. Assembled at one location, computers and all. A global shipping infrastructure already exists. Sun s version uses a 20-foot box. 40 would be better. Requires only networking, power, and cooled water. Expands as needed, in sensible increments. Rackable has a similar system. So does Google.

Side-to-side airflow is not impeded by the server case. There is no case. With bottom-to-top, servers at the top are hotter. With front-to-back, must provide hot and cold plenums. The server packaging is simplified, since they are not shipped separately. Can incorporate shock mounting at the server, not the rack level. Cables exit at the front, simplifying assembly and service. Most of this also applies to conventional data centers.

A 40 container holds two rows of 16 racks. Each rack holds 40 1U servers, plus network switch. Total container: 1280 servers. If each server draws 200W, the rack is 8KW, the container is 256 KW. A 64-container data center is 16 MW, plus cooling. Contains 82K computers. Each container has independent fire suppression. Reduces insurance cost.

By treating data centers as systems, and doing full-system optimization, we can achieve: More energy efficient systems. Lower cost, both opex and capex. Higher reliability. Incremental scale-out. More rapid innovation as technology improves.

2007 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Commodity hardware is cheaper This is commodity hardware. Even for one center (and we build many). And in the case of the network, it s not cheaper. Large switches command very large margins. Standards are better Yes, but only if they do what you need, at acceptable cost. It requires too many different skills Not as many as you might think. And we would work with engineering/manufacturing partners who would be the ultimate manufacturers. This model has worked before. If this stuff is so great, why aren t others doing it? They are.

James Hamilton ISCA 2009 Keynote: Internet-Scale Service Infrastructure Efficiency http://mvdirona.com/jrh/talksandpapers/jameshamilton_isca2009.pdf Power and Total Power Usage Effectiveness (tpue) http://perspectives.mvdirona.com/2009/06/15/pueandtotalpowerusageefficiencytpue.aspx Berkeley Above the Clouds http://perspectives.mvdirona.com/2009/02/13/berkeleyabovetheclouds.aspx Degraded Operations Mode http://perspectives.mvdirona.com/2008/08/31/degradedoperationsmode.aspx Cost of Power http://perspectives.mvdirona.com/2008/11/28/costofpowerinlargescaledatacenters.aspx http://perspectives.mvdirona.com/2008/12/06/annualfullyburdenedcostofpower.aspx Power Optimization: http://labs.google.com/papers/power_provisioning.pdf Cooperative, Expendable, Microslice Servers http://perspectives.mvdirona.com/2009/01/15/thecaseforlowcostlowpowerservers.aspx Power Proportionality http://www.barroso.org/publications/ieee_computer07.pdf Resource Consumption Shaping: http://perspectives.mvdirona.com/2008/12/17/resourceconsumptionshaping.aspx

2000 2003 2006 Electricity (billion kwh/yr) 60 50 40 30 20 10 0 E. Cost: $4.5b Energy usage growing at 14% yearly Data Center energy (excluding small DC s, office IT equip.) equals Electricity used by the entire U.S transportation manufacturing industry (manufacture of automobiles, aircraft, trucks, and ships)

Courtesy: Mihai Budiu

208V 8% distribution loss.997^3*.94*.99 = 92.2% IT LOAD 115kv 2.5MW Generator ~180 Gallons/hour UPS: Rotary or Battery 13.2kv ~1% loss in switch Gear and conductors 13.2kv 13.2kv 480V 0.3% loss 99.7% efficient 6% loss 94% efficient, >97% available http:// perspectives.mvdirona.com 0.3% loss 99.7% efficient 0.3% loss 99.7% efficient

Need to minimize conversion steps to minimize losses. Power supplies aren t very efficient: 12VDC -> 1VDC point-of-load regulators are ~90%. AC -> 12VDC converters are now 2-stage (power factor correction, inverter). 85% efficient at full load, lower at low load. Can do better. AC transformers are 98% efficient. Two steps needed. Final efficiency, grid to chips/disks: ~80%. UPS and backup generators aren t part of the picture until the grid fails.