Data Mining for Sustainable Data Centers



Similar documents
Data Mining for Modeling Chiller Systems in Data Centers

Automated Synthesis of Sustainable Data Centers

Visual exploration of frequent patterns in multivariate time series

Fine-grained Photovoltaic Output Prediction using a Bayesian Ensemble

Visual Exploration of Frequent Patterns in Multivariate Time Series

Renewable and Cooling Aware Workload Management for Sustainable Data Centers

Data analysis, Visualization and Knowledge Discovery in Sustainable Data Centers

Data centers & energy: Did we get it backwards? Adam Wierman, Caltech

Cost Model for Planning, Development and Operation of a Data Center

Algorithms for sustainable data centers

Stochastic Model Predictive Control for data centers

Energy Efficient Cloud Computing: Challenges and Solutions

supported Application QoS in Shared Resource Pools

Data Center Optimization Methodology to Maximize the Usage of Locally Produced Renewable Energy

Model-Based Approach for Optimizing a Data Center Centralized Cooling System

Energy Smart Data Center Phase-III R&D. Andrés Márquez. Pacific Northwest National Laboratory

Vitocaldens 222-F Compact Gas-Hybrid appliance

Data Center RD&D at EPRI

Solve your IT energy crisis WITH An energy SMArT SoluTIon FroM Dell

Data Center Infrastructure Management: ERP for the Data Center Manager

Control Strategies for Plenum Optimization in Raised Floor Data Centers

Leveraging Thermal Storage to Cut the Electricity Bill for Datacenter Cooling

Enabling an agile Data Centre in a (Fr)agile market

Green HPC - Dynamic Power Management in HPC

Energy Efficient Systems

Bytes and BTUs: Holistic Approaches to Data Center Energy Efficiency. Steve Hammond NREL

environmental sustainability at HP HP Sustainability Helping Norfolk County Council reduce environmental and expand opportunities

ICT and the Green Data Centre

Comparative Analysis of EM Clustering Algorithm and Density Based Clustering Algorithm Using WEKA tool.

THE WORLD S MOST EFFICIENT DATA CENTER

Thermal Management of Datacenter

Page 1. White Paper, December 2012 Gerry Conway. How IT-CMF can increase the Energy Efficiency of Data Centres

Demand Response in Data Centers Feasibility and Proof of Concept Demonstration

DATA CENTER ASSESSMENT SERVICES

DataCenter 2020: first results for energy-optimization at existing data centers

Green ICT: Consistent Actions to Reduce Energy Consumption

Bayesian networks - Time-series models - Apache Spark & Scala

Harmonizing Global Metrics for Data Center Energy Efficiency

Comparing the Carbon Footprints of 11G and 12G Rack Servers from Dell

DATA CENTER COOLING INNOVATIVE COOLING TECHNOLOGIES FOR YOUR DATA CENTER

Setting deadlines and priorities to the tasks to improve energy efficiency in cloud computing

The CEETHERM Data Center Laboratory

How To Schedule Cloud Computing On A Data Center

Smarter Data Centers. Integrated Data Center Service Management

Design Best Practices for Data Centers

Energy Efficient Thermal Management of Data Centers

FloVENT. Optimizing Data Center Cooling by Simulation. Mechanical Analysis

Direct Fresh Air Free Cooling of Data Centres

Energy management White paper. Greening the data center with IBM Tivoli software: an integrated approach to managing energy.

Trends in Data Center Design ASHRAE Leads the Way to Large Energy Savings

Modular Data Centre. Flexible, scalable high performance modular data centre solutions. Data Centre Solutions...energy efficient by design

Data Center & IT Infrastructure Optimization. Trends & Best Practices. Mickey Iqbal - IBM Distinguished Engineer. IBM Global Technology Services

Dynamic Virtual Machine Allocation in Cloud Server Facility Systems with Renewable Energy Sources

Challenges In Intelligent Management Of Power And Cooling Towards Sustainable Data Centre

ENERGY EFFICIENT AND REDUCTION OF POWER COST IN GEOGRAPHICALLY DISTRIBUTED DATA CARDS

Paul Oliver Sales Director. Paul Oliver, Airedale Tom Absalom, JCA

Understanding Power Usage Effectiveness (PUE) & Data Center Infrastructure Management (DCIM)

Measuring Power in your Data Center: The Roadmap to your PUE and Carbon Footprint

High-density modular data center making cloud easier. Frank Zheng Huawei

OPTIMIZATION OF DATA CENTER CHILLED WATER COOLING SYSTEM ACCORDING TO ANNUAL POWER CONSUMPTION CRITERION

Energy Efficient Data Centre at Imperial College. M. Okan Kibaroglu IT Production Services Manager Imperial College London.

Effect of Rack Server Population on Temperatures in Data Centers

7 Best Practices for Increasing Efficiency, Availability and Capacity. XXXX XXXXXXXX Liebert North America

Clustering Data Streams

Guideline for Water and Energy Considerations During Federal Data Center Consolidations

Convergence is accelerating the path to the New Style of Business

Information Management course

Data Center Infrastructure Management: How to Optimize Data Center Operations

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Georges Da Costa IRIT / Toulouse University GreenDays@Lille Green Data Centers: The European Sky

End-to-end management

ConSil: Low-cost Thermal Mapping of Data Centers

How High Temperature Data Centers & Intel Technologies save Energy, Money, Water and Greenhouse Gas Emissions

Dallas/Fort Worth International Airport District Energy Plant Upgrades Project Making More with Less Sustainable Communities Conference Dallas, TX

Transcription:

Data Mining for Sustainable Data Centers Manish Marwah Senior Research Scientist Hewlett Packard Laboratories manish.marwah@hp.com 2009 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice

Overview Sustainability and Data Centers Data mining applications Chiller operation characterization PV prediction Anomaly detection 2

Motivation Industry challenge: Create technologies, IT infrastructure and business models for the low-carbon economy IT industry 2% Total carbon emissions Aviation 2% The footprint of IT will need to be reduced quite significantly in a low-carbon economy. 3 3 June 2013

Sustainability sustainable development is development that meets the needs of the present without compromising the ability of future generations to meet their own needs the Brundtland Commission of the United Nations, 1987 4 3 June 2013

Sustainability What do I mean by sustainability? Risk: Ecological Damage Social ( People ) Sustainable Risk: Commercially unfeasibility Economic ( Profit ) Environmental ( Planet ) Risk: Limited Adoption 5 4 June 2013 Figure Credit: A. Agogino, UC Berkeley

Environmental Sustainability Impact factors (e.g., carbon, water, toxicity, etc.) Life Cycle View 6 3 June 2013

Handheld Notebook Desktop Blade Server Data Center Fraction of Lifecycle Energy Sustainable Data Centers Lifecycle Assessment 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Operational Embedded Ref: IEEE Computer 2009 Results are illustrative only. Actual footprint may differ. 7 3 June 2013

Cloud Data Center Supply and Demand Side Q data center Cooling Power Warm Water Air Mixture In Cooling Tower loop W p Switch Gear Makeup Water Return Water Q Cond UPS PDU Computing CHILLER Q Evap W p Chilled Water loop 8 Copyright 2011 Hewlett-Packard Company Chandrakant D. Patel; chandrakant.patel@hp.com Data Center

Power(kW) power (KW) Supply and Demand in a Data Center IT Services PV Onsite Power Grid Biogas 1.2 1 0.8 PV Supply 1 0.5 critical workload non-critical workload cooling power 0.6 0.4 0.2 0 6 12 18 24 Time(hour) 0 6 12 18 24 time (hour) Ecosystem of Clients Mechanical Cooling Supply Local Cooling Grid Demand IT Infrastructure Outside Air 9 U Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Ground Loops

Sustainable Operation and Management of Chillers using Temporal Data Mining (KDD 09) 10 Data Centers Cooling Infrastructure Problem Statement Prior Work Our Approach Symbolic representation Event encoding Motif mining Sustainability characterization Experimental Results Summary

Data Center Cooling Infrastructure Consumes from 1/3 up to 1/2 of total power consumption Cooling Towers Water Return (T in ) Chiller Unit Water Supply (T out ) 11 Computer room air-conditioner (CRAC)

Ensemble of Chillers Challenging to operate efficiently Complex physical system Dynamic Heterogeneous Inter-dependencies Many constraints Accurate models not available Rapid cycles undesirable reduce lifespan Domain experts determine settings based on heuristics Can it be automated through a datadriven approach? Chiller Ensemble Which unit to turn ON/OFF? At what utilization? How to handle increase/decrease in cooling load? 12

Problem Statement Given the following chiller time series utilization levels power consumption cooling loads Is it possible to determine which operational settings are more energy efficient? And then use this information to advise data center facility operators 13

Some Terminology IT cooling load Chiller utilization Chiller power consumption Coefficient of performance (COP) Cooling Load Power consumption 14

Prior Work Classical approaches to model time series data Principal component analysis Discrete Fourier transforms Discrete representations: SAX [Keogh et al.] Motifs: Repeating subsequences [Yankov et al.] 15

Our approach Goal: Sustainability characterization of multi- variate time series data Chiller utilization data Four Main Steps Symbolic representation Event encoding Motif mining Sustainability Characterization Other discrete data sources can be integrated Multivariate Time Series Data Cluster Analysis Event Encoding Frequent Motif Mining Sustainability characterization of frequent motifs Symbolic representation Transition-event sequence Frequent motifs 16

Clustering Individual vector: Utilization across all chiller units Raw Data: Sequence of such vectors Perform k-means clustering Use cluster labels to encode multi-variate time series 17

Some Definitions 18 Event Sequence Episode ( E, t ),( E 1 2, t 2 E i = Event type t i = Time of occurrence Ordered collection of events occurring together Episode occurrence Events same ordering as episode in the data. Motifs Frequently occurring episodes ),...,( E N, t 1 N ( A,1),( B,3),( D,4),( C,6),( A,12),( E,14),( B,15),( D,17),( C,20),( A,21) A B <(A,1), (B,3), (D,4), (C,6), (E,12), (A,14), (B,15), (C,17)> ) C

Redescribing time series data Perform run-length encoding: Note transitions from one symbol to another Higher level of abstraction Transition events 19

Level-wise (Apriori-based) motif mining Candidate generation followed by counting 20 3 June 2013

Multi-variate time-series Methodology Summary Vector representation Clustering Discrete representation of chiller ensemble time-series aabbbbbaaaxaaacccccaaaaabbbbbbaaeaaaaaacccccbggaaa Transition Encoding aabbbbbaaaxaaacccccaaaaabbbbbbaaeaaaaaacccccbggaaa Frequent Episode Mining Occurrence #1 Occurrence #2 Motif ab->ba->ac 21

Sustainability characterization of Motifs Average motif COP (coefficient of performance) Indicates cooling efficiency of a chiller unit COP = IT Cooling Load Power consumed Frequency of oscillations of a motif Impacts chiller lifespan Normalized number of mean-crossings 22

Experimental Results Data From HP R&D data center in Bangalore 70,000 sq ft 2000 racks of IT equipments Ensemble of five chiller units 3 air cooled chillers 2 water cooled chillers 480 hours of data July 2 7, Nov 27 30, Dec 16 26, 2008 22 motifs found in the data 23

A Motif Detailed Example (1/3) Time Series Symbol seq: 11,11,11,8,8,8,8,10,10,10,10,10,10,10,10,10,10,10,10,10,10,10,13,14,12,12,11,11,11 Encoded seq: 11-8,6637 8-10,6641 10-13,6656 13-14,6657 14-12,6658 12-11,6660 4 15 1 1 2 Transition Motif: [ 11-8, 8-10, 10-13, 13-14, 14-12, 12-11 ] Inter-transition gap constraint = 20 min 24

A Motif Detailed Example (2/3) Time Series Symbol seq: 11,11,11,8,8,8,10,10,10,10,10,10,10,10,10,10,10,10,10,13,13,14,12,12,11,11,11 Encoded seq: 11-8,6698 8-10,6701 10-13,6714 13-14,6716 14-12,6717 12-11,6719 3 13 2 1 2 Transition Motif: [ 11-8, 8-10, 10-13, 13-14, 14-12, 12-11 ] Inter-transition gap constraint = 20 min 25

A Motif Detailed Example (3/3) Time Series Symbol seq: 11,11,11,8,8,8,10,10,10,10,10,10,10,10,10,10,10,10,10,10,13,14,12,12,12,11,11,11 Encoded seq: 11-8,6758 8-10,6761 10-13,6775 13-14,6776 14-12,6777 12-11,6780 3 14 1 1 3 Transition Motif: [ 11-8, 8-10, 10-13, 13-14, 14-12, 12-11 ] Inter-transition gap constraint = 20 min 26

Two Interesting Motifs Chiller C1 18% 34% C2 49% 11% C3 44% 0% C4 0% 66% C5 0% 0% C1, C2, C3 Air cooled C4, C5 Water cooled Time (min) Motif 8 Motif 5 Motif 8 Motif 5 COP 4.87 5.40 Units operating 3 air-cooled 2 air-cooled, 1 water cooled 27

Potential Savings Annual saving from operating in Motif 5 instead of Motif 8 Cost savings = $40,000 (~10%) Carbon footprint savings = 287,328 kg of CO 2 28

Summary Data center chillers consume substantial power Ensemble of chillers part of data center cooling infrastructure are challenging to operate energy efficiently Mine and characterize motifs Symbolic representation Event encoding Motif mining Sustainability characterization Demonstrated our approach on data from a real data center indicates significant potential energy savings 29

Chiller COP The Net-Zero Energy Data Center Implementation in Palo Alto PV micro grid Data center Outside air Cooling infrastructure power demand Load (kw) Outside Air Temperature ( C) Data center supply side Data center demand side 30 Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

power (KW) power (KW) Power (kw) Power(kW) Net-Zero Energy Methodology and Integration Prediction 1.2 1 PV Supply Planning Supply-side Prediction Renewable power prediction Cooling capacity prediction 0.8 0.6 0.4 0.2 0 0.35 0.3 0.25 0.2 0.15 6 12 18 24 Time(hour) Outside Air Cooling Available Capacity DC Operation Objectives: Net-zero energy operation Maximize use of renewable energy Minimize dependability on Grid IT Demand Prediction 0.1 0 5 10 15 20 Time (hour) 1 0.5 critical workload non-critical workload cooling power IT Workload Planning Integrate Supply and Demand Side IT Demand Shaping Power capping 0 6 12 18 24 time (hour) Verification and Reporting Measurement Verification 1.2 0.8 0.6 0.4 0.2 Execution 1 0 critical workload non-critical workload cooling power renewable supply 6 12 18 24 time (hour) Dynamic IT Provisioning Ref: Z. Liu, Copyright Y. Chen, 2012 C. Bash, Hewlett-Packard A. Wierman, Development D. Gmach, 31 Company, Z. Wang, L.P. M. Marwah, The information C. Hyser, contained "Renewable herein is and subject Cooling to change Aware without notice. Workload Management for Sustainable Data Centers", ACM SIGMETRICS/Performance, June 11-15 2012, London, UK. Dynamic Cooling Provisioning

CPU Demand (number of CPUs) Power (kw) PV Power (kw) Prediction: Summary PV Supply Prediction Search for most similar days in the recent past Hourly generation estimated from corresponding hours of similar days Ref: P. Chakraborty, M. Marwah, M. Arlitt, N. Ramakrishnan, Fine-grained Photovoltaic Output Prediction Using a Bayesian Ensemble, in Proceedings of the 26th Conference on Artificial Intelligence (AAAI'12), Toronto, Canada, July 2012 Cooling Capacity Prediction End-to-End Energy Modeling 1 0.8 0.6 0.4 0.2 0 0.35 0.3 0.25 0.2 actual power predicted power 5 10 15 20 time (hour) Outside Air Cooling Available Capacity Ref: Breen, T.J. et. al. From Chip to Cooling Tower Data Center Modeling: Validation of Multi-Scale Energy Management Model, Proceedings of Itherm, June 2012 IT Workload Prediction 0.15 0.1 0 5 10 15 20 Time (hour) 20 15 predicted workload actual workload Perform a periodicity analysis (e.g., Fast Fourier Transform) Use an auto-regressive model to predict workload from historical data 10 5 32 Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. 0 6 12 18 24 time (hour)

Power(kW) Fine grained PV Prediction using Bayesian Ensemble Motivation Integration of renewable sources is an important goal of the smart grid effort PV output is variable and intermittent Knowledge of future PV output enables demand-side management and shaping in data centers Problem addressed 1.2 1 0.8 PV Supply Predict PV output for the next day 0.6 0.4 0.2 Data 0 6 12 18 24 Time(hour) Historical PV output data for about 9 months from the HPL Palo Alto site Weather data (AAAI 2012)

Fine grained PV Prediction using Bayesian Ensemble Approach Extract daily profiles from training data Use ensemble of predictors Naïve Bayes K-NN Motif based Perform Bayesian model averaging Results (AAAI 2012)

Fine grained PV Prediction using Bayesian Ensemble Results Error by weather condition Actual versus predicted (AAAI 2012)

power (KW) power (KW) Planning: Supply-Side Aware IT Workload Planning Planning Flow Demand Shaping electricity price renewable supply cooling capacity efficiency energy storage IT workload & SLAs goals 1 0.5 Demand critical workload non-critical workload cooling power demand shaping 1.2 1 0.8 0.6 0.4 Optimal Net Zero Plan critical workload non-critical workload cooling power renewable supply 0.2 Workload Planning 0 6 12 18 24 time (hour) 0 6 12 18 24 time (hour) A detailed workload scheduling and capacity allocation plan Workload scheduling plan IT resource and power capacity allocation Cooling micro-grid capacity allocation Overall demand is shaped according to input constraints and operation objectives Satisfy critical workload resource requirements Ref: Z. Liu, Y. Chen, C. Bash, A. Wierman, D. Gmach, Z. Wang, M. Marwah, C. Hyser, "Renewable and Cooling Aware Workload Management for Sustainable Data Centers", ACM SIGMETRICS/Performance, June 11-15 2012, London, UK. 36 Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Power and Workload Visualization medium low Before optimization After optimization

Some other projects Anomaly detection (SensorKDD 2010) Energy Disaggregation (SDM 2011, AAAI 2013) Automating Life Cycle Assessment (IEEE Computer 2011) Building Energy Management (BuildSys 2011) 38 3 June 2013

Anomalous Thermal Behavior Detection using PCA Example: Event (Anomaly) Detection Period of increased energy consumption (17 % increase) Rack D5 Switch turned on Normal energy consumption Network Switch Start: 2009-09-28 16:44:34 End: 2009-09-28 23:58:34 39 2009 (SensorKDD 2010)

Energy Disaggregation 40 2009

Proposed Variant of Factorial HMM s (SDM 2011) 41 2009

References P. Chakraborty, M. Marwah, M. Arlitt, and N. Ramakrishnan. Fine-grained Photovoltaic Output Prediction using a Bayesian Ensemble, in Proceedings of the 26th Conference on Artificial Intelligence (AAAI'12), Toronto, Canada, 7 pages, July 2012, To appear. Z. Liu, Y. Chen, C. Bash, A. Wierman, D. Gmach, Z. Wang, M. Marwah, C. Hyser, "Renewable and Cooling Aware Workload Management for Sustainable Data Centers", ACM SIGMETRICS/Performance, June 11-15 2012, London, UK. Manish Marwah, Amip Shah, Cullen Bash, Chandrakant Patel, Naren Ramakrishnan, "Using Data Mining to Help Design Sustainable Products," IEEE Computer, August 2011 Hyungsul Kim, Manish Marwah, Martin Arlitt, Geoff Lyon and Jiawei Han, "Unsupervised Disaggregation of Low Frequency Power Measurements", SIAM International Conference on Data Mining (SDM 11), Mesa, Arizona, April 28-30, 2011. Gowtham Bellala, Manish Marwah, Martin Arlitt, Geoff Lyon, Cullen Bash, "Towards an understanding of campus-scale power consumption." In ACM BuildSys, November 1, 2011, Seattle, WA. Manish Marwah, Ratnesh Sharma, Wilfredo Lugo, Lola Bautista, "Anomalous Thermal Behavior Detection in Data Centers using Hierarchical PCA," in SensorKDD in conjunction with KDD 2010. D. Patnaik, M. Marwah, Sharma, Ramakrishna, "Sustainable Operation and Management of Data Center Chillers using Temporal Data Mining," In ACM KDD, June 27 - July 1, 2009, Paris, France. Amip Shah, Tom Christian, Chandrakant D. Patel, Cullen Bash, Ratnesh K. Sharma: Assessing ICT's Environmental Impact. IEEE Computer 42(7): 91-93, July 2009.