Energy efficiency in HPC :



Similar documents
Hadoop. MPDL-Frühstück 9. Dezember 2013 MPDL INTERN

HPC technology and future architecture

The Hartree Centre helps businesses unlock the potential of HPC

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Evaluating MapReduce and Hadoop for Science

Chapter 7. Using Hadoop Cluster and MapReduce

Final Project Proposal. CSCI.6500 Distributed Computing over the Internet

Microsoft Research Windows Azure for Research Training

High-Performance Computing and Big Data Challenge

Personalized Medicine and IT

Microsoft Research Microsoft Azure for Research Training

Hadoop. Sunday, November 25, 12

The Power of Pentaho and Hadoop in Action. Demonstrating MapReduce Performance at Scale

HPC performance applications on Virtual Clusters

BIG DATA SOLUTION DATA SHEET

Cloud Computing based on the Hadoop Platform

Big Data: Overview and Roadmap eglobaltech. All rights reserved.

Adobe Deploys Hadoop as a Service on VMware vsphere

A Performance Analysis of Distributed Indexing using Terrier

Grid Computing vs Cloud

ASAM ODS, Peak ODS Server and openmdm as a company-wide information hub for test and simulation data. Peak Solution GmbH, Nuremberg

1. Simulation of load balancing in a cloud computing environment using OMNET

Open source Google-style large scale data analysis with Hadoop

Big Data Management in the Clouds and HPC Systems

Performance and Energy Efficiency of. Hadoop deployment models

Energy Efficiency Embedded Service Lifecycle: Towards an Energy Efficient Cloud Computing Architecture

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Bringing Compute to the Data Alternatives to Moving Data. Part of EUDAT s Training in the Fundamentals of Data Infrastructures

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control

OpenNebula Leading Innovation in Cloud Computing Management

Log Mining Based on Hadoop s Map and Reduce Technique

Open source software framework designed for storage and processing of large scale data on clusters of commodity hardware

Hadoop and its Usage at Facebook. Dhruba Borthakur June 22 rd, 2009

Cloud Storage Solution for WSN Based on Internet Innovation Union

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

Hadoop Big Data for Processing Data and Performing Workload

Systems Engineering. Aim. Dr Peter Jones. Degree Course Leader

Can High-Performance Interconnects Benefit Memcached and Hadoop?

1 Bull, 2011 Bull Extreme Computing

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect

Agenda. Big Data & Hadoop ViPR HDFS Pivotal Big Data Suite & ViPR HDFS ViON Customer Feedback #EMCVIPR

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Data Refinery with Big Data Aspects

Data Mining with Hadoop at TACC

GC3 Use cases for the Cloud

Scaling Big Data Mining Infrastructure: The Smart Protection Network Experience

Introduction to Cloud Computing

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Sector vs. Hadoop. A Brief Comparison Between the Two Systems

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

Oracle Big Data Essentials

The WAMS Power Data Processing based on Hadoop

Cloud Resilient Architecture (CRA) -Design and Analysis. Hamid Alipour Salim Hariri Youssif-Al-Nashif

Build Your Own Private Cloud with Ezilla and Haduzilla

Steinbuch Centre for Computing (SCC) The Information Technology Centre of KIT

S06: Open-Source Stack for Cloud Computing

Agile Business Intelligence Data Lake Architecture

Deploying Clusters at Electricité de France. Jean-Yves Berthou

Big Data Spatial Analytics An Introduction

Best of breed Multi Architecture Cloud

Workshop on Hadoop with Big Data


Addressing Open Source Big Data, Hadoop, and MapReduce limitations

UPS battery remote monitoring system in cloud computing

Talend Real-Time Big Data Sandbox. Big Data Insights Cookbook

Improving Data Processing Speed in Big Data Analytics Using. HDFS Method

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

The future: Big Data, IoT, VR, AR. Leif Granholm Tekla / Trimble buildings Senior Vice President / BIM Ambassador

Dell Reference Configuration for Hortonworks Data Platform

Analysis and Research of Cloud Computing System to Comparison of Several Cloud Computing Platforms

Welcome to the unit of Hadoop Fundamentals on Hadoop architecture. I will begin with a terminology review and then cover the major components

DATA SECURITY MODEL FOR CLOUD COMPUTING

Introduction to grid technologies, parallel and cloud computing. Alaa Osama Allam Saida Saad Mohamed Mohamed Ibrahim Gaber

Mr. Apichon Witayangkurn Department of Civil Engineering The University of Tokyo

Mobile Cloud Computing for Data-Intensive Applications

10- High Performance Compu5ng

Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

Manifest for Big Data Pig, Hive & Jaql

BSPCloud: A Hybrid Programming Library for Cloud Computing *

R.K.Uskenbayeva 1, А.А. Kuandykov 2, Zh.B.Kalpeyeva 3, D.K.Kozhamzharova 4, N.K.Mukhazhanov 5

Open Source for Cloud Infrastructure

Pilot-Streaming: Design Considerations for a Stream Processing Framework for High- Performance Computing

BIG DATA USING HADOOP

Network-Aware Scheduling of MapReduce Framework on Distributed Clusters over High Speed Networks

Transcription:

Energy efficiency in HPC : A new trend? A software approach to save power but still increase the number or the size of scientific studies! 19 Novembre 2012

The EDF Group in brief A GLOBAL LEADER IN ELECTRICITY 37.7 million customers 156,168 employees worldwide 65.3 billion in sales 87% carbon-free generation

EDF R&D, key figures 2000 people 370 doctors 220 doctoral students 200 research fellows from Universities and other higher education establishments 518 M 2011 budget 90 % Success rate For annuel contacts signed with EDF Group entities and business units 15 departments 12 common laboratories 7 Research centers of which 3 in France 1 in Germany 1 in the UK 1 in Poland 1 in China 500 Major research projects per year

RESEARCH PROGRAMMES A BIG USE OF COMPUTING RESSOURCES Specific R&D for all Group activities Generation Energy management Sales Electrical grids Renewable energies Covering all programmes: Information technology

INFORMATION TECHNOLOGIES Improving the performance of operations throught advanced simulation technologies Coming up with new business opportunities through innovative uses of new ICTs Simulate 600 people working on numerical simulation Compute Test Ivanoé, a HPC cluster : 200,000 billion operations per second Augmented reality

Computing Power for EDF R&D In 8 years, the computing power at EDF R&D was multiplied by 2000 2002 Sinetics 100 Gflops 2011 : Ivanoe 200 Tflops 3 computers in the last TOP 500 (nov. 2012) Frontier2 (IBM Blue Gene P) : 328 th with 100 teraflops Ivanoé, a conventional cluster : 149 th with 200 teraflops Zumbrota (IBM BG Q) : 32 nd with 690 teraflops

Performance! What is that? To reduce the computing time To obtain more precise results To increase the size of the field of study or to increase the number of simulations To allow the multi-physics coupling

The Project : Founded by ANR It involves 3 partners : Kerlabs INRIA EDF Aim of the project : Design & implement a cluster management system Take into account energy consumption

EDF's role Experimentally validate SNOOZE Open source framework developped by Eugen Feller at INRIA It manages virtual machines in private clouds It is energy efficient Provide Hardware Infrastructure HPSLab : High Performance Simulation Laboratory

HPSLAB? What do you speak about? An experimental platform or a platform for experiments! various technologies processors Networks Easy administration most maintenance actions are made through web interfaces Fully equipped for electricity consumption measurements

Internship's Objective Implement a test protocol Compare different use cases Analyse diverse execution conditions How? Refining the data collected Choising the right execution parameters Monitoring the plateforme Having a user-friendly interface Also Analyse VM impact with SNOOZE Compare to other monitoring projects

What has been done so far? State of the art in cluster Type of servers Type of sensors Network interfaces Monitoring system Scripts to get the data & plot it Web interface Comparaison to Ganglia EDF MEDIATHEQUE / SARGOS ALEXANDRE First tests with Hadoop

Monitoring system architecture

Web Interface

Hadoop What is it? Open source framework Written in Java Under Apache license Store & process large amounts of data How does it work? MapReduce : the programming model HDFS : the distributed File System

Test with Hadoop EDF s results Ganglia s results

Test with Hadoop EDF s results Ganglia s results

Test with Hadoop EDF s results Ganglia s results

What is left to do? Experimentally validate SNOOZE Use the framework in the HPSLab cluster Test it with multiple use cases Analyse the results

Aims for the future Become an EDF's researchers work tool Integrate this project to futur production clusters EDF MEDIATHEQUE / DIAS JEAN-LIONEL

Links http://www.edf.com http://ecograppe.inria.fr http://snooze.inria.fr http://hadoop.apache.org http://ganglia.sourceforge.net/