Deploying Clusters at Electricité de France. Jean-Yves Berthou



Similar documents
High Performance Computing. Course Notes HPC Fundamentals

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

Clusters: Mainstream Technology for CAE

AuditMatic Enterprise Edition Installation Specifications

Introduction to Cluster Computing

Contents. Chapter 1. Introduction

Performance Monitoring of Parallel Scientific Applications

Kerrighed / XtreemOS cluster flavour

Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters

Energy efficiency in HPC :

Chapter 1: Introduction. What is an Operating System?

HP OpenView Tiering Matrix For HP and Partner use Page 1 of 5 Updated: 19 December 2005

Appro Supercomputer Solutions Best Practices Appro 2012 Deployment Successes. Anthony Kenisky, VP of North America Sales

P013 INTRODUCING A NEW GENERATION OF RESERVOIR SIMULATION SOFTWARE

PRIMERGY server-based High Performance Computing solutions

A Comparison of VMware and {Virtual Server}

Improved LS-DYNA Performance on Sun Servers

Comparison of Recent Trends in Sustainable Energy Development in Japan, U.K., Germany and France

SERVER CLUSTERING TECHNOLOGY & CONCEPT

MOSIX: High performance Linux farm

PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN

Virtualization. Types of Interfaces

PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY

Load-Balancing for a Real-Time System Based on Asymmetric Multi-Processing

Master in Nuclear Engineering M2

Overlapping Data Transfer With Application Execution on Clusters

Performance of Host Identity Protocol on Nokia Internet Tablet

Accelerating Simulation & Analysis with Hybrid GPU Parallelization and Cloud Computing

IOS110. Virtualization 5/27/2014 1

Product Brief. DC-Protect. Content based backup and recovery solution. By DATACENTERTECHNOLOGIES

Lecture 2 Cloud Computing & Virtualization. Cloud Application Development (SE808, School of Software, Sun Yat-Sen University) Yabo (Arber) Xu

Final Report. Cluster Scheduling. Submitted by: Priti Lohani

nanohub.org An Overview of Virtualization Techniques

Chapter 18: Database System Architectures. Centralized Systems

System Requirements. Version 8.2 November 23, For the most recent version of this document, visit our documentation website.

Simplest Scalable Architecture

Distributed File System Performance. Milind Saraph / Rich Sudlow Office of Information Technologies University of Notre Dame

High Performance Computing in CST STUDIO SUITE

LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR

Recent Advances in HPC for Structural Mechanics Simulations

Is Virtualization Killing SSI Research?

Comparing Free Virtualization Products

Microsoft Compute Clusters in High Performance Technical Computing. Björn Tromsdorf, HPC Product Manager, Microsoft Corporation

Nuclear Education & Training in France. Support to Newcomer and Expanding Countries

salome-platform.org SALOME7 THE OPEN SOURCE INTEGRATION PLATFORM FOR NUMERICAL SIMULATION

Distributed applications monitoring at system and network level

THE EXPAND PARALLEL FILE SYSTEM A FILE SYSTEM FOR CLUSTER AND GRID COMPUTING. José Daniel García Sánchez ARCOS Group University Carlos III of Madrid

Load Balancing on a Non-dedicated Heterogeneous Network of Workstations

RED HAT ENTERPRISE VIRTUALIZATION FOR SERVERS: COMPETITIVE FEATURES

Leveraging Windows HPC Server for Cluster Computing with Abaqus FEA

Using GPUs in the Cloud for Scalable HPC in Engineering and Manufacturing March 26, 2014

HPC performance applications on Virtual Clusters

MIKE by DHI 2014 e sviluppi futuri

System Requirements Version 8.0 July 25, 2013

WEB COMPAS MINIMUM HOSTING REQUIREMENTS

Evolution of the Data Center

How to control Resource allocation on pseries multi MCM system

How To Calculate Electricity Load Data

Multicore Parallel Computing with OpenMP

Satish Mohan. Head Engineering. AMD Developer Conference, Bangalore

Server Virtualization with VMWare

LSKA 2010 Survey Report Job Scheduler

MEGWARE HPC Cluster am LRZ eine mehr als 12-jährige Zusammenarbeit. Prof. Dieter Kranzlmüller (LRZ)

Uses for Virtual Machines. Virtual Machines. There are several uses for virtual machines:

IT Infrastructure and Platforms

Grid Scheduling Dictionary of Terms and Keywords

Basics of Virtualisation

Approaches for Cloud and Mobile Computing

CS550. Distributed Operating Systems (Advanced Operating Systems) Instructor: Xian-He Sun

Centralized Systems. A Centralized Computer System. Chapter 18: Database System Architectures

A Theory of the Spatial Computational Domain

How Cineca supports IT

Recommended hardware system configurations for ANSYS users

OIS. Update on Windows 7 at CERN & Remote Desktop Gateway. Operating Systems & Information Services CERN IT-OIS

Operating System Multilevel Load Balancing

Enterprise Planning Large Scale ARGUS Enterprise /29/2015 ARGUS Software An Altus Group Company

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

Cluster Computing at HRI

Capacity Plan. Template. Version X.x October 11, 2012

Virtualization of CBORD Odyssey PCS and Micros 3700 servers. The CBORD Group, Inc. January 13, 2007

Transcription:

Electricit é Deploying Clusters at Workshop Operating Systems, Tools and Methods for High Performance Computing on Linux Clusters Jean-Yves Berthou Head of the Applied Scientific Computing Group EDF R&D October, 7 2003 Clamart EDF R&D Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-1

Outline EDF Group s R&D Scientific Computing at EDF R&D Cluster Computing at EDF R&D Cluster technology at EDF : perspectives Concluding remarks Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-2

EDF Group s R&D 2570 employees Key figures 2/3 researchers and executives 96 teaching researchers 55 doctorates Participation in 70 European projects 4 main research sites Clamart (France) Karlsruhe (Germany) Chatou (France) Les Renardières (France) One branch in California (USA) Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-3

EDF Group s R&D Commercial development Electricity generation Power networks Transmission network infrastructures Transmission network development System operation and control Distribution networks and facilities Nuclear power Fossil-fired power Hydro power Renewable energies Forecast optimisation and management of the Company s generation assets Cross-functional fields Information technologies The environment Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-4 AC

Why is the R&D Division working in the nuclear power field? Competitiveness To keep maintenance expenses down To improve generation performance To improve the present availability of generating facilities Lifetime of power plants To improve the lifetime of critical components and the knowledge of ageing mechanisms To optimise the management of a unit s life cycle Downstream part of the cycle and future of nuclear waste Reactors of the future AS Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-5

Results and projects concerning research on hydro power and the other renewable energies Some examples Forecast studies of the profitability of offshore wind power farms (project) Construction of a demonstration building combining the use of renewable energies and the power network (project) A new numerical method for complex hydraulic flows AX Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-6

Scientific Computing at EDF R&D Large number of disciplinary applications : Thermomechanic : ASTER Thermohydraulic : NEPTUNE, SATURNE, THYC Neutronic Diffusion : DESCARTES, COCCINELLE Molecular Dynamic : REVE, SINERGY/PERFECT Global Power Plant functioning : LEGO(ENEL), CATHARE, SCAR simulator Financial Mathematics : Value at risk computation, spot price model, Energy Derivatives Visual Pricing Code coupling : 1-3 new coupled applications each year (30 coupled applications currently) Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-7

Scientific Computing at EDF R&D Small History of Computing Facilities at EDF R&D Until the end of 1990 : desktop computers for small studies centralized computers for large studies Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-8

Scientific Computing at EDF R&D Small History of Computing Facilities at EDF R&D AIST Project 1999 : computing power adapted to each needs, no more EDF R&D computer center : Desktop workstation (SUN, HP, ) Departmental/Project computer (SUN SMP, SGI SMP, HP SMP, COMPAQ MPP, Fujitsu VPP) HPC machines : cooperation with CEA CCR => PC CLUSTER : a possible project target machine Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-9

Cluster technology at EDF R&D CALIBRE Project 1999-2002 Initial goal : Spreading PC Cluster technology at EDF Objectives : Study of the technical feasibility Developing expertise Developing tools adapted to users needs Building a target architecture for internal EDF projects Building a service offer with the Direction du Système d Information et de l Informatique of EDF-GDF (EDF DSII) Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-10

Cluster technology at EDF R&D CALIBRE Project 2000-2002 : experimental results REVE project : Simulation of the irradiation damage (pressure vessel steels) CYRANO3 code : simulation of fuel rod thermomechanical behaviour ECOSS code : studying the Flashing phenomenon, vaporisation of a liquid due to depressurization Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-11

REVE project: REacteur Virtuel d Etude Applications (MMC) : Simulation of the irradiation damage (pressure vessel steels) Services : tools available on a HPC computer Cascade de déplacement dans Fer (20 kev) Experimental results : DYMOKA (EDF code ), Molecular dynamics using empirical interatomic potentials (EAM) Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-12

REVE project: REacteur Virtuel d Etude DYMOKA Nombre Performance (coût/atome/pas (µs)) de CPU Cray T3E IBM SP3 Compaq SC232 Cluster PC 1 128 38,2 16,6 23,4 2 63 (100%) 20,4 (94%) 8,3 (100%) [1] 4 32,5 (100%) 10,7 (89%) 4,2 (99%) [1] 8 16,5 (101%) 6,6 (72%) 2,97 (70%) [2] 16 8,4 (95%) N/A 1,57 (66%) [4] 10,4 (80%) [2] 5,3 (78%) [2] 2,9 (72%) [4] 1,84 (57%) [8] 12,9 (90%) [1] 6,9 (85%) [2] 3,7 (79%) [4] 12,0 (97%) [2] 6,3 (93%) [4] 3,3 (89%) [8] 2,16 (67%) [8] Légende : Temps (Efficacité) [Nombre de noeuds]

CYRANO3 code Applications (MMC) : simulation of fuel rod thermomechanical behaviour The French safety authorities check the integrity of fuel rods against a mechanical criterion Thousand of scénarios studied in parallel Sequential throughput under UNIX Solveur EF 1D

CYRANO3 code Use-case : 30 and 90 scénarii Target : 4000 scénarii Very high imbalance in execution time : 30s to 350s Machine Temps Speedup Sun Ultra 2/200 6 heures 1 Pentium III 800 28 min 45 s 12,5 Cluster 16 PIII 800 5 min 29 s 65

Cluster technology at EDF R&D CALIBRE Project 1999-2002 : industrial results LINUX is part of the supported OS at EDF/GDF Development of a EDF distribution Specialization of EDF distribution for particular project needs End of 2002 : 5 clusters at EDF R&D installed 2003 : dissemination of cluster technology outside EDF R&D (Study engineering structures) Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-16

Cluster technology at EDF R&D CALIBRE Project 1999-2002 : research results PhD EDF/IRISA/Claude Bernard Univ (Lyon) : Global scheduling in the Gobelins system Kerrighed (PARIS Project) : Providing a Single System Image for clusters : make usable CPUs, memories, devices and disks as a global resource Execution platform for sequential and parallel applications (shared memory or message passing) Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-17

Cluster technology at EDF R&D CALIBRE Project 1999-2002 : side effects Introduction to LINUX as a workstation : PC LINUX : a alternative to proprietary workstation supported at EDF 180 engineers at EDF R&D, 120 engineers at RTE 2003-2004 : 400 engineers at EDF R&D, 150 engineers at SEPTEN, PC LINUX/VMWARE : 1 PC for 2 machines Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-18

Cluster technology at EDF R&D CALIBRE Project 1999-2002 : side effects Introduction to Open Source culture : Open source software : more and more used EDF proprietary software are/became free : Code_Aster (10/2001) : 1 million code, tens of developpers (www.code-aster.org) P@L/SALOME (2001-2005) : generic software component based architecture, 150 eng.year (EDF : 50 eng.year), (www.opencascade.org/salome/salome.html) CALIBRE Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-19

Cluster technology at EDF : perspectives CALIBRE2 Project 2003-2004 : research objectives Kerrighed on industrial applications Enterprise Grid for Scientific Computing : aggregating enterprise computing power and Data and make them usable as a global resource Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-20

Cluster technology at EDF : perspectives OSIS Project 2003-2005 : industrial objectives Linux on workstation and clusters part of the enterprise technical referential Organizing support and administration of Linux solutions Support to the reorganization of Scientific Computing infrastructure for EDF divisions Training of administrators and users Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-21

Cluster technology at EDF : perspectives NAIST Project 2003-2005 Reorganization of Scientific Computing infrastructure of SEPTEN (engineering division) Deploying LINUX Workstation and Cluster for the SEPTEN EDF Engineering division Cartography of scientific codes and planning of LINUX platforms porting Training of administrators and users Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-22

Cluster technology at EDF : perspectives AIST Project 2003-2005 : first conclusions Speed-up of parametric studies : between 12 and 15 on the test cluster (16 processors) Some examples : 100 CATHARE2 parametric simulations : 100 days on existing Sun, 2 days on the test cluster New fuel management studies : Target : reduce elapse time from 18 months to 3 months (parametric studies) Expected gain : 3Meuros/year each time 20% of study elapse time is saved Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-23

Cluster technology at EDF : perspectives NAIST Project 2003-2005 Targets and planning : 2003, needs expression and tests : 10 workstations, 1 cluster 2004-2005 : deployment phase Porting of scientific codes (100 codes) 100 workstations, 3 clusters Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-24

Concluding remarks Clusters are now proved industrial target machines Clusters are part of a continuum of computing power : between workstation and HPC computers Offer a solution independent of strategy vendors : the evolution of such computing facilities does not depend of vendors roadmap Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-25

Concluding remarks EDF produce and maintain an EDF Linux Distribution available from workstations to clusters Use of standard Linux kernel and non proprietary software suite guaranty independence from any vendors : portability is improved BUT, how to combine : 1. one Linux distribution for Workstations and clusters AND, 2. possibility to by and use market clusters? Linux Cluster Workshop, EDF/INRIA/ORAP, Clamart, October 2003-26