News about HPC and Clouds @ Inria

Similar documents
Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed

Héméra Inria Project Lab July 2010 June 2014

Big Data Management in the Clouds and HPC Systems

Network for Sustainable Ultrascale Computing (NESUS)

Scalable Data-Intensive Processing for Science on Clouds: A-Brain and Z-CloudFlow

Stochastic control for underwater optimal trajectories CQFD & DCNS. Inria Bordeaux Sud Ouest & University of Bordeaux France

Flauncher and DVMS Deploying and Scheduling Thousands of Virtual Machines on Hundreds of Nodes Distributed Geographically

Energy efficiency in HPC :

Provisioning and Resource Management at Large Scale (Kadeploy and OAR)

HPC-related R&D in 863 Program

HPC technology and future architecture

Supercomputing Resources in BSC, RES and PRACE

So#ware Tools and Techniques for HPC, Clouds, and Server- Class SoCs Ron Brightwell

S06: Open-Source Stack for Cloud Computing

Workshop on Parallel and Distributed Scientific and Engineering Computing, Shanghai, 25 May 2012

An Energy-aware Multi-start Local Search Metaheuristic for Scheduling VMs within the OpenNebula Cloud Distribution

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED

Recent and Future Activities in HPC and Scientific Data Management Siegfried Benkner

Journée Mésochallenges 2015 SysFera and ROMEO Make Large-Scale CFD Simulations Only 3 Clicks Away

A Service for Data-Intensive Computations on Virtual Clusters

Data Centric Systems (DCS)

PRIMERGY server-based High Performance Computing solutions

Achieving Performance Isolation with Lightweight Co-Kernels

Large Scale Management of Virtual Machines Cooperative and Reactive Scheduling in Large-Scale Virtualized Platforms

1 st Symposium on Colossal Data and Networking (CDAN-2016) March 18-19, 2016 Medicaps Group of Institutions, Indore, India

HPC enabling of OpenFOAM R for CFD applications

International High Performance Computing. Troels Haugbølle Centre for Star and Planet Formation Niels Bohr Institute PRACE User Forum

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect

Grid Computing Perspectives for IBM

Web and Big Data at LIG. Marie-Christine Rousset (Pr UJF, déléguée scientifique du LIG)

HPC performance applications on Virtual Clusters

Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging

OpenNebula Leading Innovation in Cloud Computing Management

Distributed communication-aware load balancing with TreeMatch in Charm++

GTC Presentation March 19, Copyright 2012 Penguin Computing, Inc. All rights reserved

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India

Turbomachinery CFD on many-core platforms experiences and strategies

Concept and Project Objectives

PRACE the European HPC Research Infrastructure. Carlos Mérida-Campos, Advisor of Spanish Member at PRACE Council

Cloud Computing with Red Hat Solutions. Sivaram Shunmugam Red Hat Asia Pacific Pte Ltd.

Emerging Technology for the Next Decade

Toward a practical HPC Cloud : Performance tuning of a virtualized HPC cluster

YALES2 porting on the Xeon- Phi Early results

Cosmological simulations on High Performance Computers

Building Platform as a Service for Scientific Applications

HPC Infrastructure Development in Bulgaria

EIT ICT Labs MASTER SCHOOL DSS Programme Specialisations

HPC Deployment of OpenFOAM in an Industrial Setting

Estonian Scientific Computing Infrastructure (ETAIS)

Toward a Unified Ontology of Cloud Computing

Arcane/ArcGeoSim, a software framework for geosciences simulation

Accelerating Simulation & Analysis with Hybrid GPU Parallelization and Cloud Computing

Automated deployment of virtualization-based research models of distributed computer systems

Experiences with Eucalyptus: Deploying an Open Source Cloud

Scientific Computing Data Management Visions

PES. Batch virtualization and Cloud computing. Part 1: Batch virtualization. Batch virtualization and Cloud computing

Resource Scheduling Best Practice in Hybrid Clusters

Part I Courses Syllabus

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Is Virtualization Killing SSI Research?

Virtualization with Windows

Energy Efficiency Embedded Service Lifecycle: Towards an Energy Efficient Cloud Computing Architecture

Cloud Management: Knowing is Half The Battle

GOAL AND STATUS OF THE TLSE PLATFORM

High Performance Computing in Horizon February 26-28, 2014 Fukuoka Japan

Big Data and Cloud Computing for GHRSST

BlobSeer: Towards efficient data storage management on large-scale, distributed systems

Silviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania)

Computing in High- Energy-Physics: How Virtualization meets the Grid

IBM Spectrum Scale vs EMC Isilon for IBM Spectrum Protect Workloads

Kriterien für ein PetaFlop System

Transcription:

News about HPC and Clouds @ Inria Claude Kirchner Advisor to the president 24/11/2014 Nov 24, 2014-2

Nov 24, 2014-3 Inria Research Centres Inria LILLE Nord Europe Inria PARIS - Rocquencourt Inria NANCY Grand Est Inria SACLAY Île-de-France Inria RENNES Bretagne Atlantique Inria BORDEAUX Sud-Ouest Inria GRENOBLE Rhône-Alpes Inria SOPHIA ANTIPOLIS Méditerranée 4

Inria Strategic Plan : 5 years strategy 1- Sciences helpful for humains, society and knowledge Humans as such: Health and Well Being Humans and their environments: from the individuals to the society, from the habitat to the planet Humans and knowledge: emergence of knowledge, scientific mediation and education 2- Key scientific topics at the heart of our sciences Computing the Future: models, software and digital systems Mastering complexity: data, networks and flows Interacting with the real and virtual worlds: interactions, uses and machine learning 5 Scientific challenges at the heart of our sciences Computing the Future: models, software and digital systems Mastering complexity: data networks and flows! Designing multiscale models integrating uncertainties! Building very large digital systems, possibly embedded, and systems of systems! Programming very large software under reliability, safety and security constraints! Processing data deluges into trustable knowledge libraries! Generalized, safe cyber-communication, preserving privacy Interacting with the real and virtual worlds: interactions, uses and machine learning! Unsupervised machine learning! Interaction between humans and their digital environments 6

Project-teams involved in HPC KERDATA: data storage SAGE: scientific computing ALF: multicore/gpu DOLPHIN: combinatorial problems SIMPAF: fluid simulation Inria Paris Rocquencourt Inria Rennes Bretagne Atlantique Inria Lille Nord Europe Inria Nancy Grand Est Inria Saclay Île-de-France ALPINE: algorithms & tools for numerical simulation ALGORILLE: algorithms & models CAMUS: automatic parallelization BACCHUS: tools for numerical algo. HIEPACS: scientific computing RUNTIME: communication/threads CEPAGE: task management MAGIQUE-3D: seismic simulation CONCHA: combustion simulation Inria Bordeaux Sud-Ouest Inria Grenoble Rhône-Alpes AVALON: middleware & programming CORSE: Compilation and run-time MESCAL: models & tools MOAIS: scheduling & programming ROMA: Resource Optimization Inria Sophia Antipolis Méditerranée NACHOS: scientific computing OASIS: programming OPALE: scientific computing Nov 24, 2014-7 Project-teams involved in Clouds KERDATA: data storage MYRIADS: Autonomous Dist Syst ASCOLA: Languages and virt. Inria Paris Rocquencourt Inria Rennes Bretagne Atlantique Inria Lille Nord Europe Inria Nancy Grand Est Inria Saclay Île-de-France REGAL: Large Scale dist. systems ALGORILLE: algorithms & models CEPAGE: task management Inria Grenoble Rhône-Alpes AVALON: middleware & programming MESCAL: models & tools Inria Bordeaux Sud-Ouest Inria Sophia Antipolis Méditerranée OASIS: programming ZENITH: Scientific Data Management Nov 24, 2014-8

Initiatives to support HPC/Clouds strategy within Inria Inria Project Labs (IPL) Enable to launch ambitious projects of the strategic plan Promote an interdisciplinary approach Mobilizing expertise of Inria researchers around key challenges 4 years duration ~1.5 Meuros over 4 years " CS2@Exa " Hemera " Fusion, Multicore Other (GIS, PIA/ANR) " GRID5000 " ELCI Nov 24, 2014-9 Access to Machines Curie (GENCI / CEA) Fermi, (PRACE) GRID5000 Blue Waters Bull prototypes Plafrim (Bordeaux) Nov 24, 2014-10

IPL: C2S@Exa Computers and Computational Sciences at Exascale Contact: Stephane.Lanteri@inria.fr Numerical simulation for high performance massively parallel architectures Numerical linear algebra Numerical schemes for PDE models Optimization of performance of numerical solvers Programming models Resilience for exascale computing Multidisciplinary approach in between applied mathematics and computer science: From generic building-blocks to large-scale applications Nuclear energy production (fusion): MHD computation with the JOREK simulation software (with CEA) Radioactive waste management scenario: Environmental applications (with ANDRA) Nov 24, 2014-11 GRID 5000 Testbed for research on distributed systems Born from the observation that we need a better and larger testbed High Performance Computing, Grids, Peer-to-peer systems, Cloud computing A complete access to the nodes hardware in an exclusive mode (from one node to the whole infrastructure): Hardware as a service RIaaS : Real Infrastructure as a Service!? History, a community effort 2003: Project started (ACI GRID) 2005: Opened to users Funding Inria, CNRS, and many local entities (regions, universities) One rule: only for research on distributed systems no production usage Free nodes during daytime to prepare experiments Large-scale experiments during nights and week-ends (no long jobs) Nov 24, 2014-12

Current Status (Sept. 2014 data) 10 sites (1 outside France) Dedicated 10 Gbps backbone provided by Renater (French NREN) 24 clusters 1006 nodes 8014 cores Diverse technologies Intel (65%), AMD (35%) CPUs from one to 12 cores Ethernet 1G, 10G, Infiniband {S, D, Q}DR Two GPU clusters 2 Xeon Phi 2 data clusters (3-5 disks/node) More than 500 users per year Hardware renewed regularly Nov 24, 2014-13 Some recent experiments over Grid 5000 Energy monitoring and management Evaluation of Green Strategies for Energy-Aware Framework in Large Scale Distributed Systems Evaluation of different watt-meters Estimation of energy consumption with or without application expertise Cloud Computing and virtualization Sky computing between France and US using Hadoop Virtual machines deployment and migration (up to 10240 VMs) Experiments using major CloudKits (Nimbus, OpenStack) and VM stacks (Xen, kvm) High Performance Computing Replay of the Curie computer traces for resource management systems over an emulated environment Comparison of component based approaches and MPI/threads applications Big data management Optimization of MapReduce frameworks with high performance data management systems High performance data movements over multicore machines validated with a climate simulation application Nov 24, 2014-14

ELCI: software environment for computation-intensive applications Develop numerical simulation for HPC: New generation of SW stack: supercomputer control, numerical solvers, prog. & exec. environment. Validation: better scalability, resilience, security, modularity, abstraction and interactivity of applications. Consortium: Bull, CEA, Inria, SAFRAN, CERFACS, CORIA, CENAERO, ONERA, Univ. of Versailles, Kitware, AlgoTech 36 months (started on 09/2014) 150 PY Nov 24, 2014-15 Inria@ELCI 1. Software environment 1. Intra-node optimization (extending Linux Kernel) 2. Cluster management (batch scheduler) 3. Runtime systems (process placement) 4. Energy efficiency (multi-criteria algorithms) 5. Resilience (replication, impact on energy, etc.) 6. Software integration 2. Solver and numerical methods 1. Mesh tools (mesh refinement, mesh allocation) 3. Programming methods, tools, code optimization 1. tools and prog. models (StarPU/Xkaapi with MPC) 2. High level models for productivity (components) Nov 24, 2014-16

Inria@ELCI Myriads (Energy) Runtime (prog. mod., batch sched, runtime systems) RealOpt (Resilience) Bacchus (Meshes) Roma (Resilience, energy) Avalon (components, energy) Moais (prog. models) Nov 24, 2014-17 Vision A real opportunity to get some of the very best international actors of HPC to work together: Make the JLESC a success at least as excellent as the JLPC! Continue and develop the strong collaboration with NCSA and Argonne Develop strong existing and new relationships with BSC, Jülich and Riken Develop connections between JLESC and other HPC / Cloud Inria involvements: e.g. Chameleon / GRID5000 CS2@Exa Nov 24, 2014-18

Have a fruitful workshop! More technical informations Hélène Kirchner, Director of International Relations Yves Robert, JLESC Executive Director for Inria Frédéric Desprez, Deputy Scientific Director Networks, Systems and Services and Distributed Computing Frederic.Desprez@inria.fr Jean Roman, Deputy Scientific Director Applied Mathematics, Computing, and Simulation Jean.Roman@inria.fr Nov 24, 2014-19 Inria strategy in HPC/Clouds Inria is among the HPC leaders in Europe Long history of researches around distributed systems, HPC, Grids, virtualized environments and Big Data Multidisciplinary research culture Building and using exploration tools (eg massively parallel machines since 1987, large scale testbeds such as Grid 5000) National initiatives Collaboration with Bull on Supercomputer design " ELCI " LECO Strategic Partnership with EDF, TOTAL, EADS-ASTRIUM on numerical simulation Joint laboratory with CERFACS on robust scalable sparse linear solvers Collaboration with CEA on key system software (Kadeploy) for Supercomputers Participation to French Strategic Committees on HPC: ORAP, TER@TEC Shareholder of GENCI and governance of Maison de la Simulation with CEA and CNRS Nov 24, 2014-20

INRIA strategy in HPC/Clouds European PRACE-1IP/2-IP/3-IP (within GENCI) EESI & EESI2 (Exascale initiatives) ETP4HPC FP7 ICT, Challenge 1: Pervasive and Trusted Network and Service Infrastructures XtreemOS, Contrail, BonFire, Fed4Fire H2020 International Joint laboratory on petascale computing and now JLESC Inria@SiliconValley: Stanford and Berkeley Universities HOSCAR with Brazil, CNPq Standardization (DTMF, OGF/OCCI) Nov 24, 2014-21