PARIS*: Programming parallel and distributed systems for large scale numerical simulation applications. Christine Morin IRISA/INRIA

Size: px
Start display at page:

Download "PARIS*: Programming parallel and distributed systems for large scale numerical simulation applications. Christine Morin IRISA/INRIA"

Transcription

1 PARIS*: Programming parallel and distributed systems for large scale numerical simulation applications Kerrighed, Vigne Christine Morin IRISA/INRIA * Common project with CNRS, ENS-Cachan, INRIA, INSA, Université de Rennes 1

2 Members of PARIS Project (sept 05) Scientific leader T. Priol (DR INRIA) Researchers F. André (Prof IFSIC) G. Antoniu (CR INRIA) J-P. Banâtre (Prof IFSIC) M. Bertier (MdC INSA) L. Bougé (Prof ENS) Y. Jégou (CR INRIA) A-M. Kermarrec (DR INRIA) C. Morin (DR INRIA) J.L. Pazat (MdC INSA) C. Perez (CR INRIA) Post-docs A. Viana A. Ribes G. Vallée Engineers D. Margery (IR INRIA) P. Morillon (IE IFSIC) Engineers P. Gallard (DGA) V. Lefèvre (G5K - IFSIC) R. Lottiaux (DGA) G. Mornet (G5K - PRIR) P. Palosaari (CoreGRID) J. Parpaillon (Ing. Associé) PhD candidates H-L. Bouziane (INRIA 2) J. Buisson (MENRT 3) Y. Busnel (ENS 1) L. Cudennec (INRIA-Région 1) M. Fertré (MENRT 1) M. Jan (MENRT 3) E. Jeanvoine (CIFRE 2) S. Lacour (INRIA 3) E. Le Merrer (CIFRE FT) S. Monnet (INRIA-Région 3) Y. Radenac (MENRT 3) E. Riviere (MENRT 2) L. Rilling (ENS 3)

3 Studied Systems Clusters A set of interconnected PC used as a single computing resource Grid A set of resources (processor, memory, disk, ) interconnected via Internet P2P systems A dynamic distributed system without any global state

4 Research Directions Single system image operating system Problem: clusters are difficult to program/use Challenge: to give the illusion that a cluster is a single machine Component based middleware Problème: code coupling applications are complex Challenge: How to facilitate the design of such applications while providing high performance? Advanced programming models Problème: Current programming models are not adequate for highly dynamic systems Challenge: How to express computing/coordination is such an environement? Data sharing service Problème: data sharing in large scale grids Challenge: sharing mutable data Systèmes P2P Problème: Master and optimize a P2P system Challenge: Characterize a P2P system and searching relevant information Experimental grid platform Problème: Need to experiment to validate our research results Challenge: building a reconfigurable grid

5 Grid 5000 Experimental Platform Contribution to the construction of Grid sites, 5000 processors Rennes site 500 processors (powerpc, Xeon, Opteron) Dual processor nodes Participants Researcher: Y. Jégou Engineers: V. Lefèvre, D. Margery, P. Morrillon, G. Mornet Grid-5000 Rennes

6 International Collaborations (funded) Cluster OS University of Ulm (Germany), ORNL (USA), Rutgers University (USA) Grids Pisa University (Italy), SNU (Korea) Large scale data management UIUC (USA) P2P systems Vrije Universiteit (The Netherlands)

7 OS for Clusters and Grids Kerrighed Single System Image (SSI) operating system for high performance computing on clusters Vigne Operating system to ease the use and programming of grids

8 Kerrighed Objectives Virtual shared memory multiprocessor Global and transparent resource management Tolerating node failures transparently for the applications High performance Approach Design of distributed OS mechanisms within an existing OS (Linux)

9 Kerrighed Achievements Customizable efficient full SSI operating system for high performance computing on clusters Small clusters (up to 256 nodes) Advanced research prototype Integration of the work of 3 Ph.D. students (R. Lottiaux (2001), Geoffroy Vallée (2004), P. Gallard (2004)) Robust prototype able to execute real applications provided by EDF R&D and DGA Open source software Stable version (K V1.0.2) based on Linux Demo LiveCD based on Knoppix kerrighed.users@irisa.fr integrated in OSCAR ssi-oscar.irisa.fr OSCAR is a snapshot of methods for building, programming, and using clusters. It consists of a fully integrated and easy to install software bundle designed for high performance cluster computing.

10 Efficient Operating System Comparison with other SSI for clusters OpenSSI, openmosix Results published in CC-GRID 2005 Internship of Benoît Boissinot Efficient communication system Highly reactive communication system to support Kerrighed distributed operating system services Compatibility of Kerrighed with efficient communication drivers used by HPC applications (such as GM for Myrinet)

11 Collaborations EDF R& D (since 2000) PhD and post-doc grants DGA ( ) Funding for research engineers ORNL (S. Scott) Integration of Kerrighed in OSCAR University of Ulm (M. Schöttner) Fault tolerant SDSM University of Rutgers (L. Iftode) High availability Invited researchers R. Badrinath (IIT Kharagpur) Isaac Scherson (UCI)

12 Current Research Directions Fault tolerance Large scale parallel application checkpointing System initiated checkpoints Checkpointing grid applications Master & PhD Thesis of Matthieu Fertré High availability Current work of Pascal Gallard and Renaud Lottiaux Tolerating hot node addition and eviction Phenix Investigating the backdoor approach in the context of Kerrighed Master Thesis of Benoît Boissinot Application SSI cluster OS Node 1 Node 2 Node 3

13 Technology Transfer KerLabs ( Start-up funded by Pascal Gallard and Renaud Lottiaux Software suite based on Kerrighed technologies EasyAdmin: Global cluster management EasyCheckpoint: Checkpoint/restart of parallel applications EasyRun: Application deployment & scheduling on clusters EasyCluster: the whole Kerrighed SSI solution Optimized support for high performance networking technologies Open source model

14 Vigne: a Grid OS Design and implementation of a Grid OS to ease the use and programming of very large grids Highly decentralized system Algorithms based on local knowledge Self-healing system Dealing with multiple quasi-simultaneous reconfigurations Single System Image Flexible system

15 Vigne Infrastructure based on decentralized overlays Structured and unstructured overlays Application manager for reliable application execution Resource discovery & allocation service On-going work (PhD Thesis of Emmanuel Jeanvoine) Volatile data sharing service PhD Thesis of Louis Rilling Complex application deployment PhD Thesis of Boris Daix (co-advised with Christian Pérez) To start beginning of 2006

16 Collaborations EDF R&D PhD grants

17 Future Work XtreemOS Project Integrated Project (FP6 - Call 5) Goal Under evaluation Building and Promoting a Linux-based Operating System to Support Virtual Organisations for Next Generation Grids 18 partners Academic & industrial partners 8 countries (including China)

18 XtreemOS Main Objectives We will design, implement, evaluate and distribute an open source Grid OS with native support for virtual organizations. Development of a Grid Operating System Enhance Linux to support VO across multiple administrative domains Manage very large and Self-organizing and selfhealing system Available on PC, SMP, clusters, PDA and mobile phones XtreemOS software: 3 flavours Standard flavour for PC Federation flavour based on Kerrighed SD flavour for small devices XtreemOS software will make the VO management easy for administrators and work, within VOs, easy, secure and efficient. Application Appli Appli Appli Middleware Experimentation and evaluation with a comprehensive set of real usecases provided by ISVs and endusers Linux Computer Linux Computer XtreemOS Linux Computer Linux Computer Integration in notorious Linux distributions Mandriva, Red Flag Linux Building a reference open source Grid OS

19 Talks Kerrighed Pascal Gallard Matthieu Fertré Vigne Emmanuel Jeanvoine JuxMem Sébastien Monnet

Kerrighed / XtreemOS cluster flavour

Kerrighed / XtreemOS cluster flavour Kerrighed / XtreemOS cluster flavour Jean Parpaillon Reisensburg Castle Günzburg, Germany July 5-9, 2010 July 6th, 2010 Kerrighed - XtreemOS cluster flavour 1 Summary Kerlabs Context Kerrighed Project

More information

Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters

Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters Geoffroy Vallée, Renaud

More information

Is Virtualization Killing SSI Research?

Is Virtualization Killing SSI Research? Is Virtualization Killing SSI Research? Jérôme Gallard, Geoffroy Vallée, Adrien Lèbre, Christine Morin, Pascal Gallard and Stephen L. Scott Aug, 26th Outline Context Cluster BS/SSI Virtualization Combining

More information

P U B L I C A T I O N I N T E R N E 1656 OPENMOSIX, OPENSSI AND KERRIGHED: A COMPARATIVE STUDY

P U B L I C A T I O N I N T E R N E 1656 OPENMOSIX, OPENSSI AND KERRIGHED: A COMPARATIVE STUDY I R I P U B L I C A T I O N I N T E R N E 656 N o S INSTITUT DE RECHERCHE EN INFORMATIQUE ET SYSTÈMES ALÉATOIRES A OPENMOSIX, OPENSSI AND KERRIGHED: A COMPARATIVE STUDY RENAUD LOTTIAUX, BENOIT BOISSINOT,

More information

GAMoSe: An Accurate Monitoring Service For Grid Applications

GAMoSe: An Accurate Monitoring Service For Grid Applications GAMoSe: An Accurate ing Service For Grid Applications Thomas Ropars, Emmanuel Jeanvoine, Christine Morin # IRISA/Paris Research group, Université de Rennes 1, EDF R&D, # INRIA {Thomas.Ropars,Emmanuel.Jeanvoine,Christine.Morin}@irisa.fr

More information

P U B L I C A T I O N I N T E R N E 1669 CAPABILITIES FOR PER PROCESS TUNING OF DISTRIBUTED OPERATING SYSTEMS

P U B L I C A T I O N I N T E R N E 1669 CAPABILITIES FOR PER PROCESS TUNING OF DISTRIBUTED OPERATING SYSTEMS I R I P U B L I C A T I O N I N T E R N E 1669 N o S INSTITUT DE RECHERCHE EN INFORMATIQUE ET SYSTÈMES ALÉATOIRES A CAPABILITIES FOR PER PROCESS TUNING OF DISTRIBUTED OPERATING SYSTEMS DAVID MARGERY, RENAUD

More information

P U B L I C A T I O N I N T E R N E 1704 SSI-OSCAR: A CLUSTER DISTRIBUTION FOR HIGH PERFORMANCE COMPUTING USING A SINGLE SYSTEM IMAGE

P U B L I C A T I O N I N T E R N E 1704 SSI-OSCAR: A CLUSTER DISTRIBUTION FOR HIGH PERFORMANCE COMPUTING USING A SINGLE SYSTEM IMAGE I R I P U B L I C A T I O N I N T E R N E 1704 N o S INSTITUT DE RECHERCHE EN INFORMATIQUE ET SYSTÈMES ALÉATOIRES A SSI-OSCAR: A CLUSTER DISTRIBUTION FOR HIGH PERFORMANCE COMPUTING USING A SINGLE SYSTEM

More information

XtreemOS : des grilles aux nuages informatiques

XtreemOS : des grilles aux nuages informatiques XtremOS tutorial on security XtreemOS : des grilles aux nuages informatiques Christine Morin Myriads research team INRIA Rennes-Bretagne Atlantique XtreemOS scientific coordinator Séminaire IN Tech - Virtualisation

More information

Distributed System Monitoring and Failure Diagnosis using Cooperative Virtual Backdoors

Distributed System Monitoring and Failure Diagnosis using Cooperative Virtual Backdoors Distributed System Monitoring and Failure Diagnosis using Cooperative Virtual Backdoors Benoit Boissinot E.N.S Lyon directed by Christine Morin IRISA/INRIA Rennes Liviu Iftode Rutgers University Phenix

More information

Is Virtualization Killing SSI Research?

Is Virtualization Killing SSI Research? Is Virtualization Killing SSI Research? Jérôme Gallard Paris Project-Team Dinard November 2007 Supervisor : Christine Morin Co-supervisor: Adrien Lèbre My subject! ;) Reliability and performance of execution

More information

A Monitoring Tool to Manage the Dynamic Resource Requirements of a Grid Data Sharing Service

A Monitoring Tool to Manage the Dynamic Resource Requirements of a Grid Data Sharing Service A Monitoring Tool to Manage the Dynamic Resource Requirements of a Grid Data Sharing Service Voichiţa Almăşan Voichita.Almasan@irisa.fr Supervisors : Gabriel Antoniu, Luc Bougé {Gabriel.Antoniu,Luc.Bouge}@irisa.fr

More information

Distributed Operating Systems. Cluster Systems

Distributed Operating Systems. Cluster Systems Distributed Operating Systems Cluster Systems Ewa Niewiadomska-Szynkiewicz ens@ia.pw.edu.pl Institute of Control and Computation Engineering Warsaw University of Technology E&IT Department, WUT 1 1. Cluster

More information

The Advantages and Disadvantages of a Standard Data Storage System

The Advantages and Disadvantages of a Standard Data Storage System Databases: towards performance and scalability Bibliographical study Silviu-Marius Moldovan mariusmoldovan@gmail.com Supervisors: Gabriel Antoniu, Luc Bougé {Gabriel.Antoniu,Luc.Bouge}@irisa.fr INSA, IFSIC,

More information

Security of Information Systems hosted in Clouds: SLA Definition and Enforcement in a Dynamic Environment

Security of Information Systems hosted in Clouds: SLA Definition and Enforcement in a Dynamic Environment Security of Information Systems hosted in Clouds: SLA Definition and Enforcement in a Dynamic Environment Christine Morin Inria Joint work with Louis Rilling (DGA-MI), Anna Giannakou (Inria), Jean-Louis

More information

XtreemOS and Cloud Computing Alvaro Arenas E-Science Centre Science and Technologies Facilities Council, UK XtreemOS in a Nutshell An open source Linux-based Grid Operating System with native VO support

More information

Deliverable D2.2. Resource Management Systems for Distributed High Performance Computing

Deliverable D2.2. Resource Management Systems for Distributed High Performance Computing Resource Management Systems for Distributed High Performance Computing VERSION Version 1.1 DATE February 4, 2011 EDITORIAL MANAGER Yann Radenac (Yann.Radenac@inria.fr) AUTHORS STAFF Yann Radenac IRISA/INRIA,

More information

Work in Progress on Cloud Computing in Myriads Team and Contrail European Project Christine Morin, Inria

Work in Progress on Cloud Computing in Myriads Team and Contrail European Project Christine Morin, Inria Potential collaboration talk Work in Progress on Cloud Computing in Myriads Team and Contrail European Project Christine Morin, Inria Design and implementation of autonomous distributed systems Internet

More information

Architectural Review of Load Balancing Single System Image

Architectural Review of Load Balancing Single System Image Journal of Computer Science 4 (9): 752-761, 2008 ISSN 1549-3636 2008 Science Publications Architectural Review of Load Balancing Single System Image Bestoun S. Ahmed, Khairulmizam Samsudin and Abdul Rahman

More information

Simple Introduction to Clusters

Simple Introduction to Clusters Simple Introduction to Clusters Cluster Concepts Cluster is a widely used term meaning independent computers combined into a unified system through software and networking. At the most fundamental level,

More information

Energy efficiency in HPC :

Energy efficiency in HPC : Energy efficiency in HPC : A new trend? A software approach to save power but still increase the number or the size of scientific studies! 19 Novembre 2012 The EDF Group in brief A GLOBAL LEADER IN ELECTRICITY

More information

Cellular Computing on a Linux Cluster

Cellular Computing on a Linux Cluster Cellular Computing on a Linux Cluster Alexei Agueev, Bernd Däne, Wolfgang Fengler TU Ilmenau, Department of Computer Architecture Topics 1. Cellular Computing 2. The Experiment 3. Experimental Results

More information

Kerrighed: use cases. Cyril Brulebois. Kerrighed. Kerlabs

Kerrighed: use cases. Cyril Brulebois. Kerrighed. Kerlabs Kerrighed: use cases Cyril Brulebois cyril.brulebois@kerlabs.com Kerrighed http://www.kerrighed.org/ Kerlabs http://www.kerlabs.com/ 1 / 23 Introducing Kerrighed What s Kerrighed? Single-System Image (SSI)

More information

Deploying Clusters at Electricité de France. Jean-Yves Berthou

Deploying Clusters at Electricité de France. Jean-Yves Berthou Electricit é Deploying Clusters at Workshop Operating Systems, Tools and Methods for High Performance Computing on Linux Clusters Jean-Yves Berthou Head of the Applied Scientific Computing Group EDF R&D

More information

Efficient Load Balancing using VM Migration by QEMU-KVM

Efficient Load Balancing using VM Migration by QEMU-KVM International Journal of Computer Science and Telecommunications [Volume 5, Issue 8, August 2014] 49 ISSN 2047-3338 Efficient Load Balancing using VM Migration by QEMU-KVM Sharang Telkikar 1, Shreyas Talele

More information

Ressources management and runtime environments in the exascale computing era

Ressources management and runtime environments in the exascale computing era Ressources management and runtime environments in the exascale computing era Guillaume Huard MOAIS and MESCAL INRIA Projects CNRS LIG Laboratory Grenoble University, France Guillaume Huard MOAIS and MESCAL

More information

Flauncher and DVMS Deploying and Scheduling Thousands of Virtual Machines on Hundreds of Nodes Distributed Geographically

Flauncher and DVMS Deploying and Scheduling Thousands of Virtual Machines on Hundreds of Nodes Distributed Geographically Flauncher and Deploying and Scheduling Thousands of Virtual Machines on Hundreds of Nodes Distributed Geographically Daniel Balouek, Adrien Lèbre, Flavien Quesnel To cite this version: Daniel Balouek,

More information

PhantomOS: A Next Generation Grid Operating System

PhantomOS: A Next Generation Grid Operating System PhantomOS: A Next Generation Grid Operating System Irfan Habib 2, Kamran Soomro 2, Ashiq Anjum 1, Richard McClatchey 1, Arshad Ali 2, Peter Bloodsworth 1 1 CCS Research Centre, University of the West of

More information

Box Leangsuksun+ * Thammasat University, Patumtani, Thailand # Oak Ridge National Laboratory, Oak Ridge, TN, USA + Louisiana Tech University, Ruston,

Box Leangsuksun+ * Thammasat University, Patumtani, Thailand # Oak Ridge National Laboratory, Oak Ridge, TN, USA + Louisiana Tech University, Ruston, N. Saragol * Hong Ong# Box Leangsuksun+ K. Chanchio* * Thammasat University, Patumtani, Thailand # Oak Ridge National Laboratory, Oak Ridge, TN, USA + Louisiana Tech University, Ruston, LA, USA Introduction

More information

EIT ICT Labs MASTER SCHOOL DSS Programme Specialisations

EIT ICT Labs MASTER SCHOOL DSS Programme Specialisations EIT ICT Labs MASTER SCHOOL DSS Programme Specialisations DSS EIT ICT Labs Master Programme Distributed System and Services (Cloud Computing) The programme in Distributed Systems and Services focuses on

More information

Multi-core Curriculum Development at Georgia Tech: Experience and Future Steps

Multi-core Curriculum Development at Georgia Tech: Experience and Future Steps Multi-core Curriculum Development at Georgia Tech: Experience and Future Steps Ada Gavrilovska, Hsien-Hsin-Lee, Karsten Schwan, Sudha Yalamanchili, Matt Wolf CERCS Georgia Institute of Technology Background

More information

Client/Server Computing Distributed Processing, Client/Server, and Clusters

Client/Server Computing Distributed Processing, Client/Server, and Clusters Client/Server Computing Distributed Processing, Client/Server, and Clusters Chapter 13 Client machines are generally single-user PCs or workstations that provide a highly userfriendly interface to the

More information

Appro Supercomputer Solutions Best Practices Appro 2012 Deployment Successes. Anthony Kenisky, VP of North America Sales

Appro Supercomputer Solutions Best Practices Appro 2012 Deployment Successes. Anthony Kenisky, VP of North America Sales Appro Supercomputer Solutions Best Practices Appro 2012 Deployment Successes Anthony Kenisky, VP of North America Sales About Appro Over 20 Years of Experience 1991 2000 OEM Server Manufacturer 2001-2007

More information

nanohub.org An Overview of Virtualization Techniques

nanohub.org An Overview of Virtualization Techniques An Overview of Virtualization Techniques Renato Figueiredo Advanced Computing and Information Systems (ACIS) Electrical and Computer Engineering University of Florida NCN/NMI Team 2/3/2006 1 Outline Resource

More information

Scheduling and Resource Management in Computational Mini-Grids

Scheduling and Resource Management in Computational Mini-Grids Scheduling and Resource Management in Computational Mini-Grids July 1, 2002 Project Description The concept of grid computing is becoming a more and more important one in the high performance computing

More information

Large Scale Management of Virtual Machines Cooperative and Reactive Scheduling in Large-Scale Virtualized Platforms

Large Scale Management of Virtual Machines Cooperative and Reactive Scheduling in Large-Scale Virtualized Platforms Large Scale Management of Virtual Machines Cooperative and Reactive Scheduling in Large-Scale Virtualized Platforms Adrien Lèbre EPI ASCOLA / HEMERA Flavien Quesnel, Phd Candidate February 2013 System

More information

Write a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical

Write a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical Identify a problem Review approaches to the problem Propose a novel approach to the problem Define, design, prototype an implementation to evaluate your approach Could be a real system, simulation and/or

More information

MPI / ClusterTools Update and Plans

MPI / ClusterTools Update and Plans HPC Technical Training Seminar July 7, 2008 October 26, 2007 2 nd HLRS Parallel Tools Workshop Sun HPC ClusterTools 7+: A Binary Distribution of Open MPI MPI / ClusterTools Update and Plans Len Wisniewski

More information

Using the Windows Cluster

Using the Windows Cluster Using the Windows Cluster Christian Terboven terboven@rz.rwth aachen.de Center for Computing and Communication RWTH Aachen University Windows HPC 2008 (II) September 17, RWTH Aachen Agenda o Windows Cluster

More information

Windows Compute Cluster Server 2003. Miron Krokhmal CTO

Windows Compute Cluster Server 2003. Miron Krokhmal CTO Windows Compute Cluster Server 2003 Miron Krokhmal CTO Agenda The Windows compute cluster architecture o Hardware and software requirements o Supported network topologies o Deployment strategies, including

More information

1 Bull, 2011 Bull Extreme Computing

1 Bull, 2011 Bull Extreme Computing 1 Bull, 2011 Bull Extreme Computing Table of Contents HPC Overview. Cluster Overview. FLOPS. 2 Bull, 2011 Bull Extreme Computing HPC Overview Ares, Gerardo, HPC Team HPC concepts HPC: High Performance

More information

Self-Adapting Load Balancing for DNS

Self-Adapting Load Balancing for DNS Self-Adapting Load Balancing for DNS Jo rg Jung, Simon Kiertscher, Sebastian Menski, and Bettina Schnor University of Potsdam Institute of Computer Science Operating Systems and Distributed Systems Before

More information

PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN

PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN 1 PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN Introduction What is cluster computing? Classification of Cluster Computing Technologies: Beowulf cluster Construction

More information

Linux clustering. Morris Law, IT Coordinator, Science Faculty, Hong Kong Baptist University

Linux clustering. Morris Law, IT Coordinator, Science Faculty, Hong Kong Baptist University Linux clustering Morris Law, IT Coordinator, Science Faculty, Hong Kong Baptist University PII 4-node clusters started in 1999 PIII 16 node cluster purchased in 2001. Plan for grid For test base HKBU -

More information

Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed

Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed Sébastien Badia, Alexandra Carpen-Amarie, Adrien Lèbre, Lucas Nussbaum Grid 5000 S. Badia, A. Carpen-Amarie, A. Lèbre, L. Nussbaum

More information

High Performance Computing. Course Notes 2007-2008. HPC Fundamentals

High Performance Computing. Course Notes 2007-2008. HPC Fundamentals High Performance Computing Course Notes 2007-2008 2008 HPC Fundamentals Introduction What is High Performance Computing (HPC)? Difficult to define - it s a moving target. Later 1980s, a supercomputer performs

More information

Computing in High- Energy-Physics: How Virtualization meets the Grid

Computing in High- Energy-Physics: How Virtualization meets the Grid Computing in High- Energy-Physics: How Virtualization meets the Grid Yves Kemp Institut für Experimentelle Kernphysik Universität Karlsruhe Yves Kemp Barcelona, 10/23/2006 Outline: Problems encountered

More information

Clusters: Mainstream Technology for CAE

Clusters: Mainstream Technology for CAE Clusters: Mainstream Technology for CAE Alanna Dwyer HPC Division, HP Linux and Clusters Sparked a Revolution in High Performance Computing! Supercomputing performance now affordable and accessible Linux

More information

Cluster, Grid, Cloud Concepts

Cluster, Grid, Cloud Concepts Cluster, Grid, Cloud Concepts Kalaiselvan.K Contents Section 1: Cluster Section 2: Grid Section 3: Cloud Cluster An Overview Need for a Cluster Cluster categorizations A computer cluster is a group of

More information

Fig. 3. PostgreSQL subsystems

Fig. 3. PostgreSQL subsystems Development of a Parallel DBMS on the Basis of PostgreSQL C. S. Pan kvapen@gmail.com South Ural State University Abstract. The paper describes the architecture and the design of PargreSQL parallel database

More information

LinuxWorld Conference & Expo Server Farms and XML Web Services

LinuxWorld Conference & Expo Server Farms and XML Web Services LinuxWorld Conference & Expo Server Farms and XML Web Services Jorgen Thelin, CapeConnect Chief Architect PJ Murray, Product Manager Cape Clear Software Objectives What aspects must a developer be aware

More information

Virtualization for Cloud Computing

Virtualization for Cloud Computing Virtualization for Cloud Computing Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF CLOUD COMPUTING On demand provision of computational resources

More information

Software services competence in research and development activities at PSNC. Cezary Mazurek PSNC, Poland

Software services competence in research and development activities at PSNC. Cezary Mazurek PSNC, Poland Software services competence in research and development activities at PSNC Cezary Mazurek PSNC, Poland Workshop on Actions for Better Participation of New Member States to FP7-ICT Timişoara, 18/19-03-2010

More information

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC HPC Architecture End to End Alexandre Chauvin Agenda HPC Software Stack Visualization National Scientific Center 2 Agenda HPC Software Stack Alexandre Chauvin Typical HPC Software Stack Externes LAN Typical

More information

Towards the Magic Green Broker Jean-Louis Pazat IRISA 1/29. Jean-Louis Pazat. IRISA/INSA Rennes, FRANCE MYRIADS Project Team

Towards the Magic Green Broker Jean-Louis Pazat IRISA 1/29. Jean-Louis Pazat. IRISA/INSA Rennes, FRANCE MYRIADS Project Team Towards the Magic Green Broker Jean-Louis Pazat IRISA 1/29 Jean-Louis Pazat IRISA/INSA Rennes, FRANCE MYRIADS Project Team Towards the Magic Green Broker Jean-Louis Pazat IRISA 2/29 OUTLINE Clouds and

More information

Virtualization with Windows

Virtualization with Windows Virtualization with Windows at CERN Juraj Sucik, Emmanuel Ormancey Internet Services Group Agenda Current status of IT-IS group virtualization service Server Self Service New virtualization features in

More information

Virtual machine interface. Operating system. Physical machine interface

Virtual machine interface. Operating system. Physical machine interface Software Concepts User applications Operating system Hardware Virtual machine interface Physical machine interface Operating system: Interface between users and hardware Implements a virtual machine that

More information

Virtualization of a Cluster Batch System

Virtualization of a Cluster Batch System Virtualization of a Cluster Batch System Christian Baun, Volker Büge, Benjamin Klein, Jens Mielke, Oliver Oberst and Armin Scheurer Die Kooperation von Cluster Batch System Batch system accepts computational

More information

Software Distributed Shared Memory Scalability and New Applications

Software Distributed Shared Memory Scalability and New Applications Software Distributed Shared Memory Scalability and New Applications Mats Brorsson Department of Information Technology, Lund University P.O. Box 118, S-221 00 LUND, Sweden email: Mats.Brorsson@it.lth.se

More information

HPC performance applications on Virtual Clusters

HPC performance applications on Virtual Clusters Panagiotis Kritikakos EPCC, School of Physics & Astronomy, University of Edinburgh, Scotland - UK pkritika@epcc.ed.ac.uk 4 th IC-SCCE, Athens 7 th July 2010 This work investigates the performance of (Java)

More information

Using an MPI Cluster in the Control of a Mobile Robots System

Using an MPI Cluster in the Control of a Mobile Robots System Using an MPI Cluster in the Control of a Mobile Robots System Mohamed Salim LMIMOUNI, Saïd BENAISSA, Hicham MEDROMI, Adil SAYOUTI Equipe Architectures des Systèmes (EAS), Laboratoire d Informatique, Systèmes

More information

University of Huddersfield Repository

University of Huddersfield Repository University of Huddersfield Repository Gubb, David, Holmes, Violeta, Kureshi, Ibad, Liang, Shuo and James, Yvonne Implementing a Condor pool using a Green-IT policy Original Citation Gubb, David, Holmes,

More information

Storage Virtualization from clusters to grid

Storage Virtualization from clusters to grid Seanodes presents Storage Virtualization from clusters to grid Rennes 4th october 2007 Agenda Seanodes Presentation Overview of storage virtualization in clusters Seanodes cluster virtualization, with

More information

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland The Lattice Project: A Multi-Model Grid Computing System Center for Bioinformatics and Computational Biology University of Maryland Parallel Computing PARALLEL COMPUTING a form of computation in which

More information

High Performance Applications over the Cloud: Gains and Losses

High Performance Applications over the Cloud: Gains and Losses High Performance Applications over the Cloud: Gains and Losses Dr. Leila Ismail Faculty of Information Technology United Arab Emirates University leila@uaeu.ac.ae http://citweb.uaeu.ac.ae/citweb/profile/leila

More information

An Oracle White Paper. Oracle Database Appliance X4-2

An Oracle White Paper. Oracle Database Appliance X4-2 An Oracle White Paper Oracle Database Appliance X4-2 Introduction The Oracle Database Appliance X4-2 is the 3rd generation of the Oracle Database Appliance. It is an Oracle Engineered System consisting

More information

GUEST OPERATING SYSTEM BASED PERFORMANCE COMPARISON OF VMWARE AND XEN HYPERVISOR

GUEST OPERATING SYSTEM BASED PERFORMANCE COMPARISON OF VMWARE AND XEN HYPERVISOR GUEST OPERATING SYSTEM BASED PERFORMANCE COMPARISON OF VMWARE AND XEN HYPERVISOR ANKIT KUMAR, SAVITA SHIWANI 1 M. Tech Scholar, Software Engineering, Suresh Gyan Vihar University, Rajasthan, India, Email:

More information

Improved LS-DYNA Performance on Sun Servers

Improved LS-DYNA Performance on Sun Servers 8 th International LS-DYNA Users Conference Computing / Code Tech (2) Improved LS-DYNA Performance on Sun Servers Youn-Seo Roh, Ph.D. And Henry H. Fong Sun Microsystems, Inc. Abstract Current Sun platforms

More information

Hadoop on the Gordon Data Intensive Cluster

Hadoop on the Gordon Data Intensive Cluster Hadoop on the Gordon Data Intensive Cluster Amit Majumdar, Scientific Computing Applications Mahidhar Tatineni, HPC User Services San Diego Supercomputer Center University of California San Diego Dec 18,

More information

Cluster Grid Interconects. Tony Kay Chief Architect Enterprise Grid and Networking

Cluster Grid Interconects. Tony Kay Chief Architect Enterprise Grid and Networking Cluster Grid Interconects Tony Kay Chief Architect Enterprise Grid and Networking Agenda Cluster Grid Interconnects The Upstart - Infiniband The Empire Strikes Back - Myricom Return of the King 10G Gigabit

More information

Big Data Management in the Clouds and HPC Systems

Big Data Management in the Clouds and HPC Systems Big Data Management in the Clouds and HPC Systems Hemera Final Evaluation Paris 17 th December 2014 Shadi Ibrahim Shadi.ibrahim@inria.fr Era of Big Data! Source: CNRS Magazine 2013 2 Era of Big Data! Source:

More information

LSKA 2010 Survey Report Job Scheduler

LSKA 2010 Survey Report Job Scheduler LSKA 2010 Survey Report Job Scheduler Graduate Institute of Communication Engineering {r98942067, r98942112}@ntu.edu.tw March 31, 2010 1. Motivation Recently, the computing becomes much more complex. However,

More information

Bulk Synchronous Programmers and Design

Bulk Synchronous Programmers and Design Linux Kernel Co-Scheduling For Bulk Synchronous Parallel Applications ROSS 2011 Tucson, AZ Terry Jones Oak Ridge National Laboratory 1 Managed by UT-Battelle Outline Motivation Approach & Research Design

More information

BlobSeer: Towards efficient data storage management on large-scale, distributed systems

BlobSeer: Towards efficient data storage management on large-scale, distributed systems : Towards efficient data storage management on large-scale, distributed systems Bogdan Nicolae University of Rennes 1, France KerData Team, INRIA Rennes Bretagne-Atlantique PhD Advisors: Gabriel Antoniu

More information

Data Sharing Options for Scientific Workflows on Amazon EC2

Data Sharing Options for Scientific Workflows on Amazon EC2 Data Sharing Options for Scientific Workflows on Amazon EC2 Gideon Juve, Ewa Deelman, Karan Vahi, Gaurang Mehta, Benjamin P. Berman, Bruce Berriman, Phil Maechling Francesco Allertsen Vrije Universiteit

More information

Supercomputing Resources in BSC, RES and PRACE

Supercomputing Resources in BSC, RES and PRACE www.bsc.es Supercomputing Resources in BSC, RES and PRACE Sergi Girona, BSC-CNS Barcelona, 23 Septiembre 2015 ICTS 2014, un paso adelante para la RES Past RES members and resources BSC-CNS (MareNostrum)

More information

Very Large Enterprise Network, Deployment, 25000+ Users

Very Large Enterprise Network, Deployment, 25000+ Users Very Large Enterprise Network, Deployment, 25000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering

More information

PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE

PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE Sudha M 1, Harish G M 2, Nandan A 3, Usha J 4 1 Department of MCA, R V College of Engineering, Bangalore : 560059, India sudha.mooki@gmail.com 2 Department

More information

I R I S A P U B L I C A T I O N I N T E R N E PROVIDING QOS IN A GRID APPICATION MONITORING SERVICE THOMAS ROPARS, EMMANUEL JEANVOINE, CHRISTINE MORIN

I R I S A P U B L I C A T I O N I N T E R N E PROVIDING QOS IN A GRID APPICATION MONITORING SERVICE THOMAS ROPARS, EMMANUEL JEANVOINE, CHRISTINE MORIN I R I P U B L I C A T I O N I N T E R N E N o 1831 S INSTITUT DE RECHERCHE EN INFORMATIQUE ET SYSTÈMES ALÉATOIRES A PROVIDING QOS IN A GRID APPICATION MONITORING SERVICE THOMAS ROPARS, EMMANUEL JEANVOINE,

More information

BlobSeer: Enabling Efficient Lock-Free, Versioning-Based Storage for Massive Data under Heavy Access Concurrency

BlobSeer: Enabling Efficient Lock-Free, Versioning-Based Storage for Massive Data under Heavy Access Concurrency BlobSeer: Enabling Efficient Lock-Free, Versioning-Based Storage for Massive Data under Heavy Access Concurrency Gabriel Antoniu 1, Luc Bougé 2, Bogdan Nicolae 3 KerData research team 1 INRIA Rennes -

More information

REM-Rocks: A Runtime Environment Migration Scheme for Rocks based Linux HPC Clusters

REM-Rocks: A Runtime Environment Migration Scheme for Rocks based Linux HPC Clusters REM-Rocks: A Runtime Environment Migration Scheme for Rocks based Linux HPC Clusters Tong Liu, Saeed Iqbal, Yung-Chin Fang, Onur Celebioglu, Victor Masheyakhi and Reza Rooholamini Dell Inc. {Tong_Liu,

More information

SSI-OSCAR Single System Image - Open Source Cluster Application Resources

SSI-OSCAR Single System Image - Open Source Cluster Application Resources 2006 OSCAR Symposium St. John's, Newfoundland, Canada May 17, 2006 SSI-OSCAR Single System Image - Open Source Cluster Application Resources Geoffroy Vallée, Thomas Naughton and Stephen L. Scott Oak Ridge

More information

SRNWP Workshop. HP Solutions and Activities in Climate & Weather Research. Michael Riedmann European Performance Center

SRNWP Workshop. HP Solutions and Activities in Climate & Weather Research. Michael Riedmann European Performance Center SRNWP Workshop HP Solutions and Activities in Climate & Weather Research Michael Riedmann European Performance Center Agenda A bit of marketing: HP Solutions for HPC A few words about recent Met deals

More information

Cloud Computing through Virtualization and HPC technologies

Cloud Computing through Virtualization and HPC technologies Cloud Computing through Virtualization and HPC technologies William Lu, Ph.D. 1 Agenda Cloud Computing & HPC A Case of HPC Implementation Application Performance in VM Summary 2 Cloud Computing & HPC HPC

More information

The Cost Effectiveness of PolyServe Matrix Server and SAPRITURITY

The Cost Effectiveness of PolyServe Matrix Server and SAPRITURITY Migrating from UNIX SMP Servers to Linux Clusters The Cost-Effective Alternative PolyServe Clusters of Intel Architecture-based Servers An Intel and PolyServe White Paper The increasingly compelling price/performance

More information

Using Peer to Peer Dynamic Querying in Grid Information Services

Using Peer to Peer Dynamic Querying in Grid Information Services Using Peer to Peer Dynamic Querying in Grid Information Services Domenico Talia and Paolo Trunfio DEIS University of Calabria HPC 2008 July 2, 2008 Cetraro, Italy Using P2P for Large scale Grid Information

More information

Table of Contents. Server Virtualization Peer Review 01-03-2007 cameron 1-24-2007: modified, cameron

Table of Contents. Server Virtualization Peer Review 01-03-2007 cameron 1-24-2007: modified, cameron Table of Contents Objective...2 Definitions...2 Objective discussion...2 Comparison criteria...3 Criteria weights...4 Product scores...4 Criteria comparison discussion...5 References...7 Cost Estimate,

More information

Achieving Performance Isolation with Lightweight Co-Kernels

Achieving Performance Isolation with Lightweight Co-Kernels Achieving Performance Isolation with Lightweight Co-Kernels Jiannan Ouyang, Brian Kocoloski, John Lange The Prognostic Lab @ University of Pittsburgh Kevin Pedretti Sandia National Laboratories HPDC 2015

More information

Parallel Visualization for GIS Applications

Parallel Visualization for GIS Applications Parallel Visualization for GIS Applications Alexandre Sorokine, Jamison Daniel, Cheng Liu Oak Ridge National Laboratory, Geographic Information Science & Technology, PO Box 2008 MS 6017, Oak Ridge National

More information

LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR

LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR LBM BASED FLOW SIMULATION USING GPU COMPUTING PROCESSOR Frédéric Kuznik, frederic.kuznik@insa lyon.fr 1 Framework Introduction Hardware architecture CUDA overview Implementation details A simple case:

More information

Universidad Simón Bolívar

Universidad Simón Bolívar Cardinale, Yudith Figueira, Carlos Hernández, Emilio Baquero, Eduardo Berbín, Luis Bouza, Roberto Gamess, Eric García, Pedro Universidad Simón Bolívar In 1999, a couple of projects from USB received funding

More information

XtreemFS a Distributed File System for Grids and Clouds Mikael Högqvist, Björn Kolbeck Zuse Institute Berlin XtreemFS Mikael Högqvist/Björn Kolbeck 1

XtreemFS a Distributed File System for Grids and Clouds Mikael Högqvist, Björn Kolbeck Zuse Institute Berlin XtreemFS Mikael Högqvist/Björn Kolbeck 1 XtreemFS a Distributed File System for Grids and Clouds Mikael Högqvist, Björn Kolbeck Zuse Institute Berlin XtreemFS Mikael Högqvist/Björn Kolbeck 1 The XtreemOS Project Research project funded by the

More information

Benchmark Framework for a Load Balancing Single System Image

Benchmark Framework for a Load Balancing Single System Image 32 IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.5, May 28 Benchmark Framework for a Load Balancing Single System Image Bestoun S. Ahmed, Khairulmizam Samsudin and Abdul

More information

Distributed Systems Architectures

Distributed Systems Architectures Software Engineering Distributed Systems Architectures Based on Software Engineering, 7 th Edition by Ian Sommerville Objectives To explain the advantages and disadvantages of different distributed systems

More information

On-Demand Supercomputing Multiplies the Possibilities

On-Demand Supercomputing Multiplies the Possibilities Microsoft Windows Compute Cluster Server 2003 Partner Solution Brief Image courtesy of Wolfram Research, Inc. On-Demand Supercomputing Multiplies the Possibilities Microsoft Windows Compute Cluster Server

More information

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing New Paradigms: Clouds, Virtualization and Co. EGEE08, Istanbul, September 25, 2008 An Introduction to Virtualization and Cloud Technologies to Support Grid Computing Distributed Systems Architecture Research

More information

Integrated Application and Data Protection. NEC ExpressCluster White Paper

Integrated Application and Data Protection. NEC ExpressCluster White Paper Integrated Application and Data Protection NEC ExpressCluster White Paper Introduction Critical business processes and operations depend on real-time access to IT systems that consist of applications and

More information

BSC - Barcelona Supercomputer Center

BSC - Barcelona Supercomputer Center Objectives Research in Supercomputing and Computer Architecture Collaborate in R&D e-science projects with prestigious scientific teams Manage BSC supercomputers to accelerate relevant contributions to

More information

On Cloud Computing Technology in the Construction of Digital Campus

On Cloud Computing Technology in the Construction of Digital Campus 2012 International Conference on Innovation and Information Management (ICIIM 2012) IPCSIT vol. 36 (2012) (2012) IACSIT Press, Singapore On Cloud Computing Technology in the Construction of Digital Campus

More information

HAVmS: Highly Available Virtual machine Computer System Fault Tolerant with Automatic Failback and close to zero downtime

HAVmS: Highly Available Virtual machine Computer System Fault Tolerant with Automatic Failback and close to zero downtime HAVmS: Highly Available Virtual machine Computer System Fault Tolerant with Automatic Failback and close to zero downtime Memmo Federici INAF - IAPS, Bruno Martino CNR - IASI The basics Highly available

More information