OS/Run'me and Execu'on Time Produc'vity
|
|
|
- Beverley Norris
- 10 years ago
- Views:
Transcription
1 OS/Run'me and Execu'on Time Produc'vity Ron Brightwell, Technical Manager Scalable System SoAware Department Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy s National Nuclear Security Administration under contract DE-AC04-94AL85000.
2 Applica'on Developers Are Evil Ron: I have a lightweight opera'ng system that will significantly improve performance and scalability App developers: Changing our makefiles is really hard We really don t want to use a cross compiler We don t want to make a plamorm- or hardware- specific op'miza'ons Any change has to improve performance on all machines Performance portability is cri'cal It s too much work to re- qualify the code
3 Five Years Later App developers: There s a new thing we really want to use It s a custom piece of hardware That requires a cross compiler And its own programming language That we had to change our makefiles to use That isn t portable And we had to restructure our applica'on for it But it really increases performance Ron: What?! (OK, maybe the poten'al performance increase is not iden'cal)
4 Hardware Designers are Evil HW designers: Here s a great new piece of hardware Here s the list of 22 errata for it You system soaware folks will have to deal with some of those And we had to take out that <memory bus locking> feature the previous version of your OS relied upon Ron: Sigh. Great. Thanks.
5 Some Years Later HW designers: We have more transistors than we know what do with Co- design sounds like a great idea We re going to ask the applica'on developers what hardware features they think they might need Ron: What?!
6 What OS/Run'me Design Choices Impact Produc'vity?
7 Tension in Dealing with Transparency Both defini'ons of transparency Magic happens Visibility into lower layers Abstract the hardware Focus on interface rather than mechanism But provide direct access when desired Provide a portable programming interface for system services But maintain performance and scalability Expose locality without exposing locality Caches were supposed to be invisible
8 Lightweight OS Approach Provide determinis'c performance OS resource usage is fixed Resource management is explicit How much memory is available? Applica'on has more control over its resources Eliminate unused func'onality Demand paging Page pinning Implement desired OS policy rather than enable fixing wrong OS policy Page pinning First touch Smaller code base increases reliability Compa'bility with glibc and Linux toolchain is important
9 Scaling Issues Sandia philosophy has been to fail fast Iden'fy scalability issues early by ini'ally secng resources low Avoid providing band aids Counter to what most vendors want to do Enable scaling from laptop to extreme- scale Ini'al MPP users focused on large systems Ideally provide the same soaware environment But focus should be on scaling down rather than scaling up Consider scaling aspect of system calls and services Exploring virtualiza'on as a poten'al mechanism Performance portability Across all systems or across extreme- scale systems? Should be a secondary goal in the face of fundamental challenges
10 Run'me Systems Approach has largely been sta'c Give resources to applica'on and get out of the way Taking on the complexity of dynamic resource management Introspec'on and adap'vity Significantly more complexity to manage Trying to do what humans have only been mildly successful in doing Need ways for applica'ons and compilers to constrain and guide adap'vity op'ons and mechanisms Power/energy management is a new challenge Applica'on no longer has complete control over resources Run'mes are expected to be performance portable too
11 Building Custom Opera'ng/Run'me Systems
12 Lightweight Kernel Influences Lightweight OS Small collec'on of apps Single programming model Single architecture Single usage model Small set of shared services No history Puma/Cougar MPI Distributed memory Space- shared Parallel file system Batch scheduler
13 Applica'on Composi'on Will Be Increasingly Important at Extreme- Scale More complex workflows are driving need for advanced OS services and capability Exascale applica'ons will con'nue to evolve beyond a space- shared batch scheduled approach HPC applica'on developers are employing ad- hoc solu'ons Interfaces and tools like mmap, ptrace, python for coupling codes and sharing data Tools stress OS func'onality because of these legacy APIs and services More afen'on needed on how mul'ple applica'ons are composed Several use cases Ensemble calcula'ons for uncertainty quan'fica'on Mul'- {material, physics, scale} simula'ons In- situ analysis Graph analy'cs Performance and correctness tools Requirements are driven by applica'ons Not necessarily by parallel programming model Somewhat insulated from hardware advancements
14 Composi'on in Hobbes Logical Structure (logical enclaves) Component C Component A Component B Global OS Mapping Physical Structure (physical enclaves) Enclave 2 Enclave 1
15 Lightweight Virtualiza'on is a Key Differen'ator Kifen lightweight kernel Small, highly reliable code base Focused on scalable HPC applica'ons Low noise Small memory footprint Applica'on- based resource management Being enhanced to support dynamic adap've run'me systems Support for Run'me Interface to OS (RIOS) as part XPRESS project Palacios lightweight hypervisor OS- independent virtual machine monitor Can be combined with Kifen or Linux Full system virtualiza'on Guest OS does not need to be modified Supports running mul'ple guests concurrently Passthrough resource par''oning Extensive configurability Low noise
16 Hobbes Node Virtualiza'on Layer Virtualiza'on layer provides light- weight OS func'onality Trust is limited to the light- weight OS/virtualiza'on layer Enclave and Global OS policies implemented directly on the virtualiza'on layer OS OS Enclave & Global OS Components HPC Application & runtime Application & runtime Application & runtime Light-Weight Kernel/Virtualization Layer
17 Power/Energy Thoughts Need three orders of magnitude reduc'on Largest impact must come from hardware Double digit percentage improvement from soaware may be best case Will be harder to maintain given hardware improvements Most explicit data movement has been addressed Not much OS/R can do about implicit data movement like register spill Applica'ons becoming more dynamic But bulk synchronous simplifies resource management Other power- aware soaware environments aren t sophis'cated Turn devices off when not being used
So#ware Tools and Techniques for HPC, Clouds, and Server- Class SoCs Ron Brightwell
So#ware Tools and Techniques for HPC, Clouds, and Server- Class SoCs Ron Brightwell R&D Manager, Scalable System So#ware Department Sandia National Laboratories is a multi-program laboratory managed and
Achieving Performance Isolation with Lightweight Co-Kernels
Achieving Performance Isolation with Lightweight Co-Kernels Jiannan Ouyang, Brian Kocoloski, John Lange The Prognostic Lab @ University of Pittsburgh Kevin Pedretti Sandia National Laboratories HPDC 2015
Data Center Evolu.on and the Cloud. Paul A. Strassmann George Mason University November 5, 2008, 7:20 to 10:00 PM
Data Center Evolu.on and the Cloud Paul A. Strassmann George Mason University November 5, 2008, 7:20 to 10:00 PM 1 Hardware Evolu.on 2 Where is hardware going? x86 con(nues to move upstream Massive compute
BSC vision on Big Data and extreme scale computing
BSC vision on Big Data and extreme scale computing Jesus Labarta, Eduard Ayguade,, Fabrizio Gagliardi, Rosa M. Badia, Toni Cortes, Jordi Torres, Adrian Cristal, Osman Unsal, David Carrera, Yolanda Becerra,
Effec%ve AX 2012 Upgrade Project Planning and Microso< Sure Step. Arbela Technologies
Effec%ve AX 2012 Upgrade Project Planning and Microso< Sure Step Arbela Technologies Why Upgrade? What to do? How to do it? Tools and templates Agenda Sure Step 2012 Ax2012 Upgrade specific steps Checklist
Using RDBMS, NoSQL or Hadoop?
Using RDBMS, NoSQL or Hadoop? DOAG Conference 2015 Jean- Pierre Dijcks Big Data Product Management Server Technologies Copyright 2014 Oracle and/or its affiliates. All rights reserved. Data Ingest 2 Ingest
Asynchronous Dynamic Load Balancing (ADLB)
Asynchronous Dynamic Load Balancing (ADLB) A high-level, non-general-purpose, but easy-to-use programming model and portable library for task parallelism Rusty Lusk Mathema.cs and Computer Science Division
Overview on Modern Accelerators and Programming Paradigms Ivan Giro7o [email protected]
Overview on Modern Accelerators and Programming Paradigms Ivan Giro7o [email protected] Informa(on & Communica(on Technology Sec(on (ICTS) Interna(onal Centre for Theore(cal Physics (ICTP) Mul(ple Socket
Data Management in the Cloud: Limitations and Opportunities. Annies Ductan
Data Management in the Cloud: Limitations and Opportunities Annies Ductan Discussion Outline: Introduc)on Overview Vision of Cloud Compu8ng Managing Data in The Cloud Cloud Characteris8cs Data Management
How To Protect Virtualized Data From Security Threats
S24 Virtualiza.on Security from the Auditor Perspec.ve Rob Clyde, CEO, Adap.ve Compu.ng; former CTO, Symantec David Lu, Senior Product Manager, Trend Micro Hemma Prafullchandra, CTO/SVP Products, HyTrust
Full and Para Virtualization
Full and Para Virtualization Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF x86 Hardware Virtualization The x86 architecture offers four levels
Introduction to Embedded Systems. Software Update Problem
Introduction to Embedded Systems CS/ECE 6780/5780 Al Davis logistics minor Today s topics: more software development issues 1 CS 5780 Software Update Problem Lab machines work let us know if they don t
Portable, Scalable, and High-Performance I/O Forwarding on Massively Parallel Systems. Jason Cope [email protected]
Portable, Scalable, and High-Performance I/O Forwarding on Massively Parallel Systems Jason Cope [email protected] Computation and I/O Performance Imbalance Leadership class computa:onal scale: >100,000
A Case Study - Scaling Legacy Code on Next Generation Platforms
Available online at www.sciencedirect.com ScienceDirect Procedia Engineering 00 (2015) 000 000 www.elsevier.com/locate/procedia 24th International Meshing Roundtable (IMR24) A Case Study - Scaling Legacy
Decomposition into Parts. Software Engineering, Lecture 4. Data and Function Cohesion. Allocation of Functions and Data. Component Interfaces
Software Engineering, Lecture 4 Decomposition into suitable parts Cross cutting concerns Design patterns I will also give an example scenario that you are supposed to analyse and make synthesis from The
Web Application Platform for Sandia
Web Application Platform for Sandia Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for
Project Overview. Collabora'on Mee'ng with Op'mis, 20-21 Sept. 2011, Rome
Project Overview Collabora'on Mee'ng with Op'mis, 20-21 Sept. 2011, Rome Cloud-TM at a glance "#$%&'$()!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!"#$%&!"'!()*+!!!!!!!!!!!!!!!!!!!,-./01234156!("*+!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!&7"7#7"7!("*+!!!!!!!!!!!!!!!!!!!89:!;62!("$+!
Performance Monitoring of Parallel Scientific Applications
Performance Monitoring of Parallel Scientific Applications Abstract. David Skinner National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory This paper introduces an infrastructure
Infiniband Monitoring of HPC Clusters using LDMS (Lightweight Distributed Monitoring System)
OpenFabrics Software User Group Workshop Infiniband Monitoring of HPC Clusters using LDMS (Lightweight Distributed Monitoring System) Christopher Beggio Sandia National Laboratories Sandia National Laboratories
Data Centric Systems (DCS)
Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems
Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS
Copyright 2014 Splunk Inc. Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS Dritan Bi=ncka BD Solu=ons Architecture Disclaimer During the course of this presenta=on, we may make forward looking statements
benefit of virtualiza/on? Virtualiza/on An interpreter may not work! Requirements for Virtualiza/on 1/06/15 Which of the following is not a poten/al
1/06/15 Benefits of virtualiza/on Virtualiza/on Which of the following is not a poten/al benefit of virtualiza/on? A. cost effec/ve B. applica/on migra/on is easy C. improve applica/on performance D. run
PARALLELS CLOUD SERVER
PARALLELS CLOUD SERVER An Introduction to Operating System Virtualization and Parallels Cloud Server 1 Table of Contents Introduction... 3 Hardware Virtualization... 3 Operating System Virtualization...
Cloud Server. Parallels. An Introduction to Operating System Virtualization and Parallels Cloud Server. White Paper. www.parallels.
Parallels Cloud Server White Paper An Introduction to Operating System Virtualization and Parallels Cloud Server www.parallels.com Table of Contents Introduction... 3 Hardware Virtualization... 3 Operating
Making Multicore Work and Measuring its Benefits. Markus Levy, president EEMBC and Multicore Association
Making Multicore Work and Measuring its Benefits Markus Levy, president EEMBC and Multicore Association Agenda Why Multicore? Standards and issues in the multicore community What is Multicore Association?
Introduc8on to Apache Spark
Introduc8on to Apache Spark Jordan Volz, Systems Engineer @ Cloudera 1 Analyzing Data on Large Data Sets Python, R, etc. are popular tools among data scien8sts/analysts, sta8s8cians, etc. Why are these
The Flink Big Data Analytics Platform. Marton Balassi, Gyula Fora" {mbalassi, gyfora}@apache.org
The Flink Big Data Analytics Platform Marton Balassi, Gyula Fora" {mbalassi, gyfora}@apache.org What is Apache Flink? Open Source Started in 2009 by the Berlin-based database research groups In the Apache
CS 4604: Introduc0on to Database Management Systems
CS 4604: Introduc0on to Database Management Systems B. Aditya Prakash Lecture #1: Introduc/on Many slides based on material by Profs. Murali, Ramakrishnan and Faloutsos Course Informa0on Instructor B.
Ibis: Scaling Python Analy=cs on Hadoop and Impala
Ibis: Scaling Python Analy=cs on Hadoop and Impala Wes McKinney, Budapest BI Forum 2015-10- 14 @wesmckinn 1 Me R&D at Cloudera Serial creator of structured data tools / user interfaces Mathema=cian MIT
Protec'ng Informa'on Assets - Week 8 - Business Continuity and Disaster Recovery Planning. MIS 5206 Protec/ng Informa/on Assets Greg Senko
Protec'ng Informa'on Assets - Week 8 - Business Continuity and Disaster Recovery Planning MIS5206 Week 8 In the News Readings In Class Case Study BCP/DRP Test Taking Tip Quiz In the News Discuss items
An Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style
An Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style Agenda A quick look at ManageEngine Tradi/onal Traffic Analysis Techniques & Tools Changing face of Network
Cloud Compu)ng: Overview & challenges. Aminata A. Garba
Cloud Compu)ng: Overview & challenges Aminata A. Garba Outline I. Introduc*on II. Virtualiza*on III. Resources Op*miza*on VI. Challenges 2 A Historical Note 1960, the idea of organizing computa)on as a
10- High Performance Compu5ng
10- High Performance Compu5ng (Herramientas Computacionales Avanzadas para la Inves6gación Aplicada) Rafael Palacios, Fernando de Cuadra MRE Contents Implemen8ng computa8onal tools 1. High Performance
Introduction to Virtual Machines
Introduction to Virtual Machines Introduction Abstraction and interfaces Virtualization Computer system architecture Process virtual machines System virtual machines 1 Abstraction Mechanism to manage complexity
Solution Guide Parallels Virtualization for Linux
Solution Guide Parallels Virtualization for Linux Overview Created in 1991, Linux was designed to be UNIX-compatible software that was composed entirely of open source or free software components. Linux
Clusters in the Cloud
Clusters in the Cloud Dr. Paul Coddington, Deputy Director Dr. Shunde Zhang, Compu:ng Specialist eresearch SA October 2014 Use Cases Make the cloud easier to use for compute jobs Par:cularly for users
The Virtualization Practice
The Virtualization Practice White Paper: Managing Applications in Docker Containers Bernd Harzog Analyst Virtualization and Cloud Performance Management October 2014 Abstract Docker has captured the attention
Apache Spark and the future of big data applica5ons. Eric Baldeschwieler
Apache Spark and the future of big data applica5ons Eric Baldeschwieler Who is Eric14? Big data veteran (since 1996) Databricks Tech Advisor Twitter handle: @jeric14 Previously CTO/CEO of Hortonworks Yahoo
Cray DVS: Data Virtualization Service
Cray : Data Virtualization Service Stephen Sugiyama and David Wallace, Cray Inc. ABSTRACT: Cray, the Cray Data Virtualization Service, is a new capability being added to the XT software environment with
The Impact of PaaS on Business Transformation
The Impact of PaaS on Business Transformation September 2014 Chris McCarthy Sr. Vice President Information Technology 1 Legacy Technology Silos Opportunities Business units Infrastructure Provisioning
Virtualization. Pradipta De [email protected]
Virtualization Pradipta De [email protected] Today s Topic Virtualization Basics System Virtualization Techniques CSE506: Ext Filesystem 2 Virtualization? A virtual machine (VM) is an emulation
Performance Management in Big Data Applica6ons. Michael Kopp, Technology Strategist @mikopp
Performance Management in Big Data Applica6ons Michael Kopp, Technology Strategist NoSQL: High Volume/Low Latency DBs Web Java Key Challenges 1) Even Distribu6on 2) Correct Schema and Access paperns 3)
SharePoint Capacity Planning Balancing Organiza,onal Requirements with Performance and Cost
SharePoint Capacity Planning Balancing Organiza,onal Requirements with Performance and Cost Kirk Devore / J.D. Wade SharePoint Consultants Horizons Consul;ng Agenda Expecta;ons Defining SharePoint Capacity
Intro to Virtualization
Cloud@Ceid Seminars Intro to Virtualization Christos Alexakos Computer Engineer, MSc, PhD C. Sysadmin at Pattern Recognition Lab 1 st Seminar 19/3/2014 Contents What is virtualization How it works Hypervisor
High Performance Computing. Course Notes 2007-2008. HPC Fundamentals
High Performance Computing Course Notes 2007-2008 2008 HPC Fundamentals Introduction What is High Performance Computing (HPC)? Difficult to define - it s a moving target. Later 1980s, a supercomputer performs
Virtual Machine Monitors. Dr. Marc E. Fiuczynski Research Scholar Princeton University
Virtual Machine Monitors Dr. Marc E. Fiuczynski Research Scholar Princeton University Introduction Have been around since 1960 s on mainframes used for multitasking Good example VM/370 Have resurfaced
Cluster, Grid, Cloud Concepts
Cluster, Grid, Cloud Concepts Kalaiselvan.K Contents Section 1: Cluster Section 2: Grid Section 3: Cloud Cluster An Overview Need for a Cluster Cluster categorizations A computer cluster is a group of
SQream Technologies Ltd - Confiden7al
SQream Technologies Ltd - Confiden7al 1 Ge#ng Big Data Done On a GPU- Based Database Ori Netzer VP Product 26- Mar- 14 Analy7cs Performance - 3 TB, 18 Billion records SQream Database 400x More Cost Efficient!
Cloud Computing and Open Source: Watching Hype meet Reality
Cloud Computing and Open Source: Watching Hype meet Reality Rich Wolski UCSB Computer Science Eucalyptus Systems Inc. May 26, 2011 Exciting Weather Forecasts 99 M 167 M 6.5 M What is a cloud? SLAs Web
Replacing a commercial integration platform with an open source ESB. Magnus Larsson [email protected] Cadec 2010-01- 20
Replacing a commercial integration platform with an open source ESB Magnus Larsson [email protected] Cadec 2010-01- 20 Agenda The customer Phases Problem defini?on Proof of concepts
The Importability of Performance Tools: The Good, the Bad, and the Ugly of PMUs
The Importability of Performance Tools: The Good, the Bad, and the Ugly of PMUs Jeanine Cook Scalable Architectures SNL ABQ Sandia National Laboratories is a multi-program laboratory managed and operated
Cloud Computing through Virtualization and HPC technologies
Cloud Computing through Virtualization and HPC technologies William Lu, Ph.D. 1 Agenda Cloud Computing & HPC A Case of HPC Implementation Application Performance in VM Summary 2 Cloud Computing & HPC HPC
x64 Servers: Do you want 64 or 32 bit apps with that server?
TMurgent Technologies x64 Servers: Do you want 64 or 32 bit apps with that server? White Paper by Tim Mangan TMurgent Technologies February, 2006 Introduction New servers based on what is generally called
Next Generation Operating Systems
Next Generation Operating Systems Zeljko Susnjar, Cisco CTG June 2015 The end of CPU scaling Future computing challenges Power efficiency Performance == parallelism Cisco Confidential 2 Paradox of the
BLM 413E - Parallel Programming Lecture 3
BLM 413E - Parallel Programming Lecture 3 FSMVU Bilgisayar Mühendisliği Öğr. Gör. Musa AYDIN 14.10.2015 2015-2016 M.A. 1 Parallel Programming Models Parallel Programming Models Overview There are several
The Benefits of Virtualizing Citrix XenApp with Citrix XenServer
White Paper The Benefits of Virtualizing Citrix XenApp with Citrix XenServer This white paper will discuss how customers can achieve faster deployment, higher reliability, easier management, and reduced
Is Virtualization Killing SSI Research?
Is Virtualization Killing SSI Research? Jérôme Gallard, Geoffroy Vallée, Adrien Lèbre, Christine Morin, Pascal Gallard and Stephen L. Scott Aug, 26th Outline Context Cluster BS/SSI Virtualization Combining
A Data Centric Approach for Modular Assurance. Workshop on Real-time, Embedded and Enterprise-Scale Time-Critical Systems 23 March 2011
A Data Centric Approach for Modular Assurance The Real-Time Middleware Experts Workshop on Real-time, Embedded and Enterprise-Scale Time-Critical Systems 23 March 2011 Gabriela F. Ciocarlie Heidi Schubert
An Open Dynamic Big Data Driven Applica3on System Toolkit
An Open Dynamic Big Data Driven Applica3on System Toolkit Craig C. Douglas University of Wyoming and KAUST This research is supported in part by the Na3onal Science Founda3on and King Abdullah University
Cloud Development Strategies
Photos placed in horizontal position with even amount of white space between photos and header Cloud Development Strategies A 2015 NLIT Fueling the Fire in IT presentation John G Miner, Mgr of Enterprise
A Software and Hardware Architecture for a Modular, Portable, Extensible Reliability. Availability and Serviceability System
1 A Software and Hardware Architecture for a Modular, Portable, Extensible Reliability Availability and Serviceability System James H. Laros III, Sandia National Laboratories (USA) [1] Abstract This paper
Understanding and Detec.ng Real- World Performance Bugs
Understanding and Detec.ng Real- World Performance Bugs Gouliang Jin, Linhai Song, Xiaoming Shi, Joel Scherpelz, and Shan Lu Presented by Cindy Rubio- González Feb 10 th, 2015 Mo.va.on Performance bugs
Cloud Based Application Architectures using Smart Computing
Cloud Based Application Architectures using Smart Computing How to Use this Guide Joyent Smart Technology represents a sophisticated evolution in cloud computing infrastructure. Most cloud computing products
White Paper. Anywhere, Any Device File Access with IT in Control. Enterprise File Serving 2.0
White Paper Enterprise File Serving 2.0 Anywhere, Any Device File Access with IT in Control Like it or not, cloud- based file sharing services have opened up a new world of mobile file access and collaborative
2972 Linux Options and Best Practices for Scaleup Virtualization
HP Technology Forum & Expo 2009 Produced in cooperation with: 2972 Linux Options and Best Practices for Scaleup Virtualization Thomas Sjolshagen Linux Product Planner June 17 th, 2009 2009 Hewlett-Packard
COM 444 Cloud Computing
COM 444 Cloud Computing Lec 3: Virtual Machines and Virtualization of Clusters and Datacenters Prof. Dr. Halûk Gümüşkaya [email protected] [email protected] http://www.gumuskaya.com Virtual
