Infrastructure-as-a-Service Cloud Computing for Science
|
|
|
- Jade Leslie Cummings
- 10 years ago
- Views:
Transcription
1 Infrastructure-as-a-Service Cloud Computing for Science October 2009 Banff Centre, Banff, Canada Kate Keahey Nimbus project lead University of Chicago Argonne National Laboratory
2 Cloud Computing for Science Environment Complexity Consistency Availability
3 Grid Computing Assumption: control over the manner in which resources are used stays with the site Site A Site B VO-A Site-specific environment and mode of access Site-driven prioritization But: site control -> rapid adoption
4 Cloud Computing Change of assumption: control over the resource is turned over to the user Site A Site B VO-A Enabling factors: virtualization and isolation Challenges our notion of a site But: slow adoption
5 Grids to Clouds: a Personal Perspective First STA production run on EC2 Xen released EC2 goes online Nimbus Cloud comes online A Case for Grid Computing on VMs In-Vigo, VIOLIN, DVEs, Dynamic accounts Policy-driven negotiation First WSF Workspace Service release EC2 gateway available Context Broker release Support for EC2 interfaces
6 A Very Quick Introduction to the Nimbus Toolkit: an Infrastructure-as-a-Service Toolkit
7 Nimbus: Cloud Computing Software Allow providers to build clouds Workspace Service: a service providing EC2-like functionality WSF and WS (EC2) interfaces Allow users to use cloud computing Do whatever it takes to enable scientists to use IaaS Context Broker: turnkey virtual clusters, Also: protocol adapters, account managers and scaling tools Allow developers to experiment with Nimbus For research or usability/performance improvements Open source, extensible software Community extensions and contributions: UVIC (monitoring), IU (EBS, research), Technical University of Vienna (privacy, research) Nimbus:
8 The Workspace Service VWS Service
9 The Workspace Service The workspace service publishes information about each workspace Users can find out information about their workspace (e.g. what IP the workspace was bound to) VWS Service Users can interact directly with their workspaces the same way the would with a physical machine.
10 Cloud Computing Ecosystem Appliance Providers Marketplaces, commercial providers, Virtual Organizations Appliance management software Deployment Orchestrator VMM/DataCenter/IaaS User Environments VMM/DataCenter/IaaS User Environments
11 Turnkey Virtual Clusters IP1 HK1 IP2 HK2 IP3 HK3 IP1 HK1 IP1 HK1 IP1 HK1 IP2 HK2 IP2 MPI HK2 IP2 HK2 IP3 HK3 IP3 HK3 IP3 HK3 Context Broker Turnkey, tightly-coupled cluster Shared trust/security context Shared configuration/context information
12 Scientific Cloud esources and Applications
13 Science Clouds Goals Enable experimentation with IaaS Evolve software in response to user needs Exploration of cloud interoperability issues Participants University of Chicago (since 03/08), University of Florida (05/08, access via VPN), Purdue (09/08) International collaborators Using EC2 for large runs Science Clouds Marketplace: OSG cluster, Hadoop, etc. 100s of users, many diverse projects ranging across science, CS research, build&test, education, etc. Come and run:
14 STA experiment Work by Jerome Lauret, Leve Hajdu, Lidia Didenko (BNL), Doug Olson (LBNL) STA: a nuclear physics experiment at Brookhaven National Laboratory Studies fundamental properties of nuclear matter Problems: Complexity Consistency Availability
15 STA Virtual Clusters Virtual resources A virtual OSG STA cluster: OSG head (gridmapfiles, host certificates, NFS, Torque), worker s: SL4 + STA One-click virtual cluster deployment via Nimbus Context Broker From Science Clouds to EC2 runs unning production codes since 2007 The Quark Matter run: producing just-in-time results for a conference:
16 STA Quark Matter un Gateway/ Context Broker Infrastructure-as-a-Service
17 Priceless? Compute costs: $ 5, s over ~10 days, Instances, 32-bit, 1.7 GB memory: EC2 default: 1 EC2 CPU unit High-CPU Medium Instances: 5 EC2 CPU units (2 cores) ~36,000 compute hours total Data transfer costs: $ Small I/O needs : moved <1TB of data over duration Storage costs: $ 4.69 Images only, all data transferred at run-time Producing the result before the deadline $ 5,771.37
18 Modeling the Progression of Epidemics Work by on Price and others, Public Health Informatics, University of Utah Can we use clouds to acquire on-demand resources for modeling the progression of epidemics? What is the efficiency of simulations in the cloud? Compare execution on: a physical machine 10 VMs on the cloud The Nimbus cloud only 2.5 hrs versus 17 minutes Speedup = times faster
19 A Large Ion Collider Experiment (ALICE) Work by Artem Harutyunyan and Predrag Buncic, CEN Heavy ion simulations at CEN Problem: integrate elastic computing into current infrastructure Collaboration with CernVM project Elastically extend the ALICE testbed to accommodate more computing
20 Elastic Provisioning for ALICE HEP ALICE queue queue sensor AliEn Context Broker Infrastructure-as-a-Service
21 Elastically Provisioned esources CHEP09 paper, Harutyunyan et al. Elastic resource base: OOI, ATLAS, ElasticSite, and others
22 Sky Computing Change of assumption: we can now trust remote resources Site A Site B VO-A Enabling factors: cloud computing and virtual networks Instead of a bunch of disconnected domains, one domain overlapping the Internet Network leases for a fully controlled environment
23 Sky Computing Environment Work by A. Matsunaga, M. Tsugawa, University of Florida U of Chicago U of Florida ViNE router ViNE router ViNE router Purdue Creating a seamless environment in a distributed domain
24 Hadoop in the Science Clouds U of Chicago U of Florida Hadoop cloud Purdue Papers: CloudBLAST: Combining Mapeduce and Virtualization on Distributed esources for Bioinformatics Applications by A. Matsunaga, M. Tsugawa and J. Fortes. escience Sky Computing, by K. Keahey, A. Matsunaga, M. Tsugawa, J. Fortes, to appear in IEEE Internet Computing, September 2009
25 Canadian Involvement Two CANAIE Network Enabled Platforms project in Development using Nimbus High Energy Physics Legacy Data Project (started this month) Allow the preservation of analysis environments along with the data to ensure that the data will continue to be analyzable in the future. CANAFA: Canadian Astronomy Network for Astronomical esearch Allows astronomers to do their analysis in their own custom environments. Supports multiple telescopes. Come check it out at the NEP Exhibition Wednesday 18:00 during reception Ian Gable
26 Parting Thoughts IaaS cloud computing is science-driven Scientific applications are successfully using the existing infrastructure for production runs Promising new model for the future We are just at the very beginning of the cloud revolution Cloud computing is not done Significant challenges in building ecosystem, security, usage, price-performance, etc. Lots of work to do!
Nimbus: Cloud Computing with Science
Nimbus: Cloud Computing with Science March 2010 globusworld, Chicago Kate Keahey [email protected] Nimbus Project University of Chicago Argonne National Laboratory Cloud Computing for Science Environment
How To Build A Cloud Computing System With Nimbus
Nimbus: Open Source Infrastructure-as-a-Service Cloud Computing Software Workshop on adapting applications and computing services to multi-core and virtualization CERN, June 2009 Kate Keahey [email protected]
Cloud Computing for Science
Cloud Computing for Science June 2009 21st International Conference on Scientific and Statistical Database Management Kate Keahey [email protected] Nimbus project lead University of Chicago Argonne National
Cloud Computing with Nimbus
Cloud Computing with Nimbus February 2009 Kate Keahey ([email protected]) University of Chicago Argonne National Laboratory Cloud Computing elasticity computing on demand capital expense operational expense
Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Kate Keahey and Tim Freeman
Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Kate Keahey and Tim Freeman About this document The Science Clouds provide EC2-style cycles to scientific projects. This
Efficient Data Management Support for Virtualized Service Providers
Efficient Data Management Support for Virtualized Service Providers Íñigo Goiri, Ferran Julià and Jordi Guitart Barcelona Supercomputing Center - Technical University of Catalonia Jordi Girona 31, 834
Deploying Business Virtual Appliances on Open Source Cloud Computing
International Journal of Computer Science and Telecommunications [Volume 3, Issue 4, April 2012] 26 ISSN 2047-3338 Deploying Business Virtual Appliances on Open Source Cloud Computing Tran Van Lang 1 and
Building a Volunteer Cloud
Building a Volunteer Cloud Ben Segal, Predrag Buncic, David Garcia Quintas / CERN Daniel Lombrana Gonzalez / University of Extremadura Artem Harutyunyan / Yerevan Physics Institute Jarno Rantala / Tampere
Efficient Cloud Management for Parallel Data Processing In Private Cloud
2012 International Conference on Information and Network Technology (ICINT 2012) IPCSIT vol. 37 (2012) (2012) IACSIT Press, Singapore Efficient Cloud Management for Parallel Data Processing In Private
Plug-and-play Virtual Appliance Clusters Running Hadoop. Dr. Renato Figueiredo ACIS Lab - University of Florida
Plug-and-play Virtual Appliance Clusters Running Hadoop Dr. Renato Figueiredo ACIS Lab - University of Florida Advanced Computing and Information Systems laboratory Introduction You have so far learned
Solution for private cloud computing
The CC1 system Solution for private cloud computing 1 Outline What is CC1? Features Technical details Use cases By scientist By HEP experiment System requirements and installation How to get it? 2 What
PoS(EGICF12-EMITC2)005
, Or: How One HEP Experiment Is Evaluating Strategies to Incorporate The Cloud into the Existing Grid Infrastructure Daniel Colin van der Ster 1 E-mail: [email protected] Fernando Harald
Proactively Secure Your Cloud Computing Platform
Proactively Secure Your Cloud Computing Platform Dr. Krutartha Patel Security Engineer 2010 Check Point Software Technologies Ltd. [Restricted] ONLY for designated groups and individuals Agenda 1 Cloud
A Service for Data-Intensive Computations on Virtual Clusters
A Service for Data-Intensive Computations on Virtual Clusters Executing Preservation Strategies at Scale Rainer Schmidt, Christian Sadilek, and Ross King [email protected] Planets Project Permanent
Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed
Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed Sébastien Badia, Alexandra Carpen-Amarie, Adrien Lèbre, Lucas Nussbaum Grid 5000 S. Badia, A. Carpen-Amarie, A. Lèbre, L. Nussbaum
OpenNebula An Innovative Open Source Toolkit for Building Cloud Solutions
Cloud Computing and its Applications 20th October 2009 OpenNebula An Innovative Open Source Toolkit for Building Cloud Solutions Distributed Systems Architecture Research Group Universidad Complutense
SERVER 101 COMPUTE MEMORY DISK NETWORK
Cloud Computing ก ก ก SERVER 101 COMPUTE MEMORY DISK NETWORK SERVER 101 1 GHz = 1,000.000.000 Cycle/Second 1 CPU CYCLE VIRTUALIZATION 101 VIRTUALIZATION 101 VIRTUALIZATION 101 HISTORY YEAR 1800 YEARS LATER
Design and Building of IaaS Clouds
21th May 2010 CloudViews 2010 Porto, Portugal Next Generation Data Center Summit Design and Building of IaaS Clouds Distributed Systems Architecture Research Group Universidad Complutense de Madrid This
Comparison of Several Cloud Computing Platforms
Second International Symposium on Information Science and Engineering Comparison of Several Cloud Computing Platforms Junjie Peng School of computer science & High performance computing center Shanghai
Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, 2015. 20014 IBM Corporation
Boas Betzler Cloud IBM Distinguished Computing Engineer for a Smarter Planet Globally Distributed IaaS Platform Examples AWS and SoftLayer November 9, 2015 20014 IBM Corporation Building Data Centers The
Comparative Study of Eucalyptus, Open Stack and Nimbus
Comparative Study of Eucalyptus, Open Stack and Nimbus Lakshmi D Kurup, Chandni Chandawalla, Zalak Parekh, Kunjita Sampat Abstract- Cloud computing is a Service Oriented Architecture which reduces information
Experimental Study of Bidding Strategies for Scientific Workflows using AWS Spot Instances
Experimental Study of Bidding Strategies for Scientific Workflows using AWS Spot Instances Hao Wu, Shangping Ren Illinois Institute of Technology 10 w 31 St. Chicago, IL, 60616 hwu28,[email protected] Steven
The OpenNebula Standard-based Open -source Toolkit to Build Cloud Infrastructures
Jornadas Técnicas de RedIRIS 2009 Santiago de Compostela 27th November 2009 The OpenNebula Standard-based Open -source Toolkit to Build Cloud Infrastructures Distributed Systems Architecture Research Group
VM Management for Green Data Centres with the OpenNebula Virtual Infrastructure Engine
OGF-EU: Using IT to reduce Carbon Emissions and Delivering the Potential of Energy Efficient Computing OGF25, Catania, Italy 5 March 2009 VM Management for Green Data Centres with the OpenNebula Virtual
Enabling Technologies for Cloud Computing
3th June 2010 1 st European Summit on the Future Internet Luxembourg Next Generation Data Center Summit Enabling Technologies for Cloud Computing Distributed Systems Architecture Research Group Universidad
Sky Computing: When Multiple Clouds Become One
Sky Computing: When Multiple Clouds Become One José Fortes Advanced Computing and Information Systems Lab and Clouds Provider view Economies of scale Statistical multiplexing Avoid customer-specific complexities
Cloud Computing from an Institutional Perspective
15th April 2010 e-infranet Workshop Louvain, Belgium Next Generation Data Center Summit Cloud Computing from an Institutional Perspective Distributed Systems Architecture Research Group Universidad Complutense
Research computing in a distributed cloud environment
Research computing in a distributed cloud environment K. Fransham 1, A. Agarwal 1, P. Armstrong 1, A. Bishop 1, A. Charbonneau 2, R. Desmarais 1, N. Hill 3, I. Gable 1, S. Gaudet 3, S. Goliath 3, R. Impey
Shoal: IaaS Cloud Cache Publisher
University of Victoria Faculty of Engineering Winter 2013 Work Term Report Shoal: IaaS Cloud Cache Publisher Department of Physics University of Victoria Victoria, BC Mike Chester V00711672 Work Term 3
Data intensive high energy physics analysis in a distributed cloud
Data intensive high energy physics analysis in a distributed cloud A. Charbonneau 1, A. Agarwal 2, M. Anderson 2, P. Armstrong 2, K. Fransham 2, I. Gable 2, D. Harris 2, R. Impey 1, C. Leavett-Brown 2,
Cloud Infrastructure Pattern
1 st LACCEI International Symposium on Software Architecture and Patterns (LACCEI-ISAP-MiniPLoP 2012), July 23-27, 2012, Panama City, Panama. Cloud Infrastructure Pattern Keiko Hashizume Florida Atlantic
Amazon EC2 Product Details Page 1 of 5
Amazon EC2 Product Details Page 1 of 5 Amazon EC2 Functionality Amazon EC2 presents a true virtual computing environment, allowing you to use web service interfaces to launch instances with a variety of
Cloud Computing: Computing as a Service. Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad
Cloud Computing: Computing as a Service Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad Abstract: Computing as a utility. is a dream that dates from the beginning from the computer
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice Eddy Caron 1, Frédéric Desprez 2, Adrian Mureșan 1, Frédéric Suter 3, Kate Keahey 4 1 Ecole Normale Supérieure de Lyon, France
HPC performance applications on Virtual Clusters
Panagiotis Kritikakos EPCC, School of Physics & Astronomy, University of Edinburgh, Scotland - UK [email protected] 4 th IC-SCCE, Athens 7 th July 2010 This work investigates the performance of (Java)
Emerging Technology for the Next Decade
Emerging Technology for the Next Decade Cloud Computing Keynote Presented by Charles Liang, President & CEO Super Micro Computer, Inc. What is Cloud Computing? Cloud computing is Internet-based computing,
IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud
IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud February 25, 2014 1 Agenda v Mapping clients needs to cloud technologies v Addressing your pain
Evaluation of Nagios for Real-time Cloud Virtual Machine Monitoring
University of Victoria Faculty of Engineering Fall 2009 Work Term Report Evaluation of Nagios for Real-time Cloud Virtual Machine Monitoring Department of Physics University of Victoria Victoria, BC Michael
CONDOR CLUSTERS ON EC2
CONDOR CLUSTERS ON EC2 Val Hendrix, Roberto A. Vitillo Lawrence Berkeley National Lab ATLAS Cloud Computing R & D 1 INTRODUCTION This is our initial work on investigating tools for managing clusters and
CHAPTER 8 CLOUD COMPUTING
CHAPTER 8 CLOUD COMPUTING SE 458 SERVICE ORIENTED ARCHITECTURE Assist. Prof. Dr. Volkan TUNALI Faculty of Engineering and Natural Sciences / Maltepe University Topics 2 Cloud Computing Essential Characteristics
Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases
NASA Ames NASA Advanced Supercomputing (NAS) Division California, May 24th, 2012 Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases Ignacio M. Llorente Project Director OpenNebula Project.
9/26/2011. What is Virtualization? What are the different types of virtualization.
CSE 501 Monday, September 26, 2011 Kevin Cleary [email protected] What is Virtualization? What are the different types of virtualization. Practical Uses Popular virtualization products Demo Question,
Virtual Machine Management with OpenNebula in the RESERVOIR project
CISCO Cloud Computing Research Symposium (C 3 RS) November 5 & 6, 2008 San Jose, CA Virtual Machine Management with OpenNebula in the RESERVOIR project Ruben Santiago Montero Distributed Systems Architecture
Amazon EC2 XenApp Scalability Analysis
WHITE PAPER Citrix XenApp Amazon EC2 XenApp Scalability Analysis www.citrix.com Table of Contents Introduction...3 Results Summary...3 Detailed Results...4 Methods of Determining Results...4 Amazon EC2
DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2
DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing Slide 1 Slide 3 A style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet.
An Introduction to Virtualization and Cloud Technologies to Support Grid Computing
New Paradigms: Clouds, Virtualization and Co. EGEE08, Istanbul, September 25, 2008 An Introduction to Virtualization and Cloud Technologies to Support Grid Computing Distributed Systems Architecture Research
Interoperating Cloud-based Virtual Farms
Stefano Bagnasco, Domenico Elia, Grazia Luparello, Stefano Piano, Sara Vallero, Massimo Venaruzzo For the STOA-LHC Project Interoperating Cloud-based Virtual Farms The STOA-LHC project 1 Improve the robustness
How To Understand Cloud Computing
Virtualizing the Private Cloud for Maximum Resource Utilization C.Shreeharsha, Prof.ManasiKulkarni Computer Engineering Department, VJTI, Matunga, Mumbai, India, E-mail:[email protected]. Abstract
Data Sharing Options for Scientific Workflows on Amazon EC2
Data Sharing Options for Scientific Workflows on Amazon EC2 Gideon Juve, Ewa Deelman, Karan Vahi, Gaurang Mehta, Benjamin P. Berman, Bruce Berriman, Phil Maechling Francesco Allertsen Vrije Universiteit
Using WebSphere Application Server on Amazon EC2. Speaker(s): Ed McCabe, Arthur Meloy
Using WebSphere Application Server on Amazon EC2 Speaker(s): Ed McCabe, Arthur Meloy Cloud Computing for Developers Hosted by IBM and Amazon Web Services October 1, 2009 1 Agenda WebSphere Application
Mesos: A Platform for Fine- Grained Resource Sharing in Data Centers (II)
UC BERKELEY Mesos: A Platform for Fine- Grained Resource Sharing in Data Centers (II) Anthony D. Joseph LASER Summer School September 2013 My Talks at LASER 2013 1. AMP Lab introduction 2. The Datacenter
An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform
An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform A B M Moniruzzaman 1, Kawser Wazed Nafi 2, Prof. Syed Akhter Hossain 1 and Prof. M. M. A. Hashem 1 Department
Manjrasoft Market Oriented Cloud Computing Platform
Manjrasoft Market Oriented Cloud Computing Platform Aneka Aneka is a market oriented Cloud development and management platform with rapid application development and workload distribution capabilities.
How To Understand Cloud Computing
Overview of Cloud Computing (ENCS 691K Chapter 1) Roch Glitho, PhD Associate Professor and Canada Research Chair My URL - http://users.encs.concordia.ca/~glitho/ Overview of Cloud Computing Towards a definition
Scheduler in Cloud Computing using Open Source Technologies
Scheduler in Cloud Computing using Open Source Technologies Darshan Upadhyay Prof. Chirag Patel Student of M.E.I.T Asst. Prof. Computer Department S. S. Engineering College, Bhavnagar L. D. College of
SURFsara HPC Cloud Workshop
SURFsara HPC Cloud Workshop doc.hpccloud.surfsara.nl UvA workshop 2016-01-25 UvA HPC Course Jan 2016 Anatoli Danezi, Markus van Dijk [email protected] Agenda Introduction and Overview (current
OpenNebula Leading Innovation in Cloud Computing Management
OW2 Annual Conference 2010 Paris, November 24th, 2010 OpenNebula Leading Innovation in Cloud Computing Management Ignacio M. Llorente DSA-Research.org Distributed Systems Architecture Research Group Universidad
SURFsara HPC Cloud Workshop
SURFsara HPC Cloud Workshop www.cloud.sara.nl Tutorial 2014-06-11 UvA HPC and Big Data Course June 2014 Anatoli Danezi, Markus van Dijk [email protected] Agenda Introduction and Overview (current
Improving MapReduce Performance in Heterogeneous Environments
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University of California at Berkeley Motivation 1. MapReduce
Cloud Computing and Amazon Web Services
Cloud Computing and Amazon Web Services Gary A. McGilvary edinburgh data.intensive research 1 OUTLINE 1. An Overview of Cloud Computing 2. Amazon Web Services 3. Amazon EC2 Tutorial 4. Conclusions 2 CLOUD
