Cloud Computing with Nimbus



Similar documents
How To Build A Cloud Computing System With Nimbus

Cloud Computing for Science

Nimbus: Cloud Computing with Science

Infrastructure-as-a-Service Cloud Computing for Science

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Kate Keahey and Tim Freeman

Lecture on Cloud Computing. Course 10. Academic cloud systems

Deploying Business Virtual Appliances on Open Source Cloud Computing

Comparative Study of Eucalyptus, Open Stack and Nimbus

A STUDY ON OPEN SOURCE CLOUD COMPUTING PLATFORMS

Proactively Secure Your Cloud Computing Platform

Cloud Infrastructure Pattern

Comparison of Several Cloud Computing Platforms

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc.

Plug-and-play Virtual Appliance Clusters Running Hadoop. Dr. Renato Figueiredo ACIS Lab - University of Florida

Infrastructure for Cloud Computing

Cloud Computing Overview

FREE AND OPEN SOURCE SOFTWARE FOR CLOUD COMPUTING SERENA SPINOSO FULVIO VALENZA

Virtual Machine Management with OpenNebula in the RESERVOIR project

Sky Computing: When Multiple Clouds Become One

OpenNebula Leading Innovation in Cloud Computing Management

Scientific Cloud Computing: Early Definition and Experience

How To Compare Cloud Computing To Cloud Platforms And Cloud Computing

Cloud and Virtualization to Support Grid Infrastructures

Introduction to Cloud computing. Viet Tran

The OpenNebula Standard-based Open -source Toolkit to Build Cloud Infrastructures

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing

CLEVER: a CLoud-Enabled Virtual EnviRonment

Key Research Challenges in Cloud Computing

A Comparison and Critique of Eucalyptus, OpenNebula and Nimbus

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

A Survey on Open-source Cloud Computing Solutions

From Grid Computing to Cloud Computing & Security Issues in Cloud Computing

Introduction to Cloud Computing

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc.

CernVM Online and Cloud Gateway a uniform interface for CernVM contextualization and deployment

OpenNebula An Innovative Open Source Toolkit for Building Cloud Solutions

Comparison of Multiple Cloud Frameworks

Authorization Strategies for Virtualized Environments in Grid Computing Systems

StratusLab project. Standards, Interoperability and Asset Exploitation. Vangelis Floros, GRNET

Toward a Unified Ontology of Cloud Computing

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

Cloud Models and Platforms

On Demand Replication of WSRF-based Grid Services via Cloud Computing

Challenges in Hybrid and Federated Cloud Computing

Design and Building of IaaS Clouds

Elastic Management of Cluster based Services in the Cloud

Lecture 02b Cloud Computing II

A Survey on Open-source Cloud Computing Solutions

Private Clouds with Open Source

A Gentle Introduction to Cloud Computing

Efficient Cloud Management for Parallel Data Processing In Private Cloud

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

Open Cirrus: Towards an Open Source Cloud Stack

Contents. 1 Introduction 2

Computing. M< MORGAN KAUfMANM. Distributed and Cloud. Processing to the. From Parallel. Internet of Things. Geoffrey C. Fox. Dongarra.

Scheduler in Cloud Computing using Open Source Technologies

GRNET-4. Offering Cloud Services to the Greek R&E Community. Yannis Mitsos 1 ymitsos@noc.grnet.gr, Panos Louridas 1 louridas@grnet.gr.

An Open Source Solution for Virtual Infrastructure Management in Private and Hybrid Clouds

Comparing Ganeti to other Private Cloud Platforms. Lance Albertson

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

VM Management for Green Data Centres with the OpenNebula Virtual Infrastructure Engine

Comparing Open Source Private Cloud (IaaS) Platforms

CLOUD COMPUTING. When It's smarter to rent than to buy

OGF25/EGEE User Forum Catania, Italy 2 March 2009

Transcription:

Cloud Computing with Nimbus February 2009 Kate Keahey (keahey@mcs.anl.gov) University of Chicago Argonne National Laboratory

Cloud Computing elasticity computing on demand capital expense operational expense SaaS Software-as-a-Service PaaS Platform-as-a-Service IaaS Infrastructure-as-a-Service 8/25/2009 The Nimbus Toolkit: http//.globus.org 2

The Quest Begins Code complexity Resource control 8/25/2009 The Nimbus Toolkit: http//.globus.org 3

Workspaces Dynamically provisioned environments Environment control Resource control Implementations Via leasing hardware platforms: reimaging, configuration management, dynamic accounts Via virtualization: VM deployment 8/25/2009 The Nimbus Toolkit: http//.globus.org 4

A Brief History of Nimbus First STAR production run on EC2 Xen released EC2 goes online Nimbus Cloud comes online 2003 2006 2009 Research on agreement-based services First WSRF Workspace Service release EC2 gateway available Support for EC2 interfaces Context Broker release 8/25/2009 The Nimbus Toolkit: http//.globus.org 5

Nimbus Overview Goal: cloud computing for science Open source, extensible IaaS implementation A platform for experimentation with features for scientific needs and interoperability Set up private clouds (privacy, expense considerations) Workspace Service Orchestration tools Focus on end-to-end picture Context Broker, gateway http://.globus.org/ 8/25/2009 The Nimbus Toolkit: http//.globus.org 6

The Workspace Service VWS Service 8/25/2009 The Nimbus Toolkit: http//.globus.org 7

The Workspace Service The service publishes information on each Users can query this information to find out things about their (e.g. what IP the was bound to) VWS Service Users can interact directly with their s the same way the would with a physical machine. Trusted Computing Base (TCB) 8/25/2009 The Nimbus Toolkit: http//.globus.org 8

Workspace Service Interfaces and Clients Web Services based Web Service Resource Framework (WSRF) GT-based Elastic Computing Cloud (EC2) Supported: ec2-describe-images, ec2-run-instances, ec2- describe-instances, ec2-terminate-instances, ec2-rebootinstances, ec2-add-keypair, ec2-delete-keypair Unsupported: availability zones, security groups, elastic IP assignment, REST Used alongside WSRF interfaces E.g., the University of Chicago cloud allows you to connect via the cloud client or via the EC2 client 8/25/2009 The Nimbus Toolkit: http//.globus.org 9

Security GSI authentication and authorization PKI credential required Works with Grid proxies VOMS, Shibboleth (via GridShib), custom PDPs Secure access to VMs EC2 key generation or accessed from.ssh Validating images and image data Collaboration with Vienna University of Technology Paper: Descher et al., Retaining Data Control in Infrastructure Clouds 8/25/2009 The Nimbus Toolkit: http//.globus.org 10

Networking Network configuration External: public IPs or private IPs (via VPN) Internal: private network via a local cluster network Each VM can specify multiple NICs mixing private and public networks (WSRF only) E.g., cluster worker s on a private network, head on both public and private network 8/25/2009 The Nimbus Toolkit: http//.globus.org 11

Workspace WSRF front-end that allows clients to deploy and manage virtual s The Back Story VWS Service Workspace back-end: Resource manager for a pool of physical s Deploys and manages Workspaces on the s Each must have a VMM ( Xen ) installed, as well as the control program that manages individual s Trusted Computing Base (TCB) 8/25/2009 The Nimbus Toolkit: http//.globus.org 12

Workspace Components EC2 WSRF service resource manager pilot control client 8/25/2009 The Nimbus Toolkit: http//.globus.org 13

Cloud Closure EC2 WSRF storage service service resource manager pilot control cloud client client 8/25/2009 The Nimbus Toolkit: http//.globus.org 14

The IaaS Gateway EC2 WSRF storage service service IaaS gateway resource manager pilot control EC2 potentially other providers cloud client client 8/25/2009 The Nimbus Toolkit: http//.globus.org 15

Virtual Clusters Tightly-coupled clusters HK1 IP2 HK2 IP3 HK3 MPI What makes a cluster a cluster? Shared trust/security context Shared configuration/context information Reciprocal exchange of information: networking and security 8/25/2009 The Nimbus Toolkit: http//.globus.org 16

8/25/2009 The Nimbus Toolkit: http//.globus.org 17 HK1 HK1 IP2 IP2 IP3 IP3 HK1 HK1 HK2 HK2 HK3 HK3 HK1 HK1 HK1 HK1 Context Context Broker Broker IP2 IP2 HK2 HK2 IP2 IP2 IP3 IP3 HK1 HK1 HK2 HK2 HK3 HK3 IP3 IP3 HK3 HK3 IP2 IP2 IP3 IP3 HK1 HK1 HK2 HK2 HK3 HK3 Context Broker Parameterizable appliance Context information exchange

Context Broker Goals Can work with every appliance Appliance schema, can be implemented in terms of many configuration systems Can work with every cloud provider Simple and minimal conditions on generic context delivery Can work across multiple cloud providers, in a distributed environment 8/25/2009 The Nimbus Toolkit: http//.globus.org 18

Status for Context Broker Release history: In alpha testing since August 07 First released summer July 08 (v 1.3.3) Latest update January 09 (v 2.2) Used to contextualize 100s of s for EC2 HEP STAR runs, Hadoop s, HEP Alice s Contextualized images on marketplace Working with rpath to make contextualization easier for the user OVF extensions to be submitted to DMTF 8/25/2009 The Nimbus Toolkit: http//.globus.org 19

End of Nimbus Tour context broker EC2 WSRF storage service service IaaS gateway resource manager pilot control EC2 potentially other providers context client cloud client client 8/25/2009 The Nimbus Toolkit: http//.globus.org 20

Science Clouds Goals Make it easy for scientific projects to experiment with cloud computing Can cloud computing be used for science? Evolve software in response to the needs of scientific projects Start with EC2-like functionality and evolve to serve scientific projects: virtual clusters, diverse resource leases Federating clouds: moving between cloud resources in academic and commercial space Provide a laboratory for exploration of cloud interoperability issues 8/25/2009 The Nimbus Toolkit: http//.globus.org 21

Science Cloud Resources University of Chicago (Nimbus): first cloud, online since March 4th 2008 16 s of UC TeraPort cluster, public IPs University of Florida Online since 05/08 16-32 s, access via VPN Other Science Clouds Masaryk University, Brno, Czech Republic (08/08), Purdue (09/08) Installations in progress: IU, Grid5K, Vrije, others Using EC2 for overflow Minimal governance model http://.globus.org/clouds 8/25/2009 The Nimbus Toolkit: http//.globus.org 22

Who Runs on Nimbus? Hadoop AliEn GT-scalability STAR Montage workflows GridFTP testing -team Testing OSG geofest bioinformatics Other Project diversity: Science, CS, education, build&test 8/25/2009 The Nimbus Toolkit: http//.globus.org 23

Hadoop over ManyClouds U of Chicago U of Florida ViNE router ViNE router Building clouds on top of clouds Virtual : ViNE router + application VMs Need access to distributed resources, and high level of privilege to run a ViNE router Papers: Sky Computing, by K. Keahey, A. Matsunaga, M. Tsugawa, J. Fortes. Submitted to IEEE Internet Computing. CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications by A. Matsunaga, M. Tsugawa and J. Fortes. escience 2008. 8/25/2009 The Nimbus Toolkit: http//.globus.org 24

Alice HEP Experiment at CERN Collaboration with CERNVM project (CHEP09 paper) 8/25/2009 The Nimbus Toolkit: http//.globus.org 25

STAR STAR: a high-energy physics experiment Need resources with the right configuration Complex environments Consistent environments A virtual OSG STAR cluster OSG cluster: OSG CE (head), gridmapfiles, host certificates, NSF, Torque, worker s: SL4 + STAR Requirements One-click virtual cluster deployment Science Clouds -> EC2 From proof-of-concept to productions runs Work by Jerome Lauret, Doug Olson, Leve Hajdu, Lidia Didenko Results to be published at Quark Matter conference and CHEP 8/25/2009 The Nimbus Toolkit: http//.globus.org 26

Infrastructure Testing Motivation Test middleware scalability, use of different platforms, etc. Workspaces Globus 101 and several different environments Requirements very short-term but flexible access to diverse platforms Work by various members of the Globus community (Tom Howe and John Bresnahan), short-lived communities of one Resulted in provisioning a private cloud for Globus 8/25/2009 The Nimbus Toolkit: http//.globus.org 27

Montage Workflows Evaluating a cloud from user s perspective Paper: Exploration of the Applicability of Cloud Computing to Large-Scale Scientific Workflows, C. Hoffa, T. Freeman, G. Mehta, E. Deelman, K. Keahey, SWBES08: Challenging Issues in Workflow Applications 8/25/2009 The Nimbus Toolkit: http//.globus.org 28

Friends and Family Committers: Kate Keahey & Tim Freeman (ANL/UC), Ian Gable (UVIC) A lot of help from the community, see: http://.globus.org/people.html Collaborations: Cumulus: S3 implementation (Globus team) EBS implementation with IU Appliance management: rpath and Bcfg2 projects Virtual network overlays: University of Florida Security: Vienna University of Technology 8/25/2009 The Nimbus Toolkit: http//.globus.org 29

Nimbus Users Applications users Scientific projects Use clouds, Nimbus client side, user guides Cloud administrators Resource providers Install Nimbus, administrator guides Communities extending Nimbus Extensions for research or usability Develop code, extensibility guides 8/25/2009 The Nimbus Toolkit: http//.globus.org 30

Open Source IaaS Implementations OpenNebula Open source datacenter implementation University of Madrid, I. Llorente & team, 03/2008 Eucalyptus Open source implementation of EC2 UCSB, R. Wolski & team, 06/2008 Cloud-enabled Nimrod-G Open source implementation of EC2 Monash University, MeSsAGE Lab, 01/2009 Industry efforts openqrm, Enomalism 8/25/2009 The Nimbus Toolkit: http//.globus.org 31

Cloud Computing Ecosystem Appliance Providers marketplaces commercial providers communities (CERNVM) Deployment Orchestrator orchestrate the deployment of environments across possibly many cloud providers VMM/datacenter/IaaS User Environments 8/25/2009 The Nimbus Toolkit: http//.globus.org 32

IaaS Clouds vs Grids Interfaces to resources in the cloud Significant new abstraction: provisioning resources and environments/configurations Configuration control Resource control, better articulated SLAs Revolution or evolution? We can leverage much of the infrastructure developed in the context of grid computing There is new potential to explore 8/25/2009 The Nimbus Toolkit: http//.globus.org 33

Some Thoughts Cloud computing has an enormous potential for Science New roles: appliance providers Or maybe not so new Interoperability Academic vs commercial resources Standards: rough consensus & working code Importance of open source Drive academic needs into the marketplace Drive the development of standards End-2-end tools Combine with what we have Explore new potential 8/25/2009 The Nimbus Toolkit: http//.globus.org 34

Acknowledgements NSF CSR Virtual Playgrounds NSF SDCI Missing Links DOE CEDPS proposal Various other collaborations with TeraGrid, OOI, and other communities 8/25/2009 The Nimbus Toolkit: http//.globus.org 35