How To Build A Cloud Computing System With Nimbus



Similar documents
Nimbus: Cloud Computing with Science

Cloud Computing for Science

Infrastructure-as-a-Service Cloud Computing for Science

Cloud Computing with Nimbus

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Kate Keahey and Tim Freeman

Deploying Business Virtual Appliances on Open Source Cloud Computing

Lecture on Cloud Computing. Course 10. Academic cloud systems

Comparative Study of Eucalyptus, Open Stack and Nimbus

Comparison of Several Cloud Computing Platforms

VM Management for Green Data Centres with the OpenNebula Virtual Infrastructure Engine

Efficient Cloud Management for Parallel Data Processing In Private Cloud

Cloud Infrastructure Pattern

Efficient Data Management Support for Virtualized Service Providers

Solution for private cloud computing

Amazon EC2 Product Details Page 1 of 5

Virtual Machine Management with OpenNebula in the RESERVOIR project

SURFsara HPC Cloud Workshop

Plug-and-play Virtual Appliance Clusters Running Hadoop. Dr. Renato Figueiredo ACIS Lab - University of Florida

A Survey on Cloud Storage Systems

Sky Computing: When Multiple Clouds Become One

Elastic Management of Cluster based Services in the Cloud

A Survey on Open-source Cloud Computing Solutions

From Grid Computing to Cloud Computing & Security Issues in Cloud Computing

SURFsara HPC Cloud Workshop

Proactively Secure Your Cloud Computing Platform

Introduction to Cloud computing. Viet Tran

The OpenNebula Standard-based Open -source Toolkit to Build Cloud Infrastructures

Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed

Getting Started Hacking on OpenNebula

FREE AND OPEN SOURCE SOFTWARE FOR CLOUD COMPUTING SERENA SPINOSO FULVIO VALENZA

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

Design and Building of IaaS Clouds

Cloud Providers, SciCloudand

SERVER 101 COMPUTE MEMORY DISK NETWORK

A STUDY ON OPEN SOURCE CLOUD COMPUTING PLATFORMS

Building a Volunteer Cloud

OpenNebula An Innovative Open Source Toolkit for Building Cloud Solutions

Comparison of Multiple Cloud Frameworks

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

CernVM Online and Cloud Gateway a uniform interface for CernVM contextualization and deployment

Cloud Computing Overview

A Survey on Open-source Cloud Computing Solutions

OGF25/EGEE User Forum Catania, Italy 2 March 2009

CPET 581 Cloud Computing: Technologies and Enterprise IT Strategies. Virtualization of Clusters and Data Centers

Experimental Study of Bidding Strategies for Scientific Workflows using AWS Spot Instances

An Open Source Solution for Virtual Infrastructure Management in Private and Hybrid Clouds

CONDOR CLUSTERS ON EC2

Infrastructure as a Service (IaaS)

HPC performance applications on Virtual Clusters

OpenNebula Leading Innovation in Cloud Computing Management

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, IBM Corporation

PoS(EGICF12-EMITC2)005

Cloud Models and Platforms

OpenNebula The Open Source Solution for Data Center Virtualization

Cloud Computing: Amazon Web Services

Enabling Technologies for Cloud Computing

Cloud Computing from an Institutional Perspective

Cloud Computing: Computing as a Service. Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad

Deployment of Private, Hybrid & Public Clouds with OpenNebula

Open Source Cloud Computing Management with OpenNebula

DESIGN OF A PLATFORM OF VIRTUAL SERVICE CONTAINERS FOR SERVICE ORIENTED CLOUD COMPUTING. Carlos de Alfonso Andrés García Vicente Hernández

Managing a Tier-2 Computer Centre with a Private Cloud Infrastructure

IPOP-TinCan: User-defined IP-over-P2P Virtual Private Networks

Transcription:

Nimbus: Open Source Infrastructure-as-a-Service Cloud Computing Software Workshop on adapting applications and computing services to multi-core and virtualization CERN, June 2009 Kate Keahey keahey@mcs.anl.gov Nimbus project lead University of Chicago Argonne National Laboratory

Cloud Computing for Science Complex environments Need for control

Workspaces Dynamically provisioned environments Environment control Resource control Implementations Via leasing hardware platforms: reimaging, configuration management, dynamic Isolation accounts Via virtualization: VM deployment

A Brief History of Nimbus First STAR production run on EC2 Xen released EC2 goes online Nimbus Cloud comes online 2003 2006 2009 A Case for Grid Computing on VMs In-Vigo, VIOLIN, DVEs, Dynamic accounts Policy-driven negotiation First WSRF Workspace Service release EC2 gateway available Context Broker release Support for EC2 interfaces

Nimbus Goals Allow providers to build clouds Private clouds (privacy, expense considerations) Workspace Service: open source EC2 implementation Allow users to use cloud computing Do whatever it takes to enable scientists to use IaaS Context Broker: turnkey virtual clusters IaaS Gateway: interoperability Allow developers to experiment with Nimbus For research or usability/performance improvements Community extensions and contributions

The Workspace Service VWS Service

The Workspace Service The service publishes information about each Users can find out information about their (e.g. what IP the was bound to) VWS Service Users can interact directly with their s the same way the would with a physical machine.

Workspace Service: Interfaces and Clients Web Services based Web Service Resource Framework (WSRF) WS + state management (WS-Notification) Elastic Computing Cloud (EC2) Compatible with EC2 clients Supported: ec2-describe-images, ec2-run-instances, ec2- describe-instances, ec2-terminate-instances, ec2-rebootinstances, ec2-add-keypair, ec2-delete-keypair Unsupported: availability zones, security groups, elastic IP assignment, REST Protocol adapter, moving towards messaging

Workspace Service: Security GSI authentication and authorization PKI-based VOMS, Shibboleth (via GridShib), custom PDPs Secure access to VMs EC2 key generation or accessed from.ssh Validating images and image data Extensions from Vienna University of Technology Paper: Descher et al., Retaining Data Control in Infrastructure Clouds, ARES (the International Dependability Conference), 2009.

Workspace Service: Networking Network configuration External: public IPs or private IPs (via VPN) Internal: private network via a local cluster network Each VM can specify multiple NICs mixing private and public networks (WSRF only) E.g., cluster worker s on a private network, head on both public and private network

Workspace Components EC2 WSRF service resource manager pilot control OpenNebula Project client commercial providers See papers at: http://.globus.org/papers/index.html Simple Leases with Workspace Pilot (EuroPar08) Combining Batch Execution and Leasing Using Virtual Machines (HPDC08),

Cloud Capabilities EC2 WSRF storage service service resource manager pilot control cloud client client

Cloud Computing Ecosystem Appliance Providers Marketplaces, commercial providers, Virtual Organizations Appliance management software Deployment Orchestrator VMM/DataCenter/IaaS User Environments VMM/DataCenter/IaaS User Environments

Turnkey Virtual Clusters IP1 HK1 IP2 HK2 IP3 HK3 IP1 HK1 IP1 HK1 IP1 HK1 IP2 HK2 IP2 MPI HK2 IP2 HK2 IP3 HK3 IP3 HK3 IP3 HK3 Context Broker Turnkey, tightly-coupled cluster Shared trust/security context Shared configuration/context information

Context Broker Status Releases In alpha since 08/07, first release 06/08, update 01/09 Used to contextualize cluster composed of 100s of virtual s for multiple production apps Contextualized images on marketplace Working with rpath to make contextualization easier for the user Paper: Keahey&Freeman, Contextualization: Providing One-Click Virtual Clusters, escience 2008

End of Nimbus Tour context broker EC2 WSRF storage service service resource manager pilot control context client cloud client client commercial providers

Nimbus: Extensions and Collaborations Nimbus core team: UC/ANL: Kate Keahey, Tim Freeman, David LaBissoniere UVIC: Ian Gable & team (Nimbus monitoring) Cumulus: Raj Kettimuthu and John Bresnahan (ANL/UC) EBS: Marlon Pierce, Xiaoming Gao, Mike Lowe (IU) ViNe: Mauricio Tsugawa, Jose Fortes (UFL) Others: OpenNebula project (University of Madrid) Descher et al (Technical U of Vienna): privacy extensions Congratulations to the EBS team for winning best poster award at TG09!

Science Clouds Goals Enable experimentation with IaaS Evolve software in response to user needs Exploration of cloud interoperability issues Participants University of Chicago (since 03/08, 16 s), University of Florida (05/08, 16-32 s, access via VPN), Masaryk University, Brno, Czech Republic (08/08), Wispy @ Purdue (09/08) In progress: Grid5K, IU, Vrije Using EC2 for large runs Science Clouds Marketplace: OSG cluster, Hadoop, etc. Come and run: http://.globus.org/clouds

Who Runs on Nimbus? 100+ DNs projects ranging across Science, CS, education, build&test

STAR experiment STAR: a nuclear physics experiment at Brookhaven National Laboratory Studies fundamental properties of nuclear matter Problem: computations require complex and consistently configured environments that are hard to find in existing grids

STAR Virtual Clusters Virtual resources A virtual OSG STAR cluster: OSG head (gridmapfiles, host certificates, NFS, Torque), worker s: SL4 + STAR One-click virtual cluster deployment via Nimbus Context Broker From Science Clouds to EC2 runs Running production codes since 2007 Work by Jerome Lauret, Leve Hajdu, Lidia Didenko (BNL), Doug Olson (LBNL) The Quark Matter run: producing just-in-time results for a conference: http://www.isgtw.org/?pid=1001735

STAR Quark Matter Run Gateway/ Context Broker Infrastructure-as-a-Service

STAR Quark Matter Run (2) Application stats: Processed 1.2 M events Moved ~1TB of data over duration (small I/O needs) Run facts: 300+ s, total of ~36,000 hours Instances, 32-bit, 1.7 GB memory: Cost: EC2 default: 1 EC2 CPU unit High-CPU Medium Instances: 5 EC2 CPU units (2 cores) Comp: ~ $6,000: ~ $1,7 K (default) + ~ $3,9K (medium) Data: ~ $150

Elastic Provisioning for AliEn Challenge: integrate elastic computing into existing infrastructure for the ALICE experiment Collaboration with CernVM project Work by Artem Harutyunyan

Elastic Provisioning for AliEn Information about jobs and resources Deployment decision (policy) Context Broker AliEn Infrastructure-as-a-Service

Elastic Provisioning for AliEn CHEP09 paper, Harutyunyan et al. Similar ongoing work on extensions to GANGA (ATLAS) and extensions to local schedulers

Sky Computing Environment U of Chicago U of Florida ViNE router ViNE router ViNE router Purdue Work by A. Matsunaga, M. Tsugawa, University of Florida

Hadoop in the Science Clouds U of Chicago U of Florida Hadoop cloud Purdue Papers: Sky Computing, by K. Keahey, A. Matsunaga, M. Tsugawa, J. Fortes. Submitted to IEEE Internet Computing. CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications by A. Matsunaga, M. Tsugawa and J. Fortes. escience 2008.

Parting Thoughts IaaS cloud computing is science-driven Scientific applications are successfully using the existing infrastructure for production runs Many more could be using it, but challenges exist Project for the next few years: solve them!