OSG All Hands Meeting

Similar documents
Cloud Computing with Nimbus

Nimbus: Cloud Computing with Science

Infrastructure-as-a-Service Cloud Computing for Science

How To Build A Cloud Computing System With Nimbus

Cloud Computing for Science

a Cloud Computing for Science

Cloud Computing with Nimbus

SC12 Cloud Compu,ng for Science Tutorial: Introduc,on to Infrastructure Clouds

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Kate Keahey and Tim Freeman

Evalua&ng Streaming Strategies for Event Processing across Infrastructure Clouds (joint work)

Infrastructure Outsourcing in Mul2- Cloud Environment

salsadpi: a dynamic provisioning interface for IaaS cloud

Infrastructure Clouds for Science and Education: Platform Tools

SC12 Cloud Compu,ng for Science Tutorial: Cloud Challenges

The idea of using remote resources. Sky Computing

Dynamic Resource Distribution Across Clouds

Deploying Business Virtual Appliances on Open Source Cloud Computing

Efficient Cloud Management for Parallel Data Processing In Private Cloud

Efficient Data Management Support for Virtualized Service Providers

Pierre Riteau University of Chicago

Clouds for research computing

Plug-and-play Virtual Appliance Clusters Running Hadoop. Dr. Renato Figueiredo ACIS Lab - University of Florida

Elastic Site. Using Clouds to Elastically Extend Site Resources

Sky Computing: When Multiple Clouds Become One

Adding IaaS Clouds to the ATLAS Computing Grid

Comparative Study of Eucalyptus, Open Stack and Nimbus

Automating Application Deployment in Infrastructure Clouds

Dynamic Extension of a Virtualized Cluster by using Cloud Resources CHEP 2012

DEVELOPMENT OF TWO MODELS OF DYNAMIC INTEGRATION OF CLOUD COMPUTING RESOURCES WITH ALIEN GRID

Workflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice

Infrastructure Outsourcing in Multi-Cloud Environment

Rebalancing in a Multi-Cloud Environment

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Simulation and user analysis of BaBar data in a distributed cloud

Towards a New Model for the Infrastructure Grid

Modalis. Des grilles aux clouds : Hélène Renard, I3S - UMR 6070 UNSA / CNRS - nouveaux problèmes, nouvelles solutions

Solution for private cloud computing

Supporting Cloud Computing with the Virtual Block Store System

Amazon EC2 XenApp Scalability Analysis

Virtual Machine Management with OpenNebula in the RESERVOIR project

Capacity Leasing in Cloud Systems using the OpenNebula Engine

On the Use of Cloud Computing for Scientific Workflows

Distributed IaaS Clouds and 100G Networking for HEP applications

Exploiting Virtualization and Cloud Computing in ATLAS

The Application of Cloud Computing to Scientific Workflows: A Study of Cost and Performance

An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform

Integrating Clouds and Cyberinfrastructure: Research Challenges

A Service for Data-Intensive Computations on Virtual Clusters

Dynamic Provisioning of Virtual Clusters for Grid Computing

Elastic Cloud Computing in the Open Cirrus Testbed implemented via Eucalyptus

Evaluation of Nagios for Real-time Cloud Virtual Machine Monitoring

4 4 # #6 ) 5 #

Auto-Scaling Model for Cloud Computing System

Développement logiciel pour le Cloud (TLC)

ECE6130 Grid and Cloud Computing

SURFsara HPC Cloud Workshop

Enabling multi-cloud resources at CERN within the Helix Nebula project. D. Giordano (CERN IT-SDC) HEPiX Spring 2014 Workshop 23 May 2014

Research computing in a distributed cloud environment

PoS(EGICF12-EMITC2)005

Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

An Efficient Use of Virtualization in Grid/Cloud Environments. Supervised by: Elisa Heymann Miquel A. Senar

CSE 599c Scientific Data Management. Magdalena Balazinska and Bill Howe Spring 2010 Lecture 3 Science in the Cloud

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, IBM Corporation

Environments, Services and Network Management for Green Clouds

CONDOR CLUSTERS ON EC2

Development of Bio-Cloud Service for Genomic Analysis Based on Virtual

POSIX and Object Distributed Storage Systems

Cloud Computing. Adam Barker

A batch system for HEP applications on a distributed IaaS cloud

Building a Volunteer Cloud

VM Management for Green Data Centres with the OpenNebula Virtual Infrastructure Engine

Introduction to Cloud Computing

Towards Model-Defined Cloud of Clouds

SURFsara HPC Cloud Workshop

How To Create A Cloud Based System For Aaas (Networking)

Improving MapReduce Performance in Heterogeneous Environments

FREE AND OPEN SOURCE SOFTWARE FOR CLOUD COMPUTING SERENA SPINOSO FULVIO VALENZA


The application of cloud computing to scientific workflows: a study of cost and performance

Repoman: A Simple RESTful X.509 Virtual Machine Image Repository. Roger Impey

OpenNebula An Innovative Open Source Toolkit for Building Cloud Solutions

Dutch HPC Cloud: flexible HPC for high productivity in science & business

Solution for private cloud computing

Data Semantics Aware Cloud for High Performance Analytics

Experiences with Eucalyptus: Deploying an Open Source Cloud

Enabling Execution of Service Workflows in Grid/Cloud Hybrid Systems

A Very Brief Introduction To Cloud Computing. Jens Vöckler, Gideon Juve, Ewa Deelman, G. Bruce Berriman

Cloud Computing and E-Commerce

A Cost-Evaluation of MapReduce Applications in the Cloud

Data Centers and Cloud Computing

Secure Hybrid Cloud Infrastructure for Scien5fic Applica5ons

StACC: St Andrews Cloud Computing Co laboratory. A Performance Comparison of Clouds. Amazon EC2 and Ubuntu Enterprise Cloud

Building a Private Cloud with Eucalyptus

Administrative Issues

CHAPTER 8 CLOUD COMPUTING

Assembling Cloud Infrastructures with Eucalyptus

Data intensive high energy physics analysis in a distributed cloud

Introduction to Cloud computing. Viet Tran

Comparison of Multiple Cloud Frameworks

Transcription:

Nimbus Update March 2010 OSG All Hands Meeting Kate Keahey keahey@mcs.anl.gov Nimbus Project University of Chicago Argonne National Laboratory

Nimbus: Cloud Computing for Science Allow providers to build clouds Workspace Service: a service providing EC2-like functionality WSRF-style and EC2-style (both WS and REST) interfaces Support for Xen and KVM Allow users to use cloud computing Do whatever it takes to enable scientists to use IaaS Context Broker: turnkey virtual clusters Currently investigating scaling tools Allow developers to experiment with Nimbus For research or usability/performance improvements Open source, extensible software Community extensions and contributions: UVIC (monitoring), IU (EBS, research), Technical University of Vienna (privacy, research)

The Workspace Service VWS Service

The Workspace Service The workspace service publishes information about each workspace Users can find out information about their workspace (e.g. what IP the workspace was bound to) VWS Service Users can interact directly with their workspaces the same way the would with a physical machine.

Turnkey Virtual Clusters Turnkey, tightly-coupled cluster Shared trust/security context Shared configuration/context information Context Broker goals Every appliance Every cloud provider Multiple distributed cloud providers Used to contextualize 100s of virtual s for EC2 HEP STAR runs, Hadoop s, HEP Alice s Working with rpath on developing appliances, standardization

Nimbus CloudKit Nimbus: Science Clouds: www.scienceclouds.org GitHub for code management Latest release is 2.4 (02/02/10)

Science in the Clouds: Update on Applications Working with Nimbus

STAR experiment Work by Jerome Lauret, Leve Hajdu, Lidia Didenko (BNL), Doug Olson (LBNL) STAR: a nuclear physics experiment at Brookhaven National Laboratory Studies fundamental properties of nuclear matter Problems: Complexity Consistency Availability

STAR Virtual Clusters Virtual resources A virtual OSG STAR cluster: OSG head (gridmapfiles, host certificates, NFS, Torque), worker s: SL4 + STAR One-click virtual cluster deployment via Nimbus Context Broker From Science Clouds to EC2 runs Running production codes since 2007 The Quark Matter run: producing just-in-time results for a conference: http://www.isgtw.org/?pid=1001735

Priceless? Compute costs: $ 5,630.30 300+ s over ~10 days, Instances, 32-bit, 1.7 GB memory: EC2 default: 1 EC2 CPU unit High-CPU Medium Instances: 5 EC2 CPU units (2 cores) ~36,000 compute hours total Data transfer costs: $ 136.38 Small I/O needs : moved <1TB of data over duration Storage costs: $ 4.69 Images only, all data transferred at run-time Producing the result before the deadline $ 5,771.37

Modeling the Progression of Epidemics Work by Ron Price and others, Public Health Informatics, University of Utah Can we use clouds to acquire on-demand resources for modeling the progression of epidemics? What is the efficiency of simulations in the cloud? Compare execution on: a physical machine 10 VMs on the cloud The Nimbus cloud only 2.5 hrs versus 17 minutes Speedup = 8.81 9 times faster

Ocean Observatory Initiative - Highly Available Services - Rapidly provision resources - Scale to demand 12

OOI Architecture VM VM VM (Deployable Unit) (Deployable Unit) Application Software EPU Worker EPU Worker EPU HA Service (OOI Application) EPU Worker 13

Elastically Provisioned Resources CHEP 2009 paper, Harutyunyan et al., Collab. with CernVM CCGrid 2010 paper, Marshall et al., Elastic Sites

Hadoop in the Science Clouds U of Chicago U of Florida Hadoop cloud Purdue Papers: CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications by A. Matsunaga, M. Tsugawa and J. Fortes. escience 2008. Sky Computing, by K. Keahey, A. Matsunaga, M. Tsugawa, J. Fortes, to appear in IEEE Internet Computing, September 2009

Keeping the Blood Flowing Work by Karnidakis&Sherwin and Dong&Karonis (Brown, Imperial College, NIU) Arterial Tree Models The NECTAR project Simula9ng the blood flow in ALL major arteries of the human. Running across TeraGrid Inves9ga9ng cloud compu9ng Environments Network Efficiency

Nimbus for FutureGrid FutureGrid: experimental testbed Infrastructure- as- a- Service for FutureGrid Dynamic cloud deployment Improved accoun9ng, administra9on, user experience Con9nue to provide IaaS cycles for science Encourage new and experimental uses of the infrastructure Con9nue reaching out to scien9sts in the US and collaborators abroad Develop new capabili9es Automate building of sky compu9ng scenarios Provide addi9onal tools on the cloud federa9on level

Nimbus: Friends and Family Nimbus core team: UC/ANL: Kate Keahey, Tim Freeman, David LaBissoniere, John Bresnahan UVIC: Ian Gable & team: Patrick Armstrong, Adam Bishop, Mike Paterson, Duncan Penfold-Brown UCSD: Alex Clemesha Contributors: http:///about/people/ Other efforts: ViNe: Mauricio Tsugawa, Jose Fortes (UFL)