Elastic Cloud Computing in the Open Cirrus Testbed implemented via Eucalyptus



Similar documents
Performance measurement of a private Cloud in the OpenCirrus Testbed

Assembling Cloud Infrastructures with Eucalyptus

Building a Private Cloud with Eucalyptus

Open Cirrus: Towards an Open Source Cloud Stack

Private Clouds with Open Source

S06: Open-Source Stack for Cloud Computing

Fundamentals of Cloud Computing

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

Cloud Computing: Computing as a Service. Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad

Neptune. A Domain Specific Language for Deploying HPC Software on Cloud Platforms. Chris Bunch Navraj Chohan Chandra Krintz Khawaja Shams

Building Out Your Cloud-Ready Solutions. Clark D. Richey, Jr., Principal Technologist, DoD

How To Compare Cloud Computing To Cloud Platforms And Cloud Computing

The KOALA Cloud Management Service

Analysis and Research of Cloud Computing System to Comparison of Several Cloud Computing Platforms

wu.cloud: Insights Gained from Operating a Private Cloud System

Efficient Cloud Management for Parallel Data Processing In Private Cloud

Comparison of Open Source Cloud System for Small and Medium Sized Enterprises

9/26/2011. What is Virtualization? What are the different types of virtualization.

Cloud computing - Architecting in the cloud

Variations in Performance and Scalability when Migrating n-tier Applications to Different Clouds

Sistemi Operativi e Reti. Cloud Computing

From Internet Data Centers to Data Centers in the Cloud

Cloud Computing Now and the Future Development of the IaaS

Emerging Technology for the Next Decade

opening the clouds qualitative overview of the state-of-the-art open source cloud management platforms. ACM, middleware 2009 conference

Introduction to Cloud Computing

2) Xen Hypervisor 3) UEC

Cloud Computing: a Perspective Study

CLOUD COMPUTING. When It's smarter to rent than to buy

Cloud Computing Submitted By : Fahim Ilyas ( ) Submitted To : Martin Johnson Submitted On: 31 st May, 2009

Business applications:

Cloud Computing An Introduction

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc.

Cloud Computing: Making the right choices

Enabling Technologies for Distributed and Cloud Computing

Amazon EC2 XenApp Scalability Analysis

Grid Computing Vs. Cloud Computing

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc.

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344

Use of Hadoop File System for Nuclear Physics Analyses in STAR

Introduction to Cloud computing. Viet Tran

SLA Driven Load Balancing For Web Applications in Cloud Computing Environment

Cloud Computing Technology

StACC: St Andrews Cloud Computing Co laboratory. A Performance Comparison of Clouds. Amazon EC2 and Ubuntu Enterprise Cloud

LOGO Resource Management for Cloud Computing

Li Sheng. Nowadays, with the booming development of network-based computing, more and more

Modeling Public Pensions with Mathematica and Python II

I/O Virtualization Bottlenecks in Cloud Computing Today

LSKA 2010 Survey Report I Device Drivers & Cloud Computing

Data Centers and Cloud Computing

appscale: open-source platform-level cloud computing

What Is It? Business Architecture Research Challenges Bibliography. Cloud Computing. Research Challenges Overview. Carlos Eduardo Moreira dos Santos

Putchong Uthayopas, Kasetsart University

Final Project Proposal. CSCI.6500 Distributed Computing over the Internet

Enabling Technologies for Distributed Computing

FREE AND OPEN SOURCE SOFTWARE FOR CLOUD COMPUTING SERENA SPINOSO FULVIO VALENZA

Resource Scalability for Efficient Parallel Processing in Cloud

Auto-Scaling Model for Cloud Computing System

How To Understand Cloud Computing

Cluster, Grid, Cloud Concepts

SERVER 101 COMPUTE MEMORY DISK NETWORK

CHAPTER 8 CLOUD COMPUTING

CloudFTP: A free Storage Cloud

Lecture 02a Cloud Computing I

Datacenters and Cloud Computing. Jia Rao Assistant Professor in CS

Mobile Cloud Computing T Open Source IaaS

CloudSim: A Toolkit for Modeling and Simulation of Cloud Computing Environments and Evaluation of Resource Provisioning Algorithms

Comparing Open Source Private Cloud (IaaS) Platforms

DISTRIBUTED SYSTEMS AND CLOUD COMPUTING. A Comparative Study

Comparison of Several Cloud Computing Platforms

Introduction to Cloud Computing

CS 695 Topics in Virtualization and Cloud Computing. Introduction

Computing Service Provision in P2P Clouds

Private Cloud in Educational Institutions: An Implementation using UEC

Performance Management for Cloudbased STC 2012

A Performance Analysis of Distributed Indexing using Terrier

Cornell University Center for Advanced Computing

CS 695 Topics in Virtualization and Cloud Computing and Storage Systems. Introduction

THE EUCALYPTUS OPEN-SOURCE PRIVATE CLOUD

How To Understand Cloud Computing

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

Deploying Business Virtual Appliances on Open Source Cloud Computing

Architectural Implications of Cloud Computing

Infrastructure as a Service (IaaS)

Cloud Computing. Adam Barker

Cloud Computing Training

Cloud Infrastructure Pattern

Transcription:

Elastic Cloud Computing in the Open Cirrus Testbed implemented via Eucalyptus International Symposium on Grid Computing 2009 (Taipei) Christian Baun The cooperation of and Universität Karlsruhe (TH)

Agenda Definiton of Cloud-Computing Clouds vs. Grids Types of Cloud Services The OpenCirrus TM project Hadoop Eucalyptus AppScale 2 Christian Baun ISGC 2009 (Taipei) April 23th 2009

No Cloud Talk without Cloud Definitions Cloud Computing is on-demand access to virtualized IT resources that are sourced inside or outside of a data center, scalable, shared by others, simple to use, paid for via subscription or as you go and accessible over the web. Dr. Behrend Freese (Zimory GmbH) Cloud Services are the consumer and business products, services and solutions that are delivered and consumed in realtime over the internet. IDC - Analyze the Future A computing Cloud is a set of network enabled on demand IT services, scalable and QoS guaranteed, which could be accessed in a simple and pervasive way. Dr. Marcel Kunze (SCC/KIT) 3 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Clouds vs. Grids: A Comparison Objective Infrastructure Cloud Computing Provide desired computing platform via network enabled services One or few data centers, heterogeneous/homogeneous resource under central control, Industry and Business Grid Computing Resource sharing Job execution Geographically distributed, heterogeneous resource, no central control, VO Research and academic organization Application Suited for generic applications Special application domains like High Energy Physics Business Model Commercial: Pay-as-you-go Publicly funded: Use for free Middleware User interface Operational Model Proprietary, several reference implementations exist (e.g. Amazon) Easy to use/deploy, no complex user interface required Industrialization of IT Fully automated Services Well developed, maintained and documented Difficult use and deployment Need new user interface, e.g., commands, APIs, SDKs, services Mostly Manufacture Handcrafted Services QoS Possible Little support On-demand provisioning Yes No 4 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Three Major Types of Cloud Services SaaS: Provides enterprise quality software (complete applications) PaaS: Appears as one single large computer and makes it simple to scale from a single server to many No need to worry about the operating system or other foundational software IaaS: Abstracts away the hardware (servers, network, ) and allows to run virtual instances of servers without ever touching a piece of the hardware 5 Christian Baun ISGC 2009 (Taipei) April 23th 2009

OpenCirrus In the Press 6 Christian Baun ISGC 2009 (Taipei) April 23th 2009

OpenCirrus Cloud Computing Research Testbed An open, internet-scale global testbed for cloud computing research Data center management & cloud services Systems level research Application level research Structure: a loose federation Sponsors: HP Labs, Intel Research, Yahoo! Partners: University of Illinois at Urbana- Champaign (UIUC), Singapore Infocomm Development Authority (IDA), KIT Great opportunity for cloud R&D http://opencirrus.org 7 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Where are the OpenCirrus sites? Six sites initially: Sites distributed world-wide: HP Research, Yahoo!, UIUC, Intel Research Pittsburgh, KIT, Singapore IDA 1000-4000 processor cores per site KIT-Site available in Summer 2009 3300 Nehalem cores, 10TB memory, 192TB hard disk storage HP Yahoo! UIUC Intel KIT IDA 8 Christian Baun ISGC 2009 (Taipei) April 23th 2009

OpenCirrus TM - Physical Resource Sets (PRS) PRS service goals Provide mini-datacenters to researchers Isolate experiments from each other PRS service approach Allocate sets of physical co-located nodes, isolated inside VLANs using existing software Utah Emulab - Network Emulation Testbed HP Opsware - Server provisioning, configuration and management Start simple, add features as we go Basis to implement Virtual Resource Sets (VRS) Hardware as a Service (HaaS) 9 Christian Baun ISGC 2009 (Taipei) April 23th 2009

OpenCirrus TM - Virtual Resource Sets (VRS) Basic idea: Abstract from physical resources by the introduction of a virtualization layer Concept applies to all IT aspects: CPU, storage, networks and applications, Main advantages Implement IT services exactly fitting customers varying needs Deploy IT services on demand Automated resource management Easily guarantee service levels Live migration of services Reduce both: Capital Expenditures and Operational Expenditures Infrastructure as a Service (IaaS) Implement Compute and Storage Services De-facto standard: Amazon Web Services interface 10 Christian Baun ISGC 2009 (Taipei) April 23th 2009

OpenCirrus TM Blueprint 11 Christian Baun ISGC 2009 (Taipei) April 23th 2009

How is OpenCirrus different from other testbeds? OpenCirrus TM supports both system- and applicationlevel research n/a at Google/IBM and EC2/S3 OpenCirrus TM researchers will have complete access to the underlying hardware and software platform. OpenCirrus TM allows Intel platform features that support Cloud computing to be exposed, and exploited. e.g. Intel Data Center Management Interface (DCMI) Map-Reduce apps Hadoop Virtual machines Google/IBM cluster Map-Reduce apps Hadoop Cluster mgmt software Virtual or physical machines Open Cirrus cluster Can be modified by users Cannot be modified by users Cloud apps and services Can be modified by users 12 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Programming the Cloud: Hadoop http://hadoop.apache.org An open-source Java framework developed by the Apache Software Foundation and sponsored by Yahoo! http://wiki.apache.org/hadoop/projectdescription intent is to reproduce the proprietary software infrastructure developed by Google Provides a parallel programming model (MapReduce), a distributed file system (inspired by Google File System), and a parallel database http://code.google.com/edu/parallel/mapreduce-tutorial.html MapReduce is a software framework that supports distributed computing on large data sets. With MapReduce petabyte of data can be sorted in only a few hours 13 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Commercial Cloud Offerings (Small Excerpt) Problem: Commercial offers are proprietary and usually not open for Cloud systems research and development! 14 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Eucalyptus http://eucalyptus.cs.ucsb.edu Open-Source software infrastructure for implementing Cloud computing on clusters from UC Santa Barbara EUCALYPTUS - Elastic Utility Computing Architecture for Linking Your Programs To Useful Systems Implements Infrastructure as a Service (IaaS) gives the user the ability to run and control entire virtual machine instances (Xen, KVM) deployed across a variety of physical resources Interface compatible with Amazon EC2 Includes Walrus, a basic implementation of Amazon S3 interface Potential to interact with the same tools, known to work with Amazon EC2 and S3 Eucalyptus is an important step to establish an open Cloud computing infrastructure standard 15 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Eucalyptus http://eucalyptus.cs.ucsb.edu Schedules the distribution of virtual machines to the NC. Collects (free) resource information. Collects resource information from the CC. Operates like a meta-scheduler in the Cloud. Runs on every node in the Cloud. Xen- Hypervisor running. Provides Information about free resources to the CC. Source: R.Wolski 16 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Eucalyptus R&D Cloud Installation at SCC/KIT R&D Cloud I 2x IBM Blade LS20 Dual Core Opteron (2,4GHz) 4GB RAM 2x IBM Blade HS21 Dual Core Xeon (2,33GHz) 16GB RAM R&D Cloud II 5x HP Blade ProLiant BL2x220c Each Blade: 2 Server Nodes 2x Intel Quad-Core Xeon (2,33GHz) 16GB RAM OpenCirrus site at KIT in summer 2009 17 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Comparing Storage Performance between S3 and Eucalyptus WOW!? Sequential Output Per-Character: file is written using putc() Block: file is written using write() Rewrite: read() and write() 18 Christian Baun ISGC 2009 (Taipei) April 23th 2009 Sequential Input Per-Character: file is read using getc() Blockwise: file is written using read()

Realistic values The RAM of the Eucalyptus Node Controller was reduced to overcome memory caching The storage performance of Eucalyptus depends on the available storage sub-system Write performance of Eucalyptus is faster. Because of the close distance?! 19 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Performance of Random Seeks and File Creation Random seeks and file creation with Eucalyptus is faster Because of the close distance?! 20 Christian Baun ISGC 2009 (Taipei) April 23th 2009

AppScale http://appscale.cs.ucsb.edu Open-source implementation of the Google AppEngine Cloud computing interface from UC Santa Barbara AppScale executes automatically and transparently over Cloud infrastructures such as Eucalyptus, the open-source implementation of the Amazon Web Services interfaces AppScale provides a Platform-as-a-Service (PaaS) Cloud infrastructure that allows users to deploy, test, debug, measure, and monitor Google AppEngine applications prior to deployment on Google's proprietary resources Source: Navraj Chohan 21 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Plans for the Future CernVM Integration of CernVM Virtual Software Appliance from CERN Offers demand-driven and user friendly creation of virtual machines for various operating systems and applications Improvements in Usability Customization of popular EC2/S3 tools for using with Eucalyptus e.g. ElasticFox, S3Fox, ElasticDrive, S3tools Transferring Grid services into the Cloud g-eclipse User-friendly graphical client for dealing with Grids: glite, GRIA, GT2, GT4 Supports Cloud Infrastructures (S3, EC2) Has to be adapted for Eucalyptus http://www.geclipse.eu 22 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Summary Cloud computing is the next big thing Flexible and elastic resource provisioning Economy of scale makes it attractive Move from manufacture towards industrialization of IT (Everything as a Service) OpenCirrus TM offers interesting R&D opportunities Cloud systems and application development Accepting research proposals soon 23 Christian Baun ISGC 2009 (Taipei) April 23th 2009

Thank you for your attention 24 Christian Baun ISGC 2009 (Taipei) April 23th 2009