Cloud Design and Implementation. Cheng Li MPI-SWS Nov 9 th, 2010



Similar documents
LSKA 2010 Survey Report I Device Drivers & Cloud Computing

Cloud Computing in Distributed System

Cloud Computing Trends

THE EUCALYPTUS OPEN-SOURCE PRIVATE CLOUD

FREE AND OPEN SOURCE SOFTWARE FOR CLOUD COMPUTING SERENA SPINOSO FULVIO VALENZA

FleSSR Project: Installing Eucalyptus Open Source Cloud Solution at Oxford e- Research Centre

2) Xen Hypervisor 3) UEC

Amazon EC2 Product Details Page 1 of 5

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

Sistemi Operativi e Reti. Cloud Computing

Putchong Uthayopas, Kasetsart University

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

Network Infrastructure Services CS848 Project

What Is It? Business Architecture Research Challenges Bibliography. Cloud Computing. Research Challenges Overview. Carlos Eduardo Moreira dos Santos

High Performance Computing Cloud Computing. Dr. Rami YARED

Cloud Computing and Amazon Web Services

Public Cloud Offerings and Private Cloud Options. Week 2 Lecture 4. M. Ali Babar

Unit 10b: Introduction to Cloud Computing

Public Clouds. Krishnan Subramanian Analyst & Researcher Krishworld.com. A whitepaper sponsored by Trend Micro Inc.

References. Introduction to Database Systems CSE 444. Motivation. Basic Features. Outline: Database in the Cloud. Outline

Introduction to Database Systems CSE 444

Cloud Computing Now and the Future Development of the IaaS

Network performance in virtual infrastructures

Cloud Infrastructure Pattern

CLOUD COMPUTING. When It's smarter to rent than to buy

Scalable Architecture on Amazon AWS Cloud

Cloud Computing: Computing as a Service. Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad

Data Centers and Cloud Computing. Data Centers

Efficient Cloud Management for Parallel Data Processing In Private Cloud

PRIVACY PRESERVATION ALGORITHM USING EFFECTIVE DATA LOOKUP ORGANIZATION FOR STORAGE CLOUDS

What is Cloud Computing? Why call it Cloud Computing?

DESIGN OF A PLATFORM OF VIRTUAL SERVICE CONTAINERS FOR SERVICE ORIENTED CLOUD COMPUTING. Carlos de Alfonso Andrés García Vicente Hernández

A STUDY ON CLOUD COMPUTING IN DISTRIBUTED SYSTEMS

How To Understand Cloud Computing

International Journal of Engineering Research & Management Technology

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

How To Understand Cloud Computing

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc.

Cloud Models and Platforms

Cloud Performance Considerations

Elastic Cloud Computing in the Open Cirrus Testbed implemented via Eucalyptus

Amazon Web Services Primer. William Strickland COP 6938 Fall 2012 University of Central Florida

Chapter 19 Cloud Computing for Multimedia Services

BlobSeer: Towards efficient data storage management on large-scale, distributed systems

Open Cirrus: Towards an Open Source Cloud Stack

Introduction to OpenStack

Datacenters and Cloud Computing. Jia Rao Assistant Professor in CS

Infrastructure as a Service (IaaS)

Part V Applications. What is cloud computing? SaaS has been around for awhile. Cloud Computing: General concepts

Cloud Computing: Meet the Players. Performance Analysis of Cloud Providers

Lecture 02a Cloud Computing I

When talking about hosting

Last time. Data Center as a Computer. Today. Data Center Construction (and management)

Cloud Computing and Amazon Web Services. CJUG March, 2009 Tom Malaher

Technical Overview Simple, Scalable, Object Storage Software

Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings

RIDE THE SDN AND CLOUD WAVE WITH CONTRAIL

Distribution transparency. Degree of transparency. Openness of distributed systems

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

ACCESS DEFINED CLOUD BASED MULTIVARIANT RESPONSE SYSTEM WITH CACHING

CLOUD DATABASE DATABASE AS A SERVICE

ECE6130 Grid and Cloud Computing

Future of Cloud Computing. Irena Bojanova, Ph.D. UMUC, NIST

Scaling in the Cloud with AWS. By: Eli White (CTO & mojolive) eliw.com - mojolive.com

Deploying complex applications to Google Cloud. Olia Kerzhner

SURFsara HPC Cloud Workshop

Overview: X5 Generation Database Machines

Cloud Computing. Lecture 24 Cloud Platform Comparison

Li Sheng. Nowadays, with the booming development of network-based computing, more and more

A Brief Overview. Delivering Windows Azure Services on Windows Server. Enabling Service Providers

Software-Defined Networks Powered by VellOS

SURFsara HPC Cloud Workshop

Hypertable Architecture Overview

AN IMPLEMENTATION OF E- LEARNING SYSTEM IN PRIVATE CLOUD

CS 6343: CLOUD COMPUTING Term Project

Permanent Link:

SLA Driven Load Balancing For Web Applications in Cloud Computing Environment

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Cloud Computing Overview

A Cost-Evaluation of MapReduce Applications in the Cloud

Service Catalogue. virtual services, real results

Ch. 4 - Topics of Discussion

Comparative Study of Eucalyptus, Open Stack and Nimbus

Private Distributed Cloud Deployment in a Limited Networking Environment

A Survey on Cloud Storage Systems

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344

The Panoptix Building Efficiency Solution: Ensuring a Secure Delivery of Building Efficiency

Improving MapReduce Performance in Heterogeneous Environments

Today: Data Centers & Cloud Computing" Data Centers"

Session 3. the Cloud Stack, SaaS, PaaS, IaaS

Cloud Computing Architectures and Design Issues

Cloud Computing Architecture: A Survey

Alfresco Enterprise on AWS: Reference Architecture

How Solace Message Routers Reduce the Cost of IT Infrastructure

Above the clouds: A Berkeley View of Cloud Computing

Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases. Lecture 14

Networking Issues For Big Data

Transcription:

Cloud Design and Implementation Cheng Li MPI-SWS Nov 9 th, 2010 1

Modern Computing CPU, Mem, Disk Academic computation Chemistry, Biology Large Data Set Analysis Online service Shopping Website Collaborative jobs Google Docs Online Games 2

Resource Demands Online Service Workload The resource demands are changing significantly over time. 3

Resource Demands Con t Academic Computation Resource Usage The resource demands are not constant, instead periodical and diverse. 4

Resource Provision Keep their own systems High investments Hardware & software Significant maintenance cost Power & Cooling Technical supports Upgrading Hard to adapt to elastic demands Scale up or down Unnecessary to keep it always on 5

Resource Provision Con t Ideal Economic Model Elastic Provision Affordable Price Robust Easy to use Buy whenever you want Pay as you go? 6

Resource Provision Con t Ideal Economic Model Elastic Provision Affordable Price Robust Easy to use Buy whenever you want Pay as you go Cloud Providers dedicating to offer computing resources 7

Outline Motivation Introduction to Cloud Computing Design & Challenges Implementation Eucalyptus Yahoo! Open Questions & Discussion Conclusion 8

Cloud Computing The applications delivered as services over the Internet e.g., Google search, Docs, Amazon Web Services The hardware and systems software in the datacenters that provide those services e.g., Network, disk, memory, CPU e.g., Platforms 9

Cloud Computing Con t Cloud Customer Cloud Provider Request Submission Resource Assignment Offering services Cloud Infrastructures 10

Amazon Cloud Service Elastic Compute Cloud (EC2) Standard Instances Small (Default) Large Extra Large Linux/UNIX Usage $0.085 per hour $0.34 per hour $0.68 per hour Windows Usage $0.12 per hour $0.48 per hour $0.96 per hour Simple Storage Service (S3) Tier Price Tier Price First 1 TB / month of Storage Used $0.140 per GB Next 49 TB / month of Storage Used $0.125 per GB Next 450 TB / month of Storage Used $0.110 per GB Next 500 TB / month of Storage Used $0.095 per GB 11

Outline Motivation Introduction to Cloud Computing Design & Challenges Implementation Eucalyptus Yahoo! Open Questions & Discussion Conclusion 12

Design Goals Multi-tenancy Hardware and software shared Isolation problem Scalability Precisely estimates demands Quickly adapts to changes Scale up or down Cloud Growth 13

Design Goals Con t Load and Tenant Balancing Almost equal load distribution Avoid overload Take advantage of any available resource Availability Failures are common Outage leads to loss of money Robust to tolerate failures 14

Design Goals Con t Easy-of-use Low admin efforts Simple Interface (API) Transparency (Challenge) User friendly Close to user (e.g., geo-located datacenters) Short instructions Low latency 15

Modules of Cloud External Interface User & Admin access to their systems Resource manager Allocate as required Free & Clean up Operation node Computing unit Storage Node Dedicated to store data 16

Modules of Cloud Monitor Inspect the whole system Handle node failures Network Layer Interconnectivity in one service Isolation between services Load Balancer Security Guarantee Access Control Malicious attack 17

Modules of Cloud Con t 18

Outline Motivation Introduction to Cloud Computing Design & Challenges Implementation Eucalyptus Yahoo! Open Questions & Discussion Conclusion 19

Eucalyptus An open-source cloud system design 20

Node Controller 21

Node Controller Hosted at each physical node Serve upcoming requests Functionalities Tasks Create a Virtual Machine Instance (VM) for services Terminate VM if required Inspect VM Discover physical resources (cores, memory) API runinstance() terminateinstance() describeinstance() describeresource() 22

Cluster Controller Cluster definition 23

Cluster Controller Manage a set of NCs Functionalities similar to NC, but plural runinstances terminateinstances describeinstances 24

Network Layer Network = Physical network + Virtual network 25

Virtual Network Overlay Named network sets Service owns its own network Network contains a set of VM instances Network among VM instances VMs interconnectivity Exchange or share information Isolation Avoid interferences between two networks Performance Close to Physical connections 26

Virtual Interfaces VM interface connects to a named network Disjoint sets of VMs belong to different network - isolation 27

Communication across networks Cluster Controller acts as a router 28

Storage Controller (Walrus) Users Stream data into/out of the cloud directly via Walrus Manipulate data through communications between VM instances and Walrus Cloud System Host root file system Store VM images for initialization 29

Cloud Controller 30

Cloud Controller Con t Super administrator managing all states in the cloud, and exposing users: Interface to control their VMs Assure persistent meta data System data: VM images User data: password Resource service process High-level commands to allocate or free 31

Cloud Controller Con t 32

Cloud Controller Con t 33

Cloud Controller Con t 34

User Interfaces Similar to EC2 & S3 Simple Object Access Protocol (SOAP) Use XML to express actions Query API Only GET and POST Embedded expressive queries with Parameters Representational State Transfer (REST) Based on HTTP protocol GET, DELETE, PUT, POST 35

Outline Motivation Introduction to Cloud Computing Design & Challenges Implementation Eucalyptus Yahoo! Open Questions & Discussion Conclusion 36

Yahoo! Private Cloud CPU-intensive Data-intensive 37

Yahoo! Private Cloud Con t Core Service Batch process jobs Operational Storage Resource Provisioning Messaging Broker Pub/Sub message pattern Edge services Decomposition & aggregation Routing Caching 38

Yahoo! Private Cloud Apps Hadoop/MapReduce Word count Page ranking Inverted index Sherpa Massive data store Schema-less data structure Widely used by Yahoo! Web services 39

Outline Motivation Introduction to Cloud Computing Design & Challenges Implementation Eucalyptus Yahoo! Open Questions & Discussion Conclusion 40

Open Questions: Automated Service Provisioning Predict future demands Avoid outage caused by peak load [INRIA, Tech Report 10] Scalability Easy to add more resources Energy saving Smart design [Bruce Maggs, SIGCOMM 09] Traffic management [James Willett, Invitation talk 09] 41

Open Questions Con t: Quality of Service (QoS) Meets SLA for more applications [QoS-Aware Clouds, Cloud Futures 10] Security Malicious attacks Blog: http://cloudsecurity.trendmicro.com/ Transparency User Privacy Data Confidentiality and Integrity [SPORC, OSDI 10] 42

Outline Motivation Introduction to Cloud Computing Design & Challenges Implementation Eucalyptus Yahoo! Open Questions & Discussion Conclusion 43

Conclusion Why we need Cloud Computing? What is Cloud Computing? What are the components? How to build a Cloud Computing System? What questions remain? 44

References [1] M. et al., Above the clouds: A Berkeley View of Cloud Computing. Technical Report, UC Berkeley, 2009. [2] D. Nurmi et al., The Eucalyptus Open-source Cloud-computing System. In IEEE CCGRID 2009. [3] Brian F. Cooper et al., Building a Cloud for Yahoo!. In IEEE ICDE 2009. [4] Robert L. Grossman et al., On the Varieties of Clouds for Data Intensive Computing, In IEEE ICDE 2009 [5] James Murty, Programming Amazon Web Services: S3, EC2, SQS, FPS, and SimpleDB, O Reilly Media, 2008 [6] Qi Zhang et.al. Cloud Computing: state-of-the-art and research challenges, In Journal of Internet Services and Applications, 2010. 45

The end! Thanks! Questions? 46