Automated Control in a Cloud Computing Infrastructure



Similar documents
CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

Data Management in the Cloud. Zhen Shi

Deployment Options for Microsoft Hyper-V Server

Cloud Computing. Chapter 1 Introducing Cloud Computing

Reference Model for Cloud Applications CONSIDERATIONS FOR SW VENDORS BUILDING A SAAS SOLUTION

Windows Server 2003 Migration Guide: Nutanix Webscale Converged Infrastructure Eases Migration

Cloud Computing Disaster Recovery (DR)

Storage and Disaster Recovery

Apache Hadoop. Alexandru Costan

Elastic Management of Cluster based Services in the Cloud

Fault-Tolerant Computer System Design ECE 695/CS 590. Putting it All Together

Preparing Your IT for the Holidays. A quick start guide to take your e-commerce to the Cloud

Cloud Computing. Chapter 1 Introducing Cloud Computing

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing

Using OSGi as a Cloud Platform

ArcGIS for Server in the Amazon Cloud. Michele Lundeen Esri

COMPUTING SCIENCE. Designing a Distributed File-Processing System Using Amazon Web Services. Students of MSc SDIA TECHNICAL REPORT SERIES

Design for Failure High Availability Architectures using AWS

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

PaaS Cloud Migration Migration Process, Architecture Problems and Solutions. Claus Pahl and Huanhuan Xiong

Cloud and Virtualization to Support Grid Infrastructures

Xen Virtualization: Xen (source) and XenServer

Build Platform as a Service (PaaS) with SUSE Studio, WSO2 Middleware, and EC2 Chris Haddad

Technical. Overview. ~ a ~ irods version 4.x

EWeb: Highly Scalable Client Transparent Fault Tolerant System for Cloud based Web Applications

CA ARCserve Replication and High Availability Deployment Options for Hyper-V

Introduction to AWS Economics

Amazon Relational Database Service (RDS)

How Routine Data Center Operations Put Your HA/DR Plans at Risk

Today: Data Centers & Cloud Computing" Data Centers"

Moving to the Cloud. Sam Hornstein Jetline Jason Nokes President, Distributor Central Garrett Ausfeldt Starline

Amazon Compute - EC2 and Related Services

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center

Cloud computing - Architecting in the cloud

ArcGIS for Server: In the Cloud

High Availability of VistA EHR in Cloud. ViSolve Inc. White Paper February

Protecting your SQL database with Hybrid Cloud Backup and Recovery. Session Code CL02

Cloud Computing An Introduction

Cloud Computing Architecture

The Private Cloud Your Controlled Access Infrastructure

HRG Assessment: Stratus everrun Enterprise

Data Centers and Cloud Computing. Data Centers

Administering a Microsoft SQL Server 2000 Database

CLEVER: a CLoud-Enabled Virtual EnviRonment

OVERVIEW Cloud Deployment Services

Everything You Need To Know About Cloud Computing

High Availability and Clustering

Managing and Maintaining Windows Server 2008 Servers

The Inside Scoop on Hadoop

CA API Management SaaS

Introduction to Database as a Service

Postgres Plus Cloud Database!

Cloud Service SLA Declaration

Microsoft s Advantages and Goals for Hyper-V for Server 2016

PipeCloud : Using Causality to Overcome Speed-of-Light Delays in Cloud-Based Disaster Recovery. Razvan Ghitulete Vrije Universiteit

DISASTER RECOVERY WITH AWS

Building Platform as a Service for Scientific Applications

Drupal in the Cloud. Scaling with Drupal and Amazon Web Services. Northern Virginia Drupal Meetup

Scalable Application. Mikalai Alimenkou

Infrastructure as a Service (IaaS)

Cloud Service Model. Selecting a cloud service model. Different cloud service models within the enterprise

AVLOR SERVER CLOUD RECOVERY

Building an AWS-Compatible Hybrid Cloud with OpenStack

Based on Geo Clustering for SUSE Linux Enterprise Server High Availability Extension

The Service, The Cloud & The Method: The Connection Points

Data Centers and Cloud Computing

Divy Agrawal and Amr El Abbadi Department of Computer Science University of California at Santa Barbara

THE FOUR STEP METHOD OF CLOUD SERVICE LEVEL AGREEMENTS

DESIGN OF A PLATFORM OF VIRTUAL SERVICE CONTAINERS FOR SERVICE ORIENTED CLOUD COMPUTING. Carlos de Alfonso Andrés García Vicente Hernández

SCALABILITY AND AVAILABILITY

A Service for Data-Intensive Computations on Virtual Clusters

CiteSeer x in the Cloud

In Memory Accelerator for MongoDB

Mobile Admin Real-time Dashboard and Notification System

EMERGING CLOUD COMPUTING

Cloud Computing. Chapter 1 Introducing Cloud Computing

Tunebot in the Cloud. Arefin Huq 18 Mar 2010

Case Study. Migration to Cloud and SaaS. InVision Software AG, Germany. Enterprise-wide Workforce Management Solution

Providing Load Balancing and Fault Tolerance in the OSGi Service Platform

Technology Enablement

Solutions for Disaster Recovery Using Grass Valley Integrated Playout Systems

VM Management for Green Data Centres with the OpenNebula Virtual Infrastructure Engine

Ironfan Your Foundation for Flexible Big Data Infrastructure

Cloud.. Migration? Bursting? Orchestration? Vincent Lavergne SED EMEA, South Gary Newe Sr SEM EMEA, UKISA

The Pros and Cons of Erasure Coding & Replication vs. RAID in Next-Gen Storage Platforms. Abhijith Shenoy Engineer, Hedvig Inc.

Cloud Courses Description

Connect Converge / Converged Infrastructure

Transcription:

D u k e S y s t e m s Automated Control in a Cloud Computing Infrastructure Harold Lim Duke University

What is Cloud Computing? General Overview

What is Cloud Computing? 2 Varieties of cloud computing Software as a Service Infrastructure as a Service Clouds are an important (and different) context for autonomic control. Elastic Provisioning for variable load Performance Fault Tolerance Interactions among services/ modules/tiers Reflective Control

Motivation Consider an abstraction of a cloud computing infrastructure service. Two kinds of entities: providers and consumers (guests).

Motivation Opportunity for guests to monitor deployed application and modulate resources automatically (through slice controllers). Resource efficient (minimize cost) Acceptable quality of service

Motivation Automated control in a cloud computing infrastructure Resources are elastic, but still needs to decide how, where and when to use the resources. Can be different from controlling other computing systems. Prior works have studied how to control services (which might also run on cloud platform). Have to consider how to adapt these control loops to the cloud computing context.

What is the Right Control Architecture? Autonomic computing for other systems, such as Data Centers: Centralized Control. Has all knowledge and power. Ability to control is unconstrained.

Decoupled Control In cloud computing, nobody knows and control everything. Decoupling is fundamental to the architecture. Want the cloud to be a general platform for all kinds of (diverse) guests. Guests want their services to run on any cloud without having to know about the cloud's internals.

Decoupled Control Each entity has an independent controller. How do these controllers interact? Limited view of the other entities.

Service Interface Constraints Guest controller can only control the resources through a service interface. Providers discretize (coarse) resources into pre-defined sizes. e.g., Amazon EC2 Unavailability of certain type of actuators. (e.g., hypervisor-level) We introduce a new policy, called proportional thresholding [ACDC09]. Dampen control loop. Convert target value to dynamic target range.

Cloud Provider Issues What is the right mechanism and policy for arbitration of requests? If resources are congested, how does a cloud platform pick which request to grant? Provider is a general platform and does not know the particulars of a guest service.

Cloud Provider Issues ORCA- service-oriented infrastructure that provides resource leasing abstractions. Separates mechanism and policy. Resource provider can modify the arbitration policy. (pluggable) Guests use the ticket mechanism to submit requests. Tickets contain guest-configurable properties.

Other Issues Stateful services / services with persistent data Inertia and Lag How to take advantage of elasticity of resources, when services add delay/lag before allocated resources is ready to contribute? Delay before new node is ready. (migration, replication of data) Need to compensate for this delay. Proposed Solution: Cool Standby

Cool Standby Periodically sync data with backup nodes. When controller needs additional nodes, it simply has to turn on the nodes. Reduces delay time, since backup nodes already have most of the data.

Other Issues What is the right control architecture for complex guest services? Composing multiple cloud services. Can use decoupled control (further decoupling of the slice controller). Decompose controller into local controllers. + Easier to model because each controller only needs to know a piece of the overall service. + Parts are pluggable/ can easily change one of the controllers/services. - May not be optimal - Emergent misbehaviors

Thank You.