Service Performance Aspects for Cloud Service Level Agreements



Similar documents
Eucalyptus-Based. GSAW 2010 Working Group Session 11D. Nehal Desai

Performance Management for Cloudbased STC 2012

Performance Management for Cloud-based Applications STC 2012

Comparison of Request Admission Based Performance Isolation Approaches in Multi-tenant SaaS Applications

IaaS Federation. Contrail project. IaaS Federation! Objectives and Challenges! & SLA management in Federations 5/23/11

SLA BASED SERVICE BROKERING IN INTERCLOUD ENVIRONMENTS

Software-Defined Networks Powered by VellOS

Journey to the Private Cloud. Key Enabling Technologies

Security & Trust in the Cloud

Predictable Data Centers

Cloud Computing Standards: Overview and ITU-T positioning

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

Novel Network Computing Paradigms (I)

and Deployment Roadmap for Satellite Ground Systems

Sacha Dubois RED HAT TRENDS AND TECHNOLOGY PATH TO AN OPEN HYBRID CLOUD AND DEVELOPER AGILITY. Solution Architect Infrastructure

Cloud computing: the state of the art and challenges. Jānis Kampars Riga Technical University

APPLICATION DELIVERY IN OPENSTACK WITH AVI NETWORKS

The Cisco Powered Network Cloud: An Exciting Managed Services Opportunity

Mobile Cloud Networking FP7 European Project: Radio Access Network as a Service

Today: Data Centers & Cloud Computing" Data Centers"

Hosting Performance-Sensitive Applications in the Cloud

Planning the Migration of Enterprise Applications to the Cloud

Traditional Disaster Recovery versus Cloud based DR

Achieving a High-Performance Virtual Network Infrastructure with PLUMgrid IO Visor & Mellanox ConnectX -3 Pro

Who moved my cloud? Part I: Introduction to Private, Public and Hybrid clouds and smooth migration

Minder. simplifying IT. All-in-one solution to monitor Network, Server, Application & Log Data

Secure Cloud-Ready Data Centers Juniper Networks

Accelerate the Performance of Virtualized Databases Using PernixData FVP Software

SLA-driven Dynamic Resource Provisioning for Service Provider in Cloud Computing

Intel Service Assurance Administrator. Product Overview

APPLICATION DELIVERY IN OPENSTACK WITH AVI NETWORKS CLOUD APPLICATION DELIVERY PLATFORM

AEIJST - June Vol 3 - Issue 6 ISSN Cloud Broker. * Prasanna Kumar ** Shalini N M *** Sowmya R **** V Ashalatha

Oracle Applications and Cloud Computing - Future Direction

Unisys ClearPath Forward Fabric Based Platform to Power the Weather Enterprise

How To Understand The Value Of Cloud Computing For An Enterprise Company

Energetic Resource Allocation Framework Using Virtualization in Cloud

Software Define Storage (SDs) and its application to an Openstack Software Defined Infrastructure (SDi) implementation

Cloud Federations in Contrail

Extreme Networks: Building Cloud-Scale Networks Using Open Fabric Architectures A SOLUTION WHITE PAPER

1.1.1 Introduction to Cloud Computing

Public Cloud Workshop Offerings

A Strawman Model. NIST Cloud Computing Reference Architecture and Taxonomy Working Group. January 3, 2011

CloudLink - The On-Ramp to the Cloud Security, Management and Performance Optimization for Multi-Tenant Private and Public Clouds

PEPPERDATA IN MULTI-TENANT ENVIRONMENTS

ITU- T Focus Group Cloud Compu2ng

Dr. Jesus Luna Garcia

Choosing Between Commodity and Enterprise Cloud

Migrating Scientific Applications from Grid and Cluster Computing into the Cloud Issues & Challenges

Leveraging the Cloud. September 22, Digital Government Institute Cloud-Enabled Government Conference Washington, DC

STRATEGIC WHITE PAPER. The next step in server virtualization: How containers are changing the cloud and application landscape

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

Monitoring Cloud Applications. Amit Pathak

HP Converged Cloud Cloud Platform Overview. Shane Pearson Vice President, Portfolio & Product Management

Rise of RaaS: the Resource-as-a-Service Cloud

Making the Business and IT Case for Dedicated Hosting

Lecture on Cloud Computing. Pricing, SLA, business models

Solving I/O Bottlenecks to Enable Superior Cloud Efficiency

Federal Aviation Administration. efast. Cloud Computing Services. 25 October Federal Aviation Administration

Enterprises Seek The Benefits Of Hybrid Cloud, And Work To Overcome The Challenges

How OpenFlow -Based SDN Transforms Private Cloud. ONF Solution Brief November 27, 2012

Infrastructure as a Service (IaaS)

Cloud Computing An Introduction

Dynamic Resource Allocation in Software Defined and Virtual Networks: A Comparative Analysis

Consumption IT. Michael Shepherd Business Development Manager. Cisco Public Sector May 1 st 2014

ITL BULLETIN FOR JUNE 2012 CLOUD COMPUTING: A REVIEW OF FEATURES, BENEFITS, AND RISKS, AND RECOMMENDATIONS FOR SECURE, EFFICIENT IMPLEMENTATIONS

Understanding Virtualization and Cloud in the Enterprise

OpenStack Private Cloud

Successfully Deploying Globalized Applications Requires Application Delivery Controllers

WHITE PAPER Empowering Application Workloads Migration to Cloud Services

Interoperable Clouds

Network Infrastructure Services CS848 Project

Enterprise Architecture Review Checklist

Multifaceted Resource Management for Dealing with Heterogeneous Workloads in Virtualized Data Centers

Elevating Data Center Performance Management

Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage

Why Use OPTIMIS? Build and Run Services in the Most Suitable Cloud Venues. May 2013

HPC ON WALL ST OPENSTACK AND BIG DATA. Brent Holden Chief Field Architect, Eastern US April 2014

The Benefits of POWER7+ and PowerVM over Intel and an x86 Hypervisor

HP Cloud technologies

Transcription:

Service Performance Aspects for Cloud Service Level Agreements Dr. Craig A. Lee, Senior Scientist, lee@aero.org Computer Systems Research Department The Aerospace Corporation NITRD SLA Workshop Arlington, VA, August 6, 2014 The Aerospace Corporation 2014

Why SLAs? Some applications will be performance-critical or performance-sensitive "Best effort" cloud resources may not suffice to meet mission requirements Some applications will have dynamic requirements Some apps will have varying demands surge -- at unpredictable times Surge traditionally addressed by over-provisioning with dedicated hardware Dedicated system was sized for the worst-case, rather than the average case Drove acquisition costs and operation costs for the entire life of the system Example: Satellite ground systems This is antithetical to cloud computing Multi-tenant environment where utilization and costs can be better managed Hence, the goal is to provide the user with a reasonable expectation that performance requirements will be met, through mechanisms that are reasonable for the provider to implement and support for multiple apps These cannot simply be contractual SLAs These must be capabilities that a provider may provide and a user may use to keep applications in spec Dynamic, machine-enforceable SLAs Work in this area has already been done in the Grid community 2

Grid Computing: Big Science Collaboration on a Global Scale 3 EU EGI, NSF XSEDE, DOE OSG, (WLCG Dashboard website: http://dashb-earth.cern.ch)

WS-Agreement, GFD.192 Defines term language and protocol for advertising service provider capabilities, creating agreements based on offers, monitoring compliance, and penalties/rewards for non-compliance RESTful implementations exist 4

WS-AgreementNegotiation, GFD.193 Defines an offer/counter-offer model for dynamic exchange of information between a negotiation initiator and responder Rounds of negotiation modeled as a rooted tree States: Advisory, Solicited, Acceptable, Rejected Layered model separates functions and implementations 5

How to Use this in a Cloud Environment? WS-Agreement and WS-AgreementNegotiation are parameter (term language) and cloud agnostic Many SLA metrics possible: Memory/Disk (space: bytes) Throughput (rate: x/sec) Bandwidth (rate: bytes/sec) Latency (time: t) Time-to-Solution (time: t) Availability (time ratio: percentage) Application-level requirements must be mapped to infrastructure-level requirements This will be application-specific To have "teeth", an SLA must be monitored and enforced WS-Agreement and WS-AgreementNegotiation are only the front-end of the SLA process 6

Basic SLA Functions an Autonomic Control Cycle Admission Control Mapping of app-level requirements to infrastructure-level metrics WS-Agreement and WS-Agreement Negotiation Term language needed Monitoring - Metrics Collection Where: host OS/hypervisor, guest OS, application-level When: upstream vs. downstream SLA Evaluation Hysteresis Statistical methods, e.g., Median Absolute Deviation, Interquartile Range, Iterative Local Regression SLA Enforcement -- Violation Response Throttling Load migration process, VM, container migration all possible Elasticity -- on-demand resources SLA re-negotiation 7

Server Load Types for SLA Management Spare additional Cap. surge w/ capacity preemptible back-fill Type IIIa Type IIIa Type IIIb Type IIIb Type I Best Effort Type II Strict Throttling Type III Live Migration w/ spare w/ back-fill capacity Type IV Preemptible Type V Idle 8 Lee and Sill, A Design Space for Dynamic Service Level Agreements in OpenStack Journal of Cloud Computing: Advances, Systems and Applications, to appear, 2014.

An SLA Architecture for OpenStack 9 Lee and Sill, A Design Space for Dynamic Service Level Agreements in OpenStack Journal of Cloud Computing: Advances, Systems and Applications, to appear, 2014.

Summary, Findings, Conclusions & Comments Lots of Development & Testing needed What are the simplest SLA mechanisms that "scratch the itch" for the most users? Contractual SLAs vs. machine-enforceable SLAs Cloud performance SLAs vs. network SLAs (SDNs?) Lighter weight alternatives to VM migration Process migration; Container-based virtualization -- Docker Capacity Planning & Management How to estimate query requirements, load demand, time-to-completion How to support reasonable load requirements to produce reasonable behavior How to manage sets of users such that no one user is disruptive Cyber-security Implications As clouds become larger and more widely used, there will be more automated tools, i.e., autonomic behaviors Autonomic agents become a threat surface -- compromising an agent that controls system behavior would have broad impact Leverage/harmonize existing SLA work OGF WS-Agreement, WS-Agreement Negotiation TeleManagement Forum (TMF) Distributed Management Task Force (DMTF), <humor> And don t forget the WS-Disagreement protocol (WS-NO), GFD.199, ;-) Most negotiations fail anyway WS-Disagreement save vast amounts of time and money by immediately going to the Disagree state and staying there </humor> 10