Network Infrastructure Services CS848 Project



Similar documents
How To Provide Qos Based Routing In The Internet

Integrated Service (IntServ) versus Differentiated Service (Diffserv)

18: Enhanced Quality of Service

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

CS640: Introduction to Computer Networks. Why a New Service Model? Utility curve Elastic traffic. Aditya Akella. Lecture 20 QoS

Optimizing Shared Resource Contention in HPC Clusters

CS/ECE 438: Communication Networks. Internet QoS. Syed Faisal Hasan, PhD (Research Scholar Information Trust Institute) Visiting Lecturer ECE

Scalable Architecture on Amazon AWS Cloud

Monitoring Cloud Applications. Amit Pathak

Internet Quality of Service

Solving I/O Bottlenecks to Enable Superior Cloud Efficiency

King Fahd University of Petroleum & Minerals Computer Engineering g Dept

A Review on Quality of Service Architectures for Internet Network Service Provider (INSP)

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

Cloud Design and Implementation. Cheng Li MPI-SWS Nov 9 th, 2010

Quality of Service (QoS) EECS 122: Introduction to Computer Networks Resource Management and QoS. What s the Problem?

QoS for (Web) Applications Velocity EU 2011

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

5 Performance Management for Web Services. Rolf Stadler School of Electrical Engineering KTH Royal Institute of Technology.

Evaluation Methodology of Converged Cloud Environments

Network performance in virtual infrastructures

Internet services pricing under usagebased cost allocation: Congestion dependence

Running SAP Solutions in the Cloud How to Handle Sizing and Performance Challenges. William Adams SAP AG

Testing & Assuring Mobile End User Experience Before Production. Neotys

Zadara Storage Cloud A

Cloud computing - Architecting in the cloud

Amazon Web Services Primer. William Strickland COP 6938 Fall 2012 University of Central Florida

Deep Dive: Maximizing EC2 & EBS Performance

benchmarking Amazon EC2 for high-performance scientific computing

Performance And Scalability In Oracle9i And SQL Server 2000

Chapter 19 Cloud Computing for Multimedia Services

Financial Services Grid Computing on Amazon Web Services January 2013 Ian Meyers

EXPERIMENTAL STUDY FOR QUALITY OF SERVICE IN VOICE OVER IP

Cluster Computing. ! Fault tolerance. ! Stateless. ! Throughput. ! Stateful. ! Response time. Architectures. Stateless vs. Stateful.

Evolution of OpenCache: an OpenSource Virtual Content Distribution Network (vcdn) Platform

CS 268: Lecture 13. QoS: DiffServ and IntServ

Migration Scenario: Migrating Batch Processes to the AWS Cloud

Accelerating Web-Based SQL Server Applications with SafePeak Plug and Play Dynamic Database Caching

International Journal of Engineering Research & Management Technology

Characterizing Task Usage Shapes in Google s Compute Clusters

Amazon EC2 Product Details Page 1 of 5

A Generic Auto-Provisioning Framework for Cloud Databases

Amazon Cloud Storage Options

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013

Technology Enablement

Cloud Computing For Bioinformatics

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Resource Utilization of Middleware Components in Embedded Systems

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

A programming model in Cloud: MapReduce

Application Performance Testing Basics

Performance Evaluation of AODV, OLSR Routing Protocol in VOIP Over Ad Hoc

White Paper. Cloud Native Advantage: Multi-Tenant, Shared Container PaaS. Version 1.1 (June 19, 2012)

Who moved my cloud? Part II: What s behind the cloud vendors AWS, GCE and Azure?

Quality of Service for IP Videoconferencing Engineering White Paper

References. Introduction to Database Systems CSE 444. Motivation. Basic Features. Outline: Database in the Cloud. Outline

Introduction to Database Systems CSE 444

Preparing Your IT for the Holidays. A quick start guide to take your e-commerce to the Cloud

Maintaining Non-Stop Services with Multi Layer Monitoring

How swift is your Swift? Ning Zhang, OpenStack Engineer at Zmanda Chander Kant, CEO at Zmanda

Cloud Computing. Lecture 24 Cloud Platform Comparison

QoS in VoIP. Rahul Singhai Parijat Garg

Concepts Introduced in Chapter 6. Warehouse-Scale Computers. Important Design Factors for WSCs. Programming Models for WSCs

Performance and scalability of a large OLTP workload

4 Internet QoS Management

White Paper. Optimizing the Performance Of MySQL Cluster

Amazon Web Services. Elastic Compute Cloud (EC2) and more...

Cloud Computing: Meet the Players. Performance Analysis of Cloud Providers

Software-Defined Networks Powered by VellOS

WORKFLOW ENGINE FOR CLOUDS

Measurement of IP Transport Parameters for IP Telephony

Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks

Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand

Rackspace Cloud Databases and Container-based Virtualization

Chapter 7: Distributed Systems: Warehouse-Scale Computing. Fall 2011 Jussi Kangasharju

Figure 1. The cloud scales: Amazon EC2 growth [2].

Question: 3 When using Application Intelligence, Server Time may be defined as.

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

Multimedia Requirements. Multimedia and Networks. Quality of Service

Requirements of Voice in an IP Internetwork

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015

Performance Modeling and Analysis of a Database Server with Write-Heavy Workload

LBPerf: An Open Toolkit to Empirically Evaluate the Quality of Service of Middleware Load Balancing Services

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

Accelerating Hadoop MapReduce Using an In-Memory Data Grid

SCALABILITY IN THE CLOUD

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

How AWS Pricing Works May 2015

Transcription:

Quality of Service Guarantees for Cloud Services CS848 Project presentation by Alexey Karyakin David R. Cheriton School of Computer Science University of Waterloo March 2010

Outline 1. Performance of cloud services (Amazon EC2, Amazon S3) 2. Current QoS frameworks (media streaming, network communication) 3. QoS in databases and other IT applications 4. QoS contract specification 5. Enforcing QoS requirements

Implications of cloud computing on performance expectations The cost of cloud services is claimed to be lower due to extensive sharing of resources; However, a higher degree of sharing would automatically mean less guarantees of performance; Since performance expectations of users may be affected by the resource sharing, explicit quality of service requirements could be used;

The types of CPU usage pricing in Amazon AWS 1. Paid per machine-hour - Amazon EC2, Elastic MapReduce, Relational Database Services (MySQL) - On-Demand, Spot, Reservation pricing 2. Paid per request - Amazon S3, SQS 3. Paid per actual CPU consumption - Amazon SimpleDB According to Amazon SLA, the provider can throttle any user's activity at any time

Research of actual cloud performance 1. Simson L. Garfinkel, Technical Report TR-08-07: An Evaluation of Amazon s Grid Computing Services: EC2, S3 and SQS, Harvard University, 2007 2. Edward Walker. Benchmarking Amazon EC2. In Proceedings of USENIX'2008, Volume: 33, Issue: 5, pages 18-23, October 2008.

Garfinkel: variance of read throughput from EC2

Garfinkel: variance of S3 throughput from EC2

Garfinkel: some conclusion 1. Response time stability: between 10% to 20% requests suffer 5x worse performance; 2. For small requests (1B), response time for successive requests to the same bucket is independent, that mean that in case of delay an application could issue another requests that may complete sooner; 3. That is not the case for larger requests (100MB): several requests are likely to complete at the same time; 4. No legal guarantees of service; Amazon has a right to terminate the service any time for any reason 5. AWS is generally fast, responsive, and very reliable, availability is excellent

Walker: NAS parallel benchmark 1. The benchmark containing 8 test programs simulating schientific computations, e.g. fluid dynamics; 2. OpenMPI used for interconnect API 3. The configurations for the test are the same for a local cluster and EC2 cluster

Walker: single node result

Walker: cluster results

Walker: some conclusion 1. The benchmark is a parallel scientific benchmark with high sensitivity to interconnect latency so great degradation in the cluster test comparing to Infiniband is expected; 2. In the single node test the degradation is moderate;

Existing QoS frameworks: media streaming 1. End-to-end (intergation principle) 2. Declarative 3. Types of requirements: - flow synchronization an analog to consistency - flow performance - level of service (deterministic, predictive, best-effort) - management policy (tradeoff between media quality and bandwidth) - cost of service 4. Mechanisms to achieve QoS: - QoS mapping to underlying services (thread priorities, etc) - admission testing - resource reservation protocol

Existing QoS frameworks: network communication 1. Approaches: - intserv: reservation of resources, RSVP (RFC 2205) - diffserv: priorities based on the type of traffic, implemented as Differentiated Services Field (DS Field) in the IPv4 and IPv6 Headers (RFC 2474) 2. QoS isn't widely used: only used in some enterprise networks, not on Internet, supported by some routers 3. Problems with internet QoS: - hard to support reservation in high-volume routers for thousands of concurrent sessions - encryped traffic cannot be inspected - ability to abuse QoS policies by both users and providers 4. It is argued that increasing internet bandwidth is a better way to meet users' quality expectations instead of employing some form of QoS

Types of quality (QoS dimensions) 1. Performance: response time, mean throughput 2. Quality of result set: data freshness, correctness, amount of data returned 3. Economic factors of query execution especially important for a scalable utility service Those requirements are often contradictory.

Real-time DB approach A query must complete within a specified time. Otherwise, the query can be canceled or downgraded to low priority execution. Implemented using resource reservation and priorities. Much research exists on real-time databases, including distributed ones. Some interesting problems include scheduling, concurrency control and priority inversion avoidance. Real-time database approach is usually not considered for utility service providers as it is in conflict with the concept of resource sharing in order to minimize costs.

SLA approach Service Level Agreement (SLA) specifies an averaged restrictions on QoS performance metrics, over a (relatively) long time interval. Quality requirements can be specified for a class of queries or for each individual query. The goal is to maximize the percentage of queries that fulfill the constraints.

Economic models of query execution Historically, many factors are included in query cost estimation: - CPU cycles - disk I/O, etc, etc Economic models can combine those factors, along with query QoS requirement into single model. The model defines Wealth value, corresponding to the importance of the workload. Resource brokers sell resources to Consumers to maximize its profits. Consumers compete for resources on a auction, trying to use maximize utility/cost ratio. SLA can be integrated into the Wealth definition. Economic models naturally fit into cloud computing infrastructure as using resources really costs money.

Strategies to meet SLA requirements 1. Adaptive query execution - selecting one of many sub-plans during query execution based on its tradeoffs and the current execution monitoring data 2. Dynamic resource allocation (e.g. using feedback control loop) - most commonly, CPU time and memory (caches, buffer pool) is managed - resource allocation can be dynamically split between classes of queries. Particularly, many publications discuss optimal buffer pool allocation between parts of workload. 3. Admission control - Multi-Programming Limit (MPL) 4. Migration of query execution or data between physical resources

Fin