The Performance and Cost Variability of Amazon EC2



Similar documents
Economic Cloud Computing What to keep in mind when using the Cloud...

Amazon Elastic Compute Cloud Getting Started Guide. My experience

Cloud Computing and Amazon Web Services

How AWS Pricing Works May 2015

How AWS Pricing Works

Cloud Computing For Bioinformatics

AWS Account Setup and Services Overview

Amazon EC2 Product Details Page 1 of 5

Introduction to Cloud Computing on Amazon Web Services (AWS) with focus on EC2 and S3. Horst Lueck

Technology and Cost Considerations for Cloud Deployment: Amazon Elastic Compute Cloud (EC2) Case Study

AWS Performance Tuning

Scalable Application. Mikalai Alimenkou

Intro to AWS: Storage Services

Using ArcGIS for Server in the Amazon Cloud

Amazon Web Services. Elastic Compute Cloud (EC2) and more...

Comparing Cloud Computing Resources for Model Calibration with PEST

Cloud Computing. Alex Crawford Ben Johnstone

ColdFusion 10 in the Amazon AWS Cloud. Sven Ramuschkat tecracer GmbH

ur skills.com

AWS Storage: Minimizing Costs While Retaining Functionality

How To Choose Between A Relational Database Service From Aws.Com

National Center for Education Statistics. Amazon Hosted ESRI ArcGIS Servers Project Final Report

Part V Applications. What is cloud computing? SaaS has been around for awhile. Cloud Computing: General concepts

Amazon AWS in.net. Presented by: Scott Reed

Amazon Web Services Student Tutorial

Cloud Computing with Amazon Web Services and the DevOps Methodology.

Running R from Amazon's Elastic Compute Cloud

Using ArcGIS for Server in the Amazon Cloud

Data Sharing Options for Scientific Workflows on Amazon EC2

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

Amazon Relational Database Service. User Guide API Version

An Esri White Paper January 2011 Estimating the Cost of a GIS in the Amazon Cloud

OTM in the Cloud. Ryan Haney

Hosting Requirements Smarter Balanced Assessment Consortium Contract 11 Test Delivery System. American Institutes for Research

Cloud Computing and E-Commerce

Amazon Elastic Beanstalk

The Cost of the Cloud. Steve Saporta CTO, SwipeToSpin Mar 20, 2015

Cloud Performance Benchmark Series

CS / Cloud Computing. Recitation 2 September 2 & 4, 2014

Hosting Requirements Smarter Balanced Assessment Consortium Contract 11 Test Delivery System. American Institutes for Research

Estimating the Cost of a GIS in the Amazon Cloud. An Esri White Paper August 2012

Amazon CloudWatch to monitor cloud resource usage

MoMoD-Cloud Movie Mobile on Demand by Cloud

Scaling in the Cloud with AWS. By: Eli White (CTO & mojolive) eliw.com - mojolive.com

Développement logiciel pour le Cloud (TLC)

Amazon Web Services Primer. William Strickland COP 6938 Fall 2012 University of Central Florida

Cloud Server as IaaS Pricing

EEDC. Scalability Study of web apps in AWS. Execution Environments for Distributed Computing

Part 1: Price Comparison Among The 10 Top Iaas Providers

Cloud Server as IaaS Pricing

TECHNOLOGY WHITE PAPER Jan 2016

Cloud Based Tes,ng & Capacity Planning (CloudPerf)

Unisys Cost Schedule. Unisys Cost Schedule. Page 1

Shadi Khalifa Database Systems Laboratory (DSL)

!"#$%&'()*'+),-./)0' 9##+':,%-.;),0'

PUBLIC CLOUD USAGE TRENDS

AlphaTrust PRONTO - Hardware Requirements

Running Oracle on the Amazon Cloud

A Comparison of Clouds: Amazon Web Services, Windows Azure, Google Cloud Platform, VMWare and Others (Fall 2012)

Deep Dive: Maximizing EC2 & EBS Performance

What is Cloud Computing? Why call it Cloud Computing?

Tunebot in the Cloud. Arefin Huq 18 Mar 2010

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

Cloud Compu)ng. [Stephan Bergemann, Björn Bi2ns] IP 2011, Virrat

Cloud Computing: Amazon Web Services

Build Your Own Performance Test Lab in the Cloud. Leslie Segal Testware Associate, Inc.

DataCenter optimization for Cloud Computing

Pierre Riteau University of Chicago

Public Cloud Infrastructure - The Advantages and Disadvantages

Building a Private Cloud with Eucalyptus

Drupal in the Cloud. by Azhan Founder/Director S & A Solutions

THE DEFINITIVE GUIDE FOR AWS CLOUD EC2 FAMILIES

This computer will be on independent from the computer you access it from (and also cost money as long as it s on )

Cloud Computing on Amazon's EC2

The Cloud as a Computing Platform: Options for the Enterprise

CENIC Private Cloud Pilot Using Amazon Reserved Instances (RIs) September 9, 2011

Scalable Architecture on Amazon AWS Cloud

Resource Sizing: Spotfire for AWS

Designing Apps for Amazon Web Services

Cloud Computing. Adam Barker

Administrative Issues

Logentries Insights: The State of Log Management & Analytics for AWS

Scalable Services for Digital Preservation

TECHNOLOGY WHITE PAPER Jun 2012

Part II: What s behind the cloud vendors AWS, Azure and GCP?

Public Cloud Offerings and Private Cloud Options. Week 2 Lecture 4. M. Ali Babar

FortiGate Amazon Machine Image (AMI) Selection Guide for Amazon EC2

Very Large Enterprise Network, Deployment, Users

Cloud Performance Benchmark Series

COMPUTER MEASUREMENT GROUP - India Hyderabad Chapter. Strategies to Optimize Cloud Costs By Cloud Performance Monitoring

Thing Big: How to Scale Your Own Internet of Things.

ArcGIS for Server: In the Cloud

PV213 Enterprise Information Systems in Practice 07 - Architecture of the EIS in the cloud

Neptune. A Domain Specific Language for Deploying HPC Software on Cloud Platforms. Chris Bunch Navraj Chohan Chandra Krintz Khawaja Shams

CLOUD BENCHMARK ROUND 1

Cloud n Service Presentation. NTT Communications Corporation Cloud Services

Amazon Cloud Storage Options

Reducing Your AWS Spend by 40%: Best Practices Mark Lavi, Technology June 2016

GeoCloud Project Report GEOSS Clearinghouse

Transcription:

The Performance and Cost Variability of Amazon EC2 Gary A. McGilvary edinburgh data.intensive research 1

OUTLINE 1. Introduction 2. Cost Variations 3. Performance Variations Motivation Amazon EC2 and SPRINT End-User Location Data Transfer Usage Charges Cloud Load Underutilization Instance Processors 4. Conclusions/Future Work 2

INTRODUCTION Investigate the variability of cost and performance - Optimal cloud configuration - Cloud platform selection - Cloud vs HPC - Business impact

AMAZON EC2 Amazon s Elastic Compute Cloud Compute Size Memory Storage Compute Cores I/O Cost $ Small 1.7 GB 160 GB 1 CU 1 Moderate 0.085 Large 7.5 GB 850 GB 4 CU s 2 High 0.34 XLarge 15 GB 1690 GB 8 CU s 4 High 0.68 EC2 Compute Unit 1 EC2 CU: 1.0-1.2 GHz Xeon 2007 processor

AMAZON EC2 Amazon s Elastic Compute Cloud Network - Regions

AMAZON EC2 Amazon s Elastic Compute Cloud Network - Availability Zones IDT RDT AZDT

SPRINT Simple Parallel R INTerface - provides parallel functions of R Functions: - pcor: parallel correlation (memory/compute-intensive) pcor(t (x, y = x) - pmaxt: parallel permutation test (compute-intensive) pmaxt(x, classlabel, B=150000)

OUTLINE 1. Introduction 2. Cost Variations Motivation Amazon EC2 and SPRINT End-User Location Data Transfer Usage Charges 3. Performance Variations 4. Conclusions/Future Work 8

COST VARIABILITY Cost vs User Location: Instance Location: - US East Region - us-east-1b Submit from: Thailand and UK Experiment Copy of SPRINT pcor data SPRINT package installation from EC2 repository Execution Results: Invoice and Usage report

Amazon Web Services Billing Statement: February 1 - February 28, 2011 Date Printed: February 23, 2011 Amazon Elastic Compute Cloud US East (Northern Virginia) Region Amazon EC2 running Linux/UNIX $0.085 per Small Instance (m1.small) about:blank Name: Gary McGilvary Email: gary.mcgilvary@ed.ac.uk Account Number: 0864-1647-1363 Cost instance-hour vs (or User partial hour) Location: Amazon EC2 EBS COST VARIABILITY Totals 10 Hrs 0.85 $0.00 per GB-month of provisioned 0.024 GB-Mo 0.00 storage under monthly free tier Amazon Web $0.00 Services per 1 million I/O requests under 5,422 IOs Name: Gary McGilvary 0.00 Billing Statement: monthly February free tier 1 - February 28, 2011 Email: gary.mcgilvary@ed.ac.uk Date Printed: $0.00 February per 10,000 23, 2011 gets (when loading a 2,048 Requests Account Number: 0864-1647-1363 0.00 snapshot) under monthly free tier Amazon CloudWatch $0.015 per monitored instance-hour (or 5 Hrs Totals 0.08 Amazon Elastic partial Compute hour) Cloud US East (Northern Virginia) Region» 0.93 Amazon Amazon Simple EC2 Notification running Linux/UNIX Service US East (Northern $0.085 Virginia) per Small Region Instance (m1.small) First 10 Hrs 0.85 instance-hour 100,000 Amazon (or partial SNS hour) API 36 Requests 0.00 Amazon Requests EC2 EBSper month are free $0.00 per GB-month of provisioned» 0.00 Amazon Simple 0.024 GB-Mo 0.00 storage Storage under Service monthly free tier US Standard Region $0.00 per 1 million I/O requests under $0.00 5,422 IOs 0.00 monthly per request free tier- PUT, COPY, POST, $0.00 or per LIST 10,000 requests gets (when under loading the a 17 Requests 0.00 2,048 Requests 0.00 monthly snapshot) global under free monthly tier free tier Amazon CloudWatch» 0.00 Amazon Virtual $0.015 Private per monitored Cloud instance-hour (or 5 Hrs 0.08 partial hour)» 0.00 AWS Data Transfer (excluding Amazon CloudFront)» 0.93 Amazon Simple $0.000 Notification per GB - data Service transfer in under US East (Northern Virginia) Region 0.040 GB 0.00 monthly global free tier $0.000 First 100,000 per GB Amazon - data transfer SNS API out under 36 Requests Requests per month are free 0.004 GB 0.00 the monthly global free tier $0.010 per GB - regional data transfer -» 0.00 Amazon Simple in/out/between Storage Service EC2 Avail Zones or US Standard Region 0.511 GB 0.01 when using public/elastic IP addresses or $0.00 ELBper request - PUT, COPY, POST, or LIST requests under the 17 Requests 0.00 0.01 Taxes monthly global free tier Estimated Taxes VAT Registration» 0.00 0.19 (Due Amazon March 1, Virtual 2011) Private Cloud» 0.00 Total Charges due on March 1, 2011 $1.13 AWS Data Transfer (excluding Amazon CloudFront) $0.000 per GB - data transfer in under 0.040 GB 0.00 the monthly global free tier $0.000 per GB - data transfer out under 0.004 GB 0.00 the monthly global free tier $0.010 per GB - regional data transfer - 23/02/2011 12:33 <?xml version="1.0" encoding="utf-8"?> <ServiceUsage> <OperationUsage> <ServiceName>AmazonEC2</ServiceName> <OperationName>RunInstances</OperationName> <UsageType>DataTransfer-In-Bytes</UsageType> <StartTime>02/24/11 12:00:00</StartTime> <EndTime>02/24/11 13:00:00</EndTime> <UsageValue>4253394</UsageValue> </OperationUsage> <OperationUsage> <ServiceName>AmazonEC2</ServiceName> <OperationName>GetMetricStatistics</OperationName> <UsageType>Calls</UsageType> <StartTime>02/24/11 12:00:00</StartTime> <EndTime>02/24/11 13:00:00</EndTime> <UsageValue>20</UsageValue> </OperationUsage> <OperationUsage> <ServiceName>AmazonEC2</ServiceName> <OperationName>RunInstances</OperationName> <UsageType>BoxUsage</UsageType> <StartTime>02/24/11 12:00:00</StartTime> <EndTime>02/24/11 13:00:00</EndTime> <UsageValue>1</UsageValue> </OperationUsage> <OperationUsage> <ServiceName>AmazonEC2</ServiceName> <OperationName>InterZone-Out</OperationName> <UsageType>DataTransfer-Regional-Bytes</UsageType> <StartTime>02/24/11 12:00:00</StartTime> <EndTime>02/24/11 13:00:00</EndTime> <UsageValue>42708</UsageValue> </OperationUsage> Page 1 of 1

COST VARIABILITY Cost vs User Location: Difference in taxation levels - Scale with use and expensive for prolonged time periods Difference in resource usages - dependent on location or cloud load? Consequences of user location: - Businesses/Individuals in a tax free zone will benefit - reduction in performance?

COST VARIABILITY Variability of Data Transfer Usage Charges Run SPRINT s pcor function multiple times Small Ubuntu instance Regional Data Transfer (RDT) Transferred 84.3 MB s from EC2 Ubuntu Repository to the instance 12

CLOUD CORRECTNESS Variability of Data Transfer Usage Charges Regional Data Transfer (RDT) Transferred 84.3 MB s from EC2 Ubuntu Repository to the instance Regional%Data%Transfer%In% MB's%Consumed% 90" 80" 70" 60" 50" 40" 30" 20" 10" 0" Usage"Report" tcpdump" 1" 2" 3" 4" 5" 6" 7" 8" 9" 10" 11" 12" Experiment%

COST VARIABILITY Variability of Data Transfer Usage Charges 90" Usage"Report" tcpdump" Consequences 80" of incorrect data usage recording: MB's%Consumed% 70" 60" 50" 40" 30" Regional%Data%Transfer%In% - some free data transfer! - substantial savings for prolonged use! - GB s can go unrecorded on Azure 20" 10" 0" 1" 2" 3" 4" 5" 6" 7" 8" 9" 10" 11" 12" Experiment% 13

OUTLINE 1. Introduction 2. Cost Variations 3. Performance Variations Motivation Amazon EC2 and SPRINT End-User Location Data Transfer Usage Charges Cloud Load Underutilization Instance Processors 4. Conclusions/Future Work 14

CLOUD LOAD Cloud Load vs Time of Day? Improving High-Performance Computations on Clouds Through Resource Underutilization 15

UNDERUTILISATION Reserving more resources while using a small % of each - optimum configuration! SPRINT s pcor and pmaxt functions and EC2 Large instances 2 GHz 2 GHz R R 2 GHz 2 GHz VS R R R R R R

UNDERUTILISATION Case Study: SPRINT

UNDERUTILISATION Consequences of Underutilization: - Reserving more resources will increase costs - cost vs performance? - Paper: Improving High-Performance Computations on Clouds Through Resource Underutilization - Underutilization improves the expected execution time by two orders of magnitude -... it is more than 3 times cheaper to use 50% of the resources than 100% - Geared towards finding the optimal cloud configuration - Could potentially save businesses/individuals a substantial amount of time and money How do we determine the utilisation rate, and hence optimal configuration for a job?

INSTANCE PROCESSORS Instances deployed on varying processors Availability Zone: us-east-1d Instance Type: Large Remember: EC2 Compute Units (1.0-1.2 GHz, Xeon 2007) - Large instance = 2 EC2 CU s per core (2cores), 4.0-4.8 GHz - Large instance has 2 processors Processor Type Min Usage Max Usage Intel Xeon E5507 2.27 GHz ( x2-4.54 GHz) Intel Xeon E5645 2.4 GHz (x2-4.8 GHz) Intel Xeon E5430 2.66 GHz (x2-5.32 GHz) 88.1% 100% 83.3% 100% 75.1% 90.22%

INSTANCE PROCESSORS 1 core: 2-GHz 2 cores: 4-GHz 20

CONCLUSIONS Dependent on user location, costs may differ as well as resources used EC2 s data transfer usage mechanism may be incorrect at times - cloud load? Application performance can vary significantly (execution times) Underutilization can increase performance - in every case? - utilization rate? The underlying instance processors affect performance - correct EC2 Compute Units specified? Optimal cloud configuration == increased performance == lowest cost 21

THANK YOU! Questions? gary.mcgilvary@ed.ac.uk 22