Volunteer Computing, Grid Computing and Cloud Computing: Opportunities for Synergy. Derrick Kondo INRIA, France

Similar documents
Cost-Benefit Analysis of Cloud Computing versus Desktop Grids

Volunteer Computing and Cloud Computing: Opportunities for Synergy

Cost-Benefit Analysis of Cloud Computing versus Desktop Grids. By : Paritosh Heera( MT )

How To Compare Amazon Ec2 To A Supercomputer For Scientific Applications

Part V Applications. What is cloud computing? SaaS has been around for awhile. Cloud Computing: General concepts

Technology and Cost Considerations for Cloud Deployment: Amazon Elastic Compute Cloud (EC2) Case Study

Code and Process Migration! Motivation!

Amazon Web Services 100 Success Secrets

Cloud Computing Trends

Cloud Computing and Amazon Web Services

wu.cloud: Insights Gained from Operating a Private Cloud System

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

Amazon EC2 Product Details Page 1 of 5

Distribution transparency. Degree of transparency. Openness of distributed systems

bigdata Managing Scale in Ontological Systems

SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES

Keywords: Cloudsim, MIPS, Gridlet, Virtual machine, Data center, Simulation, SaaS, PaaS, IaaS, VM. Introduction

Datacenters and Cloud Computing. Jia Rao Assistant Professor in CS

IMCM: A Flexible Fine-Grained Adaptive Framework for Parallel Mobile Hybrid Cloud Applications

Big Data and Cloud Computing for GHRSST

What is Cloud Computing? Tackling the Challenges of Big Data. Tackling The Challenges of Big Data. Matei Zaharia. Matei Zaharia. Big Data Collection

Cloud computing - Architecting in the cloud

Solution for private cloud computing

Cloud Design and Implementation. Cheng Li MPI-SWS Nov 9 th, 2010

Introduction to AWS Economics

Navigating Among the Clouds. Evaluating Public, Private and Hybrid Cloud Computing Approaches

This presentation covers virtual application shared services supplied with IBM Workload Deployer version 3.1.

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router

Cloud Computing Now and the Future Development of the IaaS

Network & HEP Computing in China. Gongxing SUN CJK Workshop & CFI

Grid Computing Vs. Cloud Computing

Concepts Introduced in Chapter 6. Warehouse-Scale Computers. Important Design Factors for WSCs. Programming Models for WSCs

Hadoop Cluster Applications

Sriram Krishnan, Ph.D.

Four Orders of Magnitude: Running Large Scale Accumulo Clusters. Aaron Cordova Accumulo Summit, June 2014

Cloud JPL Science Data Systems

The Hidden Extras. The Pricing Scheme of Cloud Computing. Stephane Rufer

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

Cloud Computing. Adam Barker

Cloud Computing For Bioinformatics

There Are Clouds In Your Future. Jeff Barr Amazon Web (Twitter)

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering

Maginatics Cloud Storage Platform for Elastic NAS Workloads

STeP-IN SUMMIT June 2014 at Bangalore, Hyderabad, Pune - INDIA. Mobile Performance Testing

Oracle Applications and Cloud Computing - Future Direction

Cluster, Grid, Cloud Concepts

Development at the Speed and Scale of Google. Ashish Kumar Engineering Tools

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

Manjrasoft Market Oriented Cloud Computing Platform

Cloud Computing through Virtualization and HPC technologies

Cloud Computing Submitted By : Fahim Ilyas ( ) Submitted To : Martin Johnson Submitted On: 31 st May, 2009

CHAPTER 8 CLOUD COMPUTING

Cloud Computing and Advanced Relationship Analytics

DLT Solutions and Amazon Web Services

System Models for Distributed and Cloud Computing

A Cloud WHERE PHYSICAL ARE TOGETHER AT LAST

The Definitive Guide to Cloud Acceleration

High Performance Computing OpenStack Options. September 22, 2015

HyperQ DR Replication White Paper. The Easy Way to Protect Your Data

Using Proxies to Accelerate Cloud Applications

International Journal of Engineering Research & Management Technology

Towards Predictable Datacenter Networks

Cloud computing. Examples

Introduction to Cloud Computing

StorReduce Technical White Paper Cloud-based Data Deduplication

Amazon Cloud Storage Options

PERFORMANCE ANALYSIS OF PaaS CLOUD COMPUTING SYSTEM

Cloud Computing. Summary

Grid Computing vs Cloud

VMware VDR and Cloud Storage: A Winning Backup/DR Combination

Introduction to Cloud Computing

Data Management in the Cloud: Limitations and Opportunities. Annies Ductan

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Manjrasoft Market Oriented Cloud Computing Platform

Cloud Computing Is In Your Future

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

Big data management with IBM General Parallel File System

Elastic Management of Cluster based Services in the Cloud

Multilevel Communication Aware Approach for Load Balancing

C a r l G o e t h a l s T e r r e m a r k E u r o p e. C a r l. g o e t h a l t e r r e m a r k. c o m

Transcription:

Volunteer Computing, Grid Computing and Cloud Computing: Opportunities for Synergy Derrick Kondo INRIA, France

Outline Cloud Grid Volunteer Computing

Cloud Background Vision Hide complexity of hardware and software management from a user by offering computing as a service Benefits Pay as you go Scale up, down dynamically No hardware management, less software management

Platform Performance vs. Costs high Supercomputer Grid Cluster Cost low low Cloud Performance What is the relationship? BOINC BOINC + Cloud? high

Clouds versus Volunteer Computing Performance tradeoffs Monetary tradeoffs Client hosting Server hosting

Method Use real performance measurements Exported BOINC project data Use real costs Large/small BOINC projects (SETI@home / XtremLab) Amazon Elastic Computing Cloud (EC2)

Stages of Project & Application Platform Construction Application Deployment Application Execution Application Completion

Platform Construction Application Deployment Application Execution Application Completion How long before I get X TeraFLOPS? Strategy: Add to BOINC project list Press releases Forum Announcements Google Ad Sense Respond to users (leverage volunteers) Number of cloud nodes 10 6 10 5 10 4 Can get over 20 TeraFLOPS within 6 months 10 3 0 5 10 15 20 25 30 Months for registration 10 3 10 2 10 1 TeraFLOPS

Platform Construction Application Deployment Application Execution Application Completion How long to deploy my batch of tasks needing fast response time? Strategy: Specify lower latency bounds [Heien et al.] 7,82/9*,34.4:*,.;*:3)4,6<!" '!" &!"!!" "!"!!.!"".4=6>6!""".4=6>6!"""".4=6>6 For 1000 tasks, ~10 minutes with 10 5 hosts!" #!" $!" % ()*+,-./0.1/2)34,,-.3/5,6.

Platform Construction Application Deployment Application Execution Application Completion How many volunteer nodes are equivalent to 1 cloud node? 2.8 active volunteer hosts per 1 cloud node. (Total performance still orders of magnitude better) Strategy: Use statistical prediction of availability

Platform Construction Application Deployment Application Execution Application Completion How long should I wait for task completion? Strategy: See BOINC Catalog for typical deadlines and compute/comm/mem ratios. Median project latency bound: 9 days for 3.7 hour work unit (on 3GHz host). Ratio of lat. bound / exec time > 5. Good success rates: 96.1% of WCG tasks met out of 227,000 tasks

Monetary Tradeoffs Client hosting on cloud Not worth it and never will Server hosting on the cloud Possible solution

Monthly Project Costs Text 1K Students 1K 1K 1K

EC2 Pricing

What about Client Hosting on the Cloud?

Cost of Clouds versus BOINC When is BOINC more cost effective than Clouds? 7,8./9,)/:;<( #!!## #!!#$ #!!#% #!!#' F#'!'/G*HI8.,,)/8*3,5 2'J&/-H*I3/8*3,56 '"&$/G*HI8.,,)/8*3,5 2#&#$/-H*I3/8*3,56 =.),>;0? @ABCDE*>, A7$ / #!!#" After ~13 days #!!#& /! " #! #" $! $" %! ()*+,-./01,/230456

Cost of Clouds versus BOINC (2) 8 x 104 What are total costs over time? Total cost 7 6 5 4 3 2 1 Within <3 days, BOINC is cheaper EC2 100n EC2 1000n EC2 10000n XtremLab SETI@home 0 0 5 10 15 20 Platform age (days)

Cost of Clouds versus BOINC (3) How many months of BOINC can X months of Cloud buy me? Months with VC 3500 3000 2500 2000 1500 1000 Xtremlab / 1000n EC2 SETI@home / 1000n EC2 Xtremlab / 10000n EC2 SETI@home / 10000n EC2 12 months of Cloud can pay for > 125 BOINC years. BOINC way more sustainable. 500 0 0 5 10 15 20 25 Months with EC2

Equivalent Clouds given BOINC Costs Given BOINC costs, what size Cloud can I buy? 1.75 TeraFLOPS, 7 TeraBytes 2 orders of magnitude lower than BOINC.75 TeraFLOPS, 3 TeraBytes

What about Server Hosting on the Cloud?

Number of Hosts over Time Load variation exits w/ publicity, projects run out of work, etc. Clouds take care of server management. Potential to exploit clouds.

Server Costs on a Cloud How much to host BOINC server on cloud? 1.7-16.5 times cheaper to host on Cloud, but bandwidth is expensive. (versus 43K +12K/month) (versus {4,1}K + {5,1}K/month)

Cloud Resources with Given Budget How big of a server can I support with given budget? Many project servers are sustainable on the cloud 4 Mbps, 2 TB 12 Mbps, 2 TB

Summary of Cloud Versus Volunteer Computing Performance tradeoffs 20 BOINC TeraFLOPS within 6 months Monetary tradeoffs Client hosting After 13 days, BOINC more cost effective Server hosting BOINC server on cloud is cost-effective Best for small-medium sized projects Savings of at least 40% http://mescal.imag.fr/membres/derrick.kondo/cloud_calc.xlsx http://boinc.berkeley.edu/trac/wiki/cloudserver

Outline Cloud Grid Volunteer Computing

Volunteer Computing (Compared to Grids) User Not computer savy Cause errors (accidentaly) (or bring benefits) Shuts BOINC client machine on/off Doesn t have or wants to avoid root access Software Heterogeneous OS (most are Windows) Resource contention Firewalls Hardware Heterogeneous size and type Low bandwidth network, firewalls

Synergy Reasons Ways Use storage as cache to put data closer to end nodes Why Grids should use volunteer computing Most Grids jobs are embarrassingly parallel Order of magnitudes more compute power and storage and more pervasive Overloaded during peak periods Why volunteer computing should use Grids Utilization is still low Results are deterministic (relatively) Performance is deterministic (relatively) Use storage as archival Rerun computation on the fly Offload all runnable jobs to volunteer computing system New hybrid algorithms for non-uniform platforms: allow for global search on volunteers and refined local on Grids Increased compute/storage power Check results (spot checking w/ replication) Replication with reliable performance for failed jobs Support fast turnaround jobs

Bridging Volunteer Computing and Grids Standard API Servers that act as bridge 3 central points of failure 3 job queues: high latency Scaling issues: 1 bridge for M Grids and N BOINC projects Pilot jobs VM for system checkpointing, security, and linux support

Outline Cloud Grid Volunteer Computing

Grids versus Clouds Clouds built for web services Grids built for data-intensive applications Petabytes of data Files used by groups of users

Issues with Cloud data management [Iamnitchi08] Current cost model One size fits all (same true for comp) Data durability: existence Data availability: accessibility Access performance E.g. 1: archiving: need durability E.g. 2: caching: need availability, acces perf.

Summary Cloud vs Volunteer Computing Server hosting is cost-effective Volunteer Computing vs Grid Pilot jobs seem like the ideal approach Cloud vs Grid Clouds must provide different costs models for different qualities of service

Current and Future Work Bring Volunteer Computing to the Cloud C3: Cloud Computational Co-op RightScale Grant NFSC-ANR Grants between China and France Bring the Cloud to Volunteer Computing CloudComputing@home Bridging Grids and BOINC EDGES project (Peter Kacsuk, SZTAKI, Hungary)