Cloud Computing Summary and Preparation for Examination



Similar documents
MTAT Basics of Cloud Computing (3 ECTS) Satish Srirama

Large-scale Data Processing on the Cloud

Mobile & Cloud Computing: Research Challenges. Satish Srirama satish.srirama@ut.ee

Sriram Krishnan, Ph.D.

Scalable Architecture on Amazon AWS Cloud

Developing MapReduce Programs

What is cloud computing?

Cloud Computing: Making the right choices

Jeffrey D. Ullman slides. MapReduce for data intensive computing

Introduction to Cloud Computing

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344

Introduction to Hadoop

Cloud computing - Architecting in the cloud

Introduction to Cloud Computing

Session 3. the Cloud Stack, SaaS, PaaS, IaaS

Viswanath Nandigam Sriram Krishnan Chaitan Baru

Cloud Computing Technology

How To Compare Cloud Computing To Cloud Platforms And Cloud Computing

ur skills.com

Chapter 2 Cloud Computing

Introduction to Hadoop

Building Out Your Cloud-Ready Solutions. Clark D. Richey, Jr., Principal Technologist, DoD

More AWS and Cloud-based Research at Mobile & Cloud Lab

EEDC. Scalability Study of web apps in AWS. Execution Environments for Distributed Computing

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Lets SAAS-ify that Desktop Application

Challenges for Data Driven Systems

Platforms in the Cloud

Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop

CLOUD COMPUTING USING HADOOP TECHNOLOGY

Cloud Computing an introduction

Cloud 101. Mike Gangl, Caltech/JPL, 2015 California Institute of Technology. Government sponsorship acknowledged

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

Cloud Computing. Summary

What Is It? Business Architecture Research Challenges Bibliography. Cloud Computing. Research Challenges Overview. Carlos Eduardo Moreira dos Santos

A programming model in Cloud: MapReduce

Open source software framework designed for storage and processing of large scale data on clusters of commodity hardware

24/11/14. During this course. Internet is everywhere. Frequency barrier hit. Management costs increase. Advanced Distributed Systems Cloud Computing

Cloud Computing: Computing as a Service. Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad

Cloud Computing Submitted By : Fahim Ilyas ( ) Submitted To : Martin Johnson Submitted On: 31 st May, 2009

JBoss & Infinispan open source data grids for the cloud era

Big Data and Apache Hadoop s MapReduce

Challenges for cloud software engineering

Cloud Computing and Big Data Analytics for Teaching & Research

Cloud Computing Flying High (or not) Ben Roper IT Director City of College Station

Hadoop. MPDL-Frühstück 9. Dezember 2013 MPDL INTERN

Users VM A A A. Application. Compute/Storage/Network. VM Virtual Machine. On-Premises Data Center

Software as a Service (SaaS) and Platform as a Service (PaaS) (ENCS 691K Chapter 1)

Scaling in the Cloud with AWS. By: Eli White (CTO & mojolive) eliw.com - mojolive.com

OTM in the Cloud. Ryan Haney

Amazon Web Services. Elastic Compute Cloud (EC2) and more...

A.Prof. Dr. Markus Hagenbuchner CSCI319 A Brief Introduction to Cloud Computing. CSCI319 Page: 1


Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University

Above the Clouds A Berkeley View of Cloud Computing

How To Understand Cloud Computing

Written examination in Cloud Computing

CLOUD COMPUTING. Dana Petcu West University of Timisoara

Cloud Providers, SciCloudand

Cloud Computing Training

CPET 581 Cloud Computing: Technologies and Enterprise IT Strategies

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

HYBRID CLOUD SUPPORT FOR LARGE SCALE ANALYTICS AND WEB PROCESSING. Navraj Chohan, Anand Gupta, Chris Bunch, Kowshik Prakasam, and Chandra Krintz

Sistemi Operativi e Reti. Cloud Computing

Drive new Revenue With PaaS/IaaS. Ruslan Synytsky CTO, Jelastic

A Cost-Evaluation of MapReduce Applications in the Cloud

Cloud Computing. Adam Barker

Certified Cloud Computing Professional VS-1067

The Cloud as a Computing Platform: Options for the Enterprise

PaaS - Platform as a Service Google App Engine

CLOUD COMPUTING. When It's smarter to rent than to buy

Course Design Document: IS429: Cloud Computing and SaaS Solutions. Version 1.0

Cloud Computing using MapReduce, Hadoop, Spark

Cloud Computing An Elephant In The Dark

Public Cloud Offerings and Private Cloud Options. Week 2 Lecture 4. M. Ali Babar

Cloud Computing Services and its Application

Ø Teaching Evaluations. q Open March 3 through 16. Ø Final Exam. q Thursday, March 19, 4-7PM. Ø 2 flavors: q Public Cloud, available to public

Last time. Today. IaaS Providers. Amazon Web Services, overview

Cloud computing. Examples

An Introduction to Cloud Computing Concepts

Large-Scale Web Applications

How To Understand Cloud Computing

Security and Privacy in Cloud Computing

CHAPTER 8 CLOUD COMPUTING

Cloud Hosting. QCLUG presentation - Aaron Johnson. Amazon AWS Heroku OpenShift

Scalable Application. Mikalai Alimenkou

Parallel Databases. Parallel Architectures. Parallelism Terminology 1/4/2015. Increase performance by performing operations in parallel

DESIGN OF A PLATFORM OF VIRTUAL SERVICE CONTAINERS FOR SERVICE ORIENTED CLOUD COMPUTING. Carlos de Alfonso Andrés García Vicente Hernández

From Internet Data Centers to Data Centers in the Cloud

CSE 590: Special Topics Course ( Supercomputing ) Lecture 10 ( MapReduce& Hadoop)

ArcGIS for Server: In the Cloud

Transcription:

Basics of Cloud Computing Lecture 8 Cloud Computing Summary and Preparation for Examination Satish Srirama

Outline Quick recap of what we have learnt as part of this course How to prepare for the examination 2/18

What is Cloud Computing? Computing as a utility Consumers pay based on their usage Cloud Computing characteristics Illusion of infinite resources No up-front cost Fine-grained billing (e.g. hourly) Gartner: Cloud computing is a style of computing where massively scalable IT-related capabilities are provided as a service across the Internet to multiple external customers 3/18

Cloud Computing -Services Software as a Service SaaS A way to access applications hosted on the web through your web browser Platform as a Service PaaS Provides a computing platform and a solution stack (e.g. LAMP) as a service Infrastructure as a Service IaaS Use of commodity computers, distributed across Internet, to perform parallel processing, distributed storage, indexing and mining of data Virtualization SaaS Facebook, Flikr,Myspace.com, Google maps API, Gmail PaaS Google App Engine, Force.com, Hadoop, Azure, Heroku, etc IaaS Amazon EC2, Rackspace, GoGrid, SciCloud, etc. Level of Abstraction 4/18

Cloud Computing -Themes Massively scalable On-demand & dynamic Only use what you need -Elastic No upfront commitments, use on short term basis Accessible via Internet, location independent Transparent Complexity concealed from users, virtualized, abstracted Service oriented Easy to use Service Level Agreements 5/18

Cloud Computing Progress Short term and long term implications of cloud Economics of cloud users and cloud providers Challenges and opportunities offered by cloud computing 6/18

Cloud Providers Amazon Web Services EC2, S3, EBS, Elastic Load Balancing, Amazon Auto Scale, Amazon CloudWatch, IAM, Elastic Beanstalk, CloudFormation, Simple Workflow Service, Elastic MapReduce Private Cloud enabling technologies Eucalyptus OpenStack Worked with SciCloud PaaS Google AppEngine Windows Azure 7/18

Scaling Applications on the Cloud Two basic models of scaling Vertical scaling, aka Scale-up Horizontal scaling, aka Scale-out Scaling Enterprise Applications in the Cloud Load balancing Types and algorithms Autoscaling 8/18

Economics of Cloud Providers Cloud Computing providers bring a shift from high reliability/availability servers to commodity servers At least one failure per day in large datacenter Why? Significant economic incentives much lower per-server cost Caveat: User software has to adapt to failures Very hard problem! Solution: Replicate data and computation MapReduce & Distributed File System 9/18

MapReduce Programmers specify two functions: map(k, v) <k, v >* reduce(k, v ) <k, v >* All values with the same key are reduced together The execution framework handles everything else Not quite usually, programmers also specify: partition(k, number of parnnons) parnnon for k Often a simple hash of the key, e.g., hash(k ) mod n Divides up key space for parallel reduce operations combine(k, v ) <k, v >* Mini-reducers that run in memory after the map phase Used as an optimization to reduce network traffic 10/18

Hadoop Processing Model Create or allocate a cluster Put data onto the file system (HDFS) Data is split into blocks Replicated and stored in the cluster Run your job Copy Map code to the allocated nodes Move computation to data, not data to computation Gather output of Map, sort and partition on key Run Reduce tasks Results are stored in the HDFS 11/18

MapReduce Examples Distributed Grep Count of URL Access Frequency Reverse Web-Link Graph Inverted Index Distributed Sort 12/18

Synchronization in Hadoop Approach 1: turn synchronization into an ordering problem Sort keys into correct order of computation Partition key space so that each reducer gets the appropriate set of partial results Hold state in reducer across multiple key-value pairs to perform computation Illustrated by the pairs approach in calculating conditional probability of words Approach 2: construct data structures that bring the pieces together Each reducer receives all the data it needs to complete the computation Illustrated by the stripes approach 13/18

Research @ Mobile & Cloud Lab Scientific computing on the cloud Classification on how the algorithms can be adapted to MR Limitations of Hadoop MapReduce Alternatives Mobile Cloud Mobile Cloud Binding Models 14/18

This week in lab Complete the work from previous labs 15/18

Examination Exam I : Monday, 04.05.2015, 10:00 12:00 in room J. Liivi2-122 Exam II : Tuesday, 05.05.2015, 10:00 12:00 in room J. Liivi2-202 16/18

Preparation for Examination One of the earlier exam paper is kept online Slides are mostly self sufficient References are mentioned for further reading Mainly focus at keywords Let us discuss some questions 17/18

References Basics of Cloud Computing - Lectures https://courses.cs.ut.ee/2014/cloud/spring/mai n/lectures 18/18