Grid and Cloud Computing. María S. Pérez Facultad de Informática Universidad Politécnica de Madrid mperez@fi.upm.es



Similar documents
Grid Computing vs Cloud

Cluster, Grid, Cloud Concepts

Data Centers and Cloud Computing

LOGO Resource Management for Cloud Computing

Introduction to grid technologies, parallel and cloud computing. Alaa Osama Allam Saida Saad Mohamed Mohamed Ibrahim Gaber

Grid Computing With FreeBSD

Introduction to Cloud Computing

IOS110. Virtualization 5/27/2014 1

Cloud computing: the state of the art and challenges. Jānis Kampars Riga Technical University

Emerging Technology for the Next Decade

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing

Clouds vs Grids KHALID ELGAZZAR GOODWIN 531

Cloud Computing with Red Hat Solutions. Sivaram Shunmugam Red Hat Asia Pacific Pte Ltd.

Cloud Computing #6 - Virtualization

Data Centers and Cloud Computing. Data Centers

Aneka: A Software Platform for.net-based Cloud Computing

DESIGN OF A PLATFORM OF VIRTUAL SERVICE CONTAINERS FOR SERVICE ORIENTED CLOUD COMPUTING. Carlos de Alfonso Andrés García Vicente Hernández

Introduction to Cloud Computing

Cloud Computing Architecture: A Survey

Uses for Virtual Machines. Virtual Machines. There are several uses for virtual machines:

Cloud Computing Submitted By : Fahim Ilyas ( ) Submitted To : Martin Johnson Submitted On: 31 st May, 2009

Cloud and Virtualization to Support Grid Infrastructures

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

Grid Computing Vs. Cloud Computing

Cloud Computing. Chapter 1 Introducing Cloud Computing

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center

An approach to grid scheduling by using Condor-G Matchmaking mechanism

Virtualization. Dr. Yingwu Zhu

Cloud Computing an introduction

Business applications:

Cloud Computing: Computing as a Service. Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad

Cloud Computing. Chapter 1 Introducing Cloud Computing

Cloud Computing Trends

9/26/2011. What is Virtualization? What are the different types of virtualization.

Distributed and Cloud Computing

CS 695 Topics in Virtualization and Cloud Computing and Storage Systems. Introduction

Cloud Courses Description

Cloud Computing For Distributed University Campus: A Prototype Suggestion

DISTRIBUTED COMPUTER SYSTEMS CLOUD COMPUTING INTRODUCTION

Virtualization and the U2 Databases

2) Xen Hypervisor 3) UEC

Manjrasoft Market Oriented Cloud Computing Platform

A Gentle Introduction to Cloud Computing

Sistemi Operativi e Reti. Cloud Computing

Toward a Unified Ontology of Cloud Computing

Analysis and Research of Cloud Computing System to Comparison of Several Cloud Computing Platforms

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4

What Is It? Business Architecture Research Challenges Bibliography. Cloud Computing. Research Challenges Overview. Carlos Eduardo Moreira dos Santos

CLEVER: a CLoud-Enabled Virtual EnviRonment

CHAPTER 8 CLOUD COMPUTING

Grid Scheduling Architectures with Globus GridWay and Sun Grid Engine

Deploying Business Virtual Appliances on Open Source Cloud Computing

Virtualization. Jukka K. Nurminen

INTRODUCTION TO CLOUD COMPUTING CEN483 PARALLEL AND DISTRIBUTED SYSTEMS

How To Understand Cloud Computing

Virtual Machines.

Datacenters and Cloud Computing. Jia Rao Assistant Professor in CS

How To Compare Clouds And Grids Side By Side

OpenNebula Leading Innovation in Cloud Computing Management

Architectural Implications of Cloud Computing

Introduction to Engineering Using Robotics Experiments Lecture 18 Cloud Computing

How To Understand Cloud Computing

Virtualization. Types of Interfaces

Cloud Computing and Big Data What Technical Writers Need to Know

Assignment # 1 (Cloud Computing Security)

Cloud Infrastructure Pattern

Full and Para Virtualization

Manjrasoft Market Oriented Cloud Computing Platform

Making a Smooth Transition to a Hybrid Cloud with Microsoft Cloud OS

New Data Center architecture

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

CS 695 Topics in Virtualization and Cloud Computing. Introduction

Cloud Computing Overview

Geoff Raines Cloud Engineer

Unit 10b: Introduction to Cloud Computing

21/09/11. Introduction to Cloud Computing. First: do not be scared! Request for contributors. ToDO list. Revision history

Cloud Computing. Dipl.-Wirt.-Inform. Robert Neumann

Cloud Computing. Adam Barker

Lecture 2 Cloud Computing & Virtualization. Cloud Application Development (SE808, School of Software, Sun Yat-Sen University) Yabo (Arber) Xu

Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP

A cure for Virtual Insanity: A vendor-neutral introduction to virtualization without the hype

Cloud Models and Platforms

COS 318: Operating Systems. Virtual Machine Monitors

Cloud Computing Technology

Lecture 02a Cloud Computing I

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland

Private Clouds with Open Source

Infrastructure as a Service (IaaS)

Cloud computing - Architecting in the cloud

Transcription:

Grid and Cloud Computing María S. Pérez Facultad de Informática Universidad Politécnica de Madrid mperez@fi.upm.es

Outline Challenges not yet solved in computing Grid computing Cloud computing References 2

Challenges not yet solved in computing 3

Grand Challenge Applications Aerospace: Life sciences: E-commerce: Earth sciences: Biology: 4

Possible Scenarios A biochemist exploits 10,000 computers to screen 100,000 compounds in an hour 1,000 physicists worldwide pool resources for petaop analyses of petabytes of data Civil engineers collaborate to design, execute, & analyze shake table experiments Climate scientists visualize, annotate, & analyze terabyte simulation datasets An emergency response team couples real time data, weather model, population data Source: Slides The Challenges of Grid Computing Ian Foster 5

Means of Solving the Problem Cluster Computing Intranet Computing Cloud Computing Grid Computing

Grid Computing 7

Grid Computing Grid Computing is based on the philosophy of information and electricity sharing, allowing us to access to another kind of heterogeneous and geographically separated resources Grid provides the sharing of: Computational resources Storage elements Specific applications Equipment Other Thus, Grid is based on: Internet protocols Ideas of parallel and distributed computing 8

A Three Point Checklist A Grid is a system that... 1)...coordinates resources that are not subject to a centralized control... 2)...using standard, open, general-purpose protocols and interfaces... 3)...to deliver nontrivial qualities of services. Ian Foster What is the Grid? A Three Point Checklist (2002) 9

The Grid Scenario Flexible, secure, coordinated resource sharing among individuals and institutions Enable communities (virtual organizations) to share geographically distributed resources in order to achieve a common goal In applications which cannot be solved by resources of an only institution Or the results can be achieved faster and/or cheaper 10

Idiosyncrasy of the scenario Dynamic virtual organizations A set of individual and/or institutions which share rules Large or small Static or dynamic Kind of resources Heterogeneous resources Computers, storage, sensors, networks, etc. Coordinated problem solving Distribution Collaboration Trust, policies, negotiation, payment Challenges Security Authentication Authorization Resource access Resource discovery Scheduling Data management 11

Grid Architecture Application High level Middleware EDG, Crossgrid Low level Middleware Globus, Unicore, Legion Operating systems Unix, Linux, Windows Hardware Source: Grid Café: What is the Grid? http://gridcafe.web.cern.ch 12

Basic pillars 13

Need of security Distributed resources No centralized control Different resource providers Each resource provider uses different security policies 14

Security in Grid Generic Security Services (GSS) Authentication, delegation, integrity and confidentiality Public Key Infrastructure (PKI) with X.509 certificates Kerberos Secure Socket Layer (SSL) Grid Security Infrastructure (GSI) Delegation Single Sign-On Proxy certificates 15

Certificate request A user asks for a certificate to a Certification Authority (CA) The CA checks the user identity Then, the CA signs the request, creating a certificate, and returning it to the user Certificates can be cancelled Certificate Revocation List (CRL) The aim of the certificates is described in the certificate policy (CP) 16

Information Systems Provide information on: The Grid itself The user may query about the status and performance of the Grid Grid applications Register and monitor resources Standardization is required to interoperate among different grids projects Globus: MDS (Monitoring and Discovery Service) European Data Grid: R-GMA (Relational Grid Monitoring Architecture) UNICORE: Incarnation Database (IDB) 17

Data Grid Set of storage resources and data retrieval components which allows applications to access data by means of special software mechanisms Data grid problems: Data location Replication I/O performance 18

Data Transfer GridFTP: Protocol to data transfer in a secure way in a grid environment Extends FTP protocol Use Grid Security Infrastructure (GSI) Several storage systems provide GridFTP interfaces: Castor EDG s SRM Reliable File Transfer (RFT): Grid Service which provides interfaces to manage and monitor file transfers by using GridFTP servers 19

Data replication Due to the complexity of a grid environment, the existence of file replicas could be advisable Need of identifying and locating replicas Replica Location Service (RLS): a Grid Service for registering data replicas and later discovering Mappings between logic and fisical identifiers Database for metadata 20

Resource management system Resource Management includes the efficient use of computing and storage resources Processor time Memory Storage Network User-transparent Interacts with the rest of Grid components 21

Job submission UI Data Localization Replica Catalogue Job In/Output Local storage In/Output Workload Manager Job Computing Element Status CE status Inform. Service Grid enabled data transfers/ accesses SE status Storage Element 22

Job queue managers Condor-G: Condor High-Throughput Computing Project http://www.cs.wisc.edu Portable Batch System (PBS) Sun Grid Engine (SGE) 23

mix-and-match Object-oriented Grid Computing approaches Internet-WWW Problem Solving Approach Web-based technologies Market/Computational Economy 24

Grid Projects Globus: http://www.globus.org EGEE: http://www.eu-egee.org/ TeraGrid: http://www.teragrid.org CrossGrid: http://www.crossgrid.org EU-DataGrid: http://www.eu-datagrid.org IrisGrid: http://www.rediris.es/irisgrid mygrid: http://www.mygrid.org.uk/ 25

Cloud Computing 26

Cloud Computing Future scenario: No computing on local computers Third-party compute and storage facilities Cloud Computing: A large-scale distributed computing paradigm that is driven by economies of scale, in which a pool of abstracted, virtualized, dynamically-scalable, managed computing power, storage, platforms, and services are delivered on demand to external customers over the Internet * Is it just a fashion name for grid computing? 27

Cloud computing history Related paradigms Grid and utility computing Software as a Service (SaaS) Earlier antecedents 1961, John McCarthy Computation delivered as public utility 1969, J.C.R. Licklider, ARPANET: Idea of an intergalactic computer network: Access programs and data at any site, from anywhere Source: A history of cloud computing, Arif Mohamed, March 2009 28

Cloud computing history 1999, Salesforce.com Delivering enterprise applications via a website 2002, Amazon web services Suite of cloud-based services including storage and computation 2006, Amazon provided EC2 (Elastic Computing Cloud) Source: A history of cloud computing, Arif Mohamed, March 2009 29

Cloud computing layers Software as a Service (SaaS). Example: Salesforce.com Platform as a Service (PaaS) Example: Microsoft Azure Infrastructure as a Service (IaaS) Example: Amazon Web Services 30

Cloud Deployment Models Private cloud Managed by an organization Community cloud Shared by several organizations Intended to one community Public cloud General public Owned by an organization selling cloud services Hybrid cloud Composed by two or more clouds 31

Cloud Taxonomy 32

Cloud computing characteristics Elasticity: Resource allocation can be increased or decreased according to the demand Scalability: the cloud scales according to the demand Self-service provisioning: Cloud customers accesing cloud services Standardized interfaces: Standard APIs Billing service: A pay-as-you-go model 33

Cloud computing characteristics Large scale Economies of scale Service oriented architecture Stateless Low coupled Modular Semantically interoperable Virtualization It enables elasticity Cost savings Autonomic computing Self-configuring Self-healing Self-optimizing Self-protecting 34

Virtualization Virtual machine: An efficient isolated duplicate of a real machine Gerald J. Popek and Robert P. Goldberg (1974). "Formal Requirements for Virtualizable Third Generation Architectures". Communications of the ACM 17 (7): 412 421 Properties: Origins: CP-40 (IBM 1967) Equivalence: identical behavior Resource control: complete control of the virtualized resources Efficiency: A dominant fraction of machine instructions must be executed without VM intervention Current processing capacity relieves the inefficiency of VM Some processors support virtualization Some important definitions: Host machine: hardware that runs the virtual machine software Host operating system: operating system that runs the virtual machine software Hypervisor: software layer that provides the virtualization Guest system: operating system 35

Virtualization Kinds of VM Process VM: It supports a single process. It provides a platform-independent environment abstracting underlying hardware and/or operating systems System VM: It supports the running of a OS. Hypervisor enables the sharing of resources among several VMs 2 kinds: I (Native VM): Hypervisor running on HW II (Non native VM): Hypervisor running on host OS 36

Native VM (Hypervisor architecture) Proceso 1 Proceso 2 Proceso n Proceso 1 Proceso 2 Proceso n Proceso 1 Proceso 2 Proceso n Llamada a SO alojado SO 1 SO 2 SO m Instrucción E/S de MV MV 1 MV 2 Hipervisor MV m Activación hipervisor Instrucción E/S de HW Hardware 37

Non Native VM (Hosted architecture) Proceso 1 Proceso 2 Proceso n Proceso 1 Proceso 2 Proceso n Llamada a SO alojado Proceso 1 Proceso 2 Proceso n SO 1 MV 1 Hipervisor SO m MV m Instrucción E/S de MV Activación hipervisor Llamada a SO anfitrión SO anfitrión Instrucción E/S de HW Hardware 38

Process VM Proceso MV 1 SO Hardware Proceso MV 2 39 Proceso 1 Proceso 2 Proceso n

Virtualization approaches Full virtualization Unmodified guest operating system Nondisruptive migration to virtualized environments Example: Vmware, a combination of direct execution and binary translation techniques to achieve full virtualization of an x86 system Paravirtualization Modified guest operating system Advantages: No need for binary translation Potential performance advantages for specific workloads requiring modified operating system kernels Problem: It is impossible to modify closed source operating systems (e.g., Microsoft Windows) Example: Xen (open- source) Hardware virtualization support Virtualization extensions to the x86 architecture by Intel (Intel VT) and AMD (AMD- V) New processor instructions to assist virtualization software First-generation hardware: CPU virtualization only Later generations are expected to include memory and I/O virtualization as well Multicore processors also promote the adoption of virtualization This approach reduces the need to paravirtualize guest operating systems 40

Virtualization advantages Cost savings Operational efficiency Flexibility Coexistence of several OS OS debugging Run legacy systems Backup machines 41

Virtualization Examples: VMWare Xen Sun xvm Microsoft Virtual PC Microsoft Virtual Server VM from IBM 42

Autonomic Computing Decreasing the complexity of the environment in order to enhance its performance Based on biological systems, more specifically on the nervous system. Multiple unconscious tasks: Check blood pressure Adjust body temperature 43

Autonomic levels 44

Autonomic features Self-Configuring: Automatic adaptation to dynamic environments Self-Healing: Discovering, diagnosing and reacting to failures according to specific policies Self-Optimizing: Monitoring resources and making decisions according to monitored data Self-Protecting: Detecting and identifying attacks against the system and acting in these situations 45

Advantages of cloud computing Lower computer and software costs Enhanced software updates Unlimited storage capacity 46

Disadvantages of cloud computing Requires a high-speed internet connection Security and confiability of data Not solved yet the execution of HPC apps in cloud computing Interoperability between cloud based systems 47

Grid Computing vs Cloud Computing Business model: Grid computing: project-oriented, in which it is possible to spend an amount of service units, generally CPU hours Example: TeraGrid, proposals for the increasement of computational power Cloud computing: customers pay providers on a consumption basis (such as electricity) Example: EC2 from Amazon (instance-hour consumed), S3 from Amazon (GB-Month of storage) 48

Grid Computing vs Cloud Computing Architecture: Grid Computing Cloud Computing Application Application Collective Platform Resource Connectivity Unified Resource Fabric Fabric 49

Grid Computing vs Cloud Computing Resource management: Grid computing: Batch-based computing model: Use of LRM (local resource managers), such as Condor, PGS or Sun Grid Engine. Data model: location transparency, use of a distributed metadata catalog. Data storage usually depends on a shared file system (PVFS, Lustre). Virtualization is not so important, although there are some initiatives Widely use of Ganglia as monitoring system Cloud computing: Computing model: Resources in the cloud shared by all users. More number of users. Data model: Google s MapReduce system running on top of the Google File system (Replicated chunks of data) Virtualization is key in Cloud Computing Difficult to obtain a high level of detail in monitoring. In the future, clouds will be self-maintained. 50

Grid Computing vs Cloud Computing Programming model: Grid computing: MPICH-G2 GridRPC Workflow systems WSRF Cloud computing: MapReduce model: Map : Applying a specific operation to a set of items, obtaining a new set of items Reduce : Aggregation on a set of items Hadoop: Open source implementation of the MapReduce model Scripting (Java Script, PHP, Python) 51

Grid Computing vs Cloud Computing Security model: Grid computing: GSI Cloud computing: Simpler model and less secure Use of SSL and Web forms A challenge not solved in clouds 52

Grid Computing vs Cloud Computing They share many goals They are different in many aspects But, they are complementary [Cloud computing] is indeed evolved out of Grid Computing and relies on Grid Computing as its backbone and infrastructure support. * 53

References 54

References The Anatomy of the Grid: Enabling Scalable Virtual Organizations. I. Foster, C. Kesselman, S. Tuecke, International J. Supercomputer Applications, 15(3), 2001. The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration. I. Foster, C. Kesselman, J. Nick, S. Tuecke, Open Grid Service Infrastructure WG, Global Grid Forum, June 22, 2002 The Globus Project, http://www.globus.org 55

References Modeling Stateful Resources with Web Services, http://www.globus.org/wsrf The WS-Resource Framework, http://www.globus.org/wsrf [*] "Cloud Computing and Grid Computing 360- Degree Compared," I. Foster, Y. Zhao, I. Raicu, S. Lu, Grid Computing Environments Workshop, 2008. GCE '08, vol., no., pp.1-10, 12-16 Nov. 2008 A history of cloud computing, Arif Mohamed, March 2009 56

References Understanding Full Virtualization, Paravirtualization, and Hardware Assist Technical Paper of Vmware http://www.vmware.com/files/pdf/vmware_paravirtualization.pdf Nimbus project: http://www.nimbusproject.org OpenNebula.org: http://ww.opennebula.org 57

Grid and Cloud Computing María S. Pérez Facultad de Informática Universidad Politécnica de Madrid mperez@fi.upm.es