GridWay: Open Source Meta-scheduling Technology for Grid Computing

Similar documents

The GridWay Meta-Scheduler

Grid Scheduling Architectures with Globus GridWay and Sun Grid Engine

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing

Cloud and Virtualization to Support Grid Infrastructures

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

An approach to grid scheduling by using Condor-G Matchmaking mechanism

TUTORIAL. Rebecca Breu, Bastian Demuth, André Giesler, Bastian Tweddell (FZ Jülich) {r.breu, b.demuth, a.giesler,

Interoperability between Sun Grid Engine and the Windows Compute Cluster

CSF4:A WSRF Compliant Meta-Scheduler

Virtual Machine Management with OpenNebula in the RESERVOIR project

GRID COMPUTING Techniques and Applications BARRY WILKINSON

Cloud Computing. Lecture 5 Grid Case Studies

Sun Grid Engine, a new scheduler for EGEE

VM Management for Green Data Centres with the OpenNebula Virtual Infrastructure Engine

Sun Grid Engine, a new scheduler for EGEE middleware

GT 6.0 GRAM5 Key Concepts

LSKA 2010 Survey Report Job Scheduler

Grid Computing With FreeBSD

The ENEA-EGEE site: Access to non-standard platforms

Mitglied der Helmholtz-Gemeinschaft. System monitoring with LLview and the Parallel Tools Platform

XSEDE Service Provider Software and Services Baseline. September 24, 2015 Version 1.2

GRMS Features and Benefits

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4

Elastic Management of Cluster based Services in the Cloud

The OpenNebula Standard-based Open -source Toolkit to Build Cloud Infrastructures

AstroGrid-D WG 5: Resource Management for Grid Jobs

Building Platform as a Service for Scientific Applications

Cluster, Grid, Cloud Concepts

HPC Cloud Computing with OpenNebula

ARC Computing Element

Grid4Utility: A Grid Infrastructure for Utility Computing TIN

Anwendungsintegration und Workflows mit UNICORE 6

Grid Sun Carlo Nardone. Technical Systems Ambassador GSO Client Solutions

CHAPTER 4 PROPOSED GRID NETWORK MONITORING ARCHITECTURE AND SYSTEM DESIGN

Speeding up MATLAB and Simulink Applications

Enabling Technologies for Cloud Computing

Grid Activities in Poland

A High Performance Computing Scheduling and Resource Management Primer

OpenNebula An Innovative Open Source Toolkit for Building Cloud Solutions

Generic Grid Computing Tools for Resource and Project Management

OpenNebula Cloud Case Studies

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

IBM Solutions Grid for Business Partners Helping IBM Business Partners to Grid-enable applications for the next phase of e-business on demand

Design and Building of IaaS Clouds

A Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment

Scientific and Technical Applications as a Service in the Cloud

Recommendations for Static Firewall Configuration in D-Grid

Roberto Barbera. Centralized bookkeeping and monitoring in ALICE

How To Compare Clouds And Grids Side By Side

NorduGrid ARC Tutorial

The glite File Transfer Service

Cloud Computing from an Institutional Perspective

Interoperability in Grid Computing

Development of parallel codes using PL-Grid infrastructure.

HPC and Grid Concepts

Processing big data by WS- PGRADE/gUSE and Data Avenue

Clouds vs Grids KHALID ELGAZZAR GOODWIN 531

Transcription:

: Open Source Meta-scheduling Technology for Grid Computing Ruben S. Montero dsa-research.org Open Source Grid & Cluster Oakland CA, May 2008

Contents Introduction What is? Architecture & Components Scheduling Policies Examples of Grid Deployments WSRF Interface for Sun Grid Engine Integration

Introduction Resource selection: Where do I execute my job? Resource preparation: What do I need? Job submission: How do I submit my job? Job monitoring: How is my job doing? Job migration: Is there any better resource? Job termination: How do I get my output?

Introduction Meta-scheduler: Job to resource (other schedulers) matching (execution management). Goal: Optimize the performance according to a given metric (performance model): Global Throughput Resource usage Application (ALS) Stand-alone, HPC, HTC and self-adaptive User usage Grid characteristics Heterogeneity (job requirements) Dynamism (high fault rate, load, availability, price) Site autonomy

What is? The meta-scheduler is a scheduler virtualization layer on top of basic services (GRAM, MDS & GridFTP) For the user A LRM-like environment for submitting, monitoring, and controlling jobs For the developer An standard-base development framework for Grid Applications For the sysadmin A policy-driven job scheduler User-side Grid Accounting For the Grid architect / solution provider A modular component to use different infrastructures A key component to deploy different Grids

What is? Projects Incubator Projects Gridshib DDM LRMA GRAADS Falkon and many many more!

Architecture Application-Infrastructure decoupling DRMAA.C,.java.C,.java Services $> CLI Grid Meta- Grid Middleware Applications Scheduler LRM-like Command Line Interface OGF DRMAA C & JAVA Bindings JSDL (Posix & HTC profiles) Array jobs, DAG workflows and MPI jobs Advanced (Grid-aware) scheduling policies Fault detection & recovery Straightforward deployment (basic services) -based infrastructures Component to deploy different Grids PBS SGE Infrastructure highly dynamic & heterogeneous high fault rate

Components DRMAA library Core Job Pool Host Pool CLI Request Manager Dispatch Manager Job Job Submission Submission Job Job Monitoring Monitoring Job Job Control Control Job Job Migration Migration Scheduler Transfer Manager GridFTP RFT Job Preparation Job Termination Job Migration File Transfer Services Execution Manager pre-ws GRAM WS GRAM Execution Services MDS2 Information Manager MDS2 GLUE MDS4 Resource Discovery Resource Monitoring Information Services

Scheduling Policies Resource Policies Rank Expressions Fixed Priority User Usage History Failure Rate Grid Scheduling = Job + Resource Policies Job Policies Pending Jobs Matching Resources for each job (user) Fixed Priority Urgent Jobs User Share Deadline Waiting Time

Characteristics Enterprise Grids Small scale infrastructures (campus/enterprise) with one meta-scheduler instance Resources within the same administration domain that may be running different LRMS and be geographically distributed Goal & Benefits Integrate heterogeneous systems Improve return of IT investment Performance/Usage maximization

Enterprise Grids Architecture Examples Applications Users Middleware DRMAA interface Portal Command Line Interface One meta-scheduler Grid-wide policies European Space Astronomy Center Data Analysis from space missions DRMAA UABGrid, University of Alabama Bioinformatics applications SGE Cluster Infrastructure PBS Cluster LSF Cluster

Characteristics Partner Grids Large scale infrastructures with one or several meta-schedulers Resources belong to different administrative domains Goal & Benefits Large-scale, secure and reliable sharing of resources Support collaborative projects Access to higher computing power to satisfy peak demands

Architecture Partner Grids Examples Applications Users (Virtual) Organization Users DRMAA interface Science Gateways Multiple metaschedulers (V)Organization-wide policies EGEE-II glite-lhc interoperability Virtual Organizations Fusion: Massive Ray Tracing Biomed: CD-HIT (Worflow) SGE Cluster PBS Cluster LSF Cluster Middleware AstroGrid-D, German Astronomy Community Grid Supercomputing resources Astronomy-specific resources GRAM interface Infrastructure Multiple Admin. Domains Multiple Organizations

Interoperability Users Different Middlewares (e.g. WS and pre-ws) Different Data/Execution architectures Different Information models Integration through adapters Global DN s /WS /WS glite glite /WS /WS SGE Cluster PBS Cluster PBS Cluster SGE Cluster PBS Cluster SGE Cluster

WSRF Interface for Utility Computing Consumer/Provider relationships Applications GRAM RFT MDS Middleware Drivers GLOBUS GRID Standard functionality Abstraction of the underlying DRMS Implemented as GRAM jobmanager Virtualizes a whole Grid Enterprise Grid Partner Grid Other utilities...

WSRF Interface for Utility Computing Users Access to different infrastructures with the same adapters EGEE managed as other resource Applications Delegate identity/ VO certificates In-house/provider gateway Access through legacy applications, portals... glite glite Middleware SGE Cluster PBS Cluster PBS Cluster SGE Cluster Infrastructure Regional infrastructure

Transfer Queues Communicate LRM systems with meta-schedulers (the other way around) Users keep using the same interface, even applications (e.g. DRMAA, site scripts...) Users SGE Cluster Local Resources Job submitted to the cluster but executed in the Grid Transfer Queue Grid Infrastructure (any type) SGE Cluster PBS Cluster LSF Cluster

THANK YOU FOR YOUR ATTENTION!!! Want to try it... TUTORIAL 11 (Thursday 13:30 15:00) The Team Ignacio M. Llorente Javier Fontan Ruben S. Montero Jose Herrera Eduardo Huedo Cuesta Tino Vazquez Jose Luis Vazquez-Poletti