GridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources



Similar documents
A Survey Study on Monitoring Service for Grid

An approach to grid scheduling by using Condor-G Matchmaking mechanism

Grid Scheduling Dictionary of Terms and Keywords

Cluster, Grid, Cloud Concepts

Introduction to Cloud Computing

Lesson 4 Web Service Interface Definition (Part I)

Virtual machine interface. Operating system. Physical machine interface

locuz.com HPC App Portal V2.0 DATASHEET

Architectures for Big Data Analytics A database perspective

Client/Server Computing Distributed Processing, Client/Server, and Clusters

Tier Architectures. Kathleen Durant CS 3200

Distribution transparency. Degree of transparency. Openness of distributed systems

Chapter 2: Remote Procedure Call (RPC)

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

Parallel Computing with R using GridRPC

Resource Management on Computational Grids

Designing a Windows Server 2008 Applications Infrastructure

CLOUD COMPUTING. When It's smarter to rent than to buy

LSKA 2010 Survey Report Job Scheduler

Manjrasoft Market Oriented Cloud Computing Platform

Planning the Migration of Enterprise Applications to the Cloud

Manjrasoft Market Oriented Cloud Computing Platform

COMP5426 Parallel and Distributed Computing. Distributed Systems: Client/Server and Clusters

SOFT 437. Software Performance Analysis. Ch 5:Web Applications and Other Distributed Systems

Deploying a distributed data storage system on the UK National Grid Service using federated SRB

Information Processing, Big Data, and the Cloud

Status and Evolution of ATLAS Workload Management System PanDA

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Grid Computing Vs. Cloud Computing

Mobile Software Agents: an Overview

Middleware- Driven Mobile Applications

How To Understand The Concept Of A Distributed System

:Introducing Star-P. The Open Platform for Parallel Application Development. Yoel Jacobsen E&M Computing LTD

Principles and characteristics of distributed systems and environments

Provisioning and Resource Management at Large Scale (Kadeploy and OAR)

Sriram Krishnan, Ph.D.

Apache Hama Design Document v0.6

SAS 9.4 Intelligence Platform

Client/Server and Distributed Computing

EMC VPLEX FAMILY. Continuous Availability and Data Mobility Within and Across Data Centers

EMC VPLEX FAMILY. Continuous Availability and data Mobility Within and Across Data Centers

High Performance Cluster Support for NLB on Window

Designing and Building Applications for Extreme Scale Systems CS598 William Gropp

Lecture 26 Enterprise Internet Computing 1. Enterprise computing 2. Enterprise Internet computing 3. Natures of enterprise computing 4.

Dynamic Resource allocation in Cloud

KBase and Globus Online Nexus. Shreyas Cholia NERSC/LBL

Portable Scale-Out Benchmarks for MySQL. MySQL User Conference 2008 Robert Hodges CTO Continuent, Inc.

Vortex White Paper. Simplifying Real-time Information Integration in Industrial Internet of Things (IIoT) Control Systems

LinuxWorld Conference & Expo Server Farms and XML Web Services

Module 12: Microsoft Windows 2000 Clustering. Contents Overview 1 Clustering Business Scenarios 2 Testing Tools 4 Lab Scenario 6 Review 8

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4

SIPAC. Signals and Data Identification, Processing, Analysis, and Classification

A Brief Introduction to Apache Tez

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago

Distributed Systems Architectures

Agent Languages. Overview. Requirements. Java. Tcl/Tk. Telescript. Evaluation. Artificial Intelligence Intelligent Agents

Grid Computing vs Cloud

SiteCelerate white paper

LEARNING SOLUTIONS website milner.com/learning phone

Installation Guide NetIQ AppManager

Fast Innovation requires Fast IT

Big data management with IBM General Parallel File System

Introduction to grid technologies, parallel and cloud computing. Alaa Osama Allam Saida Saad Mohamed Mohamed Ibrahim Gaber

What s new with IBM Tivoli Workload automation?

VMware vsphere: [V5.5] Admin Training

Final Project Proposal. CSCI.6500 Distributed Computing over the Internet

Resource Utilization of Middleware Components in Embedded Systems

CS 3530 Operating Systems. L02 OS Intro Part 1 Dr. Ken Hoganson

Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga

Distributed and Cloud Computing

DB2 Connect for NT and the Microsoft Windows NT Load Balancing Service

TEST AUTOMATION FRAMEWORK

Cloud Based Distributed Databases: The Future Ahead

Petascale Software Challenges. William Gropp

Large scale processing using Hadoop. Ján Vaňo

CMB 207 1I Citrix XenApp and XenDesktop Fast Track

Service-Oriented Architecture and Software Engineering

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

Software design (Cont.)

Distributed Systems Lecture 1 1

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

Architectural Patterns. Layers: Pattern. Architectural Pattern Examples. Layer 3. Component 3.1. Layer 2. Component 2.1 Component 2.2.

Windows Server Virtualization An Overview

Transcription:

GridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources Jack Dongarra University of Tennessee and Oak Ridge National Laboratory 2/25/2006 1 Overview Grid/NetSolve Grid enabled software Allows easy access to remote resources No magic Someone has to write a program The program may run on a parallel computer NetSolve is a tool for distributed computing 2 1

The Grid 3 What is NetSolve? -server RPC-like system Designed for ease-of-use Interactions mediated by an agent e.g. scheduling, tracking, fault tolerance Dynamic service bindings does not need to have stubs for the services that it wishes to use Multiple clients C, Fortran, Matlab, Java, Mathematica, Octave Extended to support GridRPC API Part of GGF working group defining a standard API 4 2

University of Tennessee s NetSolve Grid Enabled NetSolve is an example of a Grid based hardware/software/data server. Based on a Remote Procedure Call model but with resource discovery, dynamic problem solving capabilities, load balancing, fault tolerance asynchronicity, security, Easy-of-use paramount Its about providing transparent access to resources. Legacy codes easily wrapped into services 5 GridSolve Architecture Agent Resource discovery Scheduling Load balancing Fault tolerance request server list cluster ` data result cluster cluster [x,y,z,info] = gridsolve( dgesv, A, B) cluster Can be from Matlab, C, Fortran, Python, Java, Mathematica, Excel, 6 3

NetSolve Function Based Interface. program embeds call from NetSolve s API to access additional resources. Interface available to C, Fortran, Matlab, and Mathematica. Opaque networking interactions. NetSolve can be invoked using a variety of methods: blocking, nonblocking, task farms, 7 NetSolve Intuitive and easy to use. Matlab Matrix multiply e.g.: A = matmul(b, C); A = netsolve( matmul, B, C); Possible parallelisms hidden. 8 4

NetSolve i. makes request to agent. ii. Agent returns list of servers. iii. tries first one to solve problem. 9 NetSolve Agent Agent Name server for the NetSolve system. Information Service client users and administrators can query the hardware and software services available. Resource scheduler maintains both static and dynamic information regarding the NetSolve server components to use for the allocation of resources 10 5

NetSolve Agent Agent Resource Scheduling (cont d): CPU Performance. Network bandwidth, latency. workload. Problem size/algorithm complexity. Calculates a Time to Compute. for each appropriate server. Notifies client of most appropriate server. 11 Basic Usage Scenarios Grid based numerical library routines User doesn t have to have software library on their machine, LAPACK, SuperLU, ScaLAPACK, PETSc, ARPACK, Task farming applications Pleasantly parallel execution eg Parameter studies Scavenge cycles Remote application execution Complete applications with user specifying input parameters and receiving output Blue Collar Grid Based Computing Does not require deep knowledge of network programming Level of expressiveness right for many users User can set things up, no su required In use today, up to 130 servers on the experimental grid Can plug into Globus, Condor, NINF, 12 6

Task Farming - Multiple Requests To Single Problem A Solution: Many calls to netslnb( ); /* non-blocking */ Farming Solution: Single call to netsolve_farm( ); Request iterates over an array of input parameters. Adaptive scheduling algorithm. Useful for parameter sweeping, and independently parallel applications. 13 Proxies Hide Parallelism Agent NetSolve NetSolve NetSolve System LFC (LAPACK for Clusters), Condor, ScaLAPACK, etc. User maybe unaware of parallel processing 14 7

GridSolve Usage with VGrADS Simple-to-use access to complicated software libraries, with no knowledge of grid based computing. Selection of best machines in your grid to service user request Portability Non-portable calls can be run from a client using RPC like mechanisms as long there is a server provisioned with the code Legacy codes easily wrapped into services Plug into VGrADS Framework Using the vges for resource selection and launching of application: Integrated performance information Integrated monitoring Fault prediction Integrating the software and resource information repositories 15 Virtual Grid Execution System (vges( vges) A Virtual Grid (VG) takes Virtual Grid Execution Shared heterogeneous resources System (vges) implements VG Scalable information service VG Definition Language and provides (vgdl) An hierarchy of applicationdefined aggregations (e.g. VG Find And Bind (vgfab) ClusterOf) with constraints (e.g. VG Monitor (vgmon) processor type) and rankings VG Application Launch (VgLAUNCH+DVCW) VG Resource Info (vgagent) vgdl Description Virtual Grid Successfully Bound Candidates Grid Resource Universe VG VG VG Application Application vgdl VG vges APIs vgfab vgmon DVCW vgagent vglaunch Information Services Resource Managers 16 Grid Resources 8

VGrADS/GridSolve Architecture [x,y,z,info] = gridsolve( solver, A, B) ` Agent request info Data sent & app startedstart server result register vgdl Virtual Grid Transfer query software location Software Repository Process Process Killedrestarted Service Catalog 17 Data Persistence Chain together a sequence of NetSolve requests. Analyze parameters to determine data dependencies. Essentially a DAG is created where nodes represent computational modules and arcs represent data flow. Transmit superset of all input/output parameters and make persistent near server(s) for duration of sequence execution. Schedule individual request modules for execution. 18 9

Data Persistence (cont d) command1(a, sequence(a, B, B) E) result C input A, command2(a, intermediate output C) C result D intermediate output D, input E command3(d, E) netsl_begin_sequence( ); netsl( command1, netsl( command1, A, A, B, B, C); C); netsl( command2, netsl( command2, A, A, C, C, D); D); netsl( command3, netsl( command3, D, D, E, E, F); F); netsl_end_sequence(c, D); result F 19 Current SInRG Infrastructure Industry Partners: Microsoft, Sun, Dell, Cisco, Foundry, Dolphin, Myracom Federated Ownership: CS, Chem Eng., Medical School, Computational Ecology, El. Eng. Real applications, middleware development, logistical networking 20 10

NetSolve- Things Not Touched On Integration with other NMI tools Globus, Condor, Network Weather Service Security Using Kerberos V5 for authentication. Monitor NetSolve Network Track and monitor usage Fault Tolerance Local / Global Configurations Dynamic Nature of s Automated Adaptive Algorithm Selection Dynamic determine the best algorithm based on system status and nature of user problem NetSolve evolving into GridRPC Being worked on under GGF with joint with NINF 21 Software at: http://icl.cs.utk.edu/netsolve/ NetSolve Team Sudesh Agrawal Don Fike Eric Meek Keith Seymour Zhiao Shi Asim YarKhan 22 11