TUTORIAL. Rebecca Breu, Bastian Demuth, André Giesler, Bastian Tweddell (FZ Jülich) {r.breu, b.demuth, a.giesler,

Size: px
Start display at page:

Download "TUTORIAL. Rebecca Breu, Bastian Demuth, André Giesler, Bastian Tweddell (FZ Jülich) {r.breu, b.demuth, a.giesler, b.tweddell}@fz-juelich."

Transcription

1 TUTORIAL Rebecca Breu, Bastian Demuth, André Giesler, Bastian Tweddell (FZ Jülich) {r.breu, b.demuth, a.giesler, September 2006

2 Outline Motivation & History Production UNICORE 5 Features Recent Developments Architecture UNICORE 6 Grid Stards Architecture Summary 2

3 Motivation: Why UNICORE? Scientists need computational storage related resources 3

4 Motivation: Why UNICORE? Supercomputers are managed by Resource Management s (RMSs) that hle the scheduling But: There are many RMSs available A proprietary way of job submitting IBM Loadleveler llsubmit Torque Resource Manager qsub Resource Management Different job description languages (# of nodes, memory requirements ) 4

5 Motivation: Why UNICORE? Solution: Grid UNICORE Define job workflows in abstract manner Immediate portability of job definitions for other systems with other architectures No learn overhead if a new RMS is used Applications across multiple supercomputers/clusters going meta 5

6 Initial UNICORE developments UNICORE 08/ /1999 UNICORE Plus 01/ /2002 EUROGRID 11/ /2004 GRIP 01/ /2004 OpenMolGRID 09/ /2005 6

7 UNICORE in (mostly) European Projects OpenMolGRID GRIDSTART GRIP UniGrids DEISA VIOLA Chemomentum edeisa A-WARE OMII-Europe EGEE-II DGI

8 UNICORE Features 8

9 Recent Developments OGSA-based WSRF-compliant UNICORE 6 alpha Interactive access UNICORE-SSH plugin High-level API for programming Grids DRMAA-based access to RMSs Comfortable configuration tool UNIX-like comm line interface 9

10 UNICORE Live CD Complete out-of-the-box usage of UNICORE 5 Bootable Linux OS with UNICORE 5 pre-installed Does not harm your system sbox scenario For testing, evaluating, AVAILABLE HERE AND AS ISO-IMAGE ON SOURCEFORGE 10

11 UNICORE Architecture Overview Client opt. Firewall SSL Multi-Site Jobs opt. Firewall Authentication opt. Firewall NJS Authorization NJS Abstract NJS Authorization Incarnation Non- Abstract RMS Disc RMS Disc RMS Disc Vsite Usite Vsite Vsite Usite 11

12 UNICORE Architecture Overview Workflow-Engine Client Resource Management Job-Monitoring SSL opt. Firewall File Transfer User Management Application Support Authentication Multi-Site Jobs opt. Firewall opt. Firewall NJS Authorization NJS Abstract NJS Authorization Incarnation Non- Abstract RMS Disc RMS Disc RMS Disc Vsite Usite Vsite Vsite Usite 12

13 UNICORE Architecture Overview Client opt. Firewall SSL Multi-Site Jobs opt. Firewall Authentication opt. Firewall NJS Authorization NJS Abstract NJS Authorization Incarnation RMS Disc Vsite Non- similar to Globus jobmanager Abstract fork LoadLeveler, RMS Disc (Open)PBS(Pro), RMS Disc CCS, LSF, NQE/NQS,... Vsite Vsite Usite CONDOR, GT Usite

14 UNICORE Architecture Overview Client opt. Firewall SSL Multi-Site Jobs opt. Firewall Authentication opt. Firewall NJS Authorization NJS Abstract NJS Authorization Incarnation similar to /etc/grid-security/grid-mapfile Non- Abstract RMS Disc RMS Disc RMS Disc Vsite Usite Vsite Vsite Usite 14

15 UNICORE Architecture Overview Client opt. Firewall SSL Multi-Site Jobs opt. Firewall Authentication opt. Firewall NJS Authorization NJS Abstract NJS Authorization Incarnation Non- Abstract RMS Disc RMS Disc RMS Disc Vsite Usite Vsite Vsite Usite 15

16 UNICORE Client Job Preparation Usites Workflow Management Job Monitoring Vsites 16

17 UNICORE Architecture: Client UNICORE Client Definition of abstract Jobs (not machine-specific) client tier Usite A UNICORE CLI Abstract Jobs AJO UPL over SSL Scientific Applications Usite C Creation of complex workflows (e.g.multi-site jobs for geographically dispersed supercomputers) optional firewall authentication server tier Vsite X Multi-site Jobs Vsite Y Network Job Supervisor authorization incarnation server tier Non-abstract Jobs Target Interface submit / or execution target system tier Vsite Z Resource Management scheduling resource management Usite B 17

18 UNICORE Architecture: Client UNICORE Client Client extensions via application-specific plugins client tier Usite A server tier Vsite X UNICORE CLI Abstract Jobs AJO UPL over SSL Multi-site Jobs Scientific Applications Usite C optional firewall authentication Vsite Y Various plugins have been developed over the last years: Car-Parrinello Molecular Dynamics Plugin (speed up quantum chemical computations) Fluent, Gaussian, MSC Nastran Plugins OpenMolGrid Plugins (Molecular Science) Network Job Supervisor authorization incarnation PovRay Plugin server tier Non-abstract Jobs Target Interface submit / or execution target system tier Vsite Z Resource Management scheduling resource management Usite B 18

19 UNICORE Architecture: Client UNICORE Client client tier Usite A UNICORE CLI Abstract Jobs AJO UPL over SSL Scientific Applications Usite C optional firewall server tier Vsite X Multi-site Jobs authentication Vsite Y server tier Network Job Supervisor authorization incarnation Non-abstract Jobs target system tier Vsite Z Target Interface Resource Management submit / or execution scheduling resource management Usite B Based upon the Arcon Client API (Java-API) 19

20 UNICORE Architecture: UNICORE Client client tier Usite A UNICORE CLI Abstract Jobs AJO UPL over SSL Scientific Applications Usite C Single entry-point for UNICORE Sites (Usites) Contains n Virtual Sites (Vsites) (e.g. provides access to n supercomputers) server tier Vsite X Multi-site Jobs Network Job Supervisor optional firewall authentication Vsite Y authorization incarnation Only ONE OPEN PORT in firewall Authentication of users via X.509 certificates Perfect base to group resources access within Virtual Organizations (VOs) server tier Non-abstract Jobs target system tier Target Interface Resource Management submit / or execution scheduling resource management Virtual Organization G A T E W A Y Vsite Z Usite B 20

21 UNICORE Architecture: NJS UNICORE Client Network Job Supervisor (NJS) client tier UNICORE CLI Scientific Applications Analyse defined Workflows (workflow engine) Usite A Abstract Jobs AJO UPL over SSL Usite C Multi-site, multi-step Jobs Directed Acyclic Graphs optional firewall authentication server tier Vsite X Multi-site Jobs Vsite Y Submit of sub-jobs to other sites over server tier Vsite Z Non-abstract Jobs target system tier Target Interface Network Job Supervisor Resource Management authorization incarnation submit / or execution scheduling resource management Usite B Job Incarnation via Incarnation Database () translate abstract-jobs into machine-specific jobs E.g. location of preinstalled software packages Authorization via Unicore User Database () turn X.509 user certs to users xlogin of machine The complete certificate is checked (not only DN ) 21

22 UNICORE Architecture: UNICORE Client Target Interface client tier Usite A UNICORE CLI Abstract Jobs AJO UPL over SSL Scientific Applications Usite C Represents a real Target (e.g. a supercomputer or storage server) server tier Vsite X server tier Non-abstract Jobs Target Interface Multi-site Jobs Network Job Supervisor optional firewall authentication Vsite Y authorization incarnation submit / or execution Interaction with Resource Management s Torque, PBS Pro, Load Leveler, (UNIX fork), Job submit to Resource Management Scheduling via Resource Management Retrieve job results job output Manage workspace target system tier Vsite Z Resource Management scheduling resource management Usite B 22

23 UNICORE Architecture: UNICORE Client UNICORE CLI Scientific Applications client tier Usite A server tier Vsite X server tier... NoBatch Abstract Jobs AJO UPL over SSL Multi-site Jobs Network Job Supervisor optional firewall authentication Text-based NJS- protocol Target Interfaces PBS (Linux) NQS (IRIX) PBS Pro (HP-UX) LSF (SGI) LoadLeveler (AIX) CCS perl Usite C Vsite Y authorization incarnation submit / or execution Target Interface Each supported RMS is used in conjunction with a target system-specific impl. in perl Perl implementations are stateless daemons Statically configured before startup (RMS admin) Various implementations PBS (Linux), LSF (SGI), PBS-Pro (HP- UX), NQS (IRIX), LoadLeveler (AIX), target system tier Vsite Z Resource Management Comms Resource Management scheduling resource management Usite B 23

24 Framework using DRMAA Based upon Open Source SUN Grid Engine DRMAA impl. Java Bindings of DRMAA specification are used Java-Based Target Interface Core Only one layer of UNICORE is exchanged protocols still the same Platform & RMS comm independent DRMAA - API only provides Interfaces for job submission & management Additional DRMAA Extensions for file transfer Transfer over Unicore Protocol Layer (UPL) Optional transfer over GridFTP UNICORE 5/6 24

25 Initial Interoperability UNICORE & Globus 2.4 Internet UNICORE Network Job Supervisor (NJS) VO=DGI UNICORE DRMAA-based Java Target Interface () Core DRMAA-compliant Globus GRAM Client GridFTP Client Uspace UNICORE-Client Compute Resources GRAM Gatekeeper Globus 2 MDS Stardization necessary Proprietary Protocols Resource Management Firewall UNICORE AJO/UPL, Globus RSL GRAM Job- Manager GridFTP Server 25

26 UNICORE 6 Next generation of UNICORE Based on OGSA (Open Grid s Architecture) Compliant with WS-RF (Web s Resource Framework) Thus interoperability with other Grid middlewares (e.g. Globus 4) Available as alpha release on Sourceforge Beta release in July 2007, final release in end

27 OGSA WS-RF OGSA (Open Grid s Architecture): All components of a grid (storage, compute nodes, ) are represented by Grid s One implementation of OGSA: WS-RF (Web s Resource Framework) s are stardized, they know how they can interact Interoperability over a heterogeneous network 27

28 WS-RF (Web s Resource Framework) Web s themselves are stateless: they retain no data between invocations But: Grid s need to keep track of available storage CPUs, submitted running jobs, WS-RF defines a stard for stateful Web s Web s communicate with Resource s which store data Clients talking to a Web s have to specify the Resource s to be used 28

29 UNICORE 6 Architecture Client GridBean API s APIS GridBean Security Registry Higher Level GridBean Target Factory Atomic s Target Job Mgmt. Storage Mgmt. File Transfer Target Factory Atomic s Target Job Mgmt. Storage Mgmt. File Transfer Higher Level GridBean UNICORE 6.0 Globus, OGSA-based Middleware RMS Disc RMS Disc 29

30 UniGrids Atomic s Interoperability between OGSA based Grid systems Cross authentication for UNICORE Globus Target Factory UNICORE Grid Client Globus Grid Portal UniGrids Atomic s Target Job Management Storage Management OGSA Server File Transfer Enable cross Grid resource brokering Workflows over different Grid s 30

31 Grid Programming Environment (GPE) Clients Interoperable client framework Expert Client with full access to the Grid Lightweight client with application specific interface Portal Client which can be integrated in UPortal, GridSphere, Jetspeed, Expert Client Application Client Portal Client 31

32 Portable Clients GridBeans GPE Clients work on top of UNICOR 6, GTK4 or ChinaGrid Only requirement: Atomic interfaces GridBeans as client plugins are portable without modifications Client API hides underlying infrastructure details 32

33 Open Source under BSD license Ready-to-use for research industry Strong security intuitive GUI Easy installation configuration 33