The ENEA-EGEE site: Access to non-standard platforms



Similar documents
AFS in a GRID context

Cloud Computing in ENEA-GRID: Virtual Machines, Roaming Profile, and Online Storage

CNR-INFM DEMOCRITOS and SISSA elab Trieste

The Grid-it: the Italian Grid Production infrastructure

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago

The GENIUS Grid Portal

Grid Scheduling Architectures with Globus GridWay and Sun Grid Engine

Roberto Barbera. Centralized bookkeeping and monitoring in ALICE

Status and Integration of AP2 Monitoring and Online Steering

Analisi di un servizio SRM: StoRM

Sun Grid Engine, a new scheduler for EGEE

CMS Dashboard of Grid Activity

Computing in High- Energy-Physics: How Virtualization meets the Grid

Cluster, Grid, Cloud Concepts

Grid Computing With FreeBSD

GridWay: Open Source Meta-scheduling Technology for Grid Computing

Regional SEE-GRID-SCI Training for Site Administrators Institute of Physics Belgrade March 5-6, 2009

Recommendations for Static Firewall Configuration in D-Grid

Parallel Processing using the LOTUS cluster

IGI Portal architecture and interaction with a CA- online

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland

The GRID and the Linux Farm at the RCF

GigaSpaces XAP 10.0 Administration Training ADMINISTRATION, MONITORING AND TROUBLESHOOTING GIGASPACES XAP DISTRIBUTED SYSTEMS

The CMS analysis chain in a distributed environment

An approach to grid scheduling by using Condor-G Matchmaking mechanism

Scheduling in SAS 9.3

SEE-GRID-SCI. SEE-GRID-SCI USER FORUM 2009 Turkey, Istanbul December, 2009

24/08/2004. Introductory User Guide

ARC Computing Element

Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC

Sun Grid Engine, a new scheduler for EGEE middleware

locuz.com HPC App Portal V2.0 DATASHEET

Alternative models to distribute VO specific software to WLCG sites: a prototype set up at PIC

MATLAB Distributed Computing Server System Administrator's Guide

SAS Grid: Grid Scheduling Policy and Resource Allocation Adam H. Diaz, IBM Platform Computing, Research Triangle Park, NC

The glite File Transfer Service

CERN local High Availability solutions and experiences. Thorsten Kleinwort CERN IT/FIO WLCG Tier 2 workshop CERN

Grid Scheduling Dictionary of Terms and Keywords

Running Jobs with Platform LSF. Platform LSF Version 8.0 June 2011

NorduGrid ARC Tutorial

XSEDE Service Provider Software and Services Baseline. September 24, 2015 Version 1.2

MIGRATING DESKTOP AND ROAMING ACCESS. Migrating Desktop and Roaming Access Whitepaper

vsphere Upgrade vsphere 6.0 EN

automates system administration for homogeneous and heterogeneous networks

Grids Computing and Collaboration

Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware

Automated deployment of virtualization-based research models of distributed computer systems

The glite Workload Management System

Grid Computing in SAS 9.4 Third Edition

EUROPEAN MIDDLEWARE INITIATIVE

Virtualization of a Cluster Batch System

A Survey Study on Monitoring Service for Grid

Interoperability between Sun Grid Engine and the Windows Compute Cluster

RWTH GPU Cluster. Sandra Wienke November Rechen- und Kommunikationszentrum (RZ) Fotos: Christian Iwainsky

Interwise Connect. Working with Reverse Proxy Version 7.x

Introduction to Running Computations on the High Performance Clusters at the Center for Computational Research

System Requirements Table of contents

Cloud Computing through Virtualization and HPC technologies

Scheduling in SAS 9.4 Second Edition

MATLAB Distributed Computing Server System Administrator s Guide. R2013b

Instruments in Grid: the New Instrument Element

Anwendungsintegration und Workflows mit UNICORE 6

Using NeSI HPC Resources. NeSI Computational Science Team

HPC and Grid Concepts

THE CCLRC DATA PORTAL

Transcription:

V INFNGrid Workshop Padova, Italy December 18-20 2006 The ENEA-EGEE site: Access to non-standard platforms C. Sciò**, G. Bracco, P. D'Angelo, L. Giammarino*, S.Migliori, A. Quintiliani, F. Simoni, S. Podda ENEA INFO, C.R. ENEA Frascati V. E. Fermi 45 Frascati ROMA (Italy) scio@frascati.enea.it (*) CASPUR, Roma Italy, (**) Esse3Esse, Roma, Italy

Outline ENEA-GRID Implementation of ENEA-INFO EGEE site: Access to AIX resources Middleware integration: gssklog and lcmaps Status and Discussion Documentation Work in progress...

ENEA [Italian National Agency for New Technologies, Energy and Environment] 12 Research sites and a Central Computer and Network Service (ENEA-INFO) with 6 computer centres managing multi-platform resources for serial & parallel computation and graphical post processing.

ENEA GRID mission [started 1999]: provide a unified user environment and an homogeneous access method for all ENEA researchers, irrespective of their location. optimize the utilization of the available resources GRID functionalities (unique authentication, authorization, resource access and resource discovery) are provided using mature, multi-platform components: Distributed File System: OpenAFS Resource Manager: LSF Multicluster [www.platform.com] Unified user interface: Java & Citrix Technologies These components constitute the ENEA-GRID Middleware. ENEA GRID

ENEA EGEE site implementation (1) ENEA has implemented an EGEE site as a funded partner in the project Our implementation requirements: provide access to the main ENEA-GRID resourses EGEE users => mapped to some standard ENEA-GRID AFS users Access to AIX platform => NO Middleware on Worker Nodes OpenAFS: minimize local installation LSF resources must be used whenever possible Batch submission: bsub Information: lsload, bhosts, lshosts, bqueues,... Prompt job execution: lsrun Batch queues dedicated to EGEE GRID users

ENEA EGEE site implementation (2) Implementation architecture: UI & SE: Linux, Standard EGEE configuration CE: Linux, modifications to implement AFS pool-account A Transmitter linux element with WN packages installed WN: any platform/os with support for AFS/LSF share user homes with CE and Transmitter using AFS delegate all grid commands concerning file transfer to/from GRID to the Transmitter by means of the LSF lsrun command. AFS security and quota management guarantee reliability

ENEA EGEE site implementation (3) How: Any WN middleware command concerning file transfer is wrapped with a lsrun script. The wrapped WN middleware is located in AFS. The relevant PATHs in the environment of the EGEE job on the WN are modified so that the wrapped middleware is used LSF job-starter script for the EGEE dedicated queues e.g. /afs/enea.it/grd_egee/system/eneaaddon/opt/globus/bin/globus-url-copy globus-url-copy -> wrapper wrapper: lsrun -m transmitter globus-url-copy $* EGEE GRID users require an AFS token: gssklogd modified to be compatible with EGEE middleware EGEE middleware on the CE must properly call gssklog.

EGEE Standard site layout Resource Broker User Interface Storage Element Computing Element: accepts jobs sent by the RB and submits them to the local batch system (LSF) Worker Nodes: they perform the computation and the WN middleware sends back job results to RB User Interface LSF Computing Element Worker Node Worker Node Worker Node gridftp WAN Storage Element DMZ LAN ENEA-GRID

EGEE ENEA site implementation (4) Resource Broker The Transmitter Linux WN has two functions: 1) A complete linux WN where wrapped commands are executed. 2) A proxy system to sends back job results to RB. e.g. /afs/enea.it/.../globus-url-copy-> wrapper wrapper: lsrun -m transmitter globus-url-copy $* Computing Element Linux LSF AFS Transmitter Linux WN WAN DMZ LAN e.g. gridftp inside lsrun wrappers ENEA-GRID

EGEE ENEA site implementation (5) Resource Broker GRID users are AFS users so a modified gssklogd server is required to obtain an AFS token from the X509 certificate. LSF Computing Element Linux Transmitter Linux WN WAN DMZ LAN server gssklogd AFS ENEA-GRID

Gssklog package has been developed by Doug Engert (ANL) to obtain an AFS token using a GSS [Generic Security Service] implementation. It has been used together with Globus Toolkit GSSAPI to obtain an AFS token from a X509 certificate.ftp://achilles.ctd.anl.gov/pub/dee The package include a daemon: [gssklogd] and a client [gssklog]. gssklogd has been modified to provide tokens for pool account users on the basis of the content of the VO information contained in the user certificate. The daemon requires now also the VO certificate. The afsgrid-mapfile syntax has been extended, e.g. for ENEA VO: "/VO=enea".enea/020/ gssklogd/gssklog The client gssklog provides always the principal name: gssklog -server server_address -principal enea004

lcmaps_afs LCAS [Local Centre Authorization Service] and LCMAPS [Local Credential Mapping Service] are the EGEE services that make the connection between GRID users certificates and local UNIX userids on the local site. LCMAPS [Local Credential Mapping Service] is a modular service. A lcmaps_afs module exists with the purpose to obtain an AFS tokens for the local mapped user using gssklog but the standard module is not compatible with VOMS pool account A modified module has been prepared to achieve compatibility: gssklog is invoked always with -principal parameter

EGEE ENEA site implementation (6) Resource Broker GRID users are AFS users so a modified gssklogd server is required to obtain an AFS token from the X509 certificate. AFS can be used to provide to the gssklogd daemon the information about VO and CA These information are automatically updated by the CE VO & CA informations server gssklogd LSF Computing Element Linux AFS Transmitter Linux WN WAN DMZ LAN ENEA-GRID

Status and discussion ENEA Linux site: Glite 3.0 certified CE/SE: egce.frascati.enea.it / egse.frascati.enea.it WN: 14 Linux P4, 1.8 Ghz, 1 GB ENEA AIX site: Glite 3.0 certification in progress CE/SE: egceaix.frascati.enea.it / egseaix.frascati.enea.it WN: 96 cpu SP4, 1.9 Ghz BENEFITS of the gateway approach: Any Platform/OS with AFS/LSF LIMITS of the gateway approach Not a completely standard site [but EGEE Certification job runs well] GRID API are not available [ How to Flag it?] Some WN monitoring components are unavailable OS flag not yet implemented [VO tag defined, but not sufficient!]

Documentation An EGEE Technical Note (EGEE-TR-2006-006) is available : Title: "AFS Pool Account Users - GSSKLOG and LCMAPS extension to support AFS users as EGEE pool account users" [http://www.afs.enea.it/project/eneaegee/] The patched source of lcmaps_afs module [lcmaps_afs.c] is available and submitted to lcmaps package developers. A patch to the standard gssklog-0.11 package is available. The patch to gssklog package has been presented to the AFS Best Practices Workshop 2006 [http://pmw.org/afsbpw06] and submitted to Doug Engert.

CE Glite Work in progress CE already installed. Porting of the solution adopted in LCG CE not yet completed MPI Job support ENEA is strongly interested in transparent MPI support by EGEE middleware MPI support must take advantage of the benefits of a shared file system for user HOMEs (as OpenAFS) Access to other platform: ALTIX clusters (32 cpu; 32 Gb shared memory) Tests succesfully performed also on: MacOSX, Irix