The EGEE Middleware Architecture

Similar documents
Plateforme de Calcul pour les Sciences du Vivant. SRB & glite. V. Breton.

File Transfer Software and Service SC3

The glite File Transfer Service

WP5 Progress Report. John Gordon EDG Conference Barcelona, May escience Centre

Analisi di un servizio SRM: StoRM

An approach to grid scheduling by using Condor-G Matchmaking mechanism

IT/CZ cluster meeting, Milano, September 23 24, CESNET Report. Aleš Křenek et al.

DSA1.4 R EPORT ON IMPLEMENTATION OF MONITORING AND OPERATIONAL SUPPORT SYSTEM. Activity: SA1. Partner(s): EENet, NICPB. Lead Partner: EENet

CMS Dashboard of Grid Activity

Deploying a distributed data storage system on the UK National Grid Service using federated SRB

The glite File Transfer Service

Experiences with the GLUE information schema in the LCG/EGEE production Grid

Integrating a heterogeneous and shared Linux cluster into grids

A Uniform Job Monitoring Service in Multiple Job Universes

glibrary: Digital Asset Management System for the Grid

Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware

ARDA Experiment Dashboard

CERN local High Availability solutions and experiences. Thorsten Kleinwort CERN IT/FIO WLCG Tier 2 workshop CERN

Client/Server Grid applications to manage complex workflows

LCG POOL, Distributed Database Deployment and Oracle

Virtual Clusters as a New Service of MetaCentrum, the Czech NGI

The GridWay Meta-Scheduler

The dcache Storage Element

LHC schedule: what does it imply for SRM deployment? CERN, July 2007

Managed GRID or why NFSv4.1 is not enough. Tigran Mkrtchyan for dcache Team

Michael Thomas, Dorian Kcira California Institute of Technology. CMS Offline & Computing Week

OSG PUBLIC STORAGE. Tanya Levshina

Bob Jones Technical Director

dcache - Managed Storage - LCG Storage Element - HSM optimizer Patrick Fuhrmann, DESY for the dcache Team

Grid Scheduling Architectures with Globus GridWay and Sun Grid Engine

Software Process. Oliver Keeble SA3 EGEE-II final EU Review CERN EGEE and glite are registered trademarks

The Grid-it: the Italian Grid Production infrastructure

Global Grid User Support - GGUS - in the LCG & EGEE environment

The EU DataGrid Data Management

The CMS analysis chain in a distributed environment

Data management in EGEE

Improvement Options for LHC Mass Storage and Data Management

Grid Computing in Aachen

Report from SARA/NIKHEF T1 and associated T2s

Data Management. Issues

Computing in High- Energy-Physics: How Virtualization meets the Grid

Status and Evolution of ATLAS Workload Management System PanDA

The EDGeS project receives Community research funding

Database Services for CERN

Advanced School in High Performance and GRID Computing November 2008

GridWay: Open Source Meta-scheduling Technology for Grid Computing

Introduction to Grid computing

Resume. Wenjing. Date of birth: June 11th, 1982 Nationality: Chinese Phone number: Cell phone:

The Big-Data Cloud. Patrick Fuhrmann. On behave of the project team. The BIG DATA Cloud 8 th dcache Workshop, DESY Patrick Fuhrmann 15 May

Cloud Computing. Lecture 5 Grid Case Studies

Marco Mantegazza WebSphere Client Technical Professional Team IBM Software Group. Virtualization and Cloud

The ENEA gateway approach providing EGEE/gLite access to unsupported platforms and operating systems

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft. Global Grid User Support - GGUS - within the LCG & EGEE environment

Service Challenge Tests of the LCG Grid

Batch and Cloud overview. Andrew McNab University of Manchester GridPP and LHCb

Write a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical

Regional SEE-GRID-SCI Training for Site Administrators Institute of Physics Belgrade March 5-6, 2009

Instruments in Grid: the New Instrument Element

Peer-to-peer cooperative scheduling architecture for National Grid Infrastructure

DSA1.5 U SER SUPPORT SYSTEM

HADOOP, a newly emerged Java-based software framework, Hadoop Distributed File System for the Grid

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft. dcache Introduction

dcache, a managed storage in grid

Windows Server 2008 R2 Hyper-V Server and Windows Server 8 Beta Hyper-V

Science Days Stefan Freitag. 03. November Robotics Research Institute Dortmund University of Technology. Cloud Computing in D-Grid

The Italian Grid Infrastructure (IGI) CGW08 Cracow. Mirco Mazzucato Italian Grid Infrastructure Coordinator INFN CNAF Director

DataNet Flexible Metadata Overlay over File Resources

16th International Conference on Control Systems and Computer Science (CSCS16 07)

Grid Computing: A Ten Years Look Back. María S. Pérez Facultad de Informática Universidad Politécnica de Madrid mperez@fi.upm.es

Cluster, Grid, Cloud Concepts

AN APPROACH TO DEVELOPING BUSINESS PROCESSES WITH WEB SERVICES IN GRID

PRACE WP4 Distributed Systems Management. Riccardo Murri, CSCS Swiss National Supercomputing Centre

Chapter 12 Distributed Storage

(RH 7.3, gcc ,VDT 1.1.6, EDG 1.4.3, GLUE, RLS) Tokyo BNL TAIWAN RAL 20/03/ /03/2003 CERN 15/03/ /03/2003 FNAL 10/04/2003 CNAF

Michał Jankowski Maciej Brzeźniak PSNC

Update on Network Performance Monitoring

SPACI & EGEE LCG on IA64

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

The ENEA-EGEE site: Access to non-standard platforms

IBM Solutions Grid for Business Partners Helping IBM Business Partners to Grid-enable applications for the next phase of e-business on demand

DCMS Tier 2/3 prototype infrastructure

EGEE a worldwide Grid infrastructure

For large geographically dispersed companies, data grids offer an ingenious new model to economically share computing power and storage resources

Managing a Tier-2 Computer Centre with a Private Cloud Infrastructure

towards EGI Operations Portal

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4

EUROPEAN MIDDLEWARE INITIATIVE

CONDOR And The GRID. By Karthik Ram Venkataramani Department of Computer Science University at Buffalo

Distributed Database Access in the LHC Computing Grid with CORAL

ViSION Status Update. Dan Savu Stefan Stancu. D. Savu - CERN openlab

Tier0 plans and security and backup policy proposals

A Web-based Portal to Access and Manage WNoDeS Virtualized Cloud Resources

GRID COMPUTING Techniques and Applications BARRY WILKINSON

Status and Integration of AP2 Monitoring and Online Steering

Recommendations for Static Firewall Configuration in D-Grid

Global Grid User Support - GGUS - start up schedule

Welcome to the User Support for EGEE Task Force Meeting

Authorization Strategies for Virtualized Environments in Grid Computing Systems

Data Grid Storage Systems - An Analysis of Security

Transcription:

LCG PEB 31 August 2004 www.eu-egee.org The EGEE Middleware Architecture Erwin Laure EGEE Deputy Middleware Manager EGEE is a project funded by the European Union under contract INFSO-RI-508833

Contents The glite components Architectural design Current Implementations From Prototype to Release Current prototype status Integration and testing Release plan Relationship to LCG-2 LCG PEB, 31-August-2004-2

Architecture Guiding Principles Lightweight (existing) services Easily and quickly deployable Use existing services where possible as basis for re-engineering Interoperability Allow for multiple implementations Resilience and Fault Tolerance Co-existence with deployed infrastructure Run as an application (e.g. on LCG-2; Grid3) Reduce requirements on site components Basically globus and SRM Co-existence (and convergence) with LCG-2 and Grid3 are essential for the EGEE Grid service Service oriented approach WSRF still being standardized No mature WSRF implementations exist to date, no clear picture about the impact of WSRF hence: start with plain WS WSRF compliance is not an immediate goal, but we follow the WSRF evolution WS-I compliance is important LCG PEB, 31-August-2004-3

glite Services Service decomposition based on ARDA blueprint LCG PEB, 31-August-2004-4

WMS LCG PEB, 31-August-2004-5

CE Works in push and pull mode Site policy enforcement CEA Computing Element Acceptance JC Job Controller MON Monitoring LRMS Local Resource Management System LCG PEB, 31-August-2004-6

Data Management Scheduled data transfers (like jobs) SRM based storage Reliable file transfer LCG PEB, 31-August-2004-7

Storage Element Interfaces SRM interface Management and control SRM (with possible evolution) Control Posix-like File I/O File Access Open, read, write Not real posix (like rfio) User POSIX API File I/O SRM interface rfio dcap chirp aio Castor dcache NeST Disk LCG PEB, 31-August-2004-8

Storage Element Strategic SE High QoS: reliable, safe.. Has usually an MSS Place to keep important data Needs people to keep running Heavyweight QoS Tactical SE Volatile, lightweight space Enables sites to participate in an opportunistic manner Best effort Collaboration with LCG Deployment strategic tactical Portability LCG PEB, 31-August-2004-9

Catalogs File Catalog Filesystem-like view on logical file names Replica Catalog Keep track of replicas of the same file Metadata Metadata Catalog (Meta Data Catalog) Attributes of files on the logical level Boundary between generic middleware and application layer LFN GUID Master SURL File Catalog SURL Replica Catalog SURL LCG PEB, 31-August-2004-10

Security PEP and PDP will in most cases be part of the secured service, rather than separated services LCG PEB, 31-August-2004-11

Information and Monitoring Exploit GGF GMA architecture Simple producer/consumer model LCG PEB, 31-August-2004-12

Current Implementations WMS AliEn TaskQueue EDG WMS (plus new TaskQueue and Information Supermarket) EDG L&B CE Globus Gatekeeper CondorG (CondorC) Pull component AliEn CE EGEE MON (tbi by IT/CZ cluster) LCG PEB, 31-August-2004-13

Current Implementations Cont d SE External SRM implementations dcache, Castor, LCG disk pool manager (tbi) AliEn aio (re-factored to glite- I/O)( Catalogs AliEn FileCatalog RLS (globus and EDG) Combined Catalog Interface (tbi) Data Scheduling Stork Phedex (concepts) Data mgmt interface and VO data scheduler (tbi) Data Transfer GridFTP Metadata Catalog Simple interface defined Assumption that it will be managed by experiments Information & Monitoring R-GMA For application monitoring the aim is to plug in other systems as well. LCG PEB, 31-August-2004-14

Currents Implementations Cont d Security VOMS as Attribute Authority and VO mgmt myproxy as proxy store gridmapfile and GSI security as enforcement Plan is to move to more finegrained authorization (e.g. ACLs) Plans with globus to provide a set-uid service on CE User Interface AliEn shell CLIs and APIs GAS Move to autogenerated APIs from WSDL New development Package manager Explore existing solutions Accounting EDG HLR LCG PEB, 31-August-2004-15

Deployment considerations Interoperability and co-existence Exploit different service implementations E.g. Castor and dcache SRM implementations Require minimal support from development environment Sites required to run globus and SRM (might not be required for tactical storage) Flexible service deployment Multiple services running on the same physical machine (if possible) Platform support Goal is to have portable middleware Building & Integration on RHEL 3 and windows Initial testing (at least 3 sites) using different Linux flavors (including free distributions) Service autonomy User may talk to services directly or through other services (like access service) Open source software license Based on EDG license LCG PEB, 31-August-2004-16

From Prototype to Release Prototype setup at 2 sites (CERN & Wisconsin) ~45 users registered 2 nd release mid August Many bugfixes New functionalities More nodes (more powerful) being added at CERN EDG WMS being added at CNAF Being ported to SLC3 (by end of September) Second VO being set up (core services at Wisconsin) Will use globus RLS LCG PEB, 31-August-2004-17

From Prototype to Release Continuous integration system set up First results being put forward to testing team SLC3 version of prototype middleware provided SCM convergence being reached SCM plan: https://edms.cern.ch/document/446241 Common service configuration being worked out Deployment scenarios being worked out Testing sites (RAL, NIKHEF, CERN) up and running Focus on prototype installation and testing >50 bugs confirmed Plans being made for focused testing of glite components for preproduction service Test plan being defined Based upon architecture document, release plan, and NA4 requirements https://edms.cern.ch/document/477697/ LCG PEB, 31-August-2004-18

Release Plan Incremental Releases Being tracked at weekly basis https://edms.cern.ch/document/468699 Priorities for testing and integration defined Goal is to have core services by the end of this year LCG PEB, 31-August-2004-19

LCG glite and LCG-2 LCG-2 focus on production, large-scale data handling The service for the 2004 data challenges Provides experience on operating and managing a global grid service Development programme driven by data challenge experience Data handling Strengthening the infrastructure Operation, VO management Evolves to LCG-3 as components progressively replaced with new middleware -- target is to minimise the discontinuities of migration to the new generation Aim for migration plan by end of year LCG-2 (=EGEE-0) 2004 2005 prototyping prototyping product product LCG-3 EGEE-1 glite focus on analysis Developed by EGEE project in collaboration with VDT (US) LHC applications and users closely involved in prototyping & development (ARDA project) Short development cycles Co-existence with LCG-2 Profit as far as possible from LCG- 2 infrastructure, experience Ease deployment avoid separate hardware As far as possible - completed components integrated in LCG-2 improved testing, easier displacement of LCG-2 les robertson - cern-it-20 LCG PEB, 31-August-2004-20

Pre-Production Service Independent components can move from prototype to SA1 for deployment on pre-production service after testing Rule of thumb: release plan + 1 month First components (tentatively end of September): CE Resource Broker File I/O LCG PEB, 31-August-2004-21

Summary Next generation middleware being designed and assembled Prototype first tangible outcome BUT this is a PROTOTYPE not a release! Architectural and design work well advanced Architecture document sent to EU Subject to changes Design document (draft) exists: https://edms.cern.ch/document/487871/ Input from Experiments solicited (by mid September; document needs to be sent to EU end of September) Incremental changes to prototype Feedback from applications and operations essential! Detailed release plan worked out https://edms.cern.ch/document/468699 First components for pre-production service during autumn Continuous integration and testing scheme defined and adopted Technology Risk Will WS allow for all upcoming requirements? Divergence to standards LCG PEB, 31-August-2004-22

Links JRA1 homepage http://egee-jra1.web.cern.ch/egee-jra1/ Architecture document https://edms.cern.ch/document/476451/ Release plan https://edms.cern.ch/document/468699 Prototype installation http://egee-jra1.web.cern.ch/egee-jra1/prototype/testbed.htm Test plan https://edms.cern.ch/document/477697/ Design document https://edms.cern.ch/document/487871/ LCG PEB, 31-August-2004-23

Backup Slides LCG PEB, 31-August-2004-24

Grid Login LCG PEB, 31-August-2004-25

Generic Production Job LCG PEB, 31-August-2004-26

Generic Analysis Job LCG PEB, 31-August-2004-27