D-Grid Grid and e-science in Germany



Similar documents
Web Service Based Data Management for Grid Applications

The Helmholtz Alliance for Astroparticle Physics

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

nestor - Network of Expertise in Long-Term Storage and Long-Term availability of Digital Resources in Germany

Sustainability in Grid-Computing Christian Baun. Die Kooperation von Forschungszentrum Karlsruhe GmbH und Universität Karlsruhe (TH)

How To Understand And Understand Rwth Aachen University

Fundamental Concepts and Models

Internet of Things at Work Plug-and-play for industrial Automation

Forschungszentrum Karlsruhe in der Helmholtz - Gemeinschaft. Holger Marten. Holger. Marten at iwr. fzk. de

Solutions for energy efficient buildings and cities

How can the Future Internet enable Smart Energy?

Bob Jones Technical Director

(Possible) HEP Use Case for NDN. Phil DeMar; Wenji Wu NDNComm (UCLA) Sept. 28, 2015

Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University. Xu Liang ** University of California, Berkeley

Experiences with MPTCP in an intercontinental multipathed OpenFlow network

for High Performance Computing

Energy research at KIT

Big Data Services at DKRZ

Deutsches Forschungsnetz

Capturing provenance information with a workflow monitoring extension for the Kieker framework

MASHUPS FOR THE INTERNET OF THINGS

The Integrated Clinical Pathways -Approach Current Requirements to the Knowledge Management in Health Information Systems

HPC technology and future architecture

2 nd Israeli-German Conference on the Commercialization of Future Technologies. Organic Photovoltaics turn a great idea into a great company

Workprogramme

ICT Solutions for Intelligent Mobility

Development of Tool Extensions with MOFLON

Master of Entrepreneurship (M.A.) at SRH University Berlin. Program Overview. Prof. Dr. Ronald Glasberg WS 2012/13 STATE APPROVED UNIVERSITY

irods and Metadata survey Version 0.1 Date March Abhijeet Kodgire 25th

Enterprise Application Integration (EAI) Techniques

Oracle WebLogic Foundation of Oracle Fusion Middleware. Lawrence Manickam Toyork Systems Inc

How To Develop A Business Model For Big Data Driven Innovation

ICT 9: Tools and Methods for Software Development

Scientific and Technical Applications as a Service in the Cloud

Data Semantics Aware Cloud for High Performance Analytics

Resource Management on Computational Grids

UniGR Workshop: Big Data «The challenge of visualizing big data»

Security and Compliance in Clouds

A Reference Architecture for Self-organizing Service-oriented Computing

Opportunities and Challenges in Software Engineering for the Next Generation Automotive

Electrical Engineering and Information Technology Department

Managing Complexity in Distributed Data Life Cycles Enhancing Scientific Discovery

Emergency Light House, a Help Point in the Darkness: Disaster Beacons

StratusLab project. Standards, Interoperability and Asset Exploitation. Vangelis Floros, GRNET

On-demand Provisioning of Workflow Middleware and Services An Overview

Integration strategy

THE CCLRC DATA PORTAL

A Service for Data-Intensive Computations on Virtual Clusters

GYAN VIHAR SCHOOL OF ENGINEERING & TECHNOLOGY M. TECH. CSE (2 YEARS PROGRAM)

Status and Integration of AP2 Monitoring and Online Steering

The NERC DataGrid (NDG)

Julia Fischer, Stefan Naumann, Markus Dick {-.-, s.naumann, m.dick} (at) umwelt-campus.de

EDG Project: Database Management Services

Cluster, Grid, Cloud Concepts

Review of Environmental Informatics Education in Germany and selected European Universities 2

Integration of Sustainable Approaches in the Building Design Process

Transcription:

D-Grid Grid and e-science in Germany Michael Ernst

Orientation Drivers of e-science development are advanced user groups Program primarily oriented towards science, with active participation of commercial sector Fundamental goals are Development of e-science services for the research sector Open to any approach Perspective for future usage Criteria of success Scientific power of Communities Ability to add new Communities Robustness and flexibility of technical approach

Phase 1 (2005 2006/7) Process Establishment of Community-Grids and integrated Middleware-Platform Call 1 picks up work from D-Grid initiative and marks start of this program Therefore bidding process with special emphasis More calls planned to further support this development Phase 2 (2006/7 2009) Development of science-oriented services and self sufficient infrastructure with production quality

Supported will be 3-5 Community specific Grids (CG) Primarily applications that include substantial Grid- and e-science specific R&D tasks Development of application and community specific Middleware and network based services wrt a system solution Application independent solutions Application as proof-of-concept no support for development of applications Duration 3 years, starting on 1 January 2005

Supported will be Grid-Middleware-Integration-Platform (GMIP) Platform based on existing components and a base architecture Extension by integration of Middleware components developed by CGs Some development in tight collaboration with CGs Version control, quality assurance, documentation, operations models Only one IP

LoI 27 total (23 * CGs, 1 * GMIP, 3 *?) Total cost ~ Euro 90 Mio. Request for funding ~ Euro 50 Mio. Communities Astro- and particle physics: 3 Computational Science and Engineering: 6 Medical and Bioinformatics: 3 Climate and earth science: 2 HPC: 1 Information management: 4

Schedule Community Grids and Grid Middleware Integration Platform Applications until 22 October 2004 First response expected by mid November Review Contribution to generic solution Importance of application domain Contribution to grid-based infrastructure Previous work, matching funds, potential for future usage Planned start of all projects: 1 January 2005

HEP CP LoI Theme: Community-Grid für die Simulation und verteilte Datenanalyse in der Teilchenphysik Particle, Nuclear, Astro, Theory Huge amount of data growing exponentially requires new Computing Models and new methods to realize a suitable infrastructure Initial Grid experience with simulation Important to work on improvements wrt Resource Scheduling and Monitoring for collaborative data analysis Data Management based on Metadata Catalogs Integration of collaborating instituts Request for funding: Euro 3 Mio. (~ 46 FTE-years) Duration: 3 years

HEP CP Partners DESY, FHG-IPSI, FZK, GSI, Hochschule Niederrhein, HUB, LMU Muenchen, NIC, Uni- Dortmund (CEI, Physik), Uni-Freiburg, Uni- Mainz, Uni-Wuppertal, TU Dresden, ZAM/FZ Juelich, ZIB Konsortialfuehrer: Matthias Kasemann, DESY

Grid-specific Tasks Data Management: Development of a generic, scalable Storage Element based on standard interfaces Optimized Job Scheduling in data intensive applications: Coscheduling of compute and storage resources Datamining: semantic catalogs (e.g. XML based) as web services Integration of all German Resources of collaboration Experiments in one or more production grids based on the same Middleware Installation and Management: improve usability for institutes (reduce effort) User Support, Monitoring, Bookkeeping: based on efficient job monitoring automatic user support (failure analysis, resubmission strategies, user friendly information systems) Data Analysis in a Distributed Environment: application specific scheduling based on service requirements and availability of data and computing resources, data access and data transfer

Grid-specific Tasks Data Management: Development of a generic, scalable Storage Element based on standard interfaces Optimized Job Scheduling in data intensive applications: Coscheduling of compute and storage resources Datamining: semantic catalogs (e.g. XML based) as web services Integration of all German Resources of collaboration Experiments in one or more production grids based on the same Middleware Installation and Management: improve usability for institutes (reduce effort) User Support, Monitoring, Bookkeeping: based on efficient job monitoring automatic user support (failure analysis, resubmission strategies, user friendly information systems) Data Analysis in a Distributed Environment: application specific scheduling based on service requirements and availability of data and computing resources, data access and data transfer

organized in 3 Work Packages WP1: Data Management WP2: User Support WP3: Data Analysis