The NERC DataGrid (NDG)

Similar documents
Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Use of ISO standards by NERC (a snapshot!)

THE CCLRC DATA PORTAL

Semantic Integration of File-based Data for Grid Services

The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team

MyOcean Copernicus Marine Service Architecture and data access Experience

OCLC CONTENTdm and the WorldCat Digital Collection Gateway Overview

ECM Governance Policies

NERC Data Policy Guidance Notes

Cite My Data M2M Service Technical Description

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories

Metadata Quality Control for Content Migration: The Metadata Migration Project at the University of Houston Libraries

Pan-European infrastructure for management of marine and ocean geological and geophysical data

CERN Document Server

technische universiteit eindhoven WIS & Engineering Geert-Jan Houben

EFFECTIVE STORAGE OF XBRL DOCUMENTS

GLOBAL CONSULTING SERVICES TOOLS FOR WEBMETHODS Software AG. All rights reserved. For internal use only

Enterprise GIS Solutions to GIS Data Dissemination

MIGRATING DESKTOP AND ROAMING ACCESS. Migrating Desktop and Roaming Access Whitepaper

In ediscovery and Litigation Support Repositories MPeterson, June 2009

The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols

The Key Elements of Digital Asset Management

Jamcracker Web Services. David Orchard Standards Architect

EDG Project: Database Management Services

General principles and architecture of Adlib and Adlib API. Petra Otten Manager Customer Support

data.bris: collecting and organising repository metadata, an institutional case study

2311A: Advanced Web Application Development using Microsoft ASP.NET Course 2311A Three days Instructor-led

Agents and Web Services

Cross-domain Identity Management System for Cloud Environment

CDI/THREDDS Interoperability: the SeaDataNet developments. P. Mazzetti 1,2, S. Nativi 1,2, 1. CNR-IMAA; 2. PIN-UNIFI

2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data

THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy Page 1 of 8

GIS Data Models for INSPIRE and ELF

Release 1. ICAPRG604A Create cloud computing services

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

PAPER Data retrieval in the PURE CRIS project at 9 universities

EED Task Order. Contract: NNG10HP02C Contractor: Raytheon Task Type:

Functional Requirements for Digital Asset Management Project version /30/2006

Big Data and the Earth Observation and Climate Modelling Communities: JASMIN and CEMS

Integrating SharePoint Sites within WebSphere Portal

How To Manage Your Digital Assets On A Computer Or Tablet Device

Big Data at ECMWF Providing access to multi-petabyte datasets Past, present and future

OCLC CONTENTdm. Geri Ingram Community Manager. Overview. Spring 2015 CONTENTdm User Conference Goucher College Baltimore MD May 27, 2015

San Jose State University

NHS Education for Scotland Knowledge Services Design and Development Framework

Get More from Microsoft SharePoint with Oracle Fusion Middleware. An Oracle White Paper January 2008

Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context

Filestor Digital Asset Management. The way it works

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

The Webcast will begin at 1:00pm EST.

UNLOCKING XBRL CONTENT

Notes about possible technical criteria for evaluating institutional repository (IR) software

ENTERPRISE CONTENT MANAGEMENT. Trusted by Government Easy to Use Vast Scalability Flexible Deployment Automate Business Processes

Flattening Enterprise Knowledge

Advanced Web Application Development using Microsoft ASP.NET

GRIP:Creating Interoperability between Grids

A collaborative platform for knowledge management

MatchPoint Technical Features Tutorial Colygon AG Version 1.0

dati.culturaitalia.it a Pilot Project of CulturaItalia dedicated to Linked Open Data

Digital Asset Management Developing your Institutional Repository

Web Services Strategy

A Semantic Search Engine for the Storage Resource Broker

Practical application of SAS Clinical Data Integration Server for conversion to SDTM data

Adlib Internet Server

Web Service Testing. SOAP-based Web Services. Software Quality Assurance Telerik Software Academy

An IDL for Web Services

Databases & Data Infrastructure. Kerstin Lehnert

Transcription:

The NERC DataGrid (NDG) Roy Lowry on behalf of the NDG, BADC and BODC. Ray Cramer, Marta Gutierrez, Kerstin Kleese Van Dam, Venkatasiva Kondapalli, Susan Latham, Bryan Lawrence, Kevin O Neill, Ag Stephens, Andrew Woolf British Oceanographic Data Centre http://www.bodc.ac.uk

Outline NDG Aims and Metadata Taxonomy Demonstration of NDG Discovery Service NDG Security Model Project Status

Timelines 2002: e-science arrives at NERC: Legacy Systems with many files and existing access and authorisation systems that cannot easily be replaced. Complex existing DISCOVERY metadata systems. No USE metadata Discovery based on Z39.50 (which never seems to work) Utilisation based on file retrieval. 2004: NERC DataGrid ready to move forward New metadata systems describe data as well as datasets. OAI based harvesting supports scalable FAST data discovery. New authorisation systems under development. 2005: Moving towards utilisation based on metadata, on demand server side behaviours, grid-based back end parallelisation etc NDG release rollout commences

Problem to be Addressed by NERC DataGrid British Atmospheric Data Centre Simulations British Oceanographic Data Centre Assimilation http://ndg.nerc.ac.uk

NERC DataGrid Overview Internet Link tape robot Online Data Online Data Online Data XML database BADC NDG Wrapper Software Agent Grid User XML database BODC NDG Wrapper XML database Group NDG Wrapper Wider Internet NERC Grid ESG (&other) Applications NDG Web Portal Internet Link Satellite Supercomputer Research Group Data Sources Wider Internet Internet User XML database

NDG Metadata Taxonomy

NDG Metadata Taxonomy Key Points A is Use metadata built on GML, branded Climate System Modelling Language (CSML) B is a browsable network of Discovery metadata, branded Metadata Objects for Links in Environmental Science (MOLES) D records are conventional dataset Discovery records, currently GCMD DIF (but could be any suitable format such as ISO19115 profile)

NDG Metadata Architecture Service based model: clear separation between discovery and use discovery service standards compliant and interoperable

NDG Metadata Vocabularies Controlled vocabularies form an important part of NDG metadata Schemas support multiple vocabularies and can therefore include internal maps BODC are developing vanilla web service vocabulary support for NDG (and services will be public) A lot of work is required to rationalise vocabulary requirements across atmospheric and oceanographic domains

NDG Discovery Service Data Providers each build a MOLES repository Discovery records (DIFs) generated by X-Query and XSLT and posted in a public OAI repository NDG harvests DIFs to build a central repository, which is queried by discovery web services Portal is one possible interface to these services, but they could equally well be used by software agents Data Providers with a stock of DIFs can post them without using MOLES for light participation

NDG Discovery Service

NDG Discovery Service

NDG Discovery Service

NDG Discovery Service Up to three types of service available for each dataset returned Metadata browse (access to MOLES repository) Data service (access to data through CSML) Local service (anything the data host can deliver through a URL) Also possible to display DIF in HTML (human-readable) or XML (machinereadable)

NDG Discovery Service

NDG Data Service

NDG Browse Service

Example Local Service

NDG Security Certificate based, pass encrypted credentials between user and gatekeeper.

Authorisation Role-based access: <dataset> <host> badc.nerc.ac.uk </host> <name>ukmo-obs </name> Signed conditions of use form exists for this dataset <access-requires> researcher <access-requires> <access-requires> ukmo-obs </access-requires> <processing-requires> nerc </processing-requires> </dataset> Key concept: Only hosts that trust each other share data, even within a larger virtual organisation: e.g. at BADC: <trusted> <bodc> <host>ndg.bodc.nerc.ac.uk</host> <attribute remotename= nerc > nerc </attribute> <attribute remotename= ashoe > ashoe </attribute> <attribute remotename= staff > nerc </attribute> <other> bodc </other> </bodc> </trusted>

Current Work Further service design underway but implementation details not yet obvious (e.g. GT4 etc). Deployment of CSML to describe observational and model data Building security infrastructure Ongoing MOLES development and population for: Oceanographic data Atmospheric Chemistry data Numerical Modelling data Remote Sensing Data

Where are we? Release 0.1 (1 st March 2005) Discovery Service Data extractor service NASA Ames Python API Data Provider tools and documentation CSML documentation and schema MOLES guide and schema MOLES to DIF XQueries and XSLT OAI guide Release 0.2 (23 rd May 2005) Support for DIF 9 (ISO-compliant) Support for MOLES 1.02 Improved tooling (automation for operational use) Improved documentation including new Data Providers Guide Using Discovery Service as a web service Guide ob utilising exist XML database Improved and extended Discovery Service content

Where are we? Release 0.3 (6 th June 2005) Security added Support for DIF 9 extensions for model data Dublin Core supported as additional discovery format Release 0.4 (July 2005) Document handling library in python and java Data-provider database package WMS/WCS interface to NetCDF CSML-based data delivery Vocabulary server Secure access guide Release 1.0 (Due September 2005) Pre-operational release Second (operational) phase of project funded from October 2005 to September 2007