The NERC DataGrid (NDG)
|
|
|
- Victoria Holmes
- 10 years ago
- Views:
Transcription
1 The NERC DataGrid (NDG) Roy Lowry on behalf of the NDG, BADC and BODC. Ray Cramer, Marta Gutierrez, Kerstin Kleese Van Dam, Venkatasiva Kondapalli, Susan Latham, Bryan Lawrence, Kevin O Neill, Ag Stephens, Andrew Woolf British Oceanographic Data Centre
2 Outline NDG Aims and Metadata Taxonomy Demonstration of NDG Discovery Service NDG Security Model Project Status
3 Timelines 2002: e-science arrives at NERC: Legacy Systems with many files and existing access and authorisation systems that cannot easily be replaced. Complex existing DISCOVERY metadata systems. No USE metadata Discovery based on Z39.50 (which never seems to work) Utilisation based on file retrieval. 2004: NERC DataGrid ready to move forward New metadata systems describe data as well as datasets. OAI based harvesting supports scalable FAST data discovery. New authorisation systems under development. 2005: Moving towards utilisation based on metadata, on demand server side behaviours, grid-based back end parallelisation etc NDG release rollout commences
4 Problem to be Addressed by NERC DataGrid British Atmospheric Data Centre Simulations British Oceanographic Data Centre Assimilation
5 NERC DataGrid Overview Internet Link tape robot Online Data Online Data Online Data XML database BADC NDG Wrapper Software Agent Grid User XML database BODC NDG Wrapper XML database Group NDG Wrapper Wider Internet NERC Grid ESG (&other) Applications NDG Web Portal Internet Link Satellite Supercomputer Research Group Data Sources Wider Internet Internet User XML database
6 NDG Metadata Taxonomy
7 NDG Metadata Taxonomy Key Points A is Use metadata built on GML, branded Climate System Modelling Language (CSML) B is a browsable network of Discovery metadata, branded Metadata Objects for Links in Environmental Science (MOLES) D records are conventional dataset Discovery records, currently GCMD DIF (but could be any suitable format such as ISO19115 profile)
8 NDG Metadata Architecture Service based model: clear separation between discovery and use discovery service standards compliant and interoperable
9 NDG Metadata Vocabularies Controlled vocabularies form an important part of NDG metadata Schemas support multiple vocabularies and can therefore include internal maps BODC are developing vanilla web service vocabulary support for NDG (and services will be public) A lot of work is required to rationalise vocabulary requirements across atmospheric and oceanographic domains
10 NDG Discovery Service Data Providers each build a MOLES repository Discovery records (DIFs) generated by X-Query and XSLT and posted in a public OAI repository NDG harvests DIFs to build a central repository, which is queried by discovery web services Portal is one possible interface to these services, but they could equally well be used by software agents Data Providers with a stock of DIFs can post them without using MOLES for light participation
11 NDG Discovery Service
12 NDG Discovery Service
13 NDG Discovery Service
14 NDG Discovery Service Up to three types of service available for each dataset returned Metadata browse (access to MOLES repository) Data service (access to data through CSML) Local service (anything the data host can deliver through a URL) Also possible to display DIF in HTML (human-readable) or XML (machinereadable)
15 NDG Discovery Service
16 NDG Data Service
17 NDG Browse Service
18 Example Local Service
19 NDG Security Certificate based, pass encrypted credentials between user and gatekeeper.
20 Authorisation Role-based access: <dataset> <host> badc.nerc.ac.uk </host> <name>ukmo-obs </name> Signed conditions of use form exists for this dataset <access-requires> researcher <access-requires> <access-requires> ukmo-obs </access-requires> <processing-requires> nerc </processing-requires> </dataset> Key concept: Only hosts that trust each other share data, even within a larger virtual organisation: e.g. at BADC: <trusted> <bodc> <host>ndg.bodc.nerc.ac.uk</host> <attribute remotename= nerc > nerc </attribute> <attribute remotename= ashoe > ashoe </attribute> <attribute remotename= staff > nerc </attribute> <other> bodc </other> </bodc> </trusted>
21 Current Work Further service design underway but implementation details not yet obvious (e.g. GT4 etc). Deployment of CSML to describe observational and model data Building security infrastructure Ongoing MOLES development and population for: Oceanographic data Atmospheric Chemistry data Numerical Modelling data Remote Sensing Data
22 Where are we? Release 0.1 (1 st March 2005) Discovery Service Data extractor service NASA Ames Python API Data Provider tools and documentation CSML documentation and schema MOLES guide and schema MOLES to DIF XQueries and XSLT OAI guide Release 0.2 (23 rd May 2005) Support for DIF 9 (ISO-compliant) Support for MOLES 1.02 Improved tooling (automation for operational use) Improved documentation including new Data Providers Guide Using Discovery Service as a web service Guide ob utilising exist XML database Improved and extended Discovery Service content
23 Where are we? Release 0.3 (6 th June 2005) Security added Support for DIF 9 extensions for model data Dublin Core supported as additional discovery format Release 0.4 (July 2005) Document handling library in python and java Data-provider database package WMS/WCS interface to NetCDF CSML-based data delivery Vocabulary server Secure access guide Release 1.0 (Due September 2005) Pre-operational release Second (operational) phase of project funded from October 2005 to September 2007
Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan
Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:
Use of ISO standards by NERC (a snapshot!)
Use of ISO standards by NERC (a snapshot!) Dr Andrew Woolf [email protected] STFC Rutherford Appleton Laboratory Outline NERC overview The NERC SDI Metadata Data Services Standards activities UK/EU
THE CCLRC DATA PORTAL
THE CCLRC DATA PORTAL Glen Drinkwater, Shoaib Sufi CCLRC Daresbury Laboratory, Daresbury, Warrington, Cheshire, WA4 4AD, UK. E-mail: [email protected], [email protected] Abstract: The project aims
Semantic Integration of File-based Data for Grid Services
Semantic Integration of File-based Data for Grid Services Andrew Woolf 1, Ray Cramer 3, Marta Gutierrez 2, Kerstin Kleese van Dam 1, Siva Kondapalli 3, Susan Latham 2, Bryan Lawrence 2, Roy Lowry 3, Kevin
The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team
The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team Photo courtesy Andrew Mahoney NSF Vision What is AON? a
MyOcean Copernicus Marine Service Architecture and data access Experience
MyOcean Copernicus Marine Service Architecture and data access Experience Sophie Besnard CLS, Toulouse, France February 2015 MyOcean Story MyOcean Challenge & Success MyOcean Service MyOcean System MyOcean
OCLC CONTENTdm and the WorldCat Digital Collection Gateway Overview
OCLC CONTENTdm and the WorldCat Digital Collection Gateway Overview Geri Ingram OCLC Community Manager June 2015 Overview Audience This session is for users library staff, curators, archivists, who are
ECM Governance Policies
ECM Governance Policies Metadata and Information Architecture Policy Document summary Effective date 13 June 2012 Last updated 17 November 2011 Policy owner Library Services, ICTS Approved by Council Reviewed
NERC Data Policy Guidance Notes
NERC Data Policy Guidance Notes Author: Mark Thorley NERC Data Management Coordinator Contents 1. Data covered by the NERC Data Policy 2. Definition of terms a. Environmental data b. Information products
Cite My Data M2M Service Technical Description
Cite My Data M2M Service Technical Description 1 Introduction... 2 2 How Does it Work?... 2 2.1 Integration with the Global DOI System... 2 2.2 Minting DOIs... 2 2.3 DOI Resolution... 3 3 Cite My Data
DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories
DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories MacKenzie Smith, Associate Director for Technology Massachusetts Institute of Technology Libraries, Cambridge,
Metadata Quality Control for Content Migration: The Metadata Migration Project at the University of Houston Libraries
Metadata Quality Control for Content Migration: The Metadata Migration Project at the University of Houston Libraries Andrew Weidner University of Houston, USA [email protected] Annie Wu University of Houston,
Pan-European infrastructure for management of marine and ocean geological and geophysical data
Pan-European infrastructure for management of marine and ocean geological and geophysical data By Dick M.A. Schaap Geo-Seas Technical Coordinator March 2010 Supported by the European Commission FP7 - Research
CERN Document Server
CERN Document Server Document Management System for Grey Literature in Networked Environment Martin Vesely CERN Geneva, Switzerland GL5, December 4-5, 2003 Amsterdam, The Netherlands Overview Searching
technische universiteit eindhoven WIS & Engineering Geert-Jan Houben
WIS & Engineering Geert-Jan Houben Contents Web Information System (WIS) Evolution in Web data WIS Engineering Languages for Web data XML (context only!) RDF XML Querying: XQuery (context only!) RDFS SPARQL
EFFECTIVE STORAGE OF XBRL DOCUMENTS
EFFECTIVE STORAGE OF XBRL DOCUMENTS An Oracle & UBmatrix Whitepaper June 2007 Page 1 Introduction Today s business world requires the ability to report, validate, and analyze business information efficiently,
GLOBAL CONSULTING SERVICES TOOLS FOR WEBMETHODS. 2015 Software AG. All rights reserved. For internal use only
GLOBAL CONSULTING SERVICES TOOLS FOR WEBMETHODS CONSULTING TOOLS VALUE CREATING ADD-ONS REDUCE manual effort time effort risk 6 READY-TO- USE TOOLS MORE COMING SOON SIMPLE PRICING & INSTALLATION INCREASE
Enterprise GIS Solutions to GIS Data Dissemination
Enterprise GIS Solutions to GIS Data Dissemination ESRI International User Conference July 13 17, 2009 Wendy M. Turner Senior GIS Engineer & Program Manager Freedom Consulting Group, LLC Building the Enterprise
MIGRATING DESKTOP AND ROAMING ACCESS. Migrating Desktop and Roaming Access Whitepaper
Migrating Desktop and Roaming Access Whitepaper Poznan Supercomputing and Networking Center Noskowskiego 12/14 61-704 Poznan, POLAND 2004, April white-paper-md-ras.doc 1/11 1 Product overview In this whitepaper
In ediscovery and Litigation Support Repositories MPeterson, June 2009
XAM PRESENTATION (extensible TITLE Access GOES Method) HERE In ediscovery and Litigation Support Repositories MPeterson, June 2009 Contents XAM Introduction XAM Value Propositions XAM Use Cases Digital
The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols
The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols Claudia Nicolai; Imma Subirats; Stephen Katz Food and Agriculture Organization of the United
The Key Elements of Digital Asset Management
The Key Elements of Digital Asset Management The last decade has seen an enormous growth in the amount of digital content, stored on both public and private computer systems. This content ranges from professionally
Jamcracker Web Services. David Orchard Standards Architect
Jamcracker Web Services Web Services Position April 12, 2001 David Orchard Standards Architect 1 Web Services Vision Provide an ecosystem of web services Integrate XML interfaces/web Services together
EDG Project: Database Management Services
EDG Project: Database Management Services Leanne Guy for the EDG Data Management Work Package EDG::WP2 [email protected] http://cern.ch/leanne 17 April 2002 DAI Workshop Presentation 1 Information in
General principles and architecture of Adlib and Adlib API. Petra Otten Manager Customer Support
General principles and architecture of Adlib and Adlib API Petra Otten Manager Customer Support Adlib Database management program, mainly for libraries, museums and archives 1600 customers in app. 30 countries
data.bris: collecting and organising repository metadata, an institutional case study
Describe, disseminate, discover: metadata for effective data citation. DataCite workshop, no.2.. data.bris: collecting and organising repository metadata, an institutional case study David Boyd data.bris
2311A: Advanced Web Application Development using Microsoft ASP.NET Course 2311A Three days Instructor-led
2311A: Advanced Web Application Development using Microsoft ASP.NET Course 2311A Three days Instructor-led Introduction This three-day, instructor-led course provides students with the knowledge and skills
Agents and Web Services
Agents and Web Services ------SENG609.22 Tutorial 1 Dong Liu Abstract: The basics of web services are reviewed in this tutorial. Agents are compared to web services in many aspects, and the impacts of
Cross-domain Identity Management System for Cloud Environment
Cross-domain Identity Management System for Cloud Environment P R E S E N T E D B Y: N A Z I A A K H TA R A I S H A S A J I D M. S O H A I B FA R O O Q I T E A M L E A D : U M M E - H A B I B A T H E S
CDI/THREDDS Interoperability: the SeaDataNet developments. P. Mazzetti 1,2, S. Nativi 1,2, 1. CNR-IMAA; 2. PIN-UNIFI
CDI/THREDDS Interoperability: the SeaDataNet developments P. Mazzetti 1,2, S. Nativi 1,2, 1. CNR-IMAA; 2. PIN-UNIFI Outline Interoperability Issues in SeaDataNet A broker solution for CDI/THREDDS interoperability
2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, [email protected])
CSP CHRONOS Compliance statement for ISO 14721:2003 (Open Archival Information System Reference Model) 2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, [email protected]) The international
Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data
Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data David Minor 1, Reagan Moore 2, Bing Zhu, Charles Cowart 4 1. (88)4-104 [email protected] San Diego Supercomputer Center
THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy 2015-2018. Page 1 of 8
THE BRITISH LIBRARY Unlocking The Value The British Library s Collection Metadata Strategy 2015-2018 Page 1 of 8 Summary Our vision is that by 2020 the Library s collection metadata assets will be comprehensive,
GIS Data Models for INSPIRE and ELF
GIS Data Models for INSPIRE and ELF Paul Hardy Roberto Lucchi EuroSDR/ELF Copenhagen Data Modelling and Model Driven Implementation of Data Distribution 28 Jan 2015 ArcGIS for INSPIRE Extends ArcGIS for
Release 1. ICAPRG604A Create cloud computing services
Release 1 ICAPRG604A Create cloud computing services ICAPRG604A Create cloud computing services Modification History Release Release 1 Comments This version first released with ICA11 Information and Communications
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and
PAPER Data retrieval in the PURE CRIS project at 9 universities
PAPER Data retrieval in the PURE CRIS project at 9 universities A practical approach Paper for the IWIRCRIS workshop in Copenhagen 2007, version 1.0 Author Atira A/S Bo Alrø Product Manager [email protected]
EED Task Order. Contract: NNG10HP02C Contractor: Raytheon Task Type:
EED Task Order Title: Studies CMR Phase 0 No-Cost Extension Task Number: 9 Rev 15 Originator: Marinelli Effective Date: Dec 11, 2013 ESDIS POC: Marinelli Task Estimate Cost and Maximum Available Fee Estimate
Functional Requirements for Digital Asset Management Project version 3.0 11/30/2006
/30/2006 2 3 4 5 6 7 8 9 0 2 3 4 5 6 7 8 9 20 2 22 23 24 25 26 27 28 29 30 3 32 33 34 35 36 37 38 39 = required; 2 = optional; 3 = not required functional requirements Discovery tools available to end-users:
Big Data and the Earth Observation and Climate Modelling Communities: JASMIN and CEMS
Big Data and the Earth Observation and Climate Modelling Communities: JASMIN and CEMS Workshop on the Future of Big Data Management 27-28 June 2013 Philip Kershaw Centre for Environmental Data Archival
Integrating SharePoint Sites within WebSphere Portal
Integrating SharePoint Sites within WebSphere Portal November 2007 Contents Executive Summary 2 Proliferation of SharePoint Sites 2 Silos of Information 2 Security and Compliance 3 Overview: Mainsoft SharePoint
How To Manage Your Digital Assets On A Computer Or Tablet Device
In This Presentation: What are DAMS? Terms Why use DAMS? DAMS vs. CMS How do DAMS work? Key functions of DAMS DAMS and records management DAMS and DIRKS Examples of DAMS Questions Resources What are DAMS?
Big Data at ECMWF Providing access to multi-petabyte datasets Past, present and future
Big Data at ECMWF Providing access to multi-petabyte datasets Past, present and future Baudouin Raoult Principal Software Strategist ECMWF Slide 1 ECMWF An independent intergovernmental organisation established
OCLC CONTENTdm. Geri Ingram Community Manager. Overview. Spring 2015 CONTENTdm User Conference Goucher College Baltimore MD May 27, 2015
OCLC CONTENTdm Overview Spring 2015 CONTENTdm User Conference Goucher College Baltimore MD May 27, 2015 Geri Ingram Community Manager Overview Audience This session is for users library staff, curators,
San Jose State University
San Jose State University Fall 2011 CMPE 272: Enterprise Software Overview Project: Date: 5/9/2011 Under guidance of Professor, Rakesh Ranjan Submitted by, Team Titans Jaydeep Patel (007521007) Zankhana
NHS Education for Scotland Knowledge Services Design and Development Framework
NHS Education for Scotland Knowledge Services Design and Development Framework In support of Invitation to Tender: Technical Development of Technical Development of a Platform supporting Communication,
Get More from Microsoft SharePoint with Oracle Fusion Middleware. An Oracle White Paper January 2008
Get More from Microsoft SharePoint with Oracle Fusion Middleware An Oracle White Paper January 2008 NOTE The following is intended to outline our general product direction. It is intended for information
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager [email protected] Structure! Background and overview! OAIS Model! Why
Filestor Digital Asset Management. The way it works
Filestor Digital Asset Management The way it works Filestor is an Advanced Digital Asset Management System Filestor is far more than a Digital Asset Management System as it has been designed to be flexible
LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model
LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model 22 October 2014 Tony Hammond Michele Pasin Background About Macmillan
Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova
Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel
The Webcast will begin at 1:00pm EST. www.gig-werks.com
SharePoint 2013 & SharePoint Online Security, Compliance & ediscovery The Webcast will begin at 1:00pm EST Today s Presentation: Introduction & About Gig Werks Gig Werks Experience with SharePoint Office
UNLOCKING XBRL CONTENT
UNLOCKING XBRL CONTENT An effective database solution for storing and accessing XBRL documents An Oracle & UBmatrix Whitepaper September 2009 Oracle Disclaimer The following is intended to outline our
Notes about possible technical criteria for evaluating institutional repository (IR) software
Notes about possible technical criteria for evaluating institutional repository (IR) software Introduction Andy Powell UKOLN, University of Bath December 2005 This document attempts to identify some of
ENTERPRISE CONTENT MANAGEMENT. Trusted by Government Easy to Use Vast Scalability Flexible Deployment Automate Business Processes
ENTERPRISE CONTENT MANAGEMENT Trusted by Government Easy to Use Vast Scalability Flexible Deployment Automate Business Processes ENTERPRISE CONTENT MANAGEMENT. Maintain complete control of the information
Flattening Enterprise Knowledge
Flattening Enterprise Knowledge Do you Control Your Content or Does Your Content Control You? 1 Executive Summary: Enterprise Content Management (ECM) is a common buzz term and every IT manager knows it
Advanced Web Application Development using Microsoft ASP.NET
Key Data Course #: 2311A Number of Days: 3 Format: Instructor-Led Certification Exams: Exam 70-305: Developing and Implementing Web Applications with Microsoft Visual Basic.NET and Microsoft Visual Studio.NET
GRIP:Creating Interoperability between Grids
GRIP:Creating Interoperability between Grids Philipp Wieder, Dietmar Erwin, Roger Menday Research Centre Jülich EuroGrid Workshop Cracow, October 29, 2003 Contents Motivation Software Base at a Glance
K@ A collaborative platform for knowledge management
White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index
MatchPoint Technical Features Tutorial 21.11.2013 Colygon AG Version 1.0
MatchPoint Technical Features Tutorial 21.11.2013 Colygon AG Version 1.0 Disclaimer The complete content of this document is subject to the general terms and conditions of Colygon as of April 2011. The
dati.culturaitalia.it a Pilot Project of CulturaItalia dedicated to Linked Open Data
dati.culturaitalia.it a Pilot Project of CulturaItalia dedicated to Linked Open Data www.culturaitalia.it Rosa Caffo, Director of Central Institute for the Union Catalogue of Italian Libraries (MiBACT)
Digital Asset Management Developing your Institutional Repository
Digital Asset Management Developing your Institutional Repository Manny Bekier Director, Biomedical Communications Clinical Instructor, School of Public Health SUNY Downstate Medical Center Why DAM? We
Web Services Strategy
Web Services Strategy Agenda What What are are Web Web Services? Services? Web Web Services Services --The The Technologies Technologies Web Web Services Services Compliments Compliments Overall Overall
A Semantic Search Engine for the Storage Resource Broker
PAPER IDENTIFICATION NUMBER 1 A Semantic Search Engine for the Storage Resource Broker Stephen J. Jeffrey and Jane Hunter Abstract Information discovery is looming as a major challenge with the growth
Practical application of SAS Clinical Data Integration Server for conversion to SDTM data
Paper DM03 Practical application of SAS Clinical Data Integration Server for conversion to SDTM data Peter Van Reusel, Business & Decision Life Sciences, Brussels, Belgium Mark Lambrecht, SAS, Tervuren,
Adlib Internet Server
Adlib Internet Server Software for professional collections management in archives, libraries and museums Comprehensive, Flexible, User-friendly Adlib Internet Server Put your data online, the easy way
Web Service Testing. SOAP-based Web Services. Software Quality Assurance Telerik Software Academy http://academy.telerik.com
Web Service Testing SOAP-based Web Services Software Quality Assurance Telerik Software Academy http://academy.telerik.com The Lectors Snejina Lazarova Product Manager Talent Management System Dimo Mitev
An IDL for Web Services
An IDL for Web Services Interface definitions are needed to allow clients to communicate with web services Interface definitions need to be provided as part of a more general web service description Web
Databases & Data Infrastructure. Kerstin Lehnert
+ Databases & Data Infrastructure Kerstin Lehnert + Access to Data is Needed 2 to allow verification of research results to allow re-use of data + The road to reuse is perilous (1) 3 Accessibility Discovery,
