The GEOmon Distributed DataBase GDDB A data discovery and download portal for atmospheric composition data



Similar documents
met.no AeroCom tools for HTAP Michael Schulz, Jan Griesfeller MetNo

ACCESS TO ERS AND ENVISAT DATA. CGMS is informed about the ESA Earth Observation data policy and data access, in particular in Near Real Time.

EO data hosting and processing core capabilities and emerging solutions

ICSU/WMO World Data Center for Remote Sensing of the Atmosphere (WDC RSAT)

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Building the European Biodiversity. Observation Network (EU BON)

NERC Data Policy Guidance Notes

EXPLORING AND SHARING GEOSPATIAL INFORMATION THROUGH MYGDI EXPLORER

Outcomes of the CDS Technical Infrastructure Workshop

SQL Azure vs. SQL Server

Using standards for ocean data

Kanzelhöhe Online Data Archive KODA

Development of a Very Flexible Web based Database System for Environmental Research

Sextant. Spatial Data Infrastructure for Marine Environment. C. Satra Le Bris, E. Quimbert, M. Treguer

earthnet online The ESA Earth Observation Multi-Mission User Information Services

Local Loading. The OCUL, Scholars Portal, and Publisher Relationship

Service Level Agreement for. Reconditioned Landsat Cat-1 Images Service

Internet Technologies for Digital Libraries

SKSPI33 Undertake image asset management

Data Curation Profile for History

Copernicus Atmosphere Monitoring Service

The Copernicus Atmosphere Monitoring Service (CAMS)

An Introduction to Transparent Records Management

General concepts: DDI

European Soil Data Centre (ESDAC) Marc Van Liedekerke Land Management and Natural Harzards Unit

On data quality in the generation of products, tools, and services

View from the Coalface: experiences of digital collection management

estatistik.core: COLLECTING RAW DATA FROM ERP SYSTEMS

How To Use The Alabama Data Portal

Copernicus Space Component ESA Data Access Overview J. Martin (ESA), R. Knowelden (Airbus D&S)

Questions & Answers. on e-cohesion Policy in European Territorial Cooperation Programmes. (Updated version, May 2013)

Applying the OAIS standard to CCLRC s British Atmospheric Data Centre and the Atlas Petabyte Storage Service

Archival of raw and analysed radar data at EISCAT and worldwide

Learn about OverDrive APIs and how they can benefit search, discovery and reporting services at your library. Contact:

A. Document repository services for EU policy support

SeaDataNet pan-european infrastructure for ocean and marine data management. Dick M.A. Schaap MARIS

HTAP & MACC Tools. IEK 8, Research Center Jülich, Germany. Martin Schultz, Snehal Waychal, Michael Decker Olaf Stein

POLAR IT SERVICES. Business Intelligence Project Methodology

THE CCLRC DATA PORTAL

1 How to Monitor Performance

SPARROW Gateway. Developer Data Vault Payment Type API. Version 2.7 (6293)

How To Help The European Space Program

The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team

EcoTrends Cyber-infrastructure Development

Authentication and Single Sign On

Copernicus Atmosphere Monitoring Service (CAMS) Copernicus Climate Change Service (C3S)

SAR Archive and Community Support Activities at UNAVCO

Cite My Data M2M Service Technical Description

BALTEX II Data Management and Baltic Grid: Status Report

Globus Research Data Management: Introduction and Service Overview. Steve Tuecke Vas Vasiliadis

World Data Center for Remote Sensing of the Atmosphere, WDC-RSAT

Data access and management

Streamline Mobile Telecom Management with DATALERT! And MobileIron

V. DESCRIPTION OF SPECIFIC TASKS

Task AR-09-01a Progress and Contributions

Pan-European infrastructure for management of marine and ocean geological and geophysical data

OCLC PICA. organisation and product overview Martin van Muyen

Tomáš Müller IT Architekt 21/04/2010 ČVUT FEL: SOA & Enterprise Service Bus IBM Corporation

HTAP Data Network: Application Examples for NOx Analysis

Lecture 2. Internet: who talks with whom?

Dispatcher Phoenix is available in three distinct and customizable solutions to meet customer needs most effectively and efficiently:

Archiving of Simulations within the NERC Data Management Framework: BADC Policy and Guidelines.

AERONET Web Data Access and Relational Database

SRS BIO OPTICAL WORKFLOW

GSICS Working Group on Data Management

Digital Rights Management - The Difference Between DPM and CM

Paperless employment applications ease the hiring process.

Data dissemination best practice and STAR experience

About Me. Software Architect with ShapeBlue Specialise in. 3 rd party integrations and features in CloudStack

Balance and maximise your Oracle EBS investment with IBM Optim A Priceline and Travel Industry Case Study Philip McBride

The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA

The MOME Meta-Database for Monitoring and Measurement Tools and Traces

Description of the table of the in-situ data requirements of GMES services

NERC Thematic Programme. Cloud Water Vapour and Climate (CWVC) Data Management Plan

Oracle Collaboration Suite

ENABLING BUSINESS TRANSFORMATION CSC TESTING AS A SERVICE POWERED BY CA SERVICE VIRTUALIZATION

THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy Page 1 of 8

Copernicus Information Day Q&A presentation

Dell Statistica. Statistica Document Management System (SDMS) Requirements

Concept Proposal. A standards based SOA Framework for Interoperable Enterprise Content Management

Merit Cloud Media User Guide

Module 4: File Reading. Module 5: Database connection

Adlib Internet Server

ProLibis Solutions for Libraries in the Digital Age. Automation of Core Library Business Processes

Create Reports Utilizing SQL Server Reporting Services and PI OLEDB. Tutorial

Device Log Export ENGLISH

Norwegian Satellite Earth Observation Database for Marine and Polar Research USE CASES

AN OPENGIS WEB MAP SERVER FOR THE ESA MULTI-MISSION CATALOGUE

GeoNetwork User Manual

Big Data at ECMWF Providing access to multi-petabyte datasets Past, present and future

Data Management System - Developer Guide

How To Build A Connector On A Website (For A Nonprogrammer)

Test Data Management Concepts

Introduction to Service Oriented Architectures (SOA)

REST-based Offline System

How To Manage Pandora

IBM Campaign and IBM Silverpop Engage Version 1 Release 2 August 31, Integration Guide IBM

B.Sc. in Computer Information Systems Study Plan

GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata

Cloud. Hosted Exchange Administration Manual

Transcription:

The GEOmon Distributed DataBase GDDB A data discovery and download portal for atmospheric composition data http://geomon.nilu.no Presentation at the 2 nd MACC general assembly, October 19 th 2010 Aasmund Fahre Vik, afv@nilu.no

Outline Background GDDB system and contributing databases GDDB user interface Applicability for MACC purposes Future prospects for the system

Background of the work Previous attempts to create superdatabases for all types of data or had failed Experiments with metadatabases not too useful GEOSS 10 year implementation plan distributed datasystems European contribution to GEOSS from GEOmon project (coord.: Philippe Ciais) Organise and harmonise atmospheric composition obervations in Europe Organise and manage data through virtual data centre creation of GDDB

GEOmon project

GEOmon data management vision (text written 3.5 years ago) Interact with GEOMON participants to agree on how to manage data generated through the project (Data Management Committee) Build upon existing infrastructures and data flow as far as possible Balance data originators intellectual property rights with openness and transparancy, manage protocols for data access rights Keep burden on individual DO s as low as possible how can data reporting be simplified? Investigate the use of meta-data exchange in developing a distributed data centre Different approaches for different data types (multi-dimensional data vs. Simple time series) An extensive review of data sources and routes of data flow to be conducted to serve as a basis for the choice of solution Special data transfer for NRT data all GEOMON NRT data routed through common system External interfaces Web portal and machine-to-machine interface

Data flow diagrams aerosol example

GDDB overall design GEOSS GEOSS service service GEOSS GEOSS service service GEOSS service GEOmon Data Centre web portal GEOmon RD Data archive EBAS WWW EBAS Data archive External data archive External data archive External data archive External data archive External data archive External data archive CDB WWW ESA-CDB Data archive

GDDB web portal design

Data catalogue a metadatabase Contains records of core metadata for datasets stored in databases elsewhere - One record for each dataset One dataset is defined as one component from one location A physical datafile may therefore contain several datasets Information on where the dataset is stored and how/if it may be downloaded through GDDB is available Oracle DB

Data catalogue generator A series of Perl-scripts that prepare metadata and inserts (meta)data into the catalogue Different approach for the different archives that are linked to GDDB Metadata, especially component names and location, is harmonised and original naming is converted using a GDDB naming convention Exchange of metadata normally done through simple text-files made available by contributing databases Metadata harvested routinely (cron-job) and data catalogue updated automatically Syncronizes with external databases once per day

Contributing databases Implemented data connections EMEP, AMAP, Helcom, Osparcom, EUCAARI, EUSAAR, CREATE, HTAP observations, +more GAW-WDCA, GAWSIS-WOUDC, GAWSIS-WDCGG, GAWSIS-WRDC NDACC, Aura Validation Data Centre, Envisat Validation Data Centre, EARLINET Aerocom model median (only aerosol sulphate) Yet to be implemented RAMCES (GHG), NRT O3-Sondes, more Aerocom

GDDB User interface Demonstration of: http://geomon.nilu.no Two-page system search and info/download Search by Component, Location, Database, Platform, Data type and Matrix 4D boundary selection Descriptions of terms and usage guides available Link to Rapid Delivery Data

GDDB Info and Download page Demonstration of: http://geomon.nilu.no Sorting of metadata results Viewing of metadata details Access information and login details (for restricted datasets) Download of data Sometimes only http link to contributing data centre System takes care of all data transfer (http, ftp, web services) Download module works in background come back later to retrieve data through unique URL (dev version only)

Applicability for MACC purposes GDDB is a powerful data discovery tool! An easy way to learn about existence of observations Contains an updated overview of available data from key databases Possible to download multiple datasets from several databases simultaneously GDDB system continously evolving

Applicability for MACC purposes Possible Use cases: I am studying a forest fire episode over Europe on August 14-15 2008 (imaginary event) What data are available to constraint my model? I am studying solar proton events in 2007 what stratospheric HNO3 measurements are available? Which databases contain aerosol data? A group of modellers are comparing sulphate concentrations we want to download a common reference dataset for 2007

Future prospects of the GDDB GEOmon is funded until April 2011 ACTRIS infrastructure project starts April 2011 and will utilize the GDDB system ensures support for five years More databases will be added Metadata exchange mechanisms will be improved and standardized (ESA DCIO) Better support for scripts and automatic operations may be added something for MACC II?

Thank you for your attention