Data Formats for Long-term Archiving of Climate Model Data at WDC Climate and DKRZ

Size: px
Start display at page:

Download "Data Formats for Long-term Archiving of Climate Model Data at WDC Climate and DKRZ"

Transcription

1 Data Formats for Long-term Archiving of Climate Model Data at WDC Climate and DKRZ Michael Lautenschlager and Jörg Wegner WDC Climate / Max-Planck-Institute for Meteorology, Hamburg MPG e-science Seminar October 25th 26th, 2007 in Berlin, Germany

2 DKRZ: Earth system model development Simulations of past, present and future climate WDC Climate: Long-term data archiving Inter-disciplinary data dissemination

3 Diagram of Climate System

4 Diagram of the Hamburg IPCC- Climate Model ECHAM5/MPI-OM

5 Forcing of Climate Projetions for IPCC AR4

6 Near surface temperature change for the scenarios A1B und B1. Presented is the difference of the 30-year-means minus

7 Comparison of the present-day sea ice cover In March and September (oben) with the climate projection for the scenario A1B (unten) in Additionally the snow over land can be obtained.

8 Spatial resolution of the North Atlantic sector in ECHAM5/MPI-OM

9 Data Volumina in Climate Projections: IPCC AR4: ECHAM5[T63L19]/MPI-OM produces 23 TB/year Climate projection over 240 years ( ): 5,5 TB and appr. 2 months computer run time Future: ECHAM5[T106L31] produces 44 GB/year Climate projection over 240 years ( ): 106 TB, i.e. the complexity is appr. 20 * T63

10 Extrapolated HLRE2 linear archive increase (10 times HLRE) Compute server architectures: C90: Cray C90 / HLRE: NEC SX-6 / MPP: SUN-Cluster / HLRE2: new system (HLRE: Höchstleistungsrechnersystem für die Erdsystemforschung)

11 DKRZ System Diagram GFS lokale lokale Systeme DS CS NW entfernte Systeme

12 HLRE System architecture at DKRZ SX-6 SX-6 IXS SX-6 24 nodes SX-6 SX-6 SX-6 3 * 16/32-48 SX-6 SX-6 SX-6 SX-6 x SX-6 32 SX-6 SX-6 SX-6 X compile user appl x 6 GFS Disk 70 TB x 12 x 48 x 2 x 12 x 112 x 4 x 16 x 12 x 8 x 24 x 32 LAN x 6 x 2 x 12 ApplSrv SUN Az GFS GFS/UVDM UDSN GFS/UVDM UDSN DXUL-DB 8/16 UCFM 2 * 16/32 UCFM Oracle9i archive backup 2 * 8/16 GFS 8/16 x 36 HSM x 6 3 * 4/8 DS test 6 * 4/8 x 20 DBMS UCFM Cache 17 TB x 16 x C x B x 18 T10000 x 8 LTO2 x 2 DBMS Disk 30 TB

13 Data classes Test data from model code development, life cycle: weeks to months Project data from scientific model evaluation and research projects (DKRZ resources at project level), life cycle: 3 5 years Final results as data products for international projects (IPCC) and scientific publications, life cycle: 10 years and longer Data hierarchy levels Temp(orary) scratch discs at compute server Work fixed disc space at project level for evaluation Arch(ive) tape storage space (single copy) with expiration date for project data beyond available disc space Docu(mentation) documented, long-term tape archive (security copy) for data products, focus on interdisciplinary data utilisation, data are fixed and no longer matter of change

14 Tape space distributon to archive classes at DKRZ begin of 2007: part of the work space on tape because GFS too small docu domain consists of WDCC no expiration dates in arch domain, parts of arch domain belongs to docu but not yet documented

15 Data documentation requirements are accomplished by using the WDCC infrastruture CERA-2 metadata model developed in 1999 Catalogue interface: cera.wdc-climate.de Input interface: input.wdc-climate.de CERA-2 metadata content is complete with respect to browse, to discover and to use climate data which are stored in the database system or outside in flat files The WDCC matches international description standards like ISO 19115, Dublin Core or GCMD and is integrated in international data federations Data storage structure assembles storage of climate time series per variable in BLOB data tables. This allows for web-based data catalogue search and data access in small data granules.

16 CERA Data Model Reference Contact Coverage Status Entry Parameter Distribution Data Org Local Adm. Data Access Spatial Reference

17 Coloured columns correspond to BLOB data tables in WDCC. Collections of matrix rows represents storage in model raw data files (complete model output storage time step by storage time step).

18 Data infrastructure integrates data stewardship in the long-term archive Bit-stream preservation Quality assurance Usability enabling

19 Long-term archive data stewardship Bit-stream preservation Secondary tape copies on different tapes and technology at separate location Copy to new tapes after maximum number of tape accesses are reached (Refreshment) Quality assurance Semantic examinations: behavior of a numerical model compared to observations and to other models, part of the scientific evaluation process Syntactic examinations: formal aspects of data archiving and ensurance that data archiving is free of errors as far as possible Consitency between metadata and climate data Completeness of climate data Standard range of values Spatial and temporal data arrangement

20 Long-term archive data stewardship (continued) Usability enabling Complete and searchable documenation of climate data entities (database tables and flat files) in the catalogue system of the WDCC WDCC offers web-based data access to small data granules (individual entries in BLOB DB tables) Archive technology transfer must be downward compatible to keep old data technically readable Data processing tools and data format access libraries must be migrated to new architectures

21 Standard Data Formats (SDFs) at WDC Climate Requirements Self-descriptive (use metadata) Machine independent Should contain compression or packing Benefits SDFs support long-term data preservation SDFs support data exchnage and dissemination SDFs allow for application of standardized data processing tools and packages

22 Data Form a ts at W D C C climate model output GRIB 1 GRIB 2 NetCDF 3.x NetCDF 4.x tools: cdo, cdat, xconv, IDV cdo, cdat, nco, ncl cdat, grads, ncview, G M T convert manipulate visualize

23 GRIB 1 G RIdded Binary -'GRIB' Section 0 -length of message, edition nu m ber Section 1 - product description section Section 2 - grid description section repeated Section 3 - bit map section Section 4 - binary data section -'7777' ds8 55 %grib -ginfo zzz.grb Rec : Position Size : V PDS GDS BMS BDS : Code Level: LType GType 1 : : : : : : : : : : : : : : : : : : : : ds8 56 %grib -gdsinfo zzz.grb Rec : GDS NV PVPL Typ :xsize ysize Lat1 Lon1 Lat2 Lon2 dx dy 1 : : ds8 57 % - co mpressed data -> smallfile size - every 2d field (record) is a G RIB file -> UNIX co m m ands for catenating -library supportfor FORTRAN & C -strong restrictions for header informations - header infor mation coded (num b ers) -need of tables for decoding

24 GRIB 2 General Regulary-distributed Information in Binary form Section 0 -'GRIB'indicator section Section 1 -identification section* Section 2 -localuse section (optional) Section 3 - grid definition section* repeated Section 4 - product definition section* repeated Section 5 - data representation section repeated Section 6 - bit map section Section 7 - data section Section 8 - end section '7777' * Sections 1,3,4 represent the GRIB1 product description section. This splitting, com bined with the option foriterating sections and the concept of templates make the main difference to GRIB1 and keeps GRIB2 very flexible. Concept of templates: You can define templates for grid definition, product definition and data representation by your o wn.

25 GRIB 2exa mple A 500 hpa height field forecasts on a Northern He misphere polar stereographic grid produced by a particular num erical model atforecast hours 12, 24, 36,and 48. These fourfields could be represented by a single GRIB2 message by repeating the sequence of Sections 4to 7 four times, making the appropriate forecast time changes in the Product Definition Section in each iteration of the sequence. Section 0:Indicator Section Section 1:Identification Section Section 2:Local Use Section (optional) Section 3:Grid Definition Section Section 4:Product Definition Section (hour = 12) repetition 1 Section 5:Data Representation Section Section 6:Bit-Map Section Section 7:D ata Sectio n Section 4:Product Definition Section (hour = 24) repetition 2 Section 5:Data Representation Section Section 6:Bit-Map Section Section 7:D ata Sectio n Section 4:Product Definition Section (hour = 36) repetition 3 Section 5:Data Representation Section Section 6:Bit-Map Section Section 7:D ata Sectio n Section 4:Product Definition Section (hour = 48) repetition 4 Section 5:Data Representation Section Section 6:Bit-Map Section Section 7:D ata Sectio n Section 8:End Section Note that since the Grid Definition Section is not repeated, itremains in effectfor allfour forecast hours.

26 NetCDF 3.x NETwork Com mon Data Form - dimensions (1 unlimited possible) - variables & attributes - globalattributes -data netcdf simple_xy { dimensions: x = 6 ; y = 12 ; variables: int data(x,y) ; // global attributes: :C D O = "Climate Data Operators version " ; :source = "E C H A M5.2" ; data: data = 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31,32, 33, 34,35, 36, 37,38, 39, 40, 41,42, 43, 44,45, 46, 47,48, 49, 50,51, 52, 53,54, 55, 56, 57, 58, 59,60, 61, 62,63, 64, 65,66, 67, 68,69, 70, 71 ; } - no co m pression for data -> file size bigger than G RIB1 - data stored n-dim ensional -library supportfor FORTRAN & C -file size => 2 GByte with NetCDF3.6 -no restrictions for headerinfor mations

27 NetCDF 4.x, HDF5 NetCDF-4/HDF5 Format W ith version 4.0 of netc D F, another ne w data for m at was introduced: netcdf-4/hdf5 format. This format is HDF5, with fulluse of the new dimension scales,creation ordering, and other features of HDF5 added in its version release. Multiple unlimited dimensions. Groups to organize data. New types,including com pound types and variable length arrays. ParallelI/O. netcdf4 "exa mple" { group "/" { group "group1" { dataset "set1" { dimension variables data} dataset "set2" { dimension variables data} } group "group2" {... }} netcdf3.x file

28 Tools nco: for NetCDF ncl:for NetC D F3, NetC D F4, G RIB1, GRIB2, HDF4 ncview:for NetCDF eteora.ucsd.edu/~pierce/ncview_ho me_page.html cdo: for GRIB1, NetC DF, ieg, EXTRA, Service w w.m pimet.mpg.de/filead min/software/cdo/ cdat:for GRIB1 (with GrADS controlfile), NetC DF, HDF xconv: for NetCDF, G RIB IDV: for GRIB, NetCDF w w.unidata.ucar.edu/software/ THG HDF5 tools:for HDF w w.hdfgroup.org/products/hdf5_tools/ GrADS: GRIB1 (with controlfile),netcdf3.x ml

Recent Developments at WDC Climate: Limitation of Long-term Archiving at DKRZ

Recent Developments at WDC Climate: Limitation of Long-term Archiving at DKRZ Recent Developments at WDC Climate: Limitation of Long-term Archiving at DKRZ Michael Lautenschlager WDC Climate / Max-Planck-Institute for Meteorology, Hamburg Wolfgang Stahl German Climate Computing

More information

The CEOP Model Data Archive at the World Data Center for Climate as part of the CEOP Data Network

The CEOP Model Data Archive at the World Data Center for Climate as part of the CEOP Data Network The CEOP Model Data Archive at the World Data Center for Climate as part of the CEOP Data Network CEOP / IGWCO Joint Meeting Feb. 28th Mar. 4th, 2005, Tokyo Michael Lautenschlager Frank Toussaint, Hans

More information

Big Data Services at DKRZ

Big Data Services at DKRZ Big Data Services at DKRZ Michael Lautenschlager and Colleagues from DKRZ and Scientific Computing Research Group MPI-M Seminar Hamburg, March 31st, 2015 Big Data in Climate Research Big data is an all-encompassing

More information

CMIP5 Data Management CAS2K13

CMIP5 Data Management CAS2K13 CMIP5 Data Management CAS2K13 08. 12. September 2013, Annecy Michael Lautenschlager (DKRZ) With Contributions from ESGF CMIP5 Core Data Centres PCMDI, BADC and DKRZ Status DKRZ Data Archive HLRE-2 archive

More information

Data management and archiving

Data management and archiving Data management and archiving for COP / GOP / D_PHASE 4 th COPS Workshop 25./26.09. 2006 Stuttgart Claudia Wunram Hannes Thiemann 25.-26.09.06 / 1 Data archive Long term data archive for COPS, GOP and

More information

Data-Intensive Science and Scientific Data Infrastructure

Data-Intensive Science and Scientific Data Infrastructure Data-Intensive Science and Scientific Data Infrastructure Russ Rew, UCAR Unidata ICTP Advanced School on High Performance and Grid Computing 13 April 2011 Overview Data-intensive science Publishing scientific

More information

Introduction to DKRZ and to the Visualization Server halo

Introduction to DKRZ and to the Visualization Server halo Introduction to DKRZ and to the Visualization Server halo Michael Böttinger Niklas Röber Deutsches Klimarechenzentrum -------------------------------------------------------------- German Climate Computing

More information

Big Data Research at DKRZ

Big Data Research at DKRZ Big Data Research at DKRZ Michael Lautenschlager and Colleagues from DKRZ and Scien:fic Compu:ng Research Group Symposium Big Data in Science Karlsruhe October 7th, 2014 Big Data in Climate Research Big

More information

Nevada NSF EPSCoR Track 1 Data Management Plan

Nevada NSF EPSCoR Track 1 Data Management Plan Nevada NSF EPSCoR Track 1 Data Management Plan August 1, 2011 INTRODUCTION Our data management plan is driven by the overall project goals and aims to ensure that the following are achieved: Assure that

More information

BALTEX II Data Management and Baltic Grid: Status Report

BALTEX II Data Management and Baltic Grid: Status Report BALTEX II Data Management and Baltic Grid: Status Report Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute for Meteorology BALTEX Scientific Steering Group Meeting

More information

WORLD METEOROLOGICAL ORGANIZATION. Introduction to. GRIB Edition1 and GRIB Edition 2

WORLD METEOROLOGICAL ORGANIZATION. Introduction to. GRIB Edition1 and GRIB Edition 2 WORLD METEOROLOGICAL ORGANIZATION Introduction to GRIB Edition1 and GRIB Edition 2 JUNE 2003 GRIB EDITION 1 and GRIB EDITION 2 HISTORY 1. The GRIB code has great advantages in comparison with the traditional

More information

The THREDDS Data Repository: for Long Term Data Storage and Access

The THREDDS Data Repository: for Long Term Data Storage and Access 8B.7 The THREDDS Data Repository: for Long Term Data Storage and Access Anne Wilson, Thomas Baltzer, John Caron Unidata Program Center, UCAR, Boulder, CO 1 INTRODUCTION In order to better manage ever increasing

More information

Interactive Data Visualization with Focus on Climate Research

Interactive Data Visualization with Focus on Climate Research Interactive Data Visualization with Focus on Climate Research Michael Böttinger German Climate Computing Center (DKRZ) 1 Agenda Visualization in HPC Environments Climate System, Climate Models and Climate

More information

CMIP6 Data Management at DKRZ

CMIP6 Data Management at DKRZ CMIP6 Data Management at DKRZ icas2015 Annecy, France on 13 17 September 2015 Michael Lautenschlager Deutsches Klimarechenzentrum (DKRZ) With contributions from ESGF Executive Committee and WGCM Infrastructure

More information

PART 1. Representations of atmospheric phenomena

PART 1. Representations of atmospheric phenomena PART 1 Representations of atmospheric phenomena Atmospheric data meet all of the criteria for big data : they are large (high volume), generated or captured frequently (high velocity), and represent a

More information

The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team

The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team Photo courtesy Andrew Mahoney NSF Vision What is AON? a

More information

Norwegian Satellite Earth Observation Database for Marine and Polar Research http://normap.nersc.no USE CASES

Norwegian Satellite Earth Observation Database for Marine and Polar Research http://normap.nersc.no USE CASES Norwegian Satellite Earth Observation Database for Marine and Polar Research http://normap.nersc.no USE CASES The NORMAP Project team has prepared this document to present functionality of the NORMAP portal.

More information

Outcomes of the CDS Technical Infrastructure Workshop

Outcomes of the CDS Technical Infrastructure Workshop Outcomes of the CDS Technical Infrastructure Workshop Baudouin Raoult Baudouin.raoult@ecmwf.int Funded by the European Union Implemented by Evaluation & QC function C3S architecture from European commission

More information

DA-09-02a Task Report Data Integration and Analysis System

DA-09-02a Task Report Data Integration and Analysis System Data Integration and Analysis System Report by Toshio Koike, The University of Tokyo, Japan and Rick Lawford, University of Manitoba, Canada ADC Meeting May 2009 Stresa (May. 2009) Sub-task Definition

More information

Concept of long term data archive

Concept of long term data archive Concept of long term data archive for COP / GOP / DPHASE 1 st MAP DPHASE scientific meeting 6.-8. November 2006, Vienna Claudia Wunram Hannes Thiemann cops@zmaw.de 6.-8. Nov 06 / 1 Data archive Long term

More information

FP7 Collaborative Project Grant Agreement 212492

FP7 Collaborative Project Grant Agreement 212492 Instrument: SP1 Cooperation Thematic Priority: Priority Area 1.1.6.3 "Global Change and Ecosystems" FP7 Collaborative Project Grant Agreement 212492 CLARIS LPB and Impact Studies in La Plata Basin DELIVERABLES

More information

Long Term Preservation of Earth Observation Space Data. Preservation Workflow

Long Term Preservation of Earth Observation Space Data. Preservation Workflow Long Term Preservation of Earth Observation Space Data Preservation Workflow CEOS-WGISS Doc. Ref.: CEOS/WGISS/DSIG/PW Data Stewardship Interest Group Date: March 2015 Issue: Version 1.0 Preservation Workflow

More information

Scientific Data Management and Dissemination

Scientific Data Management and Dissemination Federal GIS Conference February 9 10, 2015 Washington, DC Scientific Data Management and Dissemination John Fry Solution Engineer, Esri jfry@esri.com Agenda Background of Scientific Data Management through

More information

The storage features and needs for numerical modeling at ARPA FVG - CRMA

The storage features and needs for numerical modeling at ARPA FVG - CRMA The storage features and needs for numerical modeling at ARPA FVG - CRMA The storage features and needs for numerical modeling at ARPA FVG - CRMA Scientific data management approaches, data analysis and

More information

HYCOM Meeting. Tallahassee, FL

HYCOM Meeting. Tallahassee, FL HYCOM Data Service An overview of the current status and new developments in Data management, software and hardware Ashwanth Srinivasan & Jon Callahan COAPS FSU & PMEL HYCOM Meeting Nov 7-9, 7 2006 Tallahassee,

More information

Integrated Data Viewer (IDV) a visualization framework Yuan Ho Unidata Program Center Boulder, CO Presentation Outline Integrated Data Viewer (IDV) overview The IDV features IDV examples and customized

More information

CLOUD BASED N-DIMENSIONAL WEATHER FORECAST VISUALIZATION TOOL WITH IMAGE ANALYSIS CAPABILITIES

CLOUD BASED N-DIMENSIONAL WEATHER FORECAST VISUALIZATION TOOL WITH IMAGE ANALYSIS CAPABILITIES CLOUD BASED N-DIMENSIONAL WEATHER FORECAST VISUALIZATION TOOL WITH IMAGE ANALYSIS CAPABILITIES M. Laka-Iñurrategi a, I. Alberdi a, K. Alonso b, M. Quartulli a a Vicomteh-IK4, Mikeletegi pasealekua 57,

More information

Solution Brief: Creating Avid Project Archives

Solution Brief: Creating Avid Project Archives Solution Brief: Creating Avid Project Archives Marquis Project Parking running on a XenData Archive Server provides Fast and Reliable Archiving to LTO or Sony Optical Disc Archive Cartridges Summary Avid

More information

National Snow and Ice Data Center A brief overview and data management projects

National Snow and Ice Data Center A brief overview and data management projects Cooperative Institute for Research in Environmental Sciences University of Colorado National Snow and Ice Data Center A brief overview and data management projects Walt Meier, Mark Parsons, and Mark Serreze

More information

The ORIENTGATE data platform

The ORIENTGATE data platform Seminar on Proposed and Revised set of indicators June 4-5, 2014 - Belgrade (Serbia) The ORIENTGATE data platform WP2, Action 2.4 Alessandra Nuzzo, Sandro Fiore, Giovanni Aloisio Scientific Computing and

More information

A DISTRIBUTED CATALOG AND DATA SERVICES SYSTEM FOR REMOTE SENSING DATA

A DISTRIBUTED CATALOG AND DATA SERVICES SYSTEM FOR REMOTE SENSING DATA A DISTRIBUTED CATALOG AND DATA SERVICES SYSTEM FOR REMOTE SENSING DATA Ramachandran Suresh *, Liping Di *, Kenneth McDonald ** * NASA/RITSS 4500 Forbes Blvd, Lanham, MD 20706, USA suresh@rattler.gsfc.nasa.gov

More information

Integrated Rule-based Data Management System for Genome Sequencing Data

Integrated Rule-based Data Management System for Genome Sequencing Data Integrated Rule-based Data Management System for Genome Sequencing Data A Research Data Management (RDM) Green Shoots Pilots Project Report by Michael Mueller, Simon Burbidge, Steven Lawlor and Jorge Ferrer

More information

How To Build A Supermicro Computer With A 32 Core Power Core (Powerpc) And A 32-Core (Powerpc) (Powerpowerpter) (I386) (Amd) (Microcore) (Supermicro) (

How To Build A Supermicro Computer With A 32 Core Power Core (Powerpc) And A 32-Core (Powerpc) (Powerpowerpter) (I386) (Amd) (Microcore) (Supermicro) ( TECHNICAL GUIDELINES FOR APPLICANTS TO PRACE 7 th CALL (Tier-0) Contributing sites and the corresponding computer systems for this call are: GCS@Jülich, Germany IBM Blue Gene/Q GENCI@CEA, France Bull Bullx

More information

Analysis of Climatic and Environmental Changes Using CLEARS Web-GIS Information-Computational System: Siberia Case Study

Analysis of Climatic and Environmental Changes Using CLEARS Web-GIS Information-Computational System: Siberia Case Study Analysis of Climatic and Environmental Changes Using CLEARS Web-GIS Information-Computational System: Siberia Case Study A G Titov 1,2, E P Gordov 1,2, I G Okladnikov 1,2, T M Shulgina 1 1 Institute of

More information

Best Practices on Standards, Policies and Quality Assurance in Digital Repositories for Long Term Preservation

Best Practices on Standards, Policies and Quality Assurance in Digital Repositories for Long Term Preservation Available online at www.prace-ri.eu Partnership for Advanced Computing in Europe Best Practices on Standards, Policies and Quality Assurance in Digital Repositories for Long Term Preservation Olivier Rouchon,

More information

Primary author: Kaspar, Frank (DWD - Deutscher Wetterdienst), Frank.Kaspar@dwd.de

Primary author: Kaspar, Frank (DWD - Deutscher Wetterdienst), Frank.Kaspar@dwd.de Primary author: Kaspar, Frank (DWD - Deutscher Wetterdienst), Frank.Kaspar@dwd.de Co-authors: Johannes Behrendt (DWD - Deutscher Wetterdienst), Klaus-Jürgen Schreiber (DWD - Deutscher Wetterdienst) Abstract

More information

Satellite Products and Dissemination: Visualization and Data Access

Satellite Products and Dissemination: Visualization and Data Access Satellite Products and Dissemination: Visualization and Data Access Gregory Leptoukh GES DISC, NASA GSFC Dana Ostrenga GES DISC, NASA GSFC Introduction The Goddard Earth Sciences Data and Information Services

More information

NOAA Big Data Project. David Michaud Acting Director, Office of Central Processing Office Monday, August 3, 2015

NOAA Big Data Project. David Michaud Acting Director, Office of Central Processing Office Monday, August 3, 2015 NOAA Big Data Project David Michaud Acting Director, Office of Central Processing Office Monday, August 3, 2015 Central Processing Portfolio Benefits and Scope Central Processing Portfolio Benefits Ensures

More information

Big Data at ECMWF Providing access to multi-petabyte datasets Past, present and future

Big Data at ECMWF Providing access to multi-petabyte datasets Past, present and future Big Data at ECMWF Providing access to multi-petabyte datasets Past, present and future Baudouin Raoult Principal Software Strategist ECMWF Slide 1 ECMWF An independent intergovernmental organisation established

More information

Archiving of Simulations within the NERC Data Management Framework: BADC Policy and Guidelines.

Archiving of Simulations within the NERC Data Management Framework: BADC Policy and Guidelines. Archiving of Simulations within the NERC Data Management Framework: BADC Policy and Guidelines. Introduction 1. Issues associated with archiving information about the environment made by measurement are

More information

ATLAS2000 Atlases of the Future in Internet

ATLAS2000 Atlases of the Future in Internet ATLAS2000 Atlases of the Future in Internet M. Friedrich Institute for Physical Geography, University of Freiburg i.br., Germany (mafri@ipg.uni-freiburg.de) M. Melle Institute for Computer Science, University

More information

Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University. Xu Liang ** University of California, Berkeley

Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University. Xu Liang ** University of California, Berkeley P1.1 AN INTEGRATED DATA MANAGEMENT, RETRIEVAL AND VISUALIZATION SYSTEM FOR EARTH SCIENCE DATASETS Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University Xu Liang ** University

More information

Information Migration

Information Migration Information Migration Raymond A. Clarke Sr. Enterprise Storage Solutions Specialist, Sun Microsystems - Archive & Backup Solutions SNIA Data Management Forum, Board of Directors 1 SNIA DMFs 100 Year Task

More information

Exascale Storage Systems An Analytical Study of Expenses

Exascale Storage Systems An Analytical Study of Expenses Exascale Storage Systems An Analytical Study of Expenses Julian M. Kunkel 1, Michael Kuhn 2, Thomas Ludwig 1 The computational power and storage capability of supercomputers are growing at a different

More information

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007 Data Management in an International Data Grid Project Timur Chabuk 04/09/2007 Intro LHC opened in 2005 several Petabytes of data per year data created at CERN distributed to Regional Centers all over the

More information

Levels of Archival Stewardship at the NOAA National Oceanographic Data Center: A Conceptual Model 1

Levels of Archival Stewardship at the NOAA National Oceanographic Data Center: A Conceptual Model 1 Levels of Archival Stewardship at the NOAA National Oceanographic Data Center: A Conceptual Model 1 Levels of Archival Stewardship at the NOAA National Oceanographic Data Center: A Conceptual Model Dr.

More information

NASA Earth System Science: Structure and data centers

NASA Earth System Science: Structure and data centers SUPPLEMENT MATERIALS NASA Earth System Science: Structure and data centers NASA http://nasa.gov/ NASA Mission Directorates Aeronautics Research Exploration Systems Science http://nasascience.nasa.gov/

More information

Structure? Integrated Climate Data Center How to use the ICDC? Tools? Data Formats? User

Structure? Integrated Climate Data Center How to use the ICDC? Tools? Data Formats? User Integrated Climate Data Center? Data Formats?? Tools???? visits Structure???? User Contents Which Data Formats do we offer? What is the Structure of our data center? Which Tools do we provide? Our Aims

More information

Guide to the WMO Table Driven Code Form Used for the Representation and Exchange of Regularly Spaced Data In Binary Form: FM 92 GRIB Edition 2

Guide to the WMO Table Driven Code Form Used for the Representation and Exchange of Regularly Spaced Data In Binary Form: FM 92 GRIB Edition 2 Guide to the WMO Table Driven Code Form Used for the Representation and Exchange of Regularly Spaced Data In Binary Form: FM 92 GRIB Edition 2 Layer 1: Basic Aspects of FM 92 GRIB Edition 2 (GRIB2) Layer

More information

Product Brief: XenData X2500 LTO-6 Digital Video Archive System

Product Brief: XenData X2500 LTO-6 Digital Video Archive System Product Brief: XenData X2500 LTO-6 Digital Video Archive System Updated: March 21, 2013 Overview The XenData X2500 system includes XenData6 Workstation software which provides the archive, restore and

More information

How To Write An Nccwsc/Csc Data Management Plan

How To Write An Nccwsc/Csc Data Management Plan Guidance and Requirements for NCCWSC/CSC Plans (Required for NCCWSC and CSC Proposals and Funded Projects) Prepared by the CSC/NCCWSC Working Group Emily Fort, Data and IT Manager for the National Climate

More information

An Introduction to the MTG-IRS Mission

An Introduction to the MTG-IRS Mission An Introduction to the MTG-IRS Mission Stefano Gigli, EUMETSAT IRS-NWC Workshop, Eumetsat HQ, 25-0713 Summary 1. Products and Performance 2. Design Overview 3. L1 Data Organisation 2 Part 1 1. Products

More information

The STAR Algorithm Integration Team (AIT) Research to Operations Process

The STAR Algorithm Integration Team (AIT) Research to Operations Process The STAR Algorithm Integration Team (AIT) Research to Operations Process Presented by Tom King 1 The Problem Scientists write code that needs to go to operations, but:» Many scientists often prefer interpreted

More information

Geospatial Software Solutions for the Environment and Natural Resources

Geospatial Software Solutions for the Environment and Natural Resources Geospatial Software Solutions for the Environment and Natural Resources Manage and Preserve the Environment and its Natural Resources Our environment and the natural resources it provides play a growing

More information

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration Solutions Integrated Storage Appliances Management Optimized Storage & Migration Archive Data Retention & Compliance Services Global Installation & Support SECURING THE FUTURE OF YOUR DATA w w w.q sta

More information

XenData Video Edition. Product Brief:

XenData Video Edition. Product Brief: XenData Video Edition Product Brief: The Video Edition of XenData Archive Series software manages one or more automated data tape libraries on a single Windows 2003 server to create a cost effective digital

More information

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:

More information

J9.6 GIS TOOLS FOR VISUALIZATION AND ANALYSIS OF NEXRAD RADAR (WSR-88D) ARCHIVED DATA AT THE NATIONAL CLIMATIC DATA CENTER

J9.6 GIS TOOLS FOR VISUALIZATION AND ANALYSIS OF NEXRAD RADAR (WSR-88D) ARCHIVED DATA AT THE NATIONAL CLIMATIC DATA CENTER J9.6 GIS TOOLS FOR VISUALIZATION AND ANALYSIS OF RADAR (WSR-88D) ARCHIVED DATA AT THE NATIONAL CLIMATIC DATA CENTER Steve Ansari * STG Incorporated, Asheville, North Carolina Stephen Del Greco NOAA National

More information

POLICY AND GUIDELINES FOR THE MANAGEMENT OF ELECTRONIC RECORDS INCLUDING ELECTRONIC MAIL (E-MAIL) SYSTEMS

POLICY AND GUIDELINES FOR THE MANAGEMENT OF ELECTRONIC RECORDS INCLUDING ELECTRONIC MAIL (E-MAIL) SYSTEMS POLICY AND GUIDELINES FOR THE MANAGEMENT OF ELECTRONIC RECORDS INCLUDING ELECTRONIC MAIL (E-MAIL) SYSTEMS 1. Purpose Establish and clarify a records management policy for municipal officers with respect

More information

<Insert Picture Here> Oracle SQL Developer 3.0: Overview and New Features

<Insert Picture Here> Oracle SQL Developer 3.0: Overview and New Features 1 Oracle SQL Developer 3.0: Overview and New Features Sue Harper Senior Principal Product Manager The following is intended to outline our general product direction. It is intended

More information

MADIS-T, Satellite Radiance and Sounding Thread using Virtualization

MADIS-T, Satellite Radiance and Sounding Thread using Virtualization 1 MADIS-T, Satellite Radiance and Sounding Thread using Virtualization Abhishek Agarwal 1, Shahram Tehranian 1, James McNitt 2, Joseph Facundo 2, Anand Swaroop 1 1 Nortel Government Solutions (NGS) 2 National

More information

ETERNUS CS High End Unified Data Protection

ETERNUS CS High End Unified Data Protection ETERNUS CS High End Unified Data Protection Optimized Backup and Archiving with ETERNUS CS High End 0 Data Protection Issues addressed by ETERNUS CS HE 60% of data growth p.a. Rising back-up windows Too

More information

Big Data and the Earth Observation and Climate Modelling Communities: JASMIN and CEMS

Big Data and the Earth Observation and Climate Modelling Communities: JASMIN and CEMS Big Data and the Earth Observation and Climate Modelling Communities: JASMIN and CEMS Workshop on the Future of Big Data Management 27-28 June 2013 Philip Kershaw Centre for Environmental Data Archival

More information

CEDA Storage. Dr Matt Pritchard. Centre for Environmental Data Archival (CEDA) www.ceda.ac.uk

CEDA Storage. Dr Matt Pritchard. Centre for Environmental Data Archival (CEDA) www.ceda.ac.uk CEDA Storage Dr Matt Pritchard Centre for Environmental Data Archival (CEDA) www.ceda.ac.uk How we store our data NAS Technology Backup JASMIN/CEMS CEDA Storage Data stored as files on disk. Data is migrated

More information

PASS4TEST. IT Certification Guaranteed, The Easy Way! http://www.pass4test.com We offer free update service for one year

PASS4TEST. IT Certification Guaranteed, The Easy Way! http://www.pass4test.com We offer free update service for one year PASS4TEST IT Certification Guaranteed, The Easy Way! \ http://www.pass4test.com We offer free update service for one year Exam : 000-061 Title : IBM Dynamic Infrastructure Technical Support Leader Vendors

More information

In the two following sections we separately consider hardware and software requirements. Sometimes, they will be offered for sale as a package.

In the two following sections we separately consider hardware and software requirements. Sometimes, they will be offered for sale as a package. Appendix A COMPUTER HARDWARE AND SOFTWARE In this appendix we discuss some of the issues in choosing hardware and software for image analysis. The purpose is to draw attention to the issues involved rather

More information

Digital Preservation. OAIS Reference Model

Digital Preservation. OAIS Reference Model Digital Preservation OAIS Reference Model Stephan Strodl, Andreas Rauber Institut für Softwaretechnik und Interaktive Systeme TU Wien http://www.ifs.tuwien.ac.at/dp Aim OAIS model Understanding the functionality

More information

Oracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc.

Oracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc. Oracle9i Data Warehouse Review Robert F. Edwards Dulcian, Inc. Agenda Oracle9i Server OLAP Server Analytical SQL Data Mining ETL Warehouse Builder 3i Oracle 9i Server Overview 9i Server = Data Warehouse

More information

INTEROPERABLE IMAGE DATA ACCESS THROUGH ARCGIS SERVER

INTEROPERABLE IMAGE DATA ACCESS THROUGH ARCGIS SERVER INTEROPERABLE IMAGE DATA ACCESS THROUGH ARCGIS SERVER Qian Liu Environmental Systems Research Institute 380 New York Street Redlands, CA92373, U.S.A - qliu@esri.com KEY WORDS: OGC, Standard, Interoperability,

More information

Migrating NASA Archives to Disk: Challenges and Opportunities. NASA Langley Research Center Chris Harris June 2, 2015

Migrating NASA Archives to Disk: Challenges and Opportunities. NASA Langley Research Center Chris Harris June 2, 2015 Migrating NASA Archives to Disk: Challenges and Opportunities NASA Langley Research Center Chris Harris June 2, 2015 MSST 2015 Topics ASDC Who we are? What we do? Evolution of storage technologies Why

More information

SQL Server Array Library 2010-11 László Dobos, Alexander S. Szalay

SQL Server Array Library 2010-11 László Dobos, Alexander S. Szalay SQL Server Array Library 2010-11 László Dobos, Alexander S. Szalay The Johns Hopkins University, Department of Physics and Astronomy Eötvös University, Department of Physics of Complex Systems http://voservices.net/sqlarray,

More information

GIS Databases With focused on ArcSDE

GIS Databases With focused on ArcSDE Linköpings universitet / IDA / Div. for human-centered systems GIS Databases With focused on ArcSDE Imad Abugessaisa g-imaab@ida.liu.se 20071004 1 GIS and SDBMS Geographical data is spatial data whose

More information

GPFS und HPSS am HLRS

GPFS und HPSS am HLRS GPFS und HPSS am HLRS Peter W. Haas Archivierung im Bereich Höchstleistungsrechner Swisstopo, Bern 3. Juli 2009 1 High Performance Computing Center Stuttgart Table of Contents 1. What are GPFS and HPSS

More information

BIG DATA What it is and how to use?

BIG DATA What it is and how to use? BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14

More information

OpenDAP configuration course

OpenDAP configuration course July 8, 2009(revision 646) 1 Introduction 2 Overview 3 Architecture 4 Components 5 Hyrax Backend server Backend configuration Run 6 Frontend 7 NetCDF 8 Extra Time schedule 13:00-13:30 Project overview

More information

Long term data Archive Study on new Technologies. GMV, 2011 Property of GMV All rights reserved

Long term data Archive Study on new Technologies. GMV, 2011 Property of GMV All rights reserved Long term data Archive Study on new Technologies Property of GMV All rights reserved Index Introduction Mission & Objectives Survey Results Technological Evaluation Conclusions & Future Work LTDP Page

More information

Reprojecting MODIS Images

Reprojecting MODIS Images Reprojecting MODIS Images Why Reprojection? Reasons why reprojection is desirable: 1. Removes Bowtie Artifacts 2. Allows geographic overlays (e.g. coastline, city locations) 3. Makes pretty pictures for

More information

VIIRS-CrIS mapping. NWP SAF AAPP VIIRS-CrIS Mapping

VIIRS-CrIS mapping. NWP SAF AAPP VIIRS-CrIS Mapping NWP SAF AAPP VIIRS-CrIS Mapping This documentation was developed within the context of the EUMETSAT Satellite Application Facility on Numerical Weather Prediction (NWP SAF), under the Cooperation Agreement

More information

1/20/2016 INTRODUCTION

1/20/2016 INTRODUCTION INTRODUCTION 1 Programming languages have common concepts that are seen in all languages This course will discuss and illustrate these common concepts: Syntax Names Types Semantics Memory Management We

More information

"CLIMATE DATA OPERATORS" AS A USER-FRIENDLY PROCESSING TOOL FOR CMSAF'S SATELLITE-DERIVED CLIMATE MONITORING PRODUCTS

CLIMATE DATA OPERATORS AS A USER-FRIENDLY PROCESSING TOOL FOR CMSAF'S SATELLITE-DERIVED CLIMATE MONITORING PRODUCTS "CLIMATE DATA OPERATORS" AS A USER-FRIENDLY PROCESSING TOOL FOR CMSAF'S SATELLITE-DERIVED CLIMATE MONITORING PRODUCTS Frank Kaspar 1, Uwe Schulzweida 2, Ralf Müller 2 1 Satellite Application Facility on

More information

Two-Phase Data Warehouse Optimized for Data Mining

Two-Phase Data Warehouse Optimized for Data Mining Two-Phase Data Warehouse Optimized for Data Mining Balázs Rácz András Lukács Csaba István Sidló András A. Benczúr Data Mining and Web Search Research Group Computer and Automation Research Institute Hungarian

More information

NASA s Big Data Challenges in Climate Science

NASA s Big Data Challenges in Climate Science NASA s Big Data Challenges in Climate Science Tsengdar Lee, Ph.D. High-end Computing Program Manager NASA Headquarters Presented at IEEE Big Data 2014 Workshop October 29, 2014 1 2 7-km GEOS-5 Nature Run

More information

CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler

CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler 1) Operating systems a) Windows b) Unix and Linux c) Macintosh 2) Data manipulation tools a) Text Editors b) Spreadsheets

More information

REACCH PNA Data Management Plan

REACCH PNA Data Management Plan REACCH PNA Data Management Plan Regional Approaches to Climate Change (REACCH) For Pacific Northwest Agriculture 875 Perimeter Drive MS 2339 Moscow, ID 83844-2339 http://www.reacchpna.org reacch@uidaho.edu

More information

Chapter 13 File and Database Systems

Chapter 13 File and Database Systems Chapter 13 File and Database Systems Outline 13.1 Introduction 13.2 Data Hierarchy 13.3 Files 13.4 File Systems 13.4.1 Directories 13.4. Metadata 13.4. Mounting 13.5 File Organization 13.6 File Allocation

More information

Chapter 13 File and Database Systems

Chapter 13 File and Database Systems Chapter 13 File and Database Systems Outline 13.1 Introduction 13.2 Data Hierarchy 13.3 Files 13.4 File Systems 13.4.1 Directories 13.4. Metadata 13.4. Mounting 13.5 File Organization 13.6 File Allocation

More information

Many DBA s are being required to support multiple DBMS s on multiple platforms. Many IT shops today are running a combination of Oracle and DB2 which

Many DBA s are being required to support multiple DBMS s on multiple platforms. Many IT shops today are running a combination of Oracle and DB2 which Many DBA s are being required to support multiple DBMS s on multiple platforms. Many IT shops today are running a combination of Oracle and DB2 which is resulting in either having to cross train DBA s

More information

Storage of the Experimental Data at SOLEIL. Computing and Electronics

Storage of the Experimental Data at SOLEIL. Computing and Electronics Storage of the Experimental Data at SOLEIL 1 the SOLEIL infrastructure 2 Experimental Data Storage: Data Hierarchisation Close Data : beamline local access 3 to 4 days min. Recent Data : fast access, low

More information

CommVault Simpana Archive 8.0 Integration Guide

CommVault Simpana Archive 8.0 Integration Guide CommVault Simpana Archive 8.0 Integration Guide Data Domain, Inc. 2421 Mission College Boulevard, Santa Clara, CA 95054 866-WE-DDUPE; 408-980-4800 Version 1.0, Revision B September 2, 2009 Copyright 2009

More information

Hardware Configuration Guide

Hardware Configuration Guide Hardware Configuration Guide Contents Contents... 1 Annotation... 1 Factors to consider... 2 Machine Count... 2 Data Size... 2 Data Size Total... 2 Daily Backup Data Size... 2 Unique Data Percentage...

More information

XenData Archive Series Software Technical Overview

XenData Archive Series Software Technical Overview XenData White Paper XenData Archive Series Software Technical Overview Advanced and Video Editions, Version 4.0 December 2006 XenData Archive Series software manages digital assets on data tape and magnetic

More information

GIS Initiative: Developing an atmospheric data model for GIS. Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP)

GIS Initiative: Developing an atmospheric data model for GIS. Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP) GIS Initiative: Developing an atmospheric data model for GIS Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP) Unidata seminar August 30, 2004 Presentation Outline Overview

More information

D7.7 Data Management Plan (a)

D7.7 Data Management Plan (a) H2020 FETHPC-1-2014 An Exascale Programming, Multi-objective Optimisation and Resilience Management Environment Based on Nested Recursive Parallelism Project Number 671603 D7.7 Data Management Plan (a)

More information

AMLIGHT, Simulation Datasets, and Global Data Sharing

AMLIGHT, Simulation Datasets, and Global Data Sharing AMLIGHT, Simulation Datasets, and Global Data Sharing Jean-Bernard Minster (1,2,4,6), John J. Helly (1,2), Steven M. Day (3,4), Raul Castro Escamilla (5), Philip Maechling (4),Thomas H. Jordan (4), Amit

More information

Database preservation toolkit:

Database preservation toolkit: Nov. 12-14, 2014, Lisbon, Portugal Database preservation toolkit: a flexible tool to normalize and give access to databases DLM Forum: Making the Information Governance Landscape in Europe José Carlos

More information

FEDORA @ AWI Fedora User Meeting Copenhagen, Denmark 28 September, 2005

FEDORA @ AWI Fedora User Meeting Copenhagen, Denmark 28 September, 2005 Photo: L. Tadday FEDORA @ AWI Fedora User Meeting Copenhagen, Denmark 28 September, 2005-1- Ana Ana Macario, Macario, Computer Center Center Alfred Wegener Alfred Wegener Institute Institute, for Polar

More information

Introduction to IODE Data Management. Greg Reed Past Co-Chair IODE

Introduction to IODE Data Management. Greg Reed Past Co-Chair IODE Introduction to IODE Data Management Greg Reed Past Co-Chair IODE Outline Background - Introduction to IOC and IODE - IODE activities Oceanographic data management - End to end data management - Data stewardship

More information

Use of OGC Sensor Web Enablement Standards in the Meteorology Domain. in partnership with

Use of OGC Sensor Web Enablement Standards in the Meteorology Domain. in partnership with Use of OGC Sensor Web Enablement Standards in the Meteorology Domain in partnership with Outline Introduction to OGC Sensor Web Enablement Standards Web services Metadata encodings SWE as front end of

More information

EUMETSAT EO Portal. End User Image Access using OGC WMS/WCS services. EUM/OPS/VWG/10/0095 Issue <1> <14/01/2010> Slide: 1

EUMETSAT EO Portal. End User Image Access using OGC WMS/WCS services. EUM/OPS/VWG/10/0095 Issue <1> <14/01/2010> Slide: 1 EUMETSAT EO Portal End User Image Access using OGC WMS/WCS services Slide: 1 Overview Introduction: status of data access and visualization EUMETSAT datasets Architecture Web Map Service implementation

More information

Integrated Climate Data Center How to use our data center? Integrated Climate Data Center - Remon Sadikni - remon.sadikni@zmaw.de

Integrated Climate Data Center How to use our data center? Integrated Climate Data Center - Remon Sadikni - remon.sadikni@zmaw.de Integrated Climate Data Center How to use our data center? Contents Which Data Formats do we offer? Which Tools do we provide? What is the Structure of our data center? Data Formats Data Formats ASCII

More information