NOAA Environmental Data Management



Similar documents
NOAA Environmental Data Management

Open Data and Big Data at NOAA

NOAA Environmental Data Management Framework

GOSIC NEXRAD NIDIS NOMADS

An Esri White Paper June 2011 ArcGIS for INSPIRE

Global Earth Observation Integrated Data Environment (GEO-IDE) Presentation to the Data Archiving and Access Requirements Working Group (DAARWG)

Department of the Interior Open Data FY14 Plan

Environment Canada Data Management Program. Paul Paciorek Corporate Services Branch May 7, 2014

National Data Buoy Center Cooperative Relations

Open Data and Information Sharing Michael Simcock, Chief Data Architect OCIO/EDMO

The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team

Business Plan for the. The Geospatial Platform. September 20, 2012 REDACTED

The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA

Data Management in Science and the Legacy of the International Polar Year

NCDC Strategic Vision

How To Write An Nccwsc/Csc Data Management Plan

NCDC's Application of Climate Data to Tourism Business Decision-Making

Plataformas abiertas, estándares, y ArcGIS Online. Michael Gould Global Education Manager, Esri Profesor, Universidad Jaume I, España

Databases & Data Infrastructure. Kerstin Lehnert

National Geospatial Data Policy Procedure for Geospatial Metadata Management

Cloud-based Infrastructures. Serving INSPIRE needs

Public Access Plan. U.S. Department of Energy July 24, 2014 ENERGY.GOV

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

NOAALink Small Business Industry Day. June 2, 2014

OMAO Data Management Roadmap

GIS Initiative: Developing an atmospheric data model for GIS. Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP)

Federal Cloud Computing Initiative Overview

EXPLORING AND SHARING GEOSPATIAL INFORMATION THROUGH MYGDI EXPLORER

The ORIENTGATE data platform

Big Data to Knowledge (BD2K)

MyOcean Copernicus Marine Service Architecture and data access Experience

GeoCloud Status. Doug Nebert June 2011

UNITED STATES DEPARTMENT OF THE INTERIOR BUREAU OF LAND MANAGEMENT MANUAL TRANSMITTAL SHEET Data Administration and Management (Public)

DIVER (Data Integration Visualization Exploration and Reporting)

AN INTEGRATED SOLUTION FOR MANAGING EXPLORATION DATA

Interagency Science Working Group. National Archives and Records Administration

The USGS Landsat Big Data Challenge

MD imap 2.0 THE NEXT GENERATION OF MARYLAND S ENTERPRISE GIS. Esri MUG Conference Baltimore, MD December 3,

Copernicus Space Component ESA Data Access Overview J. Martin (ESA), R. Knowelden (Airbus D&S)

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories

Big Data in OpenTopography

GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata

The CLASS Cloud Access Pilot

Geospatial Data Archiving

Environmental Data Management:

THREDDS. THematic Real-time Environmental Distributed Data Services. Connecting people, documents and data

Sextant. Spatial Data Infrastructure for Marine Environment. C. Satra Le Bris, E. Quimbert, M. Treguer

Portal for ArcGIS. Satish Sankaran Robert Kircher

CLASS and Enterprise Solutions Rick Vizbulis. CLASS and Enterprise Solutions

Data Management Considerations for the Data Life Cycle

NATIONAL CLIMATE CHANGE & WILDLIFE SCIENCE CENTER & CLIMATE SCIENCE CENTERS DATA MANAGEMENT PLAN GUIDANCE

EEOS Spatial Databases and GIS Applications

USE CASE STUDY. Products Over Data. Department of Agriculture (USDA) Economic Research Service

Spatial Data Infrastructures - trends and impacts on society. Max Craglia Joint Research Centre Digital Earth and Reference Data Unit

Data Management Plan. Name of Contractor. Name of project. Project Duration Start date : End: DMP Version. Date Amended, if any

Standardized data sharing through an open-source Spatial Data Infrastructure: the Afromaison project

Enterprise GIS Solutions to GIS Data Dissemination

2009 CAP Grant Kickoff USGS, Reston, VA May 21, 2009

Development and Support for the USGODAE Server

Report of the DTL focus meeting on Life Science Data Repositories

CDI/THREDDS Interoperability: the SeaDataNet developments. P. Mazzetti 1,2, S. Nativi 1,2, 1. CNR-IMAA; 2. PIN-UNIFI

NOAA. Integrated Ocean Observing System (IOOS) Program. Data Integration Framework (DIF) Master Project Plan. (Version 1.0) November 8, 2007

Distributed Computing. Mark Govett Global Systems Division

U.S. Fish & Wildlife Service

Geospatial Platform Support Contract Monthly Update to Coordination Group. James Irvine, PMO Contract Support FGDC OS June 7, 2016

GRIIDC Data Management Plan Version 1.0

The Next Generation Air Transportation System Information Sharing Environment (NISE)

Scientific Data Management and Dissemination

Nevada NSF EPSCoR Track 1 Data Management Plan

Huai-Min Zhang & NOAAGlobalTemp Team

A NEW STRATEGIC DIRECTION FOR NTIS

Data Integration Strategies

Cite My Data M2M Service Technical Description

U.S. Department of Labor Digital Government Strategy (DGS) Milestone #6.3 Improving Digital Services

GetLOD - Linked Open Data and Spatial Data Infrastructures

smespire - Exercises for the Hands-on Training on INSPIRE Network Services April 2014 Jacxsens Paul SADL KU Leuven

Local Loading. The OCUL, Scholars Portal, and Publisher Relationship

Transcription:

NOAA Environmental Management Report to EDM Virtual Workshop 2013 06 25 Jeff de La Beaujardière, PhD NOAA Management Architect jeff.delabeaujardiere@noaa.gov +1 301 713 7175 1

NOAA EDM Framework Management Framework Principles Governance Resources Standards Architecture Assessment Purpose: To organize, guide and support NOAA environmental data management activities. https://www.nosc.noaa.gov/ed MC/framework.php Contributors: Jeff de La Beaujardière (Editor) Anne Ball, NOS CSC Julie Bosch, NCDDC Deirdre Byrne, NODC Leo Carling, OMAO Don Collins, NODC Darien Davis et al., OAR Nancy DeFrancesco Larry Goldberg, NMFS Peter Grimm, CID Ted Habermann, NGDC Karl Hampton, OSPO Steve Hankin, PMEL Scott Hausman, NCDC Michelle Hertzfeld, NESDIS IIAD Christina Horvat, NWS Roy Mendelssohn, NMFS Lewis McCulloch, TPIO Ana Pinheiro Privette, NCDC Nancy Ritchey, NCDC Steve Rutz, NODC Matt Seybold, OSPO Micah Wengren, NOS OCS 2

Management Framework Management Framework Principles Governance Resources Standards Architecture Assessment 3

Vision for NOAA Management All NOAA data will be: Discoverable Accessible Usable Preserved for all types of users and applications. 4

Management Framework Management Framework Principles Governance Resources Standards Architecture Assessment 5

Recent Federal Initiatives Public Access to Research Results (OSTP memo 2013-02-22) Publications Grantees Citation Research Open Access High Value Earth Obs Business Big Earth Initiative (BEDI) (proposed FY2014) Open Policy (Exec. Order 2013-05-09) Metadata Cataloging Online Services Standard formats 6

Public Access to Research Results (PARR) Publications and Digital must be freely available Embargo period for journal publications Memo from White House Office of Science and Technology Policy (OSTP): "Increasing Access to the Results of Federally Funded Scientific Research" http://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_public_access_memo_2013.pdf Focus is more on policy than technology Memo issued 2013 Feb 22; draft plans due 2013 Aug 22 NOAA PARR Cmtee established by Research Council to draft plan Co chairs: Jeff DLB (EDMC), Neal Kaske (NOAA Library) Members: Danielle Tillman NRC liaison, Mark Brady NMFS, Paul Comar NOS, Cecile Daniels OMAO, Ingrid Guch NESDIS, Douglas Perry OMAO, Allen Shimada NMFS, Ronald Stouffer OAR, Harry Tabak NWS, Marina Timofeyeva NWS OSTP hosting interagency meetings 7

Open Policy (May 9, 2013) Requires: Use of open standards and machine readable formats Life cycle data management planning Inventory of agency data assets at agency.gov/data Initial deadlines November 9, 2013 NOAA planning begun w/chairs of EA, EDM, GIS, & Web cmtees References: Executive Order "Making Open and Machine Readable the New Default for Government Information" http://www.whitehouse.gov/the press office/2013/05/09/executive order making open and machine readable new default government OMB Open Policy Managing Information as an Asset http://www.whitehouse.gov/sites/default/files/omb/memoranda/2013/m 13 13.pdf Project Open site on GitHub http://project open data.github.io/ 8

Big Earth Initiative (BEDI) proposed multi agency activity coordinated through US Group on Earth Observations (USGEO) Goal: Improve discoverability, accessibility, & usability of data Focus on "high value" datasets,e.g. from: OSTP Earth Observations Assessment USGCRP National Climate Assessment NOAA Observing Systems of Record NOAA activity: FY2014 Funding request in Pres Bud (need Congress approval) Preliminary planning with all NOAA LOs remainder of 2013 TPIO: Project management EDMC, DMIT: Technical coordination & implementation NOSC, CIO Council: Oversight 9

NOAA Environmental Management Committee NOSC NOAA Observing System Council CIO Council Chief Information Officer Council NAO 212 15 "Management of Environmental and Information" EDMC Environmental Management Committee EDMC Procedural Directives DMIT Management Integration Team 10

EDMC Procedural Directives (Environmental Management Committee) Management Planning PD Plan, in advance, how you will preserve, document and distribute your data. Archive Procedure What to archive, how to submit to archive. Documentation How to apply ISO 19115 metadata for discovery, use & understanding. Sharing by NOAA Grantees State in proposal how you will share data. Share within 2 years. Citation Assign persistent identifiers to datasets and encourage citation. Access Establish & improve on line services for data access in prep. 11

Management Framework Management Framework Principles Governance Resources Standards Architecture Assessment Resources Budget Project specific FY2014 BEDI? Personnel Training Recognition Authority Other Resources Annual Workshop Teams (e.g. DMIT) EDM Wiki: https://geo ide.noaa.gov/wiki/ 12

Management Framework Management Framework Principles Governance Resources Standards Architecture Assessment Standards Metadata ISO 19115/19139 Formats and services Open Policy mandate Vocabularies Quality Control IT Security NAO 212 13 13

Management Framework Management Framework Principles Governance Resources Standards Architecture Assessment 14

Avoid Purpose Built Systems Designated Users Custom App #1 Custom App #2 stovepipe #1 stovepipe #2 Project base #1 base #2 15

Services Layer User Tools.gov and Other Portals Decision Support Tools Scientific Software Value- Adding Reseller data services layer Search & Discovery Services Access Services Documentation Compatible Formats and Vocabularies Sources Satellite Radar Buoy Ship Sonar Gauge ROV/UAV Surveys 16

Potential Cloud Deployment Scenario Commercial Cloud Access services Discovery services Utility services Public users NOAA security boundary One-way push Master copy of NOAA Government Cloud Processing Services NOAA Internal customers 17

Management Framework Management Framework Principles Governance Resources Standards Architecture Assessment Assessment EDM Dashboard Observing System of Record EDM Assessment Geospatial Systems Assessment 18

EDM Dashboard http://sites.google.com/a/noaa.gov/edm-dashboard/ (internal access only) 19

Obs. System of Record EDM Assessment 20

Management Framework Management Framework Principles Governance Resources Standards Architecture Assessment 21

Planning and Production Activities Management Activities Requirements Definition Planning Development Deployment Operations Collection Processing Quality Control Documentation Cataloging Dissemination Preservation Stewardship Usage Tracking Final Disposition Activities Usage Activities Discovery Reception Understanding Analysis Value Added Products User Feedback Citation Tagging Gap Assessment 22

Planning and Production Activities Requirements Definition Planning Development Deployment Applicability of EDMC Directives DM Planning Operations Management Activities Collection Processing Quality Control Documentation Cataloging Dissemination Preservation Stewardship Usage Tracking Final Disposition Documentation Sharing by Grantees Access Archive Procedure Usage Activities Discovery Reception Understanding Analysis Value Added Products User Feedback Citation Tagging Gap Assessment Citation 23

set Identifier Project Published Paper or other work cites used in & Metadata NOAA Center submitted to assigns ID resolves to links to First NOAA dataset ID assigned by NGDC: 10.7289/V5154F01 (http://dx.doi.org/10.7289/v5154f01) landing page

Timeline OSTP PARR memo Open Policy PARR Plan due noaa.gov/data inventory due BEDI execution? NOAA-wide BEDI pre-planning NOAA DOI license NOAA FY2014 budget 1st NOAA DOI Virtual EDMW EDMC FY2014 plan NOAA DOI license renewal Feb 2013 May June Aug Sep Nov Jan 2014 Feb 25

NOAA data should be: Discoverable Accessible Summary Public Access to Research Results EDMC Procedural Directives Open Policy Big Earth Initiative (BEDI) Usable User Tools Preserved data services layer Sources 26

Backup Slides 27

1 Observing Requirements 2 Management Planning Directive Documentation Directive Archive Procedure Producers 3 generate 4 transmit and Metadata preserve 6 14 NOAA Center refine feedback 13 publish 5 ID 7 Users 10 find get Access and Discovery Services publish 8 establish Tools Citation Directive Access Directive measure 9 Agency Leadership create 11 Result product forecast paper decision policy response ID 12 measure monitor Management Dashboard 28

Provider Roles 1: Obtain. 2: Make your data accessible on-line. Local Service Instance or NOAA Shared Service 3: Document your well. 4: Make your metadata accessible. Metadata (including URL) Metadata Local Catalog or Web-Accessible Folder 5: Tell us where to find your metadata Local Catalog WAF DM Dashboard 29

NOAA Discovery Vision External Catalogs data.gov ONE GEO GCMD GeoPlatform Map Viewer Client Tools Desktop GIS Scientific Software NASA Giovanni NOAA data cataloging Find NOAA data by: Location Time Phenomenon Other characteristics without knowing NOAA organization Local Catalogs & Metadata Folders NCDC NODC NGDC UAF IOOS others... Access Services Sources ROV/UAV Marine Enforcement NOAA Labs Satellite Radar Mammals Buoy Ship Surveys Fish Sonar Gauge Chart Model 30

Tagging Concept Topical Catalogs or Portals sets with a relevant tag are recorded by external catalogs. Deepwater Horizon Response data.gov GEOSS CORE the next crisis Tags are not inserted into metadata records by data providers. Instead, the Catalog adds tags to indicate datasets relevant to a particular purpose. Primary NOAA Catalog(s) Tag base DWH Oil Spill data.gov GEOSS CORE GEOSS StP next crisis and the next metadata record metadata record metadata record metadata record metadata record metadata record metadata record metadata record 31

PARR Target State New or in progress: #2, 3, 4, 5, 6, 12, 16, 17 9 Agency 7 funds Researcher writes creates 1 creates 8 14 assigns ID resolves to 15 11 submitted to Journal 13 manuscript 12 issues accessible after embargo 16 Published Paper 17 cites sent to Library used in 10 & Metadata 2 resolves to 5 landing page submitted to Archive 4 assigns ID 6 links to 3 available through data access service

Obstacles to improving NOAA EDM Changes cost money. Savings from standardization and reuse are not visible until later, or may accrue to later adopters. Standards are complicated, too general, or too specific to a different purpose. NOAA datasets and programs are numerous, heterogenous, mission specific and independent. Operational systems cannot be readily changed. System developers start from scratch rather than first seeking existing solutions. Good DM practices are not part of employee job descriptions or recognition methods. People who approve new projects are not always aware of existing approaches and directives. 33

Session 2 5 Slides 34

Catalog & Search Current state: No complete inventory No single point of entry limited integration with commercial search engines Numerous Portals with overlapping scope & manual population Various catalog implementations Geoportal Server at each Center UAF Catalog NWS WIS GISC DAR 35

Catalog & Search Target state: Agency data inventory per Presidential Executive Order Single point of entry to federated search API Automatic registration with data.gov, GCMD, GEOSS, GCIS, thematic portals Better integration with commercial search engines 36

Catalog & Search Potential next steps: improve/augment Center catalogs federation of existing catalogs agency wide data inventory (noaa.gov/data) schema.org CS/W OpenSearch 37

Access current state: variety of data access methods offered DAP, OGC WMS, WFS, WCS, SOS, Esri, FTP, web links, CLASS API, custom services differing implementations of same service type multiple access points for some datasets, with authoritative source unclear some datasets not accessible online at all no permanent shared hosting capability except archival data from National Centers no CIO approved open source software stack 38

Access target state: all NOAA datasets accessible online via services per Presidential mandates and NOAA policy NOAA data made available to the widest community possible small number of data access service types services as consistent and standardized as possible services that support machine to machine exchange services not biased towards any particular client application shared hosting option 39

Access potential next steps Improve UAF holdings, organization and/or reliability for relevant data types UAF outreach and work with Centers could be expanded to monthly workshops or training sessions, etc. UAF to produce best practices for data providers who want to serve data via TDS UAF to create TDS catalog rubric to describe quality of TDS catalogs UAF to utilized ERDDAP capabilities to server in situ data/dsg feature types Improved links to access data from catalog & h t l 40

Usability Current State Inconsistent approach to metadata missing, incomplete unstructured [text, HTML, PDF] variety of standards FGDC CSDGM standard, ISO 19115/19139, NMFS InPort Partial deployment of metadata metrics not all metadata records are known & scored Variety of output formats Limited use of controlled vocabularies limited browse imagery and on demand visualization 41

Usability Target State Comprehensive metadata based on international standards All available metadata scored All data available in at least one broadlysupported format Common or controlled vocabularies used Web Map Service endpoint for all relevant datasets 42

Usability potential next steps Agree on small # of target formats Complete & updated scoring of metadata Begin establishing Web Map Services Agree on vocabularies 43

Preservation and Citation Current State not all data archived ship data not getting to archive PI/project data at local offices datasets not citable algorithms/code not preserved need to save all intermediate products cannot reprocess 44

Preservation and Citation Target State routine & complete archiving persistent identifiers, with landing pages and citation guidelines all algorithms/code preserved & document on demand reprocessing capability 45

Preservation and Citation potential next steps Continue Citation project Simplify archive submission 46