NATIONAL CLIMATE CHANGE & WILDLIFE SCIENCE CENTER & CLIMATE SCIENCE CENTERS DATA MANAGEMENT PLAN GUIDANCE



Similar documents
How To Write An Nccwsc/Csc Data Management Plan

Virginia Commonwealth University Rice Rivers Center Data Management Plan

Data Management Best Practices for Landscape Conservation Cooperatives Part 1: LCC Funded Science

Creating Geospatial Metadata. Kim Durante Geo4Lib Camp

desert conservation program Data Management Guidelines

CDI SSF Category 1: Management, Policy and Standards

ArcGIS. Server. A Complete and Integrated Server GIS

Geospatial Data Stewardship at an Interdisciplinary Data Center

Project Title: Project PI(s) (who is doing the work; contact Project Coordinator (contact information): information):

ESRI Technical Certification Overview. Amy Daniels Instructor, Greenville Tech

GIS Databases With focused on ArcSDE

REACCH PNA Data Management Plan

Cookbook 23 September 2013 GIS Analysis Part 1 - A GIS is NOT a Map!

Advanced Image Management using the Mosaic Dataset

RESPONSE TO GIS SERVICES RFP. 1. What version of ArcGIS software are you using? If it is not the latest, is there a plan to upgrade?

Guidelines on Information Deliverables for Research Projects in Grand Canyon National Park

GIS Tools for Land Managers

Oklahoma s Open Source Spatial Data Clearinghouse: OKMaps

Publishing geospatial data to the web using the EEA infrastructure

Enterprise GIS Solutions to GIS Data Dissemination

Nevada NSF EPSCoR Track 1 Data Management Plan

Subject: Using Google Earth, Google Earth Pro, and Google Maps for FWS activities

Introduction to GIS.

Data Management Plan. Name of Contractor. Name of project. Project Duration Start date : End: DMP Version. Date Amended, if any

INTRODUCTION TO ARCGIS SOFTWARE

Choosing the right GIS framework for an informed Enterprise Web GIS Solution

Checklist for a Data Management Plan draft

PART 1. Representations of atmospheric phenomena

Research Data Collection Data Management Plan

An Esri White Paper August 2010 Product Library in Esri Aeronautical Solution: Enabling Seamless Product, Data, and Document Management

Research Data Management Guide

BLM Personnel & REA Collaborator Access. Accessing REA Data, Maps, & Models through SharePoint

An Introduction to Open Source Geospatial Tools

Pilot Title: Wyoming Interagency Spatial Database & Online Management Tools for Wildlife

Basics on Geodatabases

GEOGRAPHIC INFORMATION GATEWAY New York Department of State (NYDOS) data acceptance & metadata standards

MAIN_SNP_TOPO.dgm_2m

DATA SHARING AND SPATIAL QUERY

Leveraging Metadata Standards in ArcGIS for Interoperability

Online Data Delivery System

ArcGIS Reference Document

GEOGRAPHIC INFORMATION SYSTEMS Lecture 20: Adding and Creating Data

Mapping Mashup/Data Integration Development Resources

Emerging Trends in SDI.

URISA ESIG Application

An Introduction to Managing Research Data

A GIS helps you answer questions and solve problems by looking at your data in a way that is quickly understood and easily shared.

Amazon Hosted ESRI GeoPortal Server. GeoCloud Project Report

Best Practices for Research Data Management. October 30, 2014

NetCDF and HDF Data in ArcGIS

June 10, 2015 Josh Bradley, Arctic LCC Stan Smith, USGS

Analyzing and creating GIS data using Python programming. Josh Foery, JR Franks, Connor McMillan

Wyoming Geographic Information Science Center & Wyoming Game & Fish Department

Standard 5: Use a consistent data management framework in accordance with internal and partner organization data standards.

Version 3.0, April 16, 2012, updated for ArcGIS 10.0 Produced by the Geographic Information Network of Alaska

Using CAD Data in ArcGIS

Chapter 1: Introduction to ArcGIS Server

VGIS HANDBOOK PART 2 - STANDARDS SECTION D METADATA STANDARD

ENVI THE PREMIER SOFTWARE FOR EXTRACTING INFORMATION FROM GEOSPATIAL IMAGERY.

INCOG Transportation Planning Division Spatial Data Management Workflow GIS-T 2008

CatMDEdit Metadata editor

GeoManitoba Spatial Data Infrastructure Update. Presented by: Jim Aberdeen Shawn Cruise

HELCOM Data and Map Service. User Manual

Managing Lidar (and other point cloud) Data. Lindsay Weitz Cody Benkelman

The ORIENTGATE data platform

Institute of Natural Resources Departament of General Geology and Land use planning Work with a MAPS

ArcSDE Database Servers Tutorial

Description and Testing of the Geo Data Portal: A Data Integration Framework and Web Processing Services for Environmental Science Collaboration

OpenAIRE Research Data Management Briefing paper

Harvard Data Visualization Project

Editing Strategies for Enterprise Geodatabase

The Courses. Covering complete breadth of GIS technology from ESRI including ArcGIS, ArcGIS Server and ArcGIS Engine.

Network Analysis with Python. Deelesh Mandloi

Mapping Mashup/Data Integration Development Resources Teaching with Google Earth and Google Ocean Stone Lab August 13, 2010

An Esri White Paper June 2011 ArcGIS for INSPIRE

Understanding Raster Data

Data Integration for ArcGIS Users Data Interoperability. Charmel Menzel, ESRI Don Murray, Safe Software

A Guide to the Research Data Service

EEOS Spatial Databases and GIS Applications

Research Data Archival Guidelines

HydroDesktop Overview

ArcGIS Data Models Practical Templates for Implementing GIS Projects

Checklist and guidance for a Data Management Plan

There are various ways to find data using the Hennepin County GIS Open Data site:

Guide to Creating and Editing Metadata in ArcGIS for Publishing to the MSDIS GeoPortal

Best Practices for Good Data Management. February 19, 2015

Transcription:

NATIONAL CLIMATE CHANGE & WILDLIFE SCIENCE CENTER & CLIMATE SCIENCE CENTERS DATA MANAGEMENT PLAN GUIDANCE Prepared by: NCCWSC/CSC Data Management Working Group US Geological Survey February 26, 2013 Version 2.1 More Information: NCCWSC Data Policies Point of Contact: Emily Fort, Data and IT Coordinator for the NCCWSC, efort@usgs.gov NCCWSC and CSCs Data Management Guidance 1

CONTENTS Background... 3 Data Management Plan Process... 3 Why Are Data Management Plans Required?... 3 Data Management Plan Description and Guidance... 4 Data Management Plan... 4 Data Management Plan Example... 7 NCCWSC and CSCs Data Management Guidance 2

BACKGROUND In order to provide guidance and support for data sharing, the NCCWSC/CSC Data Management Working Group has developed three policy documents. The first document, a NCCWSC/CSC Data Sharing Policy, requires data and associated products by publically shared and the development of a data management plan. The second document (this document), NCCWSC/CSC Data Management Plan Guidance, provides details to researchers on what should be included in the data management plan that is required as part of the proposal and for funded projects. Finally, the third document, the NCCWSC/CSC Data Management Manual is intended to provide CSC Data Stewards and researchers funded by the NCCWSC or a CSC information on a variety of topics including roles and responsibilities, metadata, file formats, and data standards. Each CSC has a Data Steward to work with funded PIs on the development of the Data Management Plan, answer any questions related to data management, and assist with the transition of data and information products into the NCCWSC/CSC repository. DATA MANAGEMENT PLAN PROCESS The Data Management Plan (DMP) is prepared in stages. A subset of the DMP is collected during the proposal process. If the proposal is selected for funding, all of the information in the Data Management Plan must be completed. The DMP requirements during the proposal stage are limited to information needed to understand the proposal from a data perspective. The fully completed DMP includes additional details to ensure that data products are consistently formatted and documented during development, and that data deliverables are provided with maximum utility. The DMP will help the CSC Data Steward assist the researcher with data management activities and make the project s data and derived products available after project completion. The DMP should be reviewed and enhanced as the project evolves. The initial version of the DMP is due to the CSC Data Steward within three months of the proposal receiving funding. The NCCWSC website has the contact information for the Data Stewards. If there are any questions related to the DMP, please contact the appropriate CSC Data Steward. The DMP should be reviewed quarterly and updated as needed. The DMP includes information about data used in proposed projects including: 1) data inputs existing data collections, 2) data inputs new collections (e.g., new field data collections, new model output, etc.), 3) data inputs software of other needs, and 4) data outputs expected deliverables, datasets, and products. WHY ARE DATA MANAGEMENT PLANS REQUIRED? - Assists with visibility, reproducibility, and validity of research projects because data are well documented, including approach and methodology, and accessible. - Helps ensure data and data products are accessible and available for the long term. - Consistent with the best practices. NCCWSC and CSCs Data Management Guidance 3

DATA MANAGEMENT PLAN DESCRIPTION AND GUIDANCE Descriptions of the data in the DMP are divided into the following four categories: 1. Data Inputs Existing Collections; 2. Data Inputs New Collections (e.g., data collected from the field, new model output); 3. Data Inputs Software or Other Needs; and 4. Data Outputs Expected deliverables, datasets, and products. Information should be provided for each element as appropriate. If a section is not appropriate (i.e., no new data are being collected), that should be noted and no additional information is necessary. If information is not known at the level of detail requested, as much information as possible should be provided. For multiple inputs and outputs, the data description table should be copied as many times as needed so that each table represents one input or one output. The information provided in the DMP will be reviewed as part of the proposal process. When proposals are funded, information in the proposal will be used to create initial records for the project in the NCCWSC and CSC project tracking tool and data repository ScienceBase. Additionally, a CSC Data Steward will contact the principal investigator of each funded project to assist with the development of the DMP. DATA MANAGEMENT PLAN Notes: 1. Fields shaded gray are not required during the proposal stage of the DMP. If a project is funded, all fields are required. 2. Each input, output, or software item should have its own table. Make a copy of the table for each distinct data input, data output, or software item. Project Title: [Insert Project Title] Data Inputs Existing Collections 1 [Name of Collection] Source: Describe the information that will be used and the nature and scale (e.g., national, regional, landscape, etc.) of the data. Include a link to the source of the existing data. Identify the formats in which the data are maintained and made available. Specify the procedures used to evaluate the existing data, including verification, validation, and an assessment of usability. Identify the source for the data. Describe any data processing steps or provide a scientific workflow you plan to use to manipulate the data, as appropriate. NCCWSC and CSCs Data Management Guidance 4

Backup & Storage: Volume Estimate: Describe the approach for backup and storage of the information associated with the research project during the project. Estimate the volume of information that will be generated: megabyte (MB), GB, TB, or PB. Prior to the completion of the project, specify who should have access to project information/products and what type of access (Public, Read, Write, No Access). Identify any limitations on access or reuse (e.g., sensitive data, restricted data, software with license restrictions, etc.) and provide justification for restriction. Provide citation or documentation describing limitations if due to policies or legal reasons. Identify any fees associated with acquiring the data. Provide citation for data product. If the data product can be found online, provide a URL. Data Inputs New Collections (Data that does not currently exist. For example, a new field data collection.) 1 [Provide a brief name to describe new data collection] Data Management Resources: Protocols: Metadata: Volume Estimate: Backup & Storage: Repository for Data: Describe the information that will be used and the nature and scale (e.g., national, regional, landscape, etc.) of the data that will be collected. Describe the proposal resources allocated for data management activities for the new data collected as a level of effort, total dollars allocated, or as a percentage of the total project s cost. Resources could include people s time or proposal funding. Identify the formats in which the data will be generated, maintained, and made available. Describe data processing steps or provide a scientific workflow you plan to use to manipulate the data, as appropriate. Identify any standard protocols or methodologies that will be used to collect the data, if available. Specify the procedures for ensuring data quality. Identify the metadata standard that will be used to describe the document (FGDC, ISO, EML, etc.) Estimate the volume of information generated: megabyte (MB), GB, TB, or PB. Describe the approach for backup and storage of the information associated with the research project during the project. In addition to the NCCWSC repository (ScienceBase), identify any other repositories where you plan to share your data. Prior to the completion of the project, specify who should have access to project information/products and what type of access (Public, Read, Write, No Access). NCCWSC and CSCs Data Management Guidance 5

1 [Provide a brief name to describe new data collection] Exclusive Use: Contact: Project data and associated products should be available publically at the end of the project. If a request to limit access for a period of time after project completion is needed, please identify the length of time and the reason for the extension. (Request cannot be more than two years.) Identify any limitations on access or reuse (e.g., sensitive data, restricted data, software with license restrictions, etc.) and provide justification for restriction. Provide citation or documentation describing limitations if due to policies or legal reasons. Specify how the project s data should be cited. Provide a point(s) of contact if questions arise related to the data and associated products (name, email, and phone number). Software and Other Needs 1 [Name of Software or Other Need] Source/Link: Describe any software or other needs that are required for the project. Software such as Microsoft Office, Adobe, and an Internet Browser do not need to be provided. Identify any limitations on access or reuse that accompany the software or other needed items. Identify any fees or other costs associated with acquiring the software or other items. Provide a link or a source for the need if available. Data Outputs (e.g., Project Deliverables or Products) 1 [Name of Output] Data Management Resources: Metadata: Describe the data output. Describe the proposal resources allocated for data management activities for the new data collected as a level of effort, total dollars allocated, or as a percentage of the total project s cost. Resources could include people s time or proposal funding. Identify the formats in which the data will be generated, maintained, and made available. Describe data processing steps or provide a scientific workflow you plan to use to manipulate the data, as appropriate. Specify the procedures for ensuring data quality during the project. Identify the metadata standard that will be used to describe the data and products (FGDC, ISO, EML, etc.) NCCWSC and CSCs Data Management Guidance 6

Volume Estimate: Backup & Storage: Repository for Data: Exclusive Use: Digital Object Identifier (DOI)/Link: Contact: Estimate the volume of information generated: megabyte (MB), GB, TB, or PB. Describe the approach for backup and storage of the information associated with the research project during the project. In addition to the NCCWSC repository (ScienceBase), identify any other repositories where you plan to share your data. Prior to the completion of the project, specify who should have access to project information/products and what type of access (Public, Read, Write, No Access). Project data and associated products should be available publically at the end of the project. If a request to limit access for a period of time after project completion is needed, please identify the length of time and the reason for the extension. (Request cannot be more than two years.) Identify any limitations on access or reuse (e.g., sensitive data, restricted data, software with license restrictions, etc.) and provide justification for restriction. Provide citation or documentation describing limitations if due to policies or legal reasons. Specify how the project s data should be cited. Provide a digital object identifier (DOI)/link to the project when available publically. Provide a point(s) of contact if questions arise related to the data and associated products (name, email, and phone number). DATA MANAGEMENT PLAN EXAMPLE Note: This is an example only. Project Title: Wyoming Landscape Conservation Initiative Integrated Assessment Data Inputs Existing Collections 1 Wyoming Geographic Information Center Source: Backup & Storage: This is a collection of over 100 natural resource geospatial datasets Shapefile (vector) and geotiff (raster) Reviewed metadata. Checked that calculations were correct. Checked that values were in reasonable ranges. Various data contributors including: land management agencies (e.g., BLM), University of Wyoming, and local state agencies. Data download and rasterizing Handled by Wyoming Geographic Information Science Center NCCWSC and CSCs Data Management Guidance 7

Volume Estimate: 200 GB Data distributed by the Wyoming Geographic Information Science Center Data are publically available with no restrictions. No fees Wyoming GISC Data Server http://wygl.wygisc.org/dataserver/ 2 Bureau of Land Management Disturbance Tracking System Source: Backup & Storage: Volume Estimate: Disturbance and reclamation areas associated with energy development on Bureau of Land Management areas Shapefiles Wyoming BLM State Office Data download and rasterizing Handled by BLM 10 GB Data distributed by BLM No restrictions No fees Not known Data Inputs New Collections 1 Locations of Yermo xanthocephalus Dorn (Desert Yellowhead) Data Management Resources: Scientific Workflows: Protocols: Map the locations of desert yellowhead in DOI managed lands in the state of Wyoming. 50% of a graduate student s time Gathered in field notebook. Locations entered into ArcGis. None Following the California Natural Resource Agency protocols http://www.dfg.ca.gov/biogeodata/cnddb/pdfs/protocols_for_surveying_and_evaluating_impacts.pdf NCCWSC and CSCs Data Management Guidance 8

1 Locations of Yermo xanthocephalus Dorn (Desert Yellowhead) Review of data entered into ESRI. Review of map output to make sure information is accurate. Metadata: ISO 19115 Volume Estimate: Backup & Storage: Repository for Data: Exclusive Use: Contact: MB During the project, the information will be stored on a network file drive. ScienceBase only. Access should be limited to the project team during the project. At the completion of the project, information can be made publically available. No exclusive use period needed. No limitation. Standards are fine. Bob Smith, bsmith@gov.gov Software and Other Needs 1 ArcGIS Source/Link: Geographic Information Software for data processing Proprietary/license software USGS provides ArcGIS to USGS staff under an enterprise license agreement. All spatial processing can be effected using open source geospatial tools. http://www.esri.com 2 Python Source/Link: Scripting language Open source software No fees http://www.python.org NCCWSC and CSCs Data Management Guidance 9

Data Outputs (e.g., Project Deliverables or Products) 1 WLCI Integrated Assessment Data Management Resources: Metadata: Volume Estimate: Backup & Storage: Repository: Exclusive Use: Digital Object Identifier (DOI)/Link: Contact: The Integrated Assessment will result in a set of ESRI grids, shapefiles, and geodatabases that will allow land managers to assess the resource value associated with a cell or polygon. These assessments will cover the WLCI area, the southwest portion of the state of Wyoming. The project will provide a total of $100K for data management for FY 11/12. This includes data storage, data organization support, integration, and custom presentation utilities along with data protection, storage, backup, and replication. Final products will include geotiffs, shapefiles, and python scripts (.py files). Sets of GIS data collected through collaborative efforts get partitioned into four categories (resource, condition, change agent, and future agent). The datasets are converted to 30m grids with a common origin, and then categorically combined. The values of each component grid get normalized, the combined grid converted to an index value, and the four categorical grids combined. Custom python scripts provide geoprocessing for resource assessment. The project provides a quality assurance team to validate all interim and final products. ScienceBase provides periodic checksum validation of all repository items, audits all inputs, and provides reviewable logs. FGDC The final volume (source, interim, and final datasets, metadata, and processing scripts) will comprise less than 1TB. The project uses ScienceBase for storage, replication, backup, and long-term repository. This includes both interim and final products. ScienceBase will be the repository for data and map information. The USGS Publications Warehouse will hold the publications. Prior to the completion of the project, only USGS staff has access to the data. The USGS requires review and approval of a publication or data series prior to wider distribution. All project final and interim datasets will be available after completion of the data series publication. This will be complete at the conclusion of the project. All interim and final products will be released without restriction. Source data contains both proprietary and sensitive data. The USGS Library provides citation information. The USGS Library provides DOI/Links for project products. Sarah Doe, sdoe@agency.gov NCCWSC and CSCs Data Management Guidance 10