Geospatial Data Stewardship at an Interdisciplinary Data Center



Similar documents
A GIS helps you answer questions and solve problems by looking at your data in a way that is quickly understood and easily shared.

REACCH PNA Data Management Plan

ES341 Overview of key file formats and file extensions in ArcGIS

Assessing a Scientific Data Center as a Trustworthy Digital Repository

Virginia Commonwealth University Rice Rivers Center Data Management Plan

How To Write An Nccwsc/Csc Data Management Plan

NATIONAL CLIMATE CHANGE & WILDLIFE SCIENCE CENTER & CLIMATE SCIENCE CENTERS DATA MANAGEMENT PLAN GUIDANCE

Research Data Archival Guidelines

There are various ways to find data using the Hennepin County GIS Open Data site:

Geospatial Data Archiving

DATA SHARING AND SPATIAL QUERY

Smithsonian Institution Archives Guidance Update SIA. ELECTRONIC RECORDS Recommendations for Preservation Formats. November 2004 SIA_EREC_04_03

Interagency Science Working Group. National Archives and Records Administration

11.5 E-THESIS SUBMISSION PROCEDURE (RESEARCH DEGREES)

CDI SSF Category 1: Management, Policy and Standards

WFP Liberia Country Office

Introduction to GIS.

INTRODUCTION TO ARCGIS SOFTWARE

GIS Databases With focused on ArcSDE

North Carolina Digital Preservation Policy. April 2014

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire

How To Manage Research Data At Columbia

An Esri White Paper August 2010 Product Library in Esri Aeronautical Solution: Enabling Seamless Product, Data, and Document Management

Enterprise GIS Solutions to GIS Data Dissemination

Lecture 8. Online GIS

Nevada NSF EPSCoR Track 1 Data Management Plan

Guidelines on Information Deliverables for Research Projects in Grand Canyon National Park

WMO Climate Database Management System Evaluation Criteria

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Choosing the right GIS framework for an informed Enterprise Web GIS Solution

Standard 5: Use a consistent data management framework in accordance with internal and partner organization data standards.

J9.6 GIS TOOLS FOR VISUALIZATION AND ANALYSIS OF NEXRAD RADAR (WSR-88D) ARCHIVED DATA AT THE NATIONAL CLIMATIC DATA CENTER

Data access and management

Creating a More Resilient Future. Friday 30 May, 11:00 to 12:30, Rooms S29-31

User s Guide to ArcView 3.3 for Land Use Planners in Puttalam District

The premier software for extracting information from geospatial imagery.

Bradford Scholars Digital Preservation Policy

Organization of VizieR's Catalogs Archival

Project Title: Project PI(s) (who is doing the work; contact Project Coordinator (contact information): information):

The ORIENTGATE data platform

GRADUATE CERTIFICATE PROGRAM

Glossary of terms used in the survey

ArcGIS Data Models Practical Templates for Implementing GIS Projects

Electronic Records Management Guidelines - File Formats

New challenges of water resources management: Title the future role of CHy

Topic: Receiving and Responding to CBP Forms

PEERNET File Conversion Center 6.0

An Esri White Paper October 2010 Esri Production Mapping Product Library: Spatially Enabled Document Management System

Creating Maps in QGIS: A Quick Guide

VGIS HANDBOOK PART 2 - STANDARDS SECTION D METADATA STANDARD

Statement of Qualifications

File Formats. Summary

Environment and Natural Resources Trust Fund 2016 Request for Proposals (RFP)

desert conservation program Data Management Guidelines

Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS.

CIESIN Columbia University

Data-Intensive Science and Scientific Data Infrastructure

How To Write A File System On A Microsoft Office (Windows) (Windows 2.3) (For Windows 2) (Minorode) (Orchestra) (Powerpoint) (Xls) (

Mapping Mashup/Data Integration Development Resources Teaching with Google Earth and Google Ocean Stone Lab August 13, 2010

Data Management Resources at UNC: The Carolina Digital Repository and Dataverse Network

NatureServe s Environmental Review Tool

USGS Guidelines for the Preservation of Digital Scientific Data

An Esri White Paper June 2011 ArcGIS for INSPIRE

PRESERVATION NEEDS ASSESSMENT PRESERVATION 101

(Geo)database and Data Management

OCLC Digital Archive Preservation Policy and Supporting Documentation Last Revised: 8 August 2006

Data Publishing Workflows with Dataverse

GIS Initiative: Developing an atmospheric data model for GIS. Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP)

My Account User Guide. Popfax.com login page. Easy, inexpensive Effective!

ArcGIS. Server. A Complete and Integrated Server GIS

Points to Note. Chinese and English characters shall be coded in ISO/IEC 10646:2011, and the set of Chinese

AHDS Digital Preservation Glossary

DIVER (Data Integration Visualization Exploration and Reporting)

The Spatial Data Standards for Facilities, Infrastructure, and Environment Online (SDSFIE Online) Web Site.

Figure 2: System Flow Diagram for Workflow Management

Transcription:

Geospatial Data Stewardship at an Interdisciplinary Data Center Robert R. Downs, PhD Senior Digital Archivist and Senior Staff Associate Officer or Research Acting Head of Cyberinfrastructure and Informatics Research and Development Center for International Earth Science Information Network (CIESIN) The Earth Institute, Columbia University Prepared for presentation to the Geospatial Summit on Framing a National Preservation and Access Strategy for Geospatial Data November 12-13, 2009 Library of Congress

Geospatial Activities at CIESIN Examples of Current Projects NASA Socioeconomic Data and Applications Center World Data Center for Human Interactions in the Environment Northeast Information Node for the National Biological Information Infrastructure (NBII) Cyberinfrastructure for the Earth Institute (CI4EI) Africa Soil Information Service (AfSIS) Digital Soils Map Project Millennium Villages Polar Information Commons Examples of Previous Projects Climate Change Information Resources for the New York Metropolitan Region (CCIR-NYC) Global Natural Disaster Hotspots Managing and Preserving Geospatial Electronic Records 2

CIESIN Geospatial Data Lifecycle Activities Development of Data Products and Services Data creation, enhancement, and description; map creation; facilitating data analysis Archiving of Data and Research-Related Information Data, data products, scientific reports, maps, guides, methodological descriptions, documentation Data Dissemination Enabling and supporting use of data by interdisciplinary scientific, educational, and decision-making communities Long-Term Archiving Establishing a Long-Term Archive for the NASA Socioeconomic Data and Applications Center (SEDAC) in collaboration with the Columbia University Libraries and the Earth Institute of Columbia University 3

SEDAC Data Workflow 4

Selection Criteria for LTA Data Appraisal Scientific or Historical Value citation, research, and educational use as published in refereed scientific publications/reports from recognized committee of scientists Potential Usability and Use evidence of usability, usefulness, and sufficient usage by the community interested in human dimensions of the environment. Adequate evidence indicate potential for future use justifies costs of long-term archiving Uniqueness of Data (non-redundant stewardship) not being preserved in any form in another archive and is at risk of loss if not accessioned into the Long-Term Archive Relevance to LTA Mission currently endorsed or approved by community interested in human interactions in the environment. For the short-term, relevance includes content germane to SEDAC mission and SEDAC strategic plan Documented for Accessibility completeness and correctness of documentation to facilitate future discovery, access, and use Technological Accessibility (feasibility) received in format meeting technical criteria for the Service Level designated for the resource Legality and Confidentiality unrestricted permissions for preservation and future dissemination. No information that is confidential or prohibited from dissemination Non-Replicability data replication not feasible, excessively costly or prohibitive

Interdisciplinary Data and Use Data Holdings Represent Various Kinds of Data GIS, Maps, and Remote Sensing Data Model data and Programs Demographic and Survey Data Examples of Diverse Publications Citing Data Holdings Agriculture, Ecosystems & Environment; American Journal of Public Health; Atmospheric Research; Biodiversity and Conservation; Biological Conservation; Bioscience; Climatic Change; Conservation Biology; Ecological Economics; Ecological Indicators; Ecology Letters; Ecosystems; Energy Economics; Energy Policy; Environment, Global Environmental Change; Hydrological Processes; Journal of Risk Research; Marine Policy; National Geographic; Nature; Science; Sustainability Science; The Bulletin of the Atomic Scientists; The Journal of Environment Development; The Journal of Geology; The Lancet Oncology; Trends in Parasitology; Water Policy; Water Resources Management 6

Examples of Formats for Archived Data GIS, Remote Sensing, and Map Imagery: ESRI ArcGIS, ArcInfo Workspace, American Standard Code for Information Interchange (ASCII), GRID, ArcIMS XML (axl), ArcView Theme Legend (avl), Shapefile Attribute Format (dbf), Shapefile Projection (prj), Shapefile Spatial Index (sbn and sbx), Shapefile Shape (shp), Header (hdr), Band Interleaved by Line (bil), Tagged Image File and GeoTiff (tif), Graphics Interchange Format (gif), personal Geodatabase (mdb),comma-separated Values (csv), Joint Photographic Experts Group (jpg), Microsoft Excel (xls), Adobe Portable Document Format (pdf), Microsoft Word (doc), Powerpoint (ppt), Hypertext Markup Language (html and html), JavaScript Source Code (js), Extensible Markup Language (xml) Modeling: ASCII Byte (byt), ASCII Output (out), ASCII Data (dat), ASCII Text (txt) Btreive Unformatted (unf), Unix Shell Script (sh), Scene (scen), Initialization (in, ini & init), Model Parameter (par), Configuration (con), Control (ctl), Workbook (wk1), Fortran (for), Executable (run), Gemstone (gs), Word (doc) Tabular: Comma-Separated Values (csv), Excel spreadsheet (xls), Word (doc), Unstructured : Word (doc), Text (txt), Adobe Portable Document Format (pdf), Hypertext Markup Language (html and html), 7

Example Formats of Additional Materials Archived Documentation: Word (doc), Portable Document Format (pdf), Text (txt), Word doc, (pdf) Metadata Extensible Markup Language (xml), Hypertext Markup Language (html) Permissions Portable Document Format (pdf), Text (txt), Also received as email, fax, or signed original Fixity Information: Text (txt), Extensible Markup Language (xml) 8

Archival Evolution Implemented VITAL / Fedora digital repository Currently operating VITAL 3.1.1, Fedora 2.2, VALET 1.1.3 Duplicated in failover system and nightly backup copies Managing Multiple Collections Migrating from traditional digital data archiving procedures to managing collections in digital repository Parallel ingest for selected data sets Migration of selected archived data sets to digital repository Continuing traditional archival operations onsite and offsite

Digital Repository Collections 10

Digital Repository Development Conducting a Self-Assessment of LTA Collection Based on the Trusted Repositories Audit and Certification: Criteria and Checklist (TRAC) Developing and testing online submission services Enabling self-submission by data producers Assessing schemas for provenance of pre-ingest activities (PREMIS Event, DDI Archive) Customizing pre-ingest workflow Establishing processes for online review and approval Enabling access control Defining privileges for collections, objects, and datastreams Developing capabilities to test transfer between repositories Collaborating with Columbia University Libraries digital repository