NASA SPACE SCIENCE DATA COORDINATED ARCHIVE ARCHIVE PLAN FOR 2014 2015



Similar documents
Digital Curation at the National Space Science Data Center

Establishing a Mechanism for Maintaining File Integrity within the Data Archive

PTAB Test Audit Report for. National Space Science Data Center (NSSDC) Prepared by

Standards-based science data archiving at NASA's National Space Science Data Center

Perennisation et Valorisation Nov 7, 2002

SPASE: THE CONNECTION AMONG SOLAR AND SPACE PHYSICS DATA CENTERS

XenData Archive Series Software Technical Overview

HP LTO-5 Ultrium Tape Drive Portfolio Bridging the gap between current data protection infrastructure capabilities and today s business demands

DATA MANAGEMENT PLAN

Manage Video Clutter and Organize Your Digital Library

Interagency Science Working Group. National Archives and Records Administration

EROS RECORDS MANAGEMENT PLAN. Geologist Ed Harp estimates rock-fall susceptibility in American Fork Canyon, UT. 10/7/2004.

Multi-Terabyte Archives for Medical Imaging Applications

Migrating NASA Archives to Disk: Challenges and Opportunities. NASA Langley Research Center Chris Harris June 2, 2015

Research Data Storage and the University of Bristol

GOOD LABORATORY PRACTICE (GLP)

Digital Media Storage

MAST: The Mikulski Archive for Space Telescopes

Guidelines for Development of a DATA MANAGEMENT PLAN (DMP) Earth Science Division NASA Science Mission Directorate

APPENDIX C. PLANETARY SCIENCE RESEARCH PROGRAM C.1 PLANETARY SCIENCE RESEARCH PROGRAM OVERVIEW. 1. Introduction

AHDS Digital Preservation Glossary

NASA s Big Data Challenges in Climate Science

JPL D Initial Release Phoenix PHX Phoenix Project. Archive Generation, Validation and Transfer Plan.

POLICY AND GUIDELINES FOR THE MANAGEMENT OF ELECTRONIC RECORDS INCLUDING ELECTRONIC MAIL ( ) SYSTEMS

GEOSPATIAL DIGITAL ASSET MANAGEMENT A SOLUTION INTEGRATING IMAGERY AND GIS WHERE WILL ALL THE PIXELS GO?(AND HOW WILL WE EVER FIND THEM?

LONG TERM RETENTION OF BIG DATA

Storage Options for Document Management

Computer Logic (2.2.3)

Demographics QUESTIONS COMMENTS

PROJECT DATA MANAGEMENT PLAN

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire

Statement of Dr. James Green Director, Planetary Science Division, Science Mission Directorate National Aeronautics and Space Administration

Canadian Astronomy Data Centre. Séverin Gaudet David Schade Canadian Astronomy Data Centre

PDS (The Planetary Data System) Information Technology Security Plan for The Planetary Data System: [Node Name]

PRESERVATION NEEDS ASSESSMENT PRESERVATION 101

OCLC Digital Archive Preservation Policy and Supporting Documentation Last Revised: 8 August 2006

Spatial Data Storage/Data Discovery Terms Definitions

Digital Archiving Survey

NASA Heliophysics Science Data Management Policy

North Carolina Digital Preservation Policy. April 2014

Cloud JPL Science Data Systems

How To Store Data In A Cloud Environment

DIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER. Version July 2008 UNITED STATES GOVERNMENT PRINTING OFFICE

Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International

National Association of Government Archives and Records Administrators QUALIFYING FOR THE

New Horizons Data Management and Archiving Plan

Speakers and poster presenters:

NERC Biodiversity and Ecosystem Service Sustainability (BESS) Data Management Strategy

Implementing an Automated Digital Video Archive Based on the Video Edition of XenData Software

Riverbed Whitewater/Amazon Glacier ROI for Backup and Archiving

1. Redistributions of documents, or parts of documents, must retain the SWGIT cover page containing the disclaimer.

System Requirements for Archiving Electronic Records PROS 99/007 Specification 1. Public Record Office Victoria

Digital Preservation. OAIS Reference Model

Implementing a Digital Video Archive Based on XenData Software

A long time ago, people looked

PDS4 and Build 5a Update. Dan Crichton, Emily Law November 2014

DATA MANAGEMENT, PRESERVATION AND THE FUTURE OF PDS

Columbia University Digital Library Architecture. Robert Cartolano, Director Library Information Technology Office October, 2009

Service Description Cloud Storage Openstack Swift

Bradford Scholars Digital Preservation Policy

Service Plan Fiscal Year 2016

Infrequent Tape Retention

WHITE PAPER Archiving and Continuity

ARCHIVING FOR EXCHANGE 2013

Implementing a Digital Video Archive Using XenData Software and a Spectra Logic Archive

Ad-Hoc Task Force on Big Data NAC Science Committee

Electronic Records Management Strategy

Cost Model for Digital Preservation. Ulla Bøgvad Kejser, Preservation Specialist, PhD The Royal Library, Denmark

Hitachi Content Platform. Andrej Gursky, Solutions Consultant May 2015

Long Term Preservation of Earth Observation Space Data. Preservation Workflow

MATRIX and H-Net Backup and Archival Storage: Practices and Suggested Improvements. Preservation of the H-Net Lists Supplemental Report

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Reinvent your storage infrastructure for e-business

Research Data Management Policy. Glasgow School of Art

PST Migration with Enterprise Vault 8.0: Part 1 - Solution Overview. Author: Andy Joyce, EV Technical Product Management Date: April, 2009

Known Solar System Object Association (SSOID)

Perspectives on the Value of Software Preservation

DELAWARE PUBLIC ARCHIVES POLICY STATEMENT AND GUIDELINES MODEL GUIDELINES FOR ELECTRONIC RECORDS

Applying the OAIS standard to CCLRC s British Atmospheric Data Centre and the Atlas Petabyte Storage Service

[STORAGE SOLUTIONS: FILM INDUSTRY]

STATE OF WYOMING Electronic Mail Policy

The Microsoft Large Mailbox Vision

Science Traceability

DESCRIPTION ACADEMIC STANDARDS INSTRUCTIONAL GOALS VOCABULARY BEFORE SHOWING. Subject Area: Science

Eliminate Dark and Dirty Data

Digital preservation at the Institut Cartogràfic de Catalunya

International coordination for continuity and interoperability: a CGMS perspective

TOSM Server Backup Service

Software challenges in the implementation of large surveys: the case of J-PAS

Archiving of Simulations within the NERC Data Management Framework: BADC Policy and Guidelines.

The Key Elements of Digital Asset Management

HSSTC Passed NASA Authorization of 2013 (H.R. 2687)

Astronomical Data Analysis Software & Systems XVI

Protecting Mission Data Against Loss

How To Store Data On A Computer (For A Computer)

Emerging Trends: Cultural Heritage 3D Modelling

The Next Frontier. for Records Managers. Retention and Disposition of Structured Data:

Presentation Topics. What is a record? Hawaii State Archives Presentation December 14, 2010 ABC S OF RECORDS MANAGEMENT ACHIEVING BASIC CONTROL

Tier 2 Nearline. As archives grow, Echo grows. Dynamically, cost-effectively and massively. What is nearline? Transfer to Tape

How To Use A Court Record Electronically In Idaho

Transcription:

NASA SPACE SCIENCE DATA COORDINATED ARCHIVE ARCHIVE PLAN FOR 2014 2015 Ed Grayzeck NASA Space Science Data Coordinated Archive Greenbelt, Maryland 20771 2014-03-31 1

ABSTRACT This archive plan shows that the NASA Space Science Data Coordinated Archive (NSSDCA) expects to accept ~400 TB of data into the archive in 2014 and ~600 TB in 2015. 1. INTRODUCTION NSSDCA provides a vital service as NASA's permanent multi-disciplinary Space Science archive. Its curation activities are essential to ensure that space science data will continue to be available and usable into the indefinite future. The need for long-term curation arises because in most cases the full value of any set of data cannot be known in advance. New science discoveries or changes in research and exploration priorities may make older data, seldom noticed before, suddenly highly relevant. This archive plan summarizes the expected data inflow to NSSDCA (note the Acronym list at the end of this document) for the years 2014-2015. These are estimates for planning purposes, not exact data projections. This is the successor to earlier plans covering 3-4 years each and updated at that same interval. With this version we have chosen to plan for two years at a time and update the plan annually, so the estimates should be more accurate and more relevant for planning. 1.1 Levels of Service NSSDCA accepts and archives data under four levels of service, summarized in Table 1 below. The most familiar is the Permanent Archiving of data, but, as defined in MOUs with various data providers, it also provides Backup service, mostly for other Archives. The Analog Archive includes photos, maps, microfilm, microfiche, documents, etc, some analog copies of digital data and others supporting metadata; it is included in this list for completeness. Table 1. NSSDCA Archival Storage Services Permanent Archive: AIPs Permanent Archive: non-aip digital data Backup Analog Archive Preservation of digital data in Archival Information Packages delivered by a data producer or created at NSSDCA. AIPs are re-written to new media within six years. Data is disseminated by NSSDCA if not available through an active archive or per MOU. Preservation of non-packaged data on various media types. Data will eventually be migrated from legacy media to AIPs, though no media refresh will be made in the meantime. Data is disseminated by NSSDCA if not available through an active archive or per MOU. Storage of digital data at climate-controlled off-site facility to support another archive s contingency plan per MOU. Data will not be disseminated by NSSDCA. Preservation of analog data on a variety of media with selected refreshment and selected digitization. Selected retention of original analog data after digitization. Data are copied and disseminated by NSSDCA. Given the prevalence of incoming data from the PDS nodes and subnodes for 2014-15, we have reorganized Table 2 by data contributors rather than by missions as in previous plans. 2

1.2 Archive Information Packages (AIPs) In Table 1 NSSDCA's permanent archive is digital data that is stored either as AIPs or not. The non-aip digital data is stored on off-line media and tracked by the media on which is resides. The portion of the data stored near-line in LTO jukeboxes has been growing since 2000 and includes all new data inflows received via electronic transfer, plus some legacy data collections; it is notable not because of its media, but because those data are stored on LTOs as AIPs. An Archive Information Package (AIP) is a single file container that holds one or many science data files, a number of attributes about each file that help NSSDCA manage its AIPs, and pointers to all of the supporting documentation, including calibration information. Ideally this is enough information to allow a user to be able to utilize the data independently of the archive and the original producer of the data. No reformatting of the science data files is performed unless record boundaries need to be retained and are not already in the byte stream. Any files that are transformed may be returned to their original state using the NSSDCA defined attributes. Additionally, AIPs are media independent and platform independent, making AIPs the preferred delivery and storage means. In the long-term most of the non-aip data in the permanent archive is planned to be converted to AIPs. 1.3 Active Archives NASA has established a set of Active Archives, which receive data from missions and provide electronic access to the missions' data, along with documentation and tools for accessing and using the data. NSSDCA's mission is to accept data from the Active Archives or sometimes directly from missions, then provide long-term curation of the data. This is a critical service, since the full value of any set of data cannot be known in advance. New science discoveries or changes in research and exploration priorities may make older data, seldom requested, suddenly highly relevant. 2.0 ARCHIVE PLAN The revised, detailed Archive Plan for NSSDCA for 2014-2015 is given below (next page) in Table 2. Table 2 lists the node/archive/mission and the estimated data volume to be delivered each year. Also included are the level of service (Permanent Archive - with or without AIPs - or Backup) defined by MOU for each data collection and the discipline (Astrophysics, Heliophysics, Planetary & Lunar) for each. For archives which require Backup service, the data volumes expected from individual missions are combined and listed in the table by the name of the archive, i.e. HEASARC, IRSA, MAST, PDS, and SPDF. The totals in Table 2 show that NSSDCA is planning for ~400 TB of data arriving at the archive in 2014 and ~600 TB in 2015.. The greatest data deliveries expected are those from the PDS Imaging Node, which is archiving data from the Lunar (LRO) and Mars (MRO) Reconnaissance Orbiters. The summary of the Table 2 entries by level of service and by discipline is given in Tables 3a and 3b, respectively. Clearly, planetary missions dominate; their coming contribution to the NSSDCA is estimated to be over 800 TB in 2014-15. 3

TABLE 2. Summary of data expected at NSSDCA, 2014-2015. Project Service Level* & Discipline+ Expected Data Volume (GB) 2014 2015 Totals (GB) PDS Nodes PDS_ATM A P 3 4 7 PDS_GEO A P 14 47 61 PDS_IMG A P 230 400 630 PDS_NAI A P 0.5 0.6 1.1 PDS_PPI A P 4 4 8 PDS_PSI A P 60 40 100 PDS_RINGS A P 0.6 0.4 1.0 PDS_SBN A P 1 2 3 Missions FERMI B A 7 7 14 RHESSI B H 1 1 2 WIND/WAVES B H <1 <1 <1 WISE B A 0 0 0 Active Archives HEASARC B A 90 90 180 IRSA B A 0 0 0 MAST B A <1 <1 <1 SPDF B H 0 0 0 TOTALS 411 596 1008 *Service Levels: A = Permanent Archive (AIP or non-aip); B = Backup. +Discipline: A = Astrophysics; H = Heliophysics; P = Planetary & Lunar. TABLE 3a TABLE 3b Service Level TB (2014-2015) Discipline TB (2014-2015) Permanent Archive 811 Astrophysics 194 Backup 196 Heliophysics 2 Planetary & Lunar 811 4

Glossary AIP GB HEASARC IRSA MAST NSSDC NSSDCA PDS PDS_ATM PDS_GEO PDS_IMG PDS_NAI PDS_PPI PDS_PSI PDS_RINGS PDS_SBN RHESSI SPDF TB WIND WAVES WISE Archive Information Package Gigabyte High Energy Astrophysics Science Archive Research Center Infrared Science Archive Multi-mission Archive at Space Telescope Science Institute National Space Science Data Center (now NSSDCA) NASA Space Science Data Coordinated Archive Planetary Data System PDS Atmospheres Node PDS Geosciences Node PDS Imaging Node PDS Navigation and Ancillary Information Facility PDS Planetary Plasma Interactions Node PDS Planetary Science Institute (sub-node of Small Bodies) PDS Rings Node PDS Small Bodies Node Reuven Ramaty High Energy Solar Spectroscopic Imager Space Physics Data Facility Terabyte NASA Spacecraft to study solar Wind (not an acronym) Plasma Waves instrument on WIND (not an acronym) Wide-field Infrared Survey Explorer 5