Transfer Guidance Revision Project: Identifying Fit-for-Purpose Formats. Donald Chalfant Kevin DeVorsey Louise Guenther Kathryn Nielsen



Similar documents
Preservation Handbook

Standards Development. PROS 14/00x Specification 3: Long term preservation formats

Bradford Scholars Digital Preservation Policy

TEXT FILES. Format Description / Properties Usage and Archival Recommendations

Digital Preservation Guidance Note: Selecting File Formats for Long-Term Preservation

File Formats. Summary

Electronic Records Management Guidelines - File Formats

TROPICAL DATA HUB. Best Practice Guidelines for Research Data Management at JCU

College Archives Digital Preservation Policy. Created: October 2007 Last Updated: December 2012

How Xena performs file format identification

Smithsonian Institution Archives Guidance Update SIA. ELECTRONIC RECORDS Recommendations for Preservation Formats. November 2004 SIA_EREC_04_03

11.5 E-THESIS SUBMISSION PROCEDURE (RESEARCH DEGREES)

Digital Archiving at the National Archives of Australia: Putting Principles into Practice

Preservation Handbook

The Benefit of Experience: the first four years of digital archiving at the National Archives of Australia

Suitable file formats for transfer of digital records to The National Archives

Guidelines on Information Deliverables for Research Projects in Grand Canyon National Park

Introduction. 1. Name of your organisation: 2. Country (of your organisation): Page 2

TROLLing File Format Essentials

Digital Preservation Support Levels Support level 1: All reasonable actions to maintain usability will be taken. Actions may include migration,

Introduction to Research Data Management. Tom Melvin, Anita Schwartz, and Jessica Cote April 13, 2016

Computers Are Your Future Eleventh Edition Chapter 5: Application Software: Tools for Productivity

Checklist and guidance for a Data Management Plan

It FITS the Cultural Heritage! Formats for Preservation: From Spatial Data to Cultural Resources

AHDS Digital Preservation Glossary

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire

Digital Formats: Factors for Sustainability, Functionality, and Quality

encoding compression encryption

Automated Workflow for the Ingest and Preservation of Electronic Journals. Evan Owens Chief Technology Officer Portico

Preserving music from LPs and tapes

Report of the Ad Hoc Committee for Development of a Standardized Tool for Encoding Archival Finding Aids

Digital Preservation. Guidance Note: Graphics File Formats

Digital Formats for Library of Congress Collections

AXF Archive exchange Format: Interchange & Interoperability for Operational Storage and Long-Term Preservation

Automatic Metadata Generation Use Cases

METADATA STANDARDS AND GUIDELINES RELEVANT TO DIGITAL AUDIO

Preferred formats. September 2015, version 3.0 >>>

ECM Governance Policies

From Questions to Answers: Outcomes from the Big Data Project

PRESERVATION NEEDS ASSESSMENT PRESERVATION 101

LIBER Case Study: Factors for Enabling Sharing and Reuse of Research Data Library and Archive Services at the University of Vienna

UK Data Archive Preservation Policy

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

MANAGING QUALITATIVE DATA

ASG-DIGITAL ARCHIVE FOR MEDIA AND ENTERTAINMENT

Geospatial Data Stewardship at an Interdisciplinary Data Center

Newspaper Digitization Brief Background

Innovative Seed Grant Data Management Plans. February 17, 2015

Second EUDAT Conference, October 2013 Workshop: Digital Preservation of Cultural Data Scalability in preservation of cultural heritage data

Overview of NDNP Technical Specifications

Scanning and Tossing. Requirements for Scanning and the Destruction of Paper Based Records

INTEROPERABLE IMAGE DATA ACCESS THROUGH ARCGIS SERVER

ERMS Solution BUILT ON SHAREPOINT 2013

Fetch TV My Media Hub Quick Start Guide For USB Devices

Preservation Handbook

DATA MANAGEMENT FOR QUALITATIVE DATA USING NVIVO9

Digital Preservation in Theory and in Practice

PERFORMANCE ANALYSIS OF VIDEO FORMATS ENCODING IN CLOUD ENVIRONMENT

Archiving Systems. Uwe M. Borghoff Universität der Bundeswehr München Fakultät für Informatik Institut für Softwaretechnologie.

Archive Storage Technologies Supporting Information Lifecycle Management

Digitisation Disposal Policy Toolkit

Managing research data and Horizon 2020

Long-term data storage in the media and entertainment industry: StrongBox LTFS NAS archive delivers 84% reduction in TCO

Interagency Science Working Group. National Archives and Records Administration

Adobe CQ Digital Asset Management

Data: Small and Big Data files and formats

Intro to Data Management. Chris Jordan Data Management and Collections Group Texas Advanced Computing Center

HearForm10. Office Management Software. Electronic Medical Records

Digital Asset Management. An Oracle White Paper Updated April 2007

Environment Canada Data Management Program. Paul Paciorek Corporate Services Branch May 7, 2014

Best practices for producing quality digital video files

Digital Dunhuang: : A Case Study for Digital Preservation and Digital Asset Management

Administrative Office of the Courts

Version 1.0 MCGILL UNIVERSITY SCANNING STANDARDS FOR ADMINISTRATIVE RECORDS. Summary

In addition, a decision should be made about the date range of the documents to be scanned. There are a number of options:

Where to use open source

How to use Adobe Media Encoder CS6

The Rutgers Workflow Management System. Workflow Management System Defined. The New Jersey Digital Highway

In ediscovery and Litigation Support Repositories MPeterson, June 2009

How To Scan A Document

Digital Continuity Plan

There are many useful and free tools available for working with digital materials. Here are some I use...

Setting up Web Material. An introduction

Research Summary: Enterprise Content Management

Introduction to BrightSign, BrightAuthor, and BrightSign Network (BSN)

Managing Digital Assets for Profit

6. FINDINGS AND SUGGESTIONS

USGS Guidelines for the Preservation of Digital Scientific Data

Zarządzanie sieciami telekomunikacyjnymi

How To Write A File System On A Microsoft Office (Windows) (Windows 2.3) (For Windows 2) (Minorode) (Orchestra) (Powerpoint) (Xls) (

Digital Asset Management with Free and Open Tools

North Carolina Digital Preservation Policy. April 2014

Today s topics. Digital Computers. More on binary. Binary Digits (Bits)

Preserving French Scientific data

Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University. Xu Liang ** University of California, Berkeley

Video Reformatting Equipment and Workflow Steps

Carol Palmer, Principal Product Manager, Oracle Corporation

On completion of your Research Degree, you are required to submit an electronic copy of your thesis in USIR.

Guidance. Data Management. Rural Economy and Land Use Programme Data Support Service

Transcription:

Transfer Guidance Revision Project: Identifying Fit-for-Purpose Formats Donald Chalfant Kevin DeVorsey Louise Guenther Kathryn Nielsen

Presentation Overview What We ve Learned: Ten years with the existing transfer guidance for federal agencies What are our goals? Technical White Papers. A new look. Metadata for transfer. Q&A 12/19/2012 Slide 2

Current Guidance (developed between 2002-2004): Reflect NARA s capabilities at the time. 12/19/2012 Slide 3

Current Guidance: limited scope that does not address all record/format types. 12/19/2012 Slide 4

Issues with current guidance products: Demonstrate a preference for standards based formats that agencies do not always use. Require that agencies transform to acceptable formats prior to transfer. Have proven an obstacle to NARA and agencies. 12/19/2012 Slide 5

Goals for revised guidance: Provide clear, concise, and consistent direction to agencies regarding formats that are acceptable for use when transferring records to NARA. Develop a flexible and extensible framework that can adapt to future needs. Balance preference for open formats with the business needs of agencies. 12/19/2012 Slide 6

Goals for revised guidance cont.: Match record behavior and performance to the correct file formats. Adapt to the change from tapes to ERA. Support a life-cycle approach from creation through to researcher access. 12/19/2012 Slide 7

Goals for revised guidance cont.: Expand the types of formats that NARA accepts. Acknowledge formats that are ubiquitous in the market place. Minimize the need for agencies to transform records prior to transfer. 12/19/2012 Slide 8

Where are we now? Revision project is almost ready for internal review. We ve identified the categories and file formats that agencies are using today. We ve conducted analysis of about 50 formats. We are compiling the results in the form of a revised guidance product. We are developing minimal metadata guidance. 12/19/2012 Slide 9

A Change in Approach We will identify formats that are: Preferred Acceptable Acceptable for Imminent Transfer (to sunset previously acceptable formats) Not Acceptable (this will be rare) 12/19/2012 Slide 10

Categories of E-records Digital Still Images Digital Moving Images Digital Audio Text Geospatial Records CAD Structured Data E-mail Web & Social Media 12/19/2012 Slide 11

Measuring sustainability*: Disclosure: the degree to which complete specifications and technical integrity tools exist. Adoption: the degree to which the format is used by creators, disseminators, or users. Transparency: the degree to which the digital representation is open to direct analysis with basic tools, including human readability using a text-only editor. Self-documentation: formats that contain all the metadata needed to render the data as usable information. External dependencies: refers to the degree to which a format depends on particular hardware, operating system, or software for rendering or use. Impact of patents: Patents related to a digital format may inhibit the ability of archival institutions to sustain content in that format. Technical protection mechanisms: To preserve digital content and provide service to users and designated communities decades hence, NARA must be able to replicate the content on new media, migrate and normalize it in the face of changing technology, and disseminate it to researchers. *adapted from http://www.digitalpreservation.gov/formats/ 12/19/2012 Slide 12

Measuring Sustainability. Analyzing format resources. ISO/ANSI and other specifications Corporate sites LC Format Sustainability of Digital Formats TNA-UK PRONOM Format Registry CDL/LC UDFR Wikipedia 12/19/2012 Slide 13

Determining Fit for Purpose Formats. The conventional wisdom is not always correct. Open formats may not always be the best choice. Proprietary formats aren t always so bad. 12/19/2012 Slide 14

12/19/2012 Slide 15

12/19/2012 Slide 16

Preferred Acceptable Imminent Transfer Structured Data ASCII Text CSV, Comma Separated Values (RFC 4180) XML, Extensible Markup Language DBF, dbase Table File Format HDF5, Hierarchical Data Format Version 5 CDF, Common Data Format EBCDIC Digital Audio WAVE_LPCM_BWF, Broadcast WAVE Audio File Format WAVE, WAVE Audio File Format" FLAC_1_1_2, FLAC (Free Lossless Audio Codec), Version 1.1.2 Ogg, Ogg File Format MP3_ENC, MP3 Audio Encoding (MPEG Layer III Audio Encoding) AIFF, Audio Interchange File Format Digital Still Images TIFF (Tagged Image File Format) GIF 89a Format BMP Format JPEG 2000 Part 1 Format (JP2) DNG Format JFIF/JPEG Format TARGA Format 12/19/2012 Slide 17

Metadata for Transfer The types of electronic records in use have changed and so have our needs. There are more metadata rich formats. We currently ask for indexes and system documentation. More and more this exists as metadata that we can use to help automate processes. 12/19/2012 Slide 18

What are you using? We would like to work with you to identify: Existing standards that you are using. Your capabilities to export metadata that you use to search and retrieve e-records. A simplistic set of metadata appropriate for transfer. 12/19/2012 Slide 19

Kevin De Vorsey Electronic Records Format Specialist Policy Analysis and Enforcement Office of the Chief Records Officer Agency Services National Archives and Records Administration kevin.devorsey@nara.gov 212-401-1631 12/19/2012 20