Smithsonian Institution Archives Email Preservation Tools



Similar documents
THE COLLABORATIVE ELECTRONIC RECORDS PROJECT SUMMARY

EMCAP Pilot User Guide For Microsoft Outlook 2003

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Electronic Records Management Strategy

Long-term archiving and preservation planning

Institute for Advanced Study Shelby White and Leon Levy Archives Center

Archiving/Retention Frequently Asked Questions (FAQ)

Novell GroupWise Microsoft Exchange/Outlook (PST)

Chapter WAC PRESERVATION OF ELECTRONIC PUBLIC RECORDS

The State Records Act and Electronic Records. Kris Stenson Electronic Records Archivist Illinois State Archives

Electronic documents questionnaire

POLICY AND GUIDELINES FOR THE MANAGEMENT OF ELECTRONIC RECORDS INCLUDING ELECTRONIC MAIL ( ) SYSTEMS

Microsoft Exchange/Outlook (PST) Office 365

Interagency Science Working Group. National Archives and Records Administration

Information Management Advice 18 - Managing records in business systems: Overview

Current and Archiving Practices in the Enterprise an Osterman Research research summary

2.2 INFORMATION SERVICES Documentation of computer services, computer system management, and computer network management.

Student Office 365 Outlook Web App OWA Quick Guide. Getting you up to speed quickly.

USGS Guidelines for the Preservation of Digital Scientific Data

Scotland s Commissioner for Children and Young People Records Management Policy

System Requirements for Archiving Electronic Records PROS 99/007 Specification 1. Public Record Office Victoria

SEVENTH FRAMEWORK PROGRAMME THEME ICT Digital libraries and technology-enhanced learning

Lotus Notes (NSF) Microsoft Exchange/Outlook (PST)

Information Management Advice 18 - Managing records in business systems Part 1: Checklist for decommissioning business systems

Chapter 5: The DAITSS Archiving Process

Integration of Records Management and Digital Archiving : What Can We Do Today? Ann Keen, Preservica Nov

Electronic Recordkeeping

THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy Page 1 of 8

E-Discovery Technology Considerations

MIGRATING YOUR GROUPWISE ARCHIVE TO OUTLOOK

GFI White Paper: GFI FaxMaker and HIPAA compliance

Why archiving erecords influences the creation of erecords. Martin Stürzlinger scopepartner Vienna, Austria

STEPPING UP TO THE ELECTRONIC ARCHIVING CHALLENGE: OCLC S ROLE. Andrea Keyhani Director, Licensing & Publisher Relations

TITUS Data Security for Cloud Identify and Control Sensitive Data Sent to the Cloud

Smithsonian Institution Archives Guidance Update SIA. ELECTRONIC RECORDS Responsible Recordkeeping: Records. March 2007 SIA_EREC_03_07

Streamlining the drug development lifecycle with Adobe LiveCycle enterprise solutions

Harnessing Media Micro-Services for Stewardship of Digital Assets at CUNY TV. NDSR Project:

Why Upgrade to RightFax 10.5?

How To Preserve Records In Mississippi

Social Media Case Studies: Archiving Social Media: Mesolithic Online Resources (Mesolithic Miscellany and Mesolithic Research Forum).

In ediscovery and Litigation Support Repositories MPeterson, June 2009

Archival Data Format Requirements

Smithsonian Institution Archives Guidance Update SIA. ELECTRONIC RECORDS Recommendations for Preservation Formats. November 2004 SIA_EREC_04_03

Department of Veterans Affairs VA Directive 6311 VA E-DISCOVERY

What s New in Exchange 2010?

Microsoft Exchange 2013 Ultimate Bootcamp Your pathway to becoming a GREAT Exchange Administrator

Multiple Digital Content Types in a Single Collection. Dina Sokolova and Jane Gorjevsky, Columbia University

RecordPoint Overview

SharePoint 2013: Key feature improvements

ILM et Archivage Les solutions IBM

Affordable Digital Preservation for Libraries and Museums

Novell GroupWise Microsoft Exchange/Outlook (PST)

Challenges in digital preservation: Relational databases

Comprehensive Agentless Cloud Backup and Recovery Software for the Enterprise

System Requirements for Netmail Archive

Saskatchewan Records Management Guidelines

IceWarp to IceWarp Server Migration

Table of Contents. Chapter No. 1. Introduction Objective Use Compliance Definitions Roles and Responsibilities 2

CIP s Open Data & Data Management Guidelines and Procedures

SharePoint User Manual

TECHNICAL REFERENCE GUIDE

B a r r a c u d a M e s s a g e A r c h i v e r O u t l o o k A d d - I n U s e r G u i d e. V e r si on 3. 1

Protecting Business Information With A SharePoint Data Governance Model. TITUS White Paper

2007 to 2010 SharePoint Migration - Take Time to Reorganize

How to successfully migrate from GoogleApps

IBM DB2 CommonStore for Lotus Domino, Version 8.3

Record Management in SharePoint

Brent R. Holladay Chief Deputy, Information Resources Lake County Clerk of Courts

Ex Libris Rosetta: A Digital Preservation System Product Description

Transcription:

Smithsonian Institution Archives Email Preservation Tools What they are, how they work and their operational context Riccardo Ferrante Director of Digital Services @raferrante siarchives.si.edu @SmithsonianArch Archiving Email Symposium Library of Congress Washington, DC June 2, 2015 Lynda Schmitz Fuhrig Electronic Records Archivist @LyndaLSF

Our Starting Point Late 1980s ELM (ELectronic Mail)

Our Starting Point Since then an incomplete listing

Our Starting Point Since then an incomplete listing

Our Starting Point Since then an incomplete listing

Our Starting Point Since then an incomplete listing

Our Starting Point Since then an incomplete listing

Our Starting Point Since then an incomplete listing

Our Starting Point Institutional email management strategies, circa 2004: SI Archives issued guidance part of new employee orientation Email typically considered like voicemail IT deleted oldest emails on large accounts Email migration by special request only when switching from GroupWise to MS Outlook Some of the Smithsonian s email platforms over the past 35 years. Email archives the responsibility of the person No established mechanism for identifying and capturing historically valuable emails

Our Starting Point Late 1980s the date of some of the oldest email in the Archives collections. At the time, the Smithsonian and many universities were using ELM and PINE. We do not refuse deposits of email on the basis of format. Therefore we must use an approach that accommodates a variety of formats. Some of the Smithsonian s email platforms over the past 35 years. Accessioned email comes from a variety of formats including Elm, Pine, Eudora, ccmail, Lotus Notes, Novell GroupWise and most recently Microsoft Outlook.

The Collaborative Electronic Records Project (CERP) With these concerns in mind, the Archives teamed with the Rockefeller Archive Center in a three year project to explore the issue, develop guidance and workflows for acquiring and preserving email. By the end of the project, CERP had also developed a software tool for preserving email in XML using a schema developed in collaboration with the NC/KY/PA Electronic Mail Initiative project (EMCAP). Some of the Smithsonian s email platforms over the past 35 years.

Objective: Durable Access Preservation using nonproprietary, standards based formats Asset manageability Retention of context Retention of complete messages Methodology with a low resource requirement

Our Approach Since 2008, using the CERP methodology & tools Focus on, and actively capturing key organization record creators Accounts, not individual emails Easier manageability at this level

SIP to AIP CERP Model 1) Convert the collection to.mbox files, if not already in this format. 2) Use the CERP parser to convert.mbox files into an XML preservation file. 3) Assemble all components (metadata, SIP, preservation XML, and log files) into an AIP SIP AIP

CERP Parser

Preservation and Access Preservation output through Email Account XML Schema (EAXS) Human and machine readable but not pretty

Beyond CERP

DArcMail (Digital Archive Mail)

DArcMailXML

Encoded attachments

Lifecycle coverage from DArcMail Suite Processing Preservation Everything apart from AIP assembly Reference Access (offline) Follows CERP methodology

Where this fits in the larger SIA context Acquisition Focused on capturing senior officials, notable persons on departure Follows standard electronic records procedures Accept unsolicited email records regardless of format Work with OCIO to acquire the accounts Appraisal Everything received is preserved Weeding is the responsibility of the account owner Finalized based on subject sender listing Now possible within DArcMail Arrangement, Description and AIP preparation Subject/Sender log is a finding aid supplement Full account structure is embedded in the preserved account s XML. Now possible within DArcMail Reference Online collection level description Dark archive Restricted for a 15 year minimum

Where CERP fits with other tools CERP Parser handles 3 stages. DArcMail Suite adds a UI for archival processing and works in Python. Can be used as part of a composite suite of tools for a full lifecycle solution. Affordable (open source) and accessible solution for preservation.