Information Migration



Similar documents
Best Practices for Long-Term Retention & Preservation. Michael Peterson, Strategic Research Corp. Gary Zasman, Network Appliance

In ediscovery and Litigation Support Repositories MPeterson, June 2009

Overview of Storage and Data Management Industry Trends in Long Term Information Retention and Preservation

Storage Technology. Standards Trends

Creating a Catalog for ILM Services. Bob Mister Rogers, Application Matrix Paul Field, Independent Consultant Terry Yoshii, Intel

Moving Data and Distributing Data

LONG TERM RETENTION OF BIG DATA

Data Integrity Means and Practices

AHDS Digital Preservation Glossary

North Carolina Digital Preservation Policy. April 2014

AXF Archive exchange Format: Interchange & Interoperability for Operational Storage and Long-Term Preservation

Digital Preservation. OAIS Reference Model

<Insert Picture Here> Cloud Archive Trends and Challenges PASIG Winter 2012

Interagency Science Working Group. National Archives and Records Administration

Archive exchange Format AXF

Long term retention and archiving the challenges and the solution

Archive and Preservation in the Cloud - Business Case, Challenges and Best Practices. Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP

2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (

Cloud Archive & Long Term Preservation Challenges and Best Practices

ILM: Tiered Services & The Need For Classification

Management: A Guide For Harvard Administrators

Cloud Archiving. Paul Field Consultant

The Secret Sauce of ILM The ILM Assessment Core. Bob Rogers, Application Matrix

in the Cloud - What To Do and What Not To Do Chad Thibodeau / Cleversafe Sebastian Zangaro / HP

IT Forum UW-Madison Records Management Program. UW Archives and Records Management

Archiving and Backup - The Basics

State of Michigan Records Management Services. Frequently Asked Questions About E mail Retention

Long Term Record Retention and XAM

State Records Office Guideline. Management of Digital Records

1. Redistributions of documents, or parts of documents, must retain the SWGIT cover page containing the disclaimer.

Managing Data Storage in the Public Cloud. October 2009

The evolution of data archiving

Intelligent Archiving for Managing Unstructured Information. Victor Law Director, Specialist Sales, Asia Pacific & Japan Information Management Group

Object Oriented Storage and the End of File-Level Restores

Why archiving erecords influences the creation of erecords. Martin Stürzlinger scopepartner Vienna, Austria

POLICY AND GUIDELINES FOR THE MANAGEMENT OF ELECTRONIC RECORDS INCLUDING ELECTRONIC MAIL ( ) SYSTEMS

ILM et Archivage Les solutions IBM

TERRITORY RECORDS OFFICE BUSINESS SYSTEMS AND DIGITAL RECORDKEEPING FUNCTIONALITY ASSESSMENT TOOL

Considerations for Management of Laboratory Data

Accelerating HIPAA Compliance with EMC Healthcare Solutions

COMPLIANCE PRACTICES:

Agenda. Overview Configuring the database for basic Backup and Recovery Backing up your database Restore and Recovery Operations Managing your backups

Data Sheet: Archiving Symantec Enterprise Vault Discovery Accelerator Accelerate e-discovery and simplify review

Information-centric Policy Management. Edgar StPierre, EMC

Selecting an Enterprise Records Management system enabling best practice records management. Business white paper

Management of Records

Enterprise Key Management: A Strategic Approach ENTERPRISE KEY MANAGEMENT A SRATEGIC APPROACH. White Paper February

HOLLYWOOD POST ALLIANCE TECHNOLOGY RETREAT FEBRUARY 13, 2015 S. MERRILL WEISS / MERRILL WEISS GROUP LLC CHAIR, SMPTE AXF WORKING GROUP

How To Manage An Electronic Discovery Project

Media Cloud Building Practicalities

Strategies and New Technology for Long Term Preservation of Big Data

Record Retention and Digital Asset Management Tim Shinkle Perpetual Logic, LLC

Information Management Advice 18 - Managing records in business systems: Overview

Creating the ultimate digital insurance

Solving the long term archiving challenges with IBM System Storage Archive Manager Solutions

Dionseq Uatummy Odolorem Vel

AIIM & ASSUREON AN ASSUREON BRIEF

2010: A Space Management Odyssey

Restoration Technologies. Mike Fishman / EMC Corp.

High-Level Storage and Data Management Trends & Observations

Big Data Analytics Service Definition G-Cloud 7

Miguel Ortiz, Sr. Systems Engineer. Globanet

ForgetIT. Grant Agreement No Deliverable D7.1

Using EMC SourceOne Management in IBM Lotus Notes/Domino Environments

How To Use The Hitachi Content Archive Platform

Carestream Information Management Solutions. Managing the explosion in patient information

Archival Data Format Requirements

2011 FileTek, Inc. All rights reserved. 1 QUESTION

PTAB Test Audit Report for. National Space Science Data Center (NSSDC) Prepared by

EMC arhiviranje. Lilijana Pelko Primož Golob. Sarajevo, Copyright 2008 EMC Corporation. All rights reserved.

Brown County Information Technology Aberdeen, SD. Request for Proposals For Document Management Solution. Proposals Deadline: Submit proposals to:

Achieving a Step Change in Digital Preservation Capability

IBM Infrastructure for Long Term Digital Archiving

Mayur Dewaikar Sr. Product Manager Information Management Group Symantec Corporation

Deleting Electronic Records Setting Yourself Up for Success. Pilar C. McAdam, CRM, ERMm Partner, Information Governance

Discover A New Path For Your Healthcare Data and Storage

Mapping the Technical Dependencies of Information Assets

6. FINDINGS AND SUGGESTIONS

White paper. Storing More Intelligently: Tiered Storage Solutions for Security Data

Best Archiving Practice Guidance

Chapter WAC PRESERVATION OF ELECTRONIC PUBLIC RECORDS

SharePoint Archive Rules Options

IBM Tivoli Storage Manager Version Introduction to Data Protection Solutions IBM

Introduction to Optical Archiving Library Solution for Long-term Data Retention

P u b l i c a t i o n N u m b e r : W P R e v. A

Oracle Recovery Manager

Simplify IT and Reduce Costs with Automated Data and Document Archiving

Enforce Governance, Risk, and Compliance Programs for Database Data

Queensland recordkeeping metadata standard and guideline

Checklist and guidance for a Data Management Plan

Defensible Disposition Strategies for Disposing of Structured Data - etrash

SOLUTION BRIEF KEY CONSIDERATIONS FOR BACKUP AND RECOVERY

NORTH CAROLINA STATE ARCHIVES. Preservation Storage in Real Life

Strategic archiving. Using information lifecycle management to archive data more efficiently and comply with new regulations

Vodacom Managed Hosted Backups

Protect Data... in the Cloud

DATA ARCHIVING. The first Step toward Managing the Information Lifecycle. Best practices for SAP ILM to improve performance, compliance and cost

Digital Preservation in Theory and in Practice

Addressing Archiving and Discovery with Microsoft Exchange Server 2010

NSW Government Standard Approach to Information Architecture. December 2013 v.1.0

Transcription:

Information Migration Raymond A. Clarke Sr. Enterprise Storage Solutions Specialist, Sun Microsystems - Archive & Backup Solutions SNIA Data Management Forum, Board of Directors 1

SNIA DMFs 100 Year Task Force Study Objectives Study the business and operational requirements for longterm digital information retention in the data center Goal: Determine the requirements for long-term digital information retention in the data center. These requirements are needed to frame the definition of best practices and solutions to the retention and preservation problems unique to large, scalable data centers Research Hypothesis: Practitioner s experiences with terabyte-size archival systems are adequate to define the business and operating requirements for petabyte-size information repositories in the data center 2 Sun Microsystems, Inc.- Page 2

Key Concern Logical and physical migration do not scale cost-effectively Only operating standard today is to migrate information physically (to new media) every three to five years and logically (to new formats) before the applications and readers die and become obsolete (every 5-10 years) A never ending, costly cycle of migration Practitioners are struggling to keep up with migration requirements. Only 30% claimed to be doing physical migration correctly on disk & none on tape or optical. Only 20% claimed they were confident in their ability to logically migrate some of the data. Information is at risk long-term! Sun Microsystems, Inc.- Page 3

Key Findings The problems of logical and physical retention > Practitioners are struggling information is at risk long-term > Problems are real and generally understood Long-term generally means over 10-15 years. > IT can manage to migrate and retain readability for about this long. For longer periods, processes begin failing, become too costly, and the volume of information becomes overwhelming. Long-term retention requirements are real. > Over 80% of organizations reporting have a need to retain information over 50 years and 68% report a need of over 100 years. This is the problem with 'Digital Archive', you are not thinking long enough into the future. (Source: Respondent) Sun Microsystems, Inc.- Page 4

Requirements for Long-Term Retention Accommodate Drivers > Legal risk > Compliance requirements > Business risk > Security risk > Preserving history > Organizations > Institutions Universities Libraries Archives > Nations Overcome Inhibitors/Barriers > Executive mgmt commitment > Maintaining readability > Logical & physical migration > Collaboration between information owners and administrators > Cost and complexity > Professional status It's very scary to me that the administration is so cavalier about business records. (Source: Respondent) Sun Microsystems, Inc.- Page 5

Requirements for Long-Term Retention Practitioner's Needs > Solve logical and physical migration > Solve technology obsolescence > Improve business commitment > Reduce operating cost > Better management tools > Better collaboration Distribution of state on disk must match the ongoing business value of the data automatically. If not, it's an unsolvable problem, since humans cannot keep up with the data onslaught. (Source: Respondent) Technology Issues > Solve logical and physical migration challenges > Solve the scaling problem to keep up with the growing volume > Information classification > Include provenance & metadata > Dealing with growth & technology change/obsolescence > Include databases and email > Include legacy Information > Better discovery and deletion Sun Microsystems, Inc.- Page 6

Task Force's Recommendations Response to business drivers > Solutions & best practices that are developed have to be compatible with requirements for compliance, discovery, integrity, privacy, protection, etc. Inhibitors to overcome > The three technical inhibitors (migration, cost, & complexity) are essential elements of proper solutions Target solving the top storagerelated problems > Physical & logical Migration > Integrating meta-data > Reducing management cost and reduce operating costs through automation > Keep information available, discoverable, protected, private & secure > Integrate with XAM, ILM, and other existing standards & practices Sun Microsystems, Inc.- Page 7

Long-Term Retention Reference Model Glossary (in review) Physical Migration > A virtualized, federated information repository in which self-healing eliminates need for physical migration > Add all required services (deduplication, hash-based unique naming, location independence, encryption > Meta-data: thru XAM Logical Migration > SIRF: Self-contained Information Retention Format > a container based on OAIS' Archival Information Package integrated with XAM > Through XAM applications can write archival formats containing metadata, information, and a reader. > XAM encourages support Best Practices Sun Microsystems, Inc.- Page 8

Self-Contained Information Retention Format (SIRF) Sun Microsystems, Inc.- Page 9

Requirements for the Data Layer Format (1/2) Media agnostic > Tape, disk, future media Vendor and Platform agnostic Self-describing Support self-contained data > Include means to represent internal links and cross references Performance > Needs to have good performance even for large datasets, including text and binaries > Enable parallel reads and writes Sun Microsystems, Inc.- Page 10

Requirements for the Data Layer Format (2/2) Interoperability > Need to be able to migrate data between different systems without loss of data > Can be interpreted in the future Extensible > Additional information which may be added in the future > Vendor specific extensions Cost > Free parsers Readable by both humans and machines > Ability to do offline inspection Support additional functions on the data > compression, encryption, cryptography Sun Microsystems, Inc.- Page 11

OAIS AIP Logical Structure (ISO 14721:2002) Content Data Object - the raw data that is the focus of the preservation. Representation Information the information required to interpret the raw data to its designated community. Reference globally unique and persistent identifiers for the content information. Provenance the history and the origin of the content information and any changes that may have taken place since it was originated, and who has had custody of it since it was originated. Context documents reason for creation of the content information and relationship to its environment. Fixity a demonstration that the particular content information has not been altered in an undocumented manner. SNIA Data Management Forum Sun Microsystems, Inc.- Page 12

Proposal SIRF is a proposal for an open logical format standard based on marrying the OAIS AIP with XAM. The data format will include OAIS concepts > RepInfo, reference, provenance, fixity, context, etc. Add inter-link and external-link mechanism Include TOC that points to the various AIPs on the media VTL could be a natural translation point and search staging area for off line data Sun Microsystems, Inc.- Page 13

Example Helpful Practices Classify your information (into a few common buckets) Set retention periods and delete expired information Free up space, only store what is required Include your databases Control the number of copies for protection and operational recovery and their locations Set policies for audits and perform them Measure and improve Sun Microsystems, Inc.- Page 14

Thank You for Your Time and THANK YOU! Attention Raymond.Clarke@sun.com (212) 558-9321 Raymond.Clarke@Sun.com (212) 558-9321