Long Term Record Retention and XAM



Similar documents
In ediscovery and Litigation Support Repositories MPeterson, June 2009

ILM: Tiered Services & The Need For Classification

EMC SourceOne Management and ediscovery Overview

Management Trends, Troubles, and Solutions

Miguel Ortiz, Sr. Systems Engineer. Globanet

EMC arhiviranje. Lilijana Pelko Primož Golob. Sarajevo, Copyright 2008 EMC Corporation. All rights reserved.

Using EMC SourceOne Management in IBM Lotus Notes/Domino Environments

Realizing the ROI of Information Governance. Gregory P. Kosinski Director, Product Marketing EMC

Time Value of Data. Creating an active archive strategy to address both archive and backup in the midst of data explosion.

Object Storage A Dell Point of View

Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

Information Governance Manage in Place Use Cases Workshop

How To Manage An Electronic Discovery Project

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

Reduce Cost, Time, and Risk ediscovery and Records Management in SharePoint

ILM et Archivage Les solutions IBM

Record Retention and Digital Asset Management Tim Shinkle Perpetual Logic, LLC

September 2009 Cloud Storage for Cloud Computing

ILM, classification and the Information-Centric Enterprise. Per Sedihn, Vice Chair Nordics Comitte SNIA Europe CTO Proact IT Group

Global Headquarters: 5 Speen Street Framingham, MA USA P F

Storage Considerations for Database Archiving. Julie Lockner, Vice President Solix Technologies, Inc.

Information Archiving

Add the compliance and discovery benefits of records management to your business solutions. IBM Information Management software

Intelligent document management for the legal industry

Interoperable Cloud Storage with the CDMI Standard

SMART ARCHIVING. The need for a strategy around archiving. Peter Van Camp

Real World Strategies for Migrating and Decommissioning Legacy Applications

Cloud Storage Standards Overview and Research Ideas Brainstorm

Autonomy Consolidated Archive

Storage Clouds. Enterprise Architecture and the Cloud. Author and Presenter: Marty Stogsdill, Oracle

COMPLIANCE BENEFITS OF SAP ARCHIVING

巨 量 資 料 分 層 儲 存 解 決 方 案

Object Oriented Storage and the End of File-Level Restores

White Paper. Archiving Best Practices: 9 Steps to Successful Information Lifecycle Management. Contents

Strategies and New Technology for Long Term Preservation of Big Data

DIGITAL UNIVERSE UNIVERSE

Enterprise Content Management with Microsoft SharePoint

Taming Big Data Storage with Crossroads Systems StrongBox

W H I T E P A P E R E X E C U T I V E S U M M AR Y S I T U AT I O N O V E R V I E W. Sponsored by: EMC Corporation. Laura DuBois May 2010

White Paper. Mimosa NearPoint for Microsoft Exchange Server. Next Generation Archiving for Exchange Server By Bob Spurzem and Martin Tuip

Seven Essential Strategies for Effective Archiving

Storage Virtualization

Managing Storage and Compliance Costs through Archiving and ediscovery

SOLUTION BRIEF KEY CONSIDERATIONS FOR LONG-TERM, BULK STORAGE

HYPER MEDIA MESSAGING

Complete Unstructured Information Management Validated Solutions for Scalable and Secure Content Management

Enterprise Content Management. Image from José Borbinha

Records Management and SharePoint 2013

IBM Infrastructure for Long Term Digital Archiving

The Growth and Management of Unstructured Data

Document Management. Document Management for the Agile Enterprise. AuraTech Pte Ltd

Can CA Information Governance help us protect and manage our information throughout its life cycle and reduce our risk exposure?

Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup

SNIA Cloud Storage: Standards and Beyond

3. Ensure the management of information is compliant with legislative requirements to maximise the benefits and minimise risks;

Best Practices for Long-Term Retention & Preservation. Michael Peterson, Strategic Research Corp. Gary Zasman, Network Appliance

WHITE PAPER Practical Information Governance: Balancing Cost, Risk, and Productivity

W H I T E P A P E R E n a b l i n g S h a r e P o i n t O p e r a t i o n a l E f f i c i e n c y a n d I n f o r m a t i o n G o v e r n a n c e

Optimize Your SharePoint 2010 Content Using Microsoft s New Storage Guidance

Global Headquarters: 5 Speen Street Framingham, MA USA P F

Xerox Workflow Automation Services Solutions Brochure. Xerox DocuShare 7.0. Enterprise content management for every organization.

Guide to Information Governance: A Holistic Approach

Defensible Disposition Strategies for Disposing of Structured Data - etrash

I D C V E N D O R S P O T L I G H T. S t o r a g e Ar c h i t e c t u r e t o Better Manage B i g D a t a C hallenges

Beyond the Single View with IBM InfoSphere

CA Message Manager. Benefits. Overview. CA Advantage

Case Study: Business benefits from implementing a global Content Management solution. White paper

Hitachi and Symantec Build Your Archiving and Discovery Strategy Today

Archiving with Enterprise Vault Bruno Ritter

HP StorageWorks D2D Backup Systems and StoreOnce

WHITE PAPER WHY ORGANIZATIONS NEED LTO-6 TECHNOLOGY TODAY

EMC PERSPECTIVE EMC SourceOne Management

Why Cloud Backup Now? Ashar Baig Senior Director of Product Marketing

DATA ARCHIVING. The first Step toward Managing the Information Lifecycle. Best practices for SAP ILM to improve performance, compliance and cost

Media for Long-Term Archiving. June 2014

Symantec Enterprise Vault And NetApp Better Together

EMC Data Protection Advisor 6.0

How To Manage Cloud Data Safely

Transcription:

Long Term Record Retention and XAM Wayne M. Adams Chair Emeritus, SNIA www.snia.org

Agenda Market Trends and Drivers SNIA Survey SNIA XAM Standard SNIA Meta Data Work Summary

Information Challenge Escalating Storage Costs Data: 70% annual increase in data volumes Digital Proliferation: 92% of information is digitally created with only 30% being repurposed Cost: 80% of IT budget is consumed by maintenance Increasing Scrutiny and Risk Risk: 80% of information is unstructured with no organizational control Litigation: Nearly 90% of U.S.A. companies engaged in some type of litigation Fines: $ Millions in fines for inadequate record keeping Sources: IDC, Gartner Group, AIIM, Meta Group

Data Growth and More Growth. Many of endusers report information growth rates of 50 to 80 percent per year. According to the March 2008 IDC White Paper, The Diverse and Exploding Digital Universe, the digital universe in 2007 at 281 exabytes or 281 billion gigabytes was 10 percent bigger than previously thought. By 2011, the Digital Universe will be 10 times larger than in 2006. The resizing comes as a result of faster growth in cameras, digital TV shipments, and better understanding of information replication. More findings from this study are: 70 percent increase in data volumes 92 percent of information is digitally created with only 30 percent repurposed 80 percent of IT budget is consumed by maintenance 80 percent of information is unstructured with no organizational control Nearly 90 percent of U.S. companies are engaged in some sort of litigation and are subject to millions of dollars in fines for inadequate record-keeping In 2011, the amount of digital information produced in the year should equal almost 1800 exabytes, or 10 times that produced in 2006. Over 95 percent of the digital universe is unstructured data. Even while image files grow to multi-megabyte size as a result of better camera resolution, the exponential growth of sensors, RFID (radio frequency ID) tags, and packets created by IP voice phone calls is streaming trillions of smaller signals, some just 128 bits, into the digital universe.

Information-Archiving Challenges: IT Perspective Backup My backup challenges are as critical, if not more so, as my archive needs Availability I need the information when I need it Governance and Compliance More work big liability if not addressed properly Consolidation I want one solution for all of my archiving requirements but the vendors don t offer it I need to drive down costs Costs My information is growing at greater than 50 percent per year, but I can t afford a 50 percent increase in costs Disaster Recovery I need to protect these critical assets just like everything else

A New-Kind of Old Data Fixed Content Music and Video Archived Email Medical Imaging Check Processing and Imaging Call Center Voice Recording IM Chat Sessions Video Surveillance Digital Photos

What is Fixed Content? A type of data classification that indicates the bits are no longer changing Classifying this way enables storage systems to meet the requirements of this type of data Most data is created fixed Photos, videos, published/emailed documents, etc. 70-90% of data becomes fixed at some point Even transactional data becomes fixed typically within a week Fixed content data is GROWING at 90% year over year

Agenda Market Trends and Drivers SNIA Survey SNIA XAM Standard SNIA Meta Data Work Summary

SNIA 100 Year Archive Requirements Survey

SNIA 100 Year Archive Requirements Survey

SNIA 100 Year Archive Requirements Survey

Agenda Market Trends and Drivers SNIA Survey SNIA XAM Standard SNIA Meta Data Work Summary

What is expected of storage and storage applications? Different groups have different expectations Applications Vendors End Users Storage Vendors Applications Vendors Annotate Data with associated Metadata Indicate Basic Storage Management Policies Speak the same language to all types of devices Manage billions if not trillions of records XAM End Users want: Choices between Application Vendors Choices between Storage Vendors Easy migration between vendors/technology Compliance, Scalability, Performance, $/GB, Low cost of ownership (TCO) Store billions if not trillions of records Storage Vendors want: Application Support for their Products Efficiently Store Application Data and Metadata Integrated Storage Management Capabilities Manage billions if not trillions of records

What is XAM? XAM is an Application Programming Interface (API) designed to address the storage needs of storage vendors, application vendors, and end users. XAM was designed to enable the interoperability, storage transparency, and automation of policy driven ILM based practices for data. XAM interfaces were designed with the knowledge that this data must be stored for long periods of time and with information assurance (security). Why an API? As an API, it allows the separation of the access method from the underlying storage (block storage, file systems, or whatever) This allows end users to separately choose applications and storage systems, and avoid vendor lock-in to either. It allows end users to decouple storage provisioning from storage location, enabling the end user to scale their systems as required without lengthy updates to application references

What does the XAM architecture look like? VIM Interface Reference VIM library File System VIM Interface Vendor VIM library 1 Storage System 1 Application XAM interface XAM library Extension interface Extension library VIM Interface Vendor VIM library 2 Storage System 2 VIM Interface Vendor VIM library N Storage System N The Vendors XAM provide: standard specifies: A VIM for each unique storage A standard system. library containing Vendors can the also API for applications provide extension libraries, An interface containing for vendors new functionality. to provide Note plugins that because to the these standard are library. built on These top of plugins the XAM are interface, called they Vendor are Interface usable on Modules all XAM (VIMs) storage systems A reference VIM that allows applications to exercise the XAM functionality.

XAM Features You Should Care About XAM supports interoperability between multi-vendor storage devices XAM enables retention management & regulatory compliance support XAM enables object search and e-discovery through extensive metadata capabilities XAM supports security policies Access rights to the XAM object are contained inside the XAM object XAM provides object location independence XAM data is location independent & XAM identifiers are unique across all XAM-conformant storage systems

XAM Enables Compliance & Retention Management Regulatory Compliance XAM supports tamper proof storage Regulatory retention information (retention, disposition) is embedded inside XAM objects Retention Management Removes the IT management effort & cost overhead of managing retention of objects Reduces regulatory risk of keeping information too long or not long enough All large corporations must have a strategy for dealing with reactive discovery now and a long term plan for dealing with proactive information management Debra Logan Industry Analyst October 2007

The Importance of XAM Metadata An object without metadata is like a can without a label The label contains critical information about the contents of the can The XAM API supports an interface to search metadata fields Before XAM After XAM Embedded metadata simplifies the storage & management of data for application vendors Embedded metadata & XAM search extends application capabilities for end users

Storage Vendor Independence Storage Interoperability Applications / end users can select whatever XAM-conformant storage device(s) they prefer End users can use multiple XAM-conformant storage devices simultaneously XAM Object Import and Export XAM supports the ability to migrate data between XAM-conformant back-end storage systems Prevents vendor lock-in

Use of MetaData Standards Writes content and annotates it with metadata, in this case: to, from, roles, subject and number of attachments Email Service Metadata accompanies content Content Email Analysis Program Email object stored by XAM SDK com.acme.email.from = bugs bunny com.acme.email.from.role = analyst com.acme.email.to = daffy duck com.acme.email.to.role = trader com.acme.email.subj = what s up doc? com.acme.email.numattach = 2 { Email contents } { Attachment #1 } { Attachment #2 } Can access Email metadata and, without the help of the Email Service, analyze whether the sender is allowed to send to the recipient. For example, a stock analyst may not be allowed to send information to a trader. XAM specifies how metadata is represented, but not the actual metadata field names and values. Further work is needed to standardize metadata names and allowed values for application domains like Email, Health, and Document Management.

ECM Application Use Case ECM Records & Documents Application port to XAM Full lifecycle management for enterprise content utilizing XAM Application forms are routed through XAM to the Storage Platform Vendor 1 Passports are routed through XAM to the Storage Platform Vendor 2 and Vendor 3 Application forms Passports

Information independence for applications and storage XAM makes this possible As seen at SNW 2007 and 2008 First Multi-Vendor demonstration based on XAM Commercially Available Applications Custom Application Records & Document s (Vignette) Disk Extender (EMC) RIM4DB/ Outerbay (HP) Photo Editor (Sun) HP RISS XAM Interface EMC Centera Sun XSET Browser XAM Query Tool Contributed Utilities

Agenda Market Trends and Drivers SNIA Survey SNIA XAM Standard SNIA Meta Data Work Summary

MetaData for Self Describing & Contained Data

Requirements to be addressed by SD-SCDF

MetaData for Self Describing & Contained Data

Summary Data Growth Continues Fixed Content represents one category of the data, but a large portion of growth New technology and standards are desired and required to manage the growth in context of meeting business, retention, and compliance needs. Requirements for Fixed Content Data Management and Archiving/Archive Management overlap, but there is mutual exclusivity for each Accordingly, XAM and SD-SCDF are complementary Each can be used exclusively or in conjunction with each other Timetable for XAM V1.0 Specification is now V1.0 SDK is Q308 V1.0 commercial offerings Q408 and throughout 2009 and 2010 V1.0 becomes an ANSI Standard in 2009, ISO in late 09 or 2010 Timetable for SD-SCDF Draft specification stay tuned

For Additional Information SNIA XAM Initiative XAMI home page - http://www.snia.org/forums/xam/ XAM API specs -http://www.snia.org/forums/xam/specs XAM SDK http://www.snia.org/forums/xam/ tbd XAM Demo - http://www.snia.org/forums/xam/flshdemo/1282_snia_xam.htm SNIA Data Management Forum (DMF) DMF home page - http://www.snia.org/forums/dmf/ DMF Long Term Archive - http://www.snia.org/forums/dmf/programs/ltacsi/ DMF 100 Year Survey - http://www.snia.org/forums/dmf/knowledge/100yratf_archive- Requirements-Survey_20070619.pdf SNIA http://www.snia.org

Questions Thank You!