A SMARTER WAY TO SELECT A DATA EXTRACTION METHOD FOR LEGACY DATA

Similar documents
5 WAYS STRUCTURED ARCHIVING DELIVERS ENTERPRISE ADVANTAGE

SAVE MONEY AND IMPROVE PERFORMANCE BY MIGRATING DATA TO A CENTRAL ARCHIVE

ECM Migration Without Disrupting Your Business: Seven Steps to Effectively Move Your Documents

A 15-Minute Guide to 15-MINUTE GUIDE

DATA ARCHIVING. The first Step toward Managing the Information Lifecycle. Best practices for SAP ILM to improve performance, compliance and cost

Analance Data Integration Technical Whitepaper

What we do? Our services include:

Informatica Application Information Lifecycle Management

TRADITIONAL ERP ERP FOR ECOMMERCE?

Real World Strategies for Migrating and Decommissioning Legacy Applications

Leverage the IBM Tivoli advantages in storage management

Top 5 reasons to choose HP Information Archiving

Discover A New Path For Your Healthcare Data and Storage

Considerations for Management of Laboratory Data

RECORDS MANAGEMENT SERVICES. Cost-Effective, Legally Defensible Records Management

Analance Data Integration Technical Whitepaper

The Smart Archive strategy from IBM

What you need to know about cloud backup: your guide to cost, security and flexibility.

EMC PERSPECTIVE EMC SourceOne Management

Informatica Application Information Lifecycle Management

Building a Scalable Big Data Infrastructure for Dynamic Workflows

The Stacks Approach. Why It s Time to Start Thinking About Enterprise Technology in Stacks

Reduce your data storage footprint and tame the information explosion

Combining the power of content and process with the right content management solution. IBM Information Management software

InforCloudSuite. Business. Overview INFOR CLOUDSUITE BUSINESS 1

Start Now with Information Governance

Top 5 reasons to choose HP Information Archiving

RECORDS MANAGEMENT RECORDS MANAGEMENT SERVICES. Cost-Effective, Legally Defensible Records Management

BUILDING A SCALABLE BIG DATA INFRASTRUCTURE FOR DYNAMIC WORKFLOWS

UNDERSTAND YOUR CLIENTS BETTER WITH DATA How Data-Driven Decision Making Improves the Way Advisors Do Business

Business white paper. Lower risk and cost with proactive information governance

Guide to the Flash Storage Revolution

Brochure. ECM without borders. HP Enterprise Content Management (ECM)

Taking Control of Spend Data Management and Analytics Without Bothering IT

IMPLEMENTING A SECURITY ANALYTICS ARCHITECTURE

Transform records management

WHITE PAPER. Realizing the Value of Unified Communications

EXPLORATION TECHNOLOGY REQUIRES A RADICAL CHANGE IN DATA ANALYSIS

Avoiding Turbulence in the Cloud:

How To Know If Your Archive Is Ready To Be Used For Business

EMC DOCUMENTUM CONTENT ENABLED EMR Enhance the value of your EMR investment by accessing the complete patient record.

WHITE PAPER: How to get more out of your. telepresence installation with unified interoperable video conferencing technology

A VERITAS PERSPECTIVE: Maximize Agility, Minimize Risk In The Multi-Vendor Hybrid Cloud

Lexmark Enterprise Software. Transforming customer engagement

Realizing the Benefits of Data Modernization

Unstructured data in the enterprise

TECHNOLOGY VALUE MATRIX FIRST HALF 2014 ECM

What to Look for When Selecting a Master Data Management Solution

Why Data Management Matters Right Now

EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise

EMC Isilon: Data Lake 2.0

Object Storage A Dell Point of View

Simplify IT and Reduce Costs with Automated Data and Document Archiving

OVERCOME REGULATORY DATA RETENTION CHALLENGES WITH COMPLIANCE ARCHIVING

How To Get More Out Of The Cloud

Achieving a Step Change in Digital Preservation Capability

Delivering Smart Answers!

Information Governance in the Cloud

EMC CAPTIVA SOLUTIONS FOR HEALTHCARE

SAP NetWeaver Information Lifecycle Management

White paper. Storing More Intelligently: Tiered Storage Solutions for Security Data

An Enterprise Resource Planning Solution for Mill Products Companies

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

Build an effective data integration strategy to drive innovation

Data Sheet: Archiving Symantec Enterprise Vault Store, Manage, and Discover Critical Business Information

GE Healthcare. Centricity PACS and PACS-IW with Universal Viewer* Where it all comes together

Symantec Enterprise Vault for Microsoft Exchange

The reality of cloud. Go beyond the hype and make a better choice. t e sales@365itms.co.uk.

Now Leverage Big Data for Successful Customer Engagements

Seven Essential Strategies for Effective Archiving

A Successful SharePoint Migration: Best Practices Before, During and After Your Migration

POWERFUL ACCESSIBILITY. A SINGLE WORKSPACE. A MYRIAD OF WORKFLOW BENEFITS. Vue PACS. Radiology

CA Message Manager. Benefits. Overview. CA Advantage

Defensible Disposition Strategies for Disposing of Structured Data - etrash

#MMTM15 #INFOARCHIVE #EMCWORLD 1

Can CA Information Governance help us protect and manage our information throughout its life cycle and reduce our risk exposure?

Integrating data in the Information System An Open Source approach

CommVault Simpana 9 Information Governance

WHITE PAPER Practical Information Governance: Balancing Cost, Risk, and Productivity

8 POINT PLAN TO ELIMINATE PST FILES

Using EMC SourceOne Management in IBM Lotus Notes/Domino Environments

Symantec Enterprise Vault

A Guide to Hybrid Cloud An inside-out approach for extending your data center to the cloud

ARCHIVING: A BUYER S CHECKLIST

Long-term data storage in the media and entertainment industry: StrongBox LTFS NAS archive delivers 84% reduction in TCO

EMC SourceOne Management and ediscovery Overview

Taking the Fast Track to Enterprise Search and ediscovery

White. Paper. Extracting the Value of Big Data with HP StoreAll Storage and Autonomy. December 2012

Innovative Approach to Enterprise Modernization Getting it Right with Data

CrossPoint for Managed Collaboration and Data Quality Analytics

Data Sheet: Archiving Symantec Enterprise Vault for Microsoft Exchange Store, Manage, and Discover Critical Business Information

3 MUST-HAVES IN PUBLIC SECTOR INFORMATION GOVERNANCE

How to Unlock Agility by Backing up to, from, and in the Cloud

Make technology your business advantage

How CFOs and their teams are supercharging financial reporting

IBM Unstructured Data Identification & Management An on ramp to reducing information costs and risk

Best Practices for Data Archiving...1 Prudent Policies...2 Data Vaulting Policies...2 Defining the Data Life Cycle...3 Keeping Data Current...

10 Biggest Causes of Data Management Overlooked by an Overload

Hitachi Cloud Service for Content Archiving. Delivered by Hitachi Data Systems

Optimize SAP Performance

Transcription:

A SMARTER WAY TO SELECT A DATA EXTRACTION METHOD FOR LEGACY DATA EXECUTIVE SUMMARY Deciding how to archive data within legacy applications is a difficult choice for IT organizations, especially considering the huge growth in data and the expenses associated with maintaining legacy applications. Add to that regulatory, governance and legal discovery requirements, and it s easy to understand how vexing this has become for organizations. That s why selecting the most appropriate extraction method for legacy data is such an important requirement. Choosing from among such diverse approaches as table archiving, data record archiving, file archiving and hybrid record archiving means taking into account not only the organization s current needs, but also trying to predict its future requirements certainly no easy task. What is clear, however, is that this is a choice that must be made because organizations will continue to struggle with too many systems, aging applications and rising costs to retain and produce data. Fortunately, there are innovative archiving solutions that support a full range of structured and unstructured data in a cohesive, efficient platform that works seamlessly with any data extraction method. Read this white paper to learn more about archiving solutions that make it easier for organizations to select the best data extraction method for their legacy data, and in so doing, ensure the most flexible approach to creating an enterprise-wide, standards-based, infrastructure-agnostic archiving platform.

TABLE OF CONTENTS EXECUTIVE SUMMARY... 1 WHAT TO LOOK FOR IN AN ARCHIVING PLATFORM... 4 HOW EMC INFOARCHIVE HELPS... 4 CONCLUSION... 5 2

One of the biggest and perhaps one of the most complex challenges faced by today s IT organizations centers on archiving data created by, and residing within, legacy applications. Although many enterprises would like to address the twin problems of rapid data growth and the expense associated with maintaining legacy applications, IT departments ultimately must confront the vexing problem of how best to extract data from legacy systems. Making that choice, however, requires considerable thought about the organization s current and future needs for that data. The decision is also made more difficult by figuring out how to deal with such issues as compliance; changing business processes and workflows; growing e-discovery demands; and the mounting cost and complexity of data storage, archiving and recovery. Selecting the best data extraction method typically means evaluating and choosing from among four different approaches: 1. Table archiving, typically used when an organization does not need to maintain the relationships between tables. It is well suited for the decommissioning of structured data. 2. Data record archiving, where data is presented as a single record. It is typically employed for compliance requirements; unstructured data is not a factor. 3. File archiving, which allows unstructured content to be archived. While this approach doesn t necessarily work well for data mining use cases, it is good in circumstances where the internal data structure isn t known but where important data can already be produced. 4. Hybrid record archiving, which unites structured and unstructured content into a single record. It is ideal for compliance requirements and complex business records, letting business users view essential unstructured data as well as its related structured data in a single business object that allows for big data mining. Selecting from among these four approaches isn t a simple task, because different circumstances even within the same organization can tilt the scale toward different methods. But one thing is certain: This is a choice that must be made, because organizations continue to wrestle with too many systems and huge growth in data volumes. Access to essential information must be maintained even when legacy applications are being retired even when data sits in a state of disuse. According to the Data Management Institute, 40% of data on enterprise systems is inert, while another 15% is either orphaned or being used inappropriately on IT infrastructure. 1 This growing complexity, combined with the heightened sense of urgency IT departments feel to preserve access to important data on legacy systems, has a major implication: Your extraction methodology needs to support archiving of a full range of records, both structured and unstructured, in a single, unified archive. But that s not all: You need an archiving platform that works seamlessly and securely with any or all of these four data extraction methods. In EMC s InfoArchive archiving platform, data handling is simplified by the utilization of Archival Information Units (AIUs), which allow individual data objects to be stored and extracted. Multiple AIUs with similar properties, such as retention, create an Archival Information Package (AIP). 1 Avoiding the Data Apocalypse, SearchStorage.com, 2014 3

WHAT TO LOOK FOR IN AN ARCHIVING PLATFORM Because organizations are dynamic and fluid, it makes sense to avoid being locked into an archiving platform that doesn t provide maximum choice and flexibility in extraction methods. Other factors, such as re-architecting data warehouses, changing database management applications or even undergoing a corporate merger or acquisition can result in the need to use different data extraction procedures and technologies. As a result, your archiving platform must include a number of key features and functions in order to support any of the four extraction methods. These include: Single unified repository. Regardless of the relevant extraction method, it becomes easier, more efficient and more reliable if you are able to pull data from a single, enterprise-wide repository. By having a single location that pulls together data from myriad applications, warehouses and databases, it becomes a much cleaner process. This is particularly useful in scenarios where organizations decide they d like to decommission legacy applications because of the cost and time required to support aging and even archaic systems, but must maintain access to key data for governance, compliance and e-discovery needs. Support for both structured and unstructured data. Although unstructured data now makes up about 80-90% of all new data, a huge amount of legacy data locked in old systems is in a structured format. Organizations can t afford to be limited in their ability to extract either structured or unstructured data by the characteristics of a particular data extraction method. Compliance-centric design. Look for a platform with a wide range of compliance-aware functionality, such as date- and event-based retention; centralized retention policy management; single and multiple retention policies; inheritance of retention policies based on archived data s classification; support for RSA Data Protection Manager for security and encryption; and long-term retention of regulated information. Enterprise scalability. Data volumes are expected to grow at double-digit rates for the foreseeable future, and your archiving platform must keep pace. Enterprise scalability must come not only in higher capacities and more intelligent/automated data tiering policies, but also must ensure high availability, high-iops performance and low latency. Preserved data integrity. Regardless of the method used, IT and business executives must have supreme confidence in the integrity of all extracted data. The archiving platform must have safeguards to ensure that integrity so all users of the extracted data are seeing precisely the same information at all times. Management facility and low administration footprint to reduce impact on IT. Policy management for archiving and its related functions such as data tiering and storage management needs to be an integrated, automated deliverable for the archiving platform across all extraction methods. Policies and automation help ease management headaches and reduce the need for manual search, extraction and reporting by IT professionals. HOW EMC INFOARCHIVE HELPS EMC InfoArchive is a versatile, flexible and robust archiving platform. It supports all data extraction technologies and approaches, and provides organizations with an enterprisewide, standards-based, infrastructure-agnostic platform for archiving. 4

As an integrated product suite, InfoArchive is designed for data integrity, compliance, high availability, security, scalability and automation for both structured and unstructured data. Among InfoArchive s key capabilities are: No dependencies on original application. Support for wide range of data retention policies. Assurance of data quality. Support for industry standards (OAIS) for long-term retention and easy access. Consolidated repository for ingestion and retention of all data types. Data security and encryption. Scalability to hundreds of billions of records. Single view of data and content. InfoArchive s design ensures that authorized users both inside and outside the enterprise can easily, quickly, reliably and securely access archived information, and that IT departments can use whatever extraction method is most appropriate for the particular use case. Users can search concurrently for data across multiple datasets without experiencing unacceptable latency or worrying about inconsistent data integrity from search to search. As a result, InfoArchive helps organizations reduce the cost and complexity of data extraction and archiving while ensuring compliance and enabling smart, non-disruptive application decommissioning. A final element of the added value that InfoArchive brings is an expert coalition of technology and service partners that offer different skills and experience to organizations looking to get the most out of InfoArchive. The seven core organizations that make up the InfoArchive Consortium have a range of specialties, all of them crucial to helping companies adopt next-generation unified archiving based on EMC InfoArchive. One of those members Revolution Data Systems (RDS) is an expert in hybrid record data migration. RDS offers the expertise and ETL tools to manage archiving from legacy systems. By utilizing the RDS team, organizations can migrate data and documents from homegrown and third-party databases, ERP systems, Web applications and flat files. RDS data migration services facilitate the often complex task of deciding the best extraction method for any given legacy system. They also ensure that your data regardless of complexity is archived properly, allowing seamless access to your legacy data. CONCLUSION As data volumes grow unstructured data in particular IT organizations increasingly have to make tough decisions about which data to keep available on primary storage, which to move to secondary storage for archiving purposes and which to get rid of entirely. But compliance, governance, analytics and e-discovery are just some of the reasons that organizations must always maintain that data somewhere and in an easily accessible state even as legacy applications are being decommissioned. IT organizations have four different data extraction methods to choose from, and it s not at all unusual for organizations to need to use multiple methods depending upon the use case or workload. As a result, organizations need to adopt an enterprise-wide archiving platform that supports any and all extraction methods for both structured and unstructured data. 5

InfoArchive is an ideal data archiving platform because it provides organizations with maximum flexibility in selecting the right data extraction method for their needs. InfoArchive enables a single, unified repository designed for enterprise scalability, security, data integrity and fast, reliable extraction. With the technical and business leadership of EMC and the unique capabilities of InfoArchive Consortium members like Revolution Data Systems, InfoArchive gives IT organizations the ability to archive all types of information, achieve controlled access to archive data and mitigate the need for dependencies on the originating application. CONTACT US For more information on InfoArchive or Revolution Data Systems enterprise content management tools and services, please visit www.emc.com/ content-management/ infoarchive or http://www. revolutiondatasystems. com/.