Deduplication s Role in Disaster Recovery. Thomas Rivera, SEPATON

Similar documents
Deduplication s Role in Disaster Recovery. Thomas Rivera, SEPATON

Deduplication s Role in Disaster Recovery. Gene Nagle, EXAR Thomas Rivera, SEPATON

UNDERSTANDING DATA DEDUPLICATION. Tom Sas Hewlett-Packard

UNDERSTANDING DATA DEDUPLICATION. Jiří Král, ředitel pro technický rozvoj STORYFLEX a.s.

UNDERSTANDING DATA DEDUPLICATION. Thomas Rivera SEPATON

ADVANCED DEDUPLICATION CONCEPTS. Larry Freeman, NetApp Inc Tom Pearce, Four-Colour IT Solutions

15 Best Practices For Comparing Data Protection With Cloud Media Server

in Transition to the Cloud

The Business Value of Data Deduplication DDSR SIG

PROTECTING DATA IN THE BIG DATA WORLD. Thomas Rivera, Hitachi Data Systems

Thomas Rivera / Hitachi Data Systems

Restoration Technologies. Mike Fishman / EMC Corp.

How To Create A Cloud Backup Service

Thomas Rivera / Hitachi Data Systems. Author: SNIA - Data Protection & Capacity Optimization (DPCO) Committee

Creating a Catalog for ILM Services. Bob Mister Rogers, Application Matrix Paul Field, Independent Consultant Terry Yoshii, Intel

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

Today s Agile, Complex and Heterogeneous Data Centers

Introduction to Data Protection: Backup to Tape, Disk and Beyond

Understanding Enterprise NAS

Backup to Tape, Disk and Beyond. Jason Iehl, NetApp

Trends in Data Protection and Restoration Technologies. Jason Iehl, NetApp

WAN Optimization and Thin Client: Complementary or Competitive Application Delivery Methods? Josh Tseng, Riverbed

3Gen Data Deduplication Technical

Efficient Backup with Data Deduplication Which Strategy is Right for You?

Server and Storage Virtualization with IP Storage. David Dale, NetApp

Actifio Big Data Director. Virtual Data Pipeline for Unstructured Data

Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

Protect Microsoft Exchange databases, achieve long-term data retention

Business Benefits of Data Footprint Reduction

ARCHIVING FOR DATA PROTECTION IN THE MODERN DATA CENTER. Tony Walker, Dell, Inc. Molly Rector, Spectra Logic

How To Protect Data On Network Attached Storage (Nas) From Disaster

How To Migrate To A Network (Wan) From A Server To A Server (Wlan)

Building the Business Case for the Cloud

How to Cost Effectively Retain Reference Data for Analytics and Big Data. Molly Rector, EVP Product Management & WW Marketing, Spectra Logic

EMC Disk Library with EMC Data Domain Deployment Scenario

Trends in Application Recovery. Andreas Schwegmann, HP

Energy Efficient Storage - Multi- Tier Strategies For Retaining Data

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation (Author and Presenter)

Data Center Transformation. Russ Fellows, Managing Partner Evaluator Group Inc.

Sales Tool. Summary DXi Sales Messages November NOVEMBER ST00431-v06

Cloud OS Vision. Modern platform for the world s apps

Riverbed Whitewater/Amazon Glacier ROI for Backup and Archiving

Cloud Archiving. Paul Field Consultant

Avamar. Technology Overview

Using Data De-duplication to Drastically Cut Costs

Cloud Archive & Long Term Preservation Challenges and Best Practices

Next Generation Backup Solutions

Using HP StoreOnce Backup systems for Oracle database backups

The Role of WAN Optimization in Cloud Infrastructures

OmniCube. SimpliVity OmniCube and Multi Federation ROBO Reference Architecture. White Paper. Authors: Bob Gropman

Technology Insight Series

Business Life Insurance

Future-Proofed Backup For A Virtualized World!

Best Practices in Healthcare IT Disaster Recovery Planning

EMC DATA DOMAIN OPERATING SYSTEM

EMC DATA DOMAIN OPERATING SYSTEM

REDUCE COSTS AND COMPLEXITY WITH BACKUP-FREE STORAGE NICK JARVIS, DIRECTOR, FILE, CONTENT AND CLOUD SOLUTIONS VERTICALS AMERICAS

EMC AVAMAR. a reason for Cloud. Deduplication backup software Replication for Disaster Recovery

Active Archive - Data Protection for the Modern Data Center. Molly Rector, Spectra Logic Dr. Rainer Pollak, DataGlobal

EMC Data Domain Boost for Oracle Recovery Manager (RMAN)

WAN Optimization and Cloud Computing. Josh Tseng, Riverbed

EMC DATA PROTECTION. Backup ed Archivio su cui fare affidamento

Fundamental Approaches to WAN Optimization. Josh Tseng, Riverbed

Tiered Data Protection Strategy Data Deduplication. Thomas Störr Sales Director Central Europe November 8, 2007

Archive and Preservation in the Cloud - Business Case, Challenges and Best Practices. Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP

Redefining Oracle Database Management

Long term retention and archiving the challenges and the solution

Understanding EMC Avamar with EMC Data Protection Advisor

VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group

Backup, Recovery & Archiving. Choosing a data protection strategy that best suits your IT requirements and business needs.

CISCO WIDE AREA APPLICATION SERVICES (WAAS) OPTIMIZATIONS FOR EMC AVAMAR

White. Paper. Improving Backup Effectiveness and Cost-Efficiency with Deduplication. October, 2010

Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup

Backup and Recovery Redesign with Deduplication

Leveraging the Cloud for Data Protection and Disaster Recovery

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

EMC DATA DOMAIN PRODUCT OvERvIEW

Next Generation Cloud Backup with IT Convergence Encore!

Business-Centric Storage FUJITSU Storage ETERNUS CS800 Data Protection Appliance

Protecting enterprise servers with StoreOnce and CommVault Simpana

Business-centric Storage FUJITSU Storage ETERNUS CS800 Data Protection Appliance

Native Data Protection with SimpliVity. Solution Brief

Cloud-integrated Storage What & Why

What You Need to Know NOW about Next Generation Data Protection. Kenny Wong Senior Consultant June 2015

Maximize Your Virtual Environment Investment with EMC Avamar. Rob Emsley Senior Director, Product Marketing

Real-time Compression: Achieving storage efficiency throughout the data lifecycle

Cloud File Services: October 1, 2014

Overcoming Backup & Recovery Challenges in Enterprise VMware Environments

Demystifying Deduplication for Backup with the Dell DR4000

EMC BACKUP-AS-A-SERVICE

Brochure. Data Protector 9: Nine reasons to upgrade

EMC AVAMAR. Deduplication backup software and system. Copyright 2012 EMC Corporation. All rights reserved.

Scale and Availability Considerations for Cluster File Systems. David Noy, Symantec Corporation

EMC Data Domain Boost for Oracle Recovery Manager (RMAN)

Turnkey Deduplication Solution for the Enterprise

Protect Data... in the Cloud

Don t be duped by dedupe - Modern Data Deduplication with Arcserve UDP

EMC PERSPECTIVE. An EMC Perspective on Data De-Duplication for Backup

Data Domain Overview. Jason Schaaf Senior Account Executive. Troy Schuler Systems Engineer. Copyright 2009 EMC Corporation. All rights reserved.

Transcription:

Thomas Rivera, SEPATON

SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individual members may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced in their entirety without modification The SNIA must be acknowledged as the source of any material used in the body of any document containing material from these presentations. This presentation is a project of the SNIA Education Committee. Neither the author nor the presenter is an attorney and nothing in this presentation is intended to be, or should be construed as legal advice or an opinion of counsel. If you need legal advice or a legal opinion please contact your attorney. The information presented herein represents the author's personal opinion and current understanding of the relevant issues involved. The author, the presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information. NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK. 2

About the SNIA DPCO This tutorial has been developed, reviewed and approved by members of the Data Protection and Capacity Optimization (DPCO) Committee The mission of the DPCO is to foster the growth and success of the market for data protection and capacity optimization technologies 2010 goals include educating the vendor and user communities, market outreach, and advocacy and support of any technical work associated with data protection and capacity optimization 3

Abstract Data deduplication can be applied to the replication of data for disaster recovery (DR) projects, since deduplication significantly reduces the amount of bandwidth required to replicate data. This technical session will: Review data deduplication concepts Cover the impact of deduplication on WAN replication Discuss deduplication effects on meeting SLAs for DR 4

Space Reduction Terminology SNIA definitions: Data Deduplication is the replacement of multiple copies of data - at variable levels of granularity - with references to a shared copy in order to save storage space and/or bandwidth Single Instance Storage is a form of data deduplication that operates at a granularity of an entire file or data object Subfile Data Deduplication is a form of data deduplication that operates at a finer granularity than an entire file or data object Compression is the encoding of data to reduce its storage requirement - compressed data can also be deduplicated 5

Data Deduplication Simplified = New unique data = Repeat data 6

Data Deduplication Simplified = New unique data = Repeat data = Pointer to unique data segment 7

Data Deduplication Simplified Dump #1 Dump #2 Dump #3 Dump #4 = New unique data = Repeat data = Pointer to unique data segment Check out SNIA Tutorial: Understanding Data Deduplication 8

Definitions Replication is (in this context): The transport of data between primary and secondary sites There are multiple Use Case scenarios, which we will cover later Disaster Recovery is: The recovery of data, access to data, and associated processing through a comprehensive process of setting up a redundant site (equipment & work space) with recovery of operational data to continue business operations after a loss of use of all or part of a data center 9

Data Reduction Becomes Ubiquitous Use Cases Expand: Backup Archive Primary data Techniques: Compression Single-instance store Deduplication Deduplication will be widely available in 2012 for blocks & files, and deployable in application software, middleware, operating systems, appliances & storage arrays. LAN/SAN/WAN By 2014, some form of primary data reduction, such as compression and/or deduplication, will be used for at least 20% of all enterprise workloads, up from the low single digits in 2009. (2010) 10

Data Deduplication Benefits Data deduplication can help organizations: Satisfy ROI/TCO requirements Manage data growth costs Increase efficiency of storage and backup Reduce overall expenditure on storage Reduce network bandwidth Reduce operational costs including: Infrastructure costs requiring space, power and cooling Reduce administrative costs 11

Challenges of Enterprise Scale DR Data volumes too large for timely replication Bandwidth constraints / costs Physical Tapes Highly Manual, Exceeding backup windowsrisk Prone Satisfying RPO/RTO metrics Added complexity Unreliable Tape Replication These challenges result in: Costly admin. Risk of human error Risk of tape damage Risk of data loss Not meeting SLAs (backup & recovery) Added complexity (cost $$ for admin, HW/SW, etc.) 12

Challenges of Enterprise Scale DR IT Budgets Storage as a % of IT Budgets Data Growth Cost of Storage Mgmt as a % of Storage IT Challenges Explosion of online data Infrastructure complexity Inflexible architectures Simplifying the storage infrastructure Antiquated recovery infrastructure Increase staff productivity Meeting SLAs within restricted budgets (2009) 13

Increasing Cost & Risk: Trends Data growth BC & DR requirements (SLAs) Regulatory requirements Power, space limits 14

Increasing Cost & Risk: Technical Challenges Performance Capacity optimization Linear scalability Advanced automation Expertise Service 15

Increasing Cost & Risk: Business Impact Capital expense (space) OpEx (power/cooling costs/labor) Risk (data loss) 16

Meeting SLAs Rapid Data Growth 00:00:00 24 x 7 SLAs for BC/DR High CAGR Increased backup costs Downtime costs RTO / RPO Space/Power Limitations Regulatory Requirements Data center footprint Power costs Online retention Added complexity 17

Lower TCO & Better ROI Lower acquisition cost Scalability Reduce Capital Expense Reduce Operating Expense Non disruptive Less labor: Automation Less power & space Avoid Costs Frees IT staff time More data per FTE No human error 18

Deduplication Controls Growth Deduplication ratio typically improves over time Logical Storage Savings Dedupe Storage Time 19

Deduplication Ratio: Depends on Use Case, Time Primary storage has less duplicate data Periodic archives have moderate duplicate data Repeated backups have significant duplicate data Logical Capacity Dedupe Primary Dedupe Archive Dedupe Backup Time 20

Deduplication Implementations Application-specific protocol CIFS, NFS, FC, iscsi, VTL WAN Deduplicated Replication Agent or Component Gateway Virtual Appliance Appliance Storage System Grid Storage Many options to choose from Decision will be influenced by the project goals: SLAs for data backup and recovery, regulations, etc. 21

Dedupe in DR: What Really Matters? Focus on your service level agreements (SLAs) Needs to meet allotted time for replication Needs to meet allotted time for restore Is it necessary to dedupe all data? Regulated data may require unique rules Not all data deduplicates effectively Can the dedupe solution scale to meet your needs? Needs to scale in capacity & performance Different dedupe approaches yield different reduction ratios CapEx & OpEx savings can be higher (one system vs. multiple) 22

Dedupe in DR: Automation Benefits Automation Simplifies the offsite process Minimize risk of data loss/data theft Leverage existing bandwidth Main Data Center DR Site WAN Deduplicated Data Physical Tape Creation 23

Dedupe in DR: Network Efficiency Network Efficiency Deduplication dramatically reduces bandwidth usage Main Data Center DR Site WAN Deduplicated Data 24

Dedupe in DR: Risk Reduction Risk Reduction Human error Regulatory noncompliance Improve data access reliability Main Data Center DR Site WAN Deduplicated Data 25

Dedupe in DR: Cost Savings Cost Savings Reduced manual media handling Reduce tape archival services Minimize data loss Main Data Center DR Site WAN Deduplicated Data 26

Dedupe in DR: Requirements Main Data Center DR Site WAN Deduplicated Data Replicate large data volumes Send only changed data over the network Perform fast data restores from remote site Provide control over replication/restoration process Provide resiliency / high availability 27

Use Model: HQ to DR Location Headquarters Data Center DR Site WAN Deduplicated Data Data is deduplicated & replicated to a DR site Recover operations in the event of data becoming unavailable at main data center 28

Use Model: Data Center to Data Center Data Center 1 Data Center 2 WAN Deduplicated Data Data is deduplicated & replicated bi-directionally between two production data centers Each data center acting as a DR site for the other 29

Use Model: Edge to Core DR Regional Data Centers Core DR Center WAN Deduplicated Data Data is deduped & replicated from multiple regional data centers to a main DR center Core DR center acting as a DR site for all production data centers 30

Deduplication: Potential Issues Be aware of the challenges May decrease data ingestion performance Can negatively impact restore performance May not scale in performance May not scale in capacity May not offer resiliency/ha features Encrypted data limits deduplication Choices exist that trade between strengths and weaknesses Easy to under-estimate the bandwidth required Changed data size replication window = data rate needed 31

Review Using deduplication in DR can help organizations: Satisfy ROI/TCO requirements Manage data growth Increase efficiency of storage and backup Reduce overall cost of storage Reduce required network bandwidth Reduce operational costs including: Infrastructure costs requiring space, power and cooling Movement toward a greener data center Reduce administrative costs 32

Which of the following technologies will most affect your storage infrastructure during the next three years? (2009) 33

Summary Multiple elements to consider when evaluating deduplication technologies for DR projects: CPU Utilization and/or Power Consumption Restore Performance of Deduped Data WAN Efficiency of Deduped Data Replication Scalability of Deduped Data Resiliancy/HA of Deduplication Solution 34

Summary (cont.) There is no right solution for everyone! The appropriate solution will vary by environment and requirements Determine service Level objectives (RTO/RPO) first - before selecting and implementing technology Work with trusted advisors to assess your environment and recommend appropriate solutions 35

Q&A / Feedback Please send any questions or comments on this presentation to SNIA: trackdatamgmt@snia.org Matthew Brisse David Chapa Don Deel Mike Dutch Larry Freeman David Hill Bernd Henning Many thanks to the following individuals for their contributions to this tutorial. Judy Leach Gene Nagle Richard Reitmeyer Thomas Rivera Tom Sas Gideon Senderov - SNIA Education Committee It s easy to get involved with the DPCO! - Find a passion - Join a committee - Gain knowledge & influence - Make a difference www.snia.org/dpco 36