The Business Value of Data Deduplication DDSR SIG



Similar documents
UNDERSTANDING DATA DEDUPLICATION. Jiří Král, ředitel pro technický rozvoj STORYFLEX a.s.

UNDERSTANDING DATA DEDUPLICATION. Thomas Rivera SEPATON

UNDERSTANDING DATA DEDUPLICATION. Tom Sas Hewlett-Packard

ADVANCED DEDUPLICATION CONCEPTS. Larry Freeman, NetApp Inc Tom Pearce, Four-Colour IT Solutions

Deduplication Demystified: How to determine the right approach for your business

Thomas Rivera / Hitachi Data Systems

Tiered Data Protection Strategy Data Deduplication. Thomas Störr Sales Director Central Europe November 8, 2007

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

3Gen Data Deduplication Technical

Data Deduplication HTBackup

Backup to Tape, Disk and Beyond. Jason Iehl, NetApp

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

Efficient Backup with Data Deduplication Which Strategy is Right for You?

Deduplication has been around for several

Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

Business Benefits of Data Footprint Reduction

Introduction to Data Protection: Backup to Tape, Disk and Beyond

Energy Efficient Storage - Multi- Tier Strategies For Retaining Data

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation (Author and Presenter)

WHITE PAPER. DATA DEDUPLICATION BACKGROUND: A Technical White Paper

Data Deduplication Background: A Technical White Paper

Restoration Technologies. Mike Fishman / EMC Corp.

Data Deduplication in Tivoli Storage Manager. Andrzej Bugowski Spała

Cloud OS Vision. Modern platform for the world s apps

Long term retention and archiving the challenges and the solution

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

Protect Microsoft Exchange databases, achieve long-term data retention

WHITE PAPER Data Deduplication for Backup: Accelerating Efficiency and Driving Down IT Costs

CEMEX en Concreto con EMC. Jose Luis Bedolla EMC Corporation Back Up Recovery and Archiving

Redefining Backup for VMware Environment. Copyright 2009 EMC Corporation. All rights reserved.

EMC Disk Library with EMC Data Domain Deployment Scenario

Overcoming Backup & Recovery Challenges in Enterprise VMware Environments

Riverbed Whitewater/Amazon Glacier ROI for Backup and Archiving

Real-time Compression: Achieving storage efficiency throughout the data lifecycle

Reducing Data Center Power Consumption Through Efficient Storage

DeltaStor Data Deduplication: A Technical Review

Get Success in Passing Your Certification Exam at first attempt!

HP StoreOnce D2D. Understanding the challenges associated with NetApp s deduplication. Business white paper

Data Domain Overview. Jason Schaaf Senior Account Executive. Troy Schuler Systems Engineer. Copyright 2009 EMC Corporation. All rights reserved.

Unitrends Recovery-Series: Addressing Enterprise-Class Data Protection

LEVERAGING EMC SOURCEONE AND EMC DATA DOMAIN FOR ENTERPRISE ARCHIVING AUGUST 2011

Data deduplication is more than just a BUZZ word

STORAGE. Buying Guide: TARGET DATA DEDUPLICATION BACKUP SYSTEMS. inside

EMC PERSPECTIVE. An EMC Perspective on Data De-Duplication for Backup

OmniCube. SimpliVity OmniCube and Multi Federation ROBO Reference Architecture. White Paper. Authors: Bob Gropman

Backup and Recovery Redesign with Deduplication

Data Deduplication and Tivoli Storage Manager

A Comparative TCO Study: VTLs and Physical Tape. With a Focus on Deduplication and LTO-5 Technology

Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International

Protecting Information in a Smarter Data Center with the Performance of Flash

DPAD Introduction. EMC Data Protection and Availability Division. Copyright 2011 EMC Corporation. All rights reserved.

June Blade.org 2009 ALL RIGHTS RESERVED

How To Protect Data On Network Attached Storage (Nas) From Disaster

Take Advantage of Data De-duplication for VMware Backup

REDUCING DATA CENTER POWER CONSUMPTION THROUGH EFFICIENT STORAGE

EMC AVAMAR. Deduplication backup software and system ESSENTIALS DRAWBACKS OF CONVENTIONAL BACKUP AND RECOVERY

EMC BACKUP MEETS BIG DATA

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

LDA, the new family of Lortu Data Appliances

Avamar. Technology Overview

Redefining Oracle Database Management

Checklist and Tips to Choosing the Right Backup Strategy

Doc. Code. OceanStor VTL6900 Technical White Paper. Issue 1.1. Date Huawei Technologies Co., Ltd.

REDUCING DATA CENTER POWER CONSUMPTION THROUGH EFFICIENT STORAGE

W H I T E P A P E R R e a l i z i n g t h e B e n e f i t s o f Deduplication in a Backup and Restore System

Next Generation Backup Solutions

WHITE PAPER Backup and Recovery: Accelerating Efficiency and Driving Down IT Costs Using Data Deduplication

Don t be duped by dedupe - Modern Data Deduplication with Arcserve UDP

EMC DATA DOMAIN PRODUCT OvERvIEW

StoneFly SCVM TM for ESXi

HP StoreOnce: reinventing data deduplication

DEDUPLICATION BASICS

Turnkey Deduplication Solution for the Enterprise

EMC DATA DOMAIN OVERVIEW. Copyright 2011 EMC Corporation. All rights reserved.

EMC BACKUP AND RECOVERY SOLUTIONS

Dell Data Protection. Marek Istok Ŋ Dell Slovakia

Selling Compellent NAS: File & Block Level in the Same System Chad Thibodeau

EMC AVAMAR. Deduplication backup software and system ESSENTIALS DRAWBACKS OF CONVENTIONAL BACKUP AND RECOVERY

Data Deduplication: An Essential Component of your Data Protection Strategy

Barracuda Backup Deduplication. White Paper

White Paper. Experiencing Data De-Duplication: Improving Efficiency and Reducing Capacity Requirements

EMC DATA PROTECTION. Backup ed Archivio su cui fare affidamento

EMC NETWORKER AND DATADOMAIN

EMC DATA DOMAIN OPERATING SYSTEM

Transcription:

The Business Value of Deduplication DDSR SIG

Abstract The purpose of this presentation is to provide a base level of understanding with regards to data deduplication and its business benefits. Consideration will be given to current storage pain points and business goals. This session will focus on the benefits derived from data deduplication including cost savings, storage efficiency, and reduced storage administration. Example use cases will explore considerations related to backup and recovery, replication, archival, primary storage applications, and network bandwidth reduction. 2

Agenda Storage Pain Points Storage Business Goals Deduplication Explained Deduplication Benefits Deduplication Implementations Deduplication Savings Ratios Summary 3

Storage Pain Points Rising storage costs due to unchecked data growth Exponential data growth and shrinking backup windows While dealing with same staffing levels Longer data retention on disk required for SLA s and regulatory requirements Limited space, power, cooling, budget deduplication technology can dramatically reduce physical disk capacity requirements while still meeting data storage requirements Unchecked Growth Backup Copies Archival Copies DR Copies Primary Deduplication 4

Storage Business Goals Deduplication can help organizations: Satisfy ROI/TCO requirements Manage data growth Increase efficiency of storage and backup Reduce overall cost of storage Reduce network bandwidth Reduce operational costs including: Infrastructure costs required space, power and cooling Movement toward a greener data center carbon footprint friendly Reduce administrative costs 5

Deduplication Effect on Green Technologies 10 TB - Green technologies use less raw capacity to store and use the same data set - Power consumption falls accordingly Archive 5 TB 1 TB Backup RAID10 RAID10 Archive Backup RAID DP RAIDDP Archive Backup RAID DP RAIDDP Archive Backup RAID DP RAIDDP Archive Backup RAID DP RAIDDP Archive Backup RAID DP RAIDDP RAID 5/6 Thin Provisioning Multi- Use Backups Virtual Clones Deduplication & Compression Source: SNIA Green Storage Initiative 6

Deduplication Explained Find duplicates at some level, substitute pointers to a single shared copy Block or sub-file based Not to be confused with SIS, which is content or name based Streaming (in-band) and offline techniques Savings increase with number of copies found Up to 95% savings, depending on dataset Source: SNIA Green Storage Initiative 7

Deduplication Savings Calculation Every TB of disks you don t buy saves you CAPEX for the equipment CAPEX for the footprint CAPEX and OPEX for power conditioning OPEX for the power to spin the drives OPEX for cooling OPEX for storage management Source: SNIA Green Storage Initiative 8

Deduplication Benefits Reduced floor space requirements Reduced power & cooling costs Reduced capital costs Reduced operational costs Longer retention of data on disk Source: Understanding Deduplication Ratios DDSR White Paper 9

Deduplication Implementations Vendors today provide deduplication solutions for nearly every point at which data is stored or transmitted The decision as to where to deduplicate is determined only by which problem you are trying to solve Remote Office WAN Optimization s Center D D WAN D D D D D D NAS, SAN, VTL, Integrated Replication Server Agents Software D Storage System Gateway Deduplication Points 10

Example Design Approach Center Storage System General purpose storage system that is capable of deduplicating data Gateway Hardware capable of deduplicating data on external storage hardware Hardware capable of deduplicating and storing data in an appliance dedicated for this purpose Storage System Gateway Center D D D Refer to DPI Buyer s Guide For Vendor Descriptions 11

Example Design Approach Remote Office Remote Office Server D D D Agents Software Hardware capable of deduplicating and storing data in an appliance dedicated for this purpose Agents Client-side software capable of deduplicating data for a specific application Software Server-side software capable of storing deduplicated data from application agents Refer to DPI Buyer s Guide For Vendor Descriptions 12

Example Design Approach WAN Optimization WAN Optimization Many deduplication solutions are also capable of transmitting deduplicated data from point to point, resulting in reduced network traffic and inherited deduplication at the destination Remote Office WAN Optimization s Center D D WAN D D D D D D NAS, SAN, VTL, Server Agents Software Integrated Replication D Deduplication Points Storage System Gateway Refer to DPI Buyer s Guide For Vendor Descriptions 13

Deduplication Savings Ratios Savings ratios reflect design tradeoffs involving performance and space reduction Factors associated with higher data deduplication ratios created by users Low change rates Reference data and inactive data Applications with lower data transfer rates Use of full backups Longer retention of deduplicated data Wider scope of data deduplication Continuous business process improvement Smaller segment size Variable-length segment size Content-aware Temporal data deduplication Factors associated with lower data deduplication ratios captured from mother nature High change rates Active data Applications with higher data transfer rates Use of incremental backups Shorter retention of deduplicated data Narrower scope of data deduplication Business as usual operational procedures Larger segment size Fixed-length segment size Content-agnostic Spatial data deduplication 14

Deduplication Considerations How Deduplication Hash-based Delta-based Forward Reference Content-Agnostic Reverse Reference Content-Aware Where Source Target When Inline Post processing What Spatial Temporal Global Local Resiliency RAID RAIN Benefits TCO Power Cooling Footprint Labor Retention 15

For More Information Deduplication Technical Considerations This technical session will address the nuances of data deduplication and the various design approaches available today. Space savings technologies including data deduplication are used to dramatically improve storage efficiency. The session will address the question of what data deduplication is, how it is performed, and the architectural choices available today. It will address various design approaches including fixed length and variable length segmentation, inline and post processing, source and target, availability and integrity of deduplicated data, and complementary use of removable media. It will also explore the factors affecting space reduction ratios relative to specific capacity optimization techniques. Check out SNIA Tutorial: Deduplication Technical Considerations 16

Summary Deduplication reduces physical data storage requirements by removing duplicate data Deduplication also allows data to be retained on disk for longer periods, using less physical space Deduplication involves tradeoffs relating to degree of physical space reduction and performance considerations Thoughtful deployment of deduplication allows users to benefit in many ways, including overall data center greening 17

Q&A / Feedback Please send any questions or comments on this presentation to ddsr-sig@snia.org Many thanks to the following individuals for their contributions to this presentation Deduplication and Space Reduction SIG Matt Brisse Mike Dutch Larry Freeman Devin Hamilton Shane Jackson Gideon Senderov Tom Sas 18

Call To Action Join the SNIA DDSR SIG!! Join the DDSR SIG! Make an impact in helping solve today s data storage challenges Ensure your voice is heard as the industry reaches consensus on space reduction terminology and best practices Help define promote deduplication and space reduction benefits industry-wide For more information and membership, contact the DDSR chair: at ddsr-sig-chair@snia.org Visit the DDSR team at: If a SNIA member : https://www.snia.org/apps/org/workgroup/ddsr-sig If not a SNIA member: http://www.snia.org/member_com/join/