deduplication s Drives Efficient Backup and Recovery

Similar documents
Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

How To Protect Data On Network Attached Storage (Nas) From Disaster

CISCO WIDE AREA APPLICATION SERVICES (WAAS) OPTIMIZATIONS FOR EMC AVAMAR

Efficient Backup with Data Deduplication Which Strategy is Right for You?

15-MINUTE GUIDE. SMARTER BACKUP Transform your future

EMC PERSPECTIVE. An EMC Perspective on Data De-Duplication for Backup

EMC DATA DOMAIN OPERATING SYSTEM

EMC DATA DOMAIN OPERATING SYSTEM

WHITEPAPER. 7 Reasons Why Businesses are Shifting to Cloud Backup

Business-Centric Storage FUJITSU Storage ETERNUS CS800 Data Protection Appliance

VMware vsphere Data Protection

Turnkey Deduplication Solution for the Enterprise

NetApp Syncsort Integrated Backup

How To Backup With Ec Avamar

Get Success in Passing Your Certification Exam at first attempt!

Mayur Dewaikar Sr. Product Manager Information Management Group Symantec Corporation

DPAD Introduction. EMC Data Protection and Availability Division. Copyright 2011 EMC Corporation. All rights reserved.

EMC Data Domain Boost for Oracle Recovery Manager (RMAN)

Business-centric Storage FUJITSU Storage ETERNUS CS800 Data Protection Appliance

Checklist and Tips to Choosing the Right Backup Strategy

Business Benefits of Data Footprint Reduction

Cloud, Appliance, or Software? How to Decide Which Backup Solution Is Best for Your Small or Midsize Organization.

Backup and Recovery Redesign with Deduplication

EMC BACKUP MEETS BIG DATA

WHITE PAPER Data Deduplication for Backup: Accelerating Efficiency and Driving Down IT Costs

Avamar. Technology Overview

Overcoming Backup & Recovery Challenges in Enterprise VMware Environments

Energy Efficient Storage - Multi- Tier Strategies For Retaining Data

Global Headquarters: 5 Speen Street Framingham, MA USA P F

Deduplication and Beyond: Optimizing Performance for Backup and Recovery

EMC NETWORKER AND DATADOMAIN

I T T R A N S F O R M A T I O N A N D T H E C H A N G I N G D A T A C E N T E R

Bringing the edge to the data center a data protection strategy for small and midsize companies with remote offices. Business white paper

EMC AVAMAR. Deduplication backup software and system ESSENTIALS DRAWBACKS OF CONVENTIONAL BACKUP AND RECOVERY

SYMANTEC NETBACKUP APPLIANCE FAMILY OVERVIEW BROCHURE. When you can do it simply, you can do it all.

Symantec NetBackup PureDisk Optimizing Backups with Deduplication for Remote Offices, Data Center and Virtual Machines

I D C T E C H N O L O G Y S P O T L I G H T

Redefining Backup for VMware Environment. Copyright 2009 EMC Corporation. All rights reserved.

Understanding EMC Avamar with EMC Data Protection Advisor

Total Cost of Ownership Analysis

WHITE PAPER Backup and Recovery: Accelerating Efficiency and Driving Down IT Costs Using Data Deduplication

Protect Microsoft Exchange databases, achieve long-term data retention

Take Advantage of Data De-duplication for VMware Backup

White. Paper. Addressing NAS Backup and Recovery Challenges. February 2012

Virtual Machine Protection with Symantec NetBackup 7

Introduction. Silverton Consulting, Inc. StorInt Briefing

Dell PowerVault DL2200 & BE 2010 Power Suite. Owen Que. Channel Systems Consultant Dell

Global Headquarters: 5 Speen Street Framingham, MA USA P F

Optimizing Backup and Data Protection in Virtualized Environments. January 2009

EMC AVAMAR. Deduplication backup software and system. Copyright 2012 EMC Corporation. All rights reserved.

Integrating a Multi-tiered Deduplication Approach to Simplify Enterprise-wide Backup & Recovery

Tiered Data Protection Strategy Data Deduplication. Thomas Störr Sales Director Central Europe November 8, 2007

Global Headquarters: 5 Speen Street Framingham, MA USA P F

VMware vsphere Data Protection

EMC Data Domain Boost for Oracle Recovery Manager (RMAN)

Field Audit Report. Asigra. Hybrid Cloud Backup and Recovery Solutions. May, By Brian Garrett with Tony Palmer

A CBTS White Paper. Offsite Backup. David Imhoff Product Manager, CBTS 4/22/2012

Maximize Your Virtual Environment Investment with EMC Avamar. Rob Emsley Senior Director, Product Marketing

Dell PowerVault DL Backup to Disk Appliance Powered by CommVault. Centralized data management for remote and branch office (Robo) environments

Complete Storage and Data Protection Architecture for VMware vsphere

WHITE PAPER. Storage Savings Analysis: Storage Savings with Deduplication and Acronis Backup & Recovery 10

WHITE PAPER. The Double-Edged Sword of Virtualization:

5 KEY BACKUP FEATURES TO ENSURE A SUCCESSFUL BACKUP REDESIGN

EMC DATA DOMAIN PRODUCT OvERvIEW

Backup and Recovery for VMware Using EMC Next-Generation Backup Solutions

VMware vsphere Data Protection 5.8 TECHNICAL OVERVIEW REVISED AUGUST 2014

Understanding EMC Avamar with EMC Data Protection Advisor

Optimizing Information Management in the Cloud

Data Protection Report 2008 Best Practices in Data Backup & Recovery

Backup and Recovery for SAP Environments using EMC Avamar 7

Oracle Data Protection Concepts

A Business Case for Disk Based Data Protection

EMC DATA DOMAIN OVERVIEW. Copyright 2011 EMC Corporation. All rights reserved.

We take care of backup and recovery so you can take care of your business. INTRODUCING: HOSTED BACKUP

Combining Onsite and Cloud Backup

Solution Overview: Data Protection Archiving, Backup, and Recovery Unified Information Management for Complex Windows Environments

Real-time Compression: Achieving storage efficiency throughout the data lifecycle

EMC Integrated Infrastructure for VMware

Symantec NetBackup 7.5 for VMware

PASS4TEST 専 門 IT 認 証 試 験 問 題 集 提 供 者

DEFINING THE RIGH DATA PROTECTION STRATEGY

Deduplication has been around for several

Barracuda Backup for Managed Services Providers Barracuda makes it easy and profitable. White Paper

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

Cisco WAAS for Isilon IQ

White Paper FASTFILE / Page 1

Manufacturers Need More Than Just Backup... But they don t need to spend more! axcient.com

The CIO Guide to Virtual Server Data Protection

Cloud Based Disaster Recovery and Technologies Driving it Janson B. Hoambrecker

CEMEX en Concreto con EMC. Jose Luis Bedolla EMC Corporation Back Up Recovery and Archiving

Transcription:

tech dossier Deduplication Drives Efficient Backup and Recovery inside: Rethinking backup and recovery improving remote site backups optimizing storage strategies deduplication s business imperatives With growing data storage needs and virtualization technologies, more IT organizations are redesigning their backup and recovery strategies to accommodate the increasing demand for storage. In today s global marketplace, companies must make datadriven decisions to stay one step ahead of the competition. Information has become essential to launching strategic initiatives as well as maintaining smooth day-to-day operations, so companies must ensure their IT infrastructures are prepared to meet current and future storage needs. Equally important, and a key component to these storage infrastructures, is a well-designed backup and recovery architecture that ensures the continuous availability and integrity of the data that fuels the company.

Yet the amount of corporate data that must be retained continues to grow, and experts say there is no end in sight. Analyst firm Enterprise Strategy Group (ESG) estimates that the amount of information stored in databases alone is growing at a rate of 25 percent annually, while the growth of unstructured data is an estimated two to three times that figure. In addition, data retention policies that are being dictated by corporate and regulatory mandates are requiring companies to store more data for longer periods of time than ever before. Another contributing factor to the growth of corporate data is the trend toward server virtualization. The key benefit of virtualization is that organizations can help address initial or immediate capacity constraints. However, they don t address the continuous need to maximize the utilization of resources. Together, these factors are forcing organizations to consider redesigning their backup and recovery strategies to better accommodate current and future storage needs. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Rethinking backup and recovery Corporate IT departments are looking to redesign their backup architectures to replace less efficient, less reliable and more complex legacy backup system approaches with technology that reaches high levels of effectiveness and simplicity. Rearchitected backup and recovery systems that feature storage optimization technologies allow organizations to achieve these goals by reducing the amount of data stored in order to control operating expenses, limit total cost of ownership, shorten backup windows and speed recovery times. Incorporating these techniques into backup and recovery plans represents an elegant way to do more with less. And, with the emergence of more efficient backup and recovery technologies, organizations are free to choose the right tool for the task at hand. Tape s role is changing, says Greg Schulz, founder of The Server and StorageIO Group, an IT consulting firm based in Stillwater, Minn. No longer best suited for daily backups, tape is shifting to archiving and preserving data for the long term. Disk is handling day-to-day or weekly backups because it s good for rapid restore and rapid access. If you couple disk with storage-efficiency technologies, such as deduplication, to reduce an organization s data footprint, you can store more data on the disk and economically shift archiving to tape. Data deduplication for backup and recovery has emerged as one of the storage-efficiency technologies that delivers the most impact, and organizations are realizing the critical role that data deduplication plays in a backup and recovery strategy. According to a recent study by research firm TheInfoPro, the number of Fortune 1000 companies that are implementing backup data reduction/ deduplication has grown significantly over the past few years, from 9 percent in the fourth quarter of 2007 to more than 46 percent in the second quarter of 2010. Companies are making data deduplication a top storage initiative because the technique addresses many of the pain points cited by storage professionals with their existing infrastructure, including performing backup management and administration tasks, managing storage growth and complexity, managing archiving, maintaining compliance, and dealing with application recovery and backup retention, the firm says. Tech Dossier Deduplication Drives Efficient Backup and Recovery 2

>>>> Improving Remote Site Backups Brown-Forman Corp., a producer and marketer of premium spirits and wines based in Louisville, Ky., was having trouble keeping up with its backup goals for the company s 30-plus remote sites located throughout the U.S. and in 24 countries. Using tape systems that were directly attached to servers in remote offices, Brown-Forman s administrative staff at each location had to run backups Monday through Friday, and success rates weren t very high, says Greg Tinnell, storage management manager at the company. With tape backup, we had very little control over how well data at our remote sites was being protected, Tinnell says. Remote office personnel would forget to swap tapes, or be out of the office sick or on vacation. Also, site data started outgrowing the capacity of a single tape, forcing them to exclude data. And if there was a hardware problem with the tape system, it could take two or three days to fix, bringing down backup success rates even more, he says. Brown-Forman decided to centralize the management of backup and recovery for remote sites, and implemented EMC s Avamar. By identifying redundant data at the source, Avamar was able to reduce the company s daily backup data by 500 times before it crossed the network to be stored on disk, Tinnell says. The patented deduplication technology in EMC Avamar is what really sold us, he says. All of our sites have limited bandwidth so the less data we needed to send over the network, the better. Now all the deduplicated data is sent across the WAN and backed up to a central Avamar server at the company s headquarters in Louisville. What pushed us toward Avamar was the fact that the deduplication was actually handled at the source, instead of bringing all the data back to our central location, Tinnell adds. It has increased our success rates, and it has given us immediate off-site storage something we never accomplished in our history of having servers in the remote offices. To be able to recover from a disaster has been the largest benefit. In addition to improving the backup process, implementing Avamar has made life easier on administrative personnel in Brown-Forman s remote offices. Avamar saves the offices a lot of time, he says. Whenever we get a restore request we go to the Avamar product and kick off the restore, which goes across the WAN without any user intervention at all. The company now has a 97 percent success rate with backup processes. With data growth and server virtualization continuing to fuel the adoption of backup data reduction technologies, it s clear that the vast majority of both Fortune 1000 and mid-size enterprises now view deduplication as an essential element in their storage infrastructures, says Marco Coulter, Managing Director of Storage Research for IT research and advisory firm TheInfoPro. Even with more and more storage vendors offering deduplication in an effort to capitalize on the significant market opportunity associated with backup redesign, our latest research shows that EMC continues to extend its lead in the backup data reduction deduplication space. Eyeing efficiency Corp Data deduplication removes the inefficiencies found in typical data protection operations, where backup applications store multiple copies of the same file even when only a small portion of that file has changed. Traditional approaches to data protection result in the storage of dozens of copies of the same data. What pushed us toward EMC Avamar was the fact that the deduplication was actually handled at the source, instead of bringing all the data back to our central location. Greg Tinnell, Brown-Forman Corp. Tech Dossier Deduplication Drives Efficient Backup and Recovery 3

POLL What are your tape plans? Use only tape Use tape for retention and disk for backups Eliminate tape Take the poll In addition, organizations end up sending multiple copies of the same data across the LAN or WAN, consuming bandwidth and increasing the amount of time it takes to complete the backup, which requires more storage capacity. By removing redundancy from files and only transmitting or storing what is unique, data deduplication reduces the amount of information that must be backed up, allowing enterprises to drive efficiency into their backup and recovery architectures by maximizing their backup capacity and limiting the amount of data that must cross the network. Data deduplication allows you to keep more data online and available, explains Mark Twomey, EMEA Technical Consultant with EMC s Backup and Recovery Solutions Division. It reduces the amount of data to be stored, without reducing the data available. Backup deduplication scans the data, comparing against what is already stored. If the data is found to be unique that is, it is not currently stored in the system then it is transmitted and stored. If data is found to be a duplicate of what has already been stored, it is not stored again. This process of eliminating redundant data produces the following benefits: Lower storage costs >> Since less backup capacity is required because less data is stored, data deduplication can significantly reduce storage requirements. Companies have reported realizing up to 98 percent reductions in cumulative backend storage while keeping their backup data online. Shortened backup and recovery times >> Because data deduplication results in less information that must be stored, backup windows shrink and recovery times are reduced. As a result, replication to disaster recovery sites is also significantly faster as less data is transmitted but all data is available. Improved bandwidth efficiency >> With less data to send across the LAN or WAN for backup, companies can reduce network traffic, leading to lower bandwidth costs and better network response times. Less data to back up from virtualized servers >> According to ESG, while virtualization allows companies to run multiple servers on a single piece of hardware to drive up utilization, more than one-third of organizations that have implemented server virtualization have seen an increase in the amount of data that must be backed up. This is because each virtual machine s disk image contains the OS, applications and data, creating a high degree of redundant information on a physical server. Data deduplication significantly reduces the amount of data backed up from virtualized servers. Maintained data integrity >> By keeping backup data online and accessible, deduplication storage systems can perform regular checks against all the backup data stored in them to ensure end-to-end data integrity throughout the system. This process is automatic, regular and does not require operator intervention. Support for green IT initiatives >> By reducing the amount of backup data, deduplication allows for organizations to consolidate their backup equipment, lowering the costs required to power IT equipment and freeing up data center space. Easier transition from tape to disk >> By driving down the total cost of ownership for disk-based backup by removing duplicate data, the technology helps facilitate the move away from tape. ESG research shows that nearly 50 percent of onsite backup data will be stored on disk by the end of 2010, up from 26 percent in 2007. The cost-saving benefits offered by data deduplication are demonstrable. Research from IDC shows that the average company can save up to $1.5 million annually by implementing this technology, which allows IT organizations to move away from less-efficient, less-reliable and more complex legacy backup system designs. The vast majority of deduplication savings comes from reducing backup costs (76 percent), while increased staff productivity (15 percent) and improved storage management (9 percent) also play a role, according to IDC. Additionally, the more a company uses the technology deployed to reduce backup data, the more benefits they will see, explains Schulz. Tech Dossier Deduplication Drives Efficient Backup and Recovery 4

The more data stored, the greater the data deduplication ratios, he says. When the technology is able to scan larger blocks of information, more duplicates can be discovered, resulting in a greater reduction of data. And users don t need to be concerned about reconstructing backed-up files, because in the storage system, the segmenting of files is hidden from users and applications, so the whole file is still readable after having been written. EMC s solutions EMC offers a variety of approaches to data deduplication for backup and recovery, enabling customers to choose the best fit for their environment whether they are undergoing a complete overhaul of their backup architecture with data deduplication at the center, or adding the technology to existing environments to gain greater efficiency and cost savings. podcast: Optimizing Storage Strategies How to efficiently re-architect backup and recovery systems with deduplication technologies. www.computerworld.com/html/assets/emc/ optimizing-storage-strategies.html Deduplication technology can accelerate backup efficiency and drive down IT costs. Laura DuBois, IDC EMC s Avamar backup solution performs global deduplication of data at the source, producing fast, secure backup and recovery across the enterprise, including VMware environments, remote offices and Network Attached Storage NDMP backup. By minimizing redundant data at the source before it is sent across the LAN or WAN, Avamar reduces bandwidth consumption while reigning in backup storage growth. Avamar performs full backups that can be recovered in one step, verifies backup data recoverability, and can encrypt data in flight and on disk for security. The Data Domain product line from EMC offers high-speed, inline data deduplication that works with POLL Have you adopted deduplication into your backup environment? Yes Not yet, but soon No plans Take the Poll existing backup infrastructures. By performing deduplication at the storage target before the data is written to disk, the technology can consolidate storage for many different backup sources and help avoid creating disparate islands of backup. Deduplicated data can be stored on site, or automatically replicated over the WAN to a remote site for disaster recovery purposes. Regardless of the approach, data deduplication helps companies eliminate the storage, integrity and offsite pain points associated with current backup and recovery architectures. Backup windows and recovery times are shortened, backups are proven to be more reliable, the amount of data produced by server virtualization is contained, electronic copies of data can be securely stored offsite or at branch locations, and overall backup and recovery infrastructure costs are reduced. Deduplication technology can accelerate backup efficiency and drive down IT costs, according to the IDC white paper Backup and Recovery: Accelerating Efficiency and Driving Down IT Costs Using Data Deduplication, written by analysts Laura DuBois and Robert Amatruda. Firms are deploying different types of deduplication-enabled solutions to address a myriad of cost and operational challenges with the growing volume of backup data. IDC finds that deduplication is a core, must-have feature for a variety of storage solutions to address these challenges. n >>>>>>>>>>>>>>>>> For more information on backup and recovery, visit us at www.emc.com Tech Dossier Deduplication Drives Efficient Backup and Recovery 5

WhitePaper additional reading Deduplication sbusinessimperatives PrioritizinganInvestmentNow ByBrianBabineauandDavidA.Chapa December,2010 Introduction Everyyear,ESGsurveysseniorITexecutivestounderstandspendingprioritiesfortheupcoming12 18months. Tworecurringinvestmentthemeshavebeen improvingbackupanddisasterrecovery and managingdata growth and2010wasnotanydifferent.facingseeminglynever endingdatagrowth,thequestioneveryit executiveshouldbeaskingis Whattechnologycanbestaddressthesepriorities? Thesimpleansweris deduplication,atechnologythatreducestheamountofthedatatobemanaged,protected,andultimatelystored. Becausededuplicationcurbsdatagrowth,manydownstreamIToperationaltasksandexpenses suchas replicationfordisasterrecoveryorretentionforcompliancepurposes becomeeasierandmorecosteffective. Thesearebenefitsthatcannotbeignored,especiallywhenESGresearchalsohighlightsthattheeasiestwayto rationalizeatechnologyinvestmentistoproveitwillresultinareductioninoperatingexpenses. 1 Deduplicationtechnologiesmanifestinseveralsolutionsinthemarketplace,mostnotablyinbackupandarchive environments.thisiswhereapreponderanceofredundantdataresides.ratherthansavingandcopyingthesame dataoverandoveragain,itistimetoprioritizeaprojectthatactuallyreducesdataandfacilitatesconsolidation. DataGrowthConundrum RelentlessInformationGrowth Therearetwoprimarycontributorstoinformationgrowth: organic or netnew informationanddatathatis retainedforaspecificreason.accordingtoallofesg smeasures,bothtypesofdatacontinuetogrowatarapid pace.forexample,inarecentesgreport,40%ofrespondentsindicatedthattheirprimarye mailstoragecapacity wasincreasingbyatleast20%perannum. 2 Archivecapacity asinformationretainedforbusinessreference, compliance,andlegalpurposes willtop300exabytesby2015. 3 Figure1.TotalWorldwideDigitalArchiveCapacity,2010 2015 350,000 300,000 TotalWorldwideDigitalArchiveCapacity,AllContentTypes,2010 2015(Petabytes) 56%CAGR 302,995 250,000 200,000 197,234 150,000 100,000 50,000 33,217 51,992 79,151 123,157 0 2010 2011 2012 2013 2014 2015 Source:EnterpriseStrategyGroup,2010. 1 Source:ESGResearchReport,2010ITSpendingIntentionsSurvey,January2010. 2 Source:ESGResearchReport,E mailarchivingmarkettrends,may2010. 3 Source:ESGResearchReport,DigitalArchiveMarketForecast2010 2015,July2010. 6

InformationCosts WhitePaperDeduplication sbusinessimperatives2 ITexecutivesmustfactorannualdatagrowthratesintoallareasoftheirbudgets.Withmoreinformationbeing created,moreprimarystoragecapacityisrequired.theincreaseincapacityrequirementsmayaffecttheprimary storagesystems footprintinthedatacenterandpotentiallyrequireadditionalfloorspace.further,storage operatingcostssuchaspowerandcoolingrequirements,additionalnetworkinginfrastructure,redundancy components,andresourcemanagementsoftwarelicensingwillalsogrowasstorageisadded. Anincreaseinprimarystoragecapacity,inturn,triggersincreasesinsecondarystoragecapacity(diskand/ortape), mediamanagementservers,backupsoftwarelicensing,backupreportingsoftwarelicensing,andoffsitemedia expenses.dependingonhoweffectiveanorganizationisatbackingupdata,thecapitaloutlaycanbe3 4times greaterthantheprimaryenvironmentduetoallthecopiesandreplicasbeingcreated.organizationsalsohaveto takeintoaccountoperatingexpensesrelatedtothebackupenvironment:esgestimatesthat,onaverage,only42% ofdataprotectionbudgetsarespentonhardwareandsoftware;therestisspentonstaffandotheroperatingcosts (tapetransportation,etc.). 4 Anorganizationwithseveralremoteandbranchoffices(ROBOs)shouldaccountforprimarystorageanddata protectioncapitalandoperatingcoststhatoccuroutsideofthedatacenter.theseexpensesoftengooverlooked becausetheyarenotpartofacentralitbudget.whenaggregated,theycanrepresentasubstantialportionof overallitspend. DataProtection scomplicity TheMultiplierEffect Primarydatagrowthisexpensive,butthebiggestcontributorstothe costofinformation arethecopiesmadefor dataprotectionpurposes.whenesgaskednearly500itdecisionmakersabouttheirgreatestdataprotection challenges,thetopresponsewas keepingpacewiththecapacityofdatatoprotect. 5 Thiscanbeinterpretedin twodifferentdimensions,thefirstofwhichissimplytheabilitytogetbackupsandreplicascompletedwithina givenwindow.asmentioned,thisisnoeasyfeatgivenexpecteddatagrowth. Anorganizationmustalsobeabletomanageallofitsinformationandrestoreafile,application,orentiresystem whenneeded.theseactivitiesbecomecumbersomeduetothebackupanddisasterrecoverypoliciesmany organizationshaveinplacetoday:rightnow,itmakesacopyofavolume,lun,orfileatoneormorepointsintime duringthedayandthensavesthecopylocallyforoperationalrecoveryandatanoffsitelocationfordisaster recovery(dr).butdataprotectionoperationsareofteninefficient backupapplicationsmakemanycopiesofthe same(orslightlymodified)filewhenonlyasmallamountofthedatawithinthefilehasactuallychanged.dozensof copiesofthesamedatamaybemadeandstoredforlengthyperiodsoftime evenwhenthefileisnotchangingor haslostitsvaluetotheorganization.considerthefollowingexample: Afileiscreatedandbackedupthesameday. Thefileiscontinuallyupdatedandbackedupincrementallyoverthecourseofaweek. Thefileisthene mailedtoagroupofpeopleandbackedupanewaspartofthee mailapplicationbackup. Oneormoreoftherecipientsmodifiesthefileslightly(changesthedateonthecoverpageofa presentation,forexample)anditisbackedupagaininthenextincrementalbackup. Acopyofthefileismadeunderanewnameandisselectedforbackupagain. Inthemeantime,everyon premisescopyofabackupisreplicatedoffsite,doublingthecopyinstances. 4 Source:ESGResearchReport,2010DataProtectionTrends,April2010. 5 Ibid. 7

WhitePaperDeduplication sbusinessimperatives3 Inthisscenario,itiseasytoseetheinefficiencyinmanybackupprocesses.HighlyredundantbackupfilesclogLANs, WANs,andSANsandconsumeon andoff premisesstoragecapacity.itisalsoverydifficultforanadministratorto keeptrackofthemostrecentcopyandfinditquicklyifneeded. CompoundingtheProblem Insomeinstances,organizationsareaddingtothedataprotectioncapacityglutbyimplementingnewtechnologies tosolveotherit relatedproblems.oneprimeexampleisservervirtualizationinitiatives.thesesolutionsallow customerstorunmultipleserversonasinglepieceofhardware,whichdrivesuputilization.however,ongoingesg LabtestsconfirmthatVMwareimplementationsresultinasubstantialamountofredundantdatasinceeachvirtual machinesharescommonconfigurationfiles. ADifficultBalancingAct Asdatagrowsandregulatorymandatesdictatelongerretentionperiods,theamountofdataundermanagement mayexceedthetimeallocatedforbackup.inanefforttoreducebackuptimes,itorganizationsaredeployingdisk intheirbackupprocessesatanincreasedrate.however,esgfoundthatthecostofstoragesystemsisanothertop concern,creatingaconundrumforitorganizationstryingtoreduceexpenses.itexecutivesmaysimplylookatthe acquisitioncostofadisksystemtobeusedforbackupanddecideagainsttheinvestmentbecauseitisnotinthe budget.suchadecisionmaynotbethebestanswerasitstillneedstoprovideadequatedataprotectionservice levelsdespitedatagrowth. ControllingSecondaryStorageCosts Datadeduplicationidentifiesandeliminatesredundantdata.Itcanbeperformedatthefile,block,orbytelevel. Theopportunitytofindandeliminateredundancybecomesgreaterwithmoregranularexamination.Insecondary storageprocesses,suchasbackup,dataisinitiallyseededonthesecondarystoragedeviceandallsubsequently writtendataisexaminedforredundancy.redundantdataisnotstoredtwice;instead,apointertothestored duplicatedataiswritten,takingupsignificantlylessspace. Regardlessoftheimplementationmethod,deduplicationdeliversmeasurableresults.Oneofitskeymeasuresis thedegreeofcapacityreduction,or reductionratio. A 10x, 10:1 or 10times reductionindicatesthatan organizationwasabletoreducethesizeof,forexample,a500gbbackuptojust50gb.theseresultsarereal.ina recentstudy,56%ofcurrentdatadeduplicationuserssurveyedreporteda10 20xreductionand11%reported morethana20xreduction. 6 Deduplicationratioswillvarybasedonthetypeofdata,frequencyoffullbackups,how frequentlythedatachanges,retention,inter fileandinter applicationredundancy,typeofdeduplication(localor global),anddeduplicationalgorithms. 6 Source:ESGResearchReport,2010DataProtectionTrends,April2010. 8

WhitePaperDeduplication sbusinessimperatives4 Figure2.DataReductionRatiosofCurrentDataDeduplicationUsers Onaverage,whatdegreeofcapacityreductionhasyourorganizationexperiencedby usingdatadeduplicationtechnology?(percentofrespondents,n=140) Morethan20x reduction;11% Don tknow;5% Lessthan10x reduction;29% 10xto20xreduction; 56% Source:EnterpriseStrategyGroup,2010. Better,LowerCostDataProtection Deduplicationchangestheeconomicsofdisk baseddataprotection.first,itmakesthetransitionfromtape to disk basedprotectionmorepalatableasitdrivesthetotalcostofownershipofdisk basedbackupclosertothatof atape basedstrategy.capitalcostsavingsassociatedwithreplacingatape basedapproachmayencompasstape infrastructure(hardwareandsoftwarelicenses),tapemediaacquisition,anddisasterrecoverycosts.second, deduplicationoptimizesdisk basedbackupenvironmentsascompaniescanreplicatemoredatafordisaster recoverymoreefficiently.withduplicatedataremoved,companiesdonothavetobuyasmuchdiskcapacityfor theremotesiteandthereplicationprocesswillnotrequireassignificantanetworkbandwidthinfrastructure.a moreefficient,costeffectivereplicationinfrastructurealsoaffordstheopportunitytocopyrobodatatoacentral locationratherthanexecutebackupswithineachoffice. Thereplicationadvantagecreatesopportunitiesforbetterservicelevelsacrossanentireapplicationportfolio. AccordingtoESGresearch,53%and20%oforganizationssaidthattheycouldtolerateanhourofdowntimeorless fortheirmissionandbusinesscriticalapplications,respectively,beforeexperiencinganegativebusinessimpact suchasrevenuelossorproductivitydisruption.thebottomlineisthatapplicationdowntimecostscanbe astronomicalanddeduplicationgivescustomersmorewaystoprotectmoreoftheirsystems. 7 DataProtectionFlexibility Thereductioninbackupdataasaresultofdeduplicationofferstheluxuryofcapacity.Somemaychoosetoreduce thenumberofbackupsystemstheyhaveintheirenvironments.othersmayopttostoremoredataonlinefor fasterrecoveries.accordingtorecentesgsurveyresults,overtwo thirdsoforganizationssaidtheyretaindataon diskforamonthorlongerbeforemovingittotapeorexpiringit.thissuggeststhatorganizationswanttomake moreinformationaccessible deduplicationsimplyallowsthemtodoitmorecostefficiently.andstillother organizationsmaywanttocreatemorebackups(morecopiescreatedduringtheday)toreducetheriskofdata loss.alloptionsaremorefeasiblebecausededuplicationenablesmoredatatobestoredinasmallerphysical footprint,reducingtheoverallcostofonlinebackupsystems. 7 Source:ESGResearchReport,2010DataProtectionTrends,April2010. 9

WhitePaperDeduplication sbusinessimperatives5 SupportforMultipleITInitiatives ITiscontinuouslylookingforwaystoimproveresourceutilization,driveefficiency,andgeneratebetterservice levels.manyofthemeasurablebenefitsofdeduplicationassistwiththeseobjectives,withmuchofthepositive impacttakingplaceinthestorageenvironment.forexample,loweringcapacityrequirementscanimpact sustainabilityefforts.aspreviouslydiscussed,capacityoptimizationcanpostponeadditionalcapacitypurchasesas wellasreducepowerconsumptionanddatacenterfloorspacerequirements.inthecaseoftapeelimination,the associatedfacilityandenvironmentalcostsofthetapeinfrastructure especiallyatmultiplerobos maycreate negligiblepowerandcoolingsavingsversusadisk basedbackupsystemwithdeduplication. AnotherareawheredatadeduplicationsupportsITinitiativesisindatacenterconsolidation.Thetechnology reducesthenumberofstoragesystemsneededtosupportbackupanddisasterrecoveryandhelpsmitigatethe needforitoperationsatrobos.italsooptimizesservervirtualizationdeploymentsasiteliminatesmuchofthe downsideassociatedwiththeseprojects:virtualmachinediskimagescontainhighlyredundantdataandincrease storagecapacityrequirements.throughservervirtualization,customerscanreducethenumberofserversintheir environmentsandthroughdeduplication,theycanreducetheirstoragecapacityneeds. TheBiggerTruth Thebenefitsofdeduplicationarereflectedinitsrampantadoption.Ina2008ESGsurvey,11%ofrespondentswere usingthetechnologyinsomeportionoftheirenvironments. 8 Thisnumbermorethantripledin2010. 9 Thereason forsuchextremeuptakeisthatdeduplicationisoneofafewitsolutionsthatcutscostsquicklyandimproves servicelevels. SeniorITexecutivesconsistentlytalkaboutthechallengesofdatagrowth,dataprotection,anddisasterrecovery especiallyasitpertainstotheoperationalburdenitplacesontheirteams.theycan,however,joinnearly40%of theirpeersandstartusingdeduplicationtoreaptherewardsthatresultfromcopying,managing,andstoringless data. 20AsylumStreet Milford,MA01757 Tel:508.482.0188Fax:508.482.0218 www.enterprisestrategygroup.com 8 Source:ESGResearchReport,DataProtectionMarketTrends,February2008. 9 Source:ESGResearchReport,2010DataProtectionTrends,April2010. 10