ESG REPORT. Data Deduplication Diversity: Evaluating Software- vs. Hardware-Based Approaches. By Lauren Whitehouse. April, 2009
|
|
- Scott Craig
- 8 years ago
- Views:
Transcription
1 ESG REPORT : Evaluating Software- vs. Hardware-Based Approaches By Lauren Whitehouse April, 2009
2 Table of Contents ESG REPORT Table of Contents... i Introduction... 1 External Forces Contribute to IT Challenges... 1 Data Deduplication... 2 Deduplication in Secondary Storage Processes... 4 How Deduplication Occurs... 4 Where Deduplication Occurs... 5 When Deduplication Occurs... 5 Deduplication Domains... 6 Considerations for Evaluating Deduplication... 6 Ease of Deployment and Use... 6 Performance... 6 Scalability... 7 Manageability... 7 Offsite Copies... 7 Summary... 8 All trademark names are property of their respective companies. Information contained in this publication has been obtained by sources The Enterprise Strategy Group (ESG) considers to be reliable but is not warranted by ESG. This publication may contain opinions of ESG, which are subject to change from time to time. This publication is copyrighted by The Enterprise Strategy Group, Inc. Any reproduction or redistribution of this publication, in whole or in part, whether in hard-copy format, electronically, or otherwise to persons not authorized to receive it, without the express consent of the Enterprise Strategy Group, Inc., is in violation of U.S. copyright law and will be subject to an action for civil damages and, if applicable, criminal prosecution. Should you have any questions, please contact ESG Client Relations at (508) i -
3 Introduction ESG REPORT Deduplication dramatically improves the value proposition of disk-based data protection as it eliminates the redundancy typically seen in secondary storage processes. The use of deduplication will drive further backup-todisk adoption and deliver associated performance and reliability benefits. Selecting a strategy for data deduplication requires consideration of several factors in order to avoid surprises later. Clearly understanding how deduplication works especially in conjunction with other requirements such as performance, ease of use, and offsite copy creation should go a long way toward selecting and designing a solution that delivers maximum business, operational, and financial benefits. Deduplication has been more popularly deployed in target storage hardware, including virtual tape libraries (VTLs) and storage appliances; however, recent introductions by backup vendors have shifted the spotlight. How do hardware and software approaches differ? What should be considered when evaluating solutions? External Forces Contribute to IT Challenges There are a number of external forces working against IT organizations today, including data growth, compliance, and economic difficulties. These realities are impacting not only data protection processes and secondary storage environments, but data center environmentals such as power, cooling, and floor space, as well as bandwidth between primary and secondary recovery sites. Relentless information growth is necessitating greater investments in IT infrastructure. ESG estimates that database data is growing at 25% per annum, with unstructured data increasing at two to three times that rate. 1 Data protection processes, such as backup and replication, compound capacity growth since multiple copies of primary data are made for operational and disaster recovery. ESG research respondents cited data protection as the application that will be most responsible for storage growth over the next 24 months (see Figure 1). 2 FIGURE 1. DATA PROTECTION S COMPLICITY IN STORAGE GROWTH Which types of business applications do you believe will be most responsible for your organization s storage growth over the next 24 months? (Percent of respondents, N=504, multiple responses accepted) Backup & recovery / other data protection applications 37% 40% Database / OLTP 37% Financials / ERP 23% Business intelligence / data warehouse / OLAP 22% 0% 10% 20% 30% 40% 50% Source: Enterprise Strategy Group, Source: ESG Research Report, Database Archiving Survey, December Source: ESG Research Report, Enterprise Storage Survey, December
4 The need to retain information for longer periods of time on accessible media, such as disk, for compliance, ediscovery, and business intelligence purposes also contributes to capacity overabundance and stress on the data protection infrastructure. Moreover, in an effort to improve the performance and reliability of backup and recovery operations, organizations have been increasingly using disk as both the initial and final resting place of backup copies. While capital investments in disk can increase costs, disk-based data protection can contribute to lower operational expenses and improvements in backup and recovery service level agreements (SLAs). In the midst of global financial turmoil, IT organizations are highly motivated to reduce costs and optimize efficiency but not at the expense of introducing risk or impacting value. Technologies that create efficiency and deliver rapid ROI without sacrificing organizations other goals meeting backup and recovery SLAs, for example are those being considered and deployed. One such technology is data deduplication. Data Deduplication Data deduplication identifies and eliminates redundancy, minimizing bandwidth and storage capacity requirements. Deduplication, while not new, has gained even greater popularity today: The increased use of disk in backup and recovery and data protection s aforementioned contribution to storage capacity growth make deduplication attractive. In fact, ESG research found that data reduction tops the list of respondents top five storage priorities over the next 24 months (see Figure 2). 3 FIGURE 2. TOP STORAGE-RELATED INITIATIVES BY ENTERPRISE ORGANIZATIONS CITING COST REDUCTION AS A MAJOR FACTOR IMPACTING STORAGE SPENDING What do you believe will be the most important storage-related initiatives undertaken by your organization over the next 24 months? (Percent of respondents, multiple responses accepted) "Data reduction" technologies that reduce overall storage capacity 21% 33% Storage virtualization More power-efficient storage hardware that runs cooler and/or uses less energy Tiered storage Remote data protection solutions for remote/branch offices 17% 32% 22% 29% 23% 27% 22% 24% Organizations citing cost reduction initiatives as a major factor impacting storage spending (N=225) All other organizations (N=279) Storage encryption solution 15% 22% 0% 5% 10% 15% 20% 25% 30% 35% Source: Enterprise Strategy Group, 2008 Deduplication in backup processes ensures that only unique data is stored and replicate data is not. Initially, data is backed up to the storage device and all subsequently written data is examined for redundancy, with only 3 Source: ESG Research Report, Enterprise Storage Survey, December
5 unique data being written to storage. When duplicate data is found, only a pointer linked to the original unique piece of data is stored. This pointer consumes significantly less space than storing the whole item multiple times. The effectiveness of deduplication is often expressed as a reduction ratio denoting the ratio of protected capacity to the actual physical capacity stored. A 10:1 ratio means that 10 times more data is protected than the physical space required to store it and a 20:1 ratio means that 20 times more data can be protected. Factoring in data growth, retention, and assuming deduplication ratios in the 20:1 range, 2 TB of storage capacity could protect up to 40 TB of retained backup data. The benefit of storing less data is obvious when it comes to reducing storage requirements and saving money. Another way of looking at it is that storing data more efficiently on disk allows for either longer retention periods or the recapture of disk and the ability to extend disk-based data protection to more workloads, contributing to improvements in recovery objectives. Efficient storage of data on disk can be a catalyst to reduce or eliminate tape media. When it comes to selecting deduplication technology, many factors should be considered. After surveying organizations using or considering using deduplication, ESG research found that, not surprisingly, the cost of the solution was the most frequently-cited factor (although savings garnered from capacity reduction often overcomes financial objections to deploying deduplication). Otherwise, the data suggests that ease of deployment and ease of use, as well as the impact on backup/recovery performance were important considerations more so than technical implementations, such as inline or post-process approaches or the deduplication ratio (see Figure 3). 4 FIGURE 3. CONSIDERATIONS FOR SELECTING DATA DEDUPLICATION TECHNOLOGY Which of the following considerations would you say were most important in your organization's evaluation and selection of data de-duplication technology? (Percent of respondents, N = 58, multiple responses accepted) Cost of solution 50% Integration with existing backup processes Impact on backup/recovery performance 36% 36% Ease of implementation/use 31% Scalability of solution De-duplication ratio (degree of storage capacity reduction) Where de-duplication occurs (i.e., in-line vs. postprocess) Vendor service and support Granularity of de-duplication (file-level vs. sub-file-level) 16% 12% 10% 10% 24% Experience of vendor in backup implementation Existing relationship with vendor Don t know 5% 5% 7% 0% 10% 20% 30% 40% 50% 60% Source: Enterprise Strategy Group, Source: ESG Research Report, Data Protection Market Trends, January
6 Data deduplication is a ground-breaking technology that changes the economics of disk-based backup and recovery, so the decision to adopt it should be easy. However, organizations must familiarize themselves with the many facets of deduplication solutions and consider them prior to purchase this can make the evaluation and selection of a solution more complex. Deduplication in Secondary Storage Processes Deduplication is a feature of both software- and hardware-based data protection solutions. Vendors offering this feature have taken different approaches to how, where, and when deduplication occurs and possess varying limitations in the scope of deduplication. How Deduplication Occurs Deduplication solutions either have knowledge about the data in the backup stream or they don t. Those that do are content-aware they can look at patterns in the data stream (the bytes that make up a file) and determine the optimal segment boundaries, which maximizes the likelihood of identifying duplicates. Backup software understands the content, whereas target-side deduplication solutions typically do not. Targets simply receive a stream of data from the backup application. Those target-side deduplication devices that are content-aware typically have to extract the metadata associated with the backup and reverse engineer the backup stream to understand its contents. Hash-Based Algorithms Deduplication solutions may depend on a hash algorithm to determine redundancy. Traditionally, hash algorithms were used to compare data read vs. data written by performing a calculation on a chunk of the data. If the result is identical, then the data read is the same as the data written. The concept has been applied to identifying unique data as a method of fingerprinting chunks of data. The concept, as applied to backup, is that multiple segments of the backup data stream are fingerprinted. The ID of each chunk of incoming data is compared against the central index. Unique IDs are stored in the index and unique data is written to disk. Any duplicates are discarded and a pointer to the existing data is stored instead. Considerations with hash-based methods are the size and location of the index. Hash comparisons made with a memory-resident index will be considerably faster than with a disk-based one. The index may be kept in RAM, but its size may be constrained by the memory limitations of the solution. An index stored on disk could grow large; however, disk seeks may impact performance. These factors, therefore, may impact the capacity of storage contained in a single system. Delta Differencing Another deduplication approach is delta differencing. With this method, there is a level of content awareness. This means that backup streams can be compared from one to the next, i.e., the backup performed today is compared to yesterday s backup an approach often taken by content-aware solutions. Only the new or changed blocks or bytes (differences) are stored. Old or recurring blocks or bytes are discarded. This approach may be faster than hash-based approaches, but cannot deduplicate across backup streams from different backup applications. Pattern Matching Other vendors use different approaches for finding replicate data. Pattern matching doesn t rely on using hashing; instead, this technique uses an advanced pattern recognition and differencing algorithm to find and keep track of duplicates. Like delta differencing, this approach examines the incoming data stream to see if similar data was received in the past. However, that similar data is further scrutinized to find any differences and only the unique bytes are saved. This method may be faster than traditional hashing methods because it is less CPU- and memory-intensive. The size of the index is smaller than with traditional hashing methods, often resulting in greater levels of performance and scalability
7 Where Deduplication Occurs In data protection, deduplication can occur at one or more places in the data path: at the system being backed up (source-side deduplication), the backup media server (proxy deduplication), or the destination storage device (target-side deduplication). Some backup applications deduplicate at the data source via client agent technology. In this case, client software running on an application server identifies and transfers only unique data to the backup media server and target storage device, providing greater network efficiency. Other backup software solutions deduplicate the backup stream at the backup server removing any performance burden from production application servers. Further, some distribute the deduplication process throughout the data path, performing hashing at the client and deduplication at the media server. It will be important to understand if and/or how deduplication solutions optimize performance and distribute the deduplication workload in software-based approaches. Deduplicating data after it has passed through the media server is referred to as target-side deduplication. This approach typically leverages powerful purpose-built storage appliances to accommodate processing of the entire (non-deduplicated) backup load either pre- or post-ingestion. There are pros and cons to every approach, and selecting one over another depends on multiple factors, such as performance requirements, flexibility, scalability, and cost. One of the drawbacks of a software approach is that adopting that feature could require a switch or upgrade in backup application or client agents. However, software-based deduplication may offer more flexibility, especially for disk vendor selection. As a built-in feature of backup, a big benefit may be no added cost. Performance may or may not be a drawback. This will depend on the characteristics of the hardware where deduplication takes place, whether or not deduplication processing is distributed, and the aforementioned method for identifying duplicates. Performance is often considered less of an issue with hardware-based deduplication as it typically leverages powerful purpose-built storage appliances. The trade-offs could be flexibility (in disk vendor) and scalability, depending on the solution. The key is finding a solution that allows for capacity and performance growth without necessitating a forklift upgrade. This could mean a single highly-scalable system or multiple systems that are managed and monitored from a single management interface. When Deduplication Occurs Deduplication can occur before data is written to disk (inline processing) or after it is written to disk (postprocessing). Inline approaches inspect and deduplicate data at the source, at the media server, or upon ingest at the disk. The tradeoffs with this approach are related to performance, which depends on a few factors such as how duplicates are identified, the granularity of deduplication, how the deduplication processing workload is distributed, network performance, and more. An inline approach may be preferred for workloads if replication to an offsite location is needed immediately. Post-process deduplication will write the backup image to disk before initiating deduplication, which allows the backup to complete at full disk performance. Oftentimes, the trade-off with this approach is the amount of disk capacity required for the solution as disk capacity will be required to temporarily store the backup stream plus the deduplicated backup. Some post-process solutions perform deduplication on a job-by-job basis, acting on data as it arrives and releasing the space once the deduplication process is completed for that job, while others deduplicate as data is ingested, minimizing the need for a landing area. A post-process approach may be preferred if the workload includes a lot of new data, if the backup window is small, or if replicating data to an offsite location can afford some lag time. A few vendors offer both inline and post-process options on a per job basis, which offers additional flexibility. This makes it possible to customize deduplication strategies for specific workloads
8 Deduplication Domains The deduplication domain refers to the realm of data used for subsequent comparisons when identifying duplicates. Local deduplication only compares data against other data passing through the same system. Most target-side deduplication solutions fall into this category. The good news is that this approach is more fieldproven. The bad news is that local deduplication is often the consequence of scalability limitations. Conversely, global deduplication makes comparisons within and across systems. This capability is more often seen in software-based and grid-architecture approaches, but may also be supported for target deduplication systems that replicate in a hub-and-spoke fashion (with global deduplication occurring at the hub). Global deduplication can result in higher deduplication ratios as data is deduplicated within and across backup sources, and greater economies of scale with respect to operational overhead and capital costs. Another aspect of deduplication domain is the storage tiers where deduplication can be applied. Target-side deduplication solutions are limited to disk-based storage, while backup software with media management capabilities may extend to the tape tier, too. The ability to move data in the compressed state from disk to tape introduces capacity savings for long-term archiving. Considerations for Evaluating Deduplication Ease of Deployment and Use As previously noted, ESG research found that the ability to integrate with existing backup processes and overall ease of use are of greater importance to users than more specific technical considerations. If a deduplication solution is not easy to manage and does not benignly integrate with existing data protection processes, even the best-performing product with the latest whiz-bang features will be a non-starter. Software incorporating deduplication is either going to be the easiest to deploy and use or be the most disruptive. This will depend on if a switch from incumbent software is required and whether or not disk is already incorporated in the backup process. Hardware-based deduplication solutions have garnered popularity as they are easy to deploy and less disruptive to existing backup environments and processes. Most vendors solutions are delivered as target storage systems, appearing as a file server over Ethernet or as a VTL over Fibre Channel. They offer a plug-and-play experience and don t require client software. Compatibility with existing backup software depends on the deduplication approach since content-aware target-side solutions require some development efforts for each backup application supported. One of the drawbacks to this approach is that the quantity of backup applications compatible with a target deduplication solution may be limited and the vendor may be slow to add support. Performance To understand the optimal deduplication strategy, organizations need to examine backup data sets size, frequency, criticality, and whether or not deduplication makes sense to determine a deduplication strategy and how it impacts the overall performance of backup. Providing policy-based deduplication (the ability to turn it on or off depending on the workload and its requirements) gives the flexibility to enable deduplication for data sets with lower backup performance requirements or high data redundancy, and disable deduplication for data sets with high backup performance requirements or little data redundancy. Policy-based deduplication also extends to determining whether inline or post-processing implementations occur. Post-process deduplication approaches have less impact on backup windows, while an inline method could introduce some impact on performance. Deduplication solutions require regular housekeeping operations called cleaning or garbage collection. This process reorganizes stored data reclaiming space freed by expired data and consolidating free capacity. Housekeeping operations could impact performance, so it is important to understand if and how the process can - 6 -
9 be scheduled to avoid peak backup windows. Enabling housekeeping during an evaluation will provide a realworld simulation of system performance. Importantly, when evaluating deduplication solutions, it s important to test over an extended period of time a minimum of one to two weeks or several backup cycles. Recovery performance is equally important. With deduplication making it much more economical to store backup data sets for longer periods of time, it is more likely that data will be restored from deduplicated data. It will, therefore, be important to test how a deduplication engine performs in several recovery scenarios, especially for data stored over a longer period of time, to judge the potential impact of deduplication in the environment. Scalability Data deduplication should mitigate the need to expand storage capacity. However, it is still important to understand what the upper threshold of capacity is for the solution and, when additional capacity is required, how easy or difficult it is to augment it. For example, can the solution s repository expand on a per system basis or will a device upgrade require a new system to be deployed and data to be migrated? The worst case scenario is for the IT organization to manage an ever-growing number of independent silos. Manageability Manageability is a key concern that is often overlooked. Backup is typically managed from the backup application. Configuring policy settings, monitoring operations, and reporting results and statistics are centralized. Adding target-side deduplication creates another point of management where deduplication-specific policies must be set. While target-side deduplication is simple to implement, what is the management impact as backup capacity grows and the environment scales? Fewer silos mean fewer points of management; therefore, what are the long-term prospects for managing the backup environment with multiple target deduplication systems? Does each device have to be managed individually? Can data be deduplicated across target devices or is each a silo? Centralized management of policies simplifies administration, decreases complexity, and reduces operational costs. Backup software with deduplication consolidates policy management and may provide better visibility of operations. Offsite Copies Typically, disk-based backup with deduplication is a replacement for tape-based backup. If that s the case, then how can backup sets be moved offsite for DR purposes? Hardware-based deduplication solutions often offer device-to-device remote replication. While there may be an added cost for acquiring and deploying a second system at a remote location, doing so will provide a safeguard in the event that the primary site (or the backup set managed at that site) is unavailable. Some backup applications can also replicate data from site to site. It s important to understand if the data replicated between sites by either hardware or software solutions maintains data s deduplicated state to optimize bandwidth. For many environments, physical tape creation is still necessary to fulfill DR and retention requirements. Most backup software solutions and some VTLs with deduplication offer the added capability of creating physical tape media. Most solutions supporting tape must reinflate data prior to it being copied to tape, eliminating the benefits of deduplication. One backup solution does offer the ability to move deduplicated data from disk to tape, minimizing the number of tapes required to store data for long-term archiving. When a recovery is required, tape-based deduplicated data must be copied back to disk, and will then be available to the end-user or application
10 Summary Before seeking out specific vendors, it is most important to understand the organization s deduplication needs to ensure the right fit for the environment and ease of integration. This process includes some capacity planning to make sure that the pursued solution will have some longevity and that capacity scaling needs are well understood before purchase. Once a vendor short list is determined, vet the company and its deduplication product. Seek out references, understand how many active deployments of the technology are in place, leverage the vendor s ROI model, compare the results versus competing solutions, and get a glimpse into the product roadmap. It is important to understand the vendor s business success and long-term viability, its support capability, how well it communicates with you, and what other services or products it could offer to you today and over time. Next, test, test, test. Beginning with installation and configuration, test viable solutions using real data based on policies in place in the current data protection environment. Record deduplication ratio results, backup and recovery performance (single stream and aggregate performance), and replication performance over an extended period of time a week or two weeks at a minimum. These tests should include deleting and expiring data, as well as simulating expected change rates. If applicable, test the physical tape creation process. Finally, simulate system failure to test resiliency. Choosing a deduplication strategy is not a simple task. Technology maturity varies considerably and the vendor landscape is in flux. As solutions are considered, cut through the hyperbole by requesting real-world references and proof points to vendor claims. Test backup and, importantly, restore performance. Thorough due diligence up front may stave off surprises later. 20 Asylum Street Milford, MA Tel: Fax:
Technology Fueling the Next Phase of Storage Optimization
White Paper HP StoreOnce Deduplication Software Technology Fueling the Next Phase of Storage Optimization By Lauren Whitehouse June, 2010 This ESG White Paper was commissioned by Hewlett-Packard and is
More informationWhite. Paper. Improving Backup Effectiveness and Cost-Efficiency with Deduplication. October, 2010
White Paper Improving Backup Effectiveness and Cost-Efficiency with Deduplication By Lauren Whitehouse October, 2010 This ESG White Paper was commissioned by Fujitsu and is distributed under license from
More informationDEDUPLICATION NOW AND WHERE IT S HEADING. Lauren Whitehouse Senior Analyst, Enterprise Strategy Group
DEDUPLICATION NOW AND WHERE IT S HEADING Lauren Whitehouse Senior Analyst, Enterprise Strategy Group Need Dedupe? Before/After Dedupe Deduplication Production Data Deduplication In Backup Process Backup
More informationA Comparative TCO Study: VTLs and Physical Tape. With a Focus on Deduplication and LTO-5 Technology
White Paper A Comparative TCO Study: VTLs and Physical Tape With a Focus on Deduplication and LTO-5 Technology By Mark Peters February, 2011 This ESG White Paper is distributed under license from ESG.
More informationE-Guide. Sponsored By:
E-Guide An in-depth look at data deduplication methods This E-Guide will discuss the various approaches to data deduplication. You ll learn the pros and cons of each, and will benefit from independent
More informationSymantec OpenStorage Date: February 2010 Author: Tony Palmer, Senior ESG Lab Engineer
ESG Lab Review Symantec OpenStorage Date: February 2010 Author: Tony Palmer, Senior ESG Lab Engineer Abstract: This ESG Lab review documents hands-on testing of consolidated management and automated data
More informationNexsan and FalconStor Team for High Performance, Operationally Efficient Disk-based Backup Date: August, 2009 Author:
Nexsan and FalconStor Team for High Performance, Operationally Efficient Disk-based Backup Date: August, 2009 Author: Brian Babineau, Senior Consulting Analyst, and Lauren Whitehouse, Senior Analyst Abstract:
More informationCost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.
Cost Effective Backup with Deduplication Agenda Today s Backup Challenges Benefits of Deduplication Source and Target Deduplication Introduction to EMC Backup Solutions Avamar, Disk Library, and NetWorker
More informationWhite Paper. Experiencing Data De-Duplication: Improving Efficiency and Reducing Capacity Requirements
White Paper : Improving Efficiency and Reducing Capacity Requirements By Heidi Biggar Storage Analyst, Data Protection Enterprise Strategy Group February, 2007 Copyright 2007. The Enterprise Strategy Group,
More informationTiered Data Protection Strategy Data Deduplication. Thomas Störr Sales Director Central Europe November 8, 2007
Tiered Data Protection Strategy Data Deduplication Thomas Störr Sales Director Central Europe November 8, 2007 Overland Storage Tiered Data Protection = Good = Better = Best! NEO / ARCvault REO w/ expansion
More informationDemystifying Deduplication for Backup with the Dell DR4000
Demystifying Deduplication for Backup with the Dell DR4000 This Dell Technical White Paper explains how deduplication with the DR4000 can help your organization save time, space, and money. John Bassett
More informationProtect Microsoft Exchange databases, achieve long-term data retention
Technical white paper Protect Microsoft Exchange databases, achieve long-term data retention HP StoreOnce Backup systems, HP StoreOnce Catalyst, and Symantec NetBackup OpenStorage Table of contents Introduction...
More informationReducing Backups with Data Deduplication
The Essentials Series: New Techniques for Creating Better Backups Reducing Backups with Data Deduplication sponsored by by Eric Beehler Reducing Backups with Data Deduplication... 1 Explaining Data Deduplication...
More informationSYMANTEC NETBACKUP APPLIANCE FAMILY OVERVIEW BROCHURE. When you can do it simply, you can do it all.
SYMANTEC NETBACKUP APPLIANCE FAMILY OVERVIEW BROCHURE When you can do it simply, you can do it all. SYMANTEC NETBACKUP APPLIANCES Symantec understands the shifting needs of the data center and offers NetBackup
More informationData Deduplication: An Essential Component of your Data Protection Strategy
WHITE PAPER: THE EVOLUTION OF DATA DEDUPLICATION Data Deduplication: An Essential Component of your Data Protection Strategy JULY 2010 Andy Brewerton CA TECHNOLOGIES RECOVERY MANAGEMENT AND DATA MODELLING
More informationDeltaStor Data Deduplication: A Technical Review
White Paper DeltaStor Data Deduplication: A Technical Review DeltaStor software is a next-generation data deduplication application for the SEPATON S2100 -ES2 virtual tape library that enables enterprises
More informationEnterprise-class Backup Performance with Dell DR6000 Date: May 2014 Author: Kerry Dolan, Lab Analyst and Vinny Choinski, Senior Lab Analyst
ESG Lab Review Enterprise-class Backup Performance with Dell DR6000 Date: May 2014 Author: Kerry Dolan, Lab Analyst and Vinny Choinski, Senior Lab Analyst Abstract: This ESG Lab review documents hands-on
More informationField Audit Report. Asigra. Hybrid Cloud Backup and Recovery Solutions. May, 2009. By Brian Garrett with Tony Palmer
Field Audit Report Asigra Hybrid Cloud Backup and Recovery Solutions By Brian Garrett with Tony Palmer May, 2009 Field Audit: Asigra Hybrid Cloud Backup and Recovery Solutions 2 Contents Introduction...
More informationHow To Improve Storage Efficiency With Ibm Data Protection And Retention
White Paper Storage-efficient Data Protection and Retention By Lauren Whitehouse April, 2011 This ESG White Paper was commissioned by IBM and is distributed under license from ESG. 2011, Enterprise Strategy
More informationDeduplication and Beyond: Optimizing Performance for Backup and Recovery
Beyond: Optimizing Gartner clients using deduplication for backups typically report seven times to 25 times the reductions (7:1 to 25:1) in the size of their data, and sometimes higher than 100:1 for file
More informationResearch Report. Abstract: The Impact of Server Virtualization on Data Protection. September 2010
Research Report Abstract: The Impact of Server Virtualization on Data Protection By Lauren Whitehouse and Bill Lundell With Jennifer Gahm September 2010 2010 Enterprise Strategy Group, Inc. All Rights
More informationProtecting enterprise servers with StoreOnce and CommVault Simpana
Technical white paper Protecting enterprise servers with StoreOnce and CommVault Simpana HP StoreOnce Backup systems Table of contents Introduction 2 Technology overview 2 HP StoreOnce Backup systems key
More informationTurnkey Deduplication Solution for the Enterprise
Symantec NetBackup 5000 Appliance Turnkey Deduplication Solution for the Enterprise Mayur Dewaikar Sr. Product Manager, Information Management Group White Paper: A Deduplication Appliance Solution for
More informationBackup and Recovery: The Benefits of Multiple Deduplication Policies
Backup and Recovery: The Benefits of Multiple Deduplication Policies NOTICE This White Paper may contain proprietary information protected by copyright. Information in this White Paper is subject to change
More informationDeduplication Demystified: How to determine the right approach for your business
Deduplication Demystified: How to determine the right approach for your business Presented by Charles Keiper Senior Product Manager, Data Protection Quest Software Session Objective: To answer burning
More informationWhite. Paper. Benefiting from Server Virtualization. Beyond Initial Workload Consolidation. June, 2010
White Paper Benefiting from Server Virtualization Beyond Initial Workload Consolidation By Mark Bowker June, 2010 This ESG White Paper was commissioned by VMware and is distributed under license from ESG.
More informationWHITE PAPER Improving Storage Efficiencies with Data Deduplication and Compression
WHITE PAPER Improving Storage Efficiencies with Data Deduplication and Compression Sponsored by: Oracle Steven Scully May 2010 Benjamin Woo IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA
More informationChoosing an Enterprise-Class Deduplication Technology
WHITE PAPER Choosing an Enterprise-Class Deduplication Technology 10 Key Questions to Ask Your Deduplication Vendor 400 Nickerson Road, Marlborough, MA 01752 P: 866.Sepaton or 508.490.7900 F: 508.490.7908
More informationEMC s Enterprise Hadoop Solution. By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst
White Paper EMC s Enterprise Hadoop Solution Isilon Scale-out NAS and Greenplum HD By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst February 2012 This ESG White Paper was commissioned
More informationEMC PERSPECTIVE. An EMC Perspective on Data De-Duplication for Backup
EMC PERSPECTIVE An EMC Perspective on Data De-Duplication for Backup Abstract This paper explores the factors that are driving the need for de-duplication and the benefits of data de-duplication as a feature
More informationProtect Data... in the Cloud
QUASICOM Private Cloud Backups with ExaGrid Deduplication Disk Arrays Martin Lui Senior Solution Consultant Quasicom Systems Limited Protect Data...... in the Cloud 1 Mobile Computing Users work with their
More informationAccelerating Backup/Restore with the Virtual Tape Library Configuration That Fits Your Environment
Accelerating Backup/Restore with the Virtual Tape Library Configuration That Fits Your Environment A WHITE PAPER Abstract: Since VTL uses disk to back up data, it eliminates the media and mechanical errors
More informationThe Shift Toward Data Protection Appliances
Research Report Abstract: The Shift Toward Data Protection Appliances By Jason Buffington, Senior Analyst and Bill Lundell, Senior Research Analyst With Jennifer Gahm, Senior Project Manager March 2015
More informationEvaluation Guide. Software vs. Appliance Deduplication
Evaluation Guide Software vs. Appliance Deduplication Table of Contents Introduction... 2 Data Deduplication Overview... 3 Backup Requirements... 6 Backup Application Client Side Deduplication... 7 Backup
More informationBusiness Benefits of Data Footprint Reduction
Business Benefits of Data Footprint Reduction Why and how reducing your data footprint provides a positive benefit to your business and application service objectives By Greg Schulz Founder and Senior
More informationLDA, the new family of Lortu Data Appliances
LDA, the new family of Lortu Data Appliances Based on Lortu Byte-Level Deduplication Technology February, 2011 Copyright Lortu Software, S.L. 2011 1 Index Executive Summary 3 Lortu deduplication technology
More informationRiverbed Whitewater/Amazon Glacier ROI for Backup and Archiving
Riverbed Whitewater/Amazon Glacier ROI for Backup and Archiving November, 2013 Saqib Jang Abstract This white paper demonstrates how to increase profitability by reducing the operating costs of backup
More informationTop Ten Questions. to Ask Your Primary Storage Provider About Their Data Efficiency. May 2014. Copyright 2014 Permabit Technology Corporation
Top Ten Questions to Ask Your Primary Storage Provider About Their Data Efficiency May 2014 Copyright 2014 Permabit Technology Corporation Introduction The value of data efficiency technologies, namely
More informationSTORAGE. Buying Guide: TARGET DATA DEDUPLICATION BACKUP SYSTEMS. inside
Managing the information that drives the enterprise STORAGE Buying Guide: DEDUPLICATION inside What you need to know about target data deduplication Special factors to consider One key difference among
More informationWhite. Paper. The SMB Market is Ready for Data Encryption. January, 2011
White Paper The SMB Market is Ready for Data Encryption By Mark Peters January, 2011 This ESG White Paper was commissioned by Tandberg Data and is distributed under license from ESG. 2011, Enterprise Strategy
More informationData De-duplication Methodologies: Comparing ExaGrid s Byte-level Data De-duplication To Block Level Data De-duplication
Data De-duplication Methodologies: Comparing ExaGrid s Byte-level Data De-duplication To Block Level Data De-duplication Table of Contents Introduction... 3 Shortest Possible Backup Window... 3 Instant
More informationWHITE PAPER Data Deduplication for Backup: Accelerating Efficiency and Driving Down IT Costs
WHITE PAPER Data Deduplication for Backup: Accelerating Efficiency and Driving Down IT Costs Sponsored by: EMC Corporation Laura DuBois May 2009 EXECUTIVE SUMMARY Global Headquarters: 5 Speen Street Framingham,
More informationUNDERSTANDING DATA DEDUPLICATION. Thomas Rivera SEPATON
UNDERSTANDING DATA DEDUPLICATION Thomas Rivera SEPATON SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individual members may use this material
More informationSymantec NetBackup PureDisk Optimizing Backups with Deduplication for Remote Offices, Data Center and Virtual Machines
Optimizing Backups with Deduplication for Remote Offices, Data Center and Virtual Machines Mayur Dewaikar Sr. Product Manager Information Management Group White Paper: Symantec NetBackup PureDisk Symantec
More informationDXi Accent Technical Background
TECHNOLOGY BRIEF NOTICE This Technology Brief contains information protected by copyright. Information in this Technology Brief is subject to change without notice and does not represent a commitment on
More informationWhite. Paper. HP StoreOnce Is Better Together with HP Data Protector 7. December 2012
White Paper HP StoreOnce Is Better Together with HP Data Protector 7 By Jason Buffington, Senior Analyst December 2012 This ESG White Paper was commissioned by HP and is distributed under license from
More informationWHITE PAPER. How Deduplication Benefits Companies of All Sizes An Acronis White Paper
How Deduplication Benefits Companies of All Sizes An Acronis White Paper Copyright Acronis, Inc., 2000 2009 Table of contents Executive Summary... 3 What is deduplication?... 4 File-level deduplication
More informationEMC DATA DOMAIN EXTENDED RETENTION SOFTWARE: MEETING NEEDS FOR LONG-TERM RETENTION OF BACKUP DATA ON EMC DATA DOMAIN SYSTEMS
SOLUTION PROFILE EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE: MEETING NEEDS FOR LONG-TERM RETENTION OF BACKUP DATA ON EMC DATA DOMAIN SYSTEMS MAY 2012 Backups are essential for short-term data recovery
More informationUNDERSTANDING DATA DEDUPLICATION. Tom Sas Hewlett-Packard
UNDERSTANDING DATA DEDUPLICATION Tom Sas Hewlett-Packard SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individual members may use this material
More informationUsing HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup
Technical white paper Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup Table of contents Executive summary... 2 Introduction... 2 What is NDMP?... 2 Technology overview... 3 HP
More informationEMC Disk Library with EMC Data Domain Deployment Scenario
EMC Disk Library with EMC Data Domain Deployment Scenario Best Practices Planning Abstract This white paper is an overview of the EMC Disk Library with EMC Data Domain deduplication storage system deployment
More informationData Deduplication in Tivoli Storage Manager. Andrzej Bugowski 19-05-2011 Spała
Data Deduplication in Tivoli Storage Manager Andrzej Bugowski 19-05-2011 Spała Agenda Tivoli Storage, IBM Software Group Deduplication concepts Data deduplication in TSM 6.1 Planning for data deduplication
More informationDeduplication has been around for several
Demystifying Deduplication By Joe Colucci Kay Benaroch Deduplication holds the promise of efficient storage and bandwidth utilization, accelerated backup and recovery, reduced costs, and more. Understanding
More informationThe Data Center of the Future
2010, Enterprise Strategy Group, Inc. All Rights Reserved White Paper The Data Center of the Future By Mark Bowker and Lauren Whitehouse March, 2010 This ESG White Paper was commissioned by Veeam and is
More informationW H I T E P A P E R R e a l i z i n g t h e B e n e f i t s o f Deduplication in a Backup and Restore System
W H I T E P A P E R R e a l i z i n g t h e B e n e f i t s o f Deduplication in a Backup and Restore System Sponsored by: HP Noemi Greyzdorf November 2008 Robert Amatruda INTRODUCTION Global Headquarters:
More informationA Business Case for Disk Based Data Protection
Mosaic Technology s IT Director s Series: A Business Case for Disk Based Data Protection presented by Mosaic Technology Mosaic Technology Corporation * Salem, NH (603) 898-5966 * Bellevue, WA (425) 462-5004
More informationFuture-Proofed Backup For A Virtualized World!
! Future-Proofed Backup For A Virtualized World! Prepared by: Colm Keegan, Senior Analyst! Prepared: January 2014 Future-Proofed Backup For A Virtualized World Like death and taxes, growing backup windows
More informationLab Validation Report
Lab Validation Report EMC Avamar 6.0 Next Generation Backup & Recovery By Vinny Choinski with David Chapa June 2011 Lab Validation: EMC Avamar 6.0 2 Contents Introduction... 3 Background... 3 EMC Avamar
More informationCONSOLIDATE MORE: HIGH- PERFORMANCE PRIMARY DEDUPLICATION IN THE AGE OF ABUNDANT CAPACITY
CONSOLIDATE MORE: HIGH- PERFORMANCE PRIMARY DEDUPLICATION IN THE AGE OF ABUNDANT CAPACITY YONG KIM, TECHNICAL DIRECTOR, AMERICAS FILE AND CONTENT SOLUTIONS WEBTECH EDUCATIONAL SERIES CONSOLIDATE MORE:
More informationExaGrid - A Backup and Data Deduplication appliance
Detailed Product Description ExaGrid Backup Storage Appliances with Deduplication 2014 ExaGrid Systems, Inc. All rights reserved. Table of Contents Executive Summary...2 ExaGrid Basic Concept...2 ExaGrid
More informationUNDERSTANDING DATA DEDUPLICATION. Jiří Král, ředitel pro technický rozvoj STORYFLEX a.s.
UNDERSTANDING DATA DEDUPLICATION Jiří Král, ředitel pro technický rozvoj STORYFLEX a.s. SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individual
More informationOvercoming Backup & Recovery Challenges in Enterprise VMware Environments
Overcoming Backup & Recovery Challenges in Enterprise VMware Environments Daniel Budiansky Enterprise Applications Technologist Data Domain Dan Lewis Manager, Network Services USC Marshall School of Business
More informationLeveraging EMC Deduplication Solutions for Backup, Recovery, and Long-term Information Retention
White Paper Leveraging EMC Deduplication Solutions for Backup, Recovery, and Long-term Information Retention By Brian Babineau and David A. Chapa January, 2011 This ESG White Paper was commissioned by
More informationReal-time Compression: Achieving storage efficiency throughout the data lifecycle
Real-time Compression: Achieving storage efficiency throughout the data lifecycle By Deni Connor, founding analyst Patrick Corrigan, senior analyst July 2011 F or many companies the growth in the volume
More informationWhite. Paper. Addressing NAS Backup and Recovery Challenges. February 2012
White Paper Addressing NAS Backup and Recovery Challenges By Terri McClure and Jason Buffington February 2012 This ESG White Paper was commissioned by EMC and is distributed under license from ESG. 2012,
More informationResearch Report. Abstract: Trends in Data Protection Modernization. August 2012
Research Report Abstract: Trends in Data Protection Modernization By Jason Buffington, Senior Analyst, and Bill Lundell, Senior Research Analyst August 2012 Introduction Research Objectives Research Report:
More informationWHY DO I NEED FALCONSTOR OPTIMIZED BACKUP & DEDUPLICATION?
WHAT IS FALCONSTOR? FalconStor Optimized Backup and Deduplication is the industry s market-leading virtual tape and LAN-based deduplication solution, unmatched in performance and scalability. With virtual
More informationHow To Make A Backup System More Efficient
Identifying the Hidden Risk of Data De-duplication: How the HYDRAstor Solution Proactively Solves the Problem October, 2006 Introduction Data de-duplication has recently gained significant industry attention,
More informationWHITE PAPER. Effectiveness of Variable-block vs Fixedblock Deduplication on Data Reduction: A Technical Analysis
WHITE PAPER Effectiveness of Variable-block vs Fixedblock Deduplication on Data Reduction: A Technical Analysis CONTENTS Executive Summary... 3 Fixed vs. Variable-block Deduplication... 3 Test Configuration...
More informationHP Data Protector software and HP StoreOnce backup systems for federated deduplication and flexible deployment
Brochure Maximize storage efficiency across the enterprise HP Data Protector software and HP StoreOnce backup systems for federated deduplication and flexible deployment Maximize storage efficiency across
More informationSales Tool. Summary DXi Sales Messages November 2009 6 NOVEMBER 2009. ST00431-v06
Summary DXi Sales Messages November 2009 6 NOVEMBER 2009 ST00431-v06 Notice This Sales Tool contains proprietary information protected by copyright. Information in this Sales Tool is subject to change
More informationArchive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration
Solutions Integrated Storage Appliances Management Optimized Storage & Migration Archive Data Retention & Compliance Services Global Installation & Support SECURING THE FUTURE OF YOUR DATA w w w.q sta
More informationProtecting Information in a Smarter Data Center with the Performance of Flash
89 Fifth Avenue, 7th Floor New York, NY 10003 www.theedison.com 212.367.7400 Protecting Information in a Smarter Data Center with the Performance of Flash IBM FlashSystem and IBM ProtecTIER Printed in
More informationEight Considerations for Evaluating Disk-Based Backup Solutions
Eight Considerations for Evaluating Disk-Based Backup Solutions 1 Introduction The movement from tape-based to disk-based backup is well underway. Disk eliminates all the problems of tape backup. Backing
More informationBy Jason Buffington, Senior Analyst, and Monya Keane, Research Analyst
White Paper How to Accelerate IT Resiliency Through Virtualization By Jason Buffington, Senior Analyst, and Monya Keane, Research Analyst August 2015 This ESG White Paper was commissioned by Axcient and
More informationPrimaryIO Application Performance Acceleration Date: July 2015 Author: Tony Palmer, Senior Lab Analyst
ESG Lab Spotlight PrimaryIO Application Performance Acceleration Date: July 215 Author: Tony Palmer, Senior Lab Analyst Abstract: PrimaryIO Application Performance Acceleration (APA) is designed to provide
More informationUsing HP StoreOnce Backup systems for Oracle database backups
Technical white paper Using HP StoreOnce Backup systems for Oracle database backups Table of contents Introduction 2 Technology overview 2 HP StoreOnce Backup systems key features and benefits 2 HP StoreOnce
More informationEnterprise Backup and Restore technology and solutions
Enterprise Backup and Restore technology and solutions LESSON VII Veselin Petrunov Backup and Restore team / Deep Technical Support HP Bulgaria Global Delivery Hub Global Operations Center November, 2013
More informationBackup and Archiving Convergence Trends
Research Report Abstract: Backup and Archiving Convergence Trends By Jason Buffington, Senior Analyst and Bill Lundell, Senior Research Analyst With Jennifer Gahm, Senior Project Manager April 2014 Introduction
More informationEfficient Backup with Data Deduplication Which Strategy is Right for You?
Efficient Backup with Data Deduplication Which Strategy is Right for You? Rob Emsley Senior Director, Product Marketing CPU Utilization CPU Utilization Exabytes Why So Much Interest in Data Deduplication?
More informationNext Generation NAS: A market perspective on the recently introduced Snap Server 500 Series
Next Generation NAS: A market perspective on the recently introduced Snap Server 500 Series ABSTRACT In February 2006, Snap Server announced the 500 Series of Networked Attached Storage systems as well
More informationSymantec NetBackup 5220
A single-vendor enterprise backup appliance that installs in minutes Data Sheet: Data Protection Overview is a single-vendor enterprise backup appliance that installs in minutes, with expandable storage
More informationEMC DATA DOMAIN OPERATING SYSTEM
EMC DATA DOMAIN OPERATING SYSTEM Powering EMC Protection Storage ESSENTIALS High-Speed, Scalable Deduplication Up to 58.7 TB/hr performance Reduces requirements for backup storage by 10 to 30x and archive
More informationDell PowerVault DL Backup to Disk Appliance Powered by CommVault. Centralized data management for remote and branch office (Robo) environments
Dell PowerVault DL Backup to Disk Appliance Powered by CommVault Centralized data management for remote and branch office (Robo) environments Contents Executive summary Return on investment of centralizing
More informationEMC DATA DOMAIN OPERATING SYSTEM
ESSENTIALS HIGH-SPEED, SCALABLE DEDUPLICATION Up to 58.7 TB/hr performance Reduces protection storage requirements by 10 to 30x CPU-centric scalability DATA INVULNERABILITY ARCHITECTURE Inline write/read
More informationHow to Get Started With Data
E-Guide How to Get Started With Data Deduplication Data deduplication has certainly generated quite a buzz among storage professionals in the UK, and while there s a lot of curiosity and interest, many
More informationVMware vsphere Data Protection
FREQUENTLY ASKED QUESTIONS VMware vsphere Data Protection vsphere Data Protection Advanced Overview Q. What is VMware vsphere Data Protection Advanced? A. VMware vsphere Data Protection Advanced is a backup
More informationWhite. Paper. EMC Isilon: A Scalable Storage Platform for Big Data. April 2014
White Paper EMC Isilon: A Scalable Storage Platform for Big Data By Nik Rouda, Senior Analyst and Terri McClure, Senior Analyst April 2014 This ESG White Paper was commissioned by EMC Isilon and is distributed
More informationLab Benchmark Testing Report. Joint Solution: Syncsort Backup Express (BEX) and NetApp Deduplication. Comparative Data Reduction Tests
Lab Report Lab Benchmark Testing Report Joint Solution: Syncsort Backup Express (BEX) and NetApp Deduplication Comparative Reduction Tests www.scasicomp.com www.syncsort.com www.netapp.com 2009 Syncsort
More informationLab Validation Report
Lab Validation Report HP Data Protector and Deduplication Solutions Scalability and Performance from the Core to the Edge By Tony Palmer June 2010 Lab Validation: HP Data Protector and Deduplication Solutions
More informationWHITE PAPER. DATA DEDUPLICATION BACKGROUND: A Technical White Paper
WHITE PAPER DATA DEDUPLICATION BACKGROUND: A Technical White Paper CONTENTS Data Deduplication Multiple Data Sets from a Common Storage Pool.......................3 Fixed-Length Blocks vs. Variable-Length
More informationData Deduplication Background: A Technical White Paper
Data Deduplication Background: A Technical White Paper NOTICE This White Paper may contain proprietary information protected by copyright. Information in this White Paper is subject to change without notice
More informationData Protection Report 2008 Best Practices in Data Backup & Recovery
Data Protection Report 2008 Best Practices in Data Backup & Recovery Prepared for: Executive Summary Data is growing at an incredible rate. As a result, the demands of data protection increase as well.
More informationOptimizing Backup and Data Protection in Virtualized Environments. January 2009
Optimizing Backup and Data Protection in Virtualized Environments January 2009 Introduction The promise of maximizing IT investments while minimizing complexity has resulted in widespread adoption of server
More informationWhite Paper. Recovery-focused Data Protection: Research Shows Your Future Depends On It
White Paper : Research Shows Your Future Depends On It By Heidi Biggar Storage Analyst, Data Protection Enterprise Strategy Group January, 2007 Copyright 2007. The Enterprise Strategy Group, Inc. All Rights
More informationHP StoreOnce: reinventing data deduplication
HP : reinventing data deduplication Reduce the impact of explosive data growth with HP StorageWorks D2D Backup Systems Technical white paper Table of contents Executive summary... 2 Introduction to data
More informationTandberg Data AccuVault RDX
Tandberg Data AccuVault RDX Binary Testing conducts an independent evaluation and performance test of Tandberg Data s latest small business backup appliance. Data backup is essential to their survival
More informationThings You Need to Know About Cloud Backup
Things You Need to Know About Cloud Backup Over the last decade, cloud backup, recovery and restore (BURR) options have emerged as a secure, cost-effective and reliable method of safeguarding the increasing
More informationData Deduplication and Tivoli Storage Manager
Data Deduplication and Tivoli Storage Manager Dave Cannon Tivoli Storage Manager rchitect Oxford University TSM Symposium September 2007 Disclaimer This presentation describes potential future enhancements
More informationEMC DATA DOMAIN OVERVIEW. Copyright 2011 EMC Corporation. All rights reserved.
EMC DATA DOMAIN OVERVIEW 1 2 With Data Domain Deduplication Storage Systems, You Can WAN Retain longer Keep backups onsite longer with less disk for fast, reliable restores, and eliminate the use of tape
More information