DEDUPLICATION BASICS



Similar documents
QUICK REFERENCE GUIDE: KEY FEATURES AND BENEFITS

Quantum DXi6500 Family of Network-Attached Disk Backup Appliances with Deduplication

Energy Efficient Storage - Multi- Tier Strategies For Retaining Data

FAQs-Quantum NDX / RDX

Backup and Recovery: The Benefits of Multiple Deduplication Policies

Business-Centric Storage FUJITSU Storage ETERNUS CS800 Data Protection Appliance

Business-centric Storage FUJITSU Storage ETERNUS CS800 Data Protection Appliance

Quantum Q-Cloud Backup-as-a-Service Reference Architecture

Total Cost of Ownership Analysis

Protect Data... in the Cloud

DXi Accent Technical Background

Turnkey Deduplication Solution for the Enterprise

RECOVERY SCALABLE STORAGE

ACCELERATING YOUR IT TRANSFORMATION WITH EMC NEXT-GENERATION UNIFIED STORAGE AND BACKUP

Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

WHITE PAPER. QUANTUM S TIERED ARCHITECTURE And Approach to Data Protection

Protect Microsoft Exchange databases, achieve long-term data retention

Every organization has critical data that it can t live without. When a disaster strikes, how long can your business survive without access to its

SYSTEMS-AT-A-GLANCE KEY FEATURES & BENEFITS

VMware vsphere Data Protection 6.0

Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup

Symantec NetBackup 7.5 for VMware

Veritas Backup Exec 15: Deduplication Option

D2D2T Backup Architectures and the Impact of Data De-duplication

Tiered Data Protection Strategy Data Deduplication. Thomas Störr Sales Director Central Europe November 8, 2007

VMware vsphere Data Protection 5.8 TECHNICAL OVERVIEW REVISED AUGUST 2014

Maximize Your Virtual Environment Investment with EMC Avamar. Rob Emsley Senior Director, Product Marketing

Demystifying Deduplication for Backup with the Dell DR4000

Archiving, Backup, and Recovery for Complete the Promise of Virtualization

Cloud, Appliance, or Software? How to Decide Which Backup Solution Is Best for Your Small or Midsize Organization.

Combining Onsite and Cloud Backup

HP StorageWorks D2D Backup Systems and StoreOnce

Dell PowerVault DL2200 & BE 2010 Power Suite. Owen Que. Channel Systems Consultant Dell

Disaster Recovery Strategies: Business Continuity through Remote Backup Replication

Solution Brief: Creating Avid Project Archives

A Best Practice Guide to Archiving Persistent Data: How archiving is a vital tool as part of a data center cost savings exercise

Barracuda Backup Deduplication. White Paper

WHY DO I NEED FALCONSTOR OPTIMIZED BACKUP & DEDUPLICATION?

Deduplication has been around for several

Solutions for Encrypting Data on Tape: Considerations and Best Practices

ETERNUS CS800 data protection appliance featuring deduplication to protect your unique data

Solution Overview. Business Continuity with ReadyNAS

Veeam Backup & Replication Enterprise Plus Powered by Cisco UCS: Reliable Data Protection Designed for Virtualized Environments

I D C T E C H N O L O G Y S P O T L I G H T

Symantec NetBackup 7.5 What s New and Version Comparison Matrix

Symantec NetBackup 7.1 What s New and Version Comparison Matrix

Symantec Backup Appliances

Symantec Backup Exec 2014 Agents and Options

Symantec NetBackup 5220

HP StoreOnce & Deduplication Solutions Zdenek Duchoň Pre-sales consultant

Data Protection. the data. short retention. event of a disaster. - Different mechanisms, products for backup and restore based on retention and age of

Universal Backup Device The Essential Facts of UBD

VERITAS NetBackup 6.0 Enterprise Server INNOVATIVE DATA PROTECTION DATASHEET. Product Highlights

Symantec NetBackup 5000 Appliance Series

Backup Exec 15: Deduplication Option

Protecting enterprise servers with StoreOnce and CommVault Simpana

Redefining Microsoft SQL Server Data Management. PAS Specification

WHITE PAPER WHY ORGANIZATIONS NEED LTO-6 TECHNOLOGY TODAY

HP ProLiant Storage Server family. Radically simple storage

Acronis Backup & Recovery 11 Next Generation Physical, Virtual, Cloud Backup, Disaster Recovery, and Data Protection Solution from Acronis

Acronis Backup & Recovery 11 Next Generation Physical, Virtual, Cloud Backup, Disaster Recovery, and Data Protection Solution from Acronis

Tandberg Data AccuVault RDX

Why StrongBox Beats Disk for Long-Term Archiving. Here s how to build an accessible, protected long-term storage strategy for $.003 per GB/month.

VMware VDR and Cloud Storage: A Winning Backup/DR Combination

W H I T E P A P E R R e a l i z i n g t h e B e n e f i t s o f Deduplication in a Backup and Restore System

Complete Storage and Data Protection Architecture for VMware vsphere

Riverbed Whitewater/Amazon Glacier ROI for Backup and Archiving

Best Practices Guide. Symantec NetBackup with ExaGrid Disk Backup with Deduplication ExaGrid Systems, Inc. All rights reserved.

Disk-to-Disk-to-Offsite Backups for SMBs with Retrospect

WHITE PAPER. Effectiveness of Variable-block vs Fixedblock Deduplication on Data Reduction: A Technical Analysis

Data Sheet: Backup & Recovery Symantec Backup Exec 12.5 for Windows Servers The gold standard in Windows data protection

Backup Exec 2014: Deduplication Option

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

We look beyond IT. Cloud Offerings

Symantec Backup Exec 2012

Future-Proofed Backup For A Virtualized World!

OmniCube. SimpliVity OmniCube and Multi Federation ROBO Reference Architecture. White Paper. Authors: Bob Gropman

EMC SOLUTIONS TO OPTIMIZE EMR INFRASTRUCTURE FOR CERNER

VMware vsphere Data Protection 6.1

How To Get A Storage And Data Protection Solution For Virtualization

Data Deduplication Background: A Technical White Paper

Bringing the edge to the data center a data protection strategy for small and midsize companies with remote offices. Business white paper

Symantec NetBackup 7 The Symantec NetBackup Platform: Complete protection for your information-driven enterprise

Backup and recovery as agile as the virtual machines being protected

SYMANTEC NETBACKUP APPLIANCE FAMILY OVERVIEW BROCHURE. When you can do it simply, you can do it all.

Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International

Solution Overview: Data Protection Archiving, Backup, and Recovery Unified Information Management for Complex Windows Environments

LDA, the new family of Lortu Data Appliances

Backup Exec 2010 Deduplication Protect More, Store Less, Save More

" " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " " Datto Makes Virtual Hybrid Cloud Backup Easy! Product Analysis!

EMC NETWORKER AND DATADOMAIN

Unitrends Recovery-Series: Addressing Enterprise-Class Data Protection

Transcription:

DEDUPLICATION BASICS

4 DEDUPE BASICS 12 HOW DO DISASTER RECOVERY & ARCHIVING FIT IN? 6 WHAT IS DEDUPLICATION 14 DEDUPLICATION FOR EVERY BUDGET QUANTUM DXi4000 and vmpro 4000 8 METHODS OF DEDUPLICATION 16 DEDUPLICATION FOR EVERY BUDGET QUANTUM NDX-8 and NDX-12 10 DEDUPLICATION EXAMPLE 18 DEDUPLICATION FOR EVERY BUDGET QUANTUM RDX

4 5 DEDUPE BASICS Data deduplication is a breakthrough technology that has changed the way data is protected in businesses of all sizes. It can reduce the amount of backup data by up to 90%, which leads to smaller-capacity backup devices making disk-based backup more affordable. Additionally, deduplication dramatically reduces network traffic, enabling simple, cost-effective replication (copying of data) to remote sites for disaster recovery. The end result is a comprehensive backup and disaster recovery solution that saves money and takes less time to manage, while also providing faster and easier data restoration. IN THE NEXT FEW PAGES, you ll see how deduplication works, how it fits into your existing data protection infrastructure, how it works with replication and disaster recovery, and a sampling of market-leading products in various price ranges that take advantage of this technology....reduce the amount of backup data by up to PRIMARY BACKUP REPLICATION 90% DEDUPLICATED DATA Deduplicated Data DEDUPLICATED DATA Deduplicated Data

6 7 WHAT IS DEDUPLICATION? The least efficient method of backup is repeatedly copying the data from your primary disk drives to backup disk drives. To save disk space, and therefore cost, you might choose to overwrite the previous backups, but if you need an earlier version of a file (such as before you inadvertently deleted the most important part), it will be gone. When you want to retrieve the backed up file as it looked on a certain date, the software intelligently reconstructs it, putting all the changes in the right places, and presents you with the version you requested. 5 copies of same file Data Stored To you, it looks like the system backed up the complete file each time, but in reality, it only backed up the file once and then added a few pieces here and there. As a result, a great deal of disk space is saved. 5 files with unique title pages Data Stored Standard backup software helps somewhat by saving only new files or files that have changed since previous backups (called an incremental backup or single-instance storage), but that is also inefficient because if you change a single word in a document, the entire document is still backed up again. DATA DEDUPLICATION takes a different approach. The software looks inside the files and stores only the unique file segments, called blocks, that aren t already backed up. This greatly reduces storage requirements over conventional backup processes. Imagine five identical Powerpoint files stored on different computers. Deduplication will recognize the common content and will only back up one copy. Now, imagine that each person changed the first page of the PowerPoint presentation to include their name and the title, for example. Deduplication software will look into the components that make up each file, and will only copy the unique content.

8 9 METHODS OF DEDUPLICATION There are several approaches to deduplication, which can be combined in various ways. SOFTWARE-BASED Deduplication software runs either on backup clients or on media servers. Software-based dedupe reduces network traffic; however, this approach can tie up system resources and typically has slower performance than purpose-built dedupe appliances. APPLIANCE-BASED Backup data is sent to a purpose-built device, such as Quantum s DXi, and deduplication occurs there, using hardware and built-in software designed to optimize the deduplication process. This generally gives higher performance, and it doesn t increase the load on backup clients or media servers. In addition, appliances are compatible with existing backup applications, so a company can standardize software across all backup systems, including both disk and tape. FILE-BASED Smaller-scale deduplication systems tend to be file-based and perform the action on each file separately. This reduces the processing power requirements, but doesn t save as much disk space as more sophisticated systems. For the example on page 7, file-based deduplication would have stored each PowerPoint presentation, in its entirety, as it recognizes duplicate files but does not look inside the files for duplicate content. GLOBAL DEDUPLICATION A common pool of file components is shared across machines and networks. The smallest components across this common pool are compared and only stored only once, thus providing the greatest savings.

10 11 A DEDUPE EXAMPLE As with any other technology, it s important to understand the business case for adopting deduplication. Backing up to disk is appealing, but with today s exponential data growth, it can quickly become very expensive. That s where data deduplication pays its way. If you can avoid storing redundant data while still ensuring data integrity, your system has a huge advantage. You not only save on hardware acquisition costs, but also on the ongoing costs of managing and maintaining these resources, perhaps at multiple locations. The advantage is even greater in virtual environments. Deduplication rates may climb as high as 100:1 because duplicate files, cloned machines and test machines significantly increase the amount of redundant data across the infrastructure. AS AN EXAMPLE OF THE BENEFITS PROVIDED BY DEDUPLICATION, assume a company s data set begins at 3 Terabytes and grows 40% a year. Management wants backups done daily, retaining the most recent data for 30 days and monthly backups retained for three years. By the end of this period, deduplication will have reduced the required disk space more than eightfold... Retaining Monthly Backups For 3 Years Data To Protect, TB (40% annual growth) Disk Capacity Used, TB (normal 2:1 backup compression) Deduplicated Backups, TB (2:1 first backup, 20:1 subsequent) Year 1 Year 2 Year 3 3.0 4.2 5.9 5.6 17.1 42.5 2.1 3.2 4.8...a powerful way to control data growth and save money.

12 13 HOW DO DISASTER RECOVERY & ARCHIVING FIT IN? As we have seen, deduplication of your important data makes backup storage much more cost effective. But no backup system is complete without off-site storage, which can combine disaster recovery with long-term archiving. This is typically accomplished by replication, which sends a copy of the backup data over a wide-area network (WAN) to another site, or by transporting removable media, such as tape or removable disks. These approaches can also be combined, depending upon your company s particular needs. Replication is the most convenient and efficient solution, since it is usually automated. Before deduplication, bandwidth consumption made replication impractical, but now as only a small fraction of your raw data needs to be sent, replication is cost-effective and enjoys widespread adoption. This means you can have an off-site image of all your backup data every day. Removable disk or tape is the more traditional method of disaster recovery protection and continues to be the primary method for long-term data retention, or archiving. Here, the physical media is taken to an off-site location on a regular basis, such as weekly. After a predetermined period of time, when sufficient new copies have been made, the media is brought back to the original site and reused (called a rotation cycle). This would not be done if the data is intended to be archived for long-term storage, such as for government regulation compliance. Tape can also play a role in the replication approach, where data that has been replicated to a disk array or data protection appliance is ultimately offloaded to a tape library for low-cost, long-term storage. This process too can be automated. Replication is the most convenient and efficient solution for disaster recovery

14 15 DEDUPLICATION FOR EVERY BUDGET Quantum DXi4000 and vmpro 4000 The most powerful products for small and mid-sized businesses are the DXi4000 and the vmpro 4000 Series backup appliances. They range from 2TB to 12TB of native capacity with the 4601 models offering Capacity-on-Demand (COD) scalability from 4TB to 12TB by simply activating a software license. Using similar architectures, these appliances combine high performance and affordability. They are easy to install and integrate with all leading backup software. They can also be linked to larger DXi units at the data center, to centralize data protection from branch offices and provide easy archiving to tape libraries. The vmpro 4000 offers a complete data protection solution for virtual machines, including both backup software and storage to retain months of data. The software is running on its own virtual appliance, eliminating the need for agents or separate physical servers, and includes a native, file-system view of all backup data. DXi4000 DXi4510: storage capacity of 2TB DXi4601: scalable capacities from 4-12TB The DXi4601 offers Capacity-on-Demand (COD) scalability from 4 to 12TB by simply activating a license key scalability with no additional hardware to purchase or install Provides up to 1.7TB/hour performance twice that of leading competitors Reduces typical disk capacity needs by 90% or more through deduplication Integrates smoothly with all leading backup applications, including NetBackup and Backup Exec vmpro 4000 vmpro 4510: storage capacity of 2TB vmpro 4601: scalable capacities from 4-12TB Backup software and DXi4000 Series storage all in one, integrated solution optimized for SMBs and remote offices, at an affordable price The vmpro 4601 scales from 4 to 12TB by simply activating a license key with no additional hardware to purchase or install The vmpro software is a complete data protection solution that simplifies the management of VM data protection while increasing performance of backups and restores Virtual appliance-based backup approach eliminates agents and extra servers The vmpro software reduces network traffic and CPU workload by up to 75%, increasing system performance

16 17 DEDUPLICATION FOR EVERY BUDGET Quantum NDX-8 and NDX-12 NAS Storage The Quantum NDX Series of network-attached storage (NAS) appliances for backup, disaster recovery and primary storage offer a complete storage and data protection solution for small and medium businesses. NDX-8d and NDX-12d Data Protection Appliance with Deduplication Protects two years of data or more, while maintaining greater history than tape or other disk Controls data growth, reducing the amount of data stored and network traffic by up to 90% through deduplication Centrally managed from the NAS unit through a simple, easy-to-use interface Comes standard with easy to use DATASTOR Shield deduplication software Agentless backup no software to install on the clients Designed for Windows environments The NDX-8d and NDX-12d Data Protection Appliances utilize client-side deduplication, to hold 5x more data than nondeduplicated NAS, while reducing network traffic by 90%. Operations are centrally managed through the NAS, and there are no software agents to install on clients being backed up (agentless). The NDX-8d and NDX-12d back up Windows systems including desktops, servers, virtual machines, Exchange, SQL Server and SharePoint. The Quantum NDX-8 and NDX-12 NAS are designed for primary storage and boast more powerful processors and RAM than competitors, while featuring enterprise-class hard drives for ultimate speed and RAID reliability. NDX offers true business class storage that fits the budget of small and medium businesses. NDX-8 and NDX-12 NAS Storage Business class storage for networked environments Use as a file server or as iscsi direct attached storage Faster, more reliable with Intel Dual Core i3 3.3GHz processor, enterprise-class HDDs and 4GB of RAM Compatible, consistent Windows storage server operating system Available in a tower or 1U rackmount configuration

18 19 DEDUPLICATION FOR EVERY BUDGET Quantum RDX The Quantum RDX and RDX 8000 disk-based removable storage systems are entry-level data protection solutions for small and mid-sized businesses. They combine the best features of disk fast access and fast restores with the best features of tape removability for disaster recovery and an easy-to-use backup process. RDX 8000 8-slot disk-based library that stores up to 8TB like a tape autoloader but uses RDX removable disk cartridges instead Take cartridges off-site for disaster recovery Available with easy to use DATASTOR Shield deduplication software, reducing cartridge use by 2/3 (also available without this software) Simple network-attached iscsi connectivity Ultimate scalability uses any RDX cartridge from 160GB to 1TB compatible with future capacities, increasing capacity instantly without having to buy more hardware The single-cartridge standalone RDX includes full-featured backup software that integrates data deduplication for a single server, while the RDX 8000 is an eight-slot library that optionally includes an enterprise-level version of the same software, with agentless, client-side deduplication to minimize network traffic. Both are plug-and-play so you can be up and running in minutes. Using removable, industry-standard RDX cartridges with interchangeable capacities (currently up to 1TB native capacity per cartridge before deduplication), the RDX systems make it easy to take data off-site and also scale up as your data grows.

20 21 LEARN MORE For more information about data deduplication technology and data protection in general, consult a knowledgeable reseller in your area. He or she can analyze your backup needs and configure the right system for your company. For more details about the Quantum deduplication products mentioned in this booklet, visit www.quantum.com and click on the Products tab. ABOUT QUANTUM Quantum is a proven global expert in Data Protection and Big Data management, providing specialized storage solutions for physical, virtual and cloud environments. From small businesses to major enterprises, more than 50,000 customers trust Quantum to help maximize the value of their data by protecting and preserving it over its entire lifecycle. With Quantum, customers can Be Certain they re able to adapt in a changing world keeping more data longer, bridging from today to tomorrow, and reducing costs. See how at www.quantum.com/becertain. In North America, you can also talk with a product representative in the Quantum Call Center, toll-free at 888-738-1533. www.quantum.com 1.888.738.1533 2012 Quantum Corporation. All rights reserved. Quantum, the Quantum logo, DXi, Scalar, StorNext and Vision are registered trademarks of Quantum Corporation and its affiliates in the United States and/or other countries. All other trademarks are the property of their respective owners.

BE CERTAIN ST00899A-v01 August 2012