Seriously: Tape Only Backup Systems are Dead, Dead, Dead!



Similar documents
How to Get Started With Data

STORAGE. Buying Guide: TARGET DATA DEDUPLICATION BACKUP SYSTEMS. inside

Backup Software Data Deduplication: What you need to know. Presented by W. Curtis Preston Executive Editor & Independent Backup Expert

Data deduplication technology: A guide to data deduping and backup

Using Virtual Tape Libraries with Veritas NetBackup Software

Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

Dell Data Protection. Marek Istok Ŋ Dell Slovakia

WHITE PAPER: customize. Best Practice for NDMP Backup Veritas NetBackup. Paul Cummings. January Confidence in a connected world.

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

STORAGE SOURCE DATA DEDUPLICATION PRODUCTS. Buying Guide: inside

Backup and Recovery. W. Curtis Preston O'REILLY' Beijing Cambridge Farnham Köln Paris Sebastopol Taipei Tokyo

SAN TECHNICAL - DETAILS/ SPECIFICATIONS

STRATEGIC PLANNING ASSUMPTION(S)

Data Compression and Deduplication. LOC Cisco Systems, Inc. All rights reserved.

Redefining Backup for VMware Environment. Copyright 2009 EMC Corporation. All rights reserved.

VERITAS NetBackup 6.0 Enterprise Server INNOVATIVE DATA PROTECTION DATASHEET. Product Highlights

Backup and Disaster Recovery Planning On a Budget. Presented by: Najam Saeed Lisa Ulrich

Платформа NetBackup 7.6. What's new in NetBackup 7.6? 1

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

HP StoreOnce D2D. Understanding the challenges associated with NetApp s deduplication. Business white paper

Disk-based Backup and Restore Reaching the Next Level

DPAD Introduction. EMC Data Protection and Availability Division. Copyright 2011 EMC Corporation. All rights reserved.

Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International

UN 4013 V - Virtual Tape Libraries solutions update...

CA ARCserve Family r15

Understanding the HP Data Deduplication Strategy

Brian LaGoe, Systems Administrator Benjamin Jellema, Systems Administrator Eastern Michigan University

Veritas NetBackup 6.0 Server Now from Symantec

Designing a Backup Architecture That Actually Works

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

Data Sheet: Data Protection Veritas NetBackup 6.5 NetBackup Enterprise Server- Next Generation Data Protection

Protect Data... in the Cloud

Effective Planning and Use of TSM V6 Deduplication

Enterprise Class Storage System, Unified NAS. Enterprise Tape Library & Virtual Tape Library. SAN Fabric Management along with SAN Switches

EMC AVAMAR. Deduplication backup software and system. Copyright 2012 EMC Corporation. All rights reserved.

The safer, easier way to help you pass any IT exams. Exam : Storage Sales V2. Title : Version : Demo 1 / 5

Name Description Included in

Data Reduction: Deduplication and Compression. Danny Harnik IBM Haifa Research Labs

Maximizing Deduplication ROI in a NetBackup Environment

EMC Data de-duplication not ONLY for IBM i

HP OpenView Storage Data Protector

DeltaStor Data Deduplication: A Technical Review

Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup

Symantec NetBackup for NDMP Administrator's Guide

Efficient Backup with Data Deduplication Which Strategy is Right for You?

The Archival Upheaval Petabyte Pandemonium Developing Your Game Plan Fred Moore President

EMC DATA DOMAIN OVERVIEW. Copyright 2011 EMC Corporation. All rights reserved.

Data Deduplication in Tivoli Storage Manager. Andrzej Bugowski Spała

Symantec Backup Appliances

Tiered Data Protection Strategy Data Deduplication. Thomas Störr Sales Director Central Europe November 8, 2007

Open Source Storage Solution - ILM

HP Data Protector Software Advanced Backup to Disk Integration with Virtual Tape Libraries

SEP AG. SEP sesam extend your infrastructure business with a heterogeneous backup solution. Josef Kastl Tasa Italia Srl Josef.kastl@tasaitalia.

Aspirus Enterprise Backup Assessment and Implementation of Avamar and NetWorker

Effective Planning and Use of IBM Tivoli Storage Manager V6 and V7 Deduplication

How To Make A Backup System More Efficient

The Modern Virtualized Data Center

The Remote Data Backup & Restore Service from

09'Linux Plumbers Conference

Eliminating Backup System Bottlenecks: Taking Your Existing Backup System to the Next Level. Jacob Farmer, CTO, Cambridge Computer

The IT budget. 30% Free for investments. 70% Maintain existing IT. Typical share of IT budgets

TSM Family Capacity Pricing Special Bid Offering IBM Corporation

NETAPP SYNCSORT INTEGRATED BACKUP. Technical Overview. Peter Eicher Syncsort Product Management

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

Trends in Enterprise Backup Deduplication

Maximize Your Virtual Environment Investment with EMC Avamar. Rob Emsley Senior Director, Product Marketing

Microsoft SQL Server 2005 on Windows Server 2003

Introduction to Data Protection: Backup to Tape, Disk and Beyond

Symantec NetBackup Enterprise Server and Server x Hardware Compatibility List

Storage Backup and Disaster Recovery: Using New Technology to Develop Best Practices

Optimizing Backup & Recovery Performance with Distributed Deduplication

Keys to optimizing your backup environment: Legato NetWorker

Using HP StoreOnce Backup systems for Oracle database backups

Secure Your Business with EVault Cloud-Connected Solutions

Turnkey Deduplication Solution for the Enterprise

Symantec NetBackup 7.1 What s New and Version Comparison Matrix

Symantec NetBackup Getting Started Guide. Release 7.1

Dell NetVault Backup Technical Overview

Deduplication Demystified: How to determine the right approach for your business

HP StoreOnce & Deduplication Solutions Zdenek Duchoň Pre-sales consultant

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

Top Ten Questions. to Ask Your Primary Storage Provider About Their Data Efficiency. May Copyright 2014 Permabit Technology Corporation

Protecting Information in a Smarter Data Center with the Performance of Flash

DEDUPLICATION BASICS

Backup of NAS devices with Avamar

Easy to Use Advanced Data Protection NetVault: Backup The Power of Simplicity.

Real-time Compression: Achieving storage efficiency throughout the data lifecycle

Virtual Tape Systems for IBM Mainframes A comparative analysis

Transcription:

Seriously: Tape Only Backup Systems are Dead, Dead, Dead!

Agenda Overview Tape backup rule #1 So what s the problem? Intelligent disk targets Disk-based backup software

Overview We re still talking disk to disk to tape, or D2D2T Not yet getting rid of tape, only placing disk between it and the backup system However, some environments have completely gotten rid of tape

Tape backup s rule #1 Rule #1: Match the speed of the network to the speed of the tape Every modern tape drive has a minimum speed at which it can reliably write data A streaming tape drive cannot write slower than it s minimum speed! If it looks like it is, trust me, it s not. It s faking it.

Variable speed tape drives Variable speed tape drives can step down to keep up with slower speeds, but they still have a minimum speed LTO-3: 27 MB/s to 80 MB/s TS1120: 40 MB/s to 105 MB/s T10000: 50 MB/s to 120 MB/s Times your compression ratio! Despite vendor claims, these drives have not eliminated shoe-shining

So what s the problem? 300 250 200 150 DLT Network LTO STK IBM to here? 100 50 from here 0 How do you get 1993 1996 2001 2003 2006 TBD

Disk: Infinitely Variable It ll take 100s of slow or fast backups all at once without multiplexing You can then Dump them to tape at its max speed Replicate them to another location

What are the options? Products Disk-as-disk SAN disk arrays NAS filers Disk-as-tape Virtual tape drives Virtual tape libraries Virtual tape cartridges

Filesystem or virtual tape? Disk-as-disk Disk-as-tape Pay for disk, possibly volume mgr & filesystem Pay for disk & value of VTL software Possibly pay per GB to backup s/w vendor Pay per slot, drive, or GB to backup s/w vendor Provisioning/sharing issues No provisioning/sharing issues Speed of filesystem Speed of mult. raw disks Fragmentation Usually no fragmentation Some backup s/w not fully filesystem aware If not already copying tapes, will need to figure out how All backup s/w tape aware Should be faster Can continue not copying tapes if you use an integrated VTL

Standalone or Integrated VTL? Standalone Integrated Sits next to your physical tape library (PTL) Sits in front of your physical tape library (PTL) Pretends to be another tape library Backup to virtual tape Pretends to be your PTL Creates virtual tapes to match physical tapes in PTL Use dupe/clone software to copy virtual tape to physical tape Backup s/w will always know status of all tapes May take longer to make physical tapes Ejecting virtual tape creates physical tape Certain conditions may require manual intervention Don t break relationship btw backup s/w and real tapes HSM style VTLs Tape stacking

Single node or clustered? VTLs tend to be single nodes that range from 100-800 MB/s If you need more throughput, you buy another node Clustered VTLs allow you to expand capacity or throughput by adding additional data movers, but manage as a single VTL

De-duplication Examines and eliminates redundant blocks of data from any source Best thing since sliced disks Reduces effective cost of disk by 10:1 or more All major VTL vendors releasing de-dupe products right now All products are not the same

Fingerprinting Lining up incoming data with existing data to get the best match. Think real fingerprints The more content aware a de-dupe app is, the better this phase will go

Redundancy Identification SHA-1: 160 bit hash MD5: 126 bit hash Custom 16 bit hash Other hashes Bit-level compare Most products can use two methods

Forward vs reverse referencing Reverse If you see a new block that you ve already seen before, don t write the new block Write a pointer to the old block Forward Write all new blocks, then look at old blocks If an old block matches a new block, make the old block a pointer to the new block The jury hasn t even left yet

In-band or out-of band In-band de-dupes in RAM, never writes redundant data to disk Requires quick processing to minimize impact on incoming backup performance Easier on the disk Requires less I/O Possible hash lookup Possible read of matched data Out-of-band writes original data to disk, reads it later and de-dupes it Requires more I/O Initial write Read to calc hash Read to lookup hash Possible read of matched data Write pointer or not Can divide and conquer a single Write new data large backup btw CPUs Cannot divide and conquer an individual backup May limit speed of some backups Supposed to never slow down any backups. Some are concerned that all the I/O may do this anyway

Clustering VTL clustering and de-dupe go hand in hand Two VTLs with different de-dupe databases are not the same as one cluster Difference between hundreds of MB/s and thousands of MB/s

Do it right from the start Backup systems designed to go to disk De-duplication backup software Snapshots & replication Continuous data protection Open source tools BackupPC Rdiff-backup Rsnapshot

Vendors Vendor GA Target or Source Forward or Reverse In/out of Band Primary Check Secondary Check ADIC Now Target Reverse In Custom Custom Data Domain Now Target Reverse In SHA-1 Custom Diligent Now Target Reverse In Custom Bit-level EMC/AVAMAR Now Source Reverse In SHA-1 None Falconstor Q1 07 Target Reverse Out SHA-1 Opt. MD5 NetApp Now Target Reverse Out Custom Bit-level SEPATON Now Target Forward Out Custom Bit-level Symantec Puredisk Now Source Reverse In SHA-1 None

Seriously Buy my book! 750 pages dedicated to free & open source backup AMANDA, BackupPC, Bacula, rdiff-backup, rsnapshot Oracle, DB2, Sybase, SQL Server, Exchange, mysql, PostgreSQL AIX, HP-UX, Solaris, Linux, Windows, MacOS bare metal recovery Available Dec 20!