Data deduplication is more than just a BUZZ word Per Larsen Principal Systems Engineer
Mr. Hansen DATA BUDGET RECOVERY & DATACENTER GROWTH PRESSURE DISCOVERY REVOLUTION More Storage Longer Backups Smaller Budgets Less Staff Fewer Projects Missed SLAs Wasted IT time Legal Fee & Fines Virtualization Backup Re-design Tape to Disk
Mr. Jensen What data can be deduplicated? What does dedupe rate mean? Where to run deduplication? How should I get started? How does deduplication affect disaster recovery?
What is Deduplication? How does it work? What is the difference between file and block level? File Level Block Level 4 Technology Day 2010 -Data deduplicationis more than just a BUZZ word 4
What Data Can Be Deduplicated? Backup Speed in % Encrypt Lotus Notes Exchange SAP Files vstorage MS SQL Oracle Compress Deduplication rate in % Technology Day 2010 -Data deduplicationis more than just a BUZZ word
What Does dedupe rate Mean? Data deduplication ratio over a particular time period The number of bytes input to a data deduplication process divided by the number of bytes output.
What Does dedupe rate Mean? Space Reduction Ratio Space Reduction Percentage 2:1 1/2 = 50% 5:1 4/5 = 80% 10:1 9/10 = 90% 20:1 19/20 = 95% 100:1 99/100 = 99% 500:1 499/500 = 99.8% Comparing ratios is problematic Broad set of assumptions implicit in their calculation
Introduction to NetBackup Deduplication
Deduplication Implementation in NetBackup Client Dedupe Media Server Dedupe Storage Pool Dedupe Target Appliance Dedupe Technology Day 2010 -Data deduplicationis more than just a BUZZ word
NetBackup to Target Dedupe Devices Deduplicate at the appliance Benefits include: Easy setup, no change to backup environment Centralized policy management and replication control Disadvantages: Workload of a traditional backup until backup data hits appliance no infrastructure savings Cost Target Dedupe Devices Client Media Server OST OST Appliance = Deduplication Engine
NetBackup Deduplication at the Media Server Deduplicate at the media Benefits include: Duplicate data eliminated at Media Server Built-In to NetBackup No client impact Disadvantages Workload of a traditional backup until data reaches Media Server Additional CPU load on Media Server Media Server Client Media Server Commodity NAS/DAS/SAN Storage = Deduplication Engine
NetBackup Deduplication at the Source Deduplicated at the source/client Benefits include: Built-In to NetBackup Reduced WAN/LAN bandwidth impact Comprehensive application and platform support Ideal for most application and file/folder backups Disadvantages May not be ideal for datasets with high change rate Additional load on the Client Client - Source Client Media Server = Deduplication Engine
Backup Deduplication Closer to the Source is Best Example: Moving 1TB of Data w/ 90% DedupePotential NetBackup Target Media Server Deduplication Good Transfer 1TB Transfer 1TB Transfer 0.1TB Appliance Target Deduplication Appliance Storage Target Dedupe OST Better Media Server Dedupe Transfer 1TB Transfer 0.1TB Symantec Off Host Dedupe Storage Up to 196TB 100% Data in Motion 10% DedupeData in Motion Best Client Dedupe Transfer 0.1TB Transfer 0.1TB Local Dedupe Storage Up to 32TB Lower Resource Consumption
NetBackup Appliances
What is NetBackup Appliances? An Appliance from Symantec that Allows for a Simplified and Faster Deployment of NetBackupDeduplicationand New NetBackupMedia Servers Complete Easy Scalable Reliable Symantec provides software, hardware, and support Complete dedupe solution source or target Single vendor for support and service Easy ordering and deployment 32TB usable dedupe capacity per node 192TB usable global dedupe capacity per setup Disk based solution, Raid 6 configuration Redundant fans, power supply System availability > 99.95% Symantec NetBackup 5000 Overview 15
NetBackup 5000 / 5020 Appliance PDDO in a box Standard NBU Clients NBU Media Server(s) Standard NBU Clients
NetBackup 5200 / 5220 Appliance Media Server in a box Standard NBU Clients NBU 5200 Media Server NBU 5200 Media Server Standard NBU Clients
Data Protection Challenges from a Disaster perspective How can NetBackup Deduplication help?
Architecture Model Remote/Branch Office backup Media server at remote site = Faster local recovery Centralize and simplify backup management NBU Clients NBU Media Server Replicate to the DC Main DC site Location with Centralized Administration Site A Tape Site B NBU Media Server NBU Clients Backup over WAN No local Infrastructure NetBackup Master OpsCenter server NBU Media Server Site C Replicate to the DR site DR Site (Replication)
What Sets NetBackup Deduplication Apart Built-In, easy to use No expensive hardware Lower TCO than Appliances Proven Product Line, Proven Technology Investment Protection Works out of the box. No additional setup required Use commodity hardware of choice Up to 55% less than appliance solutions (assumes replication) Built into NetBackup, based on PureDisk Dedupe Technology Dedupe value wherever needed- Source, Media Server, Appliance 20
Backup Re-architecture Strategic Objectives Eliminate tape as a transported media Establish always on WAN connectivity to recovery sites Improve RTO/RPO narrow the gap of tier 1 disk replication Provide always ready disaster recovery infrastructure Provide operational consistency even in case of disasters The Three Key Facilitators Data Deduplication technology coupled with highly efficient Data Replication 3-site DR model Primary, Secondary/Recovery, Data Bunker Affordable WAN expansion / optimization
Solution implementation plan Site B Short retention on disk Direct tape creation Long retention on tape No tape vaulting Media server Backup Copy 3 Backup Copy 4 Media server Backup Copy 1 Opt-dupe Replication Opt-dupe Replication Storage Pool s Opt-dupe Replication Media server Backup Copy 6 Site A Storage Pool Backup Copy 5 Site C - bunker Backup Copy 2
Summary -One size does not fit all Customers often need more than 1 approach to deduplication Client / Source Deduplication Media Server Deduplication (Inline) Target Appliance Deduplication (Post process) Each use case is different Each customers data is different Each customers approach towards Data Protection is different The Symantec deduplication strategy: Provide deduplication where customers need it
Thank you! Per Larsen per_larsen@symantec.com Copyright 2010 Symantec Corporation. All rights reserved.symantec and the Symantec Logo are trademarks or registered trademarks of Symantec Corporation or its affiliates in the U.S. and other countries. Other names may be trademarks of their respective owners. This document is provided for informational purposes only and is not intended as advertising. All warranties relating to the information in this document, either express or implied, are disclaimed to the maximum extent allowed by law. The information in this document is subject to change without notice.
Sponsors 25