EMC BACKUP AND RECOVERY SOLUTIONS Backup to the future BRS PARTNER UPDATE Sofia, March 14 th, 2011 horia.constantinescu@emc.com dumitru.taraianu@emc.com 1
Agenda EMC backup and recovery solutions Backup Recovery Systems (BRS) division profile The transition from tape to disk Backup and recovery in the enterprise Enterprise backup and recovery EMC NetWorker Deduplication: Enabling next-generation backup EMC Avamar EMC Data Domain 2
EMC Backup Recovery Systems Division Division HQ: Santa Clara, CA 10 R&D locations 2,000 employees Data protection storage systems More than 60,000 systems installed More than 45,000 customers More than 15,000 PB under protection worldwide Global sales, support, and services Approximately 6,000 channel partners 3
EMC Backup and Recovery Market Position Avamar #1 deduplication backup software worldwide 8,000 installations 4,400 customers Data Domain #1 deduplication storage worldwide 12,000 installations 5,100 customers Disk Library #1 virtual tape library (VTL) worldwide >$1B in sales NetWorker Top three enterprise backup software 30,000 customers 4
Backup and Recovery Architectures: In Transition from Tape to Disk Backup/Recovery Architecture Application Backup Clients Backup/Media Manager Onsite Backup Storage Disaster Recovery Storage Conventional (Tape-centric) DB Backup NetWorker software Backup NetWorker software Tape Disk VTL Library Tape VTL/Tape Transformational (Disk-centric) Home Backup NetWorker software Deduplication Data Domain storage Deduplication backup Avamar software and system Data Protection Data Protection Management Advisor Software on premise off premise 5
Backup and Recovery: Hot Spot in the Enterprise Secular shift from tapecentric to disk/networkcentric approaches F1000: What are your top storage pain points? Enabler: Massive data deduplication and compression techniques Unabated growth of enterprise data overwhelms legacy infrastructure Server virtualization is another catalyst Deduplication helps optimize many of these initiatives Source: TheInfoPro, Wave 14 Storage Study, Q2 2010, published August 19, 2010; n=166 (8/11/10 F1000 sample). Note that due to multiple responses per interview, total exceeds 100%. 6
EMC NETWORKER 7
NetWorker Backup/Recovery Architecture Application Backup Clients Backup/Media Manager Onsite Backup Storage Disaster Recovery Storage Conventional (Tape-centric) DB Backup NetWorker software Backup NetWorker software Tape Disk VTL Library Tape VTL/Tape Transformational (Disk-centric) Home Deduplication Data Domain storage NetWorker Deduplication backup Avamar software and system Data Protection Data Protection Management Advisor Software on premise off premise 8
NetWorker Backup and Recovery Software Unified backup software Common platform Backup to disk Backup to tape Snapshot management Replication management Integrated deduplication support Integrated Avamar client services Data Domain Boost integration Simplified, centralized management Broad, heterogeneous platform support Enterprise-wide deployment experience Mid-market to enterprise Small to very, very large 9
Centralized Management DEDUPLICATION APPLICATION SUPPORT Microsoft Oracle SAP Data Domain Avamar VIRTUALIZATION NetWorker Tape Cloud FILE SYSTEMS AND SERVER RECOVERY Disk Library Family Centera EMC STORAGE PLATFORMS VNX Family Symmetrix REMOTE AND BRANCH OFFICES 10
NetWorker Differentiation Centralized control for all backup requirements Seamless integration with industry s two leading deduplication solutions Advanced application and virtual environment support Reliable recoverability 11
NetWorker Licensing Traditional Licenses needed for each client, application, SAN, tape, features, etc. Capacity 1 license per Networker datazone (backup environment) calculated per front-end TB All features and clients/modules are included Exceptions: Documentum module, PowerSnap module, SnapImage module 12
EMC AVAMAR AND EMC DATA DOMAIN Enabling next-generation data protection with deduplication 13
EMC Avamar and EMC Data Domain Retain, replicate, recover Deduplicate everything without changing anything Simplify backup, archiving, and disaster recovery with easy integration across workloads, infrastructures, and backup software Never back up the same data twice Revolutionize your backup by moving less data to solve your toughest VMware, NAS, remote office, and desktop/laptop backup challenges Data Domain Deduplication Storage Systems Avamar Deduplication Backup Software 14
Data Reduction/Deduplication: F1000 The in-use rating for EMC is now over threetimes that of its nearest competitor Source: TheInfoPro, Wave 14 Storage Study Q2 2010, published August 19, 2010; n=146 (7/6/10 F1000 sample) 15
Deduplication Impact on Data Size Deduplication 10 30 times less data stored versus fulls plus incrementals with typical retention policies 30 Data Stored 20 10 0 1 5 10 15 20 Weeks in Use Deduplication storage Traditional storage 16
Data Deduplication: Technology Overview Store more backups in a smaller footprint Friday Full Backup A B C D A E F G Backup Estimated Data Logical Reduction Physical Mon Incremental A B H Tues Incremental C B I Weds Incremental E G J Thurs Incremental A C K Second Friday Full Backup B C D E F L G H A B C D E F G H I J K L FRIDAY FULL 1 TB 2 4x 250 GB Monday Incremental 100 GB 7 10x 10 GB Tuesday Incremental 100 GB 7 10x 10 GB Wednesday Incremental 100 GB 7 10x 10 GB Thursday Incremental 100 GB 7 10x 10 GB Second FRIDAY FULL 1 TB 50 60x 18 GB TOTAL 2.4 TB 7.8x 308 GB 17
Retain: Store More for Longer with Less Over one year of retention in 3U of Data Domain deduplication storage Backup Cumulative Estimated Physical Data Logical Reduction First Full 1 TB 4x 250 GB Week 1 Week 2 Week 3 Month 1 Month 2 Month 3 April 7 2.4 TB 8x 308 GB April 14 3.8 TB 10x 366 GB April 21 5.2 TB 12x 424 GB April 28 6.6 TB 14x 482 GB May 31 12.2 TB 17x 714 GB June 30 17.8 TB 19x 946 GB Month 4 July 31 23.4 TB 20x 1,178 GB TOTAL 23.4 TB 20x 1,178 GB 18
It s Not All Deduplication Out There Regular storage array 1:1 Whitespace reduction LZ compression ~ 2:1 File level Single instance storage ~ 3:1 Fixed blocks, snapshots Backup target, variable segment Fixed block ~ 3:1 Variable segment ~20:1 Deduplication significantly reduces: Replication WAN bandwidth Power Heat Cooling Management 19
Deduplication Enables Next-Generation Storage Architectures Storage 1.0 When did you implement this? What made you evolve? Primary Disk Tape Storage 2.0 Why did you add SATA? What did you learn? Primary Disk SATA Tape Storage 3.0 Backup/recover plus archive from disk (shrink primary) Tape: monthly Primary Deduplicate SATA Before After Tape Storage 4.0 Flash for primary Everything else to deduplicate Flash Deduplicate SATA Before After 20
EMC AVAMAR 21
Backup and Recovery Architectures: In Transition from Tape to Disk Backup/Recovery Architecture Application Backup Clients Backup/Media Manager Onsite Backup Storage Disaster Recovery Storage Conventional (Tape-centric) DB Backup NetWorker software Backup NetWorker software Tape Disk VTL Library Tape VTL/Tape Transformational (Disk-centric) Home Deduplication Data Domain storage NetWorker Deduplication backup Avamar software and system Data Protection Data Protection Management Advisor Software on premise off premise 22
Avamar Deduplication backup software and system Avamar VM End-to-end, software/hardware solution Integrated system for simple, predictable results Client-side, global deduplication; within and across clients Improves backup window, less network load Backup process minimizes data sent and stored Reduces network and virtual infrastructure stress Full backups, every time: one-step recovery Higher backup success rate and reliability Increased ROI, lower TCO, less risk Integrated high availability and reliability RAIN (redundant array of independent nodes) architecture for high availability and fault tolerance Recoverability verified daily Disaster recovery through replication Flexible deployment options Avamar Data Store Avamar Virtual Edition Agent-only for remote office/branch office (ROBO) 23
Avamar Family UNIFIED MANAGEMEN T EXAMPLE USE CASES VMware Remote/Branch Offices Desktop/Laptop NAS/NDMP EMC Data Protection Advisor CLIENTS Lotus Notes IBM DB2 CORE PLATFORMS EMC NetWorker Avamar VM EMC Avamar EMC Avamar Data Store EMC Avamar Virtual Edition for VMware 24
Avamar Differentiation Shorter backup windows Less data moved reduces daily full backup times Reduces required daily network bandwidth and client stress Scalable VMware backup for greater server consolidation Simple management System deployment is easy, pre-configured, with predictable performance Streamlined, centralized administration and management of remote backups Single-step restore Single-step restore for full backups; no need for full and incrementals Recoverability guaranteed Daily integrity checks, RAIN, and replication ensure recoverability, high availability 25
EMC DATA DOMAIN 26
EMC Data Domain Backup/Recovery Architecture Application Backup Clients Backup/Media Manager Onsite Backup Storage Disaster Recovery Storage Conventional (Tape-centric) DB NetWorker Symantec Tape Disk VTL Library Tape VTL/Tape Home TSM Transformational (Disk-centric) Other 3 rd Party Data Domain Deduplication backup Avamar software and system Data Protection Data Protection Management Advisor Software on premise off premise 27
Data Domain Basics Easy integration with existing environment Control Tier Target Tier Disaster Recovery Tier Backup and archive applications CIFS, NFS, NDMP, DD Boost EMC Symantec CommVault Tivoli Software BakBone Software Vizioncore Ethernet Virtual Tape Library (VTL) over Fibre Channel DD890 appliance Replication DD890 appliance 2U 2 to 10 ports 10 and 1 Gigabit Ethernet; 8 Gb/s Fibre Channel RAID 6 Up to 285 TB usable capacity with shelves 2 TB or 1 TB 7.2K rpm SATA hard disk drives in shelf File system NVRAM N+1 fans and redundant, hot-plug power supplies 28
Industry s Most Scalable Inline Deduplication Systems DD800 Appliance Series Global Deduplication Array DD Archiver DD600 Appliance Series DD140 Remote Office Appliance Software options: DD Boost, DD Virtual Tape Library, DD Replicator, DD Retention Lock, and DD Encryption Speed (DD Boost) DD140 DD610 DD630 DD670 DD860 DD890 Global Deduplication Array DD Archiver 490 GB/hr 1.3 TB/hr 2.1 TB/hr 5.4 TB/hr 9.8 TB/hr 14.7 TB/hr 26.3 TB/hr 9.8 TB/hr Speed (other) 450 GB/hr 675 GB/hr 1.1 TB/hr 3.6 TB/hr 5.1 TB/hr 8.1 TB/hr 10.7 TB/hr 4.3 TB/hr Logical capacity 9 43 TB 40 195 TB 84 420 TB 0.6 2.7 PB 1.4 7.1 PB 2.9 14.2 PB 5.7 28.5 PB 5.7 28.5 PB Raw capacity 1.5 TB Up to 6 TB Up to 12 TB Up to 76 TB Up to 192 TB Up to 384 TB Up to 768 TB Up to 768 TB Usable capacity 0.86 TB Up to 3.98 TB Up to 8.4 TB Up to 55.9 TB Up to 142 TB Up to 285 TB Up to 570 TB Up to 570 TB 29
Methodology: Inline versus Post-Process Deduplication INLINE Deduplication Before Storing POST- PROCESS Deduplication After Storing Deduplication Store Deduplication 3x disk accesses to shared store Other activities unimpeded Predictable Simpler The more processes, the more resource contention Copy to tape: Too slow to stream tape Recovery: Service level agreement predictability Replication: Poor time-to-disaster-recovery Deduplication: If interleaved with backup or restore More administration to fight these issues 30
Performance: CPU-Centric versus Spindle-Bound 1,500 Data Domain Improvement since 2004: Throughput: 175x Capacity: 450x Throughput MB/s Fibre Channel Most deduplication vendors SATA 50 50 100 150 200 Number of Disk Spindles 31
Data Integrity: Data Invulnerability Architecture End-to-end data verification Checksum Deduplication, write to disk Verify Self-healing file system Cleaning Expired data Defrag Verify Generate Checksum File System Deduplication Local Compression RAID Verify Data Verify the file system metadata integrity Verify user data integrity Verify stripe integrity Other RAID 6 NVRAM Snapshots End-to-end data verification 32
Data Domain Differentiation Maturity Simple Consistent Robust (e.g., policy-driven deduplication replication) Product concept: purpose-built storage Inline and simple appliance System infrastructure Application independent: backup, archive, and more Architecture: fast, small, storage of last resort CPU-centric for price/performance Data protection from the ground up 33
DD Archiver Overview Cost-optimized long-term retention Data Domain system for backup and archive Active tier: short-term data protection; less than 90 days Archive tier: scalable long-term retention; multiple years High-throughput deduplication storage Up to 9.8 TB/hr Cost optimized for long-term retention Up to 570 TB usable, 28.5 PB logical capacity Low cost per gigabyte while maintaining high throughput Fault isolation of archive units for long-term recoverability Leverage existing Data Domain system advantages Supports DD Replicator and DD Retention Lock software options Data Domain Data Invulnerability Architecture to ensure data integrity 34
Enterprise Recoverability Readiness at Disaster Recovery Site Data Domain inline deduplicated replication Replicate during backup Disaster recovery (DR)-ready Adaptive post-process deduplicated replication Backup to Cache Backup time 1.7-times longer than Data Domain DR-ready Deduplicate and replicate less than 50% ingest speed two times longer if uncompressed at fixed bandwidth Scheduled post-process deduplicated replication Backup to Cache Backup time 1.1-times longer than Data Domain DR-ready Deduplicate and replicate less than 50% ingest speed two times longer if uncompressed at fixed bandwidth Backup to VTL Recall tapes VTL/tape/truck Copy to tape Truck to storage Truck from storage? 35
THANK YOU 36