Hvordan sikrer du ditt virtuelle datasenter? Espen S. Lillejord Sales Manager Cedric Jucker Advisory Technology Consultant EMC Backup-, Recovery and Archiving Solutions (BuRA) 1
Agenda Hva er deduplisering, og hvorfor trenger vi det? Forskjellig måter å deduplisere på Backup og recovery av VMware Utfordringer og løsninger Optimal backup for et virtuelt miljø Spørsmål og svar Trekning av...noe gratis!!! 2
Dagens IT Løsninger: Virtualisering Etter Virtualisering: Før Virtualisering: 3
Hva er De-Duplisering? EMC s Definisjon av De-Duplisering: Prosessen med å oppdage og identifisere unike data segmenter innen et gitt sett av informasjon, som gjør det mulig å eliminere redundant informasjon når dataen lagres eller flyttes. Data Set 1 Data Set 2 De-Duplication Data Set 3 Før: totale data segmenter = 39 Etter: Unike data segmenter = 6 4
Data De-Duplisering: Slik det virker ❶ First Instance ❷ Duplicate Instance ❸ Modified Instance A B A B E B C D C D C D A Kun unike data segmenter blir kopiert til backup server B C D Data som allerede er tatt backup av blir ikke kopiert igjen men blir referert til av en 20 bytes peker E Det nye data segmentet blir identifisert og kopiert, men de allerede eksisterende data segmentene blir ikke kopiert igjen A B C D E Unik data blir lagret på disk, og er umiddelbart klar for gjenoppbygging 5
Target og Source basert De-Duplisering TARGET basert De-Duplisering SOURCE basert De-Duplisering B B B B B B Backup Server De-duplication Device Avamar Server Up to 50x reduction in backup storage Moves 200% of primary data weekly Backups are typically restored from full and incremental images Data viewed as file systems and/or VTL target for traditional backup environments Up to 50x reduction in backup storage Moves ~2% of primary data weekly Up to 500x reduction in network impact Up to 10x faster daily full backups All backups are full; immediate, single-step recovery Next generation backup and recovery 6
VMware and backup/recovery issue VMware backup/recovery concept Backup Each VM as usual (Full, incremental backup ) Client pricing? Performance impact? Quantity of Data to move Incremental for ever obsolete (after backup process) VMware explosion Parallel backup impossible (Resources sharing) Physical machine stored on a logical Disk!!! Disaster Recovery is critical Must be part of any backup activity 7
New Backup Challenges Potential Barrier to Greater Virtualization Consolidation and Value Old Paradigm Low overall utilization and plenty of bandwidth for backup New Paradigm High overall server utilization, but low bandwidth for backup % CPU Busy (8 LCPVs w/ht) 8
VMware and backup/recovery issue Conclusion? A good VMware backup/recovery strategy requires 2 types of backup Daily files level backup Move as few data as possible No impact on VMware resources sharing DR procedure VMDK backup included in daily backup procedure 9
VMware Backup/Recovery existing Solution Backup/Recovery application for VMware File level? Hhhmmmm YES (Most of them) Installation of an agent/client on each virtual machine See previous slide DR level? Hmmmmmm NOP (almost none of them) Need a new technique to backup.vmdk file (Console backup/vcb) Need a specific application/function to perform DR procedure 10
VMware Backup/Recovery existing Solution Disaster Recovery application for VMware File level? Hhhmmmm May be Back to issue with backup in parallel (direct impact on VMware performance) File recovery based on Snapshot DR level? Hmmmmmm NOP CRASH CONSISTENT ONLY DOES NOT WORK FOR APPLICATION (Exchange/SQL/ and the rest) 11
VMware Backup/Recovery existing Solution CONCLUSION Either, either, either. 12
Factors that Can Impact Data De-duplication Ratios Type of data Data from natural sources (audio, video, scanned images) is unique; User-generated data (docs, ppts, PSTs) often have duplicate content (templates, etc.) Encrypted and compressed data are not ideal candidates for de-duplication More user created content = Higher de-duplication ratio Data change rate Small data change rates result in large amounts of duplicate data in subsequent backups Less change = Higher de-duplication ratio Retention policy The longer a backup is held, the greater the chance that the same data will be backed up more than once Longer retention policy = Higher de-duplication ratio Ratio of full backups to incremental backups More full backups increases the amount of same data being repeatedly backed up More full backups = Higher de-duplication ratio Data de-duplication performance is tied to a number of factors even small variations can have a significant impact These factors apply for all backup de-duplication technologies 13
EMC Avamar: Tangible Customer Results De-duplication Benefits Begin at the Source Data Type Amount of Primary Data Backed Up Amount of Data Moved Daily Daily De-dupe Ratio Windows file systems 3,573 GB 6.1 GB 586:1 Mix of Windows, Linux, and UNIX file systems Engineering files on NAS (NDMP backups) Mix of 20% databases, 80% file systems (Windows and UNIX) 5,097 GB 11.7 GB 436:1 3,265 GB 24.2 GB 135:1 9,583 GB 80.0 GB 120:1 Mix of Linux file systems and databases 7,831 GB 104.2 GB 75:1 Source: EMC 14
EMC Avamar: Optimized Backup for VMware Starts De-duplication at the Optimal Location and Level of Granularity Optimal Location: Reduces data at the source Duplicate data never traverses congested shared resources Data backed-up is reduced from ~ 200% to ~ 2% of primary-weekly Significantly reduces contention for shared resources De-duplicates within and across virtual machines and/or.vmdk files Optimal Granularity: Sub-file, variable-length segments.vmdk is one, large file, any changes cause an incremental/full traditional backup Fixed-length segment de-duplication fails due to frame offset Sub-file, variable-length segment de-duplication finds changes anywhere in.vmdk Dramatically cuts the amount of data backed up daily Significantly reduces daily backup time Ideal for the protection of VMware environments Enables fast, secure, backups over existing virtual infrastructure Permits greater server consolidation and maximum value from VMware Guest, Console and VCB VMware backup supported Leverage Avamar Virtual Edition as deployment option 15
How De-duplication Works Global Source-based Data De-Duplication Instances ❶ First Instance VMDK1 ❷ Duplicate Instance VMDK1 ❸ Modified Instance VMDK2 VM VM VM Para VM Management VM VM VM Para VM Management VM VM VM Para VM Management VMware Hypervisor VMvisor Virtual Services Resource Management Storage File System Networking VMware Hypervisor VMvisor Virtual Services Resource Management Storage File System Networking VMware Hypervisor VMvisor Virtual Services Resource Management Storage File System Networking Hardware Hardware Hardware CPU Memory NIC Disk CPU Memory NIC Disk CPU Memory NIC Disk Only unique data segments are backed up A B C D Data already backed up, so only a unique ID pointer is stored (20 bytes) A B C D E E New data segment identified and backed up Avamar Server (stored backup data) 16
EMC Avamar Solutions for VMware Infrastructure Flexible, Fast, Efficient and Reliable Backup and Recovery Avamar Client Backup Solutions Avamar Server Backup Solutions 17
EMC Avamar Backup for VMware Guests Avamar agent resides inside each Virtual Machine De-duplicates data within the VM as if they were physical servers Moves minimal backup data - reduces resource contention and accelerates backups Provides file-level restore for Windows, Linux and Solaris Avamar Agents 18
EMC Avamar Benefits for Guest-level Backups Avamar Efficiently Protects Virtual Machines Up to 95% reduction in data moved Traditional moves ~200% weekly Up to 90% reduction in backup times Up to 50% reduction in disk impact Up to 95% reduction in NIC usage Up to 80% reduction in CPU usage Up to 50% reduction in memory usage All backups stored as virtual full backups, ready for immediate restore Avamar moves ~2% weekly Maintain effective consolidation ratios without over-taxing CPU utilization 19
EMC Avamar and VMware Consolidated Backup Avamar agent resides on the proxy server De-duplicates within and across.vmdk files Supports both VCB file (Windows only) and imagelevel backup Avamar replication provides DR for backed up.vmdk s Avamar Agents VCB Proxy Server with Avamar Agent Avamar Server 20
EMC Avamar Backup at the Console-level Avamar agent resides on the ESX console OS APP OS APP OS APP OS De-duplicates within and across VMDK files Provides.vmdk image- level (no file-level) backup without VCB CPU Memory VMware Hardware NIC Disk No proxy server or SAN required ESX Server Avamar Agents Avamar Server 21
EMC Avamar Virtual Edition for VMware Avamar Server Software Deployed in VMware Machine (VM) One of several convenient deployment options for Avamar server Recommended for small or remote sites already deploying VMware infrastructure Avamar VM Operating System The first de-duplication virtual appliance No other VMware backup/recovery solution encapsulate both backup and disaster recovery Leverages shared servers and storage Can utilize any storage (SAN, NAS, or DAS) Replication (of applications and storage) eliminates shipping tapes Replicate between Avamar virtual machines, or to other Avamar server solutions Application Avamar VM Operating System Operating System ESX Server Hardware Facilitates rapid, cost-effective deployment and return on investment CPU Memory NIC Disk 22
EMC Avamar Solutions for VMware Infrastructure Avamar Agents Large Remote Site Small Remote Site Primary systems Avamar Data Store (ENCRYPTED) (ENCRYPTED) Avamar agent only on primary systems Data Center WAN Remote Recovery Site Primary systems Primary systems Avamar Data Store (ENCRYPTED) (ENCRYPTED) Tape Vault Avamar Data Store 23
De-duplication via NetWorker and Avamar Integration Centralized control of De-dupe through NetWorker integration with Avamar Single NetWorker client with integrated Avamar capabilities Enables file system data de-duplication and backup in a single solution Simple to deploy in existing NetWorker environment Avamar Data Store comes pre-configured with Avamar software Push client support means no manual host deployment required Avamar New Networker Client De-dupe and Non De-dupe NetWorker Storage Node Disk Tape 24
HomeBase provides unique capabilities for VMware Dramatically reducing the cost of deployment, operation and recovery of VMware environment captured, capabilities application and data for backed up Profiles HomeBase provides unique VMware Physical environment migrated onto virtual session Ability to move a VMware session between server stacks Virtual session recovered/migrated onto physical environment Data backed up Backup Server HomeBase Server Profiles (snapshots) of servers taken and stored on HomeBase VMware ESX Enterprise Server 2 MB VMware ESX Enterprise Server 25