VMware vcenter Site Recovery Manager 5 Technical Raj Jethnani, VCP 4/5, VCAP4-DCA / DCD Systems Engineer VMware, Inc. rjethnani@vmware.com @rajtech 2009 VMware Inc. All rights reserved
Agenda Simplifying Disaster Recovery What is VMware vcenter Site Recovery Manager? How can I use SRM in my environment? 3
Simple Setup And Management of Recovery Plans From Complex Runbooks to Simple Recovery Plans Weeks or months to set up Error-prone Quickly falls out of sync with apps and infrastructure changes Simple recovery plan set up in minutes Fewer steps means far less room for errors Simple to keep in sync with changes
VMware vcenter Site Recovery Manager (SRM) Disaster Recovery Plan Workflow Automation DR run books one-button recovery plans Simplifies, automates DR Setup Testing Failover Auditing Works with VMware vsphere to make disaster recovery rapid, reliable, manageable, affordable 5 Copyright 2009 VMware, Inc. All rights reserved. This product is protected by U.S. and international copyright and intellectual property laws. VMware products are covered by one or more patents listed at http://www.vmware.com/go/patents.
SRM Architecture (Array Replication) Data Center 1 Data Center 2 vsphere Client vsphere Client SRM Plug-In SRM Plug-In DB DB DB DB SRM Server vcenter Server vcenter Server SRM Server SRA SRA ESXi ESXi ESXi ESXi ESXi VMFS VMFS Storage Replication VMFS VMFS Storage Copyright 2009 VMware, Inc. All rights reserved. This product is protected by U.S. and international copyright and intellectual property laws. VMware products are covered by one or more patents listed at http://www.vmware.com/go/patents.
Disaster Recovery Scenarios Traditional Primary and Secondary Bidirectional Production Production Production Recovery Recovery Production Dedicated recovery site means wasted hardware, power, etc. L Production at one site, test/dev at the other site get ROI from recovery site Mix of production and test/dev at both sites may further decrease recovery time
Shared Recovery Site Protected Site 1 vcenter Server SRM Server 1 Shared Recovery Site SRM Server 1 SRM Server 2 vcenter Server SAN Array SRM Server Protected Site 2 vcenter Server SRM Server 2 SAN Array SAN Array
vsphere Client Integration
Resource, Folder, Network Mappings 10
Protection Groups 11
Running a Recovery Plan Export recovery plan
Define VM Start Order and Dependencies Five priority groups, set VM dependencies in each group 13
Network Settings Customization Same IP settings at each site (e.g. Cisco OTV) Works great with SRM lowers complexity and recovery time Different IP settings at each site SRM automates IP change using GUI or command line tool
Automating DNS Updates Linux/UNIX command nsupdate Active Directory/DNS command dnscmd.exe Dynamic DNS DNS Update Scripts included with SRM C:\Program Files (x86)\vmware \scripts\callouts Appliance API E.g. Infoblox
Isolated Test Network Separate vswitch at recovery site
History Reports Export recovery plan
History Reports Export recovery plan Descriptive error messages are helpful
Non-Disruptive DR Testing Production Site Recovery Site Isolated test network at recovery site Suspended Test/Dev VMs Copy of Production Replication SRM provides non-disruptive testing of disaster recovery plans
Testing a Recovery Plan Storage Layer Protected Site Recovery Site Storage Replication Array Replication
Testing a Recovery Plan Storage Layer Protected Site Recovery Site Storage Replication Array Replication
Testing a Recovery Plan Storage Layer Protected Site Recovery Site Storage Replication Array Replication Isolated Test Network
Testing a Recovery Plan Storage Layer Protected Site Recovery Site Storage Replication Array Replication Isolated Test Network
Testing a Recovery Plan Storage Layer Protected Site Recovery Site Storage Replication Array Replication Isolated Test Network
Testing a Recovery Plan Storage Layer Protected Site Recovery Site Storage Replication Array Replication
Running a Recovery Plan Storage Layer Protected Site Recovery Site Replication
Running a Recovery Plan Storage Layer Protected Site Recovery Site Replication
Running a Recovery Plan Storage Layer Protected Site Recovery Site Replication
Running a Recovery Plan Storage Layer Protected Site Recovery Site Replication
Fail-back (Array Replication) Fail-back consists of two main steps 1. Reprotect 2. Recovery Protected Site Recovery Site Recovery Site Protected Site Replication Note: Fail-back only supported with array replication
Planned Migration Original Site New Site
VM Storage Organization (Array Replication) Protected Site Recovery Site LUN 1 Datastore Group 1 VMFS A Protection Group 1 Recovery Plan 1 (CRM) Protection Group 1 LUN 2 VMFS B VMFS C LUN 3 Datastore Group 2 VMFS D LUN 4 VMFS E LUN 5 Datastore Group 3 Protection Group 2.vmdk files in separate datastores Protection Group 3 Recovery Plan 2 (Entire Site) Protection Group 1 Protection Group 2 Protection Group 3
Configuring Array Managers 33
Configuring Array Managers 34
Configuring Array Managers 35
Configuring Array Managers 36
Configuring Array Managers 37
Configuring Array Managers 38
Configuring Array Managers 39
Configuring Array Managers 40
SRM Architecture (vsphere Replication) Data Center 1 Data Center 2 vsphere Client vsphere Client SRM Plug-In SRM Plug-In DB DB DB DB SRM Server vcenter Server vcenter Server SRM Server vrms DB ESXi ESXi ESXi vra vra vra Replication vrs ESXi ESXi vrms DB VMFS VMFS Storage VMFS VMFS Storage Copyright 2009 VMware, Inc. All rights reserved. This product is protected by U.S. and international copyright and intellectual property laws. VMware products are covered by one or more patents listed at http://www.vmware.com/go/patents.
Protecting a VM with vsphere Replication Right-click on the VM 42
Protecting a VM with vsphere Replication Configure Recovery Point Objective (RPO) Asynchronous replication only, 15 minute 24 hour RPO 43
Protecting a VM with vsphere Replication Guest OS quiescing with MS VSS Select target location for VM 44
Protecting a VM with vsphere Replication Customize replication settings per virtual disk (.vmdk) 45
Protecting a VM with vsphere Replication Auto-assign or specify VR Server Multiple VR Servers can be deployed to scale the environment 46
vsphere Replication Details Does not use VMware snapshots Requires VM virtual hardware version 7 or higher Changes on source disk(s) tracked by ESXi Deltas are sent to recovery site (after initial sync) VM with snapshot(s) is recovered with snapshot(s) collapsed Initial copy can be provided to recovery site 47
vsphere Replication Limitations VMs must be powered on to replicate Automated failback is not supported vsphere Fault Tolerance is not supported VM templates cannot be replicated Floppy and.iso images cannot be replicated Physical RDMs cannot be replicated VM linked clones are not supported 48
vsphere Replication Status 49
vsphere Replication Status 50
Alarms in SRM
Roles and Permissions in SRM
SRM Scalability
SRM in the Real World Japan Disaster: Rolling Blackout Avoidance http://www.vsamurai.com/english/2011/3/23/real-life-dr-bc-with-vmware-srm.html I hope that the relative success of the running of the DR plan will also open the eyes of management to more prevalent use of VMware, specifically for the disaster recovery benefits. I can say, without going into detail, other critical systems (outside of the virtual infrastructure) did not fare as well! Thanks VMware! Thanks SRM! Mission accomplished... 54
SRM Use Cases: Coordinated Datacenter Migrations Site Recovery Manager Planned Datacenter Migration Old Site vcenter Server New Site vcenter Server Site Recovery Manager Customer Problems Datacenter migrations are risky and cause longer than intended downtime due to poorly tested plans. New datacenter projects are when customers will typically deploy storage from new vendors. Deploying new vendor technology prohibits replicating between storage arrays. vsphere vsphere Replication vsphere SRM Benefits Storage-based replication Allows for migration testing before actual migration is performed minimizing dependency issues or resource constraints. vsphere Replication can be used to migrate between competing vendor technologies. 55
VMware Virtual Infrastructure Navigator GA 1 st half of 2012 Complete dependency views of related application components Map and Tabular views Visualize App dependencies With VC related information (SRM, vapp, etc) 56
VMware Virtual Infrastructure Navigator GA 1 st half of 2012 Enable Business Continuity and full DR recovery Are all dependencies factored in creating an App protection group? Create SRM protection groups & recovery plans with accurate app visibility Build HA clusters, set affinity /anti-affinity rules with workload dependency maps across VI hosts and clusters Is the App service / VM part of a protection group, & recovery plan? 57
Summary VMware vcenter Site Recovery Manager Automate DR Run Books Non-disruptive, granular testing of recovery plans Automated failover when it is needed most Detailed recovery plan history reports 58
Thank You More information about SRM on vmware.com http://www.vmware.com/products/site-recovery-manager/overview.html 59