DRAFT. Active site backup. System Disaster Recovery Limitations and Best Practices

Similar documents
EMC ViPR Controller. Backup and Restore Guide. Version Darth 302-XXX-XXX 01

EMC ViPR Controller. User Interface Virtual Data Center Configuration Guide. Version REV 01

EMC ViPR Controller. Service Catalog Reference Guide. Version 2.3 XXX-XXX-XXX 01

VMware Site Recovery Manager with EMC RecoverPoint

Continuous Data Protection for any Point-in-Time Recovery: Product Options for Protecting Virtual Machines or Storage Array LUNs

EMC VPLEX FAMILY. Continuous Availability and data Mobility Within and Across Data Centers

EMC VPLEX FAMILY. Continuous Availability and Data Mobility Within and Across Data Centers

EMC ViPR Controller. ViPR Controller REST API Virtual Data Center Configuration Guide. Version

Testing and Restoring the Nasuni Filer in a Disaster Recovery Scenario

Panorama High Availability

Cisco Prime Collaboration Deployment Troubleshooting

DR-to-the- Cloud Best Practices

Implementing and Managing Windows Server 2008 Clustering

Disaster Recovery (DR) Planning with the Cloud Desktop

Testing and Restoring the Nasuni Filer in a Disaster Recovery Scenario

Deploy App Orchestration 2.6 for High Availability and Disaster Recovery

EMC ViPR Controller. Backup and Disaster Recovery Guide. Version 2.4 Service Pack XXX 01

EMC NETWORKER SNAPSHOT MANAGEMENT

Privileged Access Management Upgrade Guide

Blackboard Managed Hosting SM Disaster Recovery Planning Document

Affordable Remote Data Replication

BUSINESS CONTINUITY AND DISASTER RECOVERY FOR ORACLE 11g

MS 20417B: Upgrading Your Skills to MCSA Windows Server 2012

Cisco Nexus 1000V and Cisco Nexus 1110 Virtual Services Appliance (VSA) across data centers

Jive and High-Availability

Backup Exec Private Cloud Services. Planning and Deployment Guide

Veritas InfoScale Availability

Module: Business Continuity

Cisco Active Network Abstraction Gateway High Availability Solution

Virtual Data Movers on EMC VNX

EMC RECOVERPOINT FAMILY

EonStor DS remote replication feature guide

IMPROVING VMWARE DISASTER RECOVERY WITH EMC RECOVERPOINT Applied Technology

Disaster Recovery Solutions for Oracle Database Standard Edition RAC. A Dbvisit White Paper

WhatsUp Gold v16.3 Installation and Configuration Guide

VMware vcloud Air - Disaster Recovery User's Guide

An Oracle White Paper December, Enterprise Manager 12c Cloud Control: Configuring OMS Disaster Recovery with F5 BIG-IP Global Traffic Manager

Configuring the Redundancy Feature in a KIRK Wireless Server Application Note

EMC VPLEX FAMILY. Transparent information mobility within, across, and between data centers ESSENTIALS A STORAGE PLATFORM FOR THE PRIVATE CLOUD

DISASTER RECOVERY BUSINESS CONTINUITY DISASTER AVOIDANCE STRATEGIES

RecoverPoint for Virtual Machines

EMC VPLEX: LEVERAGING ARRAY BASED AND NATIVE COPY TECHNOLOGIES

Solution Brief Availability and Recovery Options: Microsoft Exchange Solutions on VMware

Configuring the Redundancy Feature in a Polycom KIRK Wireless Server 6000

Pervasive PSQL Meets Critical Business Requirements

Symantec Cluster Server powered by Veritas

Virtual Managment Appliance Setup Guide

Westek Technology Snapshot and HA iscsi Replication Suite

Overview of Luna High Availability and Load Balancing

Backup and Restore of CONFIGURATION Object on Windows 2008

Host Hardening. Presented by. Douglas Couch & Nathan Heck Security Analysts for ITaP 1

EXTENDED ORACLE RAC with EMC VPLEX Metro

CLI Commands and Disaster Recovery System

MICROSOFT CLOUD REFERENCE ARCHITECTURE: FOUNDATION

Configuring High Availability for VMware vcenter in RMS Distributed Setup

Building a Scale-Out SQL Server 2008 Reporting Services Farm

CA ARCserve and CA XOsoft r12.5 Best Practices for protecting Microsoft SQL Server

EMC ViPR Controller. Service Catalog Reference Guide. Version REV 02

Administrator Guide VMware vcenter Server Heartbeat 6.3 Update 1

High Availability and Clustering

Exchange Data Protection: To the DAG and Beyond. Whitepaper by Brien Posey

EMC VIPR SRM: VAPP BACKUP AND RESTORE USING EMC NETWORKER

High Availability and Disaster Recovery for Exchange Servers Through a Mailbox Replication Approach

NEXT GENERATION EMC: LEAD YOUR STORAGE TRANSFORMATION. Copyright 2013 EMC Corporation. All rights reserved.

場次: Track B-2 公司名稱: EMC 主講人: 藍基能

BlackBerry Enterprise Service 10. Version: Configuration Guide

WHITE PAPER. Best Practices to Ensure SAP Availability. Software for Innovative Open Solutions. Abstract. What is high availability?

WANSync SQL Server. Operations Guide

EMC Disk Library with EMC Data Domain Deployment Scenario

vrealize Automation Load Balancing

Virtual Infrastructure Security

AUTOMATED DISASTER RECOVERY SOLUTION USING AZURE SITE RECOVERY FOR FILE SHARES HOSTED ON STORSIMPLE

Veritas Cluster Server from Symantec

Whitepaper Continuous Availability Suite: Neverfail Solution Architecture

Configuring the Redundancy Feature in a Spectralink IP-DECT Server 6500

Ecomm Enterprise High Availability Solution. Ecomm Enterprise High Availability Solution (EEHAS) Page 1 of 7

High Availability & Disaster Recovery Development Project. Concepts, Design and Implementation

EMC MID-RANGE STORAGE AND THE MICROSOFT SQL SERVER I/O RELIABILITY PROGRAM

EMC ViPR Software Defined Storage

ABSTRACT. February, 2014 EMC WHITE PAPER

Synology High Availability (SHA): An Introduction Synology Inc.

13.1 Backup virtual machines running on VMware ESXi / ESX Server

Frequently Asked Questions: EMC ViPR Software- Defined Storage Software-Defined Storage

VMware vsphere Data Protection Evaluation Guide REVISED APRIL 2015

NTP Software VFM Administration Web Site for EMC Atmos

High-Availability User s Guide v2.00

Veritas Storage Foundation High Availability for Windows by Symantec

HPOM 8.1 High Availability Utilizing Microsoft SQL Server Log Shipping

Maximum Availability Architecture. Oracle Best Practices For High Availability. Backup and Recovery Scenarios for Oracle WebLogic Server: 10.

IP Storage On-The-Road Seminar Series

20417-Upgrading Your Skills to MCSA Windows Server 2012

High-Availablility Infrastructure Architecture Web Hosting Transition

W H I T E P A P E R. Disaster Recovery Virtualization Protecting Production Systems Using VMware Virtual Infrastructure and Double-Take

SQL Server 2012/2014 AlwaysOn Availability Group

Disaster Recovery on the Sun Cobalt RaQ 3 Server Appliance with Third-Party Software

SQL Server Mirroring. Introduction. Setting up the databases for Mirroring

BUSINESS CONTINUITY AND DISASTER RECOVERY FOR SAP HANA TAILORED DATACENTER INTEGRATION

Transcription:

Active site backup Configure the ViPR Controller Active site for a daily backup. Configuring your ViPR Controller Active site for a daily backup schedule and uploading backups automatically to an FTPS server is highly recommended, even if ViPR is configured with Recovery. You can configure backups from the System > General n > Backup page. In the event of a Switchover or Failover Recovery operation, the backup schedule will automatically continue using the same settings on the new Active site. Recovery Limitations and Best Practices Recovery Limitations Please note the following Recovery Limitations: At the present time, the system supports up to two Standby sites. IP address change is not supported while a ViPR Controller instance is part of a Recovery environment. If IP addresses must be changed, you can: For the Active site, unconfigure Recovery by deleting all Standby sites, change the Active site IP address, then redeploy and re-add the Standby sites. Delete a specific Standby site, and redeploy and re-add it with a new IP address. For all Standby sites, delete all Standby sites, change IP addresses in the remaining Active site, then redeploy and re-add the Standby Sites. In the Recovery environment, the Restore of a backup into a new ViPR Controller instance with different IP addresses or a different number of nodes is not supported. This situation results from the unlikely scenario of a complete loss of the Active site and failure to execute the Failover action at the Standby site. Please contact EMC Support at https://support.emc.com and refer to KB article 000482840 for a workaround. Recovery and GEO (Multi-Site ViPR Controller) are mutually exclusive and are not supported at the same time. You cannot add Recovery in a GEO environment, and cannot add a new VDC to a Recovery environment. Restore of a backup to the Active site of a system that has Recovery sites configured is not supported. Restoring a backup taken in a Recovery environment can be done only in a freshly deployed, single VDC, ViPR Controller instance, following the normal Restore procedure. After the Restore, the Standby sites shown in the GUI should be removed, new sites deployed, and the sites re-added to Recovery. Best Practices in Configuring Recovery Following are suggested best practices in configuring ViPR Controller Recovery: 22 EMC ViPR Controller 3.0 Recovery, Backup and Restore Guide

Table 7 Best Practices in Configuring Recovery Network Infrastructure Latency between sites NTP The following ports should be enabled for all nodes in remote ViPR Controller nodes: TCP ports: 443, 2888, 2889, 7000, 7100 UDP ports: 500, 4500 Ensure that you have quality speed network infrastructure between datacenters. NAT across data centers is not supported. The maximum supported latency between Recovery (DR) sites is <= 150ms. Regarding <=150ms latency limitation that applies to ViPR with SMIS in DR case: Suggested configuration for resources DR: you should have two sets of SMIS at each site (one has Local array as local and Remote Array as Remote, another one has Local array as Remote and Remote Array as Local. Same on DR site. If latency <=150ms between remote sites with SMIS, you should be able to safely register all 4 SMIS and have them running concurrently. ViPR could pick any SMIS for discovery. You will not have to worry about power on remote SMIS in the case of an Active site disaster. If latency > 150ms between remote sites with SMIS - you should register all 4 SMIS, but should SHUT DOWN the SMIS which is remote to the current ViPR location, until Failover/Switchover occurs, because we don't want to pick remote SMIS for discovery. NTP Server settings are shared among Active and all other DR sites in ViPR. Configure all redundant NTP servers (including the ones for the DR datacenter) in the Active site. Alternatively, NTP server settings can be changed after Failover/Switchover to a DR site. DNS Note All NTP servers that are used in ViPR Controller configuration must be synchronized with a reliable time source. DNS Server settings are shared among Active and all other DR sites in ViPR. Configure all redundant DNS servers (including the ones for the DR datacenter) in the Active Site. Alternatively, DNS Recovery Limitations and Best Practices 23

Table 7 Best Practices in Configuring Recovery (continued) Auth Provider SMI-S providers RecoverPoint VPLEX Arrays with no ViPR managed resource replication server settings can be changed after Failover/Switchover to a DR site. Auth Server settings are shared among Active and all other DR sites in ViPR. Configure all redundant Auth server URLs (including the ones for the DR datacenter) in the Active Site. Alternatively, Auth server settings can be changed after Failover/ Switchover to a DR site. If the Auth server configured in the Active site is not available due to a disaster, alternative Auth server settings can be changed by the root user after a ViPR failover. Follow the usual best practices for the underlying array's DR operations support in ViPR. For example, SRDF : 1 SMI-S for Site1 array, and 1 SMI-S for Site2 array. Both need to be registered in ViPR for SRDF operations to work. If disaster strikes Site1, you can failover the DR site's volumes using SMI-S at the DR site. You can also provision new non-replicated volumes (if necessary) using the Site2 provider. Once Site1 is back up, you can resume normal operations, OR you can Swap and use reverse vpool to provision, OR you can Failover/Swap back and resume normal operation. Follow usual ViPR best practices for discovering RecoverPoint. It should be sufficient to register one management IP of the RecoverPoint appliance. ViPR discovers and internally stores the DR site's management IP address, so in the case where the registered one goes down, failover can still be performed, as ViPR will use another site's available IP address that was discovered internally. Follow usual the ViPR best practices for discovering VPLEX. Resources are only managed if datacenter in which they are located is online. Failover ViPR, then continue to manage the remaining available resources not impacted by the disaster. Physical resources impacted by the disaster can continue to be managed after they are back online, from the DR site. 24 EMC ViPR Controller 3.0 Recovery, Backup and Restore Guide

Table 7 Best Practices in Configuring Recovery (continued) After all datacenters are back online, ViPR can Switchover to the original datacenter. Arrays with ViPR managed replication technology Setting System Properties Order processing during Failover Backup and Restore Depending on the underlying array/storage type, plan ahead for a potential DR situation and if necessary for business continuity, pre-create any necessary Virtual Pools (with expected Source/Target vpool settings) so that any new provisioning can resume after ViPR failover in a timely manner. Failure to do so in certain situations may result in a longer time to resume any new provisioning operations. For example, reverse direction Source/Target vpools or completely new pools may need to be created later during DR. These virtual pools may be needed in situations where source volumes need to be created in the DR site, while the Active datacenter is being recovered from disaster) Work with your EMC support specialist if you need any help with proper planning for such situations. There are some specific system properties that can be set separately for the Active and Standby sites, once they are connected: Custom Node names Login banner Keystore certificate All other system and security settings are shared among all sites. For in-progress order and array discoveries that were in progress during an Active site disaster : In-progress orders will be marked as Failed. They need to be cleaned up and retried after Failover. In-progress discoveries will be marked as Failed. After Failover, they will be reinitiated, based on normal scheduling. After Failover, queued orders (concurrent orders for same arrays) will continue. After Failover, a storage re-discovery and provider rescan job are initiated immediately. A new order should be initiated after that. Regular backup is still highly recommended, even after a DR configuration is in place. Backup helps you keep data from days ago, which may be helpful for other disaster Failover ViPR, then perform any supported DR operations on array replicate resources (such as Failover using ViPR catalog service for SRDF/ RP), using ViPR. After all datacenters are back online, ViPR do a Switchover to the original datacenter. Recovery Limitations and Best Practices 25

Table 7 Best Practices in Configuring Recovery (continued) Cluster State in DR Sites Properties change in a Standby site Standby Mode - services in standby NTP/DNS/AD servers on all sites recovery cases, such as complete data center failures, man-made disasters, etc. In DR environments, backup uploads to FTP/FTPS sites will have an appended site ID. Only the Active site will have backup scheduler running. After Switchover or Failover is done, the backup schedule continues as it was configured on the original Active site. In a DR environment, it is better to Failover to a Standby DR site instead of Restoring. If DR cannot be done, follow with a Restore from backup. The Restore from backup procedure has extra steps in a DR environment: Power off all DR sites before restoring. Failing to do so may bring data from the DR sites into the newly restored site, since their data may be newer than from the restored. Restore from backup. Follow the normal procedure, and redeploy only one site (the Active). From newly restored Active site, delete all DR Standby sites using the ViPR Controller GUI. Verify that ViPR operational as expected. Re-deploy and re-add all DR Standby sites. The Cluster State reporting in DR sites view will show "UNKNOWN" if the site is PAUSED. This is expected behavior. When a Standby site is PAUSED, changing of any properties such as banner, customer names, or keystore is not allowed. Standby site will have three less services less than the Active site. Since NTP/DNS/AD servers are shared among all sites: For the AD server, we recommend using highavailability Auth Provider, so that if production goes down, ViPR can still connect to the redundant one. Alternatively, register all your redundant AD providers in your Active site. 26 EMC ViPR Controller 3.0 Recovery, Backup and Restore Guide

Table 7 Best Practices in Configuring Recovery (continued) Unexpected shutdown or power outage Outage of a majority of nodes General Guidance Note You can also login to ViPR as root user after Failover and add or change the Auth Provider details to point to a recovered Auth Provider, if the URL is different from what you set in the Active site previously. For the NTP and DNS servers: Register redundant NTP and DNS servers in the Active site. Note You can also change or add NTP/DNS settings after Failover on the DR site, if necessary. In the event of an unexpected shutdown or power outage: If a Standby site doesn't come back, shut down the ViPR Controller instance and reboot. Redeploy the ViPR Controller instance and re-add it to the Active site as a Standby. If the Active site doesn't come back, shut down the ViPR Controller instance and reboot. Then do a Failover to a Standby site, redeploy the ViPR Controller instance and add it to the site. Finally do a Switchover. If the Active site or a Standby site temporarily experience an outage of a majority of their nodes, the Standby site can return to a synchronized state with the Active site only after ALL nodes of the Active and Standby site(s) are back and their controller status is "STABLE". After recovering ViPR controller itself on the Standby site, and bringing physical assets online (if applicable), you can proceed to: Use ViPR protection catalog service to enable ViPRmanaged array replicated resources on the Standby site, if applicable (for example, use ViPR catalog to failover SRDF or RP volumes to the Standby site). Continue to use ViPR for provisioning on available assets. Contact your EMC support representative for assistance in planning of the end-to-end DR configuration for use cases based on your datacenter physical assets and environment. Related documents (which is available from the ViPR Controller Product Documentation Index : EMC ViPR Controller 3.0 Integration with RecoverPoint and VPLEX User and Administration Guide Recovery Limitations and Best Practices 27

Table 7 Best Practices in Configuring Recovery (continued) EMC ViPR Controller 2.4 Integration with VMAX and VNX Storage Systems Guide 28 EMC ViPR Controller 3.0 Recovery, Backup and Restore Guide