Creating a Technical Disaster Recovery Implementation Plan (TDRIP)

Similar documents

Dbvisit Standby Disaster Recovery Solution

ROADMAP TO DEFINE A BACKUP STRATEGY FOR SAP APPLICATIONS Helps you to analyze and define a robust backup strategy

Using Physical Replication and Oracle Database Standard Edition for Disaster Recovery. A Dbvisit White Paper

a Disaster Recovery Plan

Disaster Recover Challenges Today

Best Practices Report

Data Center Optimization. Disaster Recovery

Storage and Disaster Recovery

Virtualization & Covance Inc.

INSIDE. Preventing Data Loss. > Disaster Recovery Types and Categories. > Disaster Recovery Site Types. > Disaster Recovery Procedure Lists

Library Recovery Center

Module 5 Introduction to Processes and Controls

Running Successful Disaster Recovery Tests

Welcome to My E-Book

Disaster Recovery Solutions for Oracle Database Standard Edition RAC. A Dbvisit White Paper

Why Not Oracle Standard Edition? A Dbvisit White Paper By Anton Els

Disaster Recovery. Hendry Taylor Tayori Limited

DISASTER RECOVERY BUSINESS CONTINUITY DISASTER AVOIDANCE STRATEGIES

Disaster Recovery Hosting Provider Selection Criteria

DISASTER RECOVERY STRATEGIES FOR ORACLE ON EMC STORAGE CUSTOMERS Oracle Data Guard and EMC RecoverPoint Comparison

Disaster Recovery for Oracle Database

Oracle Database 10g: Backup and Recovery 1-2

How To Back Up A Virtual Machine

Blackboard Collaborate Web Conferencing Hosted Environment Technical Infrastructure and Security

Business Continuity and Disaster Recovery Planning from an Information Technology Perspective

MySQL Enterprise Backup

HP Shadowbase Solutions Overview

The Shift Cloud Computing Brings to Disaster Recovery

Planning for the Worst SAS Grid Manager and Disaster Recovery

Disaster Recovery & Business Continuity Dell IT Executive Learning Series

SOLUTION BRIEF: KEY CONSIDERATIONS FOR DISASTER RECOVERY

Data Protection in a Virtualized Environment

Virtualization, Business Continuation Plan & Disaster Recovery for EMS -By Ramanj Pamidi San Diego Gas & Electric

Real-time Data Replication

courtesy of F5 NETWORKS New Technologies For Disaster Recovery/Business Continuity overview f5 networks P

EMC NETWORKER SNAPSHOT MANAGEMENT

Tiburon Master Support Agreement Exhibit 6 Back Up Schedule & Procedures. General Notes on Backups

ASM and for 3rd Party Snapshot Solutions - for Offhost. Duane Smith Nitin Vengurlekar RACPACK

HP Data Protector software Zero Downtime Backup and Instant Recovery. Data sheet

DISASTER RECOVERY PLANNING GUIDE

DISASTER RECOVERY WITH AWS

Lunch and Learn: Modernize Your Data Protection Architecture with Multiple Tiers of Storage Session 17174, 12:30pm, Cedar

One Solution for Real-Time Data protection, Disaster Recovery & Migration

WFT - SAP Disaster Recovery Solution Brief Planning and Automating an SAP Landscape Remote Recovery Procedure

Disaster Recovery 101. Sudarshan Ranganath & Matthew Phillips Ellucian

Dbvisit Replicate. Dbvisit Replicate Overview

State of Wisconsin Database Hosting Services Roles and Responsibilities

High Availability Databases based on Oracle 10g RAC on Linux

Using Live Sync to Support Disaster Recovery

HP 3PAR Software Installation and Startup Service

HA / DR Jargon Buster High Availability / Disaster Recovery

Informix Dynamic Server May Availability Solutions with Informix Dynamic Server 11

Backup and Recovery Solutions for Exadata. Ľubomír Vaňo Principal Sales Consultant

Disaster Recovery for Ingres. Abstract

Shadowbase Data Replication Solutions. William Holenstein Senior Manager of Product Delivery Shadowbase Products Group

Agenda. Overview Configuring the database for basic Backup and Recovery Backing up your database Restore and Recovery Operations Managing your backups

ORACLE CORE DBA ONLINE TRAINING

PROTECTING MICROSOFT SQL SERVER TM

Quorum DR Report. Top 4 Types of Disasters: 55% Hardware Failure 22% Human Error 18% Software Failure 5% Natural Disasters

State of Wisconsin DET File Transfer Protocol (FTP) Roles and Responsibilities

Created By: 2009 Windows Server Security Best Practices Committee. Revised By: 2014 Windows Server Security Best Practices Committee

How To Get Atos Paas For Free

Backup and Recovery Solutions for Exadata. Cor Beumer Storage Sales Specialist Oracle Nederland

VMware and VSS: Application Backup and Recovery

How do you test to determine which backup and restore technology best suits your business needs?

Protecting SQL Server Databases Software Pursuits, Inc.

Rajesh Gupta Best Practices for SAP BusinessObjects Backup & Recovery Including High Availability and Disaster Recovery Session #2747

Implementing the Right High Availability and Disaster Recovery Plan for Your Business Date: August 2010 Author: Mark Peters, Senior Analyst

Virtual Server and Storage Provisioning Service. Service Description

Oracle Database 11g: Administration Workshop I Release 2

The 5 Most Commonly Used Disaster Recovery Process

Designing and Deploying Messaging Solutions with Microsoft Exchange Server 2010 Service Pack B; 5 days, Instructor-led

Oracle Database 11g: Security. What you will learn:

Architecting DR Solutions with VMware Site Recovery Manager

Avid MediaCentral Platform Disaster Recovery Systems Version 1.0

ROI of IT DISASTER RECOVERY

VMware vsphere Data Protection Evaluation Guide REVISED APRIL 2015

HP 3PAR 7000 Software Installation and Startup Service

IP Storage On-The-Road Seminar Series

ORACLE DATABASE: ADMINISTRATION WORKSHOP I

The Future of PostgreSQL High Availability Robert Hodges - Continuent, Inc. Simon Riggs - 2ndQuadrant

Tk20 Backup Procedure

What is the Cloud, and why should it matter?

SOLUTION BRIEF Citrix Cloud Solutions Citrix Cloud Solution for Disaster Recovery

Protecting Microsoft SQL Server with an Integrated Dell / CommVault Solution. Database Solutions Engineering

Abstract Introduction Overview of Insight Dynamics VSE and Logical Servers... 2

Advanced Oracle DBA Course Details

Case Study: Oracle E-Business Suite with Data Guard Across a Wide Area Network

COURCE TITLE DURATION. Oracle Database 11g: Administration Workshop I

HP Data Protection. Business challenge: Resulting pain points: HP technology solutions:

INTRODUCTION ADVANTAGES OF RUNNING ORACLE 11G ON WINDOWS. Edward Whalen, Performance Tuning Corporation

CLOUD SERVICE SCHEDULE

Connectivity. Alliance Access 7.0. Database Recovery. Information Paper

Disaster Recovery of Tier 1 Applications on VMware vcenter Site Recovery Manager

Re-Invent Your Recovery

Oracle 12c Multitenant and Encryption in Real Life. Christian Pfundtner

Transcription:

Creating a Technical Disaster Recovery Implementation Plan (TDRIP) Arjen Visser Avisit Solutions Limited Creators of Dbvisit Protection Recovery Continuity

Introducing myself Arjen Visser Founder and CEO of Avisit Solutions Limited in New Zealand. The creators of: Dbvisit Standby Database Technology (Data Guard alternative) Leading software solution providing Oracle Disaster Recovery. Dbvisit is used world-wide. Customers and sectors include: - Kellogg s - Alcatel-Lucent - Oklahoma State Bureau of Investigation - New York Blood Centre - Banking/Financials industry - Healthcare - Government and City Councils - Manufacturing See www.dbvisit.com for more information.

How important is Disaster Recovery to your business? 43% of US businesses never reopen again after a disaster and a further 29% close within 2 years. (*) 93% of companies that suffer significant data loss are out of business within 5 years. (**) (* US Small Business administration) (** US Bureau of Labor)

TDRIP What is it? A Technical Disaster Recovery Implementation Plan (TDRIP) is a plan of the actual implementation of the hardware and software at the disaster recovery location. Why? This plan ensures there are no unforeseen surprises when building the disaster recovery solution and that all critical systems and their components have been accounted for. How? This paper will show how to create a TDRIP plan. This paper focuses mainly on Oracle centric applications in Unix/Linux environment.

Disaster Recovery project time line: Decision Planning Implementation Testing Technical Disaster Recovery Implementation Plan comes at the end of the planning phase. Technical Disaster Recovery Implementation Plan is part of the Disaster Recovery plan.

What happens without a plan?

What is a Disaster? Complete or partial loss of a data center or service for an extended period of time. Earthquake. Flood (Global warming). Tsunami. Fire. Power failure. Hurricane. Pandemics (Swine flu threats). Disgruntled employee for getting back at you for firing them. Human induced disaster (Newbie DBA meant backup not delete!).

What is Disaster Recovery (DR)? Is not High Availability Is not Business Continuity Is not operational recovery Is not an offsite backup Disaster Recovery is: Process to restore operations critical to the resumption of business after a natural or human-induced disaster

What is the best location of the DR site? DR site could be your current primary site. This ensures staff are onsite when disaster strikes. What should be the distance between two sites? Should be geographically different locations. DR is not just about systems, also about people, buildings and processes. Will you or your staff be available when disaster strikes? Ensure you do not take things for granted like cell phones.

Prerequisites required for TDRIP: High level business disaster recovery plan Key metrics of Recovery Point Objective (RPO) and Recovery Time Objective (RTO) are defined Identified mission critical systems Standby hardware budget Standby location Top priority with backing from Senior Management RTO Maximum amount of time before systems are up and running again. How long before business is operational again. RPO Maximum amount of data loss (measured in time) acceptable in the event of a disaster. How much data can we afford to loose without serious impact to the business.

Mission critical systems: DR systems and methodology identified by business needs. Tier RPO RTO Cost I No dataloss <30 min $$$$$$$$$$ II < 30 min < 1 hour $$$ III 24+ hours 48+ hours $$ IV 7+ days 3+ days $ An RTO/RPO of more than 24 hours means that backup may be sufficient for DR. An RTO/RPO of less than 24 hours requires some kind of replication. Normal backups are not good enough anymore.

Assumptions for Disaster Recovery Implementation 1. Asynchronous replication (not synchronous) 2. Host based replication (not array or fabric based)

Technical Disaster Recovery Implementation Plan 8 Steps to creating the TDRIP: 1. Technical register of applications and servers 2. Application consistency groups 3. Server mapping 4. Configuration register for each primary server including OS, patches, firewall rules, etc. 5. Software licenses and media register 6. Oracle standby database implementation 7. Host Replication methodology 8. Best practice primary servers and standby servers

Step 1 - Technical register of applications and servers What servers (and components) should be included in the disaster recovery plan? Map the critical systems identified in the business disaster recovery plan to actual servers and components of servers System Software component Server Size Sales Data Warehouse Oracle database - SALESP avisit012 950G Oracle Warehouse Builder avisit013 20G Repository OWBREP Reporting application server avisit022 2G Web server avisit034 3G Source data system avisit067 120G CRM system Oracle database CRM01P avisit320 550G Web server avisit034 15G

Step 2 - Application consistency groups Application is not just one server and one database Feeds in and out Multiple servers May need to replicate the systems at the exact same point in time to avoid inconsistencies and incomplete processes. System Software component Server Sales Data Warehouse Oracle database - SALESP avisit012 Oracle Warehouse Builder avisit013 Repository OWBREP Reporting application server avisit022 Web server avisit034 Source data system avisit067 CRM system Oracle database CRM01P avisit320 Web server avisit034

Step 3 - Server Mapping (One to one mapping ) Each primary server has a corresponding standby server.

Step 3 - Server Mapping (Many to one mapping ) Primary servers are consolidated on the standby site

Step 3 - Server Mapping (One to one virtual mapping) Each primary server has a corresponding virtual standby server.

Step 3 Comparing Server Mappings Mapping Advantages Disadvantages One to one Easiest to implement Hardware cost is highest Easiest to administer More hardware to maintain Many to one Hardware can be consolidated Conflicts may arise: Software conflicts User ID (UID) conflicts Group ID (GID) conflicts Mount point conflicts Configuration conflicts Port conflicts Patch conflicts One to one virtual Hardware can be consolidated Not all platforms can be virtualised together. Introduces a new layer to the standby platform New backup methods needed as VM are backuped differently. Virtualisation not ideal for databases.

Step 4 Configuration register For all primary servers included in the disaster recovery plan. Aim is to identify the components that are needed and to make a checklist to ensure nothing is left out when building the standby servers. The configuration register should be split into the following categories: Servers Application Software Oracle Databases This configuration register is a good starting point to get an idea of the scope involved in building the standby servers.

Step 4 Configuration register (Servers) List servers with all the following information for each server: 1. Operating Systems, version, level and patches 2. Server software installations that will be needed at the standby servers. This could be monitoring tools, utilities, etc. 3. User ID number (UID) of the Unix/Linux accounts that will be needed at the standby servers 4. Ports that are needed by the specific software 5. Printer configurations on the server that are needed at the disaster recovery site 6. Mount points that are used by the specific software 7. Important configuration files (smb.conf, sshd_config etc) 8. Firewall rules

Step 4 Configuration register (Application Software) List the application software that is needed at the standby site and for each software list the following: 1. Name of software, version, level and patches 2. Installation directory 3. Mount points used 4. Important configuration files

Step 4 Configuration register (Oracle Databases) List databases with the following information for each database: 1. Listener port(s) and network configurations 2. Oracle software installation directory 3. Location of Oracle networking files 4. Oracle version and patches 5. Mount points used by databases 6. Oracle admin/trace directories 7. Others (utl_file_dir etc)

Step 5 Software licenses and media register Media: Identify the software media needed to build the standby servers. Ensure all media and patches are available if software cannot be copied from primary server. Licenses: Ensure standby servers software licenses have been purchased or are accounted for. Ensure standby databases are fully licensed.

Step 6 Oracle standby database implementation Oracle replication methods: 1. Physical standby database (using redo or archive logs) 2. Logical standby database (using SQL and LogMiner) 3. Oracle replication (using triggers or streams) Methods to keep physical standby database up to date: 1. Dbvisit (used world-wide www.dbvisit.com ) 2. Data Guard (Enterprise Edition needed) 3. Home grown scripts (robust enough?) Make sure your Oracle replication method is not your weakest link.

Step 7 Host Replication methodology Assume asynchronous and host based replication One Master: Ensure any changes to the primary servers are also replicated to the standby servers. This includes: New users and passwords Changes to configuration files New or updated printers User files Synchronising methods: rsync rdist Commercial software

Step 8 Best practice primary servers (to avoid conflicts when standby servers are consolidated) Primary servers: 1. Assign range of port numbers that only this server may use 2. Assign range of UID (and GID) that only this server may use. Or have global UID and GID. 3. Pre-fix all non standard mountpoints with a unique identifier for the server to ensure no conflicts when consolidated. (Example: /oradata01 should be /s23-oradata01) 4. Ensure you consider DR site on your change control

Step 8 (ii) Best practice standby servers Keep the same layout as on the primary servers You will make your life a lot easier! Standby servers: 1. Keep the user UID the same as on primary server. 2. Keep GID the same as on primary server. 3. Keep mount points the same as on the primary server. 4. Keep port numbers the same as on the primary server. 5. Keep the layout of database the same.

Summary Technical Disaster Recovery Implementation Plan 8 Steps of TDRIP: 1. Technical register of applications and servers 2. Application consistency groups 3. Server mapping 4. Configuration register for each primary server including OS, patches, firewall rules, etc. 5. Software licenses and media register 6. Oracle standby database implementation 7. Host Replication methodology 8. Best practice primary servers and standby servers

Conclusion Create a plan to be prepared. And finally. >>Test your DR plan regularly<<

End of presentation Thank you www.dbvisit.com At the heart of Oracle Disaster Recovery Oracle is a registered trademark of Oracle Corporation. Dbvisit is a registered trademark of Avisit Solutions Limited.