Data Guard Fast-Start Failover

Similar documents

Oracle Data Guard Fast Start Failover understood!

Oracle Data Guard for High Availability and Disaster Recovery

Use RMAN to relocate a 10TB RAC database with minimum downtime. Tao Zuo tao_zuo@npd.com NPD Inc. 9/2011

Database Recovery For Newbies

Oracle Database 10g: Backup and Recovery 1-2

DisasterRecoverywith. DisasterRecoverywith Oracle Data Guard10gR2

Oracle Data Guard. Caleb Small Puget Sound Oracle Users Group Education Is Our Passion

11. Configuring the Database Archiving Mode.

ASM and for 3rd Party Snapshot Solutions - for Offhost. Duane Smith Nitin Vengurlekar RACPACK

DB2 9 for LUW Advanced Database Recovery CL492; 4 days, Instructor-led

Oracle Database Backup & Recovery, Flashback* Whatever, & Data Guard

Oracle Database 11g: Administration Workshop II DBA Release 2

D12CBR Oracle Database 12c: Backup and Recovery Workshop NEW

ORACLE CORE DBA ONLINE TRAINING

DISASTER RECOVERY STRATEGIES FOR ORACLE ON EMC STORAGE CUSTOMERS Oracle Data Guard and EMC RecoverPoint Comparison

Oracle Database 11g: Administration Workshop II DBA Release 2

Oracle Database 11g: Administration Workshop II Release 2

RMAN What is Rman Why use Rman Understanding The Rman Architecture Taking Backup in Non archive Backup Mode Taking Backup in archive Mode

Why Not Oracle Standard Edition? A Dbvisit White Paper By Anton Els

Connectivity. Alliance Access 7.0. Database Recovery. Information Paper

BUSINESS CONTINUITY AND DISASTER RECOVERY FOR ORACLE 11g

Setup Flashback Database on Data Guard Physical Standby Database for SAP Customers

Configuring Backup Settings Configuring and Managing Persistent Settings for RMAN Configuring Autobackup of Control File Backup optimization

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

Connectivity. Alliance Access 7.0. Database Recovery. Information Paper

Oracle GoldenGate on Disaster Recovery

Advanced Oracle DBA Course Details

Rob Zoeteweij Zoeteweij Consulting

Using Recovery Manager with Oracle Data Guard in Oracle Database 10g. An Oracle White Paper April 2009

Daniela Milanova Senior Sales Consultant

An Oracle White Paper November Oracle Real Application Clusters One Node: The Always On Single-Instance Database

12. User-managed and RMAN-based backups.

Oracle 11g: RAC and Grid Infrastructure Administration Accelerated R2

Cisco Active Network Abstraction Gateway High Availability Solution

D78850GC10. Oracle Database 12c Backup and Recovery Workshop. Summary. Introduction. Prerequisites

High Availability with Postgres Plus Advanced Server. An EnterpriseDB White Paper

Restoring To A Different Location With EBU And RMAN An AppsDBA Consulting White Paper

FactoryTalk View Site Edition V5.0 (CPR9) Server Redundancy Guidelines

Deloitte Solutions Network (SNET) Disaster Recovery POC on Amazon EC2

Oracle Database Disaster Recovery Using Dell Storage Replication Solutions

Maximize Availability With Oracle Database 12c

Maximum Availability Architecture

ORACLE DATABASE HIGH AVAILABILITY STRATEGY, ARCHITECTURE AND SOLUTIONS

Module 14: Scalability and High Availability

An Oracle White Paper March Oracle Data Guard Broker. Best Practices for Configuring Redo Transport for Data Guard and Active Data Guard 12c

Zero Downtime Backup solution for Oracle10g

Backing up a Large Oracle Database with EMC NetWorker and EMC Business Continuity Solutions

Techniques for implementing & running robust and reliable DB-centric Grid Applications

Backup and Restore of CONFIGURATION Object on Windows 2008

Maximum Availability Architecture. Oracle Best Practices For High Availability

Module 07. Log Shipping

Oracle 11g Database Administration

RMAN BACKUP & RECOVERY. Recovery Manager. Veeratteshwaran Sridhar

Disaster Recovery Solutions for Oracle Database Standard Edition RAC. A Dbvisit White Paper

Using Physical Replication and Oracle Database Standard Edition for Disaster Recovery. A Dbvisit White Paper

Oracle Data Recovery Advisor

Disaster Recovery for Oracle Database

Oracle 12c Recovering a lost /corrupted table from RMAN Backup after user error or application issue

Oracle Active Data Guard

Oracle Database 11g: RAC Administration Release 2

Disaster Recovery Solution Achieved by EXPRESSCLUSTER

This appendix describes the following procedures: Cisco ANA Registry Backup and Restore Oracle Database Backup and Restore

Agenda. Overview Configuring the database for basic Backup and Recovery Backing up your database Restore and Recovery Operations Managing your backups

Oracle 11g DBA Training Course Content

Sanovi DRM for Oracle Database

If you have not multiplexed your online redo logs, then you are only left with incomplete recovery. Your steps are as follows:

CA ARCserve Replication and High Availability

Oracle Backup & Recovery

Implementing Highly Available OpenView. Ken Herold Senior Integration Consultant Melillo Consulting

FioranoMQ 9. High Availability Guide

Key Factors For a Successful ODA Deployment

Implementing an Enterprise Class Database Backup and Recovery Plan

High Availability Solutions for the MariaDB and MySQL Database

Backup Types. Backup and Recovery. Categories of Failures. Issues. Logical. Cold. Hot. Physical With. Statement failure

DeltaV Virtualization High Availability and Disaster Recovery

Contents. SnapComms Data Protection Recommendations

Restore and Recovery Tasks. Copyright 2009, Oracle. All rights reserved.

An Oracle White Paper March Backup and Recovery Strategies for the Oracle Database Appliance

IBM Sterling Control Center

Oracle Recovery Manager 10g. An Oracle White Paper November 2003

How to protect, restore and recover SQL 2005 and SQL 2008 Databases

Backups and Maintenance

High Availability and Disaster Recovery Solutions for Perforce

MOC 5047B: Intro to Installing & Managing Microsoft Exchange Server 2007 SP1

Zen Internet. Online Data Backup. Zen Vault Professional Plug-ins. Issue:

Ultimate Guide to Oracle Storage

The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into

Configuring High Availability for VMware vcenter in RMS Distributed Setup

RBI BCP READINESS REPORT. Auto generated by Sanovi DRM

Transferring AIS to a different computer

Designing a bullet-proof Disaster Recovery Architecture for business-critical Applications

Mastering Disaster Recovery: Business Continuity and Virtualization Best Practices W H I T E P A P E R

Application Continuity with BMC Control-M Workload Automation: Disaster Recovery and High Availability Primer

Case Study: Oracle E-Business Suite with Data Guard Across a Wide Area Network

Tushar Joshi Turtle Networks Ltd

TECHNICAL REPORT. Nimble Storage Oracle Backup and Recovery Guide

Oracle Database 10g: New Features for Administrators

Transcription:

Data Guard Fast-Start Failover DOAG Regio Stuttgart 18.05.2006 Torsten Rosenwald Torsten.Rosenwald@trivadis.com

2 Data Guard and Maximum Availability > Introduction > Concept / Architecture > Flashback & Reinstate > Fast-Start Failover Agenda Know-how We know how > Core Messages

3 The Data Guard Manager > Data Guard is part of Enterprise Edition (no special option), it includes all functionality 'to manage Standby Databases' > The Data Guard Broker Framework (applications and language) facilitates the following tasks of Data Guard» Setup and configuration» Monitoring/control of Redo Log transport- and log apply services» Core operating tasks (Switchover, Failover,Reinstate, Fast-Start Failover, etc.) > Standard Edition doesn t contain Data Guard features like» Automated log-transport or» Managed Recovery Modus > There is a Trivadis package implementing the basic Data Guard functionality

4 Data Guard Fast-Start Failover > Introduction > Concept / Architecture > Flashback & Reinstate > Fast-Start Failover Agenda Know-how We know how > Miscellaneous > Core Messages

5 Dataguard Broker Framework Oracle Data Guard GUI or Command Line Interface Primary Site Primary Database Local Archiving Data Guard monitor Remote Archiving Data Guard monitor Real Time Log Apply Standby Site Standby Database Archived Log Files Log Apply if no standby redo log files configured Online Log Files Standby Log Files Log Transport

6 Data Loss Protection Modes > Maximum Performance: less performance impact to the primary database, asynchronous redo transfer > Maximum Availability: highest possible level of data protection without compromising the availability of the primary database, synchronous redo transfer when the standby database is up > Maximum Protection: this protection mode ensures that the primary database and at least one standby database are always synchronous DGMGRL> EDIT CONFIGURATION SET PROTECTION MODE AS MAXAVAILABILITY;

7 Data Guard and Maximum Availability > Introduction > Concept / Architecture > Flashback & Reinstate > Fast-Start Failover Agenda Know-how We know how > Core Messages

8 Why Flashback? > ASM is nice» but the real value is leveraged together with RAC > Flashback database is nice» but the real value is leveraged in a Data Guard Environment» Almost all real cool features provided with 10g Data Guard need Flashback Database!

9 Concept: Flashback Database 33557 34382 37121 41389 52891 SCN Incremental Level 0 Restore Recovery Flashback Flashback LOGS

10 Concept of Database Reinstate (1) LION LION OLD Primary former OLD Primary Database Database primary database What about the former primary database? Failover standby database > prior to 10g» backup from the new primary database» duplicate with backup» recreate standby database > from 10g» reinstate database

11 Concept of Database Reinstate (2) LION LION LION FAILOVER TO THEDB_BERLIN' OLD Primary former OLD Primary Database OLD primary Primary database Database Database TIGER Standby primary Database database REINSTATE DATABASE THEDB_BONN' standby database

12 Data Guard and Maximum Availability > Introduction > Concept / Architecture > Flashback & Reinstate > Fast-Start Failover Agenda Know-how We know how > Core Messages

13 Physical Standby: Startup Behavior 10g versus 9i startup recover managed standby database disconnect; alter database open; dmon 10g takes over alter database mount; dmon 9i takes over startup nomount

14 Physical Standby Startup Issue (1) > What is the biggest problem in a cluster? Split Brain! > What is the biggest problem in a Data Guard environment?? PRIM DB PRIM DB More than one primary! > How can this happen? Primary re-availability after standby activation

15 Physical Standby Startup Issue (2) > Automatic database startup as part of system startup» 2 primary databases possible after standby has been activated > Manual database startup after system boot» No issue after activation of standby, because manual intervention is necessary anyway» Requires additional attention after every system startup > Is there a better solution?

16 Physical Standby Activation Issue > Main criticism of standby databases: too much manual action > Manual intervention is required for a failover» Need some administrative checks before to validate the status of the standby database, e.g. if all redo are applied» More downtime > Manual intervention to recreate a new standby database» No HA until the setup of the new standby is finished > How can this be addressed? Fast-Start Failover

17 Concept 1. Observed Data Guard environment Primary Standby 2. Fast-Start-Failover (automatic) Primary Primary 3. Reinstate (automatic) Standby Primary

18 When is a Fast-Start Failover triggered? > Primary site failure» Server crash or server shutdown (without database shutdown) > Primary database failure» Instance failure (last running instance if RAC)» Shutdown abort (but not with normal or immediate)» Data file is taken offline > Network failure (special case)» Documentation of when and when not automatic activation will happen is quite large. Read and test carefully. We will show one case.

19 Network Failure (1) observer primary DB log transport standby DB Select fs_failover_status,fs_failover_observer_present from v$database; ---on primary site FS_FAILOVER_STATUS FS_FAILOVER_OBSERVER_PRESENT -------------------- ----------------------------- SYNCHRONIZED NO

20 Network Failure (2) observer FAILOVER start database STALLED log transport standby DB Select fs_failover_status,fs_failover_observer_present from v$database; ---on primary site FS_FAILOVER_STATUS FS_FAILOVER_OBSERVER_PRESENT -------------------- ---------------------------- STALLED NO

21 Network Failure (3) observer database STALLED new primary DB Select fs_failover_status,fs_failover_observer_present from v$database; ---on new primary site FS_FAILOVER_STATUS FS_FAILOVER_OBSERVER_PRESENT -------------------- ---------------------------- REINSTATE REQUIRED YES

22 Network Failure (4) database reinstate observer new standby DB log transport new primary DB Select fs_failover_status,fs_failover_observer_present from v$database; --on primary site and standby site FS_FAILOVER_STATUS FS_FAILOVER_OBSERVER_PRESENT -------------------- ----------------------------- SYNCHRONIZED YES

23 Observer location? Main computing center = Observer Standby computing center 4 3 1 2 log transport primary DB Public LAN 5 standby DB

24 Observer location 1 2 > Not really an option > Reason» no prevention from system crash! > Consequence» additional observer machine is necessary! primary / standby DB

25 Observer location 3 > Close (same fire prevention area) to the Standby Database > Advantages» scenario - complete failing main computing center will be addressed > Disadvantages» primary database will be heavily dependent from the network» therefore unnecessary failover events are possible Primary Standby

26 Observer location 4 > Close (same fire prevention area) to the primary database > Advantages» Fast-start-failover works in most important error situations - Instance crash, media failure, database file offline» primary database is not that much dependent on the network» no unnecessary activations due to networking issues > Disadvantages» loss of the whole main computing center is not addressed Primary Stand

27 Observer location 5 > Third computing center / somewhere within the public LAN > Advantages» basically all error scenarios addressed > Disadvantages» Observer is separated from a network point of view - Therefore the observer itself is more dependent on the network» most companies do not operate 3 computing centers» running the observer on some PC or whatsoever in the public LAN means reduced availability Primary Standb

28 Observer location > Consequences» Fast-start-failover is not an appropriate solution to overcome the loss of a whole computing center. It is not a failover cluster!» After switchover, setup 4 turns into setup 3 and vice versa Exception: the observer is switched somehow as well» In many real life situations (no 3 computation centers) option 4 will be the best choice (tradeoff)

29 Observer location The compromise ain omputing enter Observer standby computing center log transport primary DB Public LAN standby DB

30 Observer - Requirements > Observer machine and configuration > Special entry in Data Guard Broker configuration > Maximum Availability Mode (mandatory)» but: special startup behaviour» but: primary stalls in certain situations > Flashback database must be activated

31 Observer - Data Guard additional Configuration > Not much to configure, but much to describe (see manual) edit database THEDB_BONN' set property FastStartFailoverTarget = THEDB_BERLIN'; edit database THEDB_BERLIN' set property FastStartFailoverTarget = THEDB_BONN'; edit configuration set property FastStartFailoverThreshold = 15; enable fast_start failover; > Fast-Start Failover is a feature of Oracle Data Guard, and can't run without a Data Guard Broker configuration!

Observer Configuration» Start of Observer connect sys@thedb_bonn start observer» Better write a shell script with background execution, start observer does not terminate, - use the logfile option dgmgrl -logfile /u00/app/oracle/local/dba/log/observer.log sys@thedb_bonn "start observer"» Change name of the observer binary file, this file is created in the working directory where you start the observer fsfo.dat. With the parameter 'FILE' you can change the file name descriptor, but not the location Start observer file=fsfo_<dg_configuration_name>.dat Oracle 10g High Availability - Data Guard and Maximum Availability 32

33 Demo: Fast Start Failover 1. Configure Fast_Start Failover 2. Start Observer with connect to primary 3. Shutdown abort on the primary database THEDB_BONN 4. Wait until Fast_Start occurs on THEDB_BERLIN 5. Restart the old primary THEDB_BONN 6. Verify that observer reinstates database THEDB_BONN

34 Conclusion (1) + Prevention of "Split Brain" due to accidental startup of former primary database + Reduced downtime through automatic activation of the standby database + It is a small step for the DBA, but a giant leap from an availability point of view + It is easy to configure + The necessary checks are automatically done before a failover is started

35 Conclusion (2) + A failover solution without a shared disk system + with additional advantages (enhanced data availibity) + and even reduced failover time compared to HA cluster Many technical prerequisites (Flashback database, special Maximum Availability Mode) No automatic failover to a second standby database possible

36 Data Guard and Maximum Availability > Introduction > Concept / Architecture > Flashback & Reinstate > Fast-Start Failover Agenda Know-how We know how > Core Messages

Data Guard and Maximum Availability - Core messages Oracle 10g High Availability - Data Guard and Maximum Availability 37 > Data Guard 10g» Flashback makes the difference > by Trivadis At the core it's about data. > Fast-Start Failover» Protection from 2 primary databases due to inadvertend restart of failed primary database» Rather easy implementation / configuration» Reinstate database even automatically» Very short failover times

Data Guard Fast-Start Failover DOAG Regio Stuttgart 18.05.2006 Torsten Rosenwald Torsten.Rosenwald@trivadis.com