How To Build A Disaster Recovery Testing Program



Similar documents
Assessing Your Disaster. Andrews Hooper Pavlik PLC. Andrews Hooper Pavlik PLC

Business Continuity Plan

Resource Ordering and Status System. User Business Resumption Plan

Business Continuity Planning Principles and Best Practices Tom Hinkel and Zach Duke

Why Should Companies Take a Closer Look at Business Continuity Planning?

Disaster Recovery Plan (Business Continuity) Template - Version 8.2

How to Plan for Disaster Recovery and Business Continuity

Disaster Recovery Plan (Business Continuity) Template

Disaster Recovery Business Continuity Premium Edition

Building a strong business continuity plan

CENTRAL BANK OF KENYA (CBK) PRUDENTIAL GUIDELINE ON BUSINESS CONTINUITY MANAGEMENT (BCM) FOR INSTITUTIONS LICENSED UNDER THE BANKING ACT

Introduction to Vendor Management

What is Business Continuity Planning (BCP) / Disaster Recovery Plan(DRP)?

2015 CEO & Board University Taking Your Business Continuity Plan To The Next Level. Tracy L. Hall, MBCP

NAVIGATING THROUGH A CATASTROPHIC DISASTER:

How to write a DISASTER RECOVERY PLAN. To print to A4, print at 75%.

The Difference Between Disaster Recovery and Business Continuance

Version Copyright Janco Associates, Inc. - Page 1

Overview of how to test a. Business Continuity Plan

AVLOR SERVER CLOUD RECOVERY

Application / Hardware - Business Impact Analysis Template. MARC Configuration Requirements. Business Impact Analysis

Running Successful Disaster Recovery Tests

Building a Disaster Recovery Program By: Stieven Weidner, Senior Manager

Vendor Management. Outsourcing Technology Services

Business Continuity: Choosing the Right Technology Solution

Q uick Guide to Disaster Recovery Planning An ITtoolkit.com White Paper

HA / DR Jargon Buster High Availability / Disaster Recovery

Business Continuity & Recovery Plan Summary

Cisco Disaster Recovery: Best Practices White Paper

Business Continuity Management

Best Practices in Disaster Recovery Planning and Testing

2014 NABRICO Conference

The Weill Cornell Medical College and Graduate School of Medical Sciences. Responsible Department: Information Technologies and Services (ITS)

Client Security Risk Assessment Questionnaire

How to Design and Implement a Successful Disaster Recovery Plan

THORNBURG INVESTMENT MANAGEMENT THORNBURG INVESTMENT TRUST. Business Continuity Plan

What You Need to Know About Cloud Backup: Your Guide to Cost, Security, and Flexibility

outsourcing is not a bad word 6 Reasons to Outsource Your IT

Best Practices in Developing an IT Disaster Recovery Plan. Vijaykumar Kulkarni AGM Product Management

a Disaster Recovery Plan

PAPER-6 PART-1 OF 5 CA A.RAFEQ, FCA

5 Essential Benefits of Hybrid Cloud Backup

Business Continuity Planning and Disaster Recovery Planning

Oracle Database Review Security Controls and Other Issues Toronto Public Library Management Response

DESIGNATED CONTRACT MARKET OPERATIONAL CAPABILITY TECHNOLOGY QUESTIONNAIRE

Evaluating and Improving Your Business Continuity Plan

Business Resiliency Business Continuity Management - January 14, 2014

Protecting Your Business

Building and Maintaining a Business Continuity Program

How To Choose A Business Continuity Solution

DISASTER RECOVERY PLANNING GUIDE

NIST SP , Revision 1 Contingency Planning Guide for Federal Information Systems

Zerto Virtual Manager Administration Guide

DISASTER RECOVERY PLANNING FOR CITY COMPUTER FACILITIES

Business Continuity Planning (800)

SWAP EXECUTION FACILITY OPERATIONAL CAPABILITY TECHNOLOGY QUESTIONNAIRE

Driver Updater Manual

Disaster Recovery Plan Documentation for Agencies Instructions

Business Continuity and Disaster Recovery Planning

An overview of the Disaster Recovery Planning Process - From Start to Finish

Williamson County Technology Services Technology Project Questionnaire for Vendor (To be filled out withprospective solution provider)

Simplify Your Data Protection Strategies: Best Practices for Online Backup & Recovery

Oracle Maps Cloud Service Enterprise Hosting and Delivery Policies Effective Date: October 1, 2015 Version 1.0

Disaster Recovery Solutions for Oracle Database Standard Edition RAC. A Dbvisit White Paper

Technology Recovery Plan Instructions

OPERATIONAL CAPABILITY TECHNOLOGY QUESTIONNAIRE

Mastering Disaster Recovery: Business Continuity and Virtualization Best Practices W H I T E P A P E R

BME CLEARING s Business Continuity Policy

How To Ensure That Non-Peoplesoft Applications Can Withstand Adverse Events

Service Availability Metrics

CISM Certified Information Security Manager

NHS 24 - Business Continuity Strategy

Ohio Supercomputer Center

External Supplier Control Requirements BCM

Federal Financial Institutions Examination Council FFIEC. Business Continuity Planning BCP MARCH 2003 MARCH 2008 IT EXAMINATION

Temple university. Auditing a business continuity management BCM. November, 2015

Business Continuity Overview

MarketAxess Business Continuity Plan Disclosure

State of South Carolina Policy Guidance and Training

Disaster Preparedness & Response

How to measure your business resiliency

ITMF Disaster Recovery and Business Continuity Committee Report for the UGA IT Master Plan

Virginia Commonwealth University School of Medicine Information Security Standard

Transcription:

Building a Disaster Recovery Testing Program Presented by Steve Carroll Email: scarroll@aboundresources.com Phone: 717-256-1865

About Our Speaker Steve Carroll is a Senior Consultant with Abound Resources. With more than 25 years experience as a community financial institution executive, Steve has worked in a variety of capacities in financial institutions, including consultant and CEO. Since 1996, Steve has worked as a lead consultant on more than 100 financial institution consulting engagements across the country. His areas of expertise include business continuity planning, risk management, strategic business planning, and strategic technology planning. Steve has developed software applications to assist financial institutions in improving their risk management positions, including Abound Resources bplan Web-based Business Continuity Planning system. Steve has completed Institute of Financial Education courses at the University of Texas at Austin, the University of Georgia, and the University of Connecticut.

Who We Are Management consulting firm for the Community Financial Institution (CFI) industry We empower CFIs to achieve their goals. Goals achieved. Guaranteed. Based in Austin, TX; clients in 40+ states Founded in 1997 by industry execs and Big 5 consultants 500+ software evaluations Vendor Neutral Advisors average 25+ years in CFI management; lending, cash management, compliance, operations and IT

What We Do Sales & Marketing

Presentation Highlights Regulatory Issues & Terminology Building a Testing Program Conducting Tests Examples High Availability Environments A Simple Pandemic Exercise Summary Top Five Testing Mistakes

Regulatory Background FFIEC Guidance March, 2008 The Board must approve the Testing Program & review test results IT is responsible for DR Testing The Crisis Management Team should be involved in the testing process Those responsible for Facilities should be involved in the process Test results must be subjected to an Independent Review

Common Regulatory and Audit findings There is no Comprehensive Testing Program in place Testing activities show an over-reliance on a single testing methodology (Table-top) Test activities do not involve departments/users in a meaningful way Test documentation is inadequate No step-by-step restoration procedures No order of restoration defined

Terminology Testing Program A schedule of test events spanning a complete testing cycle. What? When? Who? How? Where? DR Test An event that demonstrates that a given resource can be restored to a production state within a target time frame using a documented restoration method. BCP Test An event that demonstrates that a Business Function can be completed using a contingency procedure.

Budgeting Your BCP Effort 32% 36% Business Impact Analysis Risk Assessment Documentation Emergency Response 9% 5% Testing 18%

Testing Methodologies Methodology Tabletop Exercise Due Diligence Vender Service Levels Independent Review Incident Tracking Compatibility Testing Simulation Production Testing Type of Test/Administrator BCP/BCP Coordinator BCP/BCP Coordinator BCP/BCP Coordinator BCP/BCP Coordinator BCP/BCP Coordinator Disaster Recovery/IT Disaster Recovery/IT Disaster Recovery/IT

Testing Team Getting started Technical representation Operations Department Staff Inventory of Resources Software Applications (Core and Network) Critical Services (Data Communications, Internet) Outsourced Applications & Services List of Business Functions By Department Linked to Resources (if possible)

Build a Database Resource Name* Critical Level Test? RTO (hours) RPO (hours) MAD (hours) Control Group Core System 1 Yes 8 2 72 Core Data Communications 1 Yes 1 -- 24 Network Loan Prospector 3 Yes 48 72 96 Loans Fedline 1 Yes 4 24 48 Fed Network Files 3 Yes 24 72 72 Network Branch Capture 1 Yes 4 8 48 Item Proc Internet Access 1 Yes 1 -- 24 Internet Acrobat Reader 5 No 96 -- 120 -- EMail 1 Yes 12 24 48 Email Internet Banking 1 Yes 4 8 24 Core Laser Pro 3 Yes 48 72 96 Loans

Assign Criticality Levels Criticality is assigned to both Resources and Business Functions Sometimes called Mission Critical or Business Critical Better to use multiple criticality Levels for flexibility Three or five levels, matched to a time frame Example: Level 1 = 1 to 24 hours, Level 2 = 24 to 48 hours, etc. Test Flag Will we test this? (yes or no) Level 1 Resources usually tested by default

Assign Target Timeframes Recovery Time Objective (RTO) Target time frame for resource restoration RPO The maximum capacity for data loss of a given information system, measured in time. Can be assigned to any application, but should be applied at a minimum to Transaction Interfaces. RPO s should be supplemented with a description of how lost data could be reconstructed. Maximum Allowable Downtime (MAD) Estimated maximum downtime for a given resource.

Assign Control Groups Control Groups Create a Control Group for resources that should be tested together. Examples: Core System Loan Systems Internet (Web sites) Examples By Server By Criticality Level By Application type

Create Test Events Build a Control Document for each Test Event: Statement of Objective be clear and concise Example Show that the [software application] can be restored onto new hardware from backup media. Users will log in and verify that the system has been returned to a production state Description of Test Environment How will hardware be replaced? Preinstalled software External connections needed Most likely test methodology Test Date Who is responsible? Who will be present? What documentation (evidence) will be retained? Write a Test Script Include a Reviewer section

Example Test Script Step # Start Time Activity 1 Set up server hardware, workstation & test LAN 2 Install O/S & backup/restore utility 3 Install application from original media (d/l from vendor Web site) 4 Locate backup image for application data & restore to server 5 Install client onto workstation 6 Have user login and verify that work can resume 7 User runs samples of typical transactions 8 Print screens and reports retain for documentation 9 Expected Results Actual Results

Build a Testing Timeline Test Cycle 12 months Assign a target test date to each Test Event/Control Group Strategically space test events across the entire Test Cycle Easy tests can happen more quickly Allow more time for complex tests Consider likely unplanned outages (Incident Tracking)

Resource Restoration Methods Applications & Data Restore from backup Reinstall from original media Installed in multiple locations (redundant) High Availability System failover Hardware Backup Equipment Replace from available market sources (add time to RTO!) Services Contact Supplier

Test Day Print the appropriate Test Control Documents and Scripts, or open the documents on a laptop Line up the Test Participants Test Administrator Technical support Observers/documenter Department users when appropriate Execute test script note start time and results for each step. Complete the document as you go. Testing 80% preparation & documentation, 20% execution

Reporting Electronic files are better that physical Create a folder structure on your network folders and test events with the same name Scan completed Test Scripts and Control Documents; attach to Test Event Keep a schedule of all Test Events, past and future Be able to sort by Date and Status (Pending, Complete) When you re ready to distribute -- copy or Zip the folder structure for emailing or copy to media Print a Test Summary by Date showing test status

High Availability Environments Virtual servers Pro - can immediately cut RTO s in half Con Testing challenges Bandwidth Licensing Staffing Core Synchronization Synch how often? Is more always better? Production testing is complex

A Simple Pandemic Exercise Preparation BCP Coordinator Create an Excel Spreadsheet with 2 columns Column A = Employee Name Column B = Department Use a random selection formula to select 40% of the employee records Make sure you can reference which departments become impacted (use a count record formula or a pivot table) Practice this until you can do it quickly

A Simple Pandemic Exercise Pull your team together for the exercise meeting Use the Pandemic Simulator (Excel spreadsheet) to determine which employees are absent with the flu Determine which Department has the highest level of absenteeism are most affected Review the Business Functions for that Department and develop a strategy for dealing with the incoming work Document everything

Top Five Testing Mistakes 5. Procrastination/Cramming 4. Hiding Failed Tests 3. Reliance on a single methodology 2. Failure to leverage real life 1. Documentation missing or confusing

Questions

Contact Steve Carroll Abound Resources, Inc. Senior Consultant Cell: 717-256-1865 E.Mail: scarroll@aboundresources.com Twitter: @bankbcp