Vanderbilt University Medical Center: Who are we? Vanderbilt University Medical Center By the Numbers: Licensed Beds (Sept.2005)



Similar documents
Seeking Your Contingency Plan: Are You. Panel Members: Johns Hopkins University & Health System

Assuring High Availability in Healthcare Interfacing Considerations and Approach

Business Continuity & Recovery Plan Summary

Business Continuity & Recovery Plan Summary

Oracle Maps Cloud Service Enterprise Hosting and Delivery Policies Effective Date: October 1, 2015 Version 1.0

Best Practices in Disaster Recovery Planning and Testing

Simplify Your Data Protection Strategies: Best Practices for Online Backup & Recovery

Disaster Recovery 101. Sudarshan Ranganath & Matthew Phillips Ellucian

Running Successful Disaster Recovery Tests

A Framework to Support Healthcare Continuity of Operations in an Information Technology Failure:

Section 3 -- Framework Design: Hardware and Software

G-Cloud 6 Service Definition DCG Cloud Disaster Recovery Service

UK HOSPITAL. Projects NOT Involving the General Fund, Road Fund, or Agency Bonds* Project Title / Description Total Budget Source(s)

Attachment to Data Center Services Multisourcing Service Integrator Master Services Agreement

How to Plan for Disaster Recovery and Business Continuity

Designtech Cloud-SaaS Hosting and Delivery Policy, Version 1.0, Designtech Cloud-SaaS Hosting and Delivery Policy

Computer Visions Course Outline

IT Disaster Recovery Plan Template ABC PVT LTD

Disaster Recovery Solution Achieved by EXPRESSCLUSTER

The Joint Commission Approach to Evaluation of Emergency Management New Standards

DB2 9 for LUW Advanced Database Recovery CL492; 4 days, Instructor-led

Antelope Enterprise. Electronic Documents Management System and Workflow Engine

Maintaining a Microsoft SQL Server 2008 Database

Domain Name Service Service Level Agreement (SLA) Vanderbilt Information Technology Services

ON CALL, ALL THE TIME DISASTER RECOVERY AS A SERVICE FROM WINDSTREAM

SLA. Co-Managed. Do-It-Yourself

Data Center Hosting: Build, Upgrade or Partner Tuesday, February 21, a.m. 12 p.m.

The University of Iowa. Enterprise Information Technology Disaster Plan. Version 3.1

IT Disaster Recovery Plan Template

<Client Name> IT Disaster Recovery Plan Template. By Paul Kirvan, CISA, CISSP, FBCI, CBCP

Data Center Infrastructure & Managed Services Outline

Technical Considerations in a Windows Server Environment

Blackboard Managed Hosting SM Disaster Recovery Planning Document

MiServer and MiDatabase. Service Level Expectations. Service Definition

Disaster Recovery & Business Continuity Dell IT Executive Learning Series

Executive Brief Infor Cloverleaf High Availability. Downtime is not an option

Business Continuity Plan CLINICAL DEPARTMENT [Template]

This article Includes:

The Shift Cloud Computing Brings to Disaster Recovery

Stellar Phoenix Exchange Server Backup

Web- Hosting: Service Level Agreement

Explain how to prepare the hardware and other resources necessary to install SQL Server. Install SQL Server. Manage and configure SQL Server.

Hospital Authority. Enhanced Contingency Planning for IT System Failures

Ongoing Help Desk Management Plan

A Guide to Healthcare IT Disaster Recovery

Protecting Microsoft Hyper-V 3.0 Environments with CA ARCserve

Is It Time for a Clinical Help Desk?

Web-Hosting: Service Level Agreement

Electronic Medical Record Adoption Model (EMRAM) John Rayner Director of Professional Development HIMSS-UK

How Routine Data Center Operations Put Your HA/DR Plans at Risk

Boston College. Departmental Business Continuity Planning

Managed Hosting Evaluating Blackboard Managed Hosting Vs. Self Hosting

Blackboard Collaborate Web Conferencing Hosted Environment Technical Infrastructure and Security

Veritas Storage Foundation High Availability for Windows by Symantec

EMORY HEALTHCARE INFORMATION SERVICES DEPARTMENT - EMR Implementations at a Glance

Enterprise UNIX Services - Systems Support - Extended

Communication systems must be readily available, reliable and properly maintained to support an emergency or disaster.

Commercial EHR - A Review of the Program

Integrated Application and Data Protection. NEC ExpressCluster White Paper

Building a Disaster Recovery Program By: Stieven Weidner, Senior Manager

Central Server Hosting Service Detail

Hospital IT Expenses and Budgets Related to Clinical Sophistication. Market Findings from HIMSS Analytics

Five Secrets to SQL Server Availability

Security+ Guide to Network Security Fundamentals, Fourth Edition. Chapter 13 Business Continuity

LOSS OF HEATING/VENTILATION/AIR CONDITIONING (HVAC)

SQUIZ SOLUTIONS. Disaster Recovery and Security October 13. Zetland House Clifton Street London EC2A 4LD

Start Your Disaster Recovery Plan TODAY! Bob Boyd Agility Recovery. To download a copy of the slides, please visit:

Why Your Backup/Recovery Solution Probably Isn t Good Enough

Course Syllabus. At Course Completion

Case Study: Using Predictive Analytics to Reduce Sepsis Mortality

Leveraging Virtualization for Disaster Recovery in Your Growing Business

Module 7: System Component Failure Contingencies

High Availability and Disaster Recovery for Exchange Servers Through a Mailbox Replication Approach

Systems Support - Standard

Abhi Rathinavelu Foster School of Business

Reduce the cost and delays associated with troubleshooting with the help of round-the-clock Service Desk Support.

How To Get A Server At The Alaska University Of Alaska

SCADA Business Continuity and Disaster Recovery. Presented By: William Biehl, P.E (mobile)

How To Prepare For A Disaster

Disaster Recovery Solutions for Oracle Database Standard Edition RAC. A Dbvisit White Paper

SolveXia Business Continuity Planning

Module: Business Continuity

INFORMATION ASSURANCE

Disaster Recovery Plan Checklist

CLOUD SERVICES FOR EMS

Justifying an Investment in Disaster Recovery

DISASTER RECOVERY. Omniture Disaster Plan. June 2, 2008 Version 2.0

Server Monitoring & Management Services

EMERGENCY PREPAREDNESS PLAN Business Continuity Plan

Enterprise Remote Monitoring

How to Design and Implement a Successful Disaster Recovery Plan

Disability Sport Events DSE EVENTS CONTINGENCY PLAN

Backup and Recovery for SAP Environments using EMC Avamar 7

a Disaster Recovery Plan

Offsite Disaster Recovery Plan

Emergency Management & Business Continuity Program Self-Assessment Checklist

Automated Disaster Recovery With BMC Atrium Orchestrator

Rapid recovery from bare metal, to dissimilar hardware or to and from virtual environments.

IT Service Continuity Plan <Video Conference Service> - CS DocDB #4313

Transcription:

Managing Clinical Information Systems Recovery and Patient Care Impact Following A Hospital Data Center Emergency Power Off Event (When Tabletop Exercises Become Reality) Kevin Chenoweth Vanderbilt University Medical Center Nashville, Tennessee Email: Kevin.Chenoweth@Vanderbilt.edu Vanderbilt University Medical Center: Who are we? Located in Nashville, Tennessee Academic Medical Center: Clinical, Research, Teaching Serving Middle Tennessee, Southern Kentucky, Northern Alabama and The World! Only Level 1 Trauma Center in Middle Tennessee Vanderbilt University Medical Center By the Numbers: Licensed Beds (Sept.2005) Adult and Children s Hospitals, Clinics (832 and 215 licensed beds, respectively) Psychiatric Hospital (88) Rehab Hospital (80) Campus and Satellite Clinics 1

Vanderbilt University Medical Center By the Numbers: Discharges and Clinic Visits (FY06) Discharges Ambulatory Visits 1,020,000 Outpatient Surgeries 19,100 Emergency Room Visits 91,000 43,000 (Excludes Psychiatric Hospital) Vanderbilt University Medical Center: Clinical Information Systems Healthcare s 100 Most Wired (2005, 2006) Electronic Medical Record (EMR) Computerized Physician Order Entry (CPOE) Closed-Loop Dispensing (pilot) Film-less Imaging Paperless Clinics Electronic Whiteboards (ED, ICU, Clinics), Inpatient bedboard Bedside Monitoring and Documentation Vanderbilt University Medical Center: Disaster Recovery Strategy Internal recovery strategy for select systems, using a separate backup facility (i.e. 2nd data center) featuring SRDF, HA clustering and Standby O/S recovery solutions over a diverse path DWDM network Backup facility (Co-lo arrangement): Class-A data center space, recovery floor space and conference space, with redundant HVAC and power (including on-site backup power generation) VUMC provides servers, disk, network (everything but the space, power, and cooling) 2

Vanderbilt University Medical Center: Recovery Challenges RTO <4hrs for critical clinical systems RPO = Time of Event Multiple platforms (mainframe, mid-range, Linux, Windows, DB2, Oracle, SQL, commercial and inhouse developed apps) Multiple Dependencies Multiple Interfaces Semi-centralized support model The Tabletop Exercise: June 28 th, 2006 Objective: Evaluate Computer Operations, Facilities and Helpdesk Emergency Response Procedures Scenario based on event that recently occurred at another hospital Primary Data Center fire suppression in by-pass mode while maintenance is performed, i.e. detection system is active but FM200 tanks are temporarily deactivated Minor server fire leads to Emergency Power Off (EPO) Event The EPO Event: Sunday July 16 th, 2006. 03:20 Scheduled maintenance to integrate third UPS into the primary data center UPS system requiring upgrade of input power service feeds and bypass feeds (14 hours) Planned load transfer from UPS1 and UPS2 to UPS3 failed. UPS3 shutdown (automatics safety feature) (00:06) Vendor proposed removing UPS s from the equation and running in maintenance bypass mode (via the newly installed distribution panel) to avoid taking a load loss. IT Administrator On Call notified and approval given to proceed (00:25) Transferred load to bypass data center operating solely from generators via new distribution panel (03:15) New Distribution Panel Maintenance By-Pass Breaker short-circuited; Breaker was opened to prevent further damage, personal injury and risk to data center facility, resulting in EPO (03:20) 3

Data Center EPO: Event Management Immediate Impact Emergency Response and Notification IT Impact Assessment and Response Clinical Impact Assessment and Response IT/Hospital Administration Realtionship, Communication and Response IT Disaster Recovery Business Unit Recovery Post Event Critique Lessons Learned Data Center EPO: Immediate Impact Loss of power to primary data center and network Loss of clinical applications (critical and noncritical) House-wide Downtime (HWDT) for all hospital units, Ancillaries, and Emergency Departments Loss of automated notification system Loss of email application Loss of Service Desk application Data Center EPO: Emergency Response and Notification Overhead announcement to all hospital areas Use Down-Time Procedures Computer Operations notify: IT Administrator On Call Service Desk DR On Call IT support groups Service Desk notify: Hospital Administration Downtime Notification List Report (end users) 4

Data Center EPO: IT Impact Assessment and Response 03:35 Generator power restored to data center (no UPS battery backup) 04:15 Emergency Response Team begin to arrive on site - Facilities provide DR Team and IT Administrator with status update Recovery time (i.e. start of HWDT to end of HWDT) estimated at 4 hours (based on previous experience) 04:30 IT Command Center established (DR Team, Service Desk Manager) Conference Call Bridge initiated Data Center EPO: Clinical Impact Assessment and Response Activate Downtime Procedures for: Registration, Census, Bed Management Order entry Pharmacy Ancillary results reporting (e.g. Lab) Radiological studies (film or studies read at device) Staffing level assessment Fetal monitoring (1 nurse per delivery room) CCU (Vitals Monitoring) Patient Transport, etc Hospital placed on diversion IT/Hospital Administration Relationship, Communication and Response Partnership Information shared via Conference Call Bridge Outage status Impact Estimated recovery time (emphasizing estimated) Orange Alerts Escalation assessment Hospital on diversion Hospital Emergency Operations Center activation Maintain communication (regular status updates) 5

Data Center EPO: IT Disaster Recovery DR Team and IT Admin On Call review recovery plan options, agree strategy Decision made to failover mainframe only to 2 nd data center (Census, clinical systems interface hub ) Recovery of applications by priority and critical dependencies DR Team establishes presence in data center as extension of IT Command Center to liaise with IT Admin On Call Computer Operations IT Recovery staff Data Center EPO: Business Unit Recovery (It s not over til it s over) 08:30 Service Desk application functional 08:34 End Of House-wide Downtime (Start to End = 5.25 hours). Census system available. Multiple systems still not recovered - State of Graceful Degradation 10:00 Normal power restored 11:00 Mainframe failback (required 15 minutes HWDT) 16:00 Lab system caught back up 6

Data Center EPO: Post-Event Critique IT Conducts Post Event Critique Root Cause Analysis IT Recovery and Response Solicit input from Hospital Administration and clinical areas Report created and circulated to senior management, participants Dept. Emergency Preparedness conducts critique IT provides background and root cause Focused on care unit issues Downtime Procedures and communication Staffing Data Center EPO: IT Disaster Recovery Issues Synopsis Lessons Learned Communication Incident Notification - On Call report not intuitive On Call information incomplete Service Desk recently moved out data center Between recovery teams (IT/IS) and recovery coordinators End User Communication Data Center servers managed by other groups What do support personnel want to know? Recovery Priority and Graceful Degradation Understanding new systems and support structure Bullet-proofing the mission critical and critical path systems Data Center EPO: Did the Tabletop Exercise Help? Yes Issues experienced were accurately predicted by tabletop exercise Reinforced the value of tabletop exercises No If issues identified aren t expeditiously addressed, documented and published Different crew on duty morning of July 16th 7

Data Center EPO: In perspective 12/30/02 (Monday) VUH Data Center Power Outage NES nuisance trip HWDT 19:24-06:42 (Approx 11.5 hours) All systems impacted 3/9/03 (Sunday) VUH Data Center Power Outage (Inverter and UPS failure) HWDT 13:00-21:35 (8.5 hours approx) All systems impacted. Residual impact past end of HWDT 3/13/05 (Sunday) VUH Data Center EPO HWDT 13:07-17:03 (approx 4 hours) All systems on UPS1 impacted. Residual impact past end of HWDT 8/7/05 (Sunday) 2 nd Data Center Transformer Fire HWDT 13:37-17:28 (approx 4 hours) Significant delay gaining access to the 2 nd Data center facility 7/16/06 (Sunday) VUH Data Center EPO HWDT 03:20 08:34 (approx 5.25 hours) All systems. Residual impact past end of HWDT 8