Alabama Supercomputer Center Alabama Research and Education Network. Tornadoes Disaster Continuity of Operations Experience



Similar documents
Higher Education Provider Charts (Undergraduate Students from Alabama by County)

21 st Century Campus Network Responsibility Matrix 5/26/10

Redundancy for Corporate Broadband

Cisco Network Switches Juniper Firewall Clusters

Business Continuity & Recovery Plan Summary

Business Continuity & Recovery Plan Summary

AboveNet Virtual Data Center

Using High Availability Technologies Lesson 12

Virtual Privacy vs. Real Security

FatPipe Networks

Adams Service Bureau Colocation Assessment. Overview

Disaster Recovery & Business Continuity Dell IT Executive Learning Series

State of Texas. TEX-AN Next Generation. NNI Plan

Alabama State Port Authority

ITMF Disaster Recovery and Business Continuity Committee Report for the UGA IT Master Plan

better broadband Redundancy White Paper

City of Coral Gables

Hosting Features, Terms & Policies

Security and Managed Services

Ohio Supercomputer Center

Layer 3 Network + Dedicated Internet Connectivity

Deploying 10/40G InfiniBand Applications over the WAN

FatPipe Networks


How To Improve Nts Information Technology

INSIDE. Preventing Data Loss. > Disaster Recovery Types and Categories. > Disaster Recovery Site Types. > Disaster Recovery Procedure Lists

Comparing Three Solutions

Availability Digest. Banks Use Synchronous Replication for Zero RPO February 2010

moscow//russia data center specifications tel: fax: internet + intellectual property + intelligence

About HGC RFI Review HGC Cloud Services Backup and Restore Services Add-on Services HGC Metro Networking Services Q & A

Disaster Recovery Design Ehab Ashary University of Colorado at Colorado Springs

A Glossary of Web Hosting Terms

with Cloud Infrastructure and Data Center Services

Managing Availability and Failure Avoidance

Subject: ITN for Redundant High Speed Communications (Part II) Off Campus Equipment Location. Proposed Board Action

How To Rank A Data Center

Contents. Foreword. Acknowledgments

Avoid Network Outages Within SaaS and Cloud Computing Environments

Web Site and Hosting Business Resumption Contingency Plan

Nirvanix, Inc. Finds a Secure and Reliable Data Center and a Strategic Partner for Cloud Services at CyrusOne

Broadband Marshall University 2013 Update

Clovis Municipal School District Information Technology (IT) Disaster Recovery Plan

Private Clouds & Hosted IT Solutions

How To Run A Hosted Physical Server On A Server At Redcentric

ColoTN-HCF-01-RFP Questions

DATA CENTERS. VAZATA Headquarters: 6900 Dallas Parkway, Suite 800 Plano, TX VAZATA.COM

BLACK BOX. EncrypTight

HA / DR Jargon Buster High Availability / Disaster Recovery

in other campus buildings and at remote campus locations. This outage would include Internet access at the main campus.

Service Descriptions

Fiscal Year Information Technology Request

Welcome to Texas A&M University

Enterprise VoIP and Lessons Learned. A Case Study and Impact on Curriculum

Request for Proposals Voice over Internet Protocol Unified Communications System /2016

CHICAGO S PREMIERE DATA CENTER

This chapter covers four comprehensive scenarios that draw on several design topics covered in this book:

STATE BOARD FOR COMMUNITY COLLEGES AND OCCUPATIONAL EDUCATION. December 9, 2015

SolveXia Business Continuity Planning

INFORMATION TECHNOLOGY ENGINEER V

North Street Global, LLC. Business Continuity Plan

How To Back Up A Virtual Machine

How To Connect To Telx Dia (Dia) For Free

Module 7: System Component Failure Contingencies

Local Area Networks (LANs) Blueprint (May 2012 Release)

The primary goals of the technology plan are to support the goals of the district strategic plan:

NETWORK ADMINISTRATOR

Disaster Recovery & Business Continuity. James Adamson Library Systems Office

Ensuring your DR plan does not Lead to a Disaster

North Florida Community College

Campus Network Best Practices: Core and Edge Networks

The Economic Benefit of International Students $26.8 billion Contributed; 340,000 U.S. Jobs Supported

White paper. SAS Solutions OnDemand Hosting Overview

IT Contingency Planning: IT Disaster Recovery Planning

Lumos Parallel Network Operations Centers: Protected Network Monitoring

Data Center Infrastructure & Managed Services Outline

McAfee Endpoint Encryption Hot Backup Implementation

SPECIALIST IN DATA CENTRE & BUSINESS CONTINUITY SOLUTIONS

SENIOR SYSTEMS ANALYST

Table of Contents...2 Introduction...3 Mission of IT...3 Primary Service Delivery Objectives...3 Availability of Systems Improve Processes...

Transformation of the Enterprise Network Using Passive Optical LAN

Questions and Answers

CCHC Emergency Preparedness Gap Analysis

Disaster Recovery Hosting Provider Selection Criteria

CASE STUDY. Nirvanix, Inc. Finds a Secure and Reliable Data Center and a Strategic Partner for Cloud Services at CyrusOne

Web Drive Limited TERMS AND CONDITIONS FOR THE SUPPLY OF SERVER HOSTING

Disaster Recovery and Business Continuity What Every Executive Needs to Know

Perry T. Eidson Technical Operations Communications Engineer Emory University Technical Services

Disaster Recovery Planning. By Janet Coggins

NET ACCESS VOICE PRIVATE CLOUD

DISASTER RECOVERY. Omniture Disaster Plan. June 2, 2008 Version 2.0

OCIO Network Systems Team Responsibilities, Services, Metrics and Statistics

The Impact Of The WAN On Disaster Recovery Capabilities A commissioned study conducted by Forrester Consulting on behalf of F5 Networks

IP Telephony Management

Appendix 3. Specifications for e-portal

KEEN - Reliable Infrastructure, Built to Last

Town of Wilton Telephone System Addendum 1 to RFP. NOTE: Deadline for submitting proposals has been postponed to March 13, 2015

Central Server Hosting Service Detail

Business Continuity protection for SIP trunking service

AT&T Switched Ethernet Service SM

ATMAN Telecommunication Services in Poland. Dariusz Wichniewicz, Director of Telecommunications Services Development Department

Transcription:

Alabama Supercomputer Center Alabama Research and Education Network Tornadoes Disaster Continuity of Operations Experience 1

Success Story Alabama Supercomputer Center operated continuously through the April 27 th tornadoes disaster and week following Maintained full generator power for seven days except for 40 minutes on UPS All data center services were maintained during this time Critical network backbone and Internet services were available continuously Additional IT services for recovery support were supplied to State agencies (e.g. AEMA), City of Huntsville, school systems, universities, community colleges, and FEMA Lessons learned are being incorporated into operations procedures 2

Statewide Education Network NLR Internet2 Internet Universities K-12 Alabama Supercomputer Center State Agencies Support Community Colleges Public Libraries ASA s high speed network is connected to the Supercomputer Center, Internet, Internet2, & NLR 3

Alabama Supercomputer Center Dual, redundant UPS systems Dual fuel, parallel, redundant generators Loop 12,400 V utility feeds Full building electrical plant is handled by generator systems Three fiber entry paths Facility Improvements and Modernization of Previous Five Years Enabled Continuous Operation During Week Long Power Outage 4

Network Backbone Infrastructure ASA s network backbone design provides redundant 10G transport among all major network hubs Internet gateway access in Atlanta and Dallas with failover capacity 5

Network Backbone Has Vendor and Path Diversity ASA s new network backbone design mitigates the risk of having all Internet and Internet2 connections through one point (previously Atlanta) 6

Client Endpoints Service More than 100 client locations lost network service Some telecommunications outages (bulk in Marshall and Dekalb counties) Most due to loss of power Six schools destroyed More risk because redundant connections to endpoints are expensive even if a diverse option is available 7

Staff Was Key to Maintaining Operations Exemplary staff performance was central to maintaining 24x7 data center operations Some employees worked around the clock in first 48 hours Special plan for 12 hour shifts for curfew compliance Staff conducted some Tier 2 and Tier 3 support off-site Disaster planning was modified and implemented during the disaster recovery Key servers were moved from Huntsville to Montgomery on Sunday, May 1 st to ensure full Internet and client email/web services would be maintained in case of second generator failure Strong working relationship and excellent communications between ASA and professional services contractor, CSC Enabled quick responses to clients immediate needs without formal approvals Worked jointly to keep facility, data center, statewide network, applications operational 8

Special Support to Clients City of Huntsville Data Center Worked with API Digital to provide emergency Internet access to the City of Huntsville Provisioned a 1Gbps Ethernet connection with a block of usable address space and API used this to configure the City's Internet access during the recovery Federal Emergency Management Agency (FEMA) Established 50 Mbps VLAN and provided Singlemode Fiber SFP optics to the Fire College to support EMA for FEMA to setup a command center in Tuscaloosa at the Alabama Fire College node on AREN. New muliple T1s connectivity expedited for Joint Field Office (AEMA/FEMA) in Birmingham (upgrade to Metro Ethernet service ordered) Alabama Emergency Management Agency (AEMA) Increased EMA HQ in Clanton to 50 Mbps from 10 Mbps normal service Established service to new Birmingham field site for AEMA/FEMA Auburn University Backup Internet 9

Special Support to Clients George C. Wallace Hanceville Community College Hosted the GCW Hanceville website, Student Blackboard server, and online registration servers due to storm damage at campus Their planned DR site, Calhoun Community College, was also down due to the storm and unavailable Tuscaloosa City Schools Assisted The University of Alabama in setting up a wireless link to Tuscaloosa City elementary school that used online textbooks exclusively Madison County Schools Hosting their website because their server had no power or Internet Alabama A&M University Provided loaner firewall equipment to AAMU Alabama Department of Education Provided daily updates on status of school systems Internet connectivity 10

Lessons Learned Summary AREN network design performed flawlessly 10 Gbps backbone (installation completed in late 2010) kept all network functions intact However, loss of local carriers needs to be a contingency in DR plan ASA provided fuel source to API to keep running local network hub operational API Digital is a major offsite location that connects AREN WAN connections Loss of API site would have removed redundancy for the Huntsville ASC data center New sources/options established for fuel for generator More attention to complete loss of facility needed Expect that full preparations for disasters will never be enough All disaster scenarios will not be forecast in a plan No two disasters are alike Disaster planning generally did not account for a scenario for losing all power in North Alabama Lessons learned are being incorporated into operations procedures 11

Alabama Supercomputer Authority State of Alabama Leader and Trusted Partner for Technology 12