Root Cause Analysis Report

Similar documents
Technical Support. Technical Support. Customer Manual v1.1

Informix Performance Tuning using: SQLTrace, Remote DBA Monitoring and Yellowfin BI by Lester Knutsen and Mike Walker! Webcast on July 2, 2013!

STIDistrict SQL 2000 Database Management Plans

Use of Commercial Backup Software with Juris (Juris 2.x w/msde)

Table of Contents INTRODUCTION Prerequisites... 3 Audience... 3 Report Metrics... 3

Providing Self-Service, Life-cycle Management for Databases with VMware vfabric Data Director

SQL SERVER Anti-Forensics. Cesar Cerrudo

VeriCentre 3.0 Upgrade Pre-Installation and Post Installation Guidelines

ITPS AG. Aplication overview. DIGITAL RESEARCH & DEVELOPMENT SQL Informational Management System. SQL Informational Management System 1

Protecting SQL Server Databases Software Pursuits, Inc.

2.1 To define the backup strategy for systems and data within the Cape Winelands District Municipality (CWDM).

Passing PCI Compliance How to Address the Application Security Mandates

How to effectively respond to an information security incident

General DBA Best Practices

Defining, building, and making use cases work

CA Workload Automation Agent for Databases

MAILGUARD, WEBGUARD AND ARCHIVING SERVICE SCHEDULE

Database Backup and Restore Mechanism. Presented by : Mary Meladath

Microsoft SQL Server Scheduling

Power Update - Documentation Power Update Manager

Teldat Router. DNS Client

Dell AppAssure Recovery Assure

State of Michigan Data Exchange Gateway. Web-Interface Users Guide

Version 6.5 Users Guide

AuditWizard v8. SQL Maintenance

Cisco AnyConnect Secure Mobility Client integration with ISE & SCCM client for patch remediation on windows

Service Level Agreement for Database Hosting Services

Windows Server 2012 Server Manager

The Perks of Having a Dedicated Database.

You need to recommend a monitoring solution to ensure that an administrator can review the availability information of Service1. What should you do?

30 Steps to Successfully Installing DotNetNuke on a Network Solutions Shared Hosting Package

Missed Recovery Techniques for SQL Server By Rudy Panigas

Best Practices for Monitoring Databases on VMware. Dean Richards Senior DBA, Confio Software

White Paper April 2006

Ultimate Guide to Oracle Storage

Real World Enterprise SQL Server Replication Implementations. Presented by Kun Lee

How to Back Up and Restore an ACT! Database Answer ID 19211

OpenAdmin Tool for Informix (OAT) October 2012

Product Name: Size Chart Popup Version: Document Type: Help doc Author: Milople Inc.

Securing ArcGIS Server Services: Advanced Options

One of the database administrators

Workflow Templates Library

ESL Business Internet Banking ONLINE GUIDE. Revised 05/15

Database Management. Technology Briefing. Modern organizations are said to be drowning in data but starving for information p.

You may access your Patron Edge support resources in any of the following ways:

Training Events Database (TED) Setup Guide

OnDemand Version 1.7. Installation Manual and User s Guide. AST Technology. Sophienstrasse Herford Germany

Accelerating DevOps With Copy Data Virtualization. June, 2015

Job Description. Direct Reports

Monitoring Microsoft Exchange to Improve Performance and Availability

SAP HANA Backup and Recovery (Overview, SPS08)

McAfee Content Security Reporter 2.0.0

Symantec Endpoint Protection 11.0 Architecture, Sizing, and Performance Recommendations

#9011 GeoMedia WebMap Performance Analysis and Tuning (a quick guide to improving system performance)

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

Meeting Management Solution. Technology and Security Overview N. Dale Mabry Hwy Suite 115 Tampa, FL Ext 702

Product Review: James F. Koopmann Pine Horse, Inc. Quest Software s Foglight Performance Analysis for Oracle

Backup and Recovery. What Backup, Recovery, and Disaster Recovery Mean to Your SQL Anywhere Databases

Oracle USF

Web Content Management System

Monitoring Best Practices for COMMERCE

Table of Contents. CHAPTER 1 About This Guide CHAPTER 2 Introduction CHAPTER 3 Database Backup and Restoration... 15

Course 20462C: Administering Microsoft SQL Server Databases

Administering Microsoft SQL Server 2014 Databases

DATABASE ADMINISTRATION (DBA) SERVICES

Problem Statement.

HTSQL is a comprehensive navigational query language for relational databases.

How To Understand The Error Codes On A Crystal Reports Print Engine

SSMS Built in Reports for Server and Database Monitoring

PCSchool SQL Backup Tech Tip. SQL Backup Tech Tip. Created in version /9

How to protect, restore and recover SQL 2005 and SQL 2008 Databases

Norman Secure Backup. SQL Backup Training Guide. Author: Arne Stieghorst Layout: Eva Langballe

MARKETING COMMUNICATIONS TEAM DECISION MAKING EVENT PARTICIPANT INSTRUCTIONS

Administering Microsoft SQL Server Databases

MySQL Enterprise Monitor

EMC ApplicationXtender Server

EMC ApplicationXtender Server

TIGERPAW EXCHANGE INTEGRATOR SETUP GUIDE V3.6.0 August 26, 2015

Informatics for Integrating Biology and the Bedside. User Guide. Project Request. Document Version: I2b2 Software Release: 1.

Data Replication in Privileged Credential Vaults

Transferring Your Internet Services

The Redwood City Public Works Services (PWS) Department is comprised of four operating divisions:

ManageEngine EventLog Analyzer. Best Practices Document

StruxureWare Power Monitoring In-Place Upgrade Guide SQL Server Standard Edition Only

HP LeftHand SAN Solutions

Appendix A. Customer Support Service Level Agreement

Monitoring Agent for Microsoft Exchange Server Fix Pack 9. Reference IBM

Portions of this product were created using LEADTOOLS LEAD Technologies, Inc. ALL RIGHTS RESERVED.

WUG How Target Supports a Large Winshuttle Community. Shon Kane Target Corporation

PROJECTIONS SUITE. Database Setup Utility (and Prerequisites) Installation and General Instructions. v0.9 draft prepared by David Weinstein

Oracle Database Security

Team Foundation Server 2012 Installation Guide

ORACLE DATABASE: ADMINISTRATION WORKSHOP I

Transcription:

Page 1 \ 8 RCA Name IT Customer Complaints RCA Report Number 2012.67 Report Date 4/2/2012 RCA Owner Problem Manager Root Analysis Report Problem Statement Focal Point Customer Complaints When Start Date 3/5/2012 End Date 3/5/2012 Start Time 8:44am End Time 2:31pm Unique Timing While database admin was on vacation Where System Company website Company IT infrastructure Actual I mpact Revenue Customer Service Other... Cost Estimated 1,500,000.00 Negative impact Negative publicity Actual I mpact Total: $1,500,000.00 Frequency 2 times overall Potential I mpact Revenue Customer Service Much more Higher negative impact Report and chart generated by Sologic s link software. www.sologic.com

Report Summaries Page 2 \ 8 and Effect Summary On March 5, 2012 we received numerous complaints from customers about our website being down while they were attempting to use it. The website was down from approximately 8:44am to 2:31pm EST. The customers were unable to access the website because they were receiving "500" errors from our web server. "500" errors prevent users from accessing the website. The server was returning "500" errors because the application server which processes requests was timing out, and we had an unusually high amount of web traffic. The application server was timing out it was receiving requests, and the associated database was not working. The database was not working because the SQL database cluster was not processing queries. The SQL cluster could not process new queries due to the fact that the transaction log stopped growing. The log couldn't grow because the T:Drive was full and we were using only one database cluster. There was only one database cluster in use because the other cluster was being used for UAT testing. The drive was full because there is fixed capacity, the log file storage grew, the logs were not truncated, and truncating the logs reduces memory needs. The logs weren't truncated because the database administrator (DBA) manually truncates them, and was on vacation. The backup DBA was not aware the logs needed truncating because there was no process in place to inform the backup DBA of critical tasks.

Page 3 \ 8 ID 1 Label Solution Description Implement process to notify backup DBA of critical tasks when taking over duties. No process in place to inform backup DBA Barrier = 3 months until Assigned Criteria Fail 2 Solution Create document highlighting DBA duties in case of turnover or emergency backup DBA appointed Assigned Criteria Pass 3 Solution Explore automating log truncation Logs are manually truncated by DBA Assigned Dave Flynn Criteria Fail 5 Solution Use seperate RD SQL clusters for UAT testing Other SQL cluster being used for UAT testing Assigned Ted Dezember Criteria Pass Status Identified 6 Solution Use multiple databases for application servers Only one database cluster in use

Assigned Term Criteria Pass Choose Cost $0.00 Page 4 \ 8 7 Solution Increase space on T:Drives T:Drive at zero bytes free Assigned Ted Dezember Criteria Pass

Team Page 5 \ 8 ID Label Description Label Description 1 First Name Cory Last Name Boisoneau Phone (1) 425-225-5885 Phone (2) Role Facilitator Group Email cory.boisoneau@sologic.com 2 First Name Jennifer Last Name Elderberry Phone (1) 206-985-4845 Phone (2) Role Database Admin Group Email jelder@xyz.com 3 First Name Dave Last Name Flynn Phone (1) 206-254-8890 Phone (2) Role Backup DBA Group Email dflynn@xyz.com 4 First Name Ted Last Name Dezember Phone (1) 206-795-4353 Phone (2) Role IT Infrastructure Manager Group Email tdezem@xyz.com 5 First Name Hannah Last Name Zweikwinden Phone (1) 206-658-0098 Phone (2) Role IT Analyst Group Email hzweik@xyz.com

Page 6 \ 8 ID Label Description 1 Requests made of application server The application server was timing out Transaction log was unable to grow T:Drive at zero bytes free Site was live People visiting website Logs were not truncated Customers attempted to access site Database not working Document 2 Application server processes requests Only one database cluster in use Only one application server exists We only have two SQL clusters SQL trans. log needs to grow to process queries Storage required for log to grow Storage file size fixed Database Admin (DBA) was on vacation Logs are manually truncated by DBA No process in place to inform backup DBA Transaction log located on T:Drive Company only has one DBA

3 Statement from backup DBA Page 7 \ 8 SQL server was not processing queries Backup DBA not aware logs needed truncating Application server relies on working database Dave Flynn 4 Statement from IT Infrastructure Manager Other SQL cluster being used for UAT testing Ted Dezember 5 Statement from IT Analyst "500" errors prevent access to website Time outs result in "500" errors Functional database relies on working SQL server Hannah Zweikwinden 6 Client compaint log Customers not able to access our web site Web server returned error ("500"-type) Chose to contact web support with complaint

Document Page 8 \ 8 7 Customer statement Customers need/want to access site

Chart Legend Transitory People visiting website Site was live Non-transitory Omission - Transitory Omission - Non-transitory Requests made of application server Focal Point Solution Implemented Application server processes requests SQL trans. log needs to grow to process queries The application server was timing out Application server relies on working database Statement from backup DBA Transaction log located on T:Drive We only have two SQL clusters SQL server was not processing queries Only one database cluster in use Statement from backup DBA Other SQL cluster being used for UAT testing Use multiple databases for application servers Criteria Pass Web server returned error ("500"-type) Database not working Statement from IT Infrastructure Manager Transaction log was unable to grow Use seperate RD SQL clusters for UAT testing Criteria Pass Status Identified Client compaint log Time outs result in "500" errors Storage required for log to grow Functional database relies on working SQL server Statement from IT Analyst Storage file size fixed Statement from IT Analyst Customers not able to access our web site Only one application server exists? T:Drive damaged Database Admin (DBA) was on vacation Client compaint log OR T:Drive at zero bytes free "500" errors prevent access to website Increase space on T:Drives Logs are manually truncated by DBA? Statement from IT Analyst Criteria Pass Customer Complaints Customers attempted to access site Logs were not truncated Explore automating log truncation Criteria Fail Company only has one DBA Customers need/want to access site Customer statement Backup DBA not aware logs needed truncating No process in place to inform backup DBA Chose to contact web support with complaint Statement from backup DBA Implement process to notify backup DBA of critical tasks when taking over duties. Criteria Fail Client compaint log Barrier = 3 months until