Yale University Incident Management Process Guide



Similar documents
Yale University Request Management Process Guide

ITSM Process Description

Yale University Knowledge Management Process Guide

Process Description Incident/Request. HUIT Process Description v6.docx February 12, 2013 Version 6

INCIDENT MANAGEMENT & REQUEST FULFILLMENT PROCESSES. Process Owner: Service Desk Manager. Version: v2.0. November 2014 Page 0

Infasme Support. Incident Management Process. [Version 1.0]

ITIL by Test-king. Exam code: ITIL-F. Exam name: ITIL Foundation. Version 15.0

Problem Management Fermilab Process and Procedure

1. INCIDENT MANAGEMENT

HP Service Manager. Software Version: 9.40 For the supported Windows and Linux operating systems. Processes and Best Practices Guide (Codeless Mode)

ITIL v3 Incident Management Process

The ITIL Foundation Examination

The ITIL Foundation Examination

Central Agency for Information Technology

Problem Management Overview HDI Capital Area Chapter September 16, 2009 Hugo Mendoza, Column Technologies

The ITIL Foundation Examination Sample Paper A, version 5.1

Introduction to ITIL: A Framework for IT Service Management

The ITIL Foundation Examination

The ITIL Foundation Examination

The ITIL Foundation Examination

EXIN.Passguide.EX0-001.v by.SAM.424q. Exam Code: EX Exam Name: ITIL Foundation (syllabus 2011) Exam

The ITIL Foundation Examination

HP Service Manager. Software Version: 9.34 For the supported Windows and UNIX operating systems. Processes and Best Practices Guide

The ITIL Foundation Examination

Which statement about Emergency Change Advisory Board (ECAB) is CORRECT?

Incident Management Get Your Basics Right

Creating Service Desk Metrics

HP Service Manager. Software Version: 9.34 For the supported Windows and UNIX operating systems. Incident Management help topics for printing

SERV SER ICE OPERA OPERA ION

The ITIL v.3 Foundation Examination

APPENDIX 4 TO SCHEDULE 3.3

Title: DESKTOP TICKET MANAGEMENT PROCEDURE

The ITIL v.3 Foundation Examination

Yale University Change Management Process Guide

Problem Management: A CA Service Management Process Map

TechExcel. ITIL Process Guide. Sample Project for Incident Management, Change Management, and Problem Management. Certified

ENTERPRISE SERVICE DESK (ESD) SERVICE DELIVERY GUIDE

Commonwealth of Massachusetts IT Consolidation Phase 2. ITIL Process Flows

HP Service Manager. Software Version: 9.34 For the supported Windows and UNIX operating systems. Service Desk help topics for printing

ITSM Maturity Model. 1- Ad Hoc 2 - Repeatable 3 - Defined 4 - Managed 5 - Optimizing No standardized incident management process exists

ITIL Roles Descriptions

HP Service Manager. Service Desk help topics for printing. For the supported Windows and UNIX operating systems. Software Version: 9.

Contents. Service Request Departmental Procedures

Introduction Purpose... 4 Scope... 4 Manitoba ehealth Change Management... 4 Icons RFC Procedures... 5

Free ITIL v.3. Foundation. Exam Sample Paper 1. You have 1 hour to complete all 40 Questions. You must get 26 or more correct to pass

Free ITIL v.3. Foundation. Exam Sample Paper 4. You have 1 hour to complete all 40 Questions. You must get 26 or more correct to pass

Software Quality Assurance (SQA) Testing

HP Service Manager. Software Version: 9.40 For the supported Windows and Linux operating systems. Incident Management help topics for printing

Terms of Use - The Official ITIL Accreditor Sample Examination Papers

Incident Management: A CA IT Service Management Process Map

IT Service Management

An ITIL Perspective for Storage Resource Management

Service Desk Management Process

Incident Management Best Practices Chris Pope. Global Service Delivery Manager Global Managed Services Column Technologies.

Service Request Logging Procedure

Free ITIL v.3. Foundation. Exam Sample Paper 3. You have 1 hour to complete all 40 Questions. You must get 26 or more correct to pass

sample exam IT Service Management Practitioner Support & Restore (based on ITIL ) edition July 2009

ITIL v3. Service Management

Implementation Date: November 9, Table of Contents. Section Description Page

HP Service Manager. Software Version: 9.40 For the supported Windows and Linux operating systems. Problem Management help topics for printing

COUNTY OF ORANGE, CA Schedule 2D Service Desk Services SOW SCHEDULE 2D SERVICE DESK SERVICES SOW. for. Date TBD

MOF Service Management Function Incident Management

What is the Purpose of OA s Enterprise IT Helpdesk Procedure?... 2 How will Problems, Questions or Changes be entered?... 2

Information Technology Engineers Examination. Information Technology Service Manager Examination. (Level 4) Syllabus

Guide. BMC Analytics for Business Service Management User. Supporting. BMC Analytics version for Business Service Management.

Foundation. Summary. ITIL and Services. Services - Delivering value to customers in the form of goods and services - End-to-end Service

Functional/Technical Specification

Closed Loop Incident Process

HP Service Manager. Software Version: 9.40 For the supported Windows and Linux operating systems. Service Desk help topics for printing

CA Nimsoft Service Desk

Problem Management. Process Guide. Document Code: Version 2.6. January 7, Robert Jackson (updates) Ashish Naphray (updates)

Incident Management Process Guide Example

Analyst Guide for Request Support -- Incident/Service Request

ITIL 2011 Lifecycle Roles and Responsibilities UXC Consulting

Auxilion Service Desk as a Service. Service Desk as a Service. Date January Commercial in Confidence Auxilion 2015 Page 1

Supporting GIS Best practices for Incident Management and Daily Operations

1 What does the 'Service V model' represent? a) A strategy for the successful completion of all service management projects

Service Level Agreement for Database Hosting Services

IM Misrouted Ticket Process

ITIL V3 Foundation Certification - Sample Exam 1

Communicate: Data Service Level Agreement. Author: Service Date: October 13. Communicate: Data Service Level Agreementv1.

ICS Operations & Policies Manual. For State Agencies Providing and Using Consolidated Services

Helpdesk Incident & Request Management Procedure For

Incident Manager. Notified. Major Incident? YES. Major Incident Declared. Initial Communication Drafted. MIH At A Glance. Major Incident Ended

TECHNICAL SUPPORT GUIDE

ITIL A guide to problem management

ITSM Roles. 1.0 Overview

UW Connect Update & Incident Management Overview

Service Desk Level 1 Service Description

IBM Software as a Service (SaaS) Support Handbook for IBM Cognos Sales Performance Management

Assigning Severity Codes

myit Tool Incident & Request Management

TM Forum Frameworx 13.5 Implementation Conformance Certification Report

Exhibit E - Support & Service Definitions. v1.11 /

Customer Service Charter TEMPLATE. Customer Service Charter Version: 0.1 Issue date :

OPERATIONAL SERVICE LEVEL AGREEMENT BETWEEN THE CLIENT AND FOR THE PROVISION OF PRO-ACTIVE MONITORING & SUPPORT SERVICES

INTERVIEW QUESTIONS. Que: Which process is responsible for ensuring that the CMDB has been updated correctly?

Transcription:

Yale University Management Process Guide Yale University Management Process 1 of 17

Introduction Purpose This document will serve as the official process of Management for Yale University. This document will introduce a Process Framework and will document the workflow, roles, procedures, and policies needed to implement a high quality process and ensure that the processes are effective in supporting the business. This document is a living document and should be analyzed and assessed on a regular basis. Scope The scope of this document is to define the Management Process, and process inputs from, and outputs to, other process areas. Other service management areas are detailed in separate documentation. The following is a specific list of items that are in scope for this document. Other items not listed here are considered out of scope for this document. In scope: Management Overview o Definition o Management Objectives o Management Policies Management Process Flow Management Roles Management RACI Management Procedure Flows and Descriptions Management Prioritization scheme Management Service Categorization Model Management Process Metrics Yale University Management Process 2 of 17

Management Overview Definition An is an unplanned interruption to a technology service or reduction in quality of a technology service. Failure of a Configuration Item or product that has not yet impacted service is also an incident (i.e. failure of one disk from a mirror set). Management Objectives The goal of Management is to restore service operations as quickly as possible. Timely and efficient resolution will minimize business impact and increase productivity. Management Policies reporting must go through the Service Desk, providing Users with a single point of contact All incidents must be logged, prioritized and solutions recorded in the Management System One standard Management Process is defined and used to support all IT Service users The Service Desk manages, tracks, escalates, closes and communicates status of all incident records and is responsible for all incident assignments The Management Process is the conduit of communication of any degradation of service, to the affected users and IT personnel Closure of incidents is dependent on validating with the user that the incident has been resolved and service is restored The Service Desk will own all incidents that they themselves log or that are assigned to them from a Tier 2 provider. Ownership will transfer to the / Situation Manager for major incidents Once a major incident has been validated by the Service Desk, escalation and communication protocols for high-priority incidents are initiated and managed by the Service Desk Yale University Management Process 3 of 17

Management Process Flow V3 Management Process Caller / Customer / Situation Manager Identification Sources: Event Mgt, Email, Self-Service, Phone Call, Tech Staff (Tier 2 / Walkup) Hierarchic Escalation Needed? Management Escalation Process Functional Group - Queue Manager Functional Group Tier 2+ Analyst Service Desk Analyst 1.0 Logging Service Request? To Request Fulfillment 2.0 Categorization 3.0 Prioritization Major? 4.0 Initial Diagnosis 9.0 Major Process Functional Escalation Required? 5.0 Functional Escalation 6.0 Investigation & Diagnosis 7.0 Resolution and Recovery 8.0 Closure Process Owner Accountable for the Process and its Evolution/Maturation Roles The following roles have been identified within the Management Process. Role Description Manager Oversee day to day process execution Often the Service Desk Manager Manages major incidents until the appropriate situation manager is identified Situation Manager Manages and owns major incidents Service Desk Manager Manages the service desk function, including staffing management activities Provides guidance to Service Desk Analysts Process Owner Owns the process end-to-end, including the RACI, process & procedural steps, role & definitions Accountable for maturing and evolving the process, based on monthly/quarterly/yearly review of process KPIs Adjusts the process to address performance or changing business needs Service Desk Site Lead Responsible for the operations of Service Desk Analysts that are geographically disperse, reporting to the Service Desk Manager Service Desk Analyst Logs incidents Provides initial diagnosis Resolve incidents at first point of contact if possible Escalates incidents Yale University Management Process 4 of 17

Role Description Owns non-major incidents Caller / Customer The end user having or reporting the service interruption Functional Group Queue Manager Functional Group Tier 2+ Analyst Assigns incidents to individual Tier 2+ Analysts in the functional group Monitors and manages support resolution performance May directly manage (reporting manager) the day to day activities of Tier 2+ analysts outside of process activities Group of technical support experts that will handle issues escalated by the Service Desk For example, a Network Engineer Receive process direction for a functional group queue manager, staff management from a reporting manager RACI 1.0 Logging Caller / Customer Service Desk Analyst Service Desk Site Lead Manager Situation Manager Functional Group Queue Manager Functional Group Tier 2+ Analyst C R A R 2.0 Categorization C R A R Process Owner 3.0 Prioritization C R A, C, I 4.0 Initial Diagnosis R A, C, I R 5.0 Functional Escalation R A, C 6.0 Investigation & A R Diagnosis 7.0 Resolution & Recovery I R A R 8.0 Closure C R R A R 9.0 Major Process Process Maturity and Evolution A R R C, I C R R C R C A Yale University Management Process 5 of 17

Service Desk Analyst / Service Desk Analyst / Functional Group Functional Group Tier 2+ Analyst Tier 2+ Analyst Process Procedures 1.0 Logging Service Service Desk Desk Analyst Analyst / / Functional Functional Group Group Tier Tier 2+ 2+ Analyst Analyst 4.0 1.1 Verify Issue Exists Existing? 1.3 Validate Caller / Customer Contact details and update if required 1.4 Capture and Document Details 1.2 Update Activity Log & Communicate Status 2.0 Status Provided Step 1.1 Verify Issue Exists Take steps to validate or replicate the interruption. Gather any data about the issue (screenshots, descriptions). Associate to any concurrent incident (e.g. major outage). 1.2 Update Activity Log & If the Caller is inquiring about status of an existing incident, provide the caller with Communicate Status status as available in the incident record and update the record indicating that the caller was inquiring and update with additional details if available. 1.3 Validate Caller / Customer Complete caller data and ensure contact details are accurate and update if necessary. Contact details and Update if Required 1.4 Capture and Document Details Complete the short and long description, ensuring they are clear and can be understood by others. Collect incident symptoms. 2.0 Categorization 2.0 2.1 Identify Type 2.2 Associate Configuration Item(s) 2.3 Complete Categorization 3.0 Yale University Management Process 6 of 17

Step 2.1 Identify Type Capture the incident type based on the customer-reported symptoms. 2.2 Associate Configuration Items(s) If a Configuration Management System (CMS) is present, associate the incident to the Configuration Item(s) (CI) diagnosed to have failed and are causing the incident. te, IT Business and Provider Services may be captured as CI s, if implemented. 2.3 Complete Categorization If there is no CMS present, capture the device name or ID, and based on the primary failed device, capture the component categorization. Capture IT Business Service categorization, as defined by the customer. Based on the symptoms and incident diagnosis, capture the IT Provider Service categorization. 3.0 Prioritization Manager Manager Service Service Desk Desk Analyst Analyst 2.0 3.1 Prioritize Major? 3.2 Escalate to Manager 9.0 4.0 Step 3.1 Prioritize Select the impact and urgency of the according to guidelines if it is not present. This will determine the priority. If priority-based service level monitoring is enabled, the selected priority to define the response and resolution time service level targets for the incident. If service-based monitoring is enabled, the selected priority will only define the response time service level targets for the incident. If the reported service does not have any restoration service level targets defined, a generic priority-based restoration service level target may be used. 3.2 Escalate to Manager / Situation Manager Determine if this is a major incident. If so, the service desk agent will escalate to the incident manager accordingly. Yale University Management Process 7 of 17

4.0 Initial Diagnosis Caller / Customer 4.3 Acquire Additional Information Service Service Desk Desk Analyst Analyst / / Functional Functional Group Group Tier Tier 2+ 2+ Analyst Analyst 3.0 4.1 Perform Initial Diagnosis 4.2 Search Knowledge Base, Known Error Database and Change Schedule More Information Required? 1.0 4.4 Resolution Possible? 7.0 4.5 Recurring? Problem Management Process 4.6 Update Details linking to Known Error, Knowledge Article, Change as required 5.0 Step 4.1 Perform Initial Diagnosis Document all trouble-shooting steps within the incident record. 4.2 Search Knowledge Base, Known Error, Database and Change Schedule Use initial diagnosis details to search the knowledge base for relevant knowledge. Also check the known error database to see if a workaround exists and the change schedule to see if this is issue could be related to a recently implemented change. Ensure the incident record is coded appropriately. 4.3 Acquire Additional Information If additional information is required, contact the customer. If the customer cannot be reached, place the incident on hold. 4.4 Resolution Possible? If a resolution is possible, proceed to step 7.0 Resolution and Recovery. If resolution is not possible, the incident may need to be assigned to a functional group for resolution. 4.5 Recurring? Determine if other incidents of the same nature have been experienced. If others exist and no root cause has been determined, this may be a good candidate for problem management. 4.6 Update Details linking to Known Error, Knowledge Article, Change as required Confirm that the incident record is updated and coded according to the diagnosis steps. Selection of a knowledge record may update (e.g. provider service, component category, urgency) incident categorization and details. Yale University Management Process 8 of 17

Vendor / 3 rd Party Provider 5.0 Functional Escalation Vendor / 3 rd Party Provider Service Desk Analyst Service Desk Analyst 4.0 6.0 5.1 Internal Provider? 5.3 Assign to Functional Group Monitor ( Ownership Activity) 8.0 5.2 Dispatch to Vendor and Monitor 6.0 9.0 Vendor Diagnosis and Resolution If Possible 7.0 Step 5.1 Internal Provider? If assignment is necessary, determine if the functional group that is equipped to resolve the incident is an internal support group or an established external partner that has a support agreement and process established for incident resolution. 5.2 Dispatch to Vendor and Monitor If the functional group is an external group, ensure that the established incident process is followed. 5.3 Assign to Functional Group If the functional group is an internal group, determine the proper group for assignment and assign it to the Group. 5.4 Monitor Optimally, the Service Desk owns the monitoring of incident to resolution and closure. Guidelines for ownership/monitoring include: - Providing customers with desk contact info for updates - Progress notifications originate from a desk monitored email account - s that have not been accepted within response time targets should be initially escalated to the assignment group manager, and ultimately to the Manager if required - Ownership of major incidents should be transferred to the incident manager/ situation manager Yale University Management Process 9 of 17

6.0 Investigation and Diagnosis Functional Functional Group Group Tier Tier 2+ 2+ Analyst Analyst Caller Caller / / Customer Customer 5.0 6.1 Perform Initial Review and Diagnosis 5.0 6.2 Update and Assign to Service Desk for Re-Diagnosis and Re-assignment Accept? 6.3 Contact Requestor for Additional Information if required and update Activity Log Functional Escalation? 6.4 Functional Escalation to Tier 3 Tier 3 Diagnosis and Resolution 7.0 Step 6.1 Perform Initial Review & Diagnosis 6.2 Update and Assign to Service Desk for Re-diagnosis and Re-assignment 6.3 Contact Requestor for Additional Information if Required 6.4 Functional Escalation to Tier 3 if required Perform initial review to determine if the incident has been properly assigned. If the incident was improperly assigned, the Functional Group assigns it back to the Service Desk for further diagnosis and assignment. If assignment is proper, accept the incident (work in progress) and determine if further information is required contact the customer to obtain then proceed to step 7.0 Resolution and Recovery. If the incident requires further assignment to Tier 3 or an external Vendor, the Functional Group is responsible to work with the external partners and maintain oversight of the incident record. Yale University Management Process 10 of 17

7.0 Resolution and Recovery Service Service Desk Desk Analyst Analyst 4.0 5.0 7.1 Communicate Workaround if Appropriate 7.2 Work to Resolve Updating with Necessary Details Problem Management Process - Document Known Error or Workaround (if applicable) 7.4 Service Desk Analyst Contact Functional Group or Vendor for Additional Resolution Details if Required 8.0 Functional Functional Group Group Tier Tier 2+ 2+ Analyst Analyst 6.0 Resolution requires Change Request? Change Management Process Resolved? 7.3 Update Resolution Details and Assign to Service Desk Step 7.1 Communicate Workaround if Appropriate 7.2 Work to Resolve Updating with Necessary Details 7.3 Update Resolution Details and Assign to Service Desk 7.4 Service Desk Analyst Contact Functional Group or Vendor for Additional Resolution Details if Required Investigate sources of information to see if a workaround exists. Check relevant knowledge, known error database, problem records, etc. and provide the work around to the customer. If no workaround exists, begin resolution activities making sure to update the incident record with all details related to resolution activities. If resolution requires that a change be introduced, a Request for Change must be submitted and flow through the Change Management Process. Once the incident has been resolved it is good practice to review the solution and determine if knowledge could be authored for future occurrences, or if there is a systemic issue that needs to be addressed through the Problem Management Process. Upon resolution, the incident is updated with the proper resolution information and coding, and is assigned to the Service Desk for final closure activities. In preparation for closure activities, review the incident details to ensure it is completed properly and has the appropriate resolution details. Yale University Management Process 11 of 17

8.0 Closure Service Service Desk Desk Analyst Analyst Caller Caller / / Customer Customer 9.0 Resolved? 8.1 Validate Resolution with Caller / Customer 7.0 8.2 Update for Re- Diagnosis 5.0 8.3 Trigger Customer Satisfaction Survey Closed Step 8.1 Validate Resolution with Caller / Customer Follow proper procedures to validate with the Customer that the incident has in fact been resolved. If it has been resolved, the incident will be closed according to procedures. 8.2 Update for Re-Diagnosis If the Customer indicates that the incident has not yet been resolved, it must be sent back for further diagnosis before the incident is closed. NOTE: If the incident is in a closed state when the customer indicates it was not resolved, a new incident should be opened and associated to the original incident. 8.3 Trigger Customer Satisfaction Survey Once the customer has confirmed resolution and the incident is in process of being officially closed, a customer satisfaction survey is to be provided to inform future improvement opportunities. Yale University Management Process 12 of 17

9.0 Major Process Situation Manager Manager Manager 3.0 9.1 Validates Major 9.3 Trigger and execute communications as required until Resolved 9.2 Ensure Assignment of to Functional Team Consult with Business & IT to invoke Business/IT Service Continuity Plans as defined 9.4 Prepare Major documentation as defined & Perform Post Resolution Review 5.0 8.0 Step 9.1 Validate Major If an incident is escalated to a Major status, the Manager must first ensure that it should be treated as a Major and be given the enhanced communication and management attention that a Major requires. 9.2 Ensure Assignment of to Functional Team 9.3 Trigger and execute communications as required until Resolved 9.4 Prepare Major documentation as defined & Perform Post Resolution Review Ensure that the incident has been assigned to the appropriate team for resolution and works with the management structure to coordinate a crossfunctional team to address the situation if needed and where the underlying issue is unclear. Ensure that the communication is planned and executed according to internal procedures and triggers. At a minimum communication is to be shared at the beginning and end of a Major and perhaps at specific intervals throughout the resolution process. This communication can be to either internal IT stakeholders or Customers or a combination of both. Upon resolution of a Major, documentation must be prepared that summarizes the issue, actions taken and resolution details. It should also trigger root case analysis if required and allow for improvements that can be made to avoid the situation in the future. Yale University Management Process 13 of 17

Prioritization scheme The prioritization scheme is based on a combination of Impact and Urgency and used to: Generate a simplified value in order to drive escalation and notification Establish appropriate order and extent of work effort to achieve resolution Impact: Measure of scope and criticality to business Often equal to the extent to which an incident leads to distortion of agreed or expected service levels Impact is often measured by the number of people affected, critical systems, or financial loss as a result of the service interruption Urgency: Measure of how quickly an incident needs to be responded to based on the business needs of the customer Prioritizes workload; incidents with higher urgency will be addressed before others, unless SLA breaches are imminent The follow table shows the resulting priority based on impact and urgency. Service Categorization Model A Service Categorization model is an important aspect that: Recognizes need to capture service vs. technology details Future-proofed for introduction of Service Asset and Configuration Management Enhances value of reporting by defining IT service view in terms the business should understand Yale University Management Process 14 of 17

The following shows the 2 Service Categorization levels that enable views from both Customer/Business and Provider perspectives: Process Metrics The following table describes the Management KPIs identified. Yale KPI KPI Name OM DESCRIPTION # 2.1 % s logged How many tickets are incidents? s are tickets excluding service requests and complaints DERIVED s by Source (e.g. Call, Email etc.) How many incidents are logged via calls 11 First incident resolution How many incidents were resolved correctly in first attempt? 12 s resolved at Help Desk How many incidents were resolved at the help desk? 13 s that bypassed help desk support How many incidents were not reported to help desk 17 Reassignment Count How many incidents are reassigned to another workgroup? 14 % s assigned incorrectly How many incidents are assigned to the wrong workgroup? % s escalated Inaccurately How many incidents are assigned from Tier 2 to Tier 2, vs. being reassigned to Tier 1 15 % s escalated s that are escalated to another group 17 Average time for second level support to respond Time elapsed between a call being assigned to second level support and the call being accepted Yale University Management Process 15 of 17

Yale KPI KPI Name OM DESCRIPTION # 18 s reopened How many incidents were reopened? 19 Ageing incidents How many incidents are unresolved past threshold 20 s responded on time (Tier 2) How many incidents were responded on time by Tier 2 NEW s responded on time (customerperspective) How many incidents were responded to on time from logging to last in-progress 21 s resolved on time How many incidents were resolved on time DERIVED s by Priority 24.1 Cost per Ratio of operational budget to users of IT services Yale University Management Process 16 of 17

Document History Version Date Changes Author 01 12/31/2011 Initial Document Angie Massicotte / Michael Oas 02 01/02/2012 Minor Updates Michael Oas Yale University Management Process 17 of 17