Interconnecting network monitoring and ticketing systems at CERN



Similar documents
BridgeConnex Statement of Work Managed Network Services (MNS) & Network Monitoring Services (NMS)

Best Practices for Developing and Implementing the Right Monitoring Framework

How to integrate Verax NMS & APM with Verax Service Desk

Monitoring, Managing, Remediating

Verizon Unified Communications and Collaboration as a Service Service Level Agreement ( SLA )

SPRINT MANAGED NETWORK SERVICES PRODUCT ANNEX ( MNS Terms and Conditions )

APPENDIX 2A.1 TO SCHEDULE 2A Revision 1

Empowering the Enterprise Through Unified Communications & Managed Services Solutions

The Importance of Information Delivery in IT Operations

Customer Service Charter TEMPLATE. Customer Service Charter Version: 0.1 Issue date :

PRESIDIO MANAGED SERVICES OVERVIEW JULY 2013

Managed Services. Mohammad Shakeer Sales Manager. Phone: URL:

How To Manage Your Information Systems At Aerosoft.Com

Improving. Summary. gathered from. research, and. Burnout of. Whitepaper

Efficient Network Monitoring Access

Remote Management Services Portfolio Overview

PRESIDIO MANAGED SERVICES OVERVIEW

ASIAN PACIFIC TELECOMMUNICATIONS PTY LTD STANDARD FORM OF AGREEMENT. Schedule 3 Support Services

Service Integration and Management:

ITOPIA SERVICE LEVEL AGREEMENT

Network Monitoring and Management Services: Standard Operating Procedures

Managed Services for large public safety networks. Timo Bakker, Global Market Leader Public Safety & Defence

OpManager MSP Edition

Managed Services Agreement. Hilliard Office Solutions, Ltd. PO Box Phone: Midland, Texas Fax:

Cisco Change Management: Best Practices White Paper

A FAULT MANAGEMENT WHITEPAPER

24/7 Monitoring Pro-Active Support High Availability Hardware & Software Helpdesk. itg CloudBase

GMI CLOUD SERVICES. GMI Business Services To Be Migrated: Deployment, Migration, Security, Management

Managed Services. Business Intelligence Solutions

HealthcareBookings.com Security Set Up

Avaya Diagnostic Server

Tecknodreams Software Consulting Pvt. Ltd. Managing IT Services in an Insurance Company

KENET & REGIONAL COLLABORATION NETWORKS:

Community Anchor Institution Service Level Agreement

Managed Desktop Support Services

GMS NETWORK ADVANCED WIRELESS SERVICE PRODUCT SPECIFICATION

mbits Network Operations Centrec

Information Technology Solutions. Managed IT Services

CRAWL, WALK, RUN APPROACH - IT SERVICE CATALOGUE

How To Use Mindarray For Business

MSP Service Matrix. Servers

Der Weg, wie die Verantwortung getragen werden kann!

Ticket Management & Best Practices. April 29, 2014

Tecknodreams Software Consulting Pvt. Ltd. Leading Hospital Chain uses SapphireIMS for Service and Operations Management

MARKET BRIEF Plug and Play: Managed IP Telephony

10 Tips to Better Manage Your Service Team

Service Overview: Remote Service Management

All other issues are to be submitted via a request ticket utilizing the Web Helpdesk found at

Telecom CPE Management Overview

Managed IP PBX Service Level Agreement

A TYPICAL TELECOMMUNICATIONS NOC (an overview) Prepared by: Bode A. Oladipo

Avaya Diagnostic Server

TERMS OF REFERENCE for outsourced IT services at ILO premises

COUNTY OF ORANGE, CA Schedule 2D Service Desk Services SOW SCHEDULE 2D SERVICE DESK SERVICES SOW. for. Date TBD

HELP DESK MANAGEMENT PLAN

Enterprise Managed PBX Telephony

WHO ARE WE AND WHAT WE DO?

Tecknodreams Software Consulting Pvt. Ltd. Leading IT Solutions provider uses SapphireIMS for Monitoring and Service Management

Quality Certificate for Kaspersky DDoS Prevention Software

NMS Network Management System

Table of Contents Table of Contents...2 Introduction...3 Mission of IT...3 Primary Service Delivery Objectives...3 Availability of Systems...

End-User Monitoring: Gaining Visibility into Hidden Business Risks

you productivity! Providing Overview Support-I.T. Features On demand, caring & committed support! 24/7/365 committed help desk services to support

How to Choose a Managed Network Services Provider

Changing the Landscape of Telecom Infrastructure

ManageEngine (division of ZOHO Corporation) Infrastructure Management Solution (IMS)

Network & Information Services Network Service Level Commitment

ITIL & ServiceDesk Plus

Report of Independent Auditors

INFORMATION TECHNOLOGY SERVICES TECHNICAL SERVICES June 2012

Service. Strategic Technology Solutions for DNA Technology Solutions and Services That Help You Optimize System Performance, Security and Availability

Remote Management Services Unified Communications Addendum

Ongoing Help Desk Management Plan

California Dept. of Technology AT&T CALNET 3. Service Level Agreements (SLA) 7.3 Network Based Managed Security

ONMSi: Optical Network Monitoring System. Fiber Network Visibility that Scales for Both PON and Point-to-Point Networks

Training objective. Tata Communications IP Network Surveillance & Monitoring Process. TRANSFORMATION SERVICES

TM Forum Frameworx 13.5 Implementation Conformance Certification Report

White Paper. Business Service Management Solution

XO Wide Area Network ( WAN ) Services IP Virtual Private Network Services Ethernet VPLS Services

Operating and Service Level Objectives

INCIDENT MANAGEMENT SCHEDULE

Enterprise Service Level Agreement

Service Level Agreement and Management By: Harris Kern s Enterprise Computing Institute

CWSI Service Definition for Server Monitoring

Transcription:

Interconnecting network monitoring and ticketing systems at CERN Gyorgy Balazs Veronique Lefebure SIG-NOC meeting @ Stuttgart 08.04.2015 2

Network/Telecom environment Multi-purpose, multi-vendor network infrastructure for General connectivity, Technical instruments, WLCG, Internet Exchange Point (CIXP) 3 distinct 10-100 gigabit backbones 2 datacenters (Geneva + Budapest) 150+ high performance routers 3 700+ subnets 3 000+ switches 50 000 active user devices 80 000 sockets 5 000 km of UTP cable 400+ starpoints (from 20 to 1 000 outlets) 5 000 km of CERN owned fibers 500 Gbps of WAN connectivity Telecom services 12000 fixed telephone lines (analogue, ISDN, IP, Lync) 6000 mobile phones on partly CERN operated infrastructure 300 TETRA digital radio handsets on CERN managed infrastructure Extremely Dynamic environment 2x more visitors than staff 1500 connection and change request / month 3

NOC Structure (Network and Telecom incident management) CERN Users Standard incidents and user support On-site Outsourcing IT Helpdesk (Network) Service Desk Field Technicians Switchboard & telecom Lab (Telephony) Service Monitoring And advanced support CERN accelerator and experiment control rooms Network and Telecom Operations Computer Centre Operator (24/7) Advanced incident and problem management Network Engineer teams Telecom Engineer teams External support and collaboration External Entities (Vendors, CIXP, LHCOPN...) External Entities (Vendors, Telecom operators 4

Infrastructure Monitoring Monitoring tool: CA Spectrum fed by the in-house developed NMS Used for: - Monitoring network & telecom devices and related servers, PDUs, temperature sensors, selected hosts for experiment instruments - Tracking alarms and sending notifications - Collecting data for statistics 5

Service Management / Ticketing Integrated ITSM tool: Service-Now Used for: User incidents and requests Intra-NOC ticketing Knowledge base Change management Intervention planning Service catalogue and portal Service status board OWH support calendars Reporting (eg. SLA tracking) 6

Why interconnect Monitoring + Ticketing? Automatic mapping of ticket ID to alarm: Live tracking of the troubleshooting process from the alarm screen Ease updates of service status board to keep users informed Documented solution accessible from the alarm history Automatic ticket creation: Automate incident assignment to support groups Ensure analysis of even short outages for long-term improvement Procedures directly in the ticket based on alarm and equipment type No more copy-pasting when creating tickets Automatic clock-stack for SLA tracking and post mortem analysis 7

Interconnection via centrally managed message broker interface Ticket automatically created when alarm shows up Alarm contains link to corresponding ticket Ticket contains: Incident type Device(s) concerned Service concerned Link to corresponding procedure Device alarm history with ticket references Alarms are grouped in single ticket by service and geolocation Msg Broker 8

RestAPI How does it work? SpectroServer Alarm Alarm Notifier (filters) Ticket ID Listener script Spectrum SMS E-mail Script Return msgs STOMP call Incident grouping ServiceNow INC Message Broker 9

Challenges Granularity of alarm grouping not to flood, not to miss Mapping of alarms to: Support groups Procedures Corresponding alarm history Granularity of procedures per device/service/support group? Automatic update to the ticket when device status changes Lorem ipsum dorum Lorem ipsum dorum? 10

Results so far Allowed to outsource ticket dispatching to the Computer Centre operator 24/7 (no specialization needed, procedure in ticket) Improved response time consistency: Time precisely controlled between the incident and ticket assignment to the field technician Network operations team is refocusing on proactive instead of reactive tasks: post-mortem analysis, indicative monitoring events - visible improvement in detecting anomalies Engineers can check resolution status directly from the alarm screen or alarm history 11