Monitoring Systems and Services. Alwin Brokmann DESY-IT March 24 28,2003 CHEP 2003 San Diego

Similar documents

Setting Up A Nagios Monitoring System Warren Block, May 2005

How To Monitor A Network With Nagios And Other Tools

Présentation de Nagios

DEPLOYMENT GUIDE Version 1.0. Deploying the BIG-IP LTM with the Nagios Open Source Network Monitoring System

Managing Monitoring in Distributed Environments

Document d'installation FAN 2.1

NETWORK MONITOR. Some high-end network monitoring. Watching your systems with Nagios COVER STORY. What Is Nagios? Installing the Server and Plugins

Network Monitoring Systems / Nagios. 2/19/08 Michael Miller e mail: mike.mikemiller@gmail.com

Monitoring MySQL. Geert Vanderkelen MySQL Senior Support Engineer Sun Microsystems

Availability Management Nagios overview. TEIN2 training Bangkok September 2005

Network Monitoring with Nagios. Matt Gracie, Information Security Administrator Canisius College, Buffalo, NY

Nagios. cooler than it looks. Wednesday, 31 October 2007

NRPE Documentation CONTENTS. 1. Introduction... a) Purpose... b) Design Overview Example Uses... a) Direct Checks... b) Indirect Checks...

AusCERT Remote Monitoring Service (ARMS) User Guide for AusCERT Members

Monitoring VoIP Systems. Sebastian Damm

September 21 st, Ethan Galstad.

How To Install Nagios Network Monitoring Program On A Pc Or Macbook Or Ipad (For Mac) With A Web Browser (For A Mac)

Open Source Management Options

Research Report Security Management with Open Source Tools

Analysis One Code Desc. Transaction Amount. Fiscal Period

Nagios introduction. Dhruba Raj Bhandari (CCNA) Additions by Phil Regnauld.

Robust & Reliable DNS Operations Logging & Monitoring

Monitoring Your Enterprise PACS With Nagios, Cacti And Smokeping

How To Monitor A Network With Nagios And Rt Software On Linux On A Microsoft Ipad (A2) On A Pc Or Macbook Or Ipad Or Ipa (A3) On An Ipa Or Ipo (

While are you still in Nagios working directory, create a new file for DNS servers monitoring

Monitoring the World with NetBSD

Nagios. Nagios. Jacquelin Charbonnel - Albert Shih. CNRS - Ecole Mathrice Marseille, Novembre / 72

OMNITURE MONITORING. Ensuring the Security and Availability of Customer Data. June 16, 2008 Version 2.0

Setup Guide. network support pc repairs web design graphic design Internet services spam filtering hosting sales programming

LAUREA MAGISTRALE - CURRICULUM IN INTERNATIONAL MANAGEMENT, LEGISLATION AND SOCIETY. 1st TERM (14 SEPT - 27 NOV)

FMAudit Agent Install Guide

There are numerous ways to access monitors:

Orientation Course - Lab Manual

Service Level Agreement

Network Monitoring. Dhruba Raj Bhandari (CCNA) Manager Systems Soaltee Crowne Plaza Kathmandu NEPAL

SupportDesk Extensions Installation Guide Extension Service - Versions

Manual. By GFI Software Ltd. GFI Network Server Monitor

An Introduction to Monitoring with Nagios

NetIQ AppManager Suite

WhatsUp Gold v11 Features Overview

Microsoft Outlook 2013 & Microsoft Outlook Microsoft Outlook Windows Live Mail 2012 & MAC Mail. Mozilla Thunderbird

SIG-NOC Meeting - Stuttgart 04/08/2015 Icinga - Open Source Monitoring

Monitoring Software Services registered with science.canarie.ca

6.0. Getting Started Guide

Supermicro Server Monitoring with SuperDoctor 5 and Nagios Using SNMP Protocol. Version 1.1b

Managed Services OVERVIEW

The Nagios check_logfiles plugin helps you monitor your logfiles even if the logs rotate and change names.

User s Manual. Management Software for ATS

Printer Management Software

HP LeftHand SAN Solutions

heck What the is wrong with my server?!? Josh Malone Systems Administrator National Radio Astronomy Observatory Charlottesville, VA

Configure Cisco Unified Customer Voice Portal

GFI Product Manual. Manual

GFI Network Server Monitor 7.0. Manual. By GFI Software Ltd.

Network Monitoring With Nagios. Abstract

The road to lazy monitoring with Icinga2 & Puppet. Tom De

Case 2:08-cv ABC-E Document 1-4 Filed 04/15/2008 Page 1 of 138. Exhibit 8

NETWORK MANAGEMENT AND REMOTE MONITORING VIA SMS APPLICATION

Gmail Or other POP3

Monitoring of computer networks and applications using Nagios

Business Internet service from Bell User Guide

Altaro Hyper-V Backup - Getting Started

AT&T Global Network Client for Windows Product Support Matrix January 29, 2015

AdventNet MSP Solutions

Compuprint 4247 Serial Matrix Printers

Avira Management Console User Manual

MALAYSIAN PUBLIC SECTOR OPEN SOURCE SOFTWARE (OSS) PROGRAMME. COMPARISON REPORT ON NETWORK MONITORING SYSTEMS (Nagios and Zabbix)

Cisco WebEx Meetings Server Administration Guide

TOP LEVEL COMPONENTS

Webmail. Setting up your account

FAMIILIA IT SERVCES Aranda CMDB- SELF SERVICE - SERVICE DESK

HowTo Check. Microsoft Cluster. Functionality via SNMP

High Availability with Elixir

NetCrunch 6. AdRem. Network Monitoring Server. Document. Monitor. Manage

Network User's Guide for HL-2070N

ITSM Service Monitoring Using Open Source Tools

Managing Ports and System Services using BT NetProtect Plus firewall

Planning Maintenance for Complex Networks

VMware vsphere 5.0 Boot Camp

VANITHASHRREE RAJA 21 ST OCTOBER 1988 SERVER, CPU & HARDISK NETWORK MONITORING SYSTEM 2011/2012 EN. ALIAS MOHD

Microsoft Outlook 2010

Integrated processes aligned to your business The examples of the new NetEye and EriZone releases

Quick Reference Guide: Business Mail

Cannot send Autosupport , error message: Unknown User

Resonate Central Dispatch

Using Actions and Alerts

shortcut Tap into learning NOW! Visit for a complete list of Short Cuts. Your Short Cut to Knowledge

EUCIP - IT Administrator. Module 2 Operating Systems. Version 2.0

VMware vsphere 5.1 Advanced Administration

How to configure your Windows PC post migrating to Microsoft Office 365

TF-NOC Dublin. Alexandros Kosiaris GRNET NOC Use puppet and network inventory to populate nagios/icinga configuration

Getting Started Guide

Applications Manager Best Practices document

SNMP Web card. User s Manual. Management Software for Uninterruptible Power Supply Systems

Transcription:

Monitoring Systems and Services Alwin Brokmann DESY-IT March 24 28,2003 CHEP 2003 San Diego

Requirements Host Monitoring Service Monitoring Navigation User specific Parameter s WEB Interface Alarming Escalation Simple Configuration Reporting Interface to Trouble Ticket System Fault Isolation

Why NAGIOS There are several reasons for us to select NAGIOS. a. Fulfills most of our requirements b. Possibility to submit our own test`s (Plug In Concept) c. Scalability d. Design (WEB Interface) e. Out of the Box functionality e. Price

History Production Test Starting Oct. 2001 One PC running NetSaint -- Monitoring 50 Hosts March 2002 3 PC`s running NetSaint -- Monitoring ~300 Hosts, several Proc`s Jun 2002 Testing logsurfer Nov 2002 NAGIOS 1.0 available 4 PC`s running NAGIOS ~ growing Number of Hosts and Services LogHost is running /connecting to NAGIOS Feb 2003 New Service Checks for AFS

Monitoring Policy Every Host in the Computer Center will be monitored and also Centrally Supported Printer`s Host Network Device Farm PC Printer Workgroup Server Mail WEB Server AFS Server Check by PING PING SNMP Load Disk Process POP IMAP HTTP Service Monitoring

Monitoring Service for Clusters Hardware Cluster: Mail cluster, consisting of 2 computers. Service Cluster: YP cluster which consists of several computers for the YP Service To make the check of a cluster possible we need a Check Cluster Plug In. We can define for each cluster how many components may fail before an alarm is triggered

Monitoring AFS With the introduction of OpenAFS at DESY we experienced, that a simple process monitoring gives no reliable answers. Therefore we added some new tests in NAGIOS to ensure the operation of the AFS Servers. For these tests we use afs tools like rxdebug, vos etc.. The result is then transferred to NAGIOS.

Host Statistics (Feb. 2003) Host Services ~630 ~1300 Type Quantity Printer 17 Network 37 UNIX 550 Windows 26

Configuration At present we use simple flat files to describe host and services. This means a very high effort, but makes it simply possible to distribute the checks For the future we plan to produce the tests automatically over our AMS. For an automatic configuration of NAGIOS we will extract the informatios such as computer name, interface and naturally which service runs on the computer from the AMS database. Perhaps we will hold the data in a data base

Configuration Example define hostgroup{ name hostgroup_name alias contact_groups members } define host{ host_name night night night sgi-admins,night-admins netra8,test1,test2 netra8 alias netra AFS Server address 131.169.40.109 parents route-194,route-40 use } define service{ hostcheck use fileserver host_name netra8,test1,test2 contact_groups afs-admins,night-admins }

Configuration Example define service{ name fileserver service_description fileserver is_volatile 0 active_checks_enabled 0 passive_checks_enabled 1 check_period 24x7 max_check_attempts 10 normal_check_interval 1 retry_check_interval 5 notification_interval 2200 notification_period 24x7 notification_options w,u,c,r check_command check_named_proc!' '!' '!fileserver register 0 }

Monitoring Setup @ DESY IT

Monitoring Server Setup

Operator Console

Problem Notification **** Nagios 1.0 ***** Notification Type: PROBLEM Service: IT Web Server Host: WWW Server WEB Address: 131.169.40.38 State: CRITICAL Date/Time: Tue Mar 19 08:35:59 MET 2003 Additional Info: Connection refused by host

Recovery Notification ***** Nagios 1.0 ***** Notification Type: RECOVERY Service: IT Web Server Host: WWW Server WEB Address: 131.169.40.38 State: OK Date/Time: Wed Mar 19 08:37:46 MET 2003 Additional Info: HTTP ok: HTTP/1.1 200 OK - 0 second response time

LOGSURFER * Works on any textfile (or text from standard input) * Matching of lines is done by two regular expression (logline must match the first expression but must not match the optional second regular expression). So you are able to specify exceptions. * Uses contexts (collection of messages) instead single lines * Flexible but easy configuration * Timeouts and resource limits included * Handles "shifting" of logfiles * Dynamic rules can change the actions associated with logmessages (something might happen that makes you interested in messages you would usually drop) * Multiple reactions on one logline possible

References NAGIOS www.nagios.org SNMP www.net-snmp.org logsurfer www.dfn-cert.de/eng/logsurf/index.html