Availability Management Nagios overview. TEIN2 training Bangkok September 2005



Similar documents
How To Monitor A Network With Nagios And Other Tools

DEPLOYMENT GUIDE Version 1.0. Deploying the BIG-IP LTM with the Nagios Open Source Network Monitoring System

Network Monitoring with Nagios. Matt Gracie, Information Security Administrator Canisius College, Buffalo, NY

Nagios introduction. Dhruba Raj Bhandari (CCNA) Additions by Phil Regnauld.

Document d'installation FAN 2.1

Open Source Management Options

MALAYSIAN PUBLIC SECTOR OPEN SOURCE SOFTWARE (OSS) PROGRAMME. COMPARISON REPORT ON NETWORK MONITORING SYSTEMS (Nagios and Zabbix)

An Introduction to Monitoring with Nagios

Nagios. cooler than it looks. Wednesday, 31 October 2007

A SURVEY ON AUTOMATED SERVER MONITORING

Network Monitoring Tools

Setting Up A Nagios Monitoring System Warren Block, May 2005

Network Monitoring. Dhruba Raj Bhandari (CCNA) Manager Systems Soaltee Crowne Plaza Kathmandu NEPAL

Tk20 Network Infrastructure

Network Monitoring with SNMP

Network Monitoring With Nagios. Abstract

Network Monitoring Systems / Nagios. 2/19/08 Michael Miller e mail: mike.mikemiller@gmail.com

mbits Network Operations Centrec

PANDORA FMS NETWORK DEVICE MONITORING

PANDORA FMS NETWORK DEVICES MONITORING

OMNITURE MONITORING. Ensuring the Security and Availability of Customer Data. June 16, 2008 Version 2.0

NMS300 Network Management System

SapphireIMS 4.0 BSM Feature Specification

AGENDA: INTRODUCTION: 1. How is our cloud monitoring setup? 2. Which are the tools used? 3. How do we access monitoring dashboard?

AfNOG Monitoring of IP Services. Ayitey Bulley Material generously borrowed from the NSRC NME course

NRPE Documentation CONTENTS. 1. Introduction... a) Purpose... b) Design Overview Example Uses... a) Direct Checks... b) Indirect Checks...

DOSarrest External MULTI-SENSOR ARRAY FOR ANALYSIS OF YOUR CDN'S PERFORMANCE IMMEDIATE DETECTION AND REPORTING OF OUTAGES AND / OR ISSUES

SapphireIMS Business Service Monitoring Feature Specification

Monitoring Network Availablity Using Nagios

Présentation de Nagios

Centerity Monitor Standard Edition 3.8 ADMINISTRATION GUIDE VERSION 07.14

Monitoring of computer networks and applications using Nagios

MONITORING RED HAT GLUSTER SERVER DEPLOYMENTS With the Nagios IT infrastructure monitoring tool

Introduction to system monitoring with Nagios, Check_MK and Open Monitoring Distribution (OMD)

Network Monitoring User Guide

CAREN NOC MONITORING AND SECURITY

Free Network Monitoring Software for Small Networks

NETWORK MANAGEMENT AND REMOTE MONITORING VIA SMS APPLICATION

ITSM Service Monitoring Using Open Source Tools

Monitoring Windows Servers and Applications with GroundWork Monitor Enterprise 6.7. Product Application Guide October 8, 2012

Centerity Monitor Standard V3.8 USER GUIDE VERSION 7.14

MSP End User. Version 3.0. Technical Solution Guide

Network Monitoring with SNMP

1. INTERFACE ENHANCEMENTS 2. REPORTING ENHANCEMENTS

Red Hat Network: Monitoring Module Overview

Heroix Longitude Quick Start Guide V7.1

AusCERT Remote Monitoring Service (ARMS) User Guide for AusCERT Members

Applications Manager Best Practices document

The new services in nagios: network bandwidth utility, notification and sms alert in improving the network performance

Best of Breed of an ITIL based IT Monitoring. The System Management strategy of NetEye

NetCrunch 6. AdRem. Network Monitoring Server. Document. Monitor. Manage

Features Overview Guide About new features in WhatsUp Gold v14

How To Monitor Your Computer With Nagiostee.Org (Nagios)

Installing and Configuring Websense Content Gateway

Centerity Monitor Standard V3.8.4 USER GUIDE VERSION 9.15

Configuring SNMP Cisco and/or its affiliates. All rights reserved. 1

Network Management and Monitoring Software

WhatsUp Gold v11 Features Overview

Deploying the BIG-IP LTM with the Cacti Open Source Network Monitoring System

There are numerous ways to access monitors:

1. INTERFACE ENHANCEMENTS 2. REPORTING ENHANCEMENTS

September 21 st, Ethan Galstad.

WÜRTHPHOENIX NetEye Version 3

A FAULT MANAGEMENT WHITEPAPER

Plan THE BEGINNER S GUIDE TO. Customize. Monitor NAGIOS XI. Alert PRESENTED BY

Security Correlation Server Quick Installation Guide

A Brief. Introduction. of MG-SOFT s SNMP Network Management Products. Document Version 1.3, published in June, 2008

Assignment One. ITN534 Network Management. Title: Report on an Integrated Network Management Product (Solar winds 2001 Engineer s Edition)

Network Management & Monitoring Overview

Maintaining Non-Stop Services with Multi Layer Monitoring

WhatsUp Gold v11 Features Overview

EventSentry Overview. Part I About This Guide 1. Part II Overview 2. Part III Installation & Deployment 4. Part IV Monitoring Architecture 13

NETWORK MONITORING & ALERTING SERVICES SERVICE DEFINITION

A recipe using an Open Source monitoring tool for performance monitoring of a SaaS application.

TPAf KTl Pen source. System Monitoring. Zenoss Core 3.x Network and

Firewall VPN Router. Quick Installation Guide M73-APO09-380

7750 SR OS System Management Guide

Nagios at Funet. Teemu Kiviniemi, CSC/Funet 6th June th TF-NOC meeting Dublin, Ireland

Monitoring Systems and Services. Alwin Brokmann DESY-IT March 24 28,2003 CHEP 2003 San Diego

1 Data Center Infrastructure Remote Monitoring

Getting Started with PRTG Network Monitor 2012 Paessler AG

Zenoss Discovery and Classification

Deployment and Monitoring. Pascal Robert MacTI

Network Monitoring. Easy, failsafe, and complete visibility of your network. Our customers have the same view as our NOC technicians.

IP SLAs Overview. Finding Feature Information. Information About IP SLAs. IP SLAs Technology Overview

Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid clouds.

Intelligent Power Protector User manual extension for Microsoft Virtual architectures: Hyper-V 6.0 Manager Hyper-V Server (R1&R2)

HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team

Network Management Deployment Guide

Introduction to Network Monitoring and Management

NETWORK MONITOR. Some high-end network monitoring. Watching your systems with Nagios COVER STORY. What Is Nagios? Installing the Server and Plugins

Acronis Monitoring Service

WhatsUp Gold 2016 Getting Started Guide

Red Hat Network Satellite Management and automation of your Red Hat Enterprise Linux environment

Network Management & Monitoring Overview

Whitepaper. Business Service monitoring approach

Dell UPS Local Node Manager USER'S GUIDE EXTENSION FOR MICROSOFT VIRTUAL ARCHITECTURES Dellups.com

Red Hat Satellite Management and automation of your Red Hat Enterprise Linux environment

Vistara Lifecycle Management

Network Management System (NMS) FAQ

Transcription:

1 Availability Management Nagios overview

Agenda 2 Introduction Objectives Functionalities Requirement. Architecture & Operation Operation Description WEB portal Plugins and extensions Plugins description Useful extensions Extension example

3 INTRODUCTION

Objectives: Availability Management 4 Availability Management is a process of the Service Delivery (ITIL specifications). Its goal is to ensure that the required level of Availability is provided. The measurement and monitoring of IT Availability is a key activity to ensure Availability levels are being met consistently. Availability Management should look continuously to optimise the Availability of the IT Infrastructure, services and supporting organisation. In a Network Management environment, monitoring services needed are for example: Routers status Routers environment (CPU, Memory, Temperature ) Links status (interconnexion, access) Links health (traffic evolution, errors level, instability ) Routing protocol status (BGP, MPLS ) The functionalities needed are: Extended monitoring capabilities, including custom programs for non-standard elements Powerful analysis and notification when an incident is detected Easy to use configuration interface Friendly GUI for service status display

Nagios functionalities 5 Nagios is an open source tool specially developed to monitor host and service and designed to inform you of network incidents before your clients, end-users or managers do. It has been designed to run under the Linux operating system, but works fine under most *NIX variants as well Initially developed for servers and application monitoring, it is now widely used to monitor networks availability. It is possible with the development of specific plugins around Nagios process. Nagios works with a set of plugins to provide local and remote service status. The monitoring daemon runs intermittent checks on hosts and services you specify using external "plugins" which return status information to Nagios. When incidents are detected, the daemon send notifications out to administrative contacts in a variety of different ways (email, instant message, SMS, etc.). Current status information, historical logs, and reports can all be accessed via a Web browser. Custom plugins are relatively easy to develop Different methods are provided for remote resource discovery Nagios is freely available from http://www.nagios.org

6 Nagios functionalities Nagios tool provide monitoring facilities (polling elements at regular intervals and generating events depending on polling return status. Nagios can be considered as a scheduler, which execute plugins, analyse results and obtain a status about each element concerned. Nagios can be configured to monitor anything that you can write a script for (Perl scripts, Shell, C programs, ). Nagios comes with loads of plugins for just about every normal known service such as SNMP, DNS, SSH, HTTP, NFS, FTP, etc, etc. The advantages of Nagios against other manufacturer products are: It is a free product, running on Linux OS It is an easy to use product, including all functions needed to perform availability monitoring Extensions (plugins) are simple to develop, using well known languages It is scalable, and it is possible to deploy a hierarchy of Nagios servers to provide consolidated views of network and / or system availability (Hypervisor) The platform shall nevertheless be integrated in an operational environment with all its constraints: Automatic or semi-automatic configuration of multiple services and instances Attractive GUI for incidents identification, reporting, and relation with Incident Management process Operators alarm (visual, audio or through pager for on-call FTE)

7 Requirements Other things you will need to get Nagios working are: Nagios Plugins (from Nagios download URL) GD Graphics Libraries JPEG Lib Sources PNG Lib Sources FPing (Fast Ping), this is optional but useful. For SNMP monitoring you will need: net-snmp-tools, and net-snmp-utils MySQL database for storing: Elements status logs

8 ARCHITECTURE & OPERATION

Architecture 9 Nagios is built on a server/agents architecture. Usually, on a network, a Nagios server is running on a host, and plugins are running on all the remote hosts that need to be monitored. These plugins send information to the server, which displays them in a GUI. Nagios is composed of three parts: A scheduler: this is the server part of Nagios. At regular interval, the scheduler checks the plugins, and according to their results do some actions. A GUI: the interface of Nagios (with the configuration, the alerts,...). It is displayed in web pages generated by CGI. It can be state buttons (green,ok/red,error), sounds, MRTG graphs,... The plugins. They are configurable by the user. They check a service and return a result to the Nagios server. A soft alert is raised when a plugin returns a warning or an error. Then on the GUI, a green button turns to red, and a sound is emitted. When this soft alert is raised many times (the number is configurable), a hard alert is raised, and the Nagios server sends notifications: email, SMS,...

Nagios Architecture (internal) 10 Monitored Elements

Nagios architecture (external) 11 Local host Nagios process Plugins NCSA daemon Remote host NCSA client NRPE/SSH daemon Plugins

Web Interface: tactical overview 12

Web Interface: services overview 13

Web Interface: services status detail 14

Web Interface: services status detail 15

Web Interface: services status detail 16

Web Interface: services status detail 17

Web Interface: 3D status map 18

Configuration Files 19 nagios.cfg Main configuration file Monitored elements configuration resource.cfg cgi.cfg User defined elements configuration file Display options for CGI host, hostgroup and hostextinfo Information concerning monitored equipments service, servicextinfo Information concerning monitored services host and service dependency Dependencies between hosts and between services host escalation, host group escalation, service escalation Escalation notification

Configuration Files 20 contact Nagios users. To access Nagios interface, it is necessary to be registered in this contact file contactgroup User Groups. Notifications and display rights are settled in this file timeperiod Use to configure period of time for elements monitoring and users notifications. command Set of commands used to control services, equipments, events management and notifications

Sample configuration files 21 Hosts.cfg : ========= define host{ notifications_enabled 1 ; Host notifications are enabled event_handler_enabled 1 ; Host event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information 1 ; Retain non-status information across program restarts # 'novell1' host definition define host{ use generic-host ; Name of host template to use host_name novell1 alias Novell Server #1 address 192.168.1.2 check_command check-host-alive max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_options d,u,r }

22 PLUGINS and EXTENSIONS

Plugins and Extensions 23 Developments on Nagios can be found at http://www.nagiosexchange.org/ AddOn projects are freely available. They cover subjects on: Charts, Communications, Configuration, Development, DownTimes, FrontEnds, Notificatons, Misc Plugins have been developed on: Networking, SNMP, Hardware, Linux, Solaris, Windows,

PLUGINS 24 A plugin is a small program (in Perl, C, java, python,...) that checks a service (a daemon, some free space on a disk,...). It must return a value and a small line of text (Nagios will only grab the first line of text). Output should be in the format: METRIC STATUS: information text performance data The allowed METRIC STATUS are 0 (OK), 1 (WARNING), 2 (CRITICAL) or 3 (UNKNOWN) The warning and critical thresholds are parameters, set by the user, passed as arguments to the plugin. A plugin can also return performance data in the format: "label1=value1 label2=value2..." These data are stored by Nagios and may be later displayed with MRTG (http://people.ee.ethz.ch/~oetiker/webtools/mrtg/) The plugins can be run: Locally, on the Nagios server. But such a plugin can check remote hosts, for example check_ping which pings remote hosts to check if they are running. Remotely, through a remote Nagios server, with ssh, with snmp, with NRPE (Nagios Remote Plugin Executor), or with NSCA (Nagios Service Check Acceptor). It means that the plugin either waits for a verification request from the Nagios server before sending its result, or executes itself and sends the result to the Nagios server.

Sample Plugin: Check_Cisco-Ping (beginning) 25 #!/usr/bin/perl # check_cisco_command - telnet's to a Cisco router to run a command # License Information: # This program is free software; you can redistribute it and/or modify # it under the terms of the GNU General Public License as published by # the Free Software Foundation; either version 2 of the License, or # (at your option) any later version. # use strict; use Getopt::Long; use vars qw($opt_d $opt_u $opt_p $opt_p $opt_h $opt_h $opt_v $opt_v $debug $username $password $hostname $state $error $PROGNAME @command_output $session $command $opt_r $tmp_var_loss $tmp_var_latency $opt_c $opt_w $info $answer $opt_m $metric $warning_rta $warning_pl $timeout $router $critical_rta $critical_pl $opt_t $vrf $stats); use lib "utils.pm"; use utils qw(%errors &print_revision &support &usage ); use Net::Telnet::Cisco; sub print_help (); sub print_usage (); $ENV{'PATH'}=''; $ENV{'BASH_ENV'}=''; $ENV{'ENV'}='';

Other useful developments 26 Alarm resiliency Nagios gives an immediate status of the monitored elements, it has no memory (except in log). It is useful to keep trace of an incident until it has been checked and acknowledged by an operator. Network Interfaces discovery Within big networks, it is useful to «compare» real configuration with database configuration. An external program can check every day (auto-discovery) the real network configuration versus Nagios database. If differences appears, notify network administrator of the change. Semi-automatic configuration For each new element, multiple identified checking have to be configured and started Semi-automatic configuration tool will write Nagios configuration files based on higher level network description files

Extension Example SNMP Traps (UDP Port 162)./bin/traps-dispatch.pl./bin/net-getif.pl SNMP Poll (UDP Port 161) Networks and Equipments./var/history Text file interfaces history interface_date SNMP Poll (UDP Port 161) SSH Access (TCP Port 22)./libexec/check_****./libexec_noc/check_****.pl 27 Email notification (not used)./var/spool/./var/ On-call FTE./bin/traps-rotate.pl Text file logfile.odd logfile.even Text file interfaces.dat interfaces.monitor MySQL events base./bin/nagios Interfaces-form.cgi./bin/nagios-oncall.pl nagios process alarms-form.cgi./bin/traps-compute.pl./var/./etc/ compute-form.cgi Monthly report: Circuits availability Text file avail_cache.htm current_alarm.gen log-form.cgi./bin/nagios-configmaker.pl./nagios/ Text file Nagios format nagios.gen Text file Nagios configuration process nagios.cfg Consult./config/./logs/./config/ Configure Text file Extensions configuration netadmin.cfg Log files netadmin.log netadmin.log.1.gz Services file configuration Full_network Workstations Program Daemon Crontab Job Use Produce Execute Extensions Nagios

BBMAP interfacing Nagios 28 BBMAP (Big Brother MAP) BBMAP gives a graphical view of Nagios elements. Elements on the map are clickable and give access to underlying status details It is easier to modify than Nagios maps (2D and 3) It is logically separated from Nagios (security enhancement): -Nagios is for administrator -BBMAP is for users

BBMAP interfacing Nagios and WheatherMap 29

Cacti and BBMAP consolidation 30 Cacti / BBMAP NOC User in its office? Cacti / Nagios / MySQL server ISP ADSL User at home

References 31 Nagios source program http://www.nagios.org/download/ Nagios Extra developments http://www.nagiosexchange.org/ Official plugins http://nagiosplug.sourceforge.net/ Conferences http://www.nagios.org/propaganda/conferences/

32 THE END Thank you for your attention