How To Monitor A Network With Nagios And Other Tools



Similar documents
Présentation de Nagios

Availability Management Nagios overview. TEIN2 training Bangkok September 2005

Document d'installation FAN 2.1

Managing Monitoring in Distributed Environments

Monitoring Systems and Services. Alwin Brokmann DESY-IT March 24 28,2003 CHEP 2003 San Diego

NETWORK MONITOR. Some high-end network monitoring. Watching your systems with Nagios COVER STORY. What Is Nagios? Installing the Server and Plugins

DEPLOYMENT GUIDE Version 1.0. Deploying the BIG-IP LTM with the Nagios Open Source Network Monitoring System

Library IT departments have an obligation to provide

NRPE Documentation CONTENTS. 1. Introduction... a) Purpose... b) Design Overview Example Uses... a) Direct Checks... b) Indirect Checks...

Nagios. cooler than it looks. Wednesday, 31 October 2007

Setting Up A Nagios Monitoring System Warren Block, May 2005

Monitoring VoIP Systems. Sebastian Damm

Open Source Management Options

wget

Network Monitoring Systems / Nagios. 2/19/08 Michael Miller e mail: mike.mikemiller@gmail.com

Network Monitoring with Nagios. Matt Gracie, Information Security Administrator Canisius College, Buffalo, NY

Installation von des Netzwerküberwachungssystems Nagios 1.1 auf einem Debian-Rechner

September 21 st, Ethan Galstad.

A SURVEY ON AUTOMATED SERVER MONITORING

While are you still in Nagios working directory, create a new file for DNS servers monitoring

How To Monitor A Network With Nagios And Rt Software On Linux On A Microsoft Ipad (A2) On A Pc Or Macbook Or Ipad Or Ipa (A3) On An Ipa Or Ipo (

Monitoring Your Enterprise PACS With Nagios, Cacti And Smokeping

O p e n N M S. Marcin Rybacki OpenNMS

Network Monitoring. Lance Rea. Davis & Gilbert LLP lrea@dglaw.com

Monitoring MySQL. Geert Vanderkelen MySQL Senior Support Engineer Sun Microsystems

heck What the is wrong with my server?!? Josh Malone Systems Administrator National Radio Astronomy Observatory Charlottesville, VA

An Introduction to Monitoring with Nagios

Best of Breed of an ITIL based IT Monitoring. The System Management strategy of NetEye

Network Monitoring. By: Delbert Thompson Network & Network Security Supervisor Basin Electric Power Cooperative

TF-NOC Dublin. Alexandros Kosiaris GRNET NOC Use puppet and network inventory to populate nagios/icinga configuration

Robust & Reliable DNS Operations Logging & Monitoring

Network Monitoring With Nagios. Abstract

Whitepaper. Business Service monitoring approach

SapphireIMS Business Service Monitoring Feature Specification

MONITORING EMC GREENPLUM DCA WITH NAGIOS

SIG-NOC Meeting - Stuttgart 04/08/2015 Icinga - Open Source Monitoring

NetCrunch 6. AdRem. Network Monitoring Server. Document. Monitor. Manage

Monitoring a Linux Mail Server

Network Management & Monitoring Overview

Details. Some details on the core concepts:

Nagios. Nagios. Jacquelin Charbonnel - Albert Shih. CNRS - Ecole Mathrice Marseille, Novembre / 72

Configuring SNMP Cisco and/or its affiliates. All rights reserved. 1

Monitoring Software Services registered with science.canarie.ca

Heroix Longitude Quick Start Guide V7.1

Nagios Core Version 3.x Documentation

How To Monitor Your Computer With Nagiostee.Org (Nagios)

Monitoring the World with NetBSD

There are numerous ways to access monitors:

How To Install Nagios Network Monitoring Program On A Pc Or Macbook Or Ipad (For Mac) With A Web Browser (For A Mac)

Newton Linux User Group Graphing SNMP with Cacti and RRDtool

Free Network Monitoring Software for Small Networks

Monitoring of computer networks and applications using Nagios

WhatsUp Gold 2016 Getting Started Guide

How To Get Started With Whatsup Gold

Supermicro Server Monitoring with SuperDoctor 5 and Nagios Using SNMP Protocol. Version 1.1b

TPAf KTl Pen source. System Monitoring. Zenoss Core 3.x Network and

Centerity Monitor Standard V3.8 USER GUIDE VERSION 7.14

Monitoring Windows Servers and Applications with GroundWork Monitor Enterprise 6.7. Product Application Guide October 8, 2012

Nagios Core Version 3.x Documentation

ISPadmin. by Robert Haskins SYSADMIN. Robert D. Haskins is currently employed by Renesys Corporation in Hanover, NH.

Operations Management and Open Source Tools

Research Report Security Management with Open Source Tools

Open Source in the Data Centre. John Ferlito Bulletproof Networks

IPv6 network management. Where and when?

Centerity Monitor Standard V3.8.4 USER GUIDE VERSION 9.15

SapphireIMS 4.0 BSM Feature Specification

Veriton. Getting the Best out of Hardware Load Balancers in an Oracle Environment. What is a Load Balancer? Agenda. What s out there?

VANITHASHRREE RAJA 21 ST OCTOBER 1988 SERVER, CPU & HARDISK NETWORK MONITORING SYSTEM 2011/2012 EN. ALIAS MOHD

Minimal network traffic is the result of SiteAudit s design. The information below explains why network traffic is minimized.

INTELLIGENT REQUEST TRACKING AND NETWORK MONITORING SYSTEM

Network Monitoring. Dhruba Raj Bhandari (CCNA) Manager Systems Soaltee Crowne Plaza Kathmandu NEPAL

NETWORK MANAGEMENT AND REMOTE MONITORING VIA SMS APPLICATION

Advanced Linux System Administration Knowledge GNU/LINUX Requirements

Can You Afford Network Downtime?

GroundWork Monitor Open Source Readme

Ein Unternehmen stellt sich vor. Nagios in large environments

Network Monitoring Tools

Management, Logging and Troubleshooting

IPv6 network management. 6DEPLOY. IPv6 Deployment and Support

Linux MDS Firewall Supplement

Nagios and Cloud Computing

HP Business Availability Center

PANDORA FMS NETWORK DEVICES MONITORING

This watermark does not appear in the registered version - SNMP and OpenNMS. Part 1 SNMP.

Notes on network monitoring, by Oliver Gorwits

Are my systems alive secure?

IPv6 network management

Application Performance Monitoring for WhatsUp Gold v16.1 User Guide

AfNOG 2010 Network Monitoring and Management Tutorial. Introduction to Networking Monitoring and Management

Introduction to system monitoring with Nagios, Check_MK and Open Monitoring Distribution (OMD)

IPv6 network management. Malta, April 2006

NETWORK MONITORING SYSTEM TOOLS: AN EXPLORATORY APPROACH

PANDORA FMS NETWORK DEVICE MONITORING

Network Management & Monitoring Overview

Application Performance Monitor


Introduction to Network Monitoring and Management

EventSentry Overview. Part I About This Guide 1. Part II Overview 2. Part III Installation & Deployment 4. Part IV Monitoring Architecture 13

IBM. Vulnerability scanning and best practices

Transcription:

Network Monitoring with Nagios and other tools Wednesday, 19 July 2006 Martin B. Smith martins@bebr.ufl.edu

What is a network monitoring system? A combination of hardware and software used to administer a network Common NMSes include: Nagios, OpenNMS, HP OpenView, IBM Tivoli NetView, Microsoft Operations Manager, NAV Protocols: SNMP, HTTP, SMTP/IMAP, SSH, or perhaps even WMI

Simple Network Management Protocol For monitoring network-based devices Extensible through Management Information Bases (MIBs) organized as hierarchical namespaces that define object identifiers and data types Permits active (polling) or passive (interrupting) monitoring anywhere in the OSI 7 Layer model, though it operates at Layer 7 Command line tools: snmpwalk, snmpset, snmptrap,snmpget,snmpinform, snmptranslate Daemons: Snmpd, snmptrapd, syslog-ng, etc

Simple Network Management Protocol 1988: Version 1; RFCs 1065, 1066, 1067 Clear-text community string! 1993: Versions 2, 2c, 2u; RFCs 1441-1452, 1901-1910 Bulk requests, new complex security model 2002: Version 3; IETF Recognized, RFCs 3411-3418 Security becomes user/view-based

Benefits and Problems with SNMP Benefits: Standardized, simple & quick, relatively secure, high-end devices usually have built in support for it, autodiscovery Problems: Index shifting, Not everything speaks SNMP or fits the model, requires a centralized or tiered architecture, MIBs are filed based, often a feature add, not very fault tolerant

Nagios "NAH-gee-ohs" with a hard 'G' like geese GPL v2, runs on Linux and Unix variants Stable version 2.5, though many run 1.4.x Originally called NetSaint, written in C Configuration is file-based/template ready Supports active and passive checks as well as distributed monitoring and failover

What advantages does Nagios have? Plug-in system (!!) for writing a custom check in any language you prefer Intelligent scheduling and parallelization Can tell apart down/unreachable checks Automatic log file rotation, performance data processing, and a web interface! Community and professional support Integrates into SNMP and other solutions

Configuring Nagios Configuration objects: Contacts, Hosts, Services, Commands, Events, Time Periods, Dependency, Escalation, and External Extended Information Templates and Groups allow small configuration changes to drastically alter Include external files and whole directories Downtime, host/service notes, freshness

Contacts and contact groups define contact { contact_name alias service_notification_period host_notification_period service_notification_options host_notification_options service_notification_commands host_notification_commands email 24x7 24x7 d,r root Root w,u,c,r notify-by-email host-notify-by-email root-notifications@bebr.ufl.edu define contactgroup { contactgroup_name alias members gatorlug-people GatorLUG People clint,clint-phone,martin,barrys

Time periods define timeperiod { timeperiod_name 24x7 alias 24 Hours A Day, 7 Days A Week sunday 00:00-24:00 monday 00:00-24:00 tuesday 00:00-24:00 wednesday 00:00-24:00 thursday 00:00-24:00 friday 00:00-24:00 saturday 00:00-24:00 define timeperiod { timeperiod_name workhours alias Standard WorkHours monday 08:00-18:00 tuesday 08:00-18:00 wednesday 08:00-18:00 thursday 08:00-18:00 friday 08:00-18:00

Template, Host, and a Hostgroup define host { name generic-host notifications_enabled 1 event_handler_enabled 1 flap_detection_enabled 1 failure_prediction_enabled 1 process_perf_data 0 retain_status_information 1 retain_nonstatus_information 1 check_command check-host-alive max_check_attempts 10 notification_interval 0 notification_period 24x7 notification_options d,u,r contact_groups admins register 0 define host { use generic-linux-host host_name gatorlug alias GatorLUG.org Server address 128.227.123.8 parents mat-router1 contact_groups gatorlug-people define hostgroup { hostgroup_name networking alias Networking Devices members sur-switch1, surswitch2, sur-switch3

Service template and Services define service{ name generic-service active_checks_enabled 1 passive_checks_enabled 1 parallelize_check 1 obsess_over_service 1 check_freshness 0 notifications_enabled 1 event_handler_enabled 1 flap_detection_enabled 1 failure_prediction_enabled 1 process_perf_data 0 retain_status_information 1 retain_nonstatus_information 1 notification_interval 0 is_volatile 0 check_period 24x7 normal_check_interval 5 retry_check_interval 1 max_check_attempts 4 notification_period 24x7 notification_options w,u,c,r contact_groups admins register 0 define service{ hostgroup_name linux service_description Load process_perf_data 1 check_command check_nrpe!check_load!1.5,1.25,1 1.75,1.5,1.25 use generic-service define service{ host_name gatorlug service_description Load process_perf_data 1 check_command 1.75,1.5,1.25 check_nrpe!check_load!1.5,1.25,1 use generic-service contact_groups gatorlug-people

Check commands # Service checks know these plugins as: check_mysql (no arguments) check_mysql_cmdlinecred!user!password # 'check_mysql' command definition define command{ command_name check_mysql command_line $USER1$/check_mysql -H $HOSTADDRESS$ # 'check_mysql_cmdlinecred' command definition define command{ command_name check_mysql_cmdlinecred command_line $USER1$/check_mysql -H $HOSTADDRESS$ -u $ARG1$ -p $ARG2$

Creating check scripts Simple scripts or C programs that return ('OK'=>0,'WARNING'=>1,'CRITICAL'=>2,'UNKNOWN'=> 3,'DEPENDENT'=>4) or timeout Nagios includes an official suite of plugins that are entirely a separate project, on Sourceforge Nagios plugin suite includes already-written checks for dhcp, dns, disks, smb, file_age, ftp, http, icmp, ifstatus, imap, jabber, ldap, load, log, mysql, ntp, windows, oracle, pgsql, rpc, radius, lmsensors, smtp, snmp, spop, sshd, ssmtp, tcp, time, udp, ups, users, waveform, negate

Events and notifications Event handlers put a command into the command file (like the Web interface does), execute an external script, etc Event handlers may try to solve some problems head on, before they get worse or you respond Notifications are really just check commands that send e-mail or notify you Notifications will continue until you respond, and they will escalate until someone responds or status changes

The lifetime of a check For active checks, Nagios runs your command and waits for a response or the timeout; For passive checks, Nagios does not act until the staleness limit is reached, and then it attempts an active check If the check command returns OK or downtime is scheduled, mark that in the logs and continue, otherwise notify any listed contacts, execute any event handlers, eventually escalating If the service changes status at all, notify contacts of new state, and treat if the new state is not OK, treat this as a new failed check and do it again

Downtime, Flapping, and Extended Information Downtime can be scheduled and fixed or flexible; repetitive downtime is scheduled with cron and a plugin that inserts downtime commands into the cmd file Services that change states with frequency above a certain threshold during a certain period are considered flapping, and notification is supressed temporarily Extended information about hosts can be provided with config files or scripts and may provide links to the host itself, more information about the host, or anything else

Web interface and Data Visualization Nagios also sports an elaborate web interface with CGI files that show status and can also issue commands Nagios can be told to record and process performance data, and this data can be made available through graphing tools and extended information on the web interface

Perfdata and plugins Plugins may return performance data after their normal output using a delimiter, and Nagios will periodically run a command to process this data Popular perfdata plugins send performance data to RRDtool (Round-robin Database), the industry standard logging and graphing tool Other perfdata scripts insert into databases or otherwise consume the information

Other utilities and NMSes Web-based RRDtool frontend Cacti OpenNMS, Java-based Enterprise SNMP NAV, MRTG, and Netflow SNMP Trap senders, translators, and MIB viewers/explorers Could integrate all of these into Nagios!

Our Configuration Secure HTTP with Apache2, LDAP authentication tied to network credentials 35 hosts, 97 services, 18 host groups, 9 service groups Devices types: Routers, Switches, Printers, UPSes, Servers Service types: Software, temperature, load, disk space, HTTP response times, Voltage and power load, raid failures

The End Slides and notes will be posted to the GatorLUG website, including URIs for software projects and pointers to reference material Please don t harass our Nagios-monitored boxes now that you ve seen a list of them Thank you!