Topics. CIT 470: Advanced Network and System Administration. Why Monitoring? Why Monitoring? Historical Monitoring Processes. Historical Monitoring



Similar documents
CIT 470: Advanced Network and System Administration. Topics. Why Monitoring? System Monitoring

CIT 668: System Architecture. Performance Testing

MONIT. UNIX Systems Management

CIT 470: Advanced Network and System Administration. Topics. Performance Monitoring. Performance Monitoring

Network Management & Monitoring Overview

OMNITURE MONITORING. Ensuring the Security and Availability of Customer Data. June 16, 2008 Version 2.0

CIT 470: Advanced Network and System Administration. Topics. Help Desk Life. Help Desks

Topics. CIT 470: Advanced Network and System Administration. Help Desk Life. Types of Help Desks. Scope of Coverage. Help Desk Size.

Introduction to Network Monitoring and Management

CIT 470: Advanced Network and System Administration. Topics. Help Desk Life. Help Desks

Network Management & Monitoring Overview

Why Nagios and Server Monitoring Are Failing Modern Apps

O p e n N M S. Marcin Rybacki OpenNMS

Network Monitoring and Management Introduction to Networking Monitoring and Management

Network Management & Monitoring Overview

orrelog Ping Monitor Adapter Software Users Manual

EventSentry Overview. Part I About This Guide 1. Part II Overview 2. Part III Installation & Deployment 4. Part IV Monitoring Architecture 13

Monitoring MySQL. Presented by, MySQL & O Reilly Media, Inc. A quick overview of available tools

Chapter 8 Monitoring and Logging

AfNOG 2010 Network Monitoring and Management Tutorial. Introduction to Networking Monitoring and Management

OpenMind: Know Your Customer

The new services in nagios: network bandwidth utility, notification and sms alert in improving the network performance

CIT 470: Advanced Network and System Administration. Topics. Change Management. Change and Configuration Management

Network Monitoring and Management Introduction to Networking Monitoring and Management

Network monitoring systems & tools

USING OPEN SOURCE SOFTWARE IN DAILY ISP OPERATIONS

A SURVEY ON AUTOMATED SERVER MONITORING

Introduction to system monitoring with Nagios, Check_MK and Open Monitoring Distribution (OMD)

CAREN NOC MONITORING AND SECURITY

Network Monitoring. Review of Software

Tk20 Network Infrastructure

Chapter 4 Managing Your Network

MALAYSIAN PUBLIC SECTOR OPEN SOURCE SOFTWARE (OSS) PROGRAMME. COMPARISON REPORT ON NETWORK MONITORING SYSTEMS (Nagios and Zabbix)

Topics. CIT 470: Advanced Network and System Administration. Logging Policies. System Logs. Throwing Away. How to choose a logging policy?

WEB SERVER MONITORING SORIN POPA

pt360 FREE Tool Suite Networks are complicated. Network management doesn t have to be.

White Paper. The Ten Features Your Web Application Monitoring Software Must Have. Executive Summary

iphouse has chosen LogicMonitor to offer a Software as a Service (SaaS) monitoring solution.

Upgrade Guide. Upgrading to EventTracker v6.0. Upgrade Guide Columbia Gateway Drive, Suite 250 Publication Date: Sep 20, 2007.

CSS ONEVIEW G-Cloud CA Nimsoft Monitoring

ITSM Service Monitoring Using Open Source Tools

System Administration

Network Monitoring Tools for Monitoring MPLS Links using PRTG Network Monitor Tool

HAOSCAR 2.0: an open source HA-enabling framework for mission critical systems

NMS300 Network Management System

CYAN SECURE WEB APPLIANCE. User interface manual

Network Monitoring. Lance Rea. Davis & Gilbert LLP lrea@dglaw.com

Web Application s Performance Testing

Network Monitoring & Management Log Management

MANAGING NETWORK COMPONENTS USING SNMP

Monitoring Windows Servers and Applications with GroundWork Monitor Enterprise 6.7. Product Application Guide October 8, 2012

AusCERT Remote Monitoring Service (ARMS) User Guide for AusCERT Members

Server Management. Statement of Work.

ntopng: Realtime Network Traffic View

NetCrunch 6. AdRem. Network Monitoring Server. Document. Monitor. Manage

Monitoring and Alerting

Managed Appliance Installation Guide

TPAf KTl Pen source. System Monitoring. Zenoss Core 3.x Network and

There are numerous ways to access monitors:

ENC Enterprise Network Center. Intuitive, Real-time Monitoring and Management of Distributed Devices. Benefits. Access anytime, anywhere

HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team

Kaseya 2. User Guide. for Network Monitor 4.1

E n d To E n d M o n i t o r i n g H y p e r i c H Q I n t e g r a t i o n W i t h O p e n N M S

Network Monitoring. By: Delbert Thompson Network & Network Security Supervisor Basin Electric Power Cooperative

Network Monitoring. Sebastian Büttrich, NSRC / IT University of Copenhagen Last edit: February 2012, ICTP Trieste

Network Monitoring with Nagios. Matt Gracie, Information Security Administrator Canisius College, Buffalo, NY

System Admin Module User Guide. Schmooze Com Inc.

Hanyang University Grid Network Monitoring

WhatsUp Gold v11 Features Overview

Nagios introduction. Dhruba Raj Bhandari (CCNA) Additions by Phil Regnauld.

Distributed Hosting. Service Description. Administrative Information Technology Services

Network Monitoring & Management Log Management

Open Source Network Monitoring Tools

Fifty Critical Alerts for Monitoring Windows Servers Best practices

Heroix Longitude Quick Start Guide V7.1

Parallels Plesk Panel

Free Network Monitoring Software for Small Networks

Part I: Overview. Core concepts presented:

WÜRTHPHOENIX NetEye Version 3

DOSarrest External MULTI-SENSOR ARRAY FOR ANALYSIS OF YOUR CDN'S PERFORMANCE IMMEDIATE DETECTION AND REPORTING OF OUTAGES AND / OR ISSUES

Monitoring of computer networks and applications using Nagios

UNIVERSITY OF KHARTOUM NETWORK MONITORING AND MANAGEMENT SYSTEM BY ABDULLA BASHIR ABDULLA KUBUR INDEX NO Supervisor Prof.

Availability Management Nagios overview. TEIN2 training Bangkok September 2005

Proactively Monitoring Departmental Clinical IT Systems with an Open Source Availability System

Features Overview Guide About new features in WhatsUp Gold v12

Rails Application Deployment. July Philly on Rails

WHITE PAPER September CA Nimsoft Monitor for Servers

CSE 265: System and Network Administration

20 Command Line Tools to Monitor Linux Performance

Applications Manager Best Practices document

K1000: Advanced Topics

SyAM Software Management Utilities. Creating Templates

Features Overview Guide About new features in WhatsUp Gold v14

SysPatrol - Server Security Monitor

Best of Breed of an ITIL based IT Monitoring. The System Management strategy of NetEye

CARENET-SE. NOC Tools Review. Communication System Design Summer Project team. Champion Björn Pehrson Coach Hans Eriksson

Effective MySQL Monitoring. Baron Schwartz March 2012

Do it Yourself System Administration

One software solution to monitor your entire network, including devices, applications traffic and availability.

PA Storage Monitor. Version 5.7 Ultra. Last Update: March 20, Power Admin LLC. Prepared in the USA

Transcription:

Topics CIT 470: Advanced Network and System Administration System Monitoring 1. Why monitoring? 2. Historical monitoring 3. Real-time monitoring 4. Monitoring techniques 5. Monit 6. Web-based monitoring tools CIT 470: Advanced Network and System Administration Slide #1 CIT 470: Advanced Network and System Administration Slide #2 Why Monitoring? If you aren t monitoring a service, you can t manage it. Why Monitoring? 1. Rapidly detect and fix problems. 2. Identify the source of problems. 3. Predict and avoid future problems. 4. Document an SA s achievements. CIT 470: Advanced Network and System Administration Slide #3 CIT 470: Advanced Network and System Administration Slide #4 Historical Monitoring Record long-term system statistics. Uptime. Performance. Security. Utilizations. Examples Web server uptime was 99.99% last year, compared to 99.9% the previous year. Peak network usage is 8 MBps, up from 5 MBps. Uses Capacity planning. Planning for reliability or security improvements. CIT 470: Advanced Network and System Administration Slide #5 Historical Monitoring Processes Polling Take measurements at regular intervals. Store database of measurements. Graph summaries of collected data. Measurement Tools iostat vmstat ps sar CIT 470: Advanced Network and System Administration Slide #6 1

Real-time Monitoring Alert SA to failures as they happen. Discover problems before customer does. Shorter outages. Better reputation. Real-time Monitor components Monitoring system (poll or alert). Notification system. Real-time Monitoring Techniques Polling Poll systems and applications for status. Ex: ping critical servers every 5 minutes. Alerting Many systems can send alerts to monitoring system when they detect a problem. Ex: RAID array logs a disk failure. CIT 470: Advanced Network and System Administration Slide #7 CIT 470: Advanced Network and System Administration Slide #8 Notification Types of notification 1. Email 2. Paging 3. Phone call Reliability 1. Notification system should not depend on system being monitored. 2. Email can fail or have long delays. 3. Pages are susceptible to third party failures and monitoring. CIT 470: Advanced Network and System Administration Slide #9 Escalation What if the SA is on vacation? Notifications need to be transferrable. Static: reconfigure notifier before vacation. Dynamic: configurable set of receipients. Ex: If SA doesn t respond in 1 hour, notify manager. CIT 470: Advanced Network and System Administration Slide #10 Types of monitoring Availability Watch for outages in network, host, apps. Ex: cannot reach mail server. Capacity Check thresholds for CPU, mem, disk, network. Ex: mail spool disk is 95% full Active Monitoring Active monitoring systems can fix problems. 1. Respond faster than a human can. 2. Can typically only implement temporary fix. 3. Can t fix all problems: bad disk, out of paper. Risks Reliability: Test active responses thoroughly before deployment. Security: Active monitor typically needs admin access on all monitored systems. CIT 470: Advanced Network and System Administration Slide #11 CIT 470: Advanced Network and System Administration Slide #12 2

Levels of Testing 1. Check server is pingable. Verifies connectivity from monitor only. 2. Check that application is up. Make a TCP connection to service port. Check process or service list. 3. End-to-end testing. Entire transaction as customer would do. Ex: send and receive an e-mail message. Starting monit [-v] Status monit status monit summary Running monit (also provides web interface on port 2812) Stopping monit quit CIT 470: Advanced Network and System Administration Slide #13 CIT 470: Advanced Network and System Administration Slide #14 Global configuration set daemon 60 set logfile syslog facility log_daemon set alert root@domain set mailserver my-server set httpd port 2812 address localhost allow localhost allow admin:monit Monitoring a Process check process apache with pidfile "/usr/local/apache/logs/httpd.pid" start = /etc/init.d/httpd start" stop = "/etc/init.d/httpd stop" if failed port 80 and protocol http and request "/cgi-bin/printenv" then restart if cpu usage is greater than 60 percent for 2 cycles then alert if cpu usage > 98% for 5 cycles then restart if 2 restarts within 3 cycles then timeout CIT 470: Advanced Network and System Administration Slide #15 CIT 470: Advanced Network and System Administration Slide #16 Monitoring a File # Rotate log if it gets too big check file access_log with path /var/log/access_log if size > 100 Mb then exec "/usr/sbin/logrotate -f rotate_apache_now # Restart Apache if config changes check file httpd.conf with path /usr/local/apache/conf/httpd.conf if changed checksum then exec "/usr/local/apache/bin/apachectl graceful" Monitoring CPU check system localhost if loadavg (1min) > 5 then alert if loadavg (5min) > 3 then alert if memory usage > 80% then alert if cpu usage (user) > 80% then alert CIT 470: Advanced Network and System Administration Slide #17 CIT 470: Advanced Network and System Administration Slide #18 3

Monitoring a Disk check device rootfs with path / if space usage > 90% then alert check device varfs with path /var if space usage > 90% then alert Monitoring Remote Hosts # Ping the host to see if it s up check host foo with address foo.com if failed icmp type echo with timeout 15 seconds then alert # Detailed test, accessing web services check host foo with address foo if failed port 80 protocol http and request /status then alert if failed port 443 type TCPSSL and protocol http with timeout 15 seconds then alert CIT 470: Advanced Network and System Administration Slide #19 CIT 470: Advanced Network and System Administration Slide #20 Monitoring Tools Nagios Ganglia Cacti Nagios Zabbix Hyperic HQ Munin ZenOSS OpenNMS GroundWork God Monit CIT 470: Advanced Network and System Administration Slide #22 Nagios Network Maps Nagios Graphs CIT 470: Advanced Network and System Administration Slide #23 CIT 470: Advanced Network and System Administration Slide #24 4

Zabbix Graphs References 1. Mark Burgess, Principles of System and Network Administration, Wiley, 2000. 2. Aeleen Frisch, Essential System Administration, 3 rd edition, O Reilly, 2002. 3. Mike Loukides and Gian-Paolo D. Musumeci, System Performance Tuning, 2 nd edition, O Reilly, 2003. 4. Monit doc, http://mmonit.com/monit/documentation/monit.pdf 5. Evi Nemeth et al, UNIX System Administration Handbook, 3 rd edition, Prentice Hall, 2001. 6. Wikipedia, http://en.wikipedia.org/wiki/comparison_of_network_monitoring_sy stems CIT 470: Advanced Network and System Administration Slide #25 CIT 470: Advanced Network and System Administration Slide #26 5