How To Run A Powergen On A Network With Nagios (Networking) On A Microsoft Powergen 2.5.2 (Netware) On Your Computer Or Network With A Network (Network) On An Ipnet (



Similar documents
Extensible Monitoring with Nagios and Messaging Middleware. Abstract

Nagios. cooler than it looks. Wednesday, 31 October 2007

ICINGA2 OPEN SOURCE MONITORING

Introduction to system monitoring with Nagios, Check_MK and Open Monitoring Distribution (OMD)

Nagios and Cloud Computing

Linux Syslog Messages in IBM Director

CURRENT STATE OF ICINGA

Monitoring a Linux Mail Server

How we monitor 1 billion km of monthly ride sharing

Advanced System Monitoring

GroundWork Monitor Open Source Installation Guide

Evaluation of standard monitoring tools(including log analysis) for control systems at Cern

Nagios at Funet. Teemu Kiviniemi, CSC/Funet 6th June th TF-NOC meeting Dublin, Ireland

Oracle Managed File Getting Started - Transfer FTP Server to File Table of Contents

Nagios. Contents. From SME Server. Maintainer

Maintaining Non-Stop Services with Multi Layer Monitoring

PANDORA FMS NETWORK DEVICES MONITORING

PANDORA FMS NETWORK DEVICE MONITORING

Quick Start Guide. User Manual. 1 March 2012

The Check_MK monitoring system. Open Source Days 2016, Copenhagen Speaker: Troels Arvin Slides:

SIG-NOC Meeting - Stuttgart 04/08/2015 Icinga - Open Source Monitoring


Publish Acrolinx Terminology Changes via RSS

Zenoss Discovery and Classification

Bernd Ahlers Michael Friedrich. Log Monitoring Simplified Get the best out of Graylog2 & Icinga 2

Lab Management, Device Provisioning and Test Automation Software

PHD Virtual Backup for Hyper-V

ORACLE NOSQL DATABASE HANDS-ON WORKSHOP Cluster Deployment and Management

Tutorial: Load Testing with CLIF

A Tool for Evaluation and Optimization of Web Application Performance

Using New Relic to Monitor Your Servers

Configuring Nex-Gen Web Load Balancer

PANDORA FMS OFFICIAL TRAINING

Altaro Hyper-V Backup - Offsite Backups & Seeding Guide

NGS BioAssist- meeting Wai Yi Leung (LUMC)

Strengths and Limitations of Nagios as a Network Monitoring Solution

There are numerous ways to access monitors:

Certified Selenium Professional VS-1083

User Manual op5 Monitor 3.0

Availability Management Nagios overview. TEIN2 training Bangkok September 2005

Install guide for Websphere 7.0

Ansible in Depth WHITEPAPER. ansible.com

Log Analyzer Reference

/ Administrator Training PAT-2012

HP Device Manager 4.6

Cloud Computing. Lectures 3 and 4 Grid Schedulers: Condor

Server Configuration and Customization Guide. Operations Center 5.0

Scaling Graphite Installations

Chapter 1. Backup service

Pro Puppet. Jeffrey McCune. James TurnbuII. Apress* m in

Running Windows on a Mac. Why?

How To Monitor Your Computer With Nagiostee.Org (Nagios)

Arena Tutorial 1. Installation STUDENT 2. Overall Features of Arena

TF-NOC Dublin. Alexandros Kosiaris GRNET NOC Use puppet and network inventory to populate nagios/icinga configuration

University of Southern California Shibboleth High Availability with Terracotta

Log managing at PIC. A. Bruno Rodríguez Rodríguez. Port d informació científica Campus UAB, Bellaterra Barcelona. December 3, 2013

Rsync: The Best Backup System Ever

DMZ Secure Proxy Environment setup for IP Forwarding

Enabling NetFlow on Virtual Switches ESX Server 3.5

User and Programmer Guide for the FI- STAR Monitoring Service SE

Scheduling in SAS 9.4 Second Edition

How To Monitor A Server With Zabbix

Icinga Open Source Monitoring

Scheduling in SAS 9.3

Cloud Computing. Up until now

Veritas Cluster Server

Product Summary RADIUS Servers

GroundWork Monitor Open Source Readme

Automate the monitoring of your Network through PMp

An idea on how to monitor growth of database tables in an system using Bischeck

XpoLog Center Suite Log Management & Analysis platform

1. Product Information

Monitoring MySQL. Presented by, MySQL & O Reilly Media, Inc. A quick overview of available tools

DEPLOYMENT GUIDE Version 1.0. Deploying the BIG-IP LTM with the Nagios Open Source Network Monitoring System

The objective of this lab is to learn how to set up an environment for running distributed Hadoop applications.

Continuous Integration (CI) and Testing - Configuring Bamboo, Hudson, and TestMaker

Open Source Monitoring

Mid term report, working on title... Freddy Suther, Frank Suther March 14, 2013

Online Backup Client User Manual Linux

Pcounter Web Report 3.x Installation Guide - v Pcounter Web Report Installation Guide Version 3.4

TSM Studio Server User Guide

PANDORA FMS OFFICIAL TRAINING

Nagios Web Service Checker

JBoss Seam Performance and Scalability on Dell PowerEdge 1855 Blade Servers

An Introduction to High Performance Computing in the Department

Enterprise Remote Control 5.6 Manual

TEST AUTOMATION FRAMEWORK

Version Control Your Jenkins Jobs with Jenkins Job Builder

Unitrends Virtual Backup Installation Guide Version 8.0

Software Automated Testing

Automated System Monitoring

Using sftp in Informatica PowerCenter

Notes on network monitoring, by Oliver Gorwits

IceWarp Server. Log Analyzer. Version 10

Practical Continuous Integration

Moving beyond hardware

EVALUATION ONLY. WA2088 WebSphere Application Server 8.5 Administration on Windows. Student Labs. Web Age Solutions Inc.

The Monitis Monitoring Agent ver. 1.2

RecoveryVault Express Client User Manual

Tools and strategies to monitor the ATLAS online computing farm

Transcription:

Mod-Gearman Distributed Monitoring based on the Gearman Framework Sven Nierlein 18.10.2012

Consol http://www.consol.de/open-source-monitoring/ 2

Introduction Common Scenarios Installation Configuration Performance Data Improved Plugin Output Exports Tools Performance 3

Introduction 4

Introduction Gearman Distributes tasks across the network from multiple clients to multiple worker Load balancing Client/Worker supports C, Java, Perl, PHP, Python and Shell Asynchronous 5

Introduction Nagios Mod-Gearman NEB Checkresults Checks / Events Perfdata / Exports Gearman Daemon Checkresults Checks / Events Perfdata Mod-Gearman Worker Perfdata Checkresults PNP4Nagios Worker Tools: send_gearman send_multi 6

Common Scenarios 7

Load Reduction & Non Blocking Nagios Worker hosts=yes services=yes eventhandler=yes hosts=yes services=yes eventhandler=yes Pros Move blocking events away from Nagios core (Eventhandler, on-demand hostchecks) Reduce forking overhead from huge nagios core Even reduces load when both are on the same host 8

Load Balancing Worker Nagios Worker hosts=yes services=yes eventhandler=yes hosts=yes services=yes eventhandler=yes hosts=yes services=yes eventhandler=yes Pros Spread load across multiple hosts 9

Distributed Setup Worker Nagios Worker hosts=no services=no eventhandler=no hostgroups=remote hosts=yes services=yes eventhandler=yes hostgroups=remote hosts=yes services=yes eventhandler=yes Pros Easy replacement for remote nagios installations Central configuration 10

Distributed & Load Balancing Worker hosts=no services=no eventhandler=no hostgroups=remote Nagios hosts=yes services=yes eventhandler=yes hostgroups=remote Worker hosts=yes services=yes eventhandler=yes Worker hosts=no services=no eventhandler=no hostgroups=remote Pros Active/active remote sites Worker hosts=yes services=yes eventhandler=yes 11

Distributed & Load Balancing + Graphing Worker hosts=no services=no eventhandler=no hostgroups=remote Nagios hosts=yes services=yes eventhandler=yes hostgroups=remote perfdata=yes Worker hosts=yes services=yes eventhandler=yes Worker Worker hosts=no services=no eventhandler=no hostgroups=remote PNPWorker hosts=yes services=yes eventhandler=yes 12

Check Serialization Nagios hosts=no services=no eventhandler=no servicegroups=serial Worker hosts=no services=no eventhandler=no servicegroups=serial max-worker=1 Pros Useful for non-serializable checks (ex. check_selenium, java checks. etc...) parallelize_check has been removed in Nagios 3.x Works better than max_concurrent_checks 13

Installation 14

Installation Standalone Packages are available for Centos/Redhat/SLES http://mod-gearman.org/pkg/ including Gearmand Mod-Gearman is part of the Debian 7, Wheezy Consol Labs Repository https://labs.consol.de/repo/ Packages for Mod-Gearman, Gearmand, Thruk, OMD OMD Mod-Gearman is included in OMD 15

Configuration 16

Configuration - NEB Module Load Broker Module nagios.cfg: broker_module=.../lib/mod_gearman/mod_gearman.o config=/etc/mod-gearman/server.cfg 17

Configuration NEB configuration should be the sum of all workers Worker hosts=yes services=yes eventhandler=yes = Nagios hosts=yes services=yes eventhandler=yes Worker hosts=yes services=yes eventhandler=no hostgroups=remote Worker hosts=no services=no eventhandler=yes + = Nagios hosts=yes services=yes eventhandler=yes hostgroups=remote 18

Configuration - Common config can be used to specify/include config files server list of gearmand servers to connect to encryption enable/disable encryption key plaintext key used for encryption keyfile read key from this file 19

Configuration - Queues services all servicechecks hosts all hostchecks hostgroups list of hostgroups going into a separate queue servicegroups list of servicegroups going into a separate queue eventhandler execute eventhandler with Mod-Gearman localhostgroups list of hostgroups not managed by Mod-Gearman localservicegroups list of servicegroups not managed by Mod-Gearman do_hostchecks can be used to manage hostchecks by Nagios 20

Configuration - Queues localservicegroups? Let Nagios take care about this check localhostgroups? Let Nagios take care about this check servicegroups? Put check in servicegroup queue: servicegroup_<groupname> hostgroups? Put check in hostgroup queue: hostgroup_<groupname> hosts=yes? services=yes? Put check in generic hosts queue Put check in generic services queue 21

Configuration - Queues by Custom Variable set queue by custom variable NEB: queue_custom_variable=worker Nagios: define host { }... _WORKER hostgroup_test Worker: hostgroups=test http://labs.consol.de/nagios/mod-gearman/#_how_to_set_queue_by_custom_variable 22

Configuration - Embedded Perl Embedded Perl has serious memory leaks bad for nagios process grows and gets slower and slower ok with Mod-Gearman worker processes will be renewed from time to time worker: enable_embedded_perl=on enable embedded perl use_embedded_perl_implicitly=off only when explicitly enabled by the script itself #!/usr/bin/perl # nagios: +epn 23

Configuration - Worker identifier unique name of this worker, defaults to hostname min-worker minimum number of total worker max-worker maximum number of total worker spawn-rate rate at which new worker will be spawned idle-timeout timeout in seconds before a idling worker exists max-jobs maximum number of jobs before a worker exists dupserver useful to send copy of result to other Gearmand server 24

Performance Data 25

Performance Data Nagios Mod-Gearman NEB Perfdata Gearman Daemon Perfdata PNP4Nagios Worker Config Set perfdata=yes in your Mod-Gearman neb configuration. Set process_performance_data=1 in your nagios.cfg. Adjust gearman options in process_perfdata.cfg and start pnp_gearman_worker. 26

Improved Plugin Output 27

Improved Plugin Output STDERR output included: display worker identifier on errors display stderr output for easy plugin debugging translated signal names 28

Exports 29

Exports Export core events and data into gearman queues Format is JSON Write worker in any language gearman supports (C, Java, Perl, PHP, Python and Shell) No need to poll for data all the time Example Syntax: export=<queue>:<returncode>:<callback>[,<callback>,...] mod_gearman_neb.cfg: export=log_queue:1:nebcallback_log_data Limited to a few callbacks currently: NEBCALLBACK_PROCESS_DATA NEBCALLBACK_TIMED_EVENT_DATA NEBCALLBACK_LOG_DATA 30

Tools 31

gearman_top Shows current state of all queues $ gearman_top -H localhost:4730 32

check_gearman Use as nagios plugin to check Gearmand and worker $./check_gearman -H localhost check_gearman CRITICAL - failed to connect to localhost:4730 - Connection refused $./check_gearman -H localhost check_gearman OK - 0 jobs running and 0 jobs waiting. Version: 0.25... 33

send_gearman Similar but extended functionality like send_nsca Can be used to send passive check result via Mod-Gearman Can send active results with --active Use --latency, --starttime, --finishtime to preserve those attributes too $./bin/send_gearman --server=mo --keyfile=etc/mod-gearman/secret.key \ --host='localhost' --service='ping' --message='ping OK' --returncode=0 34

send_multi P P Return multiple results from check_multi Basically: P P $ check_multi -r 256 -f check.cfg./bin/send_multi --config=mod_gearman.cfg --host=<host> Better: #!/bin/bash host=$1; shift; other=$* report="256" [ "$other"!= "" ] && report="13" out=`.../libexec/check_by_ssh -H $host -q -C ".../check_multi -f.../multi.cfg -r $report $other" 2>&1` rc=$? if [ `echo "$out" grep -c "CHILD"` -eq 0 -o "$other"!= "" ]; then echo "$out" exit $rc fi echo "$out".../send_multi config=.../mod_gearman.conf host=$host check_multi -i <subcheck> allows you to reschedule single checks from a multi.cfg $./better.sh # for all $./better.sh -i check17 # for a single check 35

gearman_proxy.pl Gearman Daemon (Main) Checkresults Checks / Events gearmanproxy.pl Checkresults Checks / Events Gearman Daemon (DMZ) Nagios All connections are initiated from the worker/client Use gearman_proxy.pl in case where it s not possible to directly access the gearmand from remote locations Mod-Gearman Worker 36

Thruk Thruks Dashboard has some Mod-Gearman related Panels 37

Performance 38

Performance Main reason for Mod-Gearman was making distributed monitoring easy but it s quite fast too all tests done with Livestatus and Mod-Gearman Module loaded tests were made on a single virtual machine 39

Performance Debian6 VM 2x2.5GHz with 2GB Ram + 2 external Worker nearly 2.000 active service checks per second! 40

Questions? 41

Resources http://labs.consol.de/nagios/mod-gearman/ http://gearman.org/ http://docs.pnp4nagios.org/de/pnp-0.6/modes#gearman_mode http://my-plugin.de/wiki/projects/check_multi/feed_passive http://packages.debian.org/de/source/sid/mod-gearman http://mod-gearman.org/pkg/ 42