Ganglia & Nagios. Maciej Lasyk 11. Sesja Linuksowa Wrocław, /25. Maciej Lasyk, Ganglia & Nagios

Size: px
Start display at page:

Download "Ganglia & Nagios. Maciej Lasyk 11. Sesja Linuksowa Wrocław, 2014-04-06 1/25. Maciej Lasyk, Ganglia & Nagios"

Transcription

1 Ganglia & Nagios Maciej Lasyk 11. Sesja Linuksowa Wrocław, /25

2 Ganglia.. what? Ganglia cluster / group of neurons found outside the central nervous system 2/25

3 Just a little about monitoring - the need for monitoring 3/25

4 Just a little about monitoring - the need for monitoring - measuring availability 3/25

5 Just a little about monitoring - the need for monitoring - measuring availability - measuring performance 3/25

6 Just a little about monitoring - the need for monitoring - measuring availability - measuring performance - gathering additional metrics 3/25

7 Monitoring is critical for HA How to measure availability? 4/25

8 Monitoring is critical for HA How to measure availability? A = Uptime / (Uptime + Downtime) 4/25

9 Monitoring is critical for HA How to measure availability? A = Uptime / (Uptime + Downtime) MTTD (Mean Time to Diagnose) The average time it takes to diagnose the problem 4/25

10 Monitoring is critical for HA How to measure availability? A = Uptime / (Uptime + Downtime) MTTD (Mean Time to Diagnose) The average time it takes to diagnose the problem MTTR (Mean Time to Repair) The average time it takes to fix a problem 4/25

11 Monitoring is critical for HA How to measure availability? A = Uptime / (Uptime + Downtime) MTTD (Mean Time to Diagnose) The average time it takes to diagnose the problem MTTR (Mean Time to Repair) The average time it takes to fix a problem MTTF (Mean Time to Failure) The average time there is correct behavior 4/25

12 Monitoring is critical for HA How to measure availability? A = Uptime / (Uptime + Downtime) MTTD (Mean Time to Diagnose) The average time it takes to diagnose the problem MTTR (Mean Time to Repair) The average time it takes to fix a problem MTTF (Mean Time to Failure) The average time there is correct behavior MTBF (Mean Time Between Failures) The average time between different failures of the service 4/25

13 Monitoring is critical for HA 4/25

14 Monitoring is critical for HA A = MTTF / MTBF = MTTF / (MTTF + MTTD + MTTR) 4/25

15 What should we monitor? - hardware housing - devices - storage - network - hosts - software (very deep hole) 5/25

16 What should we monitor? - hardware housing - devices - storage - network - hosts - software (very deep hole) Think dependencies! 5/25

17 When outage hits us don't panic! - Notifications 6/25

18 When outage hits us don't panic! - Notifications - Escalations L1 <-> L2 <-> L3 <-> L4 lol ;) desktop support / devs / ops / networking / / storage / middleware / dc / security 6/25

19 When outage hits us don't panic! - Notifications - Escalations L1 <-> L2 <-> L3 <-> L4 lol ;) desktop support / devs / ops / networking / / storage / middleware / dc / security - Clock is ticking it should be simple 6/25

20 When outage hits us don't panic! - Notifications - Escalations L1 <-> L2 <-> L3 <-> L4 lol ;) desktop support / devs / ops / networking / / storage / middleware / dc / security - Clock is ticking it should be simple - What if cell is offline or someone is out? 6/25

21 Monitoring: notifications issues - false positives 7/25

22 Monitoring: notifications issues - false positives - major events 7/25

23 Monitoring: notifications issues - false positives - major events - failover notifications? 7/25

24 Monitoring: notifications issues - false positives - major events - failover notifications? - tolerance & critical thresholds 7/25

25 Monitoring: reporting - baseline 8/25

26 Monitoring: reporting - baseline - correlation between incidents and change management 8/25

27 Monitoring: reporting - baseline - correlation between incidents and change management - trending info 8/25

28 Monitoring: reporting - baseline - correlation between incidents and change management - trending info - reporting 8/25

29 Monitoring: good practices - don't NIH! 9/25

30 Monitoring: good practices - don't NIH! - DVCS 9/25

31 Monitoring: good practices - don't NIH! - DVCS - testing envs 9/25

32 Monitoring: good practices - don't NIH! - DVCS - testing envs - think usability! 9/25

33 Monitoring: good practices - don't NIH! - DVCS - testing envs - think usability! - passive checks 9/25

34 Monitoring: good practices - don't NIH! - DVCS - testing envs - think usability! - passive checks - automate don't hardcode 9/25

35 Monitoring: good practices - don't NIH! - DVCS - testing envs - think usability! - passive checks - automate don't hardcode - security 9/25

36 Monitoring: good practices Last but not least... Quis custodiet ipsos custodes? (Who will guard the guards?) 9/25

37 Nagios recap Host / Services / Contacts - hosts, hostgroups 10/25

38 Nagios recap Host / Services / Contacts - hosts, hostgroups - services, service groups 10/25

39 Nagios recap Host / Services / Contacts - hosts, hostgroups - services, service groups - templates 10/25

40 Nagios recap Host / Services / Contacts - hosts, hostgroups - services, service groups - templates - time periods 10/25

41 Nagios recap Host / Services / Contacts - hosts, hostgroups - services, service groups - templates - time periods - host and services dependencies 10/25

42 Nagios recap Host / Services / Contacts - hosts, hostgroups - services, service groups - templates - time periods - host and services dependencies - regular expressions 10/25

43 Nagios recap 10/25

44 Nagios recap 10/25

45 Nagios recap Checks and states - frequencies & thresholds 10/25

46 Nagios recap Checks and states - frequencies & thresholds - scheduling downtimes 10/25

47 Nagios recap Checks and states - frequencies & thresholds - scheduling downtimes - outages and flapping 10/25

48 Nagios recap Notifications - periods 10/25

49 Nagios recap Notifications - periods - groups 10/25

50 Nagios recap Notifications - periods - groups - which states to be notified about? 10/25

51 Nagios recap Notifications - periods - groups - which states to be notified about? - escalations / rotations 10/25

52 Nagios recap Notifications - periods - groups - which states to be notified about? - escalations / rotations - custom notifications method 10/25

53 Nagios recap Monitoring remotes - NRPE daemons - checks via SSH 10/25

54 Nagios recap Web interface tactical overview 10/25

55 Nagios recap Web interface availability reports 10/25

56 Nagios recap Web interface trends 10/25

57 Nagios recap Web interface network maps 10/25

58 Networking recap Unicast 11/25

59 Networking recap Multicast 11/25

60 Networking recap Broadcast 11/25

61 Ganglia what is it? Problems of big scale: 20k hosts with zylion metrics probed every 10 seconds It is fully redundant (until you spoil it) It is very scalable Regexp searches and creating of views adhoc :) 12/25

62 Ganglia architecture 13/25

63 Ganglia architecture 13/25

64 Ganglia topologies Default multicast topology 14/25

65 Ganglia topologies Deaf / mute multicast topology 14/25

66 Ganglia topologies Unicast topology 14/25

67 Ganglia topologies Gmetad topology 14/25

68 Ganglia topologies Gmetad HA topology (active - active) 14/25

69 Ganglia topologies Gmetad hierarchical topology 14/25

70 Ganglia RRDcached 15/25

71 Ganglia sflow 16/25

72 Ganglia web (grid view) 17/25

73 Ganglia web (cluster view) 17/25

74 Ganglia web (physical view) 17/25

75 Ganglia web (host view) 17/25

76 Ganglia web (compare hosts) 17/25

77 Ganglia web (events) Events have API json based Think integration with whatever app :) 17/25

78 Ganglia web (dashboards) - Create view -> apply as dashboard - Create dashboard from XML - Generate graphs and add to views 17/25

79 Ganglia web (graphs) 17/25

80 Ganglia metrics - base / extended metrics - own modules - c / c++ - mod_python - spoofing - gmetric - gmetric4j / java - Which to choose? gmetric / python / c/c++? 18/25

81 Ganglia metrics - base / extended metrics 18/25

82 Ganglia metrics - base / extended metrics - own modules 18/25

83 Ganglia metrics - base / extended metrics - own modules - c / c++ 18/25

84 Ganglia metrics - base / extended metrics - own modules - c / c++ - mod_python 18/25

85 Ganglia metrics - base / extended metrics - own modules - c / c++ - mod_python - spoofing 18/25

86 Ganglia metrics - base / extended metrics - own modules - c / c++ - mod_python - spoofing - gmetric - gmetric4j / java 18/25

87 Ganglia metrics - base / extended metrics - own modules - c / c++ - mod_python - spoofing - gmetric - gmetric4j / java - Which to choose? gmetric / python / c/c++? 18/25

88 Ganglia and logfiles? ganglia-logtailer - https://bitbucket.org/maplebed/ganglia-logtailer - parser logfiles (realtime) - pushes data to ganglia (via gmetric) - yup based on specific log formats - yet still open source so poke around ;) 19/25

89 So... Nagios + Ganglia! 3 ways of integration: - ganglia-web/nagios (PHP & bash based) https://github.com/ganglia/ganglia-web - ganglia-nagios-bridge (Python & cron based) https://github.com/ganglia/ganglia-nagios-bridge - check-ganglia-metric (Python) https://github.com/ganglia/ganglia_contrib 20/25

90 Nagios + Ganglia: ganglia-web/nagios https://github.com/ganglia/ganglia-web Sending Nagios Data to Ganglia service_perfdata_command Or replace Nagios checks with Ganglia! - Check heartbeat. - Check a single metric on a specific host. - Check multiple metrics on a specific host. - Check multiple metrics across a regex-defined range of hosts 21/25

91 Nagios + Ganglia: ganglia-web/nagios Nagios pulls info from Ganglia via HTTP 21/25

92 Nagios + Ganglia: ganglia-nagios-bridge - https://github.com/ganglia/ganglia-nagios-bridge - Python script run in e.g. in crontab - pulls data from Ganglia XML via sockets - parses XML - send data to Nagios - Nagios commits only passive checks 22/25

93 Nagios + Ganglia: check_ganglia_metric - https://pypi.python.org/pypi/check_ganglia_metric/ - basically Nagios plugin - pulls data from Ganglia XML via sockets - check_ganglia_metric.py \ --gmetad_host=gmetad-server.example.com \ --metric_host=host.example.com --metric_name=cpu_idle 23/25

94 Nagios + Ganglia Which one integration should I use? 24/25

95 Nagios + Ganglia Which one integration should I use? Seriously try yourself and test 24/25

96 Freenode #ganglia https://lists.sourceforge.net/lists/listinfo/ganglia-general 24.5/25

97 Thank you :) Ganglia & Nagios Maciej Lasyk 11. Sesja Linuksowa , Wrocław 25/25

Wait, How Many Metrics? Monitoring at Quantcast

Wait, How Many Metrics? Monitoring at Quantcast Wait, How Many Metrics? Monitoring at Quantcast Count what is countable, measure what is measurable, and what is not measurable, make measurable. Galileo Galilei Quantcast offers free direct audience measurement

More information

A SURVEY ON AUTOMATED SERVER MONITORING

A SURVEY ON AUTOMATED SERVER MONITORING A SURVEY ON AUTOMATED SERVER MONITORING S.Priscilla Florence Persis B.Tech IT III year SNS College of Engineering,Coimbatore. priscillapersis@gmail.com Abstract This paper covers the automatic way of server

More information

COMPARING NETWORK AND SERVER MONITORING TOOLS

COMPARING NETWORK AND SERVER MONITORING TOOLS COMPARING NETWORK AND SERVER MONITORING TOOLS Five data center managers speak out about LogicMonitor and Nagios SaaS vs. Open Source Contents 1. Introduction 2. Quick Summary 3. Installation 4. Initial

More information

Yahoo! Communities Architectures Ian Flint

Yahoo! Communities Architectures Ian Flint Yahoo! Communities Architectures Ian Flint November 9, 2007 1 Agenda What makes Yahoo! Yahoo!? Hardware Infrastructure Software Infrastructure Operational Infrastructure Process Examples 2 What makes Yahoo!

More information

Tk20 Network Infrastructure

Tk20 Network Infrastructure Tk20 Network Infrastructure Tk20 Network Infrastructure Table of Contents Overview... 4 Physical Layout... 4 Air Conditioning:... 4 Backup Power:... 4 Personnel Security:... 4 Fire Prevention and Suppression:...

More information

visual interface dependency or relationship model of systems workflow engine comparator function poller

visual interface dependency or relationship model of systems workflow engine comparator function poller discovery dependency or relationship model of systems Metric Store visual interface workflow comparator function poller Model of system polling/threshold/state system state state state state data) state

More information

Availability Management Nagios overview. TEIN2 training Bangkok September 2005

Availability Management Nagios overview. TEIN2 training Bangkok September 2005 1 Availability Management Nagios overview Agenda 2 Introduction Objectives Functionalities Requirement. Architecture & Operation Operation Description WEB portal Plugins and extensions Plugins description

More information

Red Hat Network: Monitoring Module Overview

Red Hat Network: Monitoring Module Overview Red Hat Network: Monitoring Module Overview Red Hat has built the next generation of systems management software presenting both highgrowth businesses and service providers with a simple, scalable and

More information

This training is targeted at System Administrators and developers wanting to understand more about administering a WebLogic instance.

This training is targeted at System Administrators and developers wanting to understand more about administering a WebLogic instance. This course teaches system/application administrators to setup, configure and manage an Oracle WebLogic Application Server, its resources and environment and the Java EE Applications running on it. This

More information

Learn Oracle WebLogic Server 12c Administration For Middleware Administrators

Learn Oracle WebLogic Server 12c Administration For Middleware Administrators Wednesday, November 18,2015 1:15-2:10 pm VT425 Learn Oracle WebLogic Server 12c Administration For Middleware Administrators Raastech, Inc. 2201 Cooperative Way, Suite 600 Herndon, VA 20171 +1-703-884-2223

More information

Monitoring HTCondor with Ganglia

Monitoring HTCondor with Ganglia Monitoring HTCondor with Ganglia Ganglia Overview Scalable distributed monitoring for HPC clusters Two daemons gmond every host; collects and send metrics gmetad single host; persists metrics from local

More information

HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team

HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team Copyright 2015 HPCC Systems. All rights reserved

More information

Minder Network Performance Monitoring

Minder Network Performance Monitoring Minder Network Performance Monitoring Monitor everything about your Network performance Discover, visualize and monitor your complete IT Infrastructure in less than an hour. Mindarray s Minder is a powerful

More information

Oracle WebLogic Server: Remote Monitoring and Management

Oracle WebLogic Server: Remote Monitoring and Management October 13, 2015 Oracle WebLogic Server: Remote Monitoring and Management Raastech, Inc. 2201 Cooperative Way, Suite 600 Herndon, VA 20171 +1-703-884-2223 info@raastech.com About Us Ahmed Aboulnaga @Ahmed_Aboulnaga

More information

XpoLog Center Suite Data Sheet

XpoLog Center Suite Data Sheet XpoLog Center Suite Data Sheet General XpoLog is a data analysis and management platform for Applications IT data. Business applications rely on a dynamic heterogeneous applications infrastructure, such

More information

RUGGEDCOM NMS. Monitor Availability Quick detection of network failures at the port and

RUGGEDCOM NMS. Monitor Availability Quick detection of network failures at the port and RUGGEDCOM NMS is fully-featured enterprise grade network management software based on the OpenNMS platform. Specifically for the rugged communications industry, RNMS provides a comprehensive platform for

More information

TPAf KTl Pen source. System Monitoring. Zenoss Core 3.x Network and

TPAf KTl Pen source. System Monitoring. Zenoss Core 3.x Network and Zenoss Core 3.x Network and System Monitoring A step-by-step guide to configuring, using, and adapting this free Open Source network monitoring system Michael Badger TPAf KTl Pen source I I flli\ I I community

More information

Header 1. John T. Irwin Software Consulting Manager EMEA Managing End User Experience

Header 1. John T. Irwin Software Consulting Manager EMEA Managing End User Experience Header 1 John T. Irwin Software Consulting Manager EMEA Managing End User Experience BSM Addressing Proactive Operations Planning, predictive analytics, and preventative automation to improve IT s response

More information

Nagios and Cloud Computing

Nagios and Cloud Computing Nagios and Cloud Computing Presentation by William Leibzon (william@leibzon.org) Nagios Thanks for being here! Open Source System Management Conference May 10, 2012 Bolzano, Italy Cloud Computing What

More information

Bernd Ahlers Michael Friedrich. Log Monitoring Simplified Get the best out of Graylog2 & Icinga 2

Bernd Ahlers Michael Friedrich. Log Monitoring Simplified Get the best out of Graylog2 & Icinga 2 Bernd Ahlers Michael Friedrich Log Monitoring Simplified Get the best out of Graylog2 & Icinga 2 BEFORE WE START Agenda AGENDA Introduction Tools Log History Logs & Monitoring Demo The Future Resources

More information

SIG-NOC Meeting - Stuttgart 04/08/2015 Icinga - Open Source Monitoring WWW.ICINGA.ORG

SIG-NOC Meeting - Stuttgart 04/08/2015 Icinga - Open Source Monitoring WWW.ICINGA.ORG SIG-NOC Meeting - Stuttgart 04/08/2015 Icinga - Open Source Monitoring WWW.ICINGA.ORG Me Michael Friedrich @dnsmichi, 31, Austrian Application Developer @NETWAYS Icinga responsibilities Core 1.x & 2.x

More information

OpManager MSP Edition

OpManager MSP Edition OpManager MSP Edition Product Overview (6.5) June 2007 Agenda MSP Edition Architecture And Features About OpManager MSP Demo (https://mspdemo.opmanager.com) MSP Edition Architecture And Features Scalable

More information

IT INFRASTRUCTURE MANAGEMENT SERVICE ADDING POWER TO YOUR NETWORKS

IT INFRASTRUCTURE MANAGEMENT SERVICE ADDING POWER TO YOUR NETWORKS IT INFRASTRUCTURE MANAGEMENT SERVICE ADDING POWER TO YOUR NETWORKS IT INFRASTRUCTURE MANAGEMENT SERVICES Nortech Remote management IT security Services provide around clock remote Management, real time

More information

MONITORING RED HAT GLUSTER SERVER DEPLOYMENTS With the Nagios IT infrastructure monitoring tool

MONITORING RED HAT GLUSTER SERVER DEPLOYMENTS With the Nagios IT infrastructure monitoring tool TECHNOLOGY DETAIL MONITORING RED HAT GLUSTER SERVER DEPLOYMENTS With the Nagios IT infrastructure monitoring tool INTRODUCTION Storage system monitoring is a fundamental task for a storage administrator.

More information

Optimizing your Monitoring and Trending tools for the Cloud

Optimizing your Monitoring and Trending tools for the Cloud Optimizing your Monitoring and Trending tools for the Cloud Nagios World Conference 2012 Nicolas Brousse Lead Operations Engineer September 28 th 2012 About TubeMogul What are some of our challenges? Our

More information

1 Data Center Infrastructure Remote Monitoring

1 Data Center Infrastructure Remote Monitoring Page 1 of 7 Service Description: Cisco Managed Services for Data Center Infrastructure Technology Addendum to Cisco Managed Services for Enterprise Common Service Description This document referred to

More information

Improved metrics collection and correlation for the CERN cloud storage test framework

Improved metrics collection and correlation for the CERN cloud storage test framework Improved metrics collection and correlation for the CERN cloud storage test framework September 2013 Author: Carolina Lindqvist Supervisors: Maitane Zotes Seppo Heikkila CERN openlab Summer Student Report

More information

Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid clouds.

Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid clouds. ENTERPRISE MONITORING & LIFECYCLE MANAGEMENT Unify IT Operations Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid

More information

Oracle WebLogic Foundation of Oracle Fusion Middleware. Lawrence Manickam Toyork Systems Inc www.toyork.com http://ca.linkedin.

Oracle WebLogic Foundation of Oracle Fusion Middleware. Lawrence Manickam Toyork Systems Inc www.toyork.com http://ca.linkedin. Oracle WebLogic Foundation of Oracle Fusion Middleware Lawrence Manickam Toyork Systems Inc www.toyork.com http://ca.linkedin.com/in/lawrence143 History of WebLogic WebLogic Inc started in 1995 was a company

More information

Livrable L13.3. Nature Interne Date livraison 12/07/2012. Titre du Document Energy management system and energy consumption efficiency - COEES Code v1

Livrable L13.3. Nature Interne Date livraison 12/07/2012. Titre du Document Energy management system and energy consumption efficiency - COEES Code v1 Propriétés du Document Source du Document FUI-10-COMPATIBLE ONE Titre du Document Energy management system and energy consumption efficiency - COEES Code v1 Module(s) Responsable Auteur(s) / contributeur(s)

More information

Skynax. Mobility Management System. System Manual

Skynax. Mobility Management System. System Manual Skynax Mobility Management System System Manual Intermec by Honeywell 6001 36th Ave. W. Everett, WA 98203 U.S.A. www.intermec.com The information contained herein is provided solely for the purpose of

More information

Barracuda Load Balancer Online Demo Guide

Barracuda Load Balancer Online Demo Guide Barracuda Load Balancer Online Demo Guide Rev 1.3 October 04, 2012 Product Introduction The Barracuda Networks Load Balancer provides comprehensive IP load balancing capabilities to any IP-based application,

More information

The Importance of Information Delivery in IT Operations

The Importance of Information Delivery in IT Operations The Importance of Information Delivery in IT Operations David Williams Notes accompany this presentation. Please select Notes Page view. These materials can be reproduced only with written approval from

More information

STUDY AND SIMULATION OF A DISTRIBUTED REAL-TIME FAULT-TOLERANCE WEB MONITORING SYSTEM

STUDY AND SIMULATION OF A DISTRIBUTED REAL-TIME FAULT-TOLERANCE WEB MONITORING SYSTEM STUDY AND SIMULATION OF A DISTRIBUTED REAL-TIME FAULT-TOLERANCE WEB MONITORING SYSTEM Albert M. K. Cheng, Shaohong Fang Department of Computer Science University of Houston Houston, TX, 77204, USA http://www.cs.uh.edu

More information

White Paper. Business Service Management Solution

White Paper. Business Service Management Solution White Paper Business Service Management Solution Eric Stinson, September 2005 Executive Summary With services and Service Level Agreements (SLAs) being direct sources of revenue (or penalties) for service

More information

A recipe using an Open Source monitoring tool for performance monitoring of a SaaS application.

A recipe using an Open Source monitoring tool for performance monitoring of a SaaS application. A recipe using an Open Source monitoring tool for performance monitoring of a SaaS application. Sergiy Fakas, TOA Technologies Nagios is a popular open-source tool for fault-monitoring. Because it does

More information

the missing log collector Treasure Data, Inc. Muga Nishizawa

the missing log collector Treasure Data, Inc. Muga Nishizawa the missing log collector Treasure Data, Inc. Muga Nishizawa Muga Nishizawa (@muga_nishizawa) Chief Software Architect, Treasure Data Treasure Data Overview Founded to deliver big data analytics in days

More information

Real Time Performance Dashboard for SOA Web Services ORION SOA

Real Time Performance Dashboard for SOA Web Services ORION SOA Real Time Performance Dashboard for SOA Web Services ORION SOA Abstract The adoption of service-oriented architectures (SOA) has become increasingly prevalent in enterprise IT environments. This web services

More information

Managing Application Performance with JBoss Operations Network and OC Systems RTI

Managing Application Performance with JBoss Operations Network and OC Systems RTI Managing Application Performance with JBoss Operations Network and OC Systems RTI Joe Fernandes - Sr. Product Marketing Manager, Red Hat Steve Sturtevant - Product Manager, OC Systems March 21, 2012 Agenda

More information

[Document Title] SolarWinds Server & Application Monitor (SAM) [Document Subtitle] Angi Gahler. Share: Author: Manish Chacko

[Document Title] SolarWinds Server & Application Monitor (SAM) [Document Subtitle] Angi Gahler. Share: Author: Manish Chacko [Document Title] [Document Subtitle] SolarWinds Server & Application Monitor (SAM) Author: Manish Chacko Angi Gahler Table of Contents The Case for Application & Server Monitoring.. 3 Complete Visibility

More information

OpenITSM - IT Service Management with Open Source Software

OpenITSM - IT Service Management with Open Source Software OpenITSM - IT Service Management with Open Source Software 03.02.2011 CloudExpo London Speaker: Julian Hein NETWAYS Founded 1995 26 full time employees Headquarter Nuremberg, Germany Focus on Open Source

More information

Why Nagios and Server Monitoring Are Failing Modern Apps

Why Nagios and Server Monitoring Are Failing Modern Apps An AppDynamics Business White Paper Server monitoring is an important part of any data center monitoring architecture, but too often it becomes a crutch and a deterrent to successfully building out a holistic

More information

SEE-GRID-SCI. www.see-grid-sci.eu. SEE-GRID-SCI USER FORUM 2009 Turkey, Istanbul 09-10 December, 2009

SEE-GRID-SCI. www.see-grid-sci.eu. SEE-GRID-SCI USER FORUM 2009 Turkey, Istanbul 09-10 December, 2009 SEE-GRID-SCI Grid Site Monitoring tools developed and used at SCL www.see-grid-sci.eu SEE-GRID-SCI USER FORUM 2009 Turkey, Istanbul 09-10 December, 2009 V. Slavnić, B. Acković, D. Vudragović, A. Balaž,

More information

Monitoring Windows Servers and Applications with GroundWork Monitor Enterprise 6.7. Product Application Guide October 8, 2012

Monitoring Windows Servers and Applications with GroundWork Monitor Enterprise 6.7. Product Application Guide October 8, 2012 Monitoring Windows Servers and Applications with GroundWork Monitor Enterprise 6.7 Product Application Guide October 8, 2012 Table of Contents Introduction...3 Definitions and Abbreviations...3 GroundWork

More information

Monitoring the Grid at local, national, and global levels

Monitoring the Grid at local, national, and global levels Home Search Collections Journals About Contact us My IOPscience Monitoring the Grid at local, national, and global levels This content has been downloaded from IOPscience. Please scroll down to see the

More information

A Scalable Network Monitoring System as a Public Service on Cloud

A Scalable Network Monitoring System as a Public Service on Cloud A Scalable Network Monitoring System as a Public Service on Cloud Network Technology Lab (NTL) NECTEC, THAILAND Chavee Issariyapat Network Technology Lab (NTL), NECTEC, THAILAND nano@netham.in.th Network

More information

A Basic Introduction to DevOps Tools

A Basic Introduction to DevOps Tools A Basic Introduction to DevOps Tools Akshaya H L, Nisarga Jagadish S, Vidya J, Veena K Department of computer science and engineering, VTU, Belgaum, University, Abstract DevOps is a practice in which operations

More information

SolarWinds Network Performance Monitor powerful network fault & availabilty management

SolarWinds Network Performance Monitor powerful network fault & availabilty management SolarWinds Network Performance Monitor powerful network fault & availabilty management Fully Functional for 30 Days SolarWinds Network Performance Monitor (NPM) is powerful and affordable network monitoring

More information

Kaseya Traverse. Kaseya Product Brief. Predictive SLA Management and Monitoring. Kaseya Traverse. Service Containers and Views

Kaseya Traverse. Kaseya Product Brief. Predictive SLA Management and Monitoring. Kaseya Traverse. Service Containers and Views Kaseya Product Brief Kaseya Traverse Predictive SLA Management and Monitoring Kaseya Traverse Traverse is a breakthrough cloud and service-level monitoring solution that provides real time visibility into

More information

Whitepaper. Business Service monitoring approach

Whitepaper. Business Service monitoring approach Whitepaper on Business Service monitoring approach - Harish Jadhav Page 1 of 15 Copyright Copyright 2013 Tecknodreams Software Consulting Pvt. Ltd. All Rights Reserved. Restricted Rights Legend This document

More information

How Comcast Built An Open Source Content Delivery Network National Engineering & Technical Operations

How Comcast Built An Open Source Content Delivery Network National Engineering & Technical Operations How Comcast Built An Open Source Content Delivery Network National Engineering & Technical Operations Jan van Doorn Distinguished Engineer VSS CDN Engineering 1 What is a CDN? 2 Content Router get customer

More information

OMNITURE MONITORING. Ensuring the Security and Availability of Customer Data. June 16, 2008 Version 2.0

OMNITURE MONITORING. Ensuring the Security and Availability of Customer Data. June 16, 2008 Version 2.0 Ensuring the Security and Availability of Customer Data June 16, 2008 Version 2.0 CHAPTER 1 1 Omniture Monitoring The Omniture Network Operations (NetOps) team has built a highly customized monitoring

More information

MySQL Strategy. Morten Andersen, MySQL Enterprise Sales. Copyright 2014 Oracle and/or its affiliates. All rights reserved.

MySQL Strategy. Morten Andersen, MySQL Enterprise Sales. Copyright 2014 Oracle and/or its affiliates. All rights reserved. MySQL Strategy Morten Andersen, MySQL Enterprise Sales Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not

More information

OpenITSM - IT Service Management with Open Source Software

OpenITSM - IT Service Management with Open Source Software OpenITSM - IT Service Management with Open Source Software März 2011 CeBIT Speaker: Julian Hein NETWAYS Founded 1995 26 full time employees Headquarter Nuremberg, Germany Focus on Open Source in Systems

More information

XpoLog Competitive Comparison Sheet

XpoLog Competitive Comparison Sheet XpoLog Competitive Comparison Sheet New frontier in big log data analysis and application intelligence Technical white paper May 2015 XpoLog, a data analysis and management platform for applications' IT

More information

Nagios. cooler than it looks. Wednesday, 31 October 2007

Nagios. cooler than it looks. Wednesday, 31 October 2007 Nagios cooler than it looks 1 Outline sysadmin 101 Nagios Overview Installing nagios NRPE / NSCA Other Stuff Questions 2 Sysadmin 101 Every sysadmin needs a decent toolkit... 3 Sysadmin 101 Every sysadmin

More information

Enterprise Application Monitoring with

Enterprise Application Monitoring with Enterprise Application Monitoring with 11/10/2007 Presented by James Peel james.peel@altinity.com / www.altinity.com 1 Who am I? James Peel - james.peel@altinity.com Job: Managing Director of Altinity

More information

Load and Performance Load Testing. RadView Software October 2015 www.radview.com

Load and Performance Load Testing. RadView Software October 2015 www.radview.com Load and Performance Load Testing RadView Software October 2015 www.radview.com Contents Introduction... 3 Key Components and Architecture... 4 Creating Load Tests... 5 Mobile Load Testing... 9 Test Execution...

More information

"Charting the Course...... to Your Success!" MOC 50290 A Understanding and Administering Windows HPC Server 2008. Course Summary

Charting the Course...... to Your Success! MOC 50290 A Understanding and Administering Windows HPC Server 2008. Course Summary Description Course Summary This course provides students with the knowledge and skills to manage and deploy Microsoft HPC Server 2008 clusters. Objectives At the end of this course, students will be Plan

More information

Eye of the Storm Enterprise Technical Overview

Eye of the Storm Enterprise Technical Overview TECHNICAL OVERVIEW Overview Eye of the Storm (EYE) is network management software that delivers. It delivers on the promise of proactive business resource management with a comprehensive and integrated

More information

securityprobe5es -X20

securityprobe5es -X20 securityprobe5es -X20 v.100.00x securityprobe5es -X20 will monitor the physical environment and alert you of any disturbances, such as unauthorized intruders, security breaches, high temperatures, smoke,

More information

Paper 064-2014. Robert Bonham, Gregory A. Smith, SAS Institute Inc., Cary NC

Paper 064-2014. Robert Bonham, Gregory A. Smith, SAS Institute Inc., Cary NC Paper 064-2014 Log entries, Events, Performance Measures, and SLAs: Understanding and Managing your SAS Deployment by Leveraging the SAS Environment Manager Data Mart ABSTRACT Robert Bonham, Gregory A.

More information

Real-time Data Analytics mit Elasticsearch. Bernhard Pflugfelder inovex GmbH

Real-time Data Analytics mit Elasticsearch. Bernhard Pflugfelder inovex GmbH Real-time Data Analytics mit Elasticsearch Bernhard Pflugfelder inovex GmbH Bernhard Pflugfelder Big Data Engineer @ inovex Fields of interest: search analytics big data bi Working with: Lucene Solr Elasticsearch

More information

Executive Summary WHAT IS DRIVING THE PUSH FOR HIGH AVAILABILITY?

Executive Summary WHAT IS DRIVING THE PUSH FOR HIGH AVAILABILITY? MINIMIZE CUSTOMER SERVICE DISRUPTION IN YOUR CONTACT CENTER GENESYS SIP 99.999% AVAILABILITY PROVIDES QUALITY SERVICE DELIVERY AND A SUPERIOR RETURN ON INVESTMENT TABLE OF CONTENTS Executive Summary...1

More information

Managing and Maintaining Windows Server 2008 Servers

Managing and Maintaining Windows Server 2008 Servers Managing and Maintaining Windows Server 2008 Servers Course Number: 6430A Length: 5 Day(s) Certification Exam There are no exams associated with this course. Course Overview This five day instructor led

More information

Ansible in Depth WHITEPAPER. ansible.com +1 800-825-0212

Ansible in Depth WHITEPAPER. ansible.com +1 800-825-0212 +1 800-825-0212 WHITEPAPER Ansible in Depth Get started with ANSIBLE now: /get-started-with-ansible or contact us for more information: info@ INTRODUCTION Ansible is an open source IT configuration management,

More information

SCF/FEF Evaluation of Nagios and Zabbix Monitoring Systems. Ed Simmonds and Jason Harrington 7/20/2009

SCF/FEF Evaluation of Nagios and Zabbix Monitoring Systems. Ed Simmonds and Jason Harrington 7/20/2009 SCF/FEF Evaluation of Nagios and Zabbix Monitoring Systems Ed Simmonds and Jason Harrington 7/20/2009 Introduction For FEF, a monitoring system must be capable of monitoring thousands of servers and tens

More information

The Remote Infrastructure Management Platform

The Remote Infrastructure Management Platform services capabilities The Remote Infrastructure Management Platform What is the Remote Infrastructure Management (RIM) Platform? As part of our Global Services Operating Architecture, the RIM platform

More information

Total Cloud Control with Oracle Enterprise Manager 12c. Kevin Patterson, Principal Sales Consultant, Enterprise Manager Oracle

Total Cloud Control with Oracle Enterprise Manager 12c. Kevin Patterson, Principal Sales Consultant, Enterprise Manager Oracle Total Cloud Control with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise Manager Oracle 2 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert

More information

securityprobe 5E Standard

securityprobe 5E Standard securityprobe 5E Standard securityprobe 5E Standard Monitor the physical environment and receive alerts of any disturbances, such as unauthorized intruders, security breaches, high temperatures, smoke,

More information

Williamson County Technology Services Technology Project Questionnaire for Vendor (To be filled out withprospective solution provider)

Williamson County Technology Services Technology Project Questionnaire for Vendor (To be filled out withprospective solution provider) Williamson County Technology Services Technology Project Questionnaire for Vendor (To be filled out withprospective solution provider) General Project Questions Please provide the proposed timeline estimate:

More information

See all, manage all is the new mantra at the corporate workplace today.

See all, manage all is the new mantra at the corporate workplace today. See all, manage all is the new mantra at the corporate workplace today. 1) Do you want to schedule backups, software updates using just one consistent automation platform? 2) Do you want a product that

More information

Solution White Paper Boosting Digital Transformation BMC vs. HP

Solution White Paper Boosting Digital Transformation BMC vs. HP Solution White Paper Boosting Digital Transformation BMC vs. HP A head-to-head comparison between BMC and HP on capabilities designed to speed up digital business Table of Contents 1 EXECUTIVE SUMMARY

More information

LMT Lustre Monitoring Tools

LMT Lustre Monitoring Tools LMT Lustre Monitoring Tools April 13, 2011 Christopher Morrone, P. O. Box 808, Livermore, CA 94551 This work performed under the auspices of the U.S. Department of Energy by under Contract DE-AC52-07NA27344

More information

Five Secrets to SQL Server Availability

Five Secrets to SQL Server Availability Five Secrets to SQL Server Availability EXECUTIVE SUMMARY Microsoft SQL Server has become the data management tool of choice for a wide range of business critical systems, from electronic commerce to online

More information

AGENDA: INTRODUCTION: 1. How is our cloud monitoring setup? 2. Which are the tools used? 3. How do we access monitoring dashboard?

AGENDA: INTRODUCTION: 1. How is our cloud monitoring setup? 2. Which are the tools used? 3. How do we access monitoring dashboard? Nagios Introduction AGENDA: INTRODUCTION: 1. How is our cloud monitoring setup? 2. Which are the tools used? 3. How do we access monitoring dashboard? 4. What are the user id / password? 5. How to check

More information

Performance Analysis and Capacity Planing

Performance Analysis and Capacity Planing Performance Analysis and Capacity Planing collectd the system statistics collection daemon Sebastian tokkee Harl teamix GmbH / collectd core team Libre Software Meeting 2012 July 10, 2012

More information

NCTA Cloud Operations

NCTA Cloud Operations NCTA Cloud Operations 093018 Lesson 1: Cloud Operations Topic A: Overview of Cloud Computing Solutions Identify the core concepts of cloud computing. Operations Terminology Identify the terminology used

More information

Forcepoint Stonesoft Management Center

Forcepoint Stonesoft Management Center Datasheet Forcepoint Stonesoft Management Center EFFICIENT, CENTRALIZED MANAGEMENT OF FORCEPOINT STONESOFT NEXT GENERATION FIREWALLS IN DISTRIBUTED ENTERPRISE ENVIRONMENTS FORCEPOINT STONESOFT MANAGEMENT

More information

6367(Print), ISSN 0976-6375(Online), Volume 5, Issue 1, January (2014), IAEME TECHNOLOGY (IJCET) AN ENHANCED MONITORING MECHANISM FOR IAAS PLATFORMS

6367(Print), ISSN 0976-6375(Online), Volume 5, Issue 1, January (2014), IAEME TECHNOLOGY (IJCET) AN ENHANCED MONITORING MECHANISM FOR IAAS PLATFORMS International INTERNATIONAL Journal of Computer JOURNAL Engineering OF COMPUTER and Technology ENGINEERING (IJCET), ISSN 0976- & TECHNOLOGY (IJCET) ISSN 0976 6367(Print) ISSN 0976 6375(Online) Volume 5,

More information

Vistara Lifecycle Management

Vistara Lifecycle Management Vistara Lifecycle Management Solution Brief Unify IT Operations Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid

More information

Customer Evaluation Report On Incident.MOOG

Customer Evaluation Report On Incident.MOOG WHITE PAPER Customer Evaluation Report On Incident.MOOG (Real Data Provided by a Fortune 100 Company) For information about Moogsoft and Incident.MOOG, visit www.moogsoft.com. http://moogsoft.com 2011-2015

More information

SolarWinds Network Performance Monitor

SolarWinds Network Performance Monitor SolarWinds Network Performance Monitor powerful network fault & availabilty management Fully Functional for 30 Days SolarWinds Network Performance Monitor (NPM) makes it easy to quickly detect, diagnose,

More information

Host and Service Monitoring at SLAC

Host and Service Monitoring at SLAC Host and Service Monitoring at SLAC Alf Wachsmann Stanford Linear Accelerator Center alfw@slac.stanford.edu DESY Zeuthen, May 17, 2005 Monitoring at SLAC Alf Wachsmann 1 Monitoring at SLAC: Does not really

More information

Monitoring Infrastructure for Superclusters: Experiences at MareNostrum

Monitoring Infrastructure for Superclusters: Experiences at MareNostrum ScicomP13 2007 SP-XXL Monitoring Infrastructure for Superclusters: Experiences at MareNostrum Garching, Munich Ernest Artiaga Performance Group BSC-CNS, Operations Outline BSC-CNS and MareNostrum Overview

More information

Keynote Mobile Device Perspective

Keynote Mobile Device Perspective PRODUCT BROCHURE Keynote Mobile Device Perspective Keynote Mobile Device Perspective is a single platform for monitoring and troubleshooting mobile apps on real smartphones connected to live networks in

More information

Shield Your Applications from Catastrophe

Shield Your Applications from Catastrophe white paper Healthy or better-performing apps can save uptime, customers. Shield Your Applications from Catastrophe Whether they reside on the company s internal servers or are accessible via the Web from

More information

Using WebLOAD to Monitor Your Production Environment

Using WebLOAD to Monitor Your Production Environment Using WebLOAD to Monitor Your Production Environment Your pre launch performance test scripts can be reused for post launch monitoring to verify application performance. This reuse can save time, money

More information

Unlimited Server 24/7/365 Support

Unlimited Server 24/7/365 Support Unlimited Server 24/7/365 Support Unlimited Server Support from System Architects covers your Windows servers, Linux servers and virtual machines. Your servers are monitored and managed 24 hours a day,

More information

SOLARWINDS NETWORK PERFORMANCE MONITOR

SOLARWINDS NETWORK PERFORMANCE MONITOR DATASHEET SOLARWINDS NETWORK PERFORMANCE MONITOR Fault, Availability, Performance, and Deep Packet Inspection SolarWinds Network Performance Monitor (NPM) is powerful and affordable network monitoring

More information

Chapter 1 - Web Server Management and Cluster Topology

Chapter 1 - Web Server Management and Cluster Topology Objectives At the end of this chapter, participants will be able to understand: Web server management options provided by Network Deployment Clustered Application Servers Cluster creation and management

More information

MALAYSIAN PUBLIC SECTOR OPEN SOURCE SOFTWARE (OSS) PROGRAMME. COMPARISON REPORT ON NETWORK MONITORING SYSTEMS (Nagios and Zabbix)

MALAYSIAN PUBLIC SECTOR OPEN SOURCE SOFTWARE (OSS) PROGRAMME. COMPARISON REPORT ON NETWORK MONITORING SYSTEMS (Nagios and Zabbix) MALAYSIAN PUBLIC SECTOR OPEN SOURCE SOFTWARE (OSS) PROGRAMME COMPARISON REPORT ON NETWORK MONITORING SYSTEMS (Nagios and Zabbix) JANUARY 2010 Phase II -Network Monitoring System- Copyright The government

More information

Open Source Monitoring

Open Source Monitoring Open Source Monitoring Icinga Team Munich Monitoring Workshop 06/13/2012 WWW.ICINGA.ORG Agenda! Introduction! Tools and Platform! Icinga vs. Nagios! Architecture! New in Icinga! Current Development! Live

More information

Bringing Value to the Organization with Performance Testing

Bringing Value to the Organization with Performance Testing Bringing Value to the Organization with Performance Testing Michael Lawler NueVista Group 1 Today s Agenda Explore the benefits of a properly performed performance test Understand the basic elements of

More information

Novell Remote Manager Administration Guide

Novell Remote Manager Administration Guide www.novell.com/documentation Novell Remote Manager Administration Guide Open Enterprise Server 11 SP2 January 2014 Legal Notices Novell, Inc., makes no representations or warranties with respect to the

More information

NETWORK MONITORING SYSTEM TOOLS: AN EXPLORATORY APPROACH

NETWORK MONITORING SYSTEM TOOLS: AN EXPLORATORY APPROACH NETWORK MONITORING SYSTEM TOOLS: AN EXPLORATORY APPROACH Ochin 1, Jugnu Gaur 2 Faculty of Engineering & Technology, Manav Rachna International University,Faridabad, India ochin.fet@gmail.com 1, jugnugaur@gmail.com

More information

Open PostgreSQL Monitoring

Open PostgreSQL Monitoring Open PostgreSQL Monitoring A brand new & free monitoring suite Jean-Paul Argudo / Julien Rouhaud Dalibo - www.dalibo.org PostgreSQL Conference Europe 2014 - October, 22 1 / 26 Origins Comparison with Oracle

More information

Network Management Deployment Guide

Network Management Deployment Guide Smart Business Architecture Borderless Networks for Midsized organizations Network Management Deployment Guide Revision: H1CY10 Cisco Smart Business Architecture Borderless Networks for Midsized organizations

More information