Maintaining Non-Stop Services with Multi Layer Monitoring



Similar documents
PANDORA FMS NETWORK DEVICES MONITORING

WhatsUp Gold vs. Orion

How To Use Mindarray For Business

TPAf KTl Pen source. System Monitoring. Zenoss Core 3.x Network and

PANDORA FMS NETWORK DEVICE MONITORING

Monitoring and analyzing audio, video, and multimedia traffic on the network

Newton Linux User Group Graphing SNMP with Cacti and RRDtool

Juniper Networks Management Pack Documentation

SolarWinds Network Performance Monitor powerful network fault & availabilty management

A SURVEY ON AUTOMATED SERVER MONITORING

Network Monitoring Comparison

A recipe using an Open Source monitoring tool for performance monitoring of a SaaS application.

FUNCTIONAL OVERVIEW

SolarWinds Network Performance Monitor

mbits Network Operations Centrec

SolarWinds Network Performance Monitor

SOLARWINDS NETWORK PERFORMANCE MONITOR

Network Management Deployment Guide

SapphireIMS 4.0 BSM Feature Specification

Ein Unternehmen stellt sich vor. Nagios in large environments

Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid clouds.

Deploying the BIG-IP LTM with the Cacti Open Source Network Monitoring System

A FAULT MANAGEMENT WHITEPAPER

MySQL Enterprise Monitor

Vistara Lifecycle Management

Assignment # 1 (Cloud Computing Security)

Monitoring MySQL. Geert Vanderkelen MySQL Senior Support Engineer Sun Microsystems

How To Set Up Foglight Nms For A Proof Of Concept

Server & Application Monitor

WINDOWS SERVER MONITORING

Datasheet iscsi Protocol

MSP Center Plus Features Checklist

How To Monitor Mysql With Zabbix

Network Manager 6.1. Network operations management software. NEC Corporation

Cisco Application Networking Manager Version 2.0

Install and configure the Net- SNMP agent for Windows

Best of Breed of an ITIL based IT Monitoring. The System Management strategy of NetEye

WhatsUpGold. v3.0. WhatsConnected User Guide

Details. Some details on the core concepts:

White Paper. The Ten Features Your Web Application Monitoring Software Must Have. Executive Summary

Mark Bennett. Search and the Virtual Machine

IP SLAs Overview. Finding Feature Information. Information About IP SLAs. IP SLAs Technology Overview

Application Discovery Manager User s Guide vcenter Application Discovery Manager 6.2.1

ICND2 NetFlow. Question 1. What are the benefit of using Netflow? (Choose three) A. Network, Application & User Monitoring. B.

NMS300 Network Management System

MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM?

CA Virtual Assurance/ Systems Performance for IM r12 DACHSUG 2011

End-to-End Network Centric Performance Management

NetBrain Workstation Professional Edition 2.3 Release notes

Using The Paessler PRTG Traffic Grapher In a Cisco Wide Area Application Services Proof of Concept

PERFORMANCE TESTING. New Batches Info. We are ready to serve Latest Testing Trends, Are you ready to learn.?? START DATE : TIMINGS : DURATION :

TITANXR Multi-Switch Management Software

SystemManager. Server Management Software. November, NEC Corporation, Cloud Platform Division, MasterScope Group

Aqua Connect Load Balancer User Manual (Mac)

MSP End User. Version 3.0. Technical Solution Guide

1 Data Center Infrastructure Remote Monitoring

One software solution to monitor your entire network, including devices, applications traffic and availability.

Network Monitoring. Easy, failsafe, and complete visibility of your network. Our customers have the same view as our NOC technicians.

AKIPS Network Monitor Installation, Configuration & Upgrade Guide Version 15. AKIPS Pty Ltd

Best Practices on Campus Network Monitoring. Ljubljana, October Vidar Faltinsen, UNINETT

XpoLog Competitive Comparison Sheet

Running custom scripts which allow you to remotely and securely run a script you wrote on Windows, Mac, Linux, and Unix devices.

Monitoring backbone networks

Network Management & Monitoring Overview

MONITORING EMC GREENPLUM DCA WITH NAGIOS

ENC Enterprise Network Center. Intuitive, Real-time Monitoring and Management of Distributed Devices. Benefits. Access anytime, anywhere

Heroix Longitude Quick Start Guide V7.1

Wait, How Many Metrics? Monitoring at Quantcast

IBM PureData System for Transactions. Technical Deep Dive. Jonathan Rossi, PureSystems Specialist

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam

WhatsUp Gold 2016 Getting Started Guide

Managing your Red Hat Enterprise Linux guests with RHN Satellite

Huawei esight Brief Product Brochure

Whitepaper. Business Service monitoring approach

VCS Monitoring and Troubleshooting Using Brocade Network Advisor

SERVICE SCHEDULE DEDICATED SERVER SERVICES

ManageEngine (division of ZOHO Corporation) Infrastructure Management Solution (IMS)

End Your Data Center Logging Chaos with VMware vcenter Log Insight

Yahoo! Communities Architectures Ian Flint

Network Monitoring. By: Delbert Thompson Network & Network Security Supervisor Basin Electric Power Cooperative

8/26/2007. Network Monitor Analysis Preformed for Home National Bank. Paul F Bergetz

Introduction to system monitoring with Nagios, Check_MK and Open Monitoring Distribution (OMD)

Deployment Topologies

System Monitoring Using NAGIOS, Cacti, and Prism.

pt360 FREE Tool Suite Networks are complicated. Network management doesn t have to be.

Compare E SPIN NMS SaaS Plan & Addon

Orion Network Performance Monitor

How to Interform With Cisco Routers Through Software

Introduction to Junos Space Network Director

Basic & Advanced Administration for Citrix NetScaler 9.2

Cisco NetFlow Generation Appliance (NGA) 3140

ZEN LOAD BALANCER EE v3.04 DATASHEET The Load Balancing made easy

Datasheet FUJITSU Cloud Monitoring Service

Implementing a secure high visited web site by using of Open Source softwares. S.Dawood Sajjadi Maryam Tanha. University Putra Malaysia (UPM)

TELCO challenge: Learning and managing the network behavior

Zabbix 1.8 Network Monitoring

Transcription:

Maintaining Non-Stop Services with Multi Layer Monitoring Lahav Savir System Architect and CEO of Emind Systems lahavs@emindsys.com www.emindsys.com

The approach Non-stop applications can t leave on their own More complex systems require more monitoring Proactive system monitoring Customized monitors Monitor each process, component and application separately and all as a whole Proactive correction of problems before they become noticeable by your customers. Allow application to function at maximum availability SNMP monitoring of application infrastructure Alerting of potential problem or situation prior to accuracy Visual layered display of the entire data center 2

Monitoring is not just for System Administrators but for Developers as well 4

The goal for monitoring is to keep track of the running services 24/7, find troubles as early as possible and keep you alerted (only when needed ) Good monitoring infrastructure provides you a quick and direct troubleshooting abilities via visual representation of the system status 5

Multi-Layered Monitoring Services Keeping SLA, End-to-end service, User experience monitors Applications Application proprietary monitors, Custom counters Operating Systems CPU, Memory, Disk, Network, Processes Unified Dashboard Infrastructure Network connections, network devices, Chassis, Routers, Load Balancers, Firewalls 6

Visualize the information Use Maps & Views Visualize the topology Visualize the application flow Focus on different layers Layered views (service & host groups) Network, Hardware, OS, Application Different roles looking for different info Graph Service performance Transactions, Success rates, Cache Aggregated view Cluster s average 7

Why multi layer? Correlated information on one uniform view Network throughput, CPU usage, Application usage Generate aggregated reports for different machines & layers Collect information from all nodes Switches, routers, firewalls, load balancers, storage, servers, applications Collect different types of data Utilization, throughput, concurrency, cache status Application performance, error rates Objective Find the root cause via visuals on the dashboard! Be aware of what s going on 8

Unified Dashboard Infrastructure Network connections, network devices, Chassis, Routers, Load Balancers, Firewalls 9

Infrastructure layer Hardware redundancy my be dangerous if you don t keep your eyes on it Administrators not always seeing the HW Redundant hardware can fool you (until it dies) Vendor specific MIBS / Syslog Today s hardware provides detailed status interfaces Power supplies, power usage Fans & temperature Disk controllers & drives Switch ports, interfaces Links, connections 10

HP HW monitoring 11

Network infrastructure health Link Quality Devices utilization Network throughput 12

Unified Dashboard 13 Operating Systems CPU, Memory, Disk, Network, Processes

Operating System health Monitoring of OS essentials CPU, Memory, Disk I/O, Network traffic, processes, services Use cases Failure on log cleanup > disks full 14

Unified Dashboard 15 Applications Application proprietary monitors, Custom counters

Applications health Transaction counters Measure transaction rates Measure counters on input & output % Success rates Success counters for primary operations 16

System input/output monitoring Outlined topology 17

Applications health Queues Processing backlog Semaphores / throttle usage Latency Measure the time it takes to process request / data chunk Optimizations Measure compression rates 18

Applications health DB synchronizations Replication status Replication backlog & delays 19

Applications health Cluster & topology monitoring Track application topology changes Indicate dependencies status 20

Application topology Outlined topology Routers Front-ends Transports Load Balancers Firewalls Firewalls Load Balancers 21

Services Keeping SLA, End-to-end service, User experience monitors Unified Dashboard 22

Cluster overall QoS Now it s a cluster, aggregated counters Want to know what users are experiencing 23

User Experience & Service QoS Simulate user behavior Latency 24 How long it takes to login How long it takes to send and receive a message How long it takes to check out Success rates What s the success rate of the user s operation Download speed What s the download speed from different locations What s your Content Delivery Network (CDN) performance

Recommendations Build a generic monitoring infrastructure with generic tools and interfaces Use embedded SNMP Net-snmp is extendable (also for Windows) PROXY proxy request to other SNMP agent (embedded) proxy -v 2c -c public 127.0.0.1:50910 1.3.6.1.4.1.15867.2000.3.6 1.3.6.1.4.1.15867.2000.3.6 PASS STDOUT based subagent pass.1.3.6.1.4.1.15867.2001 /bin/sh /usr/local/ixi/genericsubagent.sh EXEC run a script exec.1.3.6.1.4.1.15867.1100.20.10 axs-imap-test-stat /bin/bash /opt/mas/scripts/imap_tester.sh last_state SSH / Telnet 25

Using Status files Perfect for batch operations perl, python, php Status file TIMESTAMP:1276203703 STATUS:0 HOSTNAME:myserver Observer if [ $(get_time_delta ${file}) -gt ${max_d_s} ]; then 27 Fi err "Delta is greater than ${max_delta} hours" return if [ "$(parse_status_file ${file} STATUS)"!= "0" ]; then Fi err "Last backup status is not 0 return echo ${ok}

Command line based info. Command line applications DB, Softswitch, Etc. Example Snmpd.conf pass.1.3.6.1.4.1.15867.1.100 /bin/bash /usr/local/emind/replication_status.sh Run the script replication_status.sh Execute SQL Query - show slave status\g; Parse the output Return data to snmpd 28

Important to remember Define your goals, What s right for you? Don t over monitor Use methodologies and technologies that fit your network and needs, not the other way around. Build generic interfaces SNMP Simple command line No proprietary protocols Agnostic to the monitoring tool 29

Mobile Access 30

Leading tools Open source first Nagios Very generic, lot s of public plug-ins Easy to tweak and build it at your own style Zabix A complete monitoring solution Less customizable Cacti Graphing (RRD tool) Easy to configure Lot s of public templates 31

Monitoring on the Cloud Nagios + Dynamic Configuration = Dynagios! Key features Auto provisioning Add, Remove, Suspend, Unsuspended Machines are monitored base on their predefined profiles Machines can join / leave the monitor (purposely) Join on boot Leave on shutdown If crash happens alert will raise 32

Commercial tools Feature ManageEngine Orion OS Requirements Windows & Linux Windows Modular (applications, IP SLA, Yes Yes Netflow, Conf.) Very easy to deploy Yes Yes Multi vendor with lot s of templates Maps Yes Yes Less customizable but still flexible Est. price for 100 devices $10k $15k Yes Yes Yes Yes 33

34 Questions?