Monitoring an Openstack cluster



Similar documents
Introduction to Openstack, an Open Cloud Computing Platform. Libre Software Meeting

Postgres on OpenStack

Billing for OpenStack Cloud Services

CERN Cloud Infrastructure. Cloud Networking

An Introduction to OpenStack and its use of KVM. Daniel P. Berrangé

Utilization-based Scheduling in OpenStack* Compute (Nova)

SDN v praxi overlay sítí pro OpenStack Daniel Prchal daniel.prchal@hpe.com

Infrastructure as a Service (IaaS)

One-click Hadoop Cluster Deployment on OpenPOWER Systems Pradeep K Surisetty IBM. #OpenPOWERSummit

Are We Done Yet? Testing Your OpenStack Deployment

NephOS A Licensed End-to-end IaaS Cloud Software Stack for Enterprise or OEM On-premise Use.

Multi Provider Cloud. Srinivasa Acharya, Engineering Manager, Hewlett-Packard

cloud functionality: advantages and Disadvantages

Oracle OpenStack for Oracle Linux Release 1.0 Installation and User s Guide ORACLE WHITE PAPER DECEMBER 2014

An Intro to OpenStack. Ian Lawson Senior Solution Architect, Red Hat

vrealize Operations Management Pack for OpenStack

Design and Implementation of IaaS platform based on tool migration Wei Ding

Running an OpenStack Cloud for several years and living to tell the tale. Alexandre Maumené Gaëtan Trellu Tokyo Summit, November 2015

OpenStack IaaS. Rhys Oxenham OSEC.pl BarCamp, Warsaw, Poland November 2013

SUSE Cloud. OpenStack End User Guide. February 20, 2015

Deploying SwiftStack Object Storage for OpenStack Glance and Cinder Backups

Déployer son propre cloud avec OpenStack. GULL François Deppierraz

rackspace.com/cloud/private

Develop a process for applying updates to systems, including verifying properties of the update. Create File Systems

Ubuntu OpenStack Fundamentals Training

Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure

Building Multi-Site & Ultra-Large Scale Cloud with Openstack Cascading

With Red Hat Enterprise Virtualization, you can: Take advantage of existing people skills and investments

Release Notes for Fuel and Fuel Web Version 3.0.1

rackspace.com/cloud/private

Installation Runbook for Avni Software Defined Cloud

OpenStack Introduction. November 4, 2015

Guide to the LBaaS plugin ver for Fuel

Sales Slide Midokura Enterprise MidoNet V1. July 2015 Fujitsu Limited

Intel Service Assurance Administrator. Product Overview

CloudCIX Bootcamp. The essential IaaS getting started guide.

Deploying workloads with Juju and MAAS in Ubuntu 13.04

Red Hat Enterprise Linux OpenStack Platform Update February 17, 2016

Mirantis

SUSE Cloud Installation: Best Practices Using a SMT, Xen and Ceph Storage Environment

Introduction to OpenStack

Red Hat Enterprise Linux OpenStack Platform 7 OpenStack Data Processing

SEACW DELIVERABLE D.1.6

Openstack. Cloud computing with Openstack. Saverio Proto

HP OpenStack & Automation

Scyld Cloud Manager User Guide

Change the Game with HP Helion

How To Use Anibom Smart Cloud For Business

OpenStack Assessment : Profiling & Tracing

An Efficient Failover Enabling Mechanism in OpenStack

OpenStack/Quantum SDNbased network virtulization with Ryu

Getting Started with OpenStack and VMware vsphere TECHNICAL MARKETING DOCUMENTATION V 0.1/DECEMBER 2013

TUT5605: Deploying an elastic Hadoop cluster Alejandro Bonilla

rackspace.com/cloud/private

การใช งานและต ดต งระบบ OpenStack ซอฟต แวร สาหร บบร หารจ ดการ Cloud Computing เบ องต น

Automated Configuration of Open Stack Instances at Boot Time

Today. 1. Private Clouds. Private Cloud toolkits. Private Clouds and OpenStack Introduction

Performance of Network Virtualization in Cloud Computing Infrastructures: The OpenStack Case.

Cloud on TEIN Part I: OpenStack Cloud Deployment. Vasinee Siripoonya Electronic Government Agency of Thailand Kasidit Chanchio Thammasat University

How To Make A Cloud Work For You

Mirantis OpenStack 6. with VMware vcenter and NSX. Mirantis Reference Architecture. US HEADQUARTERS Mountain View, CA

Cloud Cruiser and Azure Public Rate Card API Integration

SUSE Cloud Installation: Best Practices Using an Existing SMT and KVM Environment

SMB in the Cloud David Disseldorp

HCIbench: Virtual SAN Automated Performance Testing Tool User Guide

Open Source Networking for Cloud Data Centers

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager Product Marketing Manager

How To Install Openstack On Ubuntu (Amd64)

Project Documentation

OpenStack Awareness Session

4 SCS Deployment Infrastructure on Cloud Infrastructures

HA for Enterprise Clouds: Oracle Solaris Cluster & OpenStack

Isabell Sippli Cloud Architect, Lab Based Services IBM Software Group 2013 IBM Corporation

How OpenStack is implemented at GMO Public Cloud service

Overlay networking with OpenStack Neutron in Public Cloud environment. Trex Workshop 2015

HP Helion OpenStack Carrier Grade REST API Copyright Hewlett-Packard Development Company, L.P. All rights reserved.

7 Ways OpenStack Enables Automation & Agility for KVM Environments

Improved metrics collection and correlation for the CERN cloud storage test framework

Rally Installation Guide

HPSA Agent Characterization

OpenStack & Hyper-V. Alessandro Pilo- CEO Cloudbase

AMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013

INTRODUCTION TO CLOUD MANAGEMENT

Apache Stratos (incubating) M5 Installation Guide

Deploying complex applications to Google Cloud. Olia Kerzhner

Mirantis OpenStack Express: Security White Paper

Microsoft Azure. IaaS Networking Storage. Stefan Geiger Gerry

How SUSE Is Helping You Rock The Public Cloud

version 6.1 Monitoring Guide

Copyright 2014, Oracle and/or its affiliates. All rights reserved. 2

CISCO CLOUD SERVICES PRICING GUIDE AUSTRALIA

Transcription:

Monitoring an Openstack cluster with icinga/nagios Benedikt Trefzer Cirrax GmbH September 2015

Cirrax GmbH Cirrax GmbH since 2011 based in Bern (Switzerland) Linux and network consulting and engineering Project Management Private and Public OpenStack Cloud Active contributors to OpenStack and other OpenSource projects

Our objectives one tool for system status: icinga/nagios monitor generic resources like memory, disk, CPU etc. monitor external availability of services like https, ping etc. openstack-nagios-plugins to monitor openstack health similar to OpenStack client tools written in python use OpenStack libraries use nagiosplugin library hosted on github contributions welcome!

Check services./check_nova-services $ nova hypervisor-stats +----------------------+-------+ Property Value +----------------------+-------+ count 2 # 2 compute nodes current_workload 0 disk_available_least 98 free_disk_gb 97 free_ram_mb 6394 local_gb 98 local_gb_used 1 memory_mb 7930 # total memory memory_mb_used 1536 # memory uses by vm s running_vms 1 vcpus 4 # total vcpus vcpus_used 1 # vcpus used by vm s +----------------------+-------+ $./check_nova-hypervisors NOVAHYPERVISORS OK - [memory_used:1536 memory_percent:19 vcpus_used:1 vcpus_percent:25 running_vms:1] memory_percent=19;90;95;0;100 memory_used=1536;;;0;7930 running_vms=1;;;0 vcpus_percent=25;90;95;0;100 vcpus_used=1;;;0;4

Check services./check_nova-services $ nova service-list +----+------------------+--------+------+----------+-------+--------------+-------------+ Id Binary Host Zone Status State Updated at Disabled R. +----+------------------+--------+------+----------+-------+--------------+-------------+ 1 nova-conductor 0.t.ch nova enabled up...t14:09:02-2 nova-consoleauth 0.t.ch nova enabled up...t14:09:05-3 nova-scheduler 0.t.ch nova enabled up...t14:09:04-4 nova-cert 0.t.ch nova enabled up...t14:09:04-5 nova-compute 1.t.ch nova enabled up...t14:09:03-6 nova-compute 2.t.ch nova enabled down...t14:09:01-7 nova-compute 3.t.ch nova disabled down...t14:09:09 Maintenance +----+------------------+--------+------+----------+-------+--------------+-------------+ $./check_nova-services NOVASERVICES CRITICAL - [up:5 disabled:1 down:2 total:7] disabled=1;@1:;;0 down=... $./check_nova-services --host 1.t.ch NOVASERVICES OK - [up:1 disabled:0 down:0 total:1] disabled=0;@1:;;0 down=... $./check_nova-services --binary nova-compute NOVASERVICES CRITICAL - [up:1 disabled:1 down:2 total:3] disabled=1;@1:;;0 down=...

Check services/agents cinder and neutron $ cinder service-list +------------------+--------+------+---------+-------+--------------+ Binary Host Zone Status State Updated_at +------------------+--------+------+---------+-------+--------------+ cinder-scheduler 0.t.ch nova enabled up...t09:26:42 cinder-volume 0.t.ch nova enabled up...t09:26:46 +------------------+--------+------+---------+-------+--------------+ $./check_cinder-services CINDERSERVICES OK - [up:2 disabled:0 down:0 total:2] disabled=0;@1:;;0 down=... $ neutron agent-list +------+--------------------+--------+-------+----------------+---------------------------+ id agent_type host alive admin_state_up binary +------+--------------------+--------+-------+----------------+---------------------------+ 6... Loadbalancer agent 0.t.ch :-] True neutron-lbaas-agent e... L3 agent 0.t.ch :-] True neutron-l3-agent b... Open vswitch agent 2.t.ch :-] True neutron-openvswitch-agent b... Open vswitch agent 1.t.ch :-] True neutron-openvswitch-agent 7... Open vswitch agent 0.t.ch :-] True neutron-openvswitch-agent e... Metadata agent 0.t.ch :-] True neutron-metadata-agent 1... Metering agent 0.t.ch :-] True neutron-metering-agent 7... DHCP agent 0.t.ch :-] True neutron-dhcp-agent +------+--------------------+--------+-------+----------------+---------------------------+ $./check_neutron-agents NEUTRONAGENTS OK - [up:8 disabled:0 down:0] disabled=0;@1:;;0 down=...

Floating IP s $ neutron floatingip-list +------+------------------+---------------------+---------+ id fixed_ip_address floating_ip_address port_id +------+------------------+---------------------+---------+ f... xxx.xxx.xxx.9 5... 192.168.0.13 xxx.xxx.xxx.20 4... 2... 192.168.0.12 xxx.xxx.xxx.3 2... +------+------------------+---------------------+---------+ $./check_neutron-floatingips -c 0:230 -w 0:200 NEUTRONFLOATINGIPS OK - [assigned:3 used:2] assigned=3;200;230;0 used=2;;;0

Ceilometer statistics ceilometer stores samples for events in the cloud regularly triggered audit events for usage data is used to measure past usage of OpenStack (eg for billing) $./check_ceilometer-statistics -m volume CEILOMETERSTATISTICS OK - [age:26.21m count:88samples value:1volume] age=... we use this test to verify freshness of meters.

Rally Rally is a benchmarking tool run automated scenarios on a deployed cloud example scenario: boot and delete server one rally run: several iterations of different scenarios possibility to specify SLA

Rally Screenshots

Rally Nagios/Icinga and Rally Rally stores all result in database Output as HTML or json summarize json result for nagios/icinga $ rally task results./check_rally-results RALLYRESULTS OK - [errors:0 slafail:0] errors=0;0;0 fulldur=542.087836981s loaddur=222.671538115s slafail=0;0;0 total=46

links and questions this presentation: https://cirrax.com/downloads/2015_openstackmonitoring.pdf openstack-nagios-plugins: https://github.com/cirrax/openstack-nagios-plugins Rally: https://wiki.openstack.org/wiki/rally nagiosplugin library: https://pypi.python.org/pypi/nagiosplugin/ contact: benedikt.trefzer@cirrax.com This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.