Efficient and Large-Scale Infrastructure Monitoring with Tracing

Similar documents
LinuxCon Europe Cloud Monitoring and Distribution Bug Reporting with Live Streaming and Snapshots.

Large-scale performance monitoring framework for cloud monitoring. Live Trace Reading and Processing

Virtual machine CPU monitoring with Kernel Tracing

Software Tracing of Embedded Linux Systems using LTTng and Tracealyzer. Dr. Johan Kraft, Percepio AB

<Insert Picture Here> Tracing on Linux

ontune SPA - Server Performance Monitor and Analysis Tool

HPC performance applications on Virtual Clusters

<Insert Picture Here> Tracing on Linux: the Old, the New, and the Ugly

Cloud.com CloudStack Community Edition 2.1 Beta Installation Guide

Cloud Operating Systems for Servers

Creating a Linux Virtual Machine using Virtual Box

How To Install Acronis Backup & Recovery 11.5 On A Linux Computer

Acronis Backup & Recovery 10 Server for Linux. Installation Guide

Xen Virtualization: Xen (source) and XenServer

Deployment - post Xserve

EUROPEAN ORGANIZATION FOR NUCLEAR RESEARCH CERN ACCELERATORS AND TECHNOLOGY SECTOR A REMOTE TRACING FACILITY FOR DISTRIBUTED SYSTEMS

Acronis Backup & Recovery 10 Server for Linux. Update 5. Installation Guide

Technical Investigation of Computational Resource Interdependencies

Virtualization and cloud computing monitoring

Cloud n Service Presentation. NTT Communications Corporation Cloud Services

Live Vertical Scaling

LabStats 5 System Requirements

In order to upload a VM you need to have a VM image in one of the following formats:

Eddy Integrated Development Environment, LemonIDE for Embedded Software System Development

Xen and XenServer Storage Performance

How To Install Storegrid Server On Linux On A Microsoft Ubuntu 7.5 (Amd64) Or Ubuntu (Amd86) (Amd77) (Orchestra) (For Ubuntu) (Permanent) (Powerpoint

vnas Series All-in-one NAS with virtualization platform

INUVIKA OVD INSTALLING INUVIKA OVD ON UBUNTU (TRUSTY TAHR)

AppDynamics Lite Performance Benchmark. For KonaKart E-commerce Server (Tomcat/JSP/Struts)

Cloud UT. Pay-as-you-go computing explained

HC INSTALLATION GUIDE. For Linux. Hosting Controller All Rights Reserved.

Week Overview. Installing Linux Linux on your Desktop Virtualization Basic Linux system administration

EFFICIENT ANALYSIS OF APPLICATION SERVERS IN THE CLOUD

Cloud on TEIN Part I: OpenStack Cloud Deployment. Vasinee Siripoonya Electronic Government Agency of Thailand Kasidit Chanchio Thammasat University

Integration of Virtualized Workernodes in Batch Queueing Systems The ViBatch Concept

nitrobit update server

Acronis Backup & Recovery 10 Server for Linux. Quick Start Guide

Virtualization in Linux KVM + QEMU

Using SNMP to Obtain Port Counter Statistics During Live Migration of a Virtual Machine. Ronny L. Bull Project Writeup For: CS644 Clarkson University

Hardened Hosting. Quintin Russ. OWASP New Zealand Chapter th December 2011

Symantec Endpoint Protection Datasheet

Very Large Enterprise Network Deployment, 25,000+ Users

Using VMware Player. VMware Player. What Is VMware Player?

Online Backup Client User Manual

Parallels Plesk Automation

JAMF Software Server Installation Guide for Linux. Version 8.6

ULTEO OPEN VIRTUAL DESKTOP UBUNTU (PRECISE PANGOLIN) SUPPORT

Applications Manager Best Practices document

The Monitis Monitoring Agent ver. 1.2

Oracle Fusion Middleware 11gR2: Forms, and Reports ( ) Certification with SUSE Linux Enterprise Server 11 SP2 (GM) x86_64

Virtualization: Know your options on Ubuntu. Nick Barcet. Ubuntu Server Product Manager

The QEMU/KVM Hypervisor

This Release Notes document is for F-Secure Linux Security.

CS197U: A Hands on Introduction to Unix

Real-time Debugging using GDB Tracepoints and other Eclipse features

MySQL performance in a cloud. Mark Callaghan

Runtime Monitoring & Issue Tracking

Linux Tools for Monitoring and Performance. Khalid Baheyeldin November 2009 KWLUG

Acronis Backup & Recovery 10 Server for Linux. Installation Guide

How To Backup On Myroster Idataagent

Automated Performance Testing of Desktop Applications

SynapseBackup Secure backups and disaster recovery services for both physical and virtual environments. Top reasons on why SynapseBackup is the best

International Journal of Computer & Organization Trends Volume20 Number1 May 2015

Zend Server 4.0 Beta 2 Release Announcement What s new in Zend Server 4.0 Beta 2 Updates and Improvements Resolved Issues Installation Issues

CISCO CLOUD SERVICES PRICING GUIDE AUSTRALIA

RecoveryVault Express Client User Manual

ISPS & WEBHOSTS SETUP REQUIREMENTS & SIGNUP FORM LOCAL CLOUD

Reporting Installation Checklist

Acronis Backup & Recovery 11

SurfProtect User Activity Reporting

NOCTUA by init.at THE FLEXIBLE MONITORING WEB FRONTEND

Performance of Host Identity Protocol on Nokia Internet Tablet

DVS-100 Installation Guide

Online Backup Linux Client User Manual

Acronis Backup & Recovery 10 Server for Linux. Installation Guide

Acronis Backup & Recovery 11.5

LiveBackup. Jagane Sundar

Online Backup Client User Manual

Setting up VMware Server v1 for 2X VirtualDesktopServer Manual

Cloud Computing Workload Benchmark Report

Server Monitoring. AppDynamics Pro Documentation. Version Page 1

Copyright by Parallels Holdings, Ltd. All rights reserved.

Solution for private cloud computing

ZABBIX. An Enterprise-Class Open Source Distributed Monitoring Solution. Takanori Suzuki MIRACLE LINUX CORPORATION October 22, 2009

Hyper-V vs ESX at the datacenter

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

Linux/Open Source and Cloud computing Wim Coekaerts Senior Vice President, Linux and Virtualization Engineering

Building a big IaaS cloud with Apache CloudStack

vmprof Documentation Release 0.1 Maciej Fijalkowski, Antonio Cuni, Sebastian Pawlus

DNS must be up and running. Both the Collax server and the clients to be backed up must be able to resolve the FQDN of the Collax server correctly.

Affinity Aware VM Colocation Mechanism for Cloud

Transcription:

CloudOpen Europe 2013 Efficient and Large-Scale Infrastructure Monitoring with Tracing Julien.desfossez@efcios.com 1

Content Overview of tracing and LTTng LTTng features for Cloud Providers LTTng as a monitoring tool Crash dumps Real-time monitoring Large-scale low-level tracing Infrastructure integration Performance results Virtualisation specific analysis LTTngTop Future work 2

Tracing Recording run-time information without stopping the process Usually used during development to solve performance problems Lots of alternatives on Linux: LTTng, Perf, ftrace, SystemTap, strace, etc. 3

LTTng 2.x Unified user interface, API, kernel and user-space tracers Trace output in CTF (Common Trace Format) Low overhead Modules only (no kernel compilation needed) Shipped in distros: Ubuntu, Debian, SuSE, Fedora, Linaro, Wind River, etc. 4

Tracing session example $ lttng create $ lttng enable-event -k sched_switch $ lttng enable-event -k -syscall -a $ lttng start $ sleep 2 $ lttng stop $ lttng view wc -l 8669 $ lttng destroy 5

Tracing session example [11:30:42.204505464] (+0.000026604) sinkpad sys_read: { cpu_id = 3 }, { fd = 3, buf = 0x7FD06528E000, count = 4096 }... [11:30:42.204601549] (+0.000021061) sinkpad sys_open: { cpu_id = 3 }, { filename = "/lib/x86_64-linux-gnu/libnss_compat.so.2", flags = 524288, mode = 54496 }... [11:30:42.205484608] (+0.000006973) sinkpad sched_switch: { cpu_id = 1 }, { prev_comm = "swapper/1", prev_tid = 0, prev_prio = 20, prev_state = 0, next_comm = "rcuos/0", next_tid = 18, next_prio = 20 } 6

LTTng features for Cloud Providers LTTng 2.1 (12/2012): trace streaming LTTng 2.2 (06/2013): trace-file rotation LTTng 2.3 (09/2013): snapshots LTTng 2.4 (RC1 expected in November 2013): live trace reading 7

LTTng as a monitoring tool : Crash dumps Flight recorder Snapshot on demand Coredump handler (in extras/) 8

Flight recorder session + snapshot $ lttng create --snapshot $ lttng enable-event -k sched_switch $ lttng enable-event -k -syscall -a $ lttng start $... $ lttng snapshot record Snapshot recorded successfully for session auto-20131019-113803 $ babeltrace /home/julien/lttng-traces/auto-20131019-113803/sn apshot-1-20131019-113813-0/kernel/ 9

Coredump handler # cat /proc/sys/kernel/core_pattern /path/to/lttng/handler.sh %p %u %g %s %t %h %e %E %c 10

Real-time monitoring Read the trace while it is being recorded Local or remote session Configurable flush period 11

Infrastructure integration Server (lttng-sessiond) Server (lttng-sessiond) Server (lttng-sessiond) TCP lttng-relayd TCP Viewer 12

Live streaming session On the server to trace : $ lttng create - live 2000000 -U net://10.0.0.1 $ lttng enable-event -k sched_switch $ lttng enable-event -k -syscall -a $ lttng start On the receiving server (10.0.0.1) : $ lttng-relayd -d On the viewer machine : $ lttngtop -r 10.0.0.1 13

Performance results sysbench MySQL benchmark with increasing number of threads on a quad-core i7, 6GB RAM, 7200 RPM Tracing all system calls and sched_switch with LTTng in different modes : Flight recorder with a snapshot recorded every 30 seconds Streaming the trace to a remote server Writing the trace on a dedicated disk Tracing all the threads of MySQL with strace to a dedicated disk 14

Performance results The test runs for 50 minutes Each snapshot is around 7MB, 100 snapshots recorded The whole strace trace (text) is 5.4GB with 61 million events recorded The whole LTTng trace (binary CTF) is 6.8GB with 257 million events recorded with 1% of lost events 15

Performance results 16

Sharing the disk with DB and trace 17

Performance result with virtualization 2 KVM VMs on the same host One is an apache web server The other one downloads a 5GB iso file from the first with wget Same LTTng instrumentation and setup (syscalls and sched_switch) No noticeable overhead when recording the trace on an external disk, network or snapshots. 18

Advanced KVM analysis TMF Virtual Machine Analysis view by Mohamad Gebai 19

20

LTTngTop Top-alike interface to read LTTng kernel traces CPU usage, per-process file activity, kprobes hit, per-process perf counter display Navigate in the trace second-by-second Read offline traces or connect to a relay for live-streaming Experimental in-memory live-reading 21

22

Future Work Integrate with already existing monitoring tools (graphite, Nagios, etc), beta already working Filter and pre-process the trace before sending Distribute the analysis Remote control of the tracer More advanced triggers to collect snapshots, start/stop tracing, etc. 23

Install it Packages for your distro (lttng-modules, lttng-ust, lttng-tools, userspace-rcu, babeltrace) For Ubuntu : PPA for daily build (lttngtop) Or from the source, see http://git.lttng.org 24

Questions?? www.efficios.com lttng.org lttng-dev@lists.lttng.org @lttng_project 25