LinuxCon Europe 2013. Cloud Monitoring and Distribution Bug Reporting with Live Streaming and Snapshots. mathieu.desnoyers@efficios.



Similar documents
Efficient and Large-Scale Infrastructure Monitoring with Tracing

Large-scale performance monitoring framework for cloud monitoring. Live Trace Reading and Processing

Software Tracing of Embedded Linux Systems using LTTng and Tracealyzer. Dr. Johan Kraft, Percepio AB

Example of Standard API

An Oracle White Paper September Advanced Java Diagnostics and Monitoring Without Performance Overhead

OpenSAF A Standardized HA Solution

<Insert Picture Here> Tracing on Linux

Virtual Hosting & Virtual Machines

Virtual machine CPU monitoring with Kernel Tracing

This presentation covers virtual application shared services supplied with IBM Workload Deployer version 3.1.

Server Virtualization with Windows Server Hyper-V and System Center

Resource Utilization of Middleware Components in Embedded Systems

EFFICIENT ANALYSIS OF APPLICATION SERVERS IN THE CLOUD

This module provides an overview of service and cloud technologies using the Microsoft.NET Framework and the Windows Azure cloud.

11.1 inspectit inspectit

IBM Tivoli Composite Application Manager for Microsoft Applications: Microsoft Hyper-V Server Agent Version Fix Pack 2.

Network Virtualization Platform (NVP) Incident Reports

Emerald. Network Collector Version 4.0. Emerald Management Suite IEA Software, Inc.

Server Virtualization with Windows Server Hyper-V and System Center

Performance Tuning Guidelines for PowerExchange for Microsoft Dynamics CRM

Observer Analysis Advantages

Citrix Training. Course: Citrix Training. Duration: 40 hours. Mode of Training: Classroom (Instructor-Led)

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

TECHNOLOGY WHITE PAPER Jun 2012

User-ID Best Practices

Integrating LANGuardian with Active Directory

Server Virtualization with Windows Server Hyper-V and System Center

Sage 200 Online. System Requirements and Prerequisites

W H I T E P A P E R. Understanding VMware Consolidated Backup

Performance Counter. Non-Uniform Memory Access Seminar Karsten Tausche

11.1. Performance Monitoring

RCL: Software Prototype

Gigabit Ethernet Packet Capture. User s Guide

Perfmon2: A leap forward in Performance Monitoring

Chapter 14 Analyzing Network Traffic. Ed Crowley

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

Wireshark and tcpdump: Packet Capture for Network Analysis

Module 12. Configuring and Managing Storage Technologies. Contents:

Very Large Enterprise Network, Deployment, Users

Managing and Monitoring Windows 7 Performance Lesson 8

<Insert Picture Here> Tracing on Linux: the Old, the New, and the Ugly

Real-time Debugging using GDB Tracepoints and other Eclipse features

Network performance in virtual infrastructures

Course Outline. Create and configure virtual hard disks. Create and configure virtual machines. Install and import virtual machines.

Chapter 14 Virtual Machines

Dollar Universe SNMP Monitoring User Guide

DB2 Connect for NT and the Microsoft Windows NT Load Balancing Service

Server Virtualization with Windows Server Hyper-V and System Center

INUVIKA OPEN VIRTUAL DESKTOP FOUNDATION SERVER

Discovering passwords in the memory

vmprof Documentation Release 0.1 Maciej Fijalkowski, Antonio Cuni, Sebastian Pawlus

An Introduction to Service Containers

MuL SDN Controller HOWTO for pre-packaged VM

A Scalable Network Monitoring and Bandwidth Throttling System for Cloud Computing

EUROPEAN ORGANIZATION FOR NUCLEAR RESEARCH CERN ACCELERATORS AND TECHNOLOGY SECTOR A REMOTE TRACING FACILITY FOR DISTRIBUTED SYSTEMS

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

IMPLEMENTING GREEN IT

HoneyBOT User Guide A Windows based honeypot solution

ReadyNAS Replicate. Software Reference Manual. 350 East Plumeria Drive San Jose, CA USA. November v1.0

Windows Azure Storage Scaling Cloud Storage Andrew Edwards Microsoft

Altor Virtual Network Security Analyzer v1.0 Installation Guide

Do Containers fully 'contain' security issues? A closer look at Docker and Warden. By Farshad Abasi,

Determine the process of extracting monitoring information in Sun ONE Application Server

Design Document. Offline Charging Server (Offline CS ) Version i -

Trends in Application Recovery. Andreas Schwegmann, HP

TECHNOLOGY WHITE PAPER Jan 2016

Oak Ridge National Laboratory Computing and Computational Sciences Directorate. Lustre Crash Dumps And Log Files

ANDROID DEVELOPER TOOLS TRAINING GTC Sébastien Dominé, NVIDIA

State of the GStreamer Project. Jan Schmidt. Centricular Ltd

Server-Virtualisierung mit Windows Server Hyper-V und System Center MOC 20409

20409B: Server Virtualization with Windows Server Hyper-V and System Center

Deploying Hadoop with Manager

How To Monitor And Test An Ethernet Network On A Computer Or Network Card

AppScope: Application Energy Metering Framework for Android Smartphones using Kernel Activity Monitoring

Rally Installation Guide

RingStor User Manual. Version 2.1 Last Update on September 17th, RingStor, Inc. 197 Route 18 South, Ste 3000 East Brunswick, NJ

Advanced Computer Networks. Network I/O Virtualization

Very Large Enterprise Network Deployment, 25,000+ Users

HP Insight Diagnostics Online Edition. Featuring Survey Utility and IML Viewer

ZooKeeper. Table of contents

Implementing and Managing Windows Server 2008 Clustering

Citrix Desktop Virtualization Fast Track

Course Project Documentation

1. Introduction. 2. Installation and Configuration. 2.1 Linux

6422: Implementing and Managing Windows Server 2008 Hyper-V (3 Days)

Monitoring, Tracing, Debugging (Under Construction)

Hands-On Microsoft Windows Server 2008

Cloud Operating Systems for Servers

Enhancing Hypervisor and Cloud Solutions Using Embedded Linux Iisko Lappalainen MontaVista

Optimize Your Deployment Using Best Practices

dnstap: high speed DNS logging without packet capture Robert Edmonds Farsight Security, Inc.

Rackspace Cloud Databases and Container-based Virtualization

Tutorial. Reference for more thorough Mininet walkthrough if desired

DNA. White Paper. DNA White paper Version: 1.08 Release Date: 1 st July, 2015 Expiry Date: 31 st December, Ian Silvester DNA Manager.

Question: 3 When using Application Intelligence, Server Time may be defined as.

VOC Documentation. Release 0.1. Russell Keith-Magee

CMB 207 1I Citrix XenApp and XenDesktop Fast Track

Transcription:

LinuxCon Europe 2013 Cloud Monitoring and Distribution Bug Reporting with Live Streaming and Snapshots mathieu.desnoyers@efficios.com 1

Presenter Mathieu Desnoyers http://www.efficios.com Author/Maintainer of Userspace RCU, LTTng kernel and user-space tracers, Babeltrace. 2

Content New and upcoming features since Tracing Summit 2012 LTTng Babeltrace Cloud monitoring Enhanced bug reports LTTng project roadmap 3

LTTng Reminder LTTng 2.x does NOT require any kernel modification. 4

LTTng Reminder LTTng 2.x does NOT require any kernel modification. 5

LTTng Reminder LTTng 2.x does NOT require any kernel modification. 6

LTTng Reminder LTTng 2.x does NOT require any kernel modification. 7

LTTng Features Since Tracing Summit 2012 LTTng 2.1 Basse Messe (December 2012) Network Streaming (TCP) Session daemon health check Event field filtering (LTTng-UST) ARM, MIPS sysem call tracing (LTTng modules) 8

LTTng Features Since Tracing Summit 2012 LTTng 2.2 Cuda (June 2013) Per user ID buffers (LTTng-UST) On disk file rotation (maximum stream file size) 9

LTTng Features Since Tracing Summit 2012 LTTng 2.3 Dominus Vobiscum (September 2013) Flight recorder tracing Stop and non-stop snapshots Core dump handler integration LTTng Tools extras/ 10

Flight recorder session + snapshot $ lttng create --snapshot $ lttng enable-event -k sched_switch $ lttng enable-event -k -syscall -a $ lttng start $... $ lttng snapshot record Snapshot recorded successfully for session auto- 20131019-113803 $ babeltrace /home/julien/lttng-traces/auto- 20131019-113803/snapshot-1-20131019-113813-0/kernel/ 11

Snapshot At any point in time, a snapshot can be taken of a the current trace buffers. Overwrite mode meaning flight recorder ring buffer trace data lttng_snapshot_record(..) $ lttng snapshot record snapshot trace data 12

LTTng Features Since Tracing Summit 2012 LTTng 2.4 Époque Opaque (upcoming) Java Util Logging (JUL) tracing Live streaming Analysis of live traces Consumer and relay daemon health check Packet index generated by consumer daemon Faster load of large traces in viewers afterward 13

Live Network Streaming Deployment Server A (lttng-sessiond) Server B (lttng-sessiond) Server C (lttng-sessiond) TCP lttng-relayd TCP Viewer 14

Live Network Streaming Session On the server to trace : $ lttng create - live 2000000 -U net://10.0.0.1 $ lttng enable-event -k sched_switch $ lttng enable-event -k -syscall -a $ lttng start On the receiving server (10.0.0.1) : $ lttng-relayd -d On the viewer machine : $ lttngtop -r 10.0.0.1 15

Live Trace Streaming Usage As the trace is being created, you extract and can analyze the data. Continous Analysis Extract data with live streaming for analysis on an other machine Cluster-level analysis Gather traces from multiple machines Load balancing analysis Latency detection System Administration Get data of faulty machine on-demand 16

Performance Results 17

Babeltrace Features Since Tracing Summit 2012 Babeltrace 1.0 (initial release, October 2012) Babeltrace 1.1 (API namespacing fix, March 2013) Babeltrace 1.2 (upcoming) Common Trace Format (CTF) Writer API Python bindings Nexus to CTF converter Live trace stream read support Connect to LTTng relay daemon 18

Cloud Monitoring Live network streaming Flight recorder tracing and snapshots Bytecode interpreter On traced target or separate dedicated machine, Triggers: Start tracing Stop tracing Gather snapshot Aggregation 19

Enhanced Bug Reports Flight recorder tracing In production Extremely low overhead When error is encountered Gather snapshot Do you want to send a detailed bug report? Very detailed trace of trace leading to the problem sent along with bug report 20

LTTng Project Roadmap Save/restore trace session configuration to/from files Support Perf PMU counters from LTTng-UST Dynamic instrumentation of user-space (dyninst) Listing libraries shared objects (LD_PRELOAD) Hardware tracing (ARM, Freescale, Intel,...) Triggers and aggregation in LTTng-UST bytecode interpreter LTTng modules bytecode interpreter Android port for LTTng modules and UST 21

Questions?? www.efficios.com lttng.org lttng-dev@lists.lttng.org @lttng_project 22