Self Adaptive Software System Monitoring for Performance Anomaly Localization



Similar documents
Application Performance Monitoring: Trade-Off between Overhead Reduction and Maintainability

INSTITUT FÜR INFORMATIK

A Comparison of the Influence of Different Multi-Core Processors on the Runtime Overhead for Application-Level Monitoring

11.1 inspectit inspectit

Online Performance Anomaly Detection with

A Benchmark Engineering Methodology to Measure the Overhead of Application-Level Monitoring

ExplorViz: Visual Runtime Behavior Analysis of Enterprise Application Landscapes

Tool-Supported Application Performance Problem Detection and Diagnosis. André van Hoorn.

Scalable and Live Trace Processing with Kieker Utilizing Cloud Computing

Capturing provenance information with a workflow monitoring extension for the Kieker framework

Architecture-Based Multivariate Anomaly Detection for Software Systems

Run-time Resource Management in SOA Virtualized Environments. Danilo Ardagna, Raffaela Mirandola, Marco Trubian, Li Zhang

Informatica Ultra Messaging SMX Shared-Memory Transport

White Paper. Recording Server Virtualization

Instrumentation Software Profiling

Self-Adaptive Performance Monitoring for Component-Based Software Systems. Jens Ehlers

HPSA Agent Characterization

Muse Server Sizing. 18 June Document Version Muse

Continuous Integration in Kieker

PERFORMANCE ENHANCEMENTS IN TreeAge Pro 2014 R1.0

Microsoft Exchange Solutions on VMware

Performance Testing of a Cloud Service

Continuous Monitoring of Software Services: Design and Application of the Kieker Framework

Towards Adaptive Monitoring of Java EE Applications

Scaling up to Production

XTM Web 2.0 Enterprise Architecture Hardware Implementation Guidelines. A.Zydroń 18 April Page 1 of 12

Development of a Concurrent and Distributed Analysis Framework for Kieker

How To Test For Performance And Scalability On A Server With A Multi-Core Computer (For A Large Server)

Very Large Enterprise Network, Deployment, Users

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

Analyzing Big Data with Splunk A Cost Effective Storage Architecture and Solution

A Tool Suite for the Generation and Validation of Configurations for Software Availability

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW

Parallels Virtuozzo Containers

P E R F O R M A N C E S T U D Y. Scaling IBM DB2 9 in a VMware Infrastructure 3 Environment

JVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra

Using VMware VMotion with Oracle Database and EMC CLARiiON Storage Systems

Holistic Performance Analysis of J2EE Applications

MAGENTO HOSTING Progressive Server Performance Improvements

Citrix XenDesktop Modular Reference Architecture Version 2.0. Prepared by: Worldwide Consulting Solutions

Data Center Op+miza+on

Introducing Performance Engineering by means of Tools and Practical Exercises

An Oracle White Paper March Load Testing Best Practices for Oracle E- Business Suite using Oracle Application Testing Suite

PERFORMANCE MONITORING OF JAVA COMPONENT-ORIENTED DISTRIBUTED APPLICATIONS

ORACLE OPS CENTER: VIRTUALIZATION MANAGEMENT PACK

Manjrasoft Market Oriented Cloud Computing Platform

Study of Load Balancing of Resource Namespace Service

Graph Database Performance: An Oracle Perspective

Transparent Optimization of Grid Server Selection with Real-Time Passive Network Measurements. Marcia Zangrilli and Bruce Lowekamp

SERVICE LEVEL AGREEMENT XML SCHEMA FOR SOFTWARE QUALITY ASSURANCE

Very Large Enterprise Network Deployment, 25,000+ Users

FusionHub Virtual Appliance

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

How To Use An Orgs.Org Database With An Orgorora Cloud Management Pack For Database (For Cloud)

PARALLELS CLOUD SERVER

Online Failure Prediction in Cloud Datacenters

Step by Step Guide To vstorage Backup Server (Proxy) Sizing

Comparing Multi-Core Processors for Server Virtualization

Garbage Collection in the Java HotSpot Virtual Machine

A Middleware Strategy to Survive Compute Peak Loads in Cloud

Nexenta Performance Scaling for Speed and Cost

ELECTRONIC QUALITY MANAGEMENT SOFTWARE

Performance Benchmarking of Application Monitoring Frameworks

Affinity Aware VM Colocation Mechanism for Cloud

EMF Compare. EMF Compare. Summary : Table des mises à jour Version Date Auteur(s) Mises à jour v1.0 06/10/11 Laurent Goubet Initial Version

Automatic Extraction of Probabilistic Workload Specifications for Load Testing Session-Based Application Systems

Extreme Networks Security Analytics G2 Vulnerability Manager

VMware Virtual SAN Backup Using VMware vsphere Data Protection Advanced SEPTEMBER 2014

Load and Performance Testing

JBoss Data Grid Performance Study Comparing Java HotSpot to Azul Zing

Express5800 Scalable Enterprise Server Reference Architecture. For NEC PCIe SSD Appliance for Microsoft SQL Server

AppDynamics Lite Performance Benchmark. For KonaKart E-commerce Server (Tomcat/JSP/Struts)

DSS. Diskpool and cloud storage benchmarks used in IT-DSS. Data & Storage Services. Geoffray ADDE

Hora: Online Failure Prediction Framework for Component-based Software Systems Based on Kieker and Palladio

Virtualization with the Intel Xeon Processor 5500 Series: A Proof of Concept

Oracle Database Scalability in VMware ESX VMware ESX 3.5

Java Monitoring. Stuff You Can Get For Free (And Stuff You Can t) Paul Jasek Sales Engineer

INSTITUT FÜR INFORMATIK

KPDAYS 13 Symposium on Software Performance: Joint Kieker/Palladio Days Karlsruhe, Germany, November 27 29, 2013 Proceedings

System Requirements and Configuration Options

Open-Source-Software als Katalysator im Technologietransfer am Beispiel des Monitoring-Frameworks

Development of Monitoring and Analysis Tools for the Huawei Cloud Storage

Advanced Network Monitoring ANM

Detecting false users in Online Rating system & Securing Reputation

Philips IntelliSpace Critical Care and Anesthesia on VMware vsphere 5.1

Database Application Developer Tools Using Static Analysis and Dynamic Profiling

Ahsay Online Backup Suite v5.0. Whitepaper Backup speed

Analysis of VDI Storage Performance During Bootstorm

Virtual Compute Appliance Frequently Asked Questions

CloudSim: A Toolkit for Modeling and Simulation of Cloud Computing Environments and Evaluation of Resource Provisioning Algorithms

Red Hat Network Satellite Management and automation of your Red Hat Enterprise Linux environment

CHAPTER 2. Existing Server Environment Analysis with Capacity Planner

Technical Paper. Moving SAS Applications from a Physical to a Virtual VMware Environment

Towards Online Performance Model Extraction in Virtualized Environments

A Performance Analysis of Distributed Indexing using Terrier

Protect SQL Server 2012 AlwaysOn Availability Group with Hitachi Application Protector

Maximum performance, minimal risk for data warehousing

Oracle Software. Hardware. Training. Consulting. Mythics Complete.

How To Write A Test Program For Eclipse (Eclipse)

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone:

Transcription:

2011/06/17 Jens Ehlers, André van Hoorn, Jan Waller, Wilhelm Hasselbring Software Engineering Group Christian Albrechts University Kiel

Application level Monitoring Extensive infrastructure monitoring, application monitoring not widespread Reactive probe injection after performance drops Business/service monitoring Application monitoring Business Processes Services Application Middleware Container Key performance indicators, e.g. process throughput, SLO appliance, workload, Response times, operational profile, Thread/connection pool sizes, Infrastructure monitoring Virtual Machine Operating System Hardware Heap size, CPU/memory utilization, Availability, reliability, 2/ 17

Kieker Monitoring Framework Flexible, integrated monitoring framework Extension points for custom probes, records, logs, analysis plugins Provides a broad set of analysis plugins Evaluated in industry case studies, low (linear) overhead Open source development: http://kieker.sourceforge.net/ 3/ 17

Kieker Analysis Tool Monitoring control panel based on Eclipse RCP/EMF Analysis/visualization plugins UML sequence diagrams Dependency graphs Use case transition graphs 4/ 17

Kieker Overhead Evaluation Inactive probes do not cause significant overhead Test system: Sun Blade X6270 with 2x Intel Xeon E5540, 8 cores at 2.53 GHz, 24 GB RAM, ZFS RAID, SunOS 5.1, Java HotSpot x86 Server VM 1.6 5/ 17

Self Adaptive Monitoring Monitoring causes measurable overhead Not all probes/measuring points are activated permanently Monitoring rules are continuously evaluated during runtime Rule condition formulated via OCL [OMG10] Runtime OCL expression evaluation on EMF models 6/ 17

OCL based Monitoring Condition Example Select all interface and anomalous operations OCL context element: instance of calling context tree (CCT) [ABL97] OCL expression 7/ 17

Anomaly Rating Procedure 1. Response time forecast Forecast expected response times based on historical observations. 2. Anomalous behavior hypothesis test 3. Anomaly score calculation 4. Anomaly score aggregation and correlation 8/ 17

Context Influences Response Times Contextual impact factors influence timing behavior of software services Example: Response times depend on CPU utilization and system workload 9/ 17

Response Time Forecast Response time observations form a univariate time series Assumption: Patterns observed in the past will remain in future Allows short term forecasts: x, x,,x x,x, Stochastic process model has to fit to observed time series Implementation of different forecast models [Wei05] Single exponential smoothing: x βx 1 β x Holter Winters Smoothing ARIMA models, e.g. ARIMA(1,1,1): x x φ x x θ x x 10 / 17

Anomaly Rating Procedure 1. Response time forecast Forecast expected response times based on historical observations. 2. Anomalous behavior hypothesis test Test if a sample of newly observed response times is to be rated as normal or anomalous related to the expected value from 1. 3. Anomaly score calculation 4. Anomaly score aggregation and correlation 11 / 17

Anomalous Behavior Hypothesis Test Distribution of response times is unknown Not possible to extract contextual impact factors completely Leverage central limit theorem Not every operation execution has to be classified Instead: Testing a sample of operation executions 12 / 17

Anomaly Rating Procedure 1. Response time forecast Forecast expected response times based on historical observations. 2. Anomalous behavior hypothesis test Test if a sample of newly observed response times is to be rated as normal or anomalous related to the expected value from 1. 3. Anomaly score calculation Update anomaly scores for each operation representing its current anomaly degree based on the sequent rating of response times samples in 2. 4. Anomaly score aggregation and correlation Aggregate and correlate anomaly scores from 3. to higher levels of abstraction, e.g. component level anomaly scores. [MRvH+09] 13 / 17

Anomaly Localization Evaluation 14 / 17

Anomaly Localization Evaluation 15 / 17

Conclusion Motivation for application level software system monitoring Architecture and application of the Kieker monitoring framework Self adaptive monitoring approach for performance anomaly localization 16 / 17

References [ABL97] Ammons, G., Ball, T., and Larus, J. R. Exploiting hardware performance counters with flow and context sensitive profiling. In Proceedings of the ACM SIGPLAN 1997 Conference on Programming Language Design and Implementation, pages 85 96, Las Vegas, Nevada. ACM Press, 1997. [OMG10] OMG. Object Constraint Language, Version 2.2. http://www.omg.org/spec/ocl/2.2/, 2010. [Wei05] Wei, W. W. S. Time Series Analysis: Univariate and Multivariate Methods. Addison Wesley, 2nd edition, 2005. 17 / 17