Monitoring Elastic Cloud Services

Similar documents

Fine-Grained Elasticity Support for Cloud Applications: The CELAR Approach

JCatascopia: Monitoring Elastically Adaptive Applications in the Cloud

Automatic, multi-grained elasticity-provisioning for the Cloud

Η υπηρεσία Public IaaS ΕΔΕΤ ανάπτυξη και λειτουργία για χιλιάδες χρήστες

Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure

IaaS Federation. Contrail project. IaaS Federation! Objectives and Challenges! & SLA management in Federations 5/23/11

Infrastructure as a Service (IaaS)

How To Manage Cloud Service Provisioning And Maintenance

How To Run A Cloud Server On A Server Farm (Cloud)

Solution for private cloud computing

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

1 What is Cloud Computing? Cloud Infrastructures OpenStack Amazon EC CAMF Cloud Application Management

FleSSR Project: Installing Eucalyptus Open Source Cloud Solution at Oxford e- Research Centre

Scalable Architecture on Amazon AWS Cloud

An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform

Towards Smart and Intelligent SDN Controller

Automatic, multi- grained elasticity- provisioning for the Cloud

CS 6343: CLOUD COMPUTING Term Project

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

Li Sheng. Nowadays, with the booming development of network-based computing, more and more

THE EUCALYPTUS OPEN-SOURCE PRIVATE CLOUD

Deploying Business Virtual Appliances on Open Source Cloud Computing

Network performance in virtual infrastructures

Cloud n Service Presentation. NTT Communications Corporation Cloud Services

Figure 1. The cloud scales: Amazon EC2 growth [2].

White Paper. Cloud Native Advantage: Multi-Tenant, Shared Container PaaS. Version 1.1 (June 19, 2012)

GMonE: a Complete Approach to Cloud Monitoring

KT ucloud storage. Two Years of Life with OpenStack Swift / Jaesuk Ahn, Cloud OS Dev. Team, Korea Telecom

Performance Management for Cloudbased STC 2012

Architecting ColdFusion For Scalability And High Availability. Ryan Stewart Platform Evangelist

Topology Aware Analytics for Elastic Cloud Services

SolidFire SF3010 All-SSD storage system with Citrix CloudPlatform Reference Architecture

Optimization of QoS for Cloud-Based Services through Elasticity and Network Awareness

This is an author-deposited version published in : Eprints ID : 12902

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

Stratusphere Solutions

MySQL and Virtualization Guide

SERVER 101 COMPUTE MEMORY DISK NETWORK

CLOUD SERVICE SCHEDULE Newcastle

Virtualization and Cloud Computing

Performance Analysis: Benchmarking Public Clouds

Service allocation in Cloud Environment: A Migration Approach

How to Deploy OpenStack on TH-2 Supercomputer Yusong Tan, Bao Li National Supercomputing Center in Guangzhou April 10, 2014

Cloud Federations in Contrail

Achieving a High-Performance Virtual Network Infrastructure with PLUMgrid IO Visor & Mellanox ConnectX -3 Pro

Virtual Machine Instance Scheduling in IaaS Clouds

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES

Amazon Elastic Beanstalk

A Survey Study on Monitoring Service for Grid

Online Transaction Processing in SQL Server 2008

Monitoring Clusters and Grids

Cloud Computing through Virtualization and HPC technologies

Managed Hosting is a managed service provided by MN.IT. It is structured to help customers meet:

EWeb: Highly Scalable Client Transparent Fault Tolerant System for Cloud based Web Applications

Intro to Virtualization

Dynamic Resource allocation in Cloud

Two-Level Cooperation in Autonomic Cloud Resource Management

SURFsara HPC Cloud Workshop

VMware vrealize Automation

What s new in Hyper-V 2012 R2

VMware vrealize Automation

2) Xen Hypervisor 3) UEC

Cloud Computing for Control Systems CERN Openlab Summer Student Program 9/9/2011 ARSALAAN AHMED SHAIKH

VirtualclientTechnology 2011 July

A Comparison of Clouds: Amazon Web Services, Windows Azure, Google Cloud Platform, VMWare and Others (Fall 2012)

Manjrasoft Market Oriented Cloud Computing Platform

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Shareable Private Space on a Public Cloud

Proactively Secure Your Cloud Computing Platform

Amazon EC2 Product Details Page 1 of 5

VMware vcloud Automation Center 6.1

Solving I/O Bottlenecks to Enable Superior Cloud Efficiency

Introducing ScienceCloud

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc.

A Middleware Strategy to Survive Compute Peak Loads in Cloud

A Cost-Evaluation of MapReduce Applications in the Cloud

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager Product Marketing Manager

Evaluation Methodology of Converged Cloud Environments

Cloud Models and Platforms

7 Ways OpenStack Enables Automation & Agility for KVM Environments

Amazon Web Services Primer. William Strickland COP 6938 Fall 2012 University of Central Florida

VMware vcloud Automation Center 6.0

Cloud Design and Implementation. Cheng Li MPI-SWS Nov 9 th, 2010

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

Microsoft SQL Server 2012 on Cisco UCS with iscsi-based Storage Access in VMware ESX Virtualization Environment: Performance Study

Transcription:

Monitoring Elastic Cloud Services trihinas@cs.ucy.ac.cy Advanced School on Service Oriented Computing (SummerSoc 2014) 30 June 5 July, Hersonissos, Crete, Greece

Presentation Outline Elasticity in Cloud Computing Cloud Service Monitoring Challenges Existing Monitoring Tools and their Limitations JCatascopia Monitoring System Architecture Features Evaluation Conclusions and Future Work

Workload (req/s) Elasticity in Cloud Computing Ability of a system to expand or contract its dedicated resources to meet the current demand Time

Cloud Monitoring Challenges Monitor heterogeneous types of information and resources Extract metrics from multiple levels of the Cloud Low-level metrics (i.e. CPU usage, network traffic) High-level metrics (i.e. application throughput, latency, availability) Metrics collected at different time granularities

Cloud Monitoring Challenges Operate on any Cloud platform Monitor Cloud services deployed across multiple Cloud platforms Detect configuration changes in a cloud service Application topology changes (e.g. new VM added) Allocated resource changes (e.g. new disk attached to VM) Elasticity Support "Managing and Monitoring Elastic Cloud Applications", D. Trihinas and C. Sofokleous and N. Loulloudes and A. Foudoulis and G. Pallis and M. D. Dikaiakos, 14th International Conference on Web Engineering (ICWE 2014), Toulouse, France 2014

Existing Monitoring Tools

Cloud Specific Monitoring Tools Benefits Provide MaaS capabilities Fully documented Easy to use Well integrated with underlying platform Limitations Commercial and proprietary which limits them to operating on specific Cloud IaaS providers

General Purpose Monitoring Tools Benefits Open-source Robust and light-weight System level monitoring Suitable for monitoring Grids and Computing Clusters Limitations Not suitable for dynamic (elastic) application topologies

Monitoring Tools with Elasticity Support [de Carvalho et al., INM 2011] Nagios + Controller on each physical host to notify Nagios Server with a list of instances currently running on the system Lattice Monitoring Framework [Clayman et al., NOMS 2011] Controller periodically requests from hypervisor list of current running VMs Limitations Special entities required at physical level Depend on current hypervisor

JCatascopia Monitoring System

JCatascopia Monitoring System Open-source Multi-Layer Cloud Monitoring Platform Independent Capable of Supporting Elastic Applications Interoperable Scalable "JCatascopia: Monitoring Elastically Adaptive Applications in the Cloud", D. Trihinas and G. Pallis and M. D. Dikaiakos, 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2014), 2014

JCatascopia Architecture

Monitoring Agents Light-weight monitoring instances Deployable on physical nodes or virtual instances Responsible for the metric collection process Aggregate and distribute collected metrics (pub/sub)

Monitoring Probes The actual metric collectors managed by Monitoring Agents JCatascopia Probe API Dynamically deployable to Monitoring Agents Filtering mechanism at Probe level

Monitoring Servers Receive metrics from Monitoring Agents process and store metrics in Monitoring Database Handle user metric and configuration requests Hierarchy of Monitoring Servers for greater scalability

JCatascopia Architecture JCatascopia REST API JCatascopia-Web User Interface JCatascopia Database Interface Allows users to utilize their own Database solution with JCatascopia Currently available: MySQL, Cassandra

Dynamic Agent Discovery and Removal Subscriber Publisher Server Agent subscribe status: connected event stream Bind to IP and Port Bind to IP and Port subscribe status: connected send metadata status: received metric stream (a) Classic pub/sub (b) JCatascopia Benefits Monitoring Servers are agnostic of Agent network location Agents appear dynamically Eliminated the need to Restart or reconfigure Monitoring System Depend on underlying hypervisor Require directory service with Agent locations

Metric Subscription Rule Language Aggregate single instance metrics SUM(errorCount) Generate high-level metrics at runtime DBthroughput = AVG(readps+writeps) Subscription Rule Example Average DBthroughput from the low-level metrics readps and writeps of a database cluster comprised of N nodes: DBthroughput = AVG(readps + writeps) MEMBERS = [id1,...,idn] ACTION = NOTIFY(<25,>75%)

Adaptive Filtering Simple fixed uniform range filter windows are not effective: i.e. filter currentvalue if in window previousvalue±r No guarantee that any values will be filtered at all Adaptive filter window range window range (R) is not static but depends on percentage of values previously filtered Collect Samples Check percentage of filtered values Adjust Window Range (R)

JCatascopia Evaluation

Evaluation Validate JCatascopia functionality and performance Compare JCatascopia to other Monitoring Tools Ganglia Lattice Monitoring Framework Testbed Different domains of Cloud applications Various VM flavors 3 public Cloud providers and 1 private Cloud

Testbed Cloud Provider VM no. VM Flavor Applications GRNET Okeanos public 15 1GB RAM, 10GB Disk, Ubuntu Cloud Server 12.04 LTS Flexiant FlexiScale 10 2 VCPU, 2GB RAM, 10GB Disk, platform Debian 6.07 (Squeeze) Amazon EC2 10 m1.small with CentOS 6.4 (1VCPU, 1.7GB RAM, 160GB Disk) OpenStack Private Cloud 60 2 VCPU, 2GB RAM, 10GB Disk, Ubuntu Server 12.04 LTS 12 VMs Cassandra 3 VMs YCSB Clients HASCOP an attributed, multigraph clustering algorithm We have deployed on all VMs JCastascopia Monitoring Agents, Ganglia gmonds and Lattice DataSources

Testbed - Available Probes Probe Metrics Period (sec) CPU cpuuserusage, cpuniceusage, cpusystemusage, cpuidle, cpuiowait 10 Memory memtotal, memused, memfree, memcache, memswaptotal, memswapfree Network netpacketsin, netpacketsout, netbytesin, netbytesout 20 Disk Usage disktotal, diskfree, diskused 60 Disk IO readkbps, writekbps, iotime 40 Cassandra readlatency, writelatency 20 YCSB clientthroughput, clientlatency 10 HASCOP clustersperiter, iterelaptime, centroidupdtime, ptableupdtime, graphupdtime 15 20

Experiment 1. Elastically Adapting Cassandra Cluster Scale out Cassandra cluster to cope with increasing workload Experiment uses 15 VMs in Okeanos cluster Subscription Rule to notify Provisioner to add VM when scaling condition violated: cputotalusage = AVG(1 - cpuidle) MEMBERS = [id1,...,idn] ACTION = NOTIFY(>=75%) VMs YCSB Clients Cassandra Probes YCSB CPU, Memory, Network, DiskIO, Cassandra

Experiment 1. Elastically Adapting Cassandra Cluster Monitoring Agent Runtime Impact YCSB Agent Utilization Cassandra Agent Utilization

Experiment 2. Monitoring a Cloud Federation Environment Monitor an application topology spread across multiple Clouds: OpenStack (10 VMs) Amazon EC2 (10 VMs) Flexiant (10 VMs) Compare JCatascopia, Ganglia and Lattice runtime footprint Compare JCatascopia and Ganglia network utilization VMs HASCOP Probes CPU, Memory, DiskUsage, HASCOP

Experiment 2. Monitoring a Cloud Federation Environment Monitoring Agent Runtime Impact Monitoring Agent Network Utilization difference less than 0.03% HASCOP Agent Utilization Agent Network Utilization When in need of application-level monitoring, for a small runtime overhead, JCatascopia can reduce monitoring network traffic and consequently monitoring cost

Experiment 3. JCatascopia Scalability Evaluation Experiment uses the 60 VMs on OpenStack private Cloud to scale a HASCOP cluster 1 Monitoring Server for 60 Agents Subscription Rule: hascopiterelapsedtime = AVG(iterElapTime) MEMBERS = [id1,...,idn] ACTION = NOTIFY(ALL) VMs HASCOP Probes CPU, Memory, DiskUsage, HASCOP

Scalability Evaluation Archiving time grows linearly

Experiment 3. JCatascopia Scalability Evaluation New Setup 2 Intermediate Monitoring Servers which aggregate metrics from underlying Agents 1 root Monitoring Server VMs HASCOP Probes CPU, Memory, DiskUsage, HASCOP

Scalability Evaluation When archiving time is high, we can redirect monitoring metric traffic through Intermediate Monitoring Servers, allowing the monitoring system to scale

Conclusions Experiments on public and private Cloud platforms show that JCatascopia is: capable of supporting automated elasticity controllers able to adapt in a fully automatic manner when elasticity actions are enforced open-source, interoperable, scalable and has a low runtime footprint

Future Work Further pursue adaptive filtering Enhance Probes with adaptive sampling Adjust sampling rate when stable phases are detected Create Monitoring Toolkit for PaaS Cloud applications Provide Monitoring as a Service to Cloud consumers

Acknowledgements www.celarcloud.eu co-funded by the European Commission https://github.com/celar/cloud-ms

Laboratory for Internet Computing Department of Computer Science University of Cyprus http://linc.ucy.ac.cy

BACKUP SLIDES

JCatascopia Architecture

Dynamic Agent Removal Heartbeat monitoring to detect when Agents: Removed due to scaling down elasticity actions Temporary unavailable (network connectivity issues)

Monitoring Agents

Monitoring Servers