Inca User-level Grid Monitoring

Similar documents
Inca User-level Grid Monitoring

A Survey Study on Monitoring Service for Grid

XSEDE Service Provider Software and Services Baseline. September 24, 2015 Version 1.2

Monitoring Clusters and Grids

Installing and Administering VMware vsphere Update Manager

S3 Monitor Design and Implementation Plans

DBA xpress Product Overview

Cloud Computing. Lecture 5 Grid Case Studies

MALAYSIAN PUBLIC SECTOR OPEN SOURCE SOFTWARE (OSS) PROGRAMME. COMPARISON REPORT ON NETWORK MONITORING SYSTEMS (Nagios and Zabbix)

DEPLOYMENT GUIDE Version 1.0. Deploying the BIG-IP LTM with the Nagios Open Source Network Monitoring System

NetWrix SQL Server Change Reporter. Quick Start Guide

HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team

Oracle Communications Cartridge Feature Specification for Broadsoft Broadworks Enterprise Services

BI xpress Product Overview

SCF/FEF Evaluation of Nagios and Zabbix Monitoring Systems. Ed Simmonds and Jason Harrington 7/20/2009

IBM Endpoint Manager Version 9.1. Patch Management for Red Hat Enterprise Linux User's Guide

Monitoring of the UNICORE middleware

Monitoring Oracle Enterprise Performance Management System Release Deployments from Oracle Enterprise Manager 12c

SOFTWARE TESTING TRAINING COURSES CONTENTS

Copyrighted , Address :- EH1-Infotech, SCF 69, Top Floor, Phase 3B-2, Sector 60, Mohali (Chandigarh),

ARCHER SP Service Quarterly Report. Quarter

Installing GFI Network Server Monitor

About Network Data Collector

Change Manager 5.0 Installation Guide

NetWrix File Server Change Reporter. Quick Start Guide

[Jet-Magento Integration]

eeye Digital Security Product Training

Installing GFI Network Server Monitor

CHAPTER 5 IMPLEMENTATION OF THE PROPOSED GRID NETWORK MONITORING SYSTEM IN CRB

GRMS Features and Benefits

CME ClearPort (CP) API v.2.0. Certification Process Checklist

Mitglied der Helmholtz-Gemeinschaft. System monitoring with LLview and the Parallel Tools Platform

Umbraco Courier 2.0. Installation guide. Per Ploug Hansen 5/24/2011

Virtual Appliance Setup Guide

See-GRID Project and Business Model

Tutorial: Packaging your server build

SyncThru Database Migration

Cloud Computing. Up until now

Understanding Task Scheduler FIGURE Task Scheduler. The error reporting screen.

Phoronix Test Suite v5.8.0 (Belev)

PACE Predictive Analytics Center of San Diego Supercomputer Center, UCSD. Natasha Balac, Ph.D.

PRACE WP4 Distributed Systems Management. Riccardo Murri, CSCS Swiss National Supercomputing Centre

Anar Manafov, GSI Darmstadt. GSI Palaver,

GT 6.0 GRAM5 Key Concepts

Connecting to the University Wireless Network

Development Best Practices

Proposal Submission System - A Content Management System Approach for Proposal Submission

05.0 Application Development

EVALUATION ONLY. WA2088 WebSphere Application Server 8.5 Administration on Windows. Student Labs. Web Age Solutions Inc.

Certified Selenium Professional VS-1083

Department of Defense PKI Use Case/Experiences

Using Network Application Development (NAD) with InTouch

Use of Continuous Integration Tools for Application Performance Monitoring

out of this world guide to: POWERFUL DEDICATED SERVERS

BackupAgent LabTech Integration Installation and Usage

Execution Management: Key Concepts

BMC BladeLogic Client Automation Installation Guide

: Test 217, WebSphere Commerce V6.0. Application Development

Maintaining Non-Stop Services with Multi Layer Monitoring

CHAPTER 4 PROPOSED GRID NETWORK MONITORING ARCHITECTURE AND SYSTEM DESIGN

TEST AUTOMATION FRAMEWORK

CMS Software Deployment on OSG

[D-View 7 Advanced Hands-On Practice] Version 1.0

Features Overview Guide About new features in WhatsUp Gold v14

Agile Business Suite: a 4GL environment for.net developers DEVELOPMENT, MAINTENANCE AND DEPLOYMENT OF LARGE, COMPLEX BACK-OFFICE APPLICATIONS

Acronis Recovery TM for Microsoft Exchange TM

NEFSIS DEDICATED SERVER

NETWRIX CHANGE NOTIFIER

XSEDE Science Gateway Use Cases

DISASTER RECOVERY PLANNING

Database Services for CERN

Getting Started with the Ed-Fi ODS and Ed-Fi ODS API

Technical Guide to ULGrid

Enhanced System Integration Test Automation Tool (E-SITAT) Author: Akshat Sharma

Enterprise Service Bus

Implementing a SAS Metadata Server Configuration for Use with SAS Enterprise Guide

Software Configuration Management Best Practices for Continuous Integration

Installation Manual v2.0.0

Grid Computing With FreeBSD

1. Data Domain Pre-requisites. 2. Enabling OST

GFI LANguard 9.0 ReportPack. Manual. By GFI Software Ltd.

"Charting the Course to Your Success!" MOC A Understanding and Administering Windows HPC Server Course Summary

ManageEngine Desktop Central Training

Software Delivery Integration and Source Code Management. for Suppliers

Quick Start Guide. for Installing vnios Software on. VMware Platforms

Managing Credentials with

To increase scalability, the following features can be integrated:

N E T B A C K U P 7 F E A T UR E B R I E F I N G. NetBackup 7 Feature Briefing OpsCenter and OpsCenter Analytics

Tracking Network Changes Using Change Audit

Log files management. Katarzyna KAPUSTA

Oracle Financial Services Data Integration Hub Foundation Pack Extension for Data Relationship Management Interface

On Enabling Hydrodynamics Data Analysis of Analytical Ultracentrifugation Experiments

WPU-7700 APS MANAGEMENT

Managing your Red Hat Enterprise Linux guests with RHN Satellite

BusinessObjects Enterprise XI Release 2

ICE Trade Vault. Public User & Technology Guide June 6, 2014

Test Case 3 Active Directory Integration

How to configure Client side certificate authentication for authorization-only access / Active Sync URL s

To install Multifront you need to have familiarity with Internet Information Services (IIS), Microsoft.NET Framework and SQL Server 2008.

Foreign Account Tax Compliance Act (FATCA) IDES Implementation Update. April 2015

Transcription:

Inca User-level Grid Monitoring Shava Smallen ssmallen@sdsc.edu SC 08 November 19, 2008

Goal: reliable grid software and services for users Over 750 TF Over 30 PB of online and archival data storage Connected via dedicated multi- Gbps links 30-63 software packages and 6-23 services per resource 11 TeraGrid sites, 21 resources

Related Grid monitoring tools Inca s primary objective: user-level Grid monitoring

User-level grid monitoring Runs from a standard user account Executes using a standard GSI credential Uses tests that are developed and configured based on user documentation Automates periodic execution of tests Verifies user-accessible Grid access points Centrally manages monitoring configuration Easily updates and maintains monitoring deployment

Who benefits from user-level grid monitoring? Grid managers Verify requirements are fulfilled by resource providers Identify failure trends System administrators Email notification Debugging support System administrators End users Debug user account/environment issues Advanced users: feedback to Grid/VO

Inca provides user-level grid monitoring Stores and archives a wide variety of monitoring results Captures context of monitoring result as it is collected Eases the writing, deploying, and sharing of new tests or benchmarks Flexible and comprehensive web status pages Secure

Reporters collect monitoring data Executable programs that measure some aspect of the system or installed software Supports a set of command-line options and writes XML to stdout Schema supports multiple types of data Extensive library support for perl and python scripts (most reporters < 30 lines of code) Independent of other Inca components

Repositories support sharing Collection of reporters available via a URL Supports package dependencies Packages versioned to allow for automatic updates Inca project repository contains 150+ reporters Version, unit test, performance benchmark reporters Grid middleware and tools, compilers, math libraries, data tools, and viz tool

Agent provides centralized configuration and management Implements the configuration specified by Inca administrator Stages and launches a reporter manager on each resource Sends package and configuration updates Manages proxy information Administration via GUI interface (incat) Screenshot of Inca GUI tool, incat, showing the reporters that are available from a local repository

Depot stores and publishes data Stores configuration information and monitoring results Provides full archiving of reports Uses relational database backend via Hibernate Supports HQL and predefined queries Supports plug-in customization (e.g., email notifications, downtimes) Web services - Query data from depot and return as XML

Consumer displays data Current and historical views Web application packaged with Jetty JSP 2.0 pages/tags to query data and format using XSLT CeWolf/JFreeChart to graph data

Tests Summary Weekly status report Cumulative test status by resource Resource status history Error history summary Related test histories Inca s status pages provide multiple levels of details Test status by package and resource Test Details Individual test history Individual test result details Historical Current status

Software status and deployments Current software version: 2.4 (available from Inca website) http://inca.sdsc.edu

Running since 2003 Inca TeraGrid deployment Total of 2660 tests running on 20 login nodes, 3 grid nodes, and 3 servers Coordinated software and services Cross-site tests GRAM usage CA certificate and CRL checking Resource registration in information services Screenshot of Inca status pages for TeraGrid http://inca.teragrid.org/

Measuring Performance Variation on the TeraGrid Stage 1: MPI ping pong Collecting results since Oct 1 Runs every 12 hours on 16 processors, <10 minutes Running on NCSA s Abe, NICS kraken Stage 2: PARATEC Collecting results since November 1 Runs every 12 hours on 256 processors, <30 min Running on NCSA s Abe, NICS kraken Latest PingPong and PARATEC results Mean MPI ping pong bandwidth history

Inca GLEON deployment Sensors in lake: dissolved oxygen level, temperature, velocity (some), etc. Monitoring Data Turbine deployments since Oct 2007 Total of 26 tests running on data server at SDSC and windows box in Northern Temperate Lakes in Wisconsin http://inca-gleon.sdsc.edu

Inca monitoring benefits end users Tests resources and services used by LEAD. E.g. Pings service every 3 mins Verifies batch job submission every hour Automatically notifies admins of failures Show week of history in custom status pages Inca reported errors mirror failures we ve observed and as they are addressed we ve noticed an improvement in TeraGrid s stability. -- Suresh Marru (LEAD developer)

Benefits of using Inca Detect problems before the users notice them Easy to write and share tests and benchmarks Easy to deploy and maintain Flexible and comprehensive displays

Inca Information Announcements: inca-users@sdsc.edu Supported by: Email: inca@sdsc.edu Website: http://inca.sdsc.edu