3 Examples of Reliability Testing. Dan Downing, VP Testing Services MENTORA GROUP www.mentora.com



Similar documents
Throughput Capacity Planning and Application Saturation

Storage and SQL Server capacity planning and configuration (SharePoint...

Microsoft SharePoint 2010 on HP ProLiant DL380p Gen8 servers

Performance Testing Percy Pari Salas

Web Performance, Inc. Testing Services Sample Performance Analysis

WebSphere Performance Monitoring & Tuning For Webtop Version 5.3 on WebSphere 5.1.x

Liferay Portal Performance. Benchmark Study of Liferay Portal Enterprise Edition

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

EMC Business Continuity for Microsoft SQL Server Enabled by SQL DB Mirroring Celerra Unified Storage Platforms Using iscsi

Justin Bruns, Performance Test Lead, Fireman's Fund Insurance Robert May, Enterprise Software Tools Administrator Fireman's Fund Insurance

Fixed Price Website Load Testing

Performance White Paper

SOLIDWORKS Enterprise PDM - Troubleshooting Tools

WEBAPP PATTERN FOR APACHE TOMCAT - USER GUIDE

Ground up Introduction to In-Memory Data (Grids)

An Oracle White Paper March Load Testing Best Practices for Oracle E- Business Suite using Oracle Application Testing Suite

Agility Database Scalability Testing

User Reports. Time on System. Session Count. Detailed Reports. Summary Reports. Individual Gantt Charts

B M C S O F T W A R E, I N C. BASIC BEST PRACTICES. Ross Cochran Principal SW Consultant

How To Test For Performance

Diagnosing Performance and Stability Problems with Load Testing

A Performance Engineering Story

WebSphere Architect (Performance and Monitoring) 2011 IBM Corporation

MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM?

Transaction Performance Maximizer InterMax

SOLUTION BRIEF: SLCM R12.7 PERFORMANCE TEST RESULTS JANUARY, Load Test Results for Submit and Approval Phases of Request Life Cycle

Benchmarking Guide. Performance. BlackBerry Enterprise Server for Microsoft Exchange. Version: 5.0 Service Pack: 4

Minimum Hardware Configurations for EMC Documentum Archive Services for SAP Practical Sizing Guide

Perfmon counters for Enterprise MOSS

Proactive and Reactive Monitoring

Oracle WebLogic Thread Pool Tuning

ITG Software Engineering

Deep Dive: Maximizing EC2 & EBS Performance

Implementing a Well- Performing and Reliable Portal

Performance And Scalability In Oracle9i And SQL Server 2000

Sitecore Health. Christopher Wojciech. netzkern AG. Sitecore User Group Conference 2015

System Requirements Table of contents

Bringing Value to the Organization with Performance Testing

Scale and Availability Considerations for Cluster File Systems. David Noy, Symantec Corporation

High Frequency Trading and NoSQL. Peter Lawrey CEO, Principal Consultant Higher Frequency Trading

Building an Internal Cloud that is ready for the external Cloud

Infor Web UI Sizing and Deployment for a Thin Client Solution

Tuning WebSphere Application Server ND 7.0. Royal Cyber Inc.

CQG/LAN Technical Specifications. January 3, 2011 Version

Application. Performance Testing

Informatica Master Data Management Multi Domain Hub API: Performance and Scalability Diagnostics Checklist

Configuring Apache Derby for Performance and Durability Olav Sandstå

SOLUTION BRIEF: SLCM R12.8 PERFORMANCE TEST RESULTS JANUARY, Submit and Approval Phase Results

MID-TIER DEPLOYMENT KB

Performance and Scalability Best Practices in ArcGIS

11.1 inspectit inspectit

Java Mission Control

Hardware Recommendations

White paper: Unlocking the potential of load testing to maximise ROI and reduce risk.


Introduction 1 Performance on Hosted Server 1. Benchmarks 2. System Requirements 7 Load Balancing 7

Contents Introduction... 5 Deployment Considerations... 9 Deployment Architectures... 11

<Insert Picture Here> Java Application Diagnostic Expert

Java Monitoring. Stuff You Can Get For Free (And Stuff You Can t) Paul Jasek Sales Engineer

Estimate Performance and Capacity Requirements for Workflow in SharePoint Server 2010

PeopleSoft Online Performance Guidelines

Squeezing The Most Performance from your VMware-based SQL Server

Holistic Performance Analysis of J2EE Applications

Cognos8 Deployment Best Practices for Performance/Scalability. Barnaby Cole Practice Lead, Technical Services

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

SAP HANA Operation Expert Summit BUILD - High Availability & Disaster Recovery

Analyzing IBM i Performance Metrics

Cloud Based Application Architectures using Smart Computing

Monitoring Best Practices for COMMERCE

University of Southern California Shibboleth High Availability with Terracotta

Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1

Cloud Computing Is In Your Future

Transitioning from a Physical to Virtual Production Environment. Ryan Miller Middle Tennessee Electric Membership Corp

Understanding Server Configuration Parameters and Their Effect on Server Statistics

ArcGIS for Server Performance and Scalability: Testing Methodologies. Andrew Sakowicz, Frank Pizzi,

Predictive Analytics And IT Service Management

SYSTEM SETUP FOR SPE PLATFORMS

Case Study - I. Industry: Social Networking Website Technology : J2EE AJAX, Spring, MySQL, Weblogic, Windows Server 2008.

Getting Started with SandStorm NoSQL Benchmark

Performance Testing of Java Enterprise Systems

CribMaster Database and Client Requirements

Oracle Weblogic. Setup, Configuration, Tuning, and Considerations. Presented by: Michael Hogan Sr. Technical Consultant at Enkitec

Oracle Hyperion Financial Management Virtualization Whitepaper

Practice Management Installation Guide. Requirements/Prerequisites: Workstation Requirements. Page 1 of 5

Practical Performance Understanding the Performance of Your Application

Delivering Quality in Software Performance and Scalability Testing

Performance Testing. Slow data transfer rate may be inherent in hardware but can also result from software-related problems, such as:

A Scalability Study for WebSphere Application Server and DB2 Universal Database

Reform PDC Document Workflow Solution Streamline capture and distribution. intuitive. lexible. mobile

Synergis Software 18 South 5 TH Street, Suite 100 Quakertown, PA , version

Technical Considerations in a Windows Server Environment

An Oracle White Paper July Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide

SQL Server Performance Assessment and Optimization Techniques Jeffry A. Schwartz Windows Technology Symposium December 6, 2004 Las Vegas, NV

Case Study: Load Testing and Tuning to Improve SharePoint Website Performance

Monitor and Manage Your MicroStrategy BI Environment Using Enterprise Manager and Health Center

Products for the registry databases and preparation for the disaster recovery

Transcription:

WOPR 11 Experience Report Intuit Oct. 24 08 3 Examples of Reliability Testing Dan Downing, VP Testing Services MENTORA GROUP www.mentora.com

# 1 Component Failover Testing Application: Donor Management system for a national non-profit W2K, SQL Server, vbscript Reliability issue: Major load-related outage experienced in prior peak season where donation transactions were lost; suspected component failure (web? Db?) Solution: Add redundant firewall, web/app server, hot-standby DB server Testing focus: Test fail-over of components Measure recovery time and quantify failed transactions, especially lost donations Result: Showed that redundant components took up current workload quickly, minimizing failed transactions, lost donations

Bandwidth throughput and error rates under load as primary firewall & load balancer are failed Load Balancer Turned Off to simulate Production Down Load Balancer Fail Over: Throughput not affected Firewall Fail Over: Bandwidth drops and recovers Throughput Errors Running Users Color Graph Graph's Min. Graph's Ave. Graph's Max. Throughput 0.0 2343400 4879640 Running Vusers 0.0 146 375 Errors per Second 00 0.0 0233 0.233 2.078

Bandwidth, error rate and web/db cpu under load as primary DB server is failed Failed Over Database Running Users Throughput Bandwidth drops for 20 seconds Errors Increase for 2 minutes Web Server CPU Increases for 2 minutes Color Measurement Graph's Min. Graph's Ave. Graph's Max. Throughput 1580 822128 1167149 Run 0 63 176 Error 0 0.048 1.313 % Processor Time (Processor _Total):69.48.142.164 0.066 15 74 % Processor Time (Processor _Total):69.48.142.165 0.0 15 68

# 2 Endurance test to replicate dropped user sessions under load Application: Malpractice insurance case management system COM & ASP.NET, W2K3, SQL Server 2005 Reliability issue: User session state dropped and users sent back to login page during peak usage period, losing transactional updates in the middle of complex workflows Testing focus: Replicate peak load based on statistical analysis of peak-period web logs Run log over 24-hour period while monitoring throughput and system resources Resolution: Used event log to ID.NET framework critical update that triggered the outage; removed it, retested, and pushed into production

Endurance test shows 1-hour processing gap 7 hours into the test Color Measurement Min Avg Max Std Dev casecreation_end_to_end 2,294.2 2,314.1 2,338.1 18.2 caseupd_end_to_end 1,399.7 1,427.1 1,452.7 21.6 addphysician_end_to_end 1,273.6 1,311.0 1,384.1 32.5 caseinq_end_to_end 966.2 993.0 1,018.6 13.9 advsrch_end_to_end 912.9 968.6 1,038.7 39.1 physinquiry_end_to_end 903.9 946.8 1,016.8 24.8

Bandwidth drops to zero outage point, recovering only slightly thereafter Color Measurement Min Avg Max Throughput (Bytes/Sec) 3,476 10,743 29,450 Throughput (Mbps) 0.03 0.09 0.24 Load (vusers) 0 46

System resource monitors shows web server CPU falling to zero at outage point (orange line) Color Measurement Min. Ave. Max. Serve IP % Disk Time (PhysicalDisk _Total) 0.7 8.5 117.1 10.130.39.10 % Processor Time (Processor _Total) 1.013 2.4 15.4 10.130.39.10 Available MBytes (Memory) 669 804 959 10.130.39.10 Avg. Disk Queue Length (PhysicalDisk _Total) 0.02 0.17 2.3 10.130.39.10 Lock Wait Time (ms) (MSSQL$RMFPROD Locks _Total) 0.0 0.5 18.6 10.130.39.10 Number of Deadlocks/sec (MSSQL$RMFPROD Locks _Total) 0.0 0.0 0.002 10.130.39.10 SQL Compilations/sec (MSSQL$RMFPROD SQL Statistics) 0.0 0.7 3.1 10.130.39.10 SQL Re-Compilations/sec (MSSQL$RMFPROD SQL Statistics) 0.0 0.09 0.4 10.130.39.10 Load (vusers) 0 46

# 3 JVM Heap Utilization Analysis Application: National B2B retailer-supplier messaging system Complex Java, Oracle, WebSphere MQ; Windows, AIX, mainframe Re-architecture and redevelopment of middleware tier using JFS IDE Reliability issue: Severe user performance degradation under increasing load Testing focus: Monitor jvm heap utilization as is load increased (using Jconsole) Resolution: Uncovered and reduced JFS (Java Server Faces) default Viewstate queue depth to eliminate heap space depletion which was triggering cpu-grabbing garbage collection sweeps

JVM heap space profile shows sharp GC sweeps of decreasing depth as load ramps, until 1 GB heap is depleted

JVM heap space profile showing steady-state heap usage between 200 and 300 MB on all app servers after fix applied