Redpaper. Performance Metrics in TotalStorage Productivity Center Performance Reports. Introduction. Mary Lovelace



Similar documents
The IntelliMagic White Paper on: Storage Performance Analysis for an IBM San Volume Controller (SVC) (IBM V7000)

The IntelliMagic White Paper: Storage Performance Analysis for an IBM Storwize V7000

SAN Conceptual and Design Basics

Hitachi Path Management & Load Balancing with Hitachi Dynamic Link Manager and Global Link Availability Manager

Xserve G5 Using the Hardware RAID PCI Card Instructions for using the software provided with the Hardware RAID PCI Card

TECHNOLOGY BRIEF. Compaq RAID on a Chip Technology EXECUTIVE SUMMARY CONTENTS

The IntelliMagic White Paper: SMI-S for Data Collection of Storage Performance Metrics. December 2010

Q & A From Hitachi Data Systems WebTech Presentation:

Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays

WebBIOS Configuration Utility Guide

Chapter 11 I/O Management and Disk Scheduling

IBM TotalStorage SAN Volume Controller. Configuration. Version SC

Best practices for operational excellence (SharePoint Server 2010)

All-Flash Arrays Weren t Built for Dynamic Environments. Here s Why... This whitepaper is based on content originally posted at

Array Performance 101 Part 4

my forecasted needs. The constraint of asymmetrical processing was offset two ways. The first was by configuring the SAN and all hosts to utilize

HP Smart Array Controllers and basic RAID performance factors

Physical Data Organization

CA Nimsoft Monitor Snap

IBM Tivoli Storage Productivity Center (TPC)

The IntelliMagic White Paper: Green Storage: Reduce Power not Performance. December 2010

Definition of RAID Levels

Microsoft Exchange Server 2003 Deployment Considerations

Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1

How To Virtualize A Storage Area Network (San) With Virtualization

HP Storage Essentials Storage Resource Management Software end-to-end SAN Performance monitoring and analysis

Violin Memory Arrays With IBM System Storage SAN Volume Control

SPC BENCHMARK 1 FULL DISCLOSURE REPORT IBM CORPORATION IBM SYSTEM STORAGE SAN VOLUME CONTROLLER V6.2 SPC-1 V1.12

Lecture 36: Chapter 6

CA Nimsoft Monitor. Probe Guide for Apache HTTP Server Monitoring. apache v1.5 series

features at a glance

Timing of a Disk I/O Transfer

Storage Networking Management & Administration Workshop

Chapter 11 I/O Management and Disk Scheduling

ENTERPRISE INFRASTRUCTURE CONFIGURATION GUIDE

Chapter 13. Chapter Outline. Disk Storage, Basic File Structures, and Hashing

WHITEPAPER: Understanding Pillar Axiom Data Protection Options

Secondary Storage. Any modern computer system will incorporate (at least) two levels of storage: magnetic disk/optical devices/tape systems

Server Management Tools (ASMT)

One of the database administrators

Chapter 13 Disk Storage, Basic File Structures, and Hashing.

89 Fifth Avenue, 7th Floor. New York, NY White Paper. HP 3PAR Adaptive Flash Cache: A Competitive Comparison

Monitoring Microsoft Exchange to Improve Performance and Availability

Using VMware ESX Server with IBM System Storage SAN Volume Controller ESX Server 3.0.2

High Availability and Disaster Recovery for Exchange Servers Through a Mailbox Replication Approach

VirtualCenter Database Performance for Microsoft SQL Server 2005 VirtualCenter 2.5

HP StorageWorks Command View EVA user guide

Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array

Lesson Objectives. To provide a grand tour of the major operating systems components To provide coverage of basic computer system organization

CA Nimsoft Monitor. Probe Guide for IBM SVC Monitoring. ibm_svc v1.0 series

Performance Monitoring User s Manual

OVERVIEW. CEP Cluster Server is Ideal For: First-time users who want to make applications highly available

Long-Distance Configurations for MSCS with IBM Enterprise Storage Server

Intel RAID Controllers

RAID technology and IBM TotalStorage NAS products

The team that wrote this redbook Comments welcome Introduction p. 1 Three phases p. 1 Netfinity Performance Lab p. 2 IBM Center for Microsoft

Using Multipathing Technology to Achieve a High Availability Solution

File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering

StreamServe Persuasion SP5 Microsoft SQL Server

SOFTWAREDEFINED-STORAGE

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 13-1

HP LeftHand SAN Solutions

CA Unified Infrastructure Management

Designing HP SAN Networking Solutions

IPRO ecapture Performance Report using BlueArc Titan Network Storage System

CA Unified Infrastructure Management

FAS6200 Cluster Delivers Exceptional Block I/O Performance with Low Latency

VMware Virtual SAN Hardware Guidance. TECHNICAL MARKETING DOCUMENTATION v 1.0

Running a Workflow on a PowerCenter Grid

Evaluation Report: Database Acceleration with HP 3PAR StoreServ 7450 All-flash Storage

Symantec FileStore N8300 Clustered NAS Storage System V100R002. Glossary. Issue 01 Date Symantec Corporation

IBM Tivoli Composite Application Manager for Microsoft Applications: Microsoft Internet Information Services Agent Version Fix Pack 2.

IBM Tivoli Composite Application Manager for Microsoft Applications: Microsoft Exchange Server Agent Version Fix Pack 2.

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

Monitoring PostgreSQL database with Verax NMS

IBM PowerHA SystemMirror for i. Performance Information

CA Nimsoft Monitor. Probe Guide for IIS Server Monitoring. iis v1.5 series

Application Note 195. ARM11 performance monitor unit. Document number: ARM DAI 195B Issued: 15th February, 2008 Copyright ARM Limited 2007

Managing your Domino Clusters

Monitoring Agent for Microsoft Exchange Server Fix Pack 9. Reference IBM

CHAPTER 2: HARDWARE BASICS: INSIDE THE BOX

Oracle Database Scalability in VMware ESX VMware ESX 3.5

Chapter 13. Disk Storage, Basic File Structures, and Hashing

Microsoft SQL Server Always On Technologies

WHITE PAPER FUJITSU PRIMERGY SERVER BASICS OF DISK I/O PERFORMANCE

High Performance Tier Implementation Guideline

IBM Tivoli Monitoring Version 6.3 Fix Pack 2. Infrastructure Management Dashboards for Servers Reference

Cisco Active Network Abstraction Gateway High Availability Solution

Dependable Systems. 9. Redundant arrays of. Prof. Dr. Miroslaw Malek. Wintersemester 2004/05

GoGrid Implement.com Configuring a SQL Server 2012 AlwaysOn Cluster

Deploying Global Clusters for Site Disaster Recovery via Symantec Storage Foundation on Infortrend Systems

Post Production Video Editing Solution Guide with Apple Xsan File System AssuredSAN 4000

Transcription:

Redpaper Mary Lovelace Performance Metrics in TotalStorage Productivity Center Performance Reports Introduction This Redpaper contains the TotalStorage Productivity Center performance metrics that are available for the Storage Subsystem performance reports and Switch performance reports. The metrics are listed by report in the order in which they appear in the TotalStorage Productivity Center main Navigation Tree. Copyright IBM Corp. 2008. All rights reserved. ibm.com/redbooks 1

Performance metric collection We begin with a high level discussion of the way that TPC collects performance metrics from storage devices and switches. The performance counters are usually kept in device firmware, then pulled out for processing by CIM agents, and forwarded to TPC for final calculations and insertion into the TPC database. For most devices, the counters kept in firmware are monotonically increasing values. Over time, these values go up and only up. Consequently, it is necessary to pull two samples of the counters, separated by a number of seconds, in order to take the difference in the counters and calculate metrics, such as I/O rates, using the known time between samples. For example, each time that an I/O (a read or write) is issued to a volume, several counters (I/O count and Bytes transferred) increment. If the counters are pulled at times T1 and T2, the number of I/Os in the sample interval is obtained by subtracting the counters at time T1 from the counters at time T2 (T2-T1). When this count is divided by the number of seconds between T1 and T2, we obtain the I/O rate in I/Os/second for the sample interval (T1 to T2). This is the technique, and it is simple for metrics, such as I/O rate, data rate, average transfer size, and so forth. Other metrics, such as Read hit ratios or Disk Utilization, involve other calculations involving sampled counters and times T1 and T2. The counters in the firmware are usually unsigned 32-bit or 64-bit counters. Eventually, these counters wrap, meaning that the difference between the counters at T2 and T1 might be difficult to interpret. The TPC Performance Manager attempts to adjust for these wraps during its delta computations, but there might be unexpected wraps, which can confuse the Common Information Module (CIM) agent or the TPC Performance Manager. The TPC Performance Manager stores the deltas in the database. Certain counters are also stored in the TPC database, but the performance data is mostly comprised of rates and other calculated metrics that depend on the counter deltas and the sample interval, that is, the time between T1 and T2. The primary and essential performance metrics are few and simple, for example, Read I/O Rate, Write I/O Rate, Read Response, and Write Response. Also important are data rates and transfer sizes. Cache behaviors in the form of Read Hit Ratio and Write Cache delays (percentages and rates) are important. There are a myriad of additional metrics in the TPC performance reports, but they need to be used as adjuncts to the primary metrics, sometimes helping you to understand why the primary metrics have the values they have. There are very few metrics that measure other kinds of values. For example, the SVC storage subsystem also reports the maximum read and write response times that occur between times T1 and T2. Each time that a sample of the counters is pulled, this type of counter is set back to zero. But the vast majority of counters are monotonically increasing, reset to zero only by very particular circumstances, such as hardware, software, or firmware resets. The design of the TPC Performance Manager allows several storage subsystems to be included in a report (or individual subsystems by selection or filtering). But not all the metrics apply to every subsystem or component. In these cases, a -1 appears, indicating that no data is expected for the metric in this particular case. In the remainder of this section, we look at the metrics that can be selected for each report. We examine the reports in the order in which they appear in the TPC Navigation Tree. Reports under Disk Manager Storage Subsystem Performance: By Storage Subsystem By Controller 2 Performance Metrics in TotalStorage Productivity Center Performance Reports

By I/O Group By Node By Array By Managed Disk Group By Volume By Managed Disk By Port Reports under the Fabric Manager Switch Performance: By Port By Storage Subsystem report Table 1 shows the By Storage Subsystem report metrics. Critical and universal metrics are in bold. Less important metrics are in normal font, and difficult to interpret metrics are in italics. Table 1 By Storage Subsystem report metrics Subsystem Interval Read I/O Rate (normal) Read I/O Rate (sequential) Read I/O Rate (overall) Write I/O Rate (normal) Write I/O Rate (sequential) Write I/O Rate (overall) Total I/O Rate (normal) Component id Start time of the Length of sample interval in seconds. Average number of normal read operations per second for the Normal operations are not sequential, hence random. This metric is only for ESS, DS8000, and DS6000. Average number of sequential read operations per second for the Sequential I/O is detected by the subsystem. This metric is only for ESS, DS8000, and DS6000. Average number of read operations per second for the Applies to most subsystems. Average number of normal write operations per second for the Normal operations are not sequential, hence random. This metric is only for ESS, DS8000, and DS6000. Average number of sequential write operations per second for the Sequential I/O is detected by the subsystem. This metric is only for ESS, DS8000, and DS6000. Average number of write operations per second for the Applies to most subsystems. Average number of normal reads and writes per second for the Normal operations are not sequential, hence random. This metric is only for ESS, DS8000, and DS6000. Performance Metrics in TotalStorage Productivity Center Performance Reports 3

Total I/O Rate (sequential) Total I/O Rate (overall) Read Cache Hit Percentage (normal) Read Cache Hits Percentage (sequential) Read Cache Hits Percentage (overall) Write Cache Hits Percentage (normal) Write Cache Hits Percentage (sequential) Write Cache Hits Percentage (overall) Total Cache Hits Percentage (normal) Total Cache Hits Percentage (sequential) Total Cache Hits Percentage (overall) Read Data Rate Write Data Rate Total Data Rate Read Response Write Response Overall Response Read Transfer Size Write Transfer Size Overall Transfer Size Average number of sequential reads and writes per second for the Sequential I/O is detected by the subsystem. This metric is only for ESS, DS8000, and DS6000. Average number of reads and writes per second for the Applies to most subsystems. Percentage of normal (random) reads that are cache hits during the Only for ESS, DS8000, and DS6000. Percentage of sequential reads that are cache hits in the Only for ESS, DS8000, and DS6000. Percentage of reads during the sample interval that are found in cache. This is an important metric. Percentage of normal (random) writes that are handled in cache. This number needs to be 100%. Only for ESS, DS8000, and DS6000. Percentage of sequential writes that are handled in cache. This number needs to be 100%. Only for ESS, DS8000, and DS6000. Percentage of writes that are handled in cache. This number needs to be 100% for most enterprise storage. Percentage of normal reads and writes that are cache hits during the Percentage of sequential reads and writes that are cache hits during the Weighted average of read cache hits and write cache hits. Average read data rate in megabytes per second during the Average write data rate in megabytes per second during the Average total (read + write) data rate in megabytes per second during the Average response time in milliseconds for reads during the For this report, this is an average of read hits in cache as well as read misses. Average response time in milliseconds for writes during the Average response time in milliseconds for all I/O in the sample interval, including both cache hits as well as misses to backend storage if required. Average transfer size in kilobytes for reads during the Average transfer size in kilobytes for writes during the Average transfer size in kilobytes for all I/O during the 4 Performance Metrics in TotalStorage Productivity Center Performance Reports

Record Mode Read I/O Rate Record Mode Read Cache Hit Percentage Disk to Cache Transfer Rate Cache to Disk Transfer Rate Write-cache Delay Percentage Write-cache Delay I/O Rate Cache Holding Backend Read I/O Rate Backend Write I/O Rate Total Backend I/O Rate Backend Read Data Rate Backend Write Data Rate Total Backend Data Rate Backend Read Response Backend Write Response Overall Backend Response Backend Read Transfer Size This is the rate in I/O/sec for a special kind of read activity detected by ESS, DS8000, and DS6000. Only the requested data is managed in cache rather than a full track or most of a track of data. Read Hit percentage for the special class of reads mentioned above. ESS, DS8000, and DS6000 only. Average number of track transfers per second from disk to cache during the Average number of track transfers per second from cache to disk during the Percentage of all I/O operations that were delayed due to write-cache space constraints or other conditions during the Only writes can be delayed, but the percentage is of all I/O. The rate of I/O (actually writes) that are delayed during the sample interval because of write cache. The average number of seconds that a piece of data stays in cache. This value is calculated using Little s Law, only for DS8000, DS6000, and ESS. The average read rate in reads per second caused by read misses. This is the read rate to the backend storage for the The average write rate in writes per second caused by front-end write activity. This is the write rate to the backend storage for the These are logical writes and the actual number of physical I/O operations depends on whether the storage is RAID 5, RAID 10, or some other architecture. The sum of Backend Read I/O Rate and Backend Write I/O Rate over the Average number of megabytes per second read from backend storage during the Average number of megabytes per second written to backend storage during the Sum of the Backend Read and Write Data Rates for the Average response time in milliseconds for read operations to the backend storage. Average response time in milliseconds for write operations to the backend storage. This time might include several physical I/O operations, depending on the type of RAID architecture. The weighted average of Backend Read and Write Response s during the The average transfer size in kilobytes for reads to the backend storage during the Performance Metrics in TotalStorage Productivity Center Performance Reports 5

Backend Write Transfer Size Overall Backend Transfer Size Port Send I/O Rate Port Receive I/O Rate Total Port I/O Rate Port Send Data Rate Port Receive Data Rate Total Port Data Rate Port Send Response Port Receive Response Total Port Response Port Send Transfer Size Port Receive Transfer Size Total Port Transfer Size Read Queue Write Queue Overall Queue Readahead Percentage of Cache Hits The average transfer size in kilobytes for data written to the backend storage during the Weighted average transfer size in kilobytes for backend reads and writes during the The average rate per second for operations that send data from an I/O port, typically to a server. This is typically a read from the server s perspective. The average rate per second for operations where the storage port receives data, typically from a server. This is typically a write from the server s perspective. Average read plus write I/O rate per second at the storage port during the The average data rate in megabytes per second for operations that send data from an I/O port, typically to a server. The average data rate in megabytes per second for operations where the storage port receives data, typically from a server. Average read plus write data rate in megabytes per second at the storage port during the Average number of milliseconds that it took to service each port send (server read) operation, for a particular port over the Average number of milliseconds that it took to service each port receive (server write) operation, for a particular port over the Weighted average port send and port receive time over the Average size in kilobytes per Port Send operation during the Average size in kilobytes per Port Receive operation during the Average size in kilobytes per port transfer during the sample For SVC, the average number of milliseconds that each read operation during the sample interval spent on the queue before being issued to the backend storage device. For SVC, the average number of milliseconds that each write operation during the sample interval spent on the queue before being issued to the backend storage device. For SVC, the weighted average of Read Queue and Write Queue during the For SVC, an obscure measurement of cache hits involving data that has been prestaged for one reason or another. 6 Performance Metrics in TotalStorage Productivity Center Performance Reports

Dirty Write Percentage of Cache Hits Write Cache Overflow Percentage Write Cache Overflow I/O Rate Write Cache Flush-through Percentage Write Cache Flush-through I/O Rate Write Cache Write-through Percentage Write Cache Write-through I/O Rate CPU Utilization Percentage Port to Host Send I/O Rate Port to Host Receive I/O Rate Total Port to Host I/O Rate Port to Disk Send I/O Rate Port to Disk Receive I/O Rate Total Port to Disk I/O Rate Port to Local Node Send I/O Rate Port to Local Node Receive I/O Rate Total Port to Local Node I/O Rate Port to Remote Node Send I/O Rate For SVC, the percentage of write cache hits, which modified only data that was already marked dirty in the cache; rewritten data. This is an obscure measurement of how effectively writes are coalesced before destaging. For SVC, the percentage of write operations that were delayed due to lack of write-cache space during the sample For SVC, the average rate per second of write operations that were delayed due to lack of write-cache space during the For SVC, the percentage of write operations that were processed in Flush-through write mode during the sample For SVC, the average rate per second of tracks processed in Flush-through write mode during the For SVC, the percentage of write operations that were processed in Write-through write mode during the sample For SVC, the average number of tracks per second that were processed in Write-through write mode during the sample For SVC the average utilization of the cluster node controllers during the For SVC, the rate per second of port send to host (server) during the For SVC, the rate per second of port receive operations from host (server) during the For SVC, total of port send and receive I/O rate during the For SVC, the rate per second of port send to backend storage during the For SVC, the rate per second of port receive operations from backend storage during the For SVC, the sum of port to disk send and port to disk receive rates during the For SVC, the rate per second at which a port sends I/O to other nodes in the local cluster during the For SVC, the rate at which a port receives I/O from other nodes in the local cluster during the For SVC, the sum of port to local node send and receive rates during the For SVC, the average number of exchanges (I/Os) per second sent to nodes in the remote SVC cluster during the Typically, some form of remote mirroring. Performance Metrics in TotalStorage Productivity Center Performance Reports 7

Port to Remote Node Receive I/O Rate Total Port to Remote Node I/O Rate Port to Host Send Data Rate Port to Host Receive Data Rate Total Port to Host Data Rate Port to Disk Send Data Rate Port to Disk Receive Data Rate Total Port to Disk Data Rate Port to Local Node Send Data Rate Port to Local Node Receive Data Rate Total Port to Local Node Data Rate Port to Remote Node Send Data Rate Port to Remote Node Receive Data Rate Total Port to Remote Node Data Rate Port to Local Node Send Response Port to Local Node Receive Response Overall Port to Local Node Response Port to Local Node Send Queue Port to Local Node Receive Queue Overall Port to Local Node Queue For SVC, the average number of exchanges (I/Os) per second received from nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the sum of port to remote node send and receive I/O per second during the For SVC, the megabytes per second of port send to host (server) during the For SVC, the megabytes per second of port receive operations from host (server) during the For SVC, total of port send and receive megabytes per second during the For SVC, the megabytes per second of port send to backend storage during the For SVC, the megabytes per second of port receive operations from backend storage during the For SVC, the sum of port to disk send and port to disk receive megabytes per second during the For SVC, the megabytes per second at which a port sends I/O to other nodes in the local cluster during the sample For SVC, the megabytes per second at which a port receives I/O from other nodes in the local cluster during the For SVC, the sum of port to local node send and receive megabytes per second during the For SVC, the average number of megabytes per second sent to nodes in the remote SVC cluster during the sample Typically, some form of remote mirroring. For SVC, the average number of megabytes per second received from nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the sum of port to remote node send and receive megabytes per second during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average port service time in milliseconds for this operation during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average time in milliseconds waiting in queue before these send operations are executed. For SVC, the average time in milliseconds waiting in queue before these receive operations are executed. For SVC, the average time in milliseconds waiting before these port send or port receive operations are executed. 8 Performance Metrics in TotalStorage Productivity Center Performance Reports

Port to Remote Node Send Response Port to Remote Node Receive Response Overall Port to Remote Node Response Port to Remote Node Send Queue Port to Remote Node Receive Queue Overall Port to Remote Node Queue Global Mirror Write I/O Rate Global Mirror Overlapping Write Percentage Global Mirror Overlapping Write I/O Rate Peak Read Response Peak Write Response Global Mirror Secondary Write Lag For SVC, the average port service time in milliseconds for these operations during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average time in milliseconds waiting in queue before these send operations are executed. For SVC, the average time in milliseconds waiting in queue before these operations are executed. For SVC, the average time in milliseconds waiting in queue before these send operations are executed. For SVC, the rate in writes per second issued to the secondary site for Global Mirror during the For SVC, the percentage of writes during the sample interval for which the write operations at the primary site for Global Mirror have overlapping write domains. For SVC, the average rate in writes per second during the sample interval, for which the write operations at the primary site for Global Mirror have overlapping write domains. For SVC, the peak read response time in milliseconds observed during the At the end of each sample interval, this value is reset to zero. For SVC, the peak write response time in milliseconds observed during the At the end of each sample interval, this value is reset to zero. For SVC, the number of additional milliseconds it took to service each secondary write operation for Global Mirror, over and above the time needed to service the primary writes during the By Controller report The metrics for the By Controller report are shown in Table 2. This report is only for DS8000, DS6000, and ESS. Critical and universal metrics are in bold. Less important metrics are in normal font, and difficult to interpret metrics are in italics. Table 2 By Controller report metrics Subsystem Controller Interval Component ID. Component ID. Start time of the Length of sample interval in seconds. Performance Metrics in TotalStorage Productivity Center Performance Reports 9

Read I/O Rate (normal) Read I/O Rate (sequential) Read I/O Rate (overall) Write I/O Rate (normal) Write I/O Rate (sequential) Write I/O Rate (overall) Total I/O Rate (normal) Total I/O Rate (sequential) Total I/O Rate (overall) Read Cache Hits Percentage (normal) Read Cache Hits Percentage (sequential) Read Cache Hits Percentage (overall) Write Cache Hits Percentage (normal) Write Cache Hits Percentage (sequential) Write Cache Hits Percentage (overall) Total Cache Hits Percentage (normal) Total Cache Hits Percentage (sequential) Total Cache Hits Percentage (overall) Read Data Rate Write Data Rate Average number of normal read operations per second for the Normal operations are not sequential, hence random. Average number of sequential read operations per second for the Sequential IO is detected by the subsystem. Average number of read operations per second for the Average number of normal write operations per second for the Normal operations are not sequential, hence random. Average number of sequential write operations per second for the Sequential I/O is detected by the subsystem. Average number of write operations per second for the Average number of normal reads and writes per second for the Normal operations are not sequential, hence random. Average number of sequential reads and writes per second for the Sequential I/O is detected by the subsystem. Average number of reads and writes per second for the Percentage of normal (random) reads that are cache hits during the Percentage of sequential reads that are cache hits in the Percentage of reads during the sample interval that are found in cache. This is an important metric. Percentage of normal (random) writes that are handled in cache. This number needs to be 100%. Percentage of sequential writes that are handled in cache. This number needs to be 100%. Percentage of writes that are handled in cache. This number needs to be 100% for most enterprise storage. Weighted average of read cache hits and write cache hits. Percentage of sequential reads and writes that are cache hits during the Weighted average of read cache hits and write cache hits. Average read data rate in megabytes per second during the Average write data rate in megabytes per second during the 10 Performance Metrics in TotalStorage Productivity Center Performance Reports

Total Data Rate Read Response Write Response Overall Response Read Transfer Size Write Transfer Size Overall Transfer Size Record Mode Read I/O Rate Record Mode Read Cache Hit Percentage Disk to Cache Transfer Rate Cache to Disk Transfer Rate Write-cache Delay Percentage Write-cache Delay I/O Rate Cache Holding Backend Read I/O Rate Backend Write I/O Rate Total Backend I/O Rate Average total (read + write) data rate in megabytes per second during the Average response time in milliseconds for reads during the For this report, this is an average of read hits in cache as well as read misses. Average response time in milliseconds for writes during the Average response time in milliseconds for all I/O in the sample interval, including both cache hits as well as misses to backend storage if required. Average transfer size in kilobytes for reads during the Average transfer size in kilobytes for writes during the Average transfer size in kilobytes for all I/O during the This is the rate in I/O/sec for a special kind of read activity detected by ESS, DS8000, and DS6000. Only the requested data is managed in cache rather than a full track or most of a track of data. Read Hit percentage for the special class of reads mentioned above. Average number of track transfers per second from disk to cache during the Average number of track transfers per second from cache to disk during the Percentage of all I/O operations that were delayed due to write-cache space constraints or other conditions during the Only writes can be delayed, but the percentage is of all I/O. This is sometimes called non-volatile storage (NVS) Full. The rate of I/O (actually writes) that are delayed during the sample interval because of write cache, sometimes called NVS Full. The average number of seconds a piece of data stays in cache. This value is calculated using Little s Law. The average read rate in reads per second caused by read misses. This is the read rate to the backend RAID arrays for the The average write rate in writes per second caused by front-end write activity. This is the write rate to the backend storage for the These are logical writes and the actual number of physical I/O operations depends on whether the storage is RAID 5, RAID 10, or some other architecture. The sum of Backend Read I/O Rate and Backend Write I/O Rate over the Performance Metrics in TotalStorage Productivity Center Performance Reports 11

Backend Read Data Rate Backend Write Data Rate Total Backend Data Rate Backend Read Response Backend Write Response Overall Backend Response Backend Read Transfer Size Backend Write Transfer Size Overall Backend Transfer Size Average number of megabytes per second read from backend storage during the Average number of megabytes per second written to backend storage during the Sum of the Backend Read and Write Data Rates for the Average response time in milliseconds for read operations to the backend storage. Average response time in milliseconds for write operations to the backend storage. This time can include several physical I/O operations, depending on the type of RAID architecture. The weighted average of Backend Read and Write Response s during the The average transfer size in kilobytes for reads to the backend storage during the The average transfer size in kilobytes for data written to the backend storage during the Weighted average transfer size in kilobytes for backend reads and writes during the By I/O Group report The By I/O Group report (report metrics are shown in Table 3) is an SAN Volume Controller (SVC) specific report. Critical and universal metrics are in bold. Less important metrics are in normal font, and difficult to interpret metrics are in italics. Table 3 By I/O Group report metrics Subsystem I/O group Interval Read I/O Rate (overall) Write I/O Rate (overall) Total I/O Rate (overall) Read Cache Hits Percentage (overall) Write Cache Hits Percentage (overall) Name of the SVC cluster. The I/O group ID. Start time of the Length of sample interval in seconds. Average number of read operations per second for the sample Applies to most subsystems. Average number of write operations per second for the sample Applies to most subsystems. Average number of reads and writes per second for the sample Applies to most subsystems. Percentage of reads during the sample interval that are found in cache. This is an important metric. Percentage of writes that are handled in cache. This number must be 100% for most enterprise storage. 12 Performance Metrics in TotalStorage Productivity Center Performance Reports

Total Cache Hits Percentage (overall) Read Data Rate Write Data Rate Total Data Rate Read Response Write Response Overall Response Read Transfer Size Write Transfer Size Overall Transfer Size Disk to Cache Transfer Rate Cache to Disk Transfer Rate Write-cache Delay Percentage Write-cache Delay I/O Rate Backend Read I/O Rate Backend Write I/O Rate Total Backend I/O Rate Backend Read Data Rate Backend Write Data Rate Total Backend Data Rate Percentage of normal reads and writes that are cache hits during the Average read data rate in megabytes per second during the Average write data rate in megabytes per second during the Average total (read + write) data rate in megabytes per second during the Average response time in milliseconds for reads during the For this report, this is an average of read hits in cache as well as read misses. Average response time in milliseconds for writes during the Average response time in milliseconds for all I/O in the sample interval, including both cache hits as well as misses to backend storage if required. Average transfer size in kilobytes for reads during the sample Average transfer size in kilobytes for writes during the sample Average transfer size in kilobytes for all I/O during the sample Average number of track transfers per second from disk to cache during the Average number of track transfers per second from cache to disk during the Percentage of all I/O operations that were delayed due to write-cache space constraints or other conditions during the Only writes can be delayed, but the percentage is of all I/O. The rate of I/O (actually writes) that are delayed during the sample interval because of write cache. The average read rate in reads per second caused by read misses. This is the read rate to the backend storage for the The average write rate in writes per second caused by front-end write activity. This is the write rate to the backend storage for the These are logical writes. The sum of Backend Read I/O Rate and Backend Write I/O Rate over the Average number of megabytes per second read from backend storage during the Average number of megabytes per second written to backend storage during the Sum of the Backend Read and Write Data Rates for the sample Performance Metrics in TotalStorage Productivity Center Performance Reports 13

Backend Read Response Backend Write Response Overall Backend Response Read Queue Write Queue Overall Queue Backend Read Transfer Size Backend Write Transfer Size Overall Backend Transfer Size Port Send I/O Rate Port Receive I/O Rate Total Port I/O Rate Port Send Data Rate Port Receive Data Rate Total Port Data Rate Readahead Percentage of Cache Hits Dirty Write Percentage of Cache Hits Write Cache Overflow Percentage Write Cache Overflow I/O Rate Average response time in milliseconds for read operations to the backend storage. Average response time in milliseconds for write operations to the backend storage. This time can include several physical I/O operations, depending on the type of RAID architecture. The weighted average of Backend Read and Write Response s during the The average number of milliseconds that each read operation during the sample interval spent on the queue before being issued to the backend storage device. The average number of milliseconds that each write operation during the sample interval spent on the queue before being issued to the backend storage device. The weighted average of Read Queue and Write Queue during the The average transfer size in kilobytes for reads to the backend storage during the The average transfer size in kilobytes for data written to the backend storage during the Weighted average transfer size in kilobytes for Backend Reads and Writes during the The average rate per second for operations that send data from an I/O port, typically to a server. This is typically a read from the server s perspective. The average rate per second for operations where the storage port receives data, typically from a server. This is typically a write from the server s perspective. Average read plus write I/O rate per second at the storage port during the The average data rate in megabytes per second for operations that send data from an I/O port, typically to a server. The average data rate in megabytes per second for operations where the storage port receives data, typically from a server. Average read plus write data rate in megabytes per second at the storage port during the An obscure measurement of cache hits involving data that has been prestaged for one reason or another. The percentage of write cache hits which modified only data that was already marked dirty in the cache; rewritten data. This is an obscure measurement of how effectively writes are coalesced before destaging. For SVC, the percentage of write operations that were delayed due to lack of write-cache space during the For SVC, the average rate per second of write operations that were delayed due to lack of write-cache space during the 14 Performance Metrics in TotalStorage Productivity Center Performance Reports

Write Cache Flush-through Percentage Write Cache Flush-through I/O Rate Write Cache Write-through Percentage Write Cache Write-through I/O Rate CPU Utilization Percentage Port to Host Send I/O Rate Port to Host Receive I/O Rate Total Port to Host I/O Rate Port to Disk Send I/O Rate Port to Disk Receive I/O Rate Total Port to Disk I/O Rate Port to Local Node Send I/O Rate Port to Local Node Receive I/O Rate Total Port to Local Node I/O Rate Port to Remote Node Send I/O Rate Port to Remote Node Receive I/O Rate Total Port to Remote Node I/O Rate Port to Host Send Data Rate Port to Host Receive Data Rate Total Port to Host Data Rate For SVC, the percentage of write operations that were processed in Flush-through write mode during the sample For SVC, the average rate per second of tracks processed in Flush-through write mode during the For SVC, the percentage of write operations that were processed in Write-through write mode during the sample For SVC, the average number of tracks per second that were processed in Write-through write mode during the sample The average utilization of the node controllers in this I/O group during the For SVC, the rate per second of port send to host (server) during the For SVC, the rate per second of port receive operations from host (server) during the For SVC, total of port send and receive I/O rate during the For SVC, the rate per second of port send to backend storage during the For SVC, the rate per second of port receive operations from backend storage during the For SVC, the sum of port to disk send and port to disk receive rates during the For SVC, the rate per second at which a port sends I/O to other nodes in the local cluster during the For SVC, the rate at which a port receives I/O from other nodes in the local cluster during the For SVC, the sum of port to local node send and receive rates during the For SVC, the average number of exchanges (I/Os) per second sent to nodes in the remote SVC cluster during the sample Typically, some form of remote mirroring. For SVC, the average number of exchanges (I/Os) per second received from nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the sum of port to remote node send and receive I/O per second during the For SVC, the megabytes per second of port send to host (server) during the For SVC, the megabytes per second of port receive operations from host (server) during the For SVC, the total of port send and receive megabytes per second during the Performance Metrics in TotalStorage Productivity Center Performance Reports 15

Port to Disk Send Data Rate Port to Disk Receive Data Rate Total Port to Disk Data Rate Port to Local Node Send Data Rate Port to Local Node Receive Data Rate Total Port to Local Node Data Rate Port to Remote Node Send Data Rate Port to Remote Node Receive Data Rate Total Port to Remote Node Data Rate Port to Local Node Send Response Port to Local Node Receive Response Overall Port to Local Node Response Port to Local Node Send Queue Port to Local Node Receive Queue Overall Port to Local Node Queue Port to Remote Node Send Response Port to Remote Node Receive Response Overall Port to Remote Node Response Port to Remote Node Send Queue Port to Remote Node Receive Queue Overall Port to Remote Node Queue For SVC, the megabytes per second of port send to backend storage during the For SVC, the megabytes per second of port receive operations from backend storage during the For SVC, the sum of port to disk send and port to disk receive megabytes per second during the For SVC, the megabytes per second at which a port sends I/O to other nodes in the local cluster during the For SVC, the megabytes per second at which a port receives I/O from other nodes in the local cluster during the sample For SVC, the sum of port to local node send and receive megabytes per second during the For SVC, the average number of megabytes per second sent to nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the average number of megabytes per second received from nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the sum of port to remote node send and receive megabytes per second during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average port service time in milliseconds for this operation during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average time in milliseconds waiting in queue before these send operations are executed. For SVC, the average time in milliseconds waiting in queue before these receive operations are executed. For SVC, the average time in milliseconds waiting before these port send or receive operations are executed. For SVC, the average port service time in milliseconds for these operations during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average time in milliseconds waiting in queue before these send operations are executed. For SVC, the average time in milliseconds waiting in queue before these send operations are executed. For SVC, the average time in milliseconds waiting in queue before these send operations are executed. 16 Performance Metrics in TotalStorage Productivity Center Performance Reports

Global Mirror Write I/O Rate Global Mirror Overlapping Write Percentage Global Mirror Overlapping Write I/O Rate Peak Read Response Peak Write Response Global Mirror Secondary Write Lag For SVC, the rate in writes per second issued to the secondary site for Global Mirror during the For SVC, the percentage of writes during the sample interval, for which the write operations at the primary site for Global Mirror have overlapping write domains. For SVC, the average rate in writes per second during the sample interval, for which the write operations at the primary site for Global Mirror have overlapping write domains. For SVC, the peak read response time in milliseconds observed during the At the end of each sample interval, this value is reset to zero. For SVC, the peak write response time in milliseconds observed during the At the end of each sample interval, this value is reset to zero. For SVC, the number of additional milliseconds it took to service each secondary write operation for Global Mirror, over and above the time needed to service the primary writes during the By Node report The By Node report (metrics shown in Table 4) is an SVC specific report. Critical and universal metrics are in bold. Less important metrics are in normal font, and difficult to interpret metrics are in italics. Table 4 By Node report metrics Subsystem I/O group Interval Read I/O Rate (overall) Write I/O Rate (overall) Total I/O Rate (overall) Read Cache Hits Percentage (overall) Write Cache Hits Percentage (overall) Total Cache Hits Percentage (overall) Read Data Rate Name of the SVC cluster. The I/O group ID. Start time of the Length of sample interval in seconds. Average number of read operations per second for the Applies to most subsystems. Average number of write operations per second for the Applies to most subsystems. Average number of reads and writes per second for the Applies to most subsystems. Percentage of reads during the sample interval that are found in cache. This is an important metric. Percentage of writes that are handled in cache. This number needs to be almost 100% Weighted average of read cache hits and write cache hits. Average read data rate in megabytes per second during the Performance Metrics in TotalStorage Productivity Center Performance Reports 17

Write Data Rate Total Data Rate Read Response Write Response Overall Response Read Transfer Size Write Transfer Size Overall Transfer Size Disk to Cache Transfer Rate Cache to Disk Transfer Rate Write-cache Delay Percentage Write-cache Delay I/O Rate Backend Read I/O Rate Backend Write I/O Rate Total Backend I/O Rate Backend Read Data Rate Backend Write Data Rate Total Backend Data Rate Backend Read Response Average write data rate in megabytes per second during the Average total (read + write) data rate in megabytes per second during the Average response time in milliseconds for reads during the For this report, this is an average of read hits in cache as well as read misses. Average response time in milliseconds for writes during the Average response time in milliseconds for all I/O in the sample interval, including both cache hits as well as misses to backend storage if required. Average transfer size in kilobytes for reads during the Average transfer size in kilobytes for writes during the Average transfer size in kilobytes for all I/O during the Average number of track transfers per second from disk to cache during the Average number of track transfers per second from cache to disk during the Percentage of all I/O operations that were delayed due to write-cache space constraints or other conditions during the Only writes can be delayed, but the percentage is of all I/O. The rate of I/O (actually writes) that are delayed during the sample interval because of write cache. The average read rate in reads per second caused by read misses. This is the read rate to the backend storage for the The average write rate in writes per second caused by front-end write activity. This is the write rate to the backend storage for the These are logical writes. The sum of Backend Read I/O Rate and Backend Write I/O Rate over the Average number of megabytes per second read from backend storage during the Average number of megabytes per second written to backend storage during the Sum of the Backend Read and Write Data Rates for the Average response time in milliseconds for read operations to the backend storage. 18 Performance Metrics in TotalStorage Productivity Center Performance Reports

Backend Write Response Overall Backend Response Read Queue Write Queue Overall Queue Backend Read Transfer Size Backend Write Transfer Size Overall Backend Transfer Size Port Send I/O Rate Port Receive I/O Rate Total Port I/O Rate Port Send Data Rate Port Receive Data Rate Total Port Data Rate Readahead Percentage of Cache Hits Dirty Write Percentage of Cache Hits Write Cache Overflow Percentage Average response time in milliseconds for write operations to the backend storage. This time might include several physical I/O operations, depending on the type of RAID architecture. The weighted average of Backend read and write response times during the The average number of milliseconds that each read operation during the sample interval spent on the queue before being issued to the backend storage device. The average number of milliseconds that each write operation during the sample interval spent on the queue before being issued to the backend storage device. The weighted average of Read Queue and Write Queue during the The average transfer size in kilobytes for reads to the backend storage during the The average transfer size in kilobytes for data written to the backend storage during the Weighted average transfer size in kilobytes for Backend Reads and Writes during the The average rate per second for operations that send data from an I/O port, typically to a server. This is typically a read from the server s perspective. The average rate per second for operations where the storage port receives data, typically from a server. This is typically a write from the server s perspective. Average read plus write I/O rate per second at the storage port during the The average data rate in megabytes per second for operations that send data from an I/O port, typically to a server. The average data rate in megabytes per second for operations where the storage port receives data, typically from a server. Average read plus write data rate in megabytes per second at the storage port during the An obscure measurement of cache hits involving data that has been prestaged for one reason or another. The percentage of write cache hits, which modified only data that was already marked dirty in the cache; rewritten data. This is an obscure measurement of how effectively writes are coalesced before destaging. For SVC, the percentage of write operations that were delayed due to lack of write-cache space during the sample Performance Metrics in TotalStorage Productivity Center Performance Reports 19

Write Cache Overflow I/O Rate Write Cache Flush-through Percentage Write Cache Flush-through I/O Rate Write Cache Write-through Percentage Write Cache Write-through I/O Rate CPU Utilization Percentage Port to Host Send I/O Rate Port to Host Receive I/O Rate Total Port to Host I/O Rate Port to Disk Send I/O Rate Port to Disk Receive I/O Rate Total Port to Disk I/O Rate Port to Local Node Send I/O Rate Port to Local Node Receive I/O Rate Total Port to Local Node I/O Rate Port to Remote Node Send I/O Rate Port to Remote Node Receive I/O Rate Total Port to Remote Node I/O Rate Port to Host Send Data Rate For SVC, the average rate per second of write operations that were delayed due to lack of write-cache space during the For SVC, the percentage of write operations that were processed in Flush-through write mode during the sample For SVC, the average rate per second of tracks processed in Flush-through write mode during the For SVC, the percentage of write operations that were processed in Write-through write mode during the sample For SVC, the average number of tracks per second that were processed in Write-through write mode during the The average utilization of the node controllers in this I/O group during the For SVC, the rate per second of port send to host (server) during the For SVC, the rate per second of port receive operations from host (server) during the For SVC, total of port send and receive I/O rate during the For SVC, the rate per second of port send to backend storage during the For SVC, the rate per second of port receive operations from backend storage during the For SVC, the sum of port to disk send and port to disk receive rates during the For SVC, the rate per second at which a port sends I/O to other nodes in the local cluster during the For SVC, the rate at which a port receives I/O from other nodes in the local cluster during the For SVC, the sum of port to local node send and receive rates during the For SVC, the average number of exchanges (I/Os) per second sent to nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the average number of exchanges (I/Os) per second received from nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the sum of port to remote node send and receive I/O per second during the For SVC, the megabytes per second of port send to host (server) during the 20 Performance Metrics in TotalStorage Productivity Center Performance Reports

Port to Host Receive Data Rate Total Port to Host Data Rate Port to Disk Send Data Rate Port to Disk Receive Data Rate Total Port to Disk Data Rate Port to Local Node Send Data Rate Port to Local Node Receive Data Rate Total Port to Local Node Data Rate Port to Remote Node Send Data Rate Port to Remote Node Receive Data Rate Total Port to Remote Node Data Rate Port to Local Node Send Response Port to Local Node Receive Response Overall Port to Local Node Response Port to Local Node Send Queue Port to Local Node Receive Queue Overall Port to Local Node Queue Port to Remote Node Send Response Port to Remote Node Receive Response Overall Port to Remote Node Response For SVC, the megabytes per second of port receive operations from host (server) during the For SVC, total of port send and receive megabytes per second during the For SVC, the megabytes per second of port send to backend storage during the For SVC, the megabytes per second of port receive operations from backend storage during the sample For SVC, the sum of port to disk send and port to disk receive megabytes per second during the For SVC, the megabytes per second at which a port sends I/O to other nodes in the local cluster during the sample For SVC, the megabytes per second at which a port receives I/O from other nodes in the local cluster during the For SVC, the sum of port to local node send and receive megabytes per second during the For SVC, the average number of megabytes per second sent to nodes in the remote SVC cluster during the sample Typically, some form of remote mirroring. For SVC, the average number of megabytes per second received from nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the sum of port to remote node send and receive megabytes per second during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average port service time in milliseconds for this operation during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average time in milliseconds waiting in queue before these send operations are executed. For SVC, the average time in milliseconds waiting in queue before these receive operations are executed. For SVC, the average time in milliseconds waiting before these port send or receive operations are executed. For SVC, the average port service time in milliseconds for these operations during the For SVC, the average port service time in milliseconds for these operations during the For SVC, the average port service time in milliseconds for these operations during the Performance Metrics in TotalStorage Productivity Center Performance Reports 21

Port to Remote Node Send Queue Port to Remote Node Receive Queue Overall Port to Remote Node Queue Global Mirror Write I/O Rate Global Mirror Overlapping Write Percentage Global Mirror Overlapping Write I/O Rate Peak Read Response Peak Write Response Global Mirror Secondary Write Lag For SVC, the average time in milliseconds waiting in queue before these send operations are executed. For SVC, the average time in milliseconds waiting in queue before these send operations are executed. For SVC, the average time in milliseconds waiting in queue before these send operations are executed. For SVC, the rate in writes per second issued to the secondary site for Global Mirror during the For SVC, the percentage of writes during the sample interval, for which the write operations at the primary site for Global Mirror have overlapping write domains. For SVC, the average rate in writes per second during the sample interval, for which the write operations at the primary site for Global Mirror have overlapping write domains. For SVC, the peak read response time in milliseconds observed during the At the end of each sample interval, this value is reset to zero. For SVC, the peak write response time in milliseconds observed during the At the end of each sample interval, this value is reset to zero. For SVC, the number of additional milliseconds it took to service each secondary write operation for Global Mirror, over and above the time needed to service the primary writes during the By Array report The By Array report (report metrics are shown in Table 5) is for DS8000, DS6000, and ESS only. Critical and universal metrics are in bold. Less important metrics are in normal font, and difficult to interpret metrics are in italics. Table 5 By Array report metrics Subsystem Interval Read I/O Rate (normal) Read I/O Rate (sequential) Read I/O Rate (overall) Component ID. Start time of the Length of sample interval in seconds. Average number of normal read operations per second for the Normal operations are not sequential, hence random. This metric is only for ESS, DS8000, and DS6000. Average number of sequential read operations per second for the Sequential I/O is detected by the subsystem. This metric is only for ESS, DS8000, and DS6000. Average number of read operations per second for the Applies to most subsystems. 22 Performance Metrics in TotalStorage Productivity Center Performance Reports

Write I/O Rate (normal) Write I/O Rate (sequential) Write I/O Rate (overall) Total I/O Rate (normal) Total I/O Rate (sequential) Total I/O Rate (overall) Read Cache Hit Percentage (normal) Read Cache Hits Percentage (sequential) Read Cache Hits Percentage (overall) Write Cache Hits Percentage (normal) Write Cache Hits Percentage (sequential) Write Cache Hits Percentage (overall) Total Cache Hits Percentage (normal) Total Cache Hits Percentage (sequential) Total Cache Hits Percentage (overall) Read Data Rate Write Data Rate Total Data Rate Average number of normal write operations per second for the Normal operations are not sequential, hence random. This metric is only for ESS, DS8000, and DS6000. Average number of sequential write operations per second for the Sequential I/O is detected by the subsystem. This metric is only for ESS, DS8000, and DS6000. Average number of write operations per second for the Applies to most subsystems. Average number of normal reads and writes per second for the Normal operations are not sequential, hence random. This metric is only for ESS, DS8000, and DS6000. Average number of sequential reads and writes per second for the Sequential I/O is detected by the subsystem. This metric is only for ESS, DS8000, and DS6000. Average number of reads and writes per second for the Applies to most subsystems. Percentage of normal (random) reads that are cache hits during the Only for ESS, DS8000, and DS6000. Percentage of sequential reads that are cache hits in the Only for ESS, DS8000, and DS6000. Percentage of reads during the sample interval that are found in cache. This is an important metric. Percentage of normal (random) writes that are handled in cache. This number needs to be 100%. Only for ESS, DS8000, and DS6000. Percentage of sequential writes that are handled in cache. This number needs to be 100%. Only for ESS, DS8000, and DS6000. Percentage of writes that are handled in cache. This number needs to be 100% for most enterprise storage. Percentage of normal reads and writes that are cache hits during the Percentage of sequential reads and writes that are cache hits during the Weighted average of read cache hits and write cache hits. Average read data rate in megabytes per second during the Average write data rate in megabytes per second during the Average total (read + write) data rate in megabytes per second during the Performance Metrics in TotalStorage Productivity Center Performance Reports 23

Read Response Write Response Overall Response Read Transfer Size Write Transfer Size Overall Transfer Size Record Mode Read I/O Rate Record Mode Read Cache Hit Percentage Disk to Cache Transfer Rate Cache to Disk Transfer Rate Write-cache Delay Percentage Write-cache Delay I/O Rate Backend Read I/O Rate Backend Write I/O Rate Total Backend I/O Rate Backend Read Data Rate Backend Write Data Rate Total Backend Data Rate Average response time in milliseconds for reads during the For this report, this is an average of read hits in cache as well as read misses. Average response time in milliseconds for writes during the Average response time in milliseconds for all I/O in the sample interval, including both cache hits as well as misses to backend storage if required. Average transfer size in kilobytes for reads during the Average transfer size in kilobytes for writes during the Average transfer size in kilobytes for all I/O during the This is the rate in I/O/sec for a special kind of read activity detected by ESS, DS8000, and DS6000. Only the requested data is managed in cache rather than a full track or most of a track of data. Read Hit percentage for the special class of reads mentioned above. ESS, DS8000, and DS6000 only. Average number of track transfers per second from disk to cache during the Average number of track transfers per second from cache to disk during the Percentage of all I/O operations that were delayed due to write-cache space constraints or other conditions during the Only writes can be delayed, but the percentage is of all I/O. The rate of I/O (actually writes) that are delayed during the sample interval because of write cache. The average read rate in reads per second caused by read misses. This is the read rate to the backend storage for the The average write rate in writes per second caused by front-end write activity. This is the write rate to the backend storage for the These are logical writes and the actual number of physical I/O operations depends on whether the storage is RAID 5, RAID 10, or some other architecture. The sum of Backend Read I/O Rate and Backend Write I/O Rate over the Average number of megabytes per second read from backend storage during the Average number of megabytes per second written to backend storage during the Sum of the Backend Read and Write Data Rates for the 24 Performance Metrics in TotalStorage Productivity Center Performance Reports

Backend Read Response Backend Write Response Overall Backend Response Backend Read Transfer Size Backend Write Transfer Size Overall Backend Transfer Size Disk Utilization Percentage Sequential I/O Percentage Average response time in milliseconds for read operations to the backend storage. Average response time in milliseconds for write operations to the backend storage. This time might include several physical I/O operations, depending on the type of RAID architecture. The weighted average of Backend read and write response times during the The average transfer size in kilobytes for reads to the backend storage during the The average transfer size in kilobytes for data written to the backend storage during the Weighted average transfer size in kilobytes for Backend Reads and Writes during the Average disk utilization during the This is also the utilization of the RAID array, because the activity is uniform across the array. Percentage of the I/O during the sample interval which the storage believes to be sequential. This is detected by the storage algorithms. By Managed Disk Group report The By Managed Disk Group report (report metrics are shown in Table 6) is an SVC specific report. Critical and universal metrics are in bold. Less important metrics are in normal font, and difficult to interpret metrics are in italics. Table 6 By Managed Disk Group report Subsystem Managed Disk Group Interval Read I/O Rate (overall) Write I/O Rate (overall) Total I/O Rate (overall) Read Data Rate Write Data Rate Name of the SVC cluster. The Managed Disk Group ID. Start time of the Length of sample interval in seconds. Average number of read operations per second for the Applies to most subsystems. Average number of write operations per second for the Applies to most subsystems. Average number of reads and writes per second for the Applies to most subsystems. Average read data rate in megabytes per second during the Average write data rate in megabytes per second during the Performance Metrics in TotalStorage Productivity Center Performance Reports 25

Total Data Rate Read Response Write Response Overall Response Read Transfer Size Write Transfer Size Overall Transfer Size Backend Read I/O Rate Backend Write I/O Rate Total Backend I/O Rate Backend Read Data Rate Backend Write Data Rate Total Backend Data Rate Backend Read Response Backend Write Response Overall Backend Response Read Queue Write Queue Overall Queue Average total (read + write) data rate in megabytes per second during the Average response time in milliseconds for reads during the For this report, this is an average of read hits in cache as well as read misses. Average response time in milliseconds for writes during the Average response time in milliseconds for all I/O in the sample interval, including both cache hits as well as misses to backend storage if required. Average transfer size in kilobytes for reads during the Average transfer size in kilobytes for writes during the Average transfer size in kilobytes for all I/O during the The average read rate in reads per second caused by read misses. This is the read rate to the backend storage for the The average write rate in writes per second caused by front-end write activity. This is the write rate to the backend storage for the These are logical writes. The sum of Backend Read I/O Rate and Backend Write I/O Rate over the Average number of megabytes per second read from backend storage during the Average number of megabytes per second written to backend storage during the Sum of the Backend Read and Write Data Rates for the Average response time in milliseconds for read operations to the backend storage. Average response time in milliseconds for write operations to the backend storage. This time might include several physical I/O operations, depending on the type of RAID architecture. The weighted average of Backend Read and Write Response s during the The average number of milliseconds that each read operation during the sample interval spent on the queue before being issued to the backend storage device. The average number of milliseconds that each write operation during the sample interval spent on the queue before being issued to the backend storage device. The weighted average of Read Queue and Write Queue during the 26 Performance Metrics in TotalStorage Productivity Center Performance Reports

Backend Read Transfer Size Backend Write Transfer Size Overall Backend Transfer Size The average transfer size in kilobytes for reads to the backend storage during the The average transfer size in kilobytes for data written to the backend storage during the Weighted average transfer size in kilobytes for Backend Reads and Writes during the By Volume report The By Volume report is an important report. Its metrics are shown in Table 7. This report is available for all Storage Management Initiative - Specification (SMI-S) compliant subsystems, though not all metrics are applicable to all subsystems. Critical and universal metrics are in bold. Less important metrics are in normal font, and difficult to interpret metrics are in italics. Table 7 By Volume report metrics Subsystem Volume Interval Read I/O Rate (normal) Read I/O Rate (sequential) Read I/O Rate (overall) Write I/O Rate (normal) Write I/O Rate (sequential) Write I/O Rate (overall) Total I/O Rate (normal) Subsystem name. Volume ID. Start time of the Length of sample interval in seconds. Average number of normal read operations per second for the Normal operations are not sequential, hence random. This metric is only for ESS, DS8000, and DS6000. Average number of sequential read operations per second for the Sequential I/O is detected by the subsystem. This metric is only for ESS, DS8000, and DS6000. Average number of read operations per second for the Applies to most subsystems. Average number of normal write operations per second for the Normal operations are not sequential, hence random. This metric is only for ESS, DS8000, and DS6000. Average number of sequential write operations per second for the Sequential I/O is detected by the subsystem. This metric is only for ESS, DS8000, and DS6000. Average number of write operations per second for the Applies to most subsystems. Average number of normal reads and writes per second for the Normal operations are not sequential, hence random. This metric is only for ESS, DS8000, and DS6000. Performance Metrics in TotalStorage Productivity Center Performance Reports 27

Total I/O Rate (sequential) Total I/O Rate (overall) Read Cache Hit Percentage (normal) Read Cache Hits Percentage (sequential) Read Cache Hits Percentage (overall) Write Cache Hits Percentage (normal) Write Cache Hits Percentage (sequential) Write Cache Hits Percentage (overall) Total Cache Hits Percentage (normal) Total Cache Hits Percentage (sequential) Total Cache Hits Percentage (overall) Read Data Rate Write Data Rate Total Data Rate Read Response Write Response Overall Response Read Transfer Size Write Transfer Size Overall Transfer Size Average number of sequential reads and writes per second for the Sequential I/O is detected by the subsystem. This metric is only for ESS, DS8000, and DS6000. Average number of reads and writes per second for the Applies to most subsystems. Percentage of normal (random) reads that are cache hits during the Only for ESS, DS8000, and DS6000. Percentage of sequential reads that are cache hits in the Only for ESS, DS8000, and DS6000. Percentage of reads during the sample interval that are found in cache. This is an important metric. Percentage of normal (random) writes that are handled in cache. This number needs to be 100%. Only for ESS, DS8000, and DS6000. Percentage of sequential writes that are handled in cache. This number needs to be 100%. Only for ESS, DS8000, and DS6000. Percentage of writes that are handled in cache. This number needs to be 100% for most enterprise storage. Percentage of normal reads and writes that are cache hits during the Percentage of sequential reads and writes that are cache hits during the Weighted average of read cache hits and write cache hits. Average read data rate in megabytes per second during the Average write data rate in megabytes per second during the Average total (read + write) data rate in megabytes per second during the Average response time in milliseconds for reads during the For this report, this is an average of read hits in cache as well as read misses. Average response time in milliseconds for writes during the Average response time in milliseconds for all I/O in the sample interval, including both cache hits as well as misses to backend storage if required. Average transfer size in kilobytes for reads during the Average transfer size in kilobytes for writes during the Average transfer size in kilobytes for all I/O during the 28 Performance Metrics in TotalStorage Productivity Center Performance Reports

Record Mode Read I/O Rate Record Mode Read Cache Hit Percentage Disk to Cache Transfer Rate Cache to Disk Transfer Rate Write-cache Delay Percentage Write-cache Delay I/O Rate Readahead Percentage of Cache Hits Dirty Write Percentage of Cache Hits Write Cache Overflow Percentage Write Cache Overflow I/O Rate Write Cache Flush-through Percentage Write Cache Flush-through I/O Rate Write Cache Write-through Percentage Write Cache Write-through I/O Rate Global Mirror Write I/O Rate Global Mirror Overlapping Write Percentage Global Mirror Overlapping Write I/O Rate This is the rate in I/O/sec for a special kind of read activity detected by ESS, DS8000, and DS6000. Only the requested data is managed in cache rather than a full track or most of a track of data. Read Hit percentage for the special class of reads mentioned above. ESS, DS8000, and DS6000 only. Average number of track transfers per second from disk to cache during the Average number of track transfers per second from cache to disk during the Percentage of all I/O operations that were delayed due to write-cache space constraints or other conditions during the Only writes can be delayed, but the percentage is of all I/O. The rate of I/O (actually writes) that are delayed during the sample interval because of write cache. For SVC, an obscure measurement of cache hits involving data that has been prestaged for one reason or another. For SVC, the percentage of write cache hits which modified only data that was already marked dirty in the cache; rewritten data. This is an obscure measurement of how effectively writes are coalesced before destaging. For SVC, the percentage of write operations that were delayed due to lack of write-cache space during the For SVC, the average rate per second of write operations that were delayed due to lack of write-cache space during the For SVC, the percentage of write operations that were processed in Flush-through write mode during the sample For SVC, the average rate per second of tracks processed in Flush-through write mode during the For SVC, the percentage of write operations that were processed in Write-through write mode during the sample For SVC, the average number of tracks per second that were processed in Write-through write mode during the For SVC, the rate in writes per second issued to the secondary site for Global Mirror during the For SVC, the percentage of writes during the sample interval, for which the write operations at the primary site for Global Mirror have overlapping write domains. For SVC, the average rate in writes per second during the sample interval, for which the write operations at the primary site for Global Mirror have overlapping write domains. Performance Metrics in TotalStorage Productivity Center Performance Reports 29

Peak Read Response Peak Write Response Global Mirror Secondary Write Lag For SVC, the peak read response time in milliseconds observed during the At the end of each sample interval, this value is reset to zero. For SVC, the peak write response time in milliseconds observed during the At the end of each sample interval, this value is reset to zero. For SVC, the number of additional milliseconds it took to service each secondary write operation for Global Mirror, over and above the time needed to service the primary writes during the By Managed Disk report The By Managed Disk report is an SVC specific report. The metrics are shown in Table 8. Critical and universal metrics are in bold. Less important metrics are in normal font, and difficult to interpret metrics are in italics. Table 8 By Managed Disk report metrics Subsystem Managed Disk Interval Read I/O Rate (overall) Write I/O Rate (overall) Total I/O Rate (overall) Backend Read Data Rate Backend Write Data Rate Total Backend Data Rate Backend Read Response Backend Write Response Overall Backend Response Name of the SVC. Name of the Managed Disk. Start time of the Length of sample interval in seconds. Average number of read operations per second for the Applies to most subsystems. Average number of write operations per second for the Applies to most subsystems. Average number of reads and writes per second for the Applies to most subsystems. Average number of megabytes per second read from backend storage during the Average number of megabytes per second written to backend storage during the Sum of the Backend Read and Write Data Rates for the Average response time in milliseconds for read operations to the backend storage. Average response time in milliseconds for write operations to the backend storage. This time might include several physical I/O operations, depending on the type of RAID architecture. The weighted average of Backend Read and Write Response s during the 30 Performance Metrics in TotalStorage Productivity Center Performance Reports

Read Queue Write Queue Overall Queue Backend Read Transfer Size Backend Write Transfer Size Overall Backend Transfer Size The average number of milliseconds that each read operation during the sample interval spent on the queue before being issued to the backend storage device. The average number of milliseconds that each write operation during the sample interval spent on the queue before being issued to the backend storage device. The weighted average of Read Queue and Write Queue during the The average transfer size in kilobytes for reads to the backend storage during the The average transfer size in kilobytes for data written to the backend storage during the Weighted average transfer size in kilobytes for Backend Reads and Writes during the By Port report for storage The By Port report for storage metrics are shown in Table 9. Table 9 By Port report for storage metrics Subsystem Port WWPN Interval Port Send I/O Rate Port Receive I/O Rate Total Port I/O Rate Port Send Data Rate Port Receive Data Rate Total Port Data Rate Storage subsystem. Port ID. Worldwide port name (WWPN) for the port. Interval start time. Number of seconds in the Average number of I/O operations per second for send operations, for a particular port during the sample Average number of I/O operations per second for receive operations, for a particular port during the Average number of I/O operations per second for send and receive operations, for a particular port during the Average number of megabytes per second that were transferred for send (server read) operations, for a particular port during the Average number of megabytes per second that were transferred for receive (server write) operations, for a particular port during the Average number of megabytes per second for send and receive operations during the Performance Metrics in TotalStorage Productivity Center Performance Reports 31

Port Send Response Port Receive Response Total Port Response Port Send Transfer Size Port Receive Transfer Size Total Port Transfer Size Port to Host Send I/O Rate Port to Host Receive I/O Rate Total Port to Host I/O Rate Port to Disk Send I/O Rate Port to Disk Receive I/O Rate Total Port to Disk I/O Rate Port to Local Node Send I/O Rate Port to Local Node Receive I/O Rate Total Port to Local Node I/O Rate Port to Remote Node Send I/O Rate Port to Remote Node Receive I/O Rate Total Port to Remote Node I/O Rate Average number of milliseconds that it took to service each send (server read) operation during the sample Average number of milliseconds that it took to service each receive (server write) operation during the sample Average number of milliseconds that it took to service each send and receive operation during the sample Average number of KB sent per I/O by a particular port. Average number of KB received per I/O by a particular port during the Average number of KB transferred per I/O by a particular port during the Average rate per second of port send to host (server) during the Average rate per second of port receive operations from host (server) during the Average of port send and receive I/O rate during the For SVC, the average rate per second of port send to backend storage during the For SVC, the average rate per second of port receive operations from backend storage during the sample For SVC, the average of port to disk send and port to disk receive rates during the For SVC, the average rate per second at which a port sends I/O to other nodes in the local cluster during the For SVC, the average rate at which a port receives I/O from other nodes in the local cluster during the sample For SVC, the average of port to local node send and receive rates during the For SVC, the average number of exchanges (I/Os) per second sent to nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the average number of exchanges (I/Os) per second received from nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the average of port to remote node send and receive I/O per second during the 32 Performance Metrics in TotalStorage Productivity Center Performance Reports

Port to Host Send Data Rate Port to Host Receive Data Rate Total Port to Host Data Rate Port to Disk Send Data Rate Port to Disk Receive Data Rate Total Port to Disk Data Rate Port to Local Node Send Data Rate Port to Local Node Receive Data Rate Total Port to Local Node Data Rate Port to Remote Node Send Data Rate Port to Remote Node Receive Data Rate Total Port to Remote Node Data Rate For SVC, the average megabytes per second of port send to host (server) during the For SVC, the average megabytes per second of port receive operations from host (server) during the sample For SVC, average of port send and receive megabytes per second during the For SVC, the average megabytes per second of port send to backend storage during the For SVC, the average megabytes per second of port receive operations from backend storage during the For SVC, the average of port to disk send and port to disk receive megabytes per second during the sample For SVC, the average megabytes per second at which a port sends I/O to other nodes in the local cluster during the For SVC, the average megabytes per second at which a port receives I/O from other nodes in the local cluster during the For SVC, the average of port to local node send and receive megabytes per second during the sample For SVC, the average number of megabytes per second sent to nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the average number of megabytes per second received from nodes in the remote SVC cluster during the Typically, some form of remote mirroring. For SVC, the average of port to remote node send and receive megabytes per second during the sample By Port report for Fabric The metrics in the By Port report for Fabric are shown in Table 10. Note that not all the metrics are supported by all vendor CIM agents. Table 10 By Port: Fabric report metrics Switch Port WWPN Switch name. Port ID. WWPN for the port. Performance Metrics in TotalStorage Productivity Center Performance Reports 33

Interval Port Send Packet Rate Port Receive Packet Rate Total Port Packet Rate Port Send Data Rate Port Receive Data Rate Interval start time. Number of seconds in the Average number of packets per second for send operations, for a particular port during the Average number of packets per second for receive operations, for a particular port during the Average number of packets per second for send and receive operations, for a particular port during the sample Average number of megabytes (2^20 bytes) per second that were transferred for send (write) operations, for a particular port during the Average number of megabytes (2^20 bytes) per second that were transferred for receive (read) operations, for a particular port during the Total Port Data Rate Average number of megabytes (2^20 bytes) per second that were transferred for send and receive operations, for a particular port during the Port Peak Send Data Rate Port Peak Receive Data Rate Port Send Packet Size Port Receive Packet Size Overall Port Packet Size Error Frame Rate Dumped Frame Rate Link Failure Rate Loss of Sync Rate Peak number of megabytes (2^20 bytes) per second that were sent by a particular port during the Peak number of megabytes (2^20 bytes) per second that were received by a particular port during the sample Average number of KB sent per packet by a particular port during the Average number of KB received per packet by a particular port during the Average number of KB transferred per packet by a particular port during the The average number of frames per second that were received in error during the The average number of frames per second that were lost due to a lack of available host buffers during the sample The average number of link errors per second during the The average number of times per second that synchronization was lost during the Loss of Signal Rate The average number of times per second that the signal was lost during the CRC Error Rate The average number of frames received per second in which the cyclic redundancy check (CRC) in the frame did not match the CRC computed by the receiver during the 34 Performance Metrics in TotalStorage Productivity Center Performance Reports

Short Frame Rate Long Frame Rate Encoding Disparity Error Rate Discarded Class3 Frame Rate F-BSY Frame Rate F-RJT Frame Rate The average number of frames received per second that were shorter than 28 octets (24 header + 4 CRC) not including any SOF/EOF bytes during the The average number of frames received per second that were longer than 2140 octets (24 header + 4 CRC + 2112 data) not including any SOF/EOF bytes during the sample The average number of disparity errors received per second during the The average number of class-3 frames per second that were discarded during the The average number of F-BSY frames per second during the The average number of F-RJT frames per second during the About this IBM Redpaper author Mary Lovelace is a Consulting IT Specialist at the International Technical Support Organization. She has more than 20 years of experience with IBM in large systems, storage, and storage networking product education, system engineering and consultancy, and systems support. She has written many IBM Redbooks publications about IBM TotalStorage Productivity Center and z/os storage products. Thanks to Brian Smith for allowing us to convert his materials into this Redpaper and his continued support of this project. Performance Metrics in TotalStorage Productivity Center Performance Reports 35

36 Performance Metrics in TotalStorage Productivity Center Performance Reports

Notices This information was developed for products and services offered in the U.S.A. IBM may not offer the products, services, or features discussed in this document in other countries. Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any non-ibm product, program, or service. IBM may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to: IBM Director of Licensing, IBM Corporation, North Castle Drive, Armonk, NY 10504-1785 U.S.A. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to non-ibm Web sites are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this IBM product and use of those Web sites is at your own risk. IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Information concerning non-ibm products was obtained from the suppliers of those products, their published announcements or other publicly available sources. IBM has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to non-ibm products. Questions on the capabilities of non-ibm products should be addressed to the suppliers of those products. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. COPYRIGHT LICENSE: This information contains sample application programs in source language, which illustrate programming techniques on various operating platforms. You may copy, modify, and distribute these sample programs in any form without payment to IBM, for the purposes of developing, using, marketing or distributing application programs conforming to the application programming interface for the operating platform for which the sample programs are written. These examples have not been thoroughly tested under all conditions. IBM, therefore, cannot guarantee or imply reliability, serviceability, or function of these programs. Copyright International Business Machines Corporation 2008. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. 37

Send us your comments in one of the following ways: Use the online Contact us review Redbooks form found at: ibm.com/redbooks Send your comments in an email to: redbooks@us.ibm.com Mail your comments to: IBM Corporation, International Technical Support Organization Dept. HYTD Mail Station P099 2455 South Road Poughkeepsie, NY 12601-5400 U.S.A. Redpaper Trademarks The following terms are trademarks of the International Business Machines Corporation in the United States, other countries, or both: Redbooks (logo) z/os DS6000 DS8000 IBM Redbooks TotalStorage Other company, product, or service names may be trademarks or service marks of others. 38 Performance Metrics in TotalStorage Productivity Center Performance Reports