CIT 470: Advanced Network and System Administration. Topics. Performance Monitoring. Performance Monitoring



Similar documents
CIT 668: System Architecture. Performance Testing

Topics. CIT 470: Advanced Network and System Administration. Why Monitoring? Why Monitoring? Historical Monitoring Processes. Historical Monitoring

Facultat d'informàtica de Barcelona Univ. Politècnica de Catalunya. Administració de Sistemes Operatius. System monitoring

CIT 470: Advanced Network and System Administration. Topics. Why Monitoring? System Monitoring

Extreme Linux Performance Monitoring Part II

20 Command Line Tools to Monitor Linux Performance

These sub-systems are all highly dependent on each other. Any one of them with high utilization can easily cause problems in the other.

Performance monitoring. in the GNU/Linux environment. Linux is like a wigwam - no Windows, no Gates, Apache inside!

Release February 2005

Optimizing Linux Performance

IBM Tivoli Monitoring Version 6.3 Fix Pack 2. Infrastructure Management Dashboards for Servers Reference

System Resources. To keep your system in optimum shape, you need to be CHAPTER 16. System-Monitoring Tools IN THIS CHAPTER. Console-Based Monitoring

System performance monitoring in RTMT

Windows Server Performance Monitoring

Best Practices for Deploying & Tuning Oracle Database 12c on RHEL6

About Forum Howtos & FAQs Low graphics Shell Scripts RSS/Feed. nixcraft - insight into linux admin work Home > CentOS. by Vivek Gite 134 comments

Release February 2005

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

OS Thread Monitoring for DB2 Server

CS 377: Operating Systems. Outline. A review of what you ve learned, and how it applies to a real operating system. Lecture 25 - Linux Case Study

TOP(1) Linux User s Manual TOP(1)

CPSC 2800 Linux Hands-on Lab #7 on Linux Utilities. Project 7-1

VIRTUALIZATION AND CPU WAIT TIMES IN A LINUX GUEST ENVIRONMENT

Operating System and Process Monitoring Tools

System Administration

Determining the Correct Usage of Swap in Linux * 2.6 Kernels

Operating Systems Concepts: Chapter 7: Scheduling Strategies

Operating Systems. Design and Implementation. Andrew S. Tanenbaum Melanie Rieback Arno Bakker. Vrije Universiteit Amsterdam

Outline. Operating Systems Design and Implementation. Chap 1 - Overview. What is an OS? 28/10/2014. Introduction

Server Management Tools (ASMT)

Storage Performance Testing

Internet Protocols Fall Lectures 7-8 Andreas Terzis

CIT 470: Advanced Network and System Administration. Topics. Help Desk Life. Help Desks

PERFORMANCE TUNING ORACLE RAC ON LINUX

The System Monitor Handbook. Chris Schlaeger John Tapsell Chris Schlaeger Tobias Koenig

Linux Tools for Monitoring and Performance. Khalid Baheyeldin November 2009 KWLUG

Understanding Performance Monitoring

Improving the Database Logging Performance of the Snort Network Intrusion Detection Sensor

How to analyse your system to optimise performance and throughput in IIBv9

Managing your Domino Clusters

CIT 470: Advanced Network and System Administration. Topics. Change Management. Change and Configuration Management

Chapter 10: System monitoring and logging. Chapter 10 System monitoring and logging

Network Attached Storage. Jinfeng Yang Oct/19/2015

Performance Tuning and Optimization for high traffic Drupal sites. Khalid Baheyeldin Drupal Camp, Toronto May 11 12, 2007

SAS Application Performance Monitoring for UNIX

Storage benchmarking cookbook

Capacity planning for IBM Power Systems using LPAR2RRD.

Do it Yourself System Administration

Audit & Tune Deliverables

Scheduling. Yücel Saygın. These slides are based on your text book and on the slides prepared by Andrew S. Tanenbaum

Help! My system is slow!

Allocating Network Bandwidth to Match Business Priorities

Tuning WebSphere Application Server ND 7.0. Royal Cyber Inc.

Topics. CIT 470: Advanced Network and System Administration. Help Desk Life. Types of Help Desks. Scope of Coverage. Help Desk Size.

CIT 470: Advanced Network and System Administration. Topics. Help Desk Life. Help Desks

Web Application s Performance Testing

Popular Unix Performance- Monitoring Tools for Linux CHAPTER

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM?

Perfmon counters for Enterprise MOSS

Tech Tip: Understanding Server Memory Counters

The Big Picture. Cache Memory CSE Memory Hierarchy (1/3) Disk

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago

Question: 3 When using Application Intelligence, Server Time may be defined as.

Understanding Linux on z/vm Steal Time

The IntelliMagic White Paper: Storage Performance Analysis for an IBM Storwize V7000

Performance Testing. Configuration Parameters for Performance Testing

Performance Analysis Methods ESX Server 3

SiteCelerate white paper

The IntelliMagic White Paper on: Storage Performance Analysis for an IBM San Volume Controller (SVC) (IBM V7000)

USTM16 Linux System Administration

OPERATING SYSTEMS SCHEDULING

Sawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices

Deploying and Optimizing SQL Server for Virtual Machines

The Self-Tuning Memory Manager (STMM): A Technical White Paper. Authors: Christian Garcia-Arellano Adam Storm Colin Taylor

Storage at a Distance; Using RoCE as a WAN Transport

WEB SERVER MONITORING SORIN POPA

APPENDIX 1 USER LEVEL IMPLEMENTATION OF PPATPAN IN LINUX SYSTEM

What is RAID? data reliability with performance

Introduction. Application Performance in the QLinux Multimedia Operating System. Solution: QLinux. Introduction. Outline. QLinux Design Principles

Chapter 11 I/O Management and Disk Scheduling

Informix Performance Tuning using: SQLTrace, Remote DBA Monitoring and Yellowfin BI by Lester Knutsen and Mike Walker! Webcast on July 2, 2013!

The Moab Scheduler. Dan Mazur, McGill HPC Aug 23, 2013

How To Analyze Performance From A Genetic Algorithm

Operating Systems, 6 th ed. Test Bank Chapter 7

Performance and Tuning Guide. SAP Sybase IQ 16.0

TECHNICAL NOTE. Technical Note P/N REV 03. EMC NetWorker Simplifying firewall port requirements with NSR tunnel Release 8.

Job Aid - Avaya Aura Messaging. Purpose. Restarting software and server. Stopping. Procedure

High-performance Linux cluster monitoring using Java

This presentation explains how to monitor memory consumption of DataStage processes during run time.

Introduction 1 Performance on Hosted Server 1. Benchmarks 2. System Requirements 7 Load Balancing 7

Stop the Guessing. Performance Methodologies for Production Systems. Brendan Gregg. Lead Performance Engineer, Joyent. Wednesday, June 19, 13

Monitoring IBM HMC Server. eg Enterprise v6

Tushar Joshi Turtle Networks Ltd

Monitoring Unix and Windows Servers

Enabling NetFlow on Virtual Switches ESX Server 3.5

Agenda. Capacity Planning practical view CPU Capacity Planning LPAR2RRD LPAR2RRD. Discussion. Premium features Future

Transcription:

CIT 470: Advanced Network and System Administration Performance Monitoring CIT 470: Advanced Network and System Administration Slide #1 Topics 1. Performance monitoring. 2. Performance tuning. 3. CPU 4. Memory 5. Disk 6. Network CIT 470: Advanced Network and System Administration Slide #2 Performance Monitoring Identify which aspect of performance Latency: delay until initial access. Throughput: rate of transfer/processing. Identify which system component CPU Memory Disk Network CIT 470: Advanced Network and System Administration Slide #3 1

Performance Tuning Process 1. Learn the customer s problem. Identify specifically what s wrong. 2. Find the problem s cause and fix it. 1. When does the problem occur? 2. Has anything about the system changed? 3. What critical resource is affecting performance? 3. Have the right tools. Historical monitoring data will show what s normal and identify any trends. CIT 470: Advanced Network and System Administration Slide #4 Experimenter Effect Monitoring the system affects performance. Monitoring tools use system resources. If you ve consistently monitored system, then monitoring won t alter system performance. CIT 470: Advanced Network and System Administration Slide #5 Performance Problem Solutions 1. Get more of needed resource. Ex: Upgrade processor, use striped disk array. 2. Reduce system requirements. Ex: Kill processes, move services to other hosts. 3. Eliminate inefficiency and waste. Ex: Produce a static home page every 15 minutes instead of regenerating each access. 4. Ration resource usage. Ex: Set process priorities with renice. Ex: Limit process resource usage with limit. CIT 470: Advanced Network and System Administration Slide #6 2

uptime Monitoring Processes Provides aggregate data about system load. ps Shows running processes with CPU, mem usage. top Updated list of running processes + summaries. vmstat Summary data about processes and CPU usage. CIT 470: Advanced Network and System Administration Slide #7 Uptime Uptime provides the following data How long system has been running. Number of users logged in. Average number of runnable processes. In last 1, 5, 15 minutes. Want a load average under 3. Uptime example > uptime 17:40 up 126 days, 8:03, 6 users, load average: 1.40, 1.03, 0.55 CIT 470: Advanced Network and System Administration Slide #8 vmstat Number of Runnable and Blocked processes. Memory (virtual, free, buffered, cached) Blocks/second transferred in (bi) and out (bo) Interrupts/sec (in) and context switches/sec (cs) CPU usage by user, system, idle, and waiting. > vmstat 5 4 procs -----------memory---------- ---swap-- -----io---- --system-- ----cpu---- r b swpd free buff cache si so bi bo in cs us sy id wa 0 0 395716 45176 211284 88480 0 0 1 2 1 2 9 3 88 0 0 0 395716 45168 211300 88480 0 0 0 50 1035 1677 0 0 100 0 0 0 395716 45168 211300 88480 0 0 0 0 1040 1670 0 0 99 0 0 0 395716 45168 211300 88480 0 0 0 0 1033 1660 0 0 100 0 CIT 470: Advanced Network and System Administration Slide #9 3

Identifying CPU Shortages 1. Short-term CPU spikes are normal. 2. Consistently high number of runnable processes (r) in vmstat. 3. Consistent high total CPU usage (sy+us). 4. High system time compared to user time and high context switches indicates system is thrashing between processes instead of doing user work. CIT 470: Advanced Network and System Administration Slide #10 Changing Process Priorities Nice values Positive values lower priorities. Negative values increase priorities. If you know a process will be a CPU hog, nice +5 command_name If you detect a CPU hog after it s started, renice 5 PID CIT 470: Advanced Network and System Administration Slide #11 Managing Processes with kill TERM (default) Terminates process execution (Ctrl-c). Processes can catch or ignore signal. KILL (9) Terminates process execution. Processes cannot catch or ignore. Processes waiting on I/O will not die. STOP Suspends process execution until SIGCONT (Ctrl-z). Useful for moving CPU hog out of way temporarily. CIT 470: Advanced Network and System Administration Slide #12 4

Imposing Limits on Processes CPU time Maximum file size Maximum data segment Maximum stack size Maximum physical mem Maximum core size Maximum number procs Maximum virtual mem ulimit t secs ulimit f KB ulimit d KB ulimit s KB ulimit m KB ulimit c KB ulimit u n ulimit v KB CIT 470: Advanced Network and System Administration Slide #13 Monitoring Memory Use free to see how memory is used. System will use most free memory for caching. System will swap out inactive processes. Don t worry until free < 5% of total memory. Use vmstat to detect paging activity. Page out (so) rate greater than 0 consistently. High page in (si) rate, as system uses the paging facility to load programs into memory. CIT 470: Advanced Network and System Administration Slide #14 Managing Memory 1. Improving paging capacity. Add new swapfiles with swapon. Add new swap partitions. 2. Improving paging performance. Use swap partitions instead of swap files. Distribute swap resources across disks. 3. Migrate memory hogs to another host. 4. Add more memory. CIT 470: Advanced Network and System Administration Slide #15 5

Monitoring Disk I/O Use iostat to get per disk statistics. Transactions per second (tps). Blocks read/written per second. Managing disk performance problems. Distribute heavily used data across disks/ctrlers. Get more or faster disks. Use RAID or LVM striping. CIT 470: Advanced Network and System Administration Slide #16 iostat > iostat 2 Linux 2.6.15-23-386 (zim) 03/26/2007 avg-cpu: %user %nice %system %iowait %steal %idle 8.55 0.18 3.22 0.09 0.00 87.96 Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn hde 0.69 8.18 9.43 89783416 103565744 hdh 0.15 1.33 3.37 14590831 36969599 hdc 0.00 0.00 0.00 9548 0 avg-cpu: %user %nice %system %iowait %steal %idle 0.17 0.00 0.17 0.00 0.00 99.67 Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn hde 0.33 0.00 21.33 0 128 hdh 0.00 0.00 0.00 0 0 hdc 0.00 0.00 0.00 0 0 CIT 470: Advanced Network and System Administration Slide #17 Managing Disk Capacity Detecting disk resource usage. List all partition usage with df h Identify high usage directories with du Summary data: du s Highest usage directories: du -k / sort rn Use find to detect disk hogs. Use find size to search for big files. Use atime +X to identify files that haven t been used in X days. CIT 470: Advanced Network and System Administration Slide #18 6

Managing Disk Shortages 1. Add more disks. 2. Move files to remote fileservers. 3. Eliminate unnecessary files. 4. Compress large infrequently used files. 5. Impose disk quotas on users. Soft limit: can be violated temporarily. Hard limit: cannot be violated. CIT 470: Advanced Network and System Administration Slide #19 Network Statistics > netstat -s Tcp: 294750 active connections openings 18042 passive connection openings 9 failed connection attempts 6195 connection resets received 5 connections established 90553783 segments received 90005258 segments send out 16483 segments retransmited 1389 bad segments received. 15620 resets sent Ip: 91081007 total packets received 6 with invalid headers 28 with invalid addresses 0 forwarded 0 incoming packets discarded 91080973 incoming packets delivered 90418413 requests sent out Udp: 270975 packets received 336 packets to unknown port received. 6 packet receive errors 324228 packets sent CIT 470: Advanced Network and System Administration Slide #20 References 1. Mark Burgess, Principles of System and Network Administration, Wiley, 2000. 2. Aeleen Frisch, Essential System Administration, 3 rd edition, O Reilly, 2002. 3. Mike Loukides and Gian-Paolo D. Musumeci, System Performance Tuning, 2 nd edition, O Reilly, 2003. 4. Evi Nemeth et al, UNIX System Administration Handbook, 3 rd edition, Prentice Hall, 2001. CIT 470: Advanced Network and System Administration Slide #21 7