Oracle Database Capacity Planning



Similar documents
Did someone say the word 'Capacity?

Lesson 4 Measures of Central Tendency

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests

2. Filling Data Gaps, Data validation & Descriptive Statistics

Descriptive Statistics

Testing Research and Statistical Hypotheses

Oracle USF

Performance Forecasting - Introduction -

DATABASE ADMINISTRATION (DBA) SERVICES

Oracle Database 12c: Performance Management and Tuning NEW

How To Model A System

THEME: T-ACCOUNTS. By John W. Day, MBA. ACCOUNTING TERM: T-Account

Capacity Analysis Techniques Applied to VMware VMs (aka When is a Server not really a Server?)

Fingerprinting the datacenter: automated classification of performance crises

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining

MS SQL Performance (Tuning) Best Practices:

Predictive Analytics And IT Service Management

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

SanDisk Lab Validation: VMware vsphere Swap-to-Host Cache on SanDisk SSDs

Programa de Actualización Profesional ACTI Oracle Database 11g: SQL Tuning Workshop

There are a number of factors that increase the risk of performance problems in complex computer and software systems, such as e-commerce systems.

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Oracle Database 10g. Page # The Self-Managing Database. Agenda. Benoit Dageville Oracle Corporation benoit.dageville@oracle.com

Measurement with Ratios

Myth or Fact: The Diminishing Marginal Returns of Variable Creation in Data Mining Solutions

PULSE. T24 Monitoring Suite

Case Study I: A Database Service

MS EXCHANGE SERVER ACCELERATION IN VMWARE ENVIRONMENTS WITH SANRAD VXL

Means, standard deviations and. and standard errors

Web Server Software Architectures

EMC Business Continuity for Microsoft SQL Server Enabled by SQL DB Mirroring Celerra Unified Storage Platforms Using iscsi

Oracle Database 11g: SQL Tuning Workshop

Rackspace Cloud Databases and Container-based Virtualization

MS-40074: Microsoft SQL Server 2014 for Oracle DBAs

Session 7 Bivariate Data and Analysis

Monitoring HP OO 10. Overview. Available Tools. HP OO Community Guides

BLACKBOARD LEARN TM AND VIRTUALIZATION Anand Gopinath, Software Performance Engineer, Blackboard Inc. Nakisa Shafiee, Senior Software Performance

Monte Carlo analysis used for Contingency estimating.

Point and Interval Estimates

Republic Polytechnic School of Information and Communications Technology C355 Business Intelligence. Module Curriculum

Capacity Planning Use Case: Mobile SMS How one mobile operator uses BMC Capacity Management to avoid problems with a major revenue stream

Consolidate by Migrating Your Databases to Oracle Database 11g. Fred Louis Enterprise Architect

Capacity Planning for Hyper-V. Using Sumerian Capacity Planner

Benford s Law and Digital Frequency Analysis

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Capacity Management for Oracle Database Machine Exadata v2

Measures of WFM Team Success

Oracle Enterprise Manager 12c New Capabilities for the DBA. Charlie Garry, Director, Product Management Oracle Server Technologies

WHITE PAPER. SQL Server License Reduction with PernixData FVP Software

OEM s SQL Monitoring Feature. Ken Gottry 5-May-2014

Advanced analytics at your hands

Performance And Scalability In Oracle9i And SQL Server 2000

Personal Financial Literacy

Performance Tuning with Oracle Enterprise Manager Session # S300610

03 The full syllabus. 03 The full syllabus continued. For more information visit PAPER C03 FUNDAMENTALS OF BUSINESS MATHEMATICS

About PivotTable reports

Intelligent Business Operations

Microsoft SQL Server versus IBM DB2 Comparison Document (ver 1) A detailed Technical Comparison between Microsoft SQL Server and IBM DB2

Virtualization. as a key enabler for Cloud OS vision. Vasily Malanin Datacenter Product Management Lead Microsoft APAC

ORACLE ENTERPRISE MANAGER 10 g CONFIGURATION MANAGEMENT PACK FOR ORACLE DATABASE

Introduction to Per Core Licensing and Basic Definitions

Lean Six Sigma Analyze Phase Introduction. TECH QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Cornell Note Taking System (For Lecture or Reading)

CHAPTER TWELVE TABLES, CHARTS, AND GRAPHS

Critical Database. Oracle Enterprise Manager Oracle Open World 2010 Presented dby Venkat Tekkalur. Prem Venkatasamy. Principal Technical Architect

What Is Singapore Math?

Excel 2003 PivotTables Summarizing, Analyzing, and Presenting Your Data

1 Descriptive statistics: mode, mean and median

Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays

Oracle 11g is by far the most robust database software on the market

Is there any alternative to Exadata X5? March 2015

Using Excel for inferential statistics

Standard Deviation Estimator

CHAPTER 1: SPREADSHEET BASICS. AMZN Stock Prices Date Price

1

Everything you wanted to know about using Hexadecimal and Octal Numbers in Visual Basic 6

VMWare Workstation 11 Installation MICROSOFT WINDOWS SERVER 2008 R2 STANDARD ENTERPRISE ED.

Using Database Diagnostic and Tuning Packs through Oracle Enterprise Manager 12c. Eric Siglin OCM, OCP, CTT+ Senior Oracle DBA

One-Minute Spotlight. The Crystal Ball Forecast Chart

Simple Inventory Management

Wait-Time Analysis Method: New Best Practice for Performance Management

Providing Self-Service, Life-cycle Management for Databases with VMware vfabric Data Director

Managing Capacity Using VMware vcenter CapacityIQ TECHNICAL WHITE PAPER

Contact Center Math:

OPEN LESSON SAMPLE LESSONS FOR THE CLASSROOM FROM LAYING THE FOUNDATION

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Publishing papers in international journals

A cure for Virtual Insanity: A vendor-neutral introduction to virtualization without the hype

WHITE PAPER. Five Steps to Better Application Monitoring and Troubleshooting

Product Review: James F. Koopmann Pine Horse, Inc. Quest Software s Foglight Performance Analysis for Oracle

SQL Sentry Essentials

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

PTC System Monitor Solution Training

OUTLIER ANALYSIS. Data Mining 1

Transcription:

AIOUG Tech Day @ Pune Date: 28 th July, 2012 Oracle Database Capacity Planning How to Scientifically start doing Capacity Planning for an Oracle Database 1

About Me 8 years of experience in Oracle Database Performance Optimization and IT Services/Infrastructure Capacity Planning. Certified Oracle DBA OCP 9i, 10g, 11g, RAC Expert Other Certifications ISO/20000 Certified Auditor, ITIL Practitioner V3, Base SAS Currently working with Barclays Bank PLC as a Capacity Manager When time allows I blog at http://neerajbhatia.wordpress.com/ 2

What is Capacity Planning Capacity Planning is the process of predicting when future load levels will saturate the system and determining the most costeffecting way of delaying system saturation as much as possible - by Daniel A. Menasce and Virgilio A.F. Almeida (Authors of Capacity Planning for Web Services: Metrics, Models & Methods)

Systematic Approach to Capacity Planning 1. State Goals 2. Define the System Configuration 3. List Database Services 4. Identify Right Metrics 5. Collect Data 6. Analyze and Model Data 7. Interpret the Data and Results 8. Present Results 4

Identifying Application Metrics Users of Business Services Business Transactions (ATM Cash withdrawal etc) IT Service (ATM Service) Talk with Business and application teams to identify good set of metrics by which growth and workload of business is measured. Examples Total ATM transactions, Active Users, total login on an online bookstore etc IT Service Components (Application server, Database Application) Underlying Database workload metrics transactions/sec, user calls/sec etc IT Infrastructure (Server, Network components etc) CPU Utilization, Memory Utilization 5

The Future lies in the History "I have seen the future and it is very much like the present, only longer." --Kehlog Albran, The Profit This philosophy is actually a concise description of statistical forecasting. We search for statistical properties of a time series that are constant in time - trends, seasonal patterns, correlations etc. We then predict that these properties will also describe the future. 6

Data History, History, History!!! The precision of forecasted results lies in quantity of historical data; longer the data more the chances of precise results. In small shops AWR schema is the best place to keep the data. Big enterprises generally upload the data to CMIS. Data retention value should be large enough to keep seasonal trend, periodic activities. Retention value of 13 months is recommended in most of the cases. CMIS Database metrics pertaining to Capacity and Performance from AWR Schema and v$sysstat etc Other data in CMIS Business Metrics Future Business Demand Resource Utilization Data Capacity Plans and Reports 7

Time for some Statistics recap Most of the times data show a tendency to group around a central point and this single typical value can be used to describe the entire data set. These measures are Arithmetic Mean and the Median. The Arithmetic Mean (aka Average) is most commonly used metric to summarize the numeric data. It is calculated by summing all the values and then dividing the total number of observations involved. Arithmetic Mean are vulnerable to the extreme points, which are known as Outliers. Let s take an example of hourly CPU Utilization for 6-hours window as: 28%, 31%, 30%, 32%, 95% and 29% Mean CPU Utilization = 41% Eliminating 95% as an Outlier, Mean CPU Utilization = 30% Because it is based on every observation, the arithmetic mean is significantly affected by Outliers. In such cases, only reporting the arithmetic mean may present a distorted representation of what the data are conveying. 8

Median & Percentile The Median is the middle value in the ordered set of data. In case of even number of observations, the Median is calculated by taking the average of two middle observations. Consider our previous example of hourly CPU Utilization, 28%, 29%, 30%, 31%, 32%, 95% (Data sorted in ascending order) Median = Average (30%, 31%) = 30.5% The Median is unaffected by any extreme observations. When summarizing a set of data that has Outliers, you should report the median or better both the mean and median. A Percentile is the value below which a certain percent of observations fall. For example, the 70th percentile is the value below which 70 percent of the observations may be found. Percentile in Oracle Database environment can be used to set Adaptive thresholds for Performance/Workload metrics, analyze and summarize database performance data. MS Excel has inbuilt formula PERCENTILE(Data Array, k) to calculate k th percentile of values. For example to calculate 90 th percentile, use PERCENTILE(Data Array, 0.9) 9

Standard Deviation and Empirical Rule Although the arithmetic Mean give a central point around which all the values tend to cluster, but it doesn t give us a clue about the variation in the data around the mean. To evaluate the fluctuation around the mean a statistic measure called Standard Deviation is used. MS Excel has inbuilt formula STDEV(data array) to quickly calculate the Standard deviation for a given set of numbers. The Standard Deviation measure the average scatter around the mean some observations are larger than the mean while some of them can be lower. The Empirical Rule of statistics states that: About 68.27% of the values lie within 1 standard deviation of the mean. Similarly, 90% of the values lie within 1.645 standard deviations of the mean. Nearly all (95%) of the values lie within 1.960 standard deviations of the mean. Thus the arithmetic mean and the standard deviation usually helps define where majority of the data values are clustering. 10

Little Law s Formulas Service Time Arrival Rate Utilization of a server Number of CPUs Can also re-written as: 11

Case Study: DB Capacity Planning Problem Statement: You are an Oracle DBA. The Business wants to know how much additional workload current database can support AND how much additional capacity (if any) would be required to support double the workload (around 10% per month). Step-1 State Goal: How much load of some particular business activity database can support, before running out of gas? If the workload grows by 100% (around 10% every month), when we will need to add more capacity to the system? Step-2: Define the System Configuration Oracle Database 11.2 running with 8 CPUs and 16 Gigs memory Step-3: List Database Services Database is primarily part of an Order management system in an OLTP environment.

Case Study: DB Capacity Planning Step-4 Identify Right Metrics You had discussion with Business to understand how they measure business growth and came to know that #orders and #order lines give a clear picture to them. Then you discuss with application team to understand which columns in the application tables hold this data. Step-5: Collect Data Further you have agreed with relevant application teams to have these metrics on an hourly basis. You collect CPU utilization data from any monitoring tool installed on the server, native OS tool or from AWR tables ( Host CPU utilization in DBA_HIST_SYSMETRIC_SUMMARY). Step-6: Analyze and Model Data You correlate #Order and #Order lines with the host CPU utilization to see which Application metric is driving the CPU utilization on underlying database. Avg CPU can be explained by the Order Lines/hour 58.71 % of the time.

Case Study: DB Capacity Planning For example - Service Time = (68.37% * 8) / 976 = 0.00560369

Case Study: DB Capacity Planning 70 th Percentile Arrival Rate

Summarizing & Reporting the Findings Effecting Capacity Planning involves not only analyzing the data but also summarizing the key findings and presenting it to the relevant people. Step-8: Present Results Current Database Capacity can support 35% additional order lines workload which means around we are safe for next 3-4 months. We would need 4 additional CPUs to support double Order lines workload. 16

Capacity Planning Tools A tool can help you but that's not the end of the world. For big enterprise environment a sophisticated tool is recommended, which can ease your life. For small shops, MS Excel is a low-cost solution for Performance Visualization, Capacity Planning and Data Analysis. 17

Quotation "I hear and I forget. I see and I remember. I do and I understand." -- Chinese Proverb The best way to learn a subject is to apply the concepts to a real system. The techniques presented in this presentation may appear simple on the surface, their applications to real world may offer a different experience. 18

References Additional Resources: My blog - http://neerajbhatia.wordpress.com/ Email - neeraj.dba@gmail.com