Plotting on Weibull Paper

Size: px
Start display at page:

Download "Plotting on Weibull Paper"

Transcription

1 Plotting on Weibull Paper Fritz Scholz Research and Technology Boeing Information & Support Services August 23, 1996 Abstract This report explains Weibull plotting and its rationale It shows how the two Weibull parameter estimates are easily read off from the Weibull plot The use of Weibull plotting is introduced first in the context of complete samples and then extended to two common forms of censoring: type I or multiple censoring and type II censoring Two blank Weibull plotting templates are provided, one for a two cycle log 10 scale and the other for three cycle log 10 scale on the abscissa The use of a Weibull plot as a diagnostic tool for checking the Weibull assumption underlying the sample is examined critically For small sample sizes, answers, if obtained via this tool, should be taken with caution, since the variability of such Weibull samples is still substantial

2 1 Introduction In characterizing the distribution of life lengths or failure times of certain devices one often employs the Weibull distribution This is mainly due to its weakest link properties, but other reasons are its increasing 1 failure rate with device age and the variety of distribution shapes that the Weibull density offers The increasing failure rate accounts to some extent for fatigue failures Weibull plotting is a graphical method for informally checking on the assumption of Weibull distribution model and also for estimating the two Weibull parameters The method of Weibull plotting is explained and illustrated both for complete samples of failure times as well as for censored samples In the latter case we consider either type I or multiply censored data or type II censored data Type I or multiple censoring occurs typically in field data where either the failure times of devices are observed or their last running time is known, ie, the last time these devices were checked they were still functioning properly The term multiple censoring instead of type I censoring is motivated by the multiple time points at which some censoring event takes place and prevents the observation of the complete life time The censoring time points are extraneous events and independent of the actual device failure times, whether these are observed or not This is in contrast to type II censoring This occurs mostly under laboratory conditions when all n devices are put on test at the same time and one observes the failure times of the first r devices, with the remaining n r devices still running successfully This kind of censoring is useful when one cannot wait until all devices have failed, but wants to guarantee at least so many failures Here the censoring time is not independent of the failure times, since it coincides with the r th smallest failure time It is assumed that the two-parameter Weibull distribution is a reasonable model for describing the variability in the failure time data If T represents the generic failure time of a device, then the distribution function of T is given by ( [ ] ) t β F T (t) =P (T t) =1 exp for t 0 α The parameter α is called the scale parameter or characteristic life The latter term is motivated by the property F T (α) =1 exp( 1) 632, regardless of the shape parameter β There are many ways for estimating the parameters α and β from complete or censored data One of the simplest is through the method of Weibull plotting, which is very popular due to its simplicity, graphical appeal, and its informal check on the Weibull model assumption 1 for Weibull shape parameter β>1 2

3 2 Weibull Plotting and its Basis The basic idea behind Weibull plotting is the relationship between the p-quantiles t p of the Weibull distribution and p for 0 <p<1 The p-quantile t p is defined by the following property ( [ ] ) β tp p = F T (t p )=P(T t p )=1 exp α which leads to or taking decimal logs 2 on both sides t p = α [ log e (1 p)] 1/β y p =log 10 (t p )=log 10 (α)+ 1 β log 10 [ log e (1 p)] (1) Thus log 10 (t p ), when plotted against w(p) =log 10 [ log e (1 p)] should follow a straight line pattern with intercept log 10 (α) andslope1/β Plotting w(p) against y p =log 10 (t p ), as is usually done in a Weibull plot, one should see the following linear relationship w(p) =β [log 10 (t p ) log 10 (α)] (2) with slope β and abscissa intercept log 10 (α) In place of the unknown log 10 -quantiles log 10 (t p ) one uses the corresponding sample quantiles For a complete sample, T 1,,T n, these are obtained by ordering these T i from smallest to largest to get T (1) T (n) and then associate with p i = i/(n +1) the p i -quantile estimate or i th sample quantile T (i) These sample quantiles tend to vary around the respective population quantiles t pi For large sample sample sizes and for p i = i/(n +1) p with 0 <p<1 this variation diminishes (ie, the sample quantile T (i) converges to t p in a sense not made precise here) For p i close to 0 or 1 the sample quantiles T (i) may (or may not) exhibit high variability even in large samples Thus one has to be careful in interpreting extreme sample values in Weibull plots The idea of Weibull plotting for a complete sample is to plot w(p i )=log 10 [ log e (1 p i )] against log 10 (T (i) ) Aside from the variation of the T (i) around t pi one should, according to equation (2), then see a roughly linear pattern This plotting is facilitated by Weibull paper with a log 10 -transformed abscissa with untransformed labels and a transformed ordinate scale given by w(p) =log 10 [ log e (1 p)] 2 The explicit notation log 10 and log e is used to distinguish decimal and natural logs 3

4 with labels in terms of p Sometimes this scale is labeled in percent ( ie, in terms of 100p%) Two blank samples of such Weibull probability paper are given as the first two figures in Appendix A, although they are not labeled as Figure 1 or Figure 2 Figure 1 has two log 10 cycles on the abscissa, facilitating plotting of failure times over two orders of magnitudes and Figure 2 has three cycles on the abscissa, facilitating plotting of failure times over three orders of magnitudes For each plotting point (log 10 (T (i) ),w(p i )) one locates or interpolates the label value of T (i) on the abscissa and the value p i on the ordinate, ie, there is no need for the user to perform the transformations log 10 (T (i) )andw(p i )=log 10 [ log e (1 p i )] An example of a complete sample plotted on Weibull paper is given in Figure 3 of Appendix A Figure 3 shows three lines The dashed line represents the true line corresponding to the Weibull distribution from which the ten values were sampled The other two lines represent a least squares fit (the formulas for which will be given later) and the other corresponds to maximum likelihood estimates (mle) of α and β The process of finding the mle s is complicated and usually accomplished through software Note how susceptible the least squares fit is to the two lower extreme values This is not surprising since the method of least squares, as applied here, treats all data equally It does not know that the data come from a Weibull distribution and represent ordered and thus correlated values The method of maximum likelihood employs the fact that the data come from a Weibull model and knows how to properly weigh the various observations, ie, stragglers as they show up in Figure 3 will not be given undue influence The two blank specimens of Weibull probability paper in Appendix A cover two and three orders of magnitude on the abscissa, namely from 1 to 100 or from 1 to 1000 If the observed life times cover a range from 50 to 4000, one can simply change time units to tens and use the three log 10 cycle paper from 5 to 400, which fits If the ranges are very large, one may have to use Weibull paper covering more orders of magnitude However, there is a simple transformation device around that difficulty It is based on the following power transformation property of the Weibull distribution If T W(α, β) (ie, T has a Weibull distribution with parameters α and β), then T a W(α a,β/a)=w(α,β ) Thus one can always bring the scale of the failure times up or down into the proper range by an appropriate power transformation By eye or by more formal least squares methods one can then fit a line through the plotted points and take it as a proxy or estimate of the true line representing the unknown linear relationship (2) between the quantiles y p and w(p) Here the least squares fit will be 4

5 quite sensitive to variability in the extreme sample values Trying allow for that in fitting by eye will be somewhat subjective Since p =0632 yields w(p) =0orlog 10 (T ) log 10 (α) =0onecanreadoffanestimate of α from the abscissa scale where the fitted line intercepts the ordinate level 0632 For this purpose Weibull paper shows a level line at the ordinate 0632 On the right side scale of the Weibull probability paper one finds the zero value resulting from the transform w(0632) = 0 The scale to the left of the ordinate scale runs from zero to ten and is a nomographic device for reading off the estimated shape parameter associated with the line fitted to the plotted data To obtain it one draws a line parallel to the fitted line going through the solid dot to the right of that shape scale Appendix A contains several examples illustrating this process Some authors suggest to use of p i =(i 3)/(n + 4) in place of p i to characterize the p for which T (i) is the p-quantile For large n there is little difference between the two methods and for small n the inherent variability in Weibull samples makes a preference between the two methods somewhat questionable 3 Plotting with Type I Censoring Under this type of censoring the censoring times and failure times typically intermingle and it is no longer quite so clear for which quantile to consider each failure time as a quantile estimate There are various schemes of dealing with this problem The one presented here is the Herd-Johnson method as given in Nelson (1982) One orders all observations, censored or uncensored, from smallest to largest and one ranks them in reverse order If T (1) T (n) are the ordered observations, their corresponding reverse ranks are (r 1,,r n ) = (n,,1) For the i th failure one computes recursively the reliability R i = r (i) r (i) +1 R i 1 where R 0 = 1 is the reliability or survival probability at time 0 Here the distinction of r (i) and r i is the following Whereas r i = n i + 1 represents the reverse rank of the i th ordered observation (censored or uncensored), the notation r (i) is the reverse rank associated with the i th ordered failure time For example, if the first failure time is preceded by two censoring times, then r 1 = n, r 2 = n 1, r 3 = n 2=r (1) The sample calculations in Table 1 should clarify this further 5

6 In the above recursive formula for R i one can view r (i) /(r (i) +1)asanestimate ofthe conditional probability of survival at the time of the i th failure, since at the i th failure there are r (i) items, one has just failed and the other r (i) 1 survived Thus the proportion of survived items is (r (i) 1)/r (i) Since this can lead to zero values, which cause problems in probability plotting, one modifies this conditional probability estimate to r (i) /(r (i) +1) R i 1 is the estimate of survival prior to i th failure and R i is the estimate of survival after the i th failure As an aside, the proportion (r (i) 1)/r (i) figures strongly in the definition of the survival function estimate due to Kaplan and Meier (1958) This estimate is defined by R i = r (i) 1 r (i) R i 1 with R 0 = 1 Kaplan and Meier show that the step function with steps F i =1 R i at the i th failure time is the nonparametric maximum likelihood estimate of the distribution function of failure times without reference to a Weibull model See Scholz (1980) for a proper definition of maximum likelihood in such wider nonparametric settings Returning to the recursion defining R i,thep i plotting position for the i th failure time is then taken as p i =1 R i Note that the index i here counts consecutively through the failure times No points are plotted corresponding to the censored times If there are no censoring times, this method reduces to the above method of using p i = i/(n + 1) in complete samples and can be viewed as a reason for preferring this method over others An example calculation is presented in Table 1, which is taken from Nelson (1982) The corresponding Weibull plot is illustrated in Figure 4 of Appendix A Note that the time units were viewed in tens since the failure data range is [317, 1100] and a two cycle log 10 Weibull paper was used From the Weibull plot one can read off the following estimates for α and β, namely α 125 and β 199 The line was fitted by the method of least squares To do so one has to work with the transformed failure times Y [i] =log 10 (T [i] ) and with the corresponding values w(p [i] )=log 10 [ log e (1 p [i] )] The square brackets around the subscripts indicate that the numbering is consecutive along the k failures In the example of Table 1 this is along 1, 2,,7, since there are k = 7 failure times The least squares calculations use the following formulas β = ki=1 w(p [i] )(Y [i] Y ) ki=1 (Y [i] Y ) 2 with Y = 1 k 6 k Y [i] i=1

7 Table 1: Winding Data & Herd-Johnson Calculations Reliability/Survival Probability Reverse Failure Rank Cond l Previous Current Prob Time r i r (i) /(r (i) +1) R i 1 = R i p i =1 R i (16/17) 1000 = (15/16) 0941 = (14/15) 0883 = (12/13) 0824 = (11/12) 0761 = (4/5) 0697 = (2/3) 0557 = Running or Censored 7

8 and β log 10 ( α) =w β Y with w = 1 k w(p [i] ) k i=1 The estimates from the least squares calculations are α = 1233 and β =1986 Since the variability is in the Y [i] and not in the w(p [i] )onemaypreferdoingtheleast squares calculations with abscissa and ordinate reversed, ie, according to the model (1) In that case one obtains 1 β = ki=1 (w(p [i] ) w)y [i] ki=1 (w(p [i] ) w) 2 with w = 1 k k w(p [i] ) i=1 and log 10 ( α) =Y w/ β with Y = 1 k Y [i] k i=1 Although the same symbols α and β were used, their values will not be the same under the two least squares approaches In fact, for this last method the least squares estimates are α = 1205 and β =2055 The resulting fitted line is indicated by the dashed line in Figure 4 Also shown is the line resulting from the maximum likelihood estimates with α = 1232 and β = Plotting with Type II Censoring Here the observed data consist of T (1) T (r) and all that is known about the other n r (future) failure times is that they exceed T (r) Since these first r ordered failures are the quantile estimates of t p1,,t pr with p i = i/(n + 1) one can again plot these r points as before on Weibull paper, the only difference being that the n r censored points are not plotted Again one can fit a line by eye or by least squares However here the low extremes will weigh in more heavily since the upper extremes are missing As an example, the six lowest values of the complete sample underlying Figure 3 were taken as a type II censored sample and are plotted in Figure 5 As in Figure 3 the true line, the least squares fit line, and the line corresponding to the maximum likelihood estimates are shown Note that the latter is reasonably close to the true line, whereas the least squares line is led astray substantially 8

9 5 Checking the Weibull Model According to the motivation behind Weibull plotting one expects to see a roughly linear pattern in the plotted points The examples shown so far may have put a damper on this, but that is mainly due to the small sample sizes in those examples Some of the examples were artificially generated from Weibull populations Thus there is no question about their origin However, the example given by Nelson is supposed to be real data and its pattern on Weibull paper looks remarkably close to linear This has been observed in other expositions on Weibull plotting as well and it may raise false expectations in the prospective user When nonlinear patterns are found (in small samples) the user may then be led on false chase for other causes, such as multiple failure modes There may well be such multiple modes, but small samples are not a good starting point to look for these In order to get a sense for the effect of sample size on the variability in linear Weibull patterns eight Weibull samples each were generated at sample sizes n = 10, n = 30, n = 100 and were plotted on an equivalent of Weibull probability paper (without the grid lines, etc), see Figures 6-8 Also shown on each plot is the true line corresponding to the Weibull distribution from which the sample was drawn The sample to sample variability at n =10 is substantial whereas the linearity and proximity to the true line is quite satisfactory at n = 100 References [1] Abernethy, RB, Breneman, JE, Medlin, CH, and Reinman, GL (1983) Weibull Analysis Handbook, Pratt & Whitney Aircraft, Government Products Division, United Technologies, PO Box 2691, West Palm Beach, Florida [2] Kaplan, EL and Meier, P (1958) Nonparametric estimation from incomplete observations J Amer Statist Assoc 53, [3] Nelson, W 1982 Applied Life Data Analysis John Wiley & Sons [4] Scholz, FW (1980) Towards a unified definition of maximum likelihood The Canad J of Statist 8,

10 Appendix A This appendix contains two blank copies of Weibull probability paper (without page number) The first has a two cycle log 10 scale on the abscissa and the second has a three cycle log 10 scale This is followed by several illustrations of Weibull paper with plotted samples Figure 3 illustrates the plotting of a complete Weibull sample with three line superimposed: 1) the true line representing the sampled Weibull population, 2) a least squares fitted line (which is strongly influenced by stragglers), and 3) a line corresponding to maximum likelihood estimates of the paramters Figure 4 illustrates the plotting of a type I censored sample representing some real data taken from Nelson (1982) Aside from the maximum likelihood fitted line there are two types of least squares lines In one the ordinate is regressed on the abscissa and in the other the abscissa is regressed on the ordinate Figure 5 illustrates the plotting of a type II censored sample by taking the six lowest failure times from the complete sample underlying Figure 3 Again the true line, the least squares fitted line and the maximum likelihood fitted lines are shown Figure 6-8 show a collection of eight random samples, each taken from a Weibull population Figure 6 shows those samples of size n = 10, and Figure 7 and Figure 8 show similar plots for sample sizes n = 30andn = 100, respectively On each plot the thick represents the true sampled Weibull population and the thin line is the least squares fitted line 10

11 0 Weibull Probability Paper 2 Cycle Log

12 Weibull Probability Paper 3 Cycle Log

13 Weibull Probability Paper 2 Cycle Log Complete Sample maximum likelihood estimate least squares estimate true line, equation (1) Figure 3 40

14 Figure 4 Weibull Probability Paper 2 Cycle Log Type I Censored Sample maximum likelihood estimate least squares estimate, y vs x least squares estimate, x vs y

15 Weibull Probability Paper 2 Cycle Log Type II Censored Sample maximum likelihood estimate least squares estimate true line, equation (1) Figure 5 40

16 Figure 6: Weibull Probability Plots log10[-ln(1-p)] sample size n = log10(t)

17 Figure 7: Weibull Probability Plots log10[-ln(1-p)] sample size n = log10(t)

18 Figure 8: Weibull Probability Plots log10[-ln(1-p)] sample size n = log10(t)

Gamma Distribution Fitting

Gamma Distribution Fitting Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

More information

Confidence Intervals for Exponential Reliability

Confidence Intervals for Exponential Reliability Chapter 408 Confidence Intervals for Exponential Reliability Introduction This routine calculates the number of events needed to obtain a specified width of a confidence interval for the reliability (proportion

More information

Determination of g using a spring

Determination of g using a spring INTRODUCTION UNIVERSITY OF SURREY DEPARTMENT OF PHYSICS Level 1 Laboratory: Introduction Experiment Determination of g using a spring This experiment is designed to get you confident in using the quantitative

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

Confidence Intervals for One Standard Deviation Using Standard Deviation

Confidence Intervals for One Standard Deviation Using Standard Deviation Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

More information

ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA

ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA Michael R. Middleton, McLaren School of Business, University of San Francisco 0 Fulton Street, San Francisco, CA -00 -- middleton@usfca.edu

More information

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2 Lesson 4 Part 1 Relationships between two numerical variables 1 Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Review of Fundamental Mathematics

Review of Fundamental Mathematics Review of Fundamental Mathematics As explained in the Preface and in Chapter 1 of your textbook, managerial economics applies microeconomic theory to business decision making. The decision-making tools

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Example G Cost of construction of nuclear power plants

Example G Cost of construction of nuclear power plants 1 Example G Cost of construction of nuclear power plants Description of data Table G.1 gives data, reproduced by permission of the Rand Corporation, from a report (Mooz, 1978) on 32 light water reactor

More information

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012] Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance

More information

Lecture 15 Introduction to Survival Analysis

Lecture 15 Introduction to Survival Analysis Lecture 15 Introduction to Survival Analysis BIOST 515 February 26, 2004 BIOST 515, Lecture 15 Background In logistic regression, we were interested in studying how risk factors were associated with presence

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Experiment #3, Ohm s Law

Experiment #3, Ohm s Law Experiment #3, Ohm s Law 1 Purpose Physics 182 - Summer 2013 - Experiment #3 1 To investigate the -oltage, -, characteristics of a carbon resistor at room temperature and at liquid nitrogen temperature,

More information

Section 3 Part 1. Relationships between two numerical variables

Section 3 Part 1. Relationships between two numerical variables Section 3 Part 1 Relationships between two numerical variables 1 Relationship between two variables The summary statistics covered in the previous lessons are appropriate for describing a single variable.

More information

The KaleidaGraph Guide to Curve Fitting

The KaleidaGraph Guide to Curve Fitting The KaleidaGraph Guide to Curve Fitting Contents Chapter 1 Curve Fitting Overview 1.1 Purpose of Curve Fitting... 5 1.2 Types of Curve Fits... 5 Least Squares Curve Fits... 5 Nonlinear Curve Fits... 6

More information

LOGISTIC REGRESSION ANALYSIS

LOGISTIC REGRESSION ANALYSIS LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

Confidence Intervals for Spearman s Rank Correlation

Confidence Intervals for Spearman s Rank Correlation Chapter 808 Confidence Intervals for Spearman s Rank Correlation Introduction This routine calculates the sample size needed to obtain a specified width of Spearman s rank correlation coefficient confidence

More information

Correlation key concepts:

Correlation key concepts: CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)

More information

Math 0980 Chapter Objectives. Chapter 1: Introduction to Algebra: The Integers.

Math 0980 Chapter Objectives. Chapter 1: Introduction to Algebra: The Integers. Math 0980 Chapter Objectives Chapter 1: Introduction to Algebra: The Integers. 1. Identify the place value of a digit. 2. Write a number in words or digits. 3. Write positive and negative numbers used

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

EQUATIONS and INEQUALITIES

EQUATIONS and INEQUALITIES EQUATIONS and INEQUALITIES Linear Equations and Slope 1. Slope a. Calculate the slope of a line given two points b. Calculate the slope of a line parallel to a given line. c. Calculate the slope of a line

More information

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Section 14 Simple Linear Regression: Introduction to Least Squares Regression Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

More information

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

More information

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple

More information

An Application of Weibull Analysis to Determine Failure Rates in Automotive Components

An Application of Weibull Analysis to Determine Failure Rates in Automotive Components An Application of Weibull Analysis to Determine Failure Rates in Automotive Components Jingshu Wu, PhD, PE, Stephen McHenry, Jeffrey Quandt National Highway Traffic Safety Administration (NHTSA) U.S. Department

More information

Distribution (Weibull) Fitting

Distribution (Weibull) Fitting Chapter 550 Distribution (Weibull) Fitting Introduction This procedure estimates the parameters of the exponential, extreme value, logistic, log-logistic, lognormal, normal, and Weibull probability distributions

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Experiment #1, Analyze Data using Excel, Calculator and Graphs.

Experiment #1, Analyze Data using Excel, Calculator and Graphs. Physics 182 - Fall 2014 - Experiment #1 1 Experiment #1, Analyze Data using Excel, Calculator and Graphs. 1 Purpose (5 Points, Including Title. Points apply to your lab report.) Before we start measuring

More information

Interpretation of Somers D under four simple models

Interpretation of Somers D under four simple models Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms

More information

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS TEST DESIGN AND FRAMEWORK September 2014 Authorized for Distribution by the New York State Education Department This test design and framework document

More information

Analysis of Bayesian Dynamic Linear Models

Analysis of Bayesian Dynamic Linear Models Analysis of Bayesian Dynamic Linear Models Emily M. Casleton December 17, 2010 1 Introduction The main purpose of this project is to explore the Bayesian analysis of Dynamic Linear Models (DLMs). The main

More information

AP Physics 1 and 2 Lab Investigations

AP Physics 1 and 2 Lab Investigations AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks

More information

Applied Reliability Page 1 APPLIED RELIABILITY. Techniques for Reliability Analysis

Applied Reliability Page 1 APPLIED RELIABILITY. Techniques for Reliability Analysis Applied Reliability Page 1 APPLIED RELIABILITY Techniques for Reliability Analysis with Applied Reliability Tools (ART) (an EXCEL Add-In) and JMP Software AM216 Class 4 Notes Santa Clara University Copyright

More information

Objectives. Experimentally determine the yield strength, tensile strength, and modules of elasticity and ductility of given materials.

Objectives. Experimentally determine the yield strength, tensile strength, and modules of elasticity and ductility of given materials. Lab 3 Tension Test Objectives Concepts Background Experimental Procedure Report Requirements Discussion Objectives Experimentally determine the yield strength, tensile strength, and modules of elasticity

More information

RANDOM VIBRATION AN OVERVIEW by Barry Controls, Hopkinton, MA

RANDOM VIBRATION AN OVERVIEW by Barry Controls, Hopkinton, MA RANDOM VIBRATION AN OVERVIEW by Barry Controls, Hopkinton, MA ABSTRACT Random vibration is becoming increasingly recognized as the most realistic method of simulating the dynamic environment of military

More information

SUMAN DUVVURU STAT 567 PROJECT REPORT

SUMAN DUVVURU STAT 567 PROJECT REPORT SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.

More information

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:

More information

Math 1. Month Essential Questions Concepts/Skills/Standards Content Assessment Areas of Interaction

Math 1. Month Essential Questions Concepts/Skills/Standards Content Assessment Areas of Interaction Binghamton High School Rev.9/21/05 Math 1 September What is the unknown? Model relationships by using Fundamental skills of 2005 variables as a shorthand way Algebra Why do we use variables? What is a

More information

Chapter 6: Constructing and Interpreting Graphic Displays of Behavioral Data

Chapter 6: Constructing and Interpreting Graphic Displays of Behavioral Data Chapter 6: Constructing and Interpreting Graphic Displays of Behavioral Data Chapter Focus Questions What are the benefits of graphic display and visual analysis of behavioral data? What are the fundamental

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

Physics Lab Report Guidelines

Physics Lab Report Guidelines Physics Lab Report Guidelines Summary The following is an outline of the requirements for a physics lab report. A. Experimental Description 1. Provide a statement of the physical theory or principle observed

More information

Introduction to nonparametric regression: Least squares vs. Nearest neighbors

Introduction to nonparametric regression: Least squares vs. Nearest neighbors Introduction to nonparametric regression: Least squares vs. Nearest neighbors Patrick Breheny October 30 Patrick Breheny STA 621: Nonparametric Statistics 1/16 Introduction For the remainder of the course,

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009

HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. A General Formulation 3. Truncated Normal Hurdle Model 4. Lognormal

More information

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

Review of Scientific Notation and Significant Figures

Review of Scientific Notation and Significant Figures II-1 Scientific Notation Review of Scientific Notation and Significant Figures Frequently numbers that occur in physics and other sciences are either very large or very small. For example, the speed of

More information

Experiment 3 Lenses and Images

Experiment 3 Lenses and Images Experiment 3 Lenses and Images Who shall teach thee, unless it be thine own eyes? Euripides (480?-406? BC) OBJECTIVES To examine the nature and location of images formed by es. THEORY Lenses are frequently

More information

Solving Mass Balances using Matrix Algebra

Solving Mass Balances using Matrix Algebra Page: 1 Alex Doll, P.Eng, Alex G Doll Consulting Ltd. http://www.agdconsulting.ca Abstract Matrix Algebra, also known as linear algebra, is well suited to solving material balance problems encountered

More information

GRADES 7, 8, AND 9 BIG IDEAS

GRADES 7, 8, AND 9 BIG IDEAS Table 1: Strand A: BIG IDEAS: MATH: NUMBER Introduce perfect squares, square roots, and all applications Introduce rational numbers (positive and negative) Introduce the meaning of negative exponents for

More information

Applying Statistics Recommended by Regulatory Documents

Applying Statistics Recommended by Regulatory Documents Applying Statistics Recommended by Regulatory Documents Steven Walfish President, Statistical Outsourcing Services steven@statisticaloutsourcingservices.com 301-325 325-31293129 About the Speaker Mr. Steven

More information

Designer: Nathan Kimball. Stage 1 Desired Results

Designer: Nathan Kimball. Stage 1 Desired Results Interpolation Subject: Science, math Grade: 6-8 Time: 4 minutes Topic: Reading Graphs Designer: Nathan Kimball Stage 1 Desired Results Lesson Overview: In this activity students work with the direct linear

More information

Exact Nonparametric Tests for Comparing Means - A Personal Summary

Exact Nonparametric Tests for Comparing Means - A Personal Summary Exact Nonparametric Tests for Comparing Means - A Personal Summary Karl H. Schlag European University Institute 1 December 14, 2006 1 Economics Department, European University Institute. Via della Piazzuola

More information

The Wilcoxon Rank-Sum Test

The Wilcoxon Rank-Sum Test 1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

Econometrics Simple Linear Regression

Econometrics Simple Linear Regression Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight

More information

Operation Count; Numerical Linear Algebra

Operation Count; Numerical Linear Algebra 10 Operation Count; Numerical Linear Algebra 10.1 Introduction Many computations are limited simply by the sheer number of required additions, multiplications, or function evaluations. If floating-point

More information

Characterizing Digital Cameras with the Photon Transfer Curve

Characterizing Digital Cameras with the Photon Transfer Curve Characterizing Digital Cameras with the Photon Transfer Curve By: David Gardner Summit Imaging (All rights reserved) Introduction Purchasing a camera for high performance imaging applications is frequently

More information

Chemical Kinetics. 2. Using the kinetics of a given reaction a possible reaction mechanism

Chemical Kinetics. 2. Using the kinetics of a given reaction a possible reaction mechanism 1. Kinetics is the study of the rates of reaction. Chemical Kinetics 2. Using the kinetics of a given reaction a possible reaction mechanism 3. What is a reaction mechanism? Why is it important? A reaction

More information

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions. Algebra I Overview View unit yearlong overview here Many of the concepts presented in Algebra I are progressions of concepts that were introduced in grades 6 through 8. The content presented in this course

More information

Reliability. 26.1 Reliability Models. Chapter 26 Page 1

Reliability. 26.1 Reliability Models. Chapter 26 Page 1 Chapter 26 Page 1 Reliability Although the technological achievements of the last 50 years can hardly be disputed, there is one weakness in all mankind's devices. That is the possibility of failure. What

More information

Lin s Concordance Correlation Coefficient

Lin s Concordance Correlation Coefficient NSS Statistical Software NSS.com hapter 30 Lin s oncordance orrelation oefficient Introduction This procedure calculates Lin s concordance correlation coefficient ( ) from a set of bivariate data. The

More information

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Role in quality management system Quality Control (QC) is a component of process control, and is a major element of the quality management

More information

Scatter Plot, Correlation, and Regression on the TI-83/84

Scatter Plot, Correlation, and Regression on the TI-83/84 Scatter Plot, Correlation, and Regression on the TI-83/84 Summary: When you have a set of (x,y) data points and want to find the best equation to describe them, you are performing a regression. This page

More information

PLOTTING DATA AND INTERPRETING GRAPHS

PLOTTING DATA AND INTERPRETING GRAPHS PLOTTING DATA AND INTERPRETING GRAPHS Fundamentals of Graphing One of the most important sets of skills in science and mathematics is the ability to construct graphs and to interpret the information they

More information

A Determination of g, the Acceleration Due to Gravity, from Newton's Laws of Motion

A Determination of g, the Acceleration Due to Gravity, from Newton's Laws of Motion A Determination of g, the Acceleration Due to Gravity, from Newton's Laws of Motion Objective In the experiment you will determine the cart acceleration, a, and the friction force, f, experimentally for

More information

Point Biserial Correlation Tests

Point Biserial Correlation Tests Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

More information

Variables Control Charts

Variables Control Charts MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables

More information

Solving Systems of Two Equations Algebraically

Solving Systems of Two Equations Algebraically 8 MODULE 3. EQUATIONS 3b Solving Systems of Two Equations Algebraically Solving Systems by Substitution In this section we introduce an algebraic technique for solving systems of two equations in two unknowns

More information

INTRODUCTION TO ERRORS AND ERROR ANALYSIS

INTRODUCTION TO ERRORS AND ERROR ANALYSIS INTRODUCTION TO ERRORS AND ERROR ANALYSIS To many students and to the public in general, an error is something they have done wrong. However, in science, the word error means the uncertainty which accompanies

More information

Normality Testing in Excel

Normality Testing in Excel Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Numeracy and mathematics Experiences and outcomes

Numeracy and mathematics Experiences and outcomes Numeracy and mathematics Experiences and outcomes My learning in mathematics enables me to: develop a secure understanding of the concepts, principles and processes of mathematics and apply these in different

More information

Florida Math for College Readiness

Florida Math for College Readiness Core Florida Math for College Readiness Florida Math for College Readiness provides a fourth-year math curriculum focused on developing the mastery of skills identified as critical to postsecondary readiness

More information

Continued Fractions and the Euclidean Algorithm

Continued Fractions and the Euclidean Algorithm Continued Fractions and the Euclidean Algorithm Lecture notes prepared for MATH 326, Spring 997 Department of Mathematics and Statistics University at Albany William F Hammond Table of Contents Introduction

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

Lean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

Lean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY Before we begin: Turn on the sound on your computer. There is audio to accompany this presentation. Audio will accompany most of the online

More information

Analysis of Variance ANOVA

Analysis of Variance ANOVA Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.

More information

Chapter 3. Distribution Problems. 3.1 The idea of a distribution. 3.1.1 The twenty-fold way

Chapter 3. Distribution Problems. 3.1 The idea of a distribution. 3.1.1 The twenty-fold way Chapter 3 Distribution Problems 3.1 The idea of a distribution Many of the problems we solved in Chapter 1 may be thought of as problems of distributing objects (such as pieces of fruit or ping-pong balls)

More information

Lecture 3: Linear methods for classification

Lecture 3: Linear methods for classification Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

Cumulative Diagrams: An Example

Cumulative Diagrams: An Example Cumulative Diagrams: An Example Consider Figure 1 in which the functions (t) and (t) denote, respectively, the demand rate and the service rate (or capacity ) over time at the runway system of an airport

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Measurement with Ratios

Measurement with Ratios Grade 6 Mathematics, Quarter 2, Unit 2.1 Measurement with Ratios Overview Number of instructional days: 15 (1 day = 45 minutes) Content to be learned Use ratio reasoning to solve real-world and mathematical

More information

"Reliability and MTBF Overview"

Reliability and MTBF Overview "Reliability and MTBF Overview" Prepared by Scott Speaks Vicor Reliability Engineering Introduction Reliability is defined as the probability that a device will perform its required function under stated

More information

Pearson's Correlation Tests

Pearson's Correlation Tests Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation

More information

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers Correlation Greg C Elvers What Is Correlation? Correlation is a descriptive statistic that tells you if two variables are related to each other E.g. Is your related to how much you study? When two variables

More information

In this chapter, you will learn improvement curve concepts and their application to cost and price analysis.

In this chapter, you will learn improvement curve concepts and their application to cost and price analysis. 7.0 - Chapter Introduction In this chapter, you will learn improvement curve concepts and their application to cost and price analysis. Basic Improvement Curve Concept. You may have learned about improvement

More information

Life Data Analysis using the Weibull distribution

Life Data Analysis using the Weibull distribution RELIABILITY ENGINEERING Life Data Analysis using the Weibull distribution PLOT Seminar October 2008 Ing. Ronald Schop Weibull: Reliability Engineering www.weibull.nl Content Why Reliability Weibull Statistics

More information

Reliability estimators for the components of series and parallel systems: The Weibull model

Reliability estimators for the components of series and parallel systems: The Weibull model Reliability estimators for the components of series and parallel systems: The Weibull model Felipe L. Bhering 1, Carlos Alberto de Bragança Pereira 1, Adriano Polpo 2 1 Department of Statistics, University

More information

Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection

Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection Directions in Statistical Methodology for Multivariable Predictive Modeling Frank E Harrell Jr University of Virginia Seattle WA 19May98 Overview of Modeling Process Model selection Regression shape Diagnostics

More information

Time Series and Forecasting

Time Series and Forecasting Chapter 22 Page 1 Time Series and Forecasting A time series is a sequence of observations of a random variable. Hence, it is a stochastic process. Examples include the monthly demand for a product, the

More information

T O P I C 1 2 Techniques and tools for data analysis Preview Introduction In chapter 3 of Statistics In A Day different combinations of numbers and types of variables are presented. We go through these

More information

Peer review on manuscript "Predicting environmental gradients with..." by Peer 410

Peer review on manuscript Predicting environmental gradients with... by Peer 410 Peer review on manuscript "Predicting environmental gradients with..." by Peer 410 ADDED INFO ABOUT FEATURED PEER REVIEW This peer review is written by Dr. Richard Field, Associate Professor of Biogeography

More information