1 Overview. Coefficient of Variation. Hervé Abdi

Similar documents
1 Overview. Fisher s Least Significant Difference (LSD) Test. Lynne J. Williams Hervé Abdi

The Kendall Rank Correlation Coefficient

1 Overview and background

The Bonferonni and Šidák Corrections for Multiple Comparisons

The Method of Least Squares

Factor Rotations in Factor Analyses.

Partial Least Squares (PLS) Regression.

12: Analysis of Variance. Introduction

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Multiple Correspondence Analysis

Metric Multidimensional Scaling (MDS): Analyzing Distance Matrices

The correlation coefficient

Study Guide for the Final Exam

Regression Analysis: A Complete Example

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

COEFFICIENT OF CONCORDANCE

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

1.5 Oneway Analysis of Variance

Part 2: Analysis of Relationship Between Two Variables

Means, standard deviations and. and standard errors

Fairfield Public Schools

Sample Size and Power in Clinical Trials

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

CHAPTER 13. Experimental Design and Analysis of Variance

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

Population Mean (Known Variance)

Using Excel for inferential statistics

Introduction to Regression and Data Analysis

Week 4: Standard Error and Confidence Intervals

August 2012 EXAMINATIONS Solution Part I

Statistics Review PSY379

UNDERSTANDING THE TWO-WAY ANOVA

Power and sample size in multilevel modeling

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

One-Way Analysis of Variance (ANOVA) Example Problem

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Simple Linear Regression Inference

Partial Least Square Regression PLS-Regression

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Two-sample inference: Continuous data

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

One-Way Analysis of Variance

Hypothesis testing - Steps

Independent t- Test (Comparing Two Means)

Simple Second Order Chi-Square Correction

Factors affecting online sales

Confidence Intervals for the Difference Between Two Means

Least Squares Estimation

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Quantitative Methods for Finance

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Descriptive Statistics

Introduction to Statistics and Quantitative Research Methods

Chapter 7. One-way ANOVA

3.4 Statistical inference for 2 populations based on two samples

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA

Glossary of Terms Ability Accommodation Adjusted validity/reliability coefficient Alternate forms Analysis of work Assessment Battery Bias

James E. Bartlett, II is Assistant Professor, Department of Business Education and Office Administration, Ball State University, Muncie, Indiana.

Confidence Intervals for One Standard Deviation Using Standard Deviation

Basic Concepts in Research and Data Analysis

MULTIPLE REGRESSION WITH CATEGORICAL DATA

Example: Boats and Manatees

Impact of Skewness on Statistical Power

Regression step-by-step using Microsoft Excel

Lecture Notes Module 1

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Measurement. How are variables measured?

Confidence Intervals for Cp

Randomized Block Analysis of Variance

Module 5: Multiple Regression Analysis

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

The Method of Least Squares

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

2 Sample t-test (unequal sample sizes and unequal variances)

HLA data analysis in anthropology: basic theory and practice

INVESTIGATION OF FALLING BALL VISCOMETRY AND ITS ACCURACY GROUP R1 Evelyn Chou, Julia Glaser, Bella Goyal, Sherri Wykosky

Solution to HW - 1. Problem 1. [Points = 3] In September, Chapel Hill s daily high temperature has a mean

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

AP Physics 1 and 2 Lab Investigations

Fixed-Effect Versus Random-Effects Models

HYPOTHESIS TESTING: POWER OF THE TEST

Module 2 Probability and Statistics

International Statistical Institute, 56th Session, 2007: Phil Everson

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)

How To Write A Data Analysis

Simple Regression Theory II 2010 Samuel L. Baker

5.1 Identifying the Target Parameter

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

What are confidence intervals and p-values?

Regression and Correlation

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = Adverse impact as defined by the 4/5ths rule was not found in the above data.

Transcription:

In Neil Salkind (Ed.), Encyclopedia of Research Design. Thousand Oaks, CA: Sage. 2010 Coefficient of Variation 1 Overview The coefficient of variation measures the variability of a series of numbers independently of the unit of measurement used for these numbers. In order to do so, the coefficient od variation eliminates the unit of measurement of the standard deviation of a series of numbers by dividing it by the mean of these numbers. The coefficient of variation can be used to compare distributions obtained with different units, such as, for example, the variability of the weights of newborns (measured in grams) with the size of adults (measured in centimeters). The coefficient of variation is meaningful only for measurements with a real zero (i.e., ratio scales ) because the mean is meaningful (i.e., unique) only for these scales. So, for example, it will be meaningless to compute the coefficient of variation of the temperature measured in degrees Fahrenheit, because changing the measurement to degrees Celsius will not change the temperature but will change the value of the coefficient of variation (because the value of zero for Celsius is thirty-two for Fahrenheit and therefore the mean The University of Texas at Dallas Address correspondence to: Program in Cognition and Neurosciences, MS: Gr.4.1, The University of Texas at Dallas, Richardson, TX 75083 0688, USA E-mail: herve@utdallas.edu http://www.utd.edu/ herve

2 Coefficient of Variation of the temperature will change from one scale to the other, whereas the standard deviation will not change). In addition, the values of the measurement used to compute the coefficient of variation are assumed to be always positive or null. The coefficient of variation is primarily a descriptive statistic, but it is amenable to statistical inferences such as null hypothesis testing or confidence intervals. Standard procedures are often very dependent of the normality assumption and current work is exploring alternative procedures which are less dependent on this normality assumption. 2 Coefficient of variation The coefficient of variation denoted C V (or occasionally V ) eliminates the unit of measurement from the standard deviation of a series of number by dividing it by the mean of this series of numbers. Formally, if, for a series of N numbers, the standard deviation and the mean are denoted respectively by S and M, the coefficient of variation is computed as: C V = S M. (1) Often the coefficient of variation is expressed as a percentage which corresponds to the following formula for the coefficient of variation C V = S 100 M. (2) This last formula can be potentially misleading because, as we see later, the value of the coefficient of variation can exceed unity and therefore would create percentages larger than one hundred. Here we will use formula 1 and express C V as a ratio rather than a percentage. 2.1 Range of the coefficient of variation In a finite sample of N non-negative numbers with a real zero, the coefficient of variation can take value between 0 and N 1 (the maximum value of C V is reached when all values but one are equal to zero).

HERVÉ ABDI 3 2.2 Estimation of a population coefficient of variation The coefficient of variation computed on a sample is a biased estimate of the population coefficient of variation denoted γ V. An unbiased estimate of the population coefficient of variation is denoted Ĉ V, it is computed as Ĉ V = (where N is the sample size). 3 Testing the Coefficient of variation ( 1 + 1 ) C V (3) 4N When the coefficient of variation is computed on a sample drawn from a normal population, its standard error denoted σ CV is known and is equal to σ CV = γ V. (4) When γ V is not known (which is, in general, the case), σ CV can be estimated by replacing γ V by its estimation from the sample. Both C V or ĈV can be used for this purpose (ĈV being preferable because it is a better estimate). So σ CV can be estimated as: S CV = C V or Ŝ CV = ĈV. (5) Therefore, under the assumption of normality, the statistic t CV = C V γ V S CV (6) follows a Student distribution with ν = N 1 degrees of freedom. It should be stressed that this test is very sensitive to the normality assumption. Work is still being done to minimize the effect of this assumption. as By rewriting Equation 6, confidence intervals can be computed C V ± t α,ν S CV (7)

4 Coefficient of Variation Table 1 Example for the coefficient of variation. Daily commission (in dollars) of ten salespersons. Saleperson 1 2 3 4 5 6 7 8 9 10 Commission 152 155 164 164 182 221 233 236 245 248 (with t α,ν being the critical value of Student s t for the chosen α level and for ν = N 1 degrees of freedom). Again, because ĈV is a better estimation of γ V than C V, it makes some sense to use ĈV rather than C V. 4 Example Table 1 lists the daily commission in dollars of ten car salespersons. The mean commission is equal to $200 with a standard deviation of $40. This gives a value of the coefficient of variation of: C V = S M = 40 = 0.200, (8) 200 which corresponds to a population estimate of ( Ĉ V = 1 + 1 ) ( C V = 1 + 1 ) 0.200 = 0.205. (9) 4N 4 10 The standard error of the coefficient of variation is estimated as: S CV = C V = 0.200 2 10 = 0.0447 (10) (the value of ŜC V is equal to 0.0458). A t-criterion testing the hypothesis that the population value of the coefficient of variation is equal to zero is equal to: t CV = C V γ V = 0.2000 = 4.47. (11) S CV 0.0447 This value of t CV = 4.47 is larger than the critical value of t α,ν = 2.26 (which is the critical value of a Student s t distribution for α =.05 and ν = 9 degrees of freedom). Therefore we can reject

HERVÉ ABDI 5 the null hypothesis and conclude that γ V is larger than zero. A 95% corresponding confidence interval gives the values of C V ± t α,ν S CV = 0.2000 ± 2.26 0.0447 = 0.200 ± 0.1011 (12) and therefore we conclude that there is a probability of.95 that the value of γ V lays in the interval [0.0989 0.3011]. Related entries Mean, standard deviation, variance, variability. Further readings Abdi, H., Edelman, B., Valentin, D., & Dowling, W.J. (2009). Experimental design and analysis for psychology. Oxford: Oxford University Press. Curto, J.D., & Pinto, J.C. (2009). He coefficient of variation asmptotic distribution in the case of non-iid random variables. Journal of Applied Statistics, 36, 21 32. Nairy, K.S. & Rao, K.N. (2003). Tests of coefficients of variation of normal populations. Commnunications in Statistics Simulation and Computation, 32, 641-661. Martin, J.D., & Gray, L.N. (1971). Measurement of relative variation: Sociological examples. American Sociological Review, 36, 496 502. Sokal, R.R, & Rohlf, F.J. (1995). Biometry (3rd Edition). New York: Freeman.