Measures of Dispersion. Definition. Measures of Dispersion. Measures of Dispersion. Greg C Elvers, Ph.D.

Similar documents
Descriptive Statistics

Measures of Central Tendency and Variability: Summarizing your Data for Others

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Frequency Distributions

SKEWNESS. Measure of Dispersion tells us about the variation of the data set. Skewness tells us about the direction of variation of the data set.

Statistics. Measurement. Scales of Measurement 7/18/2012

Exploratory Data Analysis. Psychology 3256

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

MEASURES OF VARIATION

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

CALCULATIONS & STATISTICS

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

6.4 Normal Distribution

Means, standard deviations and. and standard errors

Topic 9 ~ Measures of Spread

Module 3: Correlation and Covariance

Descriptive Statistics and Measurement Scales

Lesson 4 Measures of Central Tendency

Standard Deviation Estimator

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

The correlation coefficient

Lecture 1: Review and Exploratory Data Analysis (EDA)

3: Summary Statistics

1.6 The Order of Operations

Week 3&4: Z tables and the Sampling Distribution of X

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

Introduction; Descriptive & Univariate Statistics

Northumberland Knowledge

Week 1. Exploratory Data Analysis

9.1 Measures of Center and Spread

Characteristics of Binomial Distributions

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

2. Filling Data Gaps, Data validation & Descriptive Statistics

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Simple Regression Theory II 2010 Samuel L. Baker

Simple linear regression

3. What is the difference between variance and standard deviation? 5. If I add 2 to all my observations, how variance and mean will vary?

Comparing Means in Two Populations

Descriptive Statistics

Constructing and Interpreting Confidence Intervals

AP * Statistics Review. Descriptive Statistics

Data Exploration Data Visualization

1.5 Oneway Analysis of Variance

How To Write A Data Analysis

CHAPTER THREE. Key Concepts

Independent samples t-test. Dr. Tom Pierce Radford University

consider the number of math classes taken by math 150 students. how can we represent the results in one number?

Midterm Review Problems

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo)

8. THE NORMAL DISTRIBUTION

Normal distribution. ) 2 /2σ. 2π σ

Mind on Statistics. Chapter 2

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

Descriptive statistics; Correlation and regression

Module 4: Data Exploration

Study Guide for Essentials of Statistics for the Social and Behavioral Sciences by Barry H. Cohen and R. Brooke Lea. Chapter 1

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

6 3 The Standard Normal Distribution

Week 4: Standard Error and Confidence Intervals

Shape of Data Distributions

DESCRIPTIVE STATISTICS & DATA PRESENTATION*

Geostatistics Exploratory Analysis

Variables. Exploratory Data Analysis

z-scores AND THE NORMAL CURVE MODEL

DDBA 8438: The t Test for Independent Samples Video Podcast Transcript

Introduction to Risk, Return and the Historical Record

Describing Data: Measures of Central Tendency and Dispersion

Pre-Algebra Academic Content Standards Grade Eight Ohio. Number, Number Sense and Operations Standard. Number and Number Systems

Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

Advanced Topics in Statistical Process Control

NCSS Statistical Software. One-Sample T-Test

Algebra I Vocabulary Cards

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

TImath.com. F Distributions. Statistics

Reliability Overview

The Perverse Nature of Standard Deviation Denton Bramwell

3 Describing Distributions

Statistical Confidence Calculations

Session 7 Bivariate Data and Analysis

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Ch. 3.1 # 3, 4, 7, 30, 31, 32

Scientific Method. 2. Design Study. 1. Ask Question. Questionnaire. Descriptive Research Study. 6: Share Findings. 1: Ask Question.

Algebra Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Procedure for Graphing Polynomial Functions

Simplifying Square-Root Radicals Containing Perfect Square Factors

Pre-course Materials

Final Exam Practice Problem Answers

Introduction to Matrix Algebra

Category 3 Number Theory Meet #1, October, 2000

Section 1.5 Exponents, Square Roots, and the Order of Operations

Coins, Presidents, and Justices: Normal Distributions and z-scores

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Transcription:

Measures of Dispersion Greg C Elvers, Ph.D. 1 Definition Measures of dispersion are descriptive statistics that describe how similar a set of scores are to each other The more similar the scores are to each other, the lower the measure of dispersion will be The less similar the scores are to each other, the higher the measure of dispersion will be In general, the more spread out a distribution is, the larger the measure of dispersion will be Measures of Dispersion Which of the distributions of scores has the larger dispersion? The upper distribution has more dispersion because the scores are more spread out That is, they are less similar to each other 15 100 75 50 5 0 1 3 5 7 8 9 10 15 100 75 50 5 0 1 3 5 7 8 9 10 3 Measures of Dispersion There are three main measures of dispersion: The range The semi-interquartile range (SIR) Variance / standard deviation

The Range The range is defined as the difference between the largest score in the set of data and the smallest score in the set of data, X L -X S What is the range of the following data: 8 1 9 3 9 The largest score (X L ) is 9; the smallest score (X S ) is 1; the range is X L -X S 9-1 8 5 When To Use the Range The range is used when you have ordinal data or you are presenting your results to people with little or no knowledge of statistics The range is rarely used in scientific work as it is fairly insensitive It depends on only two scores in the set of data, X L and X S Two very different sets of data can have the same range: 1 1 1 1 9 vs 1 3 5 7 9 The Semi-Interquartile Range The semi-interquartile range (or SIR) is defined as the difference of the first and third quartiles divided by two The first quartile is the 5 th percentile The third quartile is the 75 th percentile SIR (Q 3 -Q 1 ) / 7 SIR Example What is the SIR for the data to the right? 5 % of the scores are below 5 5 is the first quartile 5 % of the scores are above 5 5 is the third quartile SIR (Q 3 -Q 1 ) / (5-5) / 10 8 10 1 1 0 30 0 5 5 th %tile 5 75 th %tile 8

When To Use the SIR The SIR is often used with skewed data as it is insensitive to the extreme scores 9 Variance Variance is defined as the average of the square deviations: ( X ) µ σ 10 What Does the Variance Formula First, it says to subtract the mean from each of the scores This difference is called a deviate or a deviation score The deviate tells us how far a given score is from the typical, or average, score Thus, the deviate is a measure of dispersion for a given score 11 What Does the Variance Formula Why can t we simply take the average of the deviates? That is, why isn t variance defined as: ( X ) µ σ This is not the formula for variance! 1

What Does the Variance Formula One of the definitions of the mean was that it always made the sum of the scores minus the mean equal to 0 Thus, the average of the deviates must be 0 since the sum of the deviates must equal 0 To avoid this problem, statisticians square the deviate score prior to averaging them Squaring the deviate score makes all the squared scores positive 13 What Does the Variance Formula Variance is the mean of the squared deviation scores The larger the variance is, the more the scores deviate, on average, away from the mean The smaller the variance is, the less the scores deviate, on average, from the mean 1 Standard Deviation When the deviate scores are squared in variance, their unit of measure is squared as well E.g. If people s weights are measured in pounds, then the variance of the weights would be expressed in pounds (or squared pounds) Since squared units of measure are often awkward to deal with, the square root of variance is often used instead The standard deviation is the square root of variance 15 Standard Deviation Standard deviation variance Variance standard deviation 1

Computational Formula When calculating variance, it is often easier to use a computational formula which is algebraically equivalent to the definitional formula: ( X ) σ ( X ) X µ σ is the population variance, X is a score, µ is the population mean, and is the number of scores 17 Computational Formula Example X X X-µ (X-µ) 9 81 8 1 1 3-1 1 5 5-8 1 1 3-1 1 Σ Σ 30 Σ 0 Σ 1 18 Computational Formula Example σ X 30 30 9 1 ( X ) σ 1 ( X ) µ 19 Variance of a Sample Because the sample mean is not a perfect estimate of the population mean, the formula for the variance of a sample is slightly different from the formula for the variance of a population: s ( X X ) 1 s is the sample variance, X is a score, X is the sample mean, and is the number of scores 0

Measure of Skew Skew is a measure of symmetry in the distribution of scores Positive Skew ormal (skew 0) egative Skew 1 Measure of Skew The following formula can be used to determine skew: 3 X X ( ) ( X X) 3 s Measure of Skew If s 3 < 0, then the distribution has a negative skew If s 3 > 0 then the distribution has a positive skew If s 3 0 then the distribution is symmetrical The more different s 3 is from 0, the greater the skew in the distribution 3 Kurtosis (ot Related to Halitosis) Kurtosis measures whether the scores are spread out more or less than they would be in a normal (Gaussian) distribution Mesokurtic (s 3) Leptokurtic (s > 3) Platykurtic (s < 3)

Kurtosis When the distribution is normally distributed, its kurtosis equals 3 and it is said to be mesokurtic When the distribution is less spread out than normal, its kurtosis is greater than 3 and it is said to be leptokurtic When the distribution is more spread out than normal, its kurtosis is less than 3 and it is said to be platykurtic 5 Measure of Kurtosis The measure of kurtosis is given by: s X X ( X X) s, s 3, & s Collectively, the variance (s ), skew (s 3 ), and kurtosis (s ) describe the shape of the distribution 7