1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers



Similar documents
Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

3: Summary Statistics

Exercise 1.12 (Pg )

Exploratory data analysis (Chapter 2) Fall 2011

a. mean b. interquartile range c. range d. median

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Module 4: Data Exploration

Variables. Exploratory Data Analysis

Chapter 1: Exploring Data

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Topic 9 ~ Measures of Spread

Diagrams and Graphs of Statistical Data

Walk the Line Written by: Maryann Huey Drake University

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

First Midterm Exam (MATH1070 Spring 2012)

AP * Statistics Review. Descriptive Statistics

Box-and-Whisker Plots

Using SPSS, Chapter 2: Descriptive Statistics

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

Exploratory Data Analysis

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

How To Write A Data Analysis

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

+ Chapter 1 Exploring Data

Lecture 2. Summarizing the Sample

Shape of Data Distributions

Geostatistics Exploratory Analysis

Summarizing and Displaying Categorical Data

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Exploratory Data Analysis. Psychology 3256

MEASURES OF VARIATION

Data Exploration Data Visualization

consider the number of math classes taken by math 150 students. how can we represent the results in one number?

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

MTH 140 Statistics Videos

Chapter 7 Section 1 Homework Set A

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

STAT355 - Probability & Statistics

determining relationships among the explanatory variables, and

Northumberland Knowledge

Week 1. Exploratory Data Analysis

AP Statistics Solutions to Packet 1

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Ch. 3.1 # 3, 4, 7, 30, 31, 32

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Basics of Statistics

This curriculum is part of the Educational Program of Studies of the Rahway Public Schools. ACKNOWLEDGMENTS

Mind on Statistics. Chapter 2


RECOMMENDED COURSE(S): Algebra I or II, Integrated Math I, II, or III, Statistics/Probability; Introduction to Health Science

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

AP Statistics: Syllabus 1

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

430 Statistics and Financial Mathematics for Business

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo)

Measures of Central Tendency and Variability: Summarizing your Data for Others

2 Describing, Exploring, and

Bellwork Students will review their study guide for their test. Box-and-Whisker Plots will be discussed after the test.

MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Algebra 1 Course Information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

3.2 Measures of Spread

What is Data Analysis. Kerala School of MathematicsCourse in Statistics for Scientis. Introduction to Data Analysis. Steps in a Statistical Study

Descriptive Statistics

Descriptive Statistics

Describing, Exploring, and Comparing Data

Means, standard deviations and. and standard errors

Section 1.3 Exercises (Solutions)

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS

Characteristics of Binomial Distributions

Mean = (sum of the values / the number of the value) if probabilities are equal

Description. Textbook. Grading. Objective

Algebra I Vocabulary Cards

9.1 Measures of Center and Spread

UNIT 1: COLLECTING DATA

Foundation of Quantitative Data Analysis

AP Statistics Solutions to Packet 2

3. There are three senior citizens in a room, ages 68, 70, and 72. If a seventy-year-old person enters the room, the

Descriptive Statistics: Summary Statistics

Descriptive Statistics and Exploratory Data Analysis

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test March 2014

Algebra Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

3 Describing Distributions

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

MBA 611 STATISTICS AND QUANTITATIVE METHODS

2-7 Exploratory Data Analysis (EDA)

7 CONTINUOUS PROBABILITY DISTRIBUTIONS

Practice#1(chapter1,2) Name

Chapter 3. The Normal Distribution

Transcription:

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots Describing Quantitative Data with Numbers

1.3 I can n Calculate and interpret measures of center (mean, median) in context. n Calculate and interpret measures of spread (IQR, range, standard deviation) in context. n Identify outliers using the 1.5 IQR rule. n Make a boxplot. n Selecta ppropriate measures of center and spread. n Use appropriate graphs and numerical summaries to compare distributions of quantitative variables.

Example: Amt of fat in McDonald s beef sandwiches 9, 12, 19, 19, 23, 24, 26, 26, 28, 29, 39, 39, 40, 42

Calculations n Measures of Center Mean Median n Measures of Spread Range IQR Standard deviation & variance (later) n Outliers

Measuring Center n Mean average value balance point n Median typical value midpoint n www.whfreeman.com/tps4e Mean and Median Applet n When are the mean and median close to the same?

Skewed Distributions n In a right skewed distribution, where is the mean compared to the median? n In a left skewed distribution, where is the mean compared to the median?

Resistance n Which is more resistant to extreme values, the mean or median?

Check Your Understanding n Pg. 55

Center isn t enough n The correct mean concentration of hair dye isn t good enough if some boxes are extremely weak and others are extremely strong. n The correct mean weight of a football isn t good enough if some are extremely light and some are extremely heavy.

Measuring Spread n Can two dotplots with the same center and same shape, have different spreads? Explain.

Range and Interquartile Range n Range Hi-Low n Interquartile Range IQR = Q3 - Q1 n See pg. 56 n Which is a more resistant measure of spread, range or IQR? Explain.

Outliers n 1.5 x IQR n If an observation falls more than 1.5 x IQR above the third quartile or below the first quartile, it can be considered an outlier. n Outliers may reveal interesting information or they may reveal errors. n In the NASA Nimbus-7 data story, outliers were not the error, they were the message!

Mean or Median? Range or IQR? n The mean is sensitive to a few extreme values while the median is not. n A statistician could have his head in an oven and his feet in ice, and he will say that on average he feels fine. n The range is sensitive to extreme values while the IQR is not.

Boxplots n Make a boxplot without and with a calculator for the McDonald s beef sandwiches data on amount of fat. n Assess the center, spread, symmetry, and skewness from the boxplot. n Boxplots are a way to visualize the 5-number summary. n Boxplots do not show the mode like other graphs.

5-number Summary & Boxplot n Minimum n Q1 n Median n Q3 n Maximum

Comparing Distributions n Boxplots show less detail than histograms or stemplots, so they are best used for side-by-side comparison of more than one distribution.

Comparing Quiz Grades n Class A 10, 5, 6, 5, 6, 7, 8, 5, 6, 2 n Class B 2, 10, 10, 4, 2, 5, 1, 10, 9, 7

Comparing Quiz Grades n Mean n Median n Range n IQR n Outliers? Class A Class B

Comparing Quiz Grades Side-by-Side Boxplots Make boxplots by hand and/or on the calculator. Which class did better?

One more measure Exploring Data- Shape, Outliers, Center, Spread n Graphically Histogram, pie chart, dotplot, stemplot, back to back stemplot, boxplot n Numerically Measures of center mean, mode, median Measures of Spread or Variability range, IQR standard deviation & variance

Standard Deviation n Standard deviation measures spread by looking at how far the observations are from the mean. n Be able to interpret the standard deviation It is roughly the average distance each data value is from the mean of the distribution.

Standard Deviation Formula The standard deviation is the square root of the average squared difference from the mean.

Variance n Variance is the standard deviation squared. n Standard deviation is the square root of the variance.

Find & interpret the standard deviation for Class A and for Class B

Standard deviation with a calculator n Understanding what the standard deviation means is more important than being able to calculate it by hand. n Use s if data is from a sample, and use if data consists of an entire population.

Questions about standard deviation n Standard deviation is a measure of spread about the mean as center, so -When is the standard deviation 0? -What makes the standard deviation larger?

What to use and when Mean with standard deviation (not resistant) Median with range & IQR (resistant) Which is best used to describe symmetric distributions without outliers (such as the Normal distribution)? skewed distributions? *Numerical summaries do not fully describe the shape of a distribution. Always plot your data!

Assignment n Read pg. 50-69 n Do pg. 70 (79, 81, 83, 89, 91, 95, 97, 103, 105, 107-110)