Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions



Similar documents
5.1 Identifying the Target Parameter

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

Fairfield Public Schools

MTH 140 Statistics Videos

Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

Characteristics of Binomial Distributions

Lecture 5 : The Poisson Distribution

Recall this chart that showed how most of our course would be organized:

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

Problem of the Month: Fair Games

Measures of Central Tendency and Variability: Summarizing your Data for Others

How To Write A Data Analysis

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

13.0 Central Limit Theorem

The Math. P (x) = 5! = = 120.

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

z-scores AND THE NORMAL CURVE MODEL

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Lecture 14. Chapter 7: Probability. Rule 1: Rule 2: Rule 3: Nancy Pfenning Stats 1000

Chapter 4. Probability and Probability Distributions

Lecture 8. Confidence intervals and the central limit theorem

WHERE DOES THE 10% CONDITION COME FROM?

Pr(X = x) = f(x) = λe λx

Means, standard deviations and. and standard errors

The Binomial Probability Distribution

Introduction to Regression and Data Analysis

WEEK #23: Statistics for Spread; Binomial Distribution

Parametric and Nonparametric: Demystifying the Terms

Week 1. Exploratory Data Analysis

South Carolina College- and Career-Ready (SCCCR) Probability and Statistics

UNIT 1: COLLECTING DATA

CS 147: Computer Systems Performance Analysis

Week 3&4: Z tables and the Sampling Distribution of X

Chapter 5. Discrete Probability Distributions

Study Guide for the Final Exam

4. Continuous Random Variables, the Pareto and Normal Distributions

CALCULATIONS & STATISTICS

Two-sample inference: Continuous data

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Lesson 17: Margin of Error When Estimating a Population Proportion

In the past, the increase in the price of gasoline could be attributed to major national or global

How To Run Statistical Tests in Excel

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

Simple Regression Theory II 2010 Samuel L. Baker

SKEWNESS. Measure of Dispersion tells us about the variation of the data set. Skewness tells us about the direction of variation of the data set.

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

A Hands-On Exercise Improves Understanding of the Standard Error. of the Mean. Robert S. Ryan. Kutztown University

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

3. Data Analysis, Statistics, and Probability

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

The Assumption(s) of Normality

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Big Ideas in Mathematics

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem

1 Prior Probability and Posterior Probability

Simple Linear Regression

8. THE NORMAL DISTRIBUTION

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Conditional Probability, Independence and Bayes Theorem Class 3, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Foundations of Statistics Frequentist and Bayesian

John Kerrich s coin-tossing Experiment. Law of Averages - pg. 294 Moore s Text

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

Descriptive Statistics

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

UNIVERSITY OF NAIROBI

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

International College of Economics and Finance Syllabus Probability Theory and Introductory Statistics

6 3 The Standard Normal Distribution

MBA 611 STATISTICS AND QUANTITATIVE METHODS

AMS 5 CHANCE VARIABILITY

Problem of the Month Through the Grapevine

ST 371 (IV): Discrete Random Variables

TEACHER NOTES MATH NSPIRED

Summarizing and Displaying Categorical Data

Chapter 13 Introduction to Linear Regression and Correlation Analysis

6.3 Conditional Probability and Independence

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

COMMON CORE STATE STANDARDS FOR

Elementary Statistics and Inference. Elementary Statistics and Inference. 17 Expected Value and Standard Error. 22S:025 or 7P:025.

Experimental Designs (revisited)

Chapter 8: Quantitative Sampling

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

Lecture 1: Review and Exploratory Data Analysis (EDA)

Revised Version of Chapter 23. We learned long ago how to solve linear congruences. ax c (mod m)

Hypothesis Testing for Beginners

Descriptive Statistics

Mathematical Finance

STAT 360 Probability and Statistics. Fall 2012

Chapter 4. iclicker Question 4.4 Pre-lecture. Part 2. Binomial Distribution. J.C. Wang. iclicker Question 4.4 Pre-lecture

Principle of Data Reduction

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.

Binomial Sampling and the Binomial Distribution

Transcription:

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Typical Inference Problem Definition of Sampling Distribution 3 Approaches to Understanding Sampling Dist. Applying 68-95-99.7 Rule Cengage Learning Elementary Statistics: Looking at the Big Picture 1

Looking Back: Review 4 Stages of Statistics Data Production (discussed in Lectures 1-4) Displaying and Summarizing (Lectures 5-12) Probability Finding Probabilities (discussed in Lectures 13-14) Random Variables (discussed in Lectures 15-18) Sampling Distributions Proportions Means Statistical Inference Elementary Statistics: Looking at the Big Picture L19.2

Typical Inference Problem If sample of 100 students has 0.13 left-handed, can you believe population proportion is 0.10? Solution Method: Assume (temporarily) that population proportion is 0.10, find probability of sample proportion as high as 0.13. If it s too improbable, we won t believe population proportion is 0.10. Elementary Statistics: Looking at the Big Picture L19.3

Key to Solving Inference Problems For a given population proportion p and sample size n, need to find probability of sample proportion in a certain range: Need to know sampling distribution of. Note: can denote a single statistic or a random variable. Elementary Statistics: Looking at the Big Picture L19.4

Definition Sampling distribution of sample statistic tells probability distribution of values taken by the statistic in repeated random samples of a given size. Looking Back: We summarize a probability distribution by reporting its center, spread, shape. Elementary Statistics: Looking at the Big Picture L19.5

Behavior of Sample Proportion (Review) For random sample of size n from population with p in category of interest, sample proportion has mean p standard deviation shape approximately normal for large enough n Looking Back: Can find normal probabilities using 68-95-99.7 Rule, etc. Elementary Statistics: Looking at the Big Picture L19.6

Rules of Thumb (Review) Population at least 10 times sample size n (formula for standard deviation of approximately correct even if sampled without replacement) np and n(1-p) both at least 10 (guarantees approximately normal) Elementary Statistics: Looking at the Big Picture L19.7

Understanding Dist. of Sample Proportion 3 Approaches: 1. Intuition 2. Hands-on Experimentation 3. Theoretical Results Looking Ahead: We ll find that our intuition is consistent with experimental results, and both are confirmed by mathematical theory. Elementary Statistics: Looking at the Big Picture L19.8

Example: Shape of Underlying Distribution (n=1) Background: Population proportion of blue M&M s is p=1/6=0.17. Question: How does the probability histogram for sample proportions appear for samples of size 1? Response: 5/6 1/6 0 1 Looking Ahead: The shape of the underlying distribution will play a role in the shape of for various sample sizes. for n=1 Elementary Statistics: Looking at the Big Picture L19.10

Example: Sample Proportion as Random Variable Background: Population proportion of blue M&Ms is 0.17. Questions: Is the underlying variable categorical or quantitative? Consider the behavior of sample proportion for repeated random samples of a given size. What type of variable is sample proportion? What 3 aspects of the distribution of sample proportion should we report to summarize its behavior? Responses: Underlying variable Summarize with,, Elementary Statistics: Looking at the Big Picture L19.12

Example: Center, Spread of Sample Proportion Background: Population proportion of blue M&M s is p=1/6=0.17. Question: What can we say about center and spread of for repeated random samples of size n = 25 (a teaspoon)? Response: Center: Some s more than, others less; should balance out so mean of s is p =. Spread of s: s.d. depends on. For n=6, could easily get anywhere from to. For n=25, spread of will be than it is for n = 6. Elementary Statistics: Looking at the Big Picture L19.14

Example: Intuit Shape of Sample Proportion Background: Population proportion of blue M&M s is p=1/6=0.17. Question: What can we say about the shape of for repeated random samples of size n = 25 (a teaspoon)? Response: close to most common, far from in either direction increasingly less likely Elementary Statistics: Looking at the Big Picture L19.16

Example: Sample Proportion for Larger n Background: Population proportion of blue M&M s is p=1/6=0.17. Question: What can we say about center, spread, shape of for repeated random samples of size n = 75 (a Tablespoon)? Response: Center: mean of s should be p = (for any n). Spread of s: compared to n=25, spread for n=75 is Shape: s clumped near 0.17, taper at tails Looking Ahead: Sample size does not affect center but plays an important role in spread and shape of the distribution of sample proportion (also of sample mean). Elementary Statistics: Looking at the Big Picture L19.18

Understanding Sample Proportion 3 Approaches: 1. Intuition 2. Hands-on Experimentation 3. Theoretical Results Looking Ahead: We ll find that our intuition is consistent with experimental results, and both are confirmed by mathematical theory. Elementary Statistics: Looking at the Big Picture L19.19

Central Limit Theorem Approximate normality of sample statistic for repeated random samples of a large enough size is cornerstone of inference theory. Makes intuitive sense. Can be verified with experimentation. Proof requires higher-level mathematics; result called Central Limit Theorem. Elementary Statistics: Looking at the Big Picture L19.20

Center of Sample Proportion (Implications) For random sample of size n from population with p in category of interest, sample proportion has mean p is unbiased estimator of p (sample must be random) Elementary Statistics: Looking at the Big Picture L19.21

Spread of Sample Proportion (Implications) For random sample of size n from population with p in category of interest, sample proportion has mean p standard deviation has less spread for larger samples (population size must be at least 10n) n in denominator Elementary Statistics: Looking at the Big Picture L19.22

Shape of Sample Proportion (Implications) For random sample of size n from population with p in category of interest, sample proportion has mean p standard deviation shape approx. normal for large enough n can find probability that sample proportion takes value in given interval Elementary Statistics: Looking at the Big Picture L19.23

Example: Behavior of Sample Proportion Background: Population proportion of blue M&M s is p=0.17. Question: For repeated random samples of n=25, how does behave? Response: For n=25, has Center: mean Spread: standard deviation Shape: not really normal because Elementary Statistics: Looking at the Big Picture L19.25

Example: Sample Proportion for Larger n Background: Population proportion of blue M&M s is p=0.17. Question: For repeated random samples of n=75, how does behave? Response: For n=75, has Center: mean Spread: standard deviation Shape: approximately normal because Elementary Statistics: Looking at the Big Picture L19.27

68-95-99.7 Rule for Normal R.V. (Review) Sample at random from normal population; for sampled value X (a R.V.), probability is 68% that X is within 1 standard deviation of mean 95% that X is within 2 standard deviations of mean 99.7% that X is within 3 standard deviations of mean Elementary Statistics: Looking at the Big Picture L19.28

68-95-99.7 Rule for Sample Proportion For sample proportions taken at random from a large population with underlying p, probability is 68% that is within 1 of p 95% that is within 2 of p 99.7% that is within 3 of p Elementary Statistics: Looking at the Big Picture L19.29

Example: Sample Proportion for n=75, p=0.17 Background: Population proportion of blue M&Ms is p=0.17. For random samples of n=75, approx. normal with mean 0.17, s.d. Question: What does 68-95-99.7 Rule tell us about behavior of? Response: The probability is approximately 0.68 that is within of : in (0.13, 0.21) 0.95 that is within of : in (0.08, 0.26) 0.997 that is within of : in (0.04, 0.30) Looking Back: We don t use the Rule for n=25 because Elementary Statistics: Looking at the Big Picture L19.31

90-95-98-99 Rule (Review) For standard normal Z, the probability is 0.90 that Z takes a value in interval (-1.645, +1.645) 0.95 that Z takes a value in interval (-1.960, +1.960) 0.98 that Z takes a value in interval (-2.326, +2.326) 0.99 that Z takes a value in interval (-2.576, +2.576) Elementary Statistics: Looking at the Big Picture L19.32

Example: Sample Proportion for n=75, p=0.17 Background: Population proportion of blue M&Ms is p=0.17. For random samples of n=75, approx. normal with mean 0.17, s.d. Question: What does 90-95-98-99 Rule tell about behavior of? Response: The probability is approximately 0.90 that is within (0.043) of 0.17: in (0.10,0.24) 0.95 that is within (0.043) of 0.17: in (0.09,0.25) 0.98 that is within (0.043) of 0.17: in (0.07,0.27) 0.99 that is within (0.043) of 0.17: in (0.06,0.28) Elementary Statistics: Looking at the Big Picture L19.34

Typical Inference Problem (Review) If sample of 100 students has 0.13 left-handed, can you believe population proportion is 0.10? Solution Method: Assume (temporarily) that population proportion is 0.10, find probability of sample proportion as high as 0.13. If it s too improbable, we won t believe population proportion is 0.10. Elementary Statistics: Looking at the Big Picture L19.35

Example: Testing Assumption About p Background: We asked, If sample of 100 students has 0.13 left-handed, can you believe population proportion is 0.10? Questions: What are the mean, standard deviation, and shape of? Is 0.13 improbably high under the circumstances? Can we believe p = 0.10? Response: For p=0.10 and n=100, has mean, s.d. ; shape approx. normal since. According to Rule, the probability is that would take a value of 0.13 (1 s.d. above mean) or more. Since this isn t so improbable,. Elementary Statistics: Looking at the Big Picture L19.37

Lecture Summary (Distribution of Sample Proportion) Typical inference problem Sampling distribution; definition 3 approaches to understanding sampling dist. Intuition Hands-on experiment Theory Center, spread, shape of sampling distribution Central Limit Theorem Role of sample size Applying 68-95-99.7 Rule Elementary Statistics: Looking at the Big Picture L19.38