+ Statistical Methods in

Similar documents
The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

Sampling Distribution And Central Limit Theorem

Confidence Intervals

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Measures of Spread and Boxplots Discrete Math, Section 9.4

1. C. The formula for the confidence interval for a population mean is: x t, which was

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

I. Chi-squared Distributions

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

5: Introduction to Estimation

Hypothesis testing. Null and alternative hypotheses

Practice Problems for Test 3

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

STATISTICAL METHODS FOR BUSINESS

1 Computing the Standard Deviation of Sample Means

Descriptive Statistics

Determining the sample size

Statistical inference: example 1. Inferential Statistics

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Chapter 7 Methods of Finding Estimators

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Chapter 7: Confidence Interval and Sample Size

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

PSYCHOLOGICAL STATISTICS

Math C067 Sampling Distributions

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

1 Correlation and Regression Analysis

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

Confidence Intervals for One Mean

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

This document contains a collection of formulas and constants useful for SPC chart construction. It assumes you are already familiar with SPC.

Properties of MLE: consistency, asymptotic normality. Fisher information.

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error

LECTURE 13: Cross-validation

Hypergeometric Distributions

Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011

Normal Distribution.

A Mathematical Perspective on Gambling

Chapter 14 Nonparametric Statistics

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

Output Analysis (2, Chapters 10 &11 Law)

Topic 5: Confidence Intervals (Chapter 9)

Multi-server Optimal Bandwidth Monitoring for QoS based Multimedia Delivery Anup Basu, Irene Cheng and Yinzhe Yu

Quadrat Sampling in Population Ecology


Maximum Likelihood Estimators.

Lesson 17 Pearson s Correlation Coefficient

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

Modified Line Search Method for Global Optimization

Section 11.3: The Integral Test

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Central Limit Theorem and Its Applications to Baseball

OMG! Excessive Texting Tied to Risky Teen Behaviors

Department of Computer Science, University of Otago

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

Exploratory Data Analysis

Data Analysis and Statistical Behaviors of Stock Market Fluctuations

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

One-sample test of proportions

Lesson 15 ANOVA (analysis of variance)

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

Confidence intervals and hypothesis tests

1 The Gaussian channel

Unit 8: Inference for Proportions. Chapters 8 & 9 in IPS

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as:

The Stable Marriage Problem

Chapter 5: Inner Product Spaces

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Bond Valuation I. What is a bond? Cash Flows of A Typical Bond. Bond Valuation. Coupon Rate and Current Yield. Cash Flows of A Typical Bond

Chapter XIV: Fundamentals of Probability and Statistics *

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships

Now here is the important step

Actuarial Models for Valuation of Critical Illness Insurance Products

7.1 Finding Rational Solutions of Polynomial Equations

Mathematical goals. Starting points. Materials required. Time needed

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix

Infinite Sequences and Series

Is there employment discrimination against the disabled? Melanie K Jones i. University of Wales, Swansea

3. Greatest Common Divisor - Least Common Multiple

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)

Universal coding for classes of sources

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)

CHAPTER 3 DIGITAL CODING OF SIGNALS

THE PROBABLE ERROR OF A MEAN. Introduction

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

Convexity, Inequalities, and Norms

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Predictive Modeling Data. in the ACT Electronic Student Record

MEP Pupil Text 9. The mean, median and mode are three different ways of describing the average.

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

An optical illusion. A statistical illusion. What is Statistics? What is Statistics? An Engineer, A Physicist And A Statistician.

Transcription:

+ Statistical Methods i Practice STA/MTH 3379 + Dr. A. B. W. Maage Associate Professor of Statistics Departmet of Mathematics & Statistics Sam Housto State Uiversity Discoverig Statistics 2d Editio Daiel T. Larose Chapter 7: Samplig Distributios Lecture PowerPoit Slides + Chapter 7 Overview 3 + The Big Picture 4 Where we are comig from ad where we are headed 7.1 Itroductio to Samplig Distributios 7.2 Cetral Limit Theorem for Meas 7.3 Cetral Limit Theorem for Proportios I Chapters 1 4, we leared ways to describe data sets usig umbers, tables, ad graphs. I Chapters 5 6 we leared the tools of probability ad probability distributios that allow us to quatify ucertaity. I Chapter 7, we will discover that seemigly radom statistics have predictable behaviors. The special type of distributio we use to describe these behaviors is called the samplig distributio. We will also lear about the most importat result i statistical iferece, the Cetral Limit Theorem. The samplig distributios we lear i this chapter form the basis for the statistical iferece we will perform i the rest of the book. 1

+ 7.1: Itroductio to Samplig Distributios Objectives: Explai the samplig distributio of the sample mea. Describe the samplig distributio of the sample mea whe the populatio is ormal. Fid probabilities ad percetiles for the sample mea whe the populatio is ormal. 5 6 Sample Mea I this chapter, we will develop methods that will allow us to quatify the behavior of statistics like the sample mea. The samplig distributio of the sample mea for a give sample size cosists of the collectio of the meas of all possible samples of size from the populatio. 7.1 x 10 20 5 30 15 16 miutes N 5 If we calculate the mea time for every possible sample of three idividuals, we get the samplig distributio below. x 10 20 5 x1 11.67 miutes N 3 Sample Mea Whe workig with samplig distributios, it is importat to kow the mea ad stadard deviatio. The mea of the samplig distributio of the sample mea is the value of the populatio mea µ. That is, x. 7 Accordig to CaEquity Mortgage compay, the mea age of mortgage applicats i the City of Toroto is 37 years old. Assume that the stadard deviatio is 6 years. Fid the mea ad stadard deviatio for the samplig distributio of the sample mea for the followig sample sizes: (a) 4, (b) 100, (c) 225 8 The stadard deviatio of the samplig distributio of the sample mea is called the stadard error of the mea. It is equal to x /, where σ is the populatio stadard deviatio. 6 (a) a. = 4. The 3. x 4 Note, because the deomiator of the stadard error formula is, the larger the sample size, the tighter the resultig samplig distributio. Larger sample sizes lead to smaller variability, which results i more precise estimatio. x 37 6 (b) e. = 100. The 0.6. x 100 6 (c) f. = 225. The 0.4. x 225 2

Sample Mea for a Normal Populatio Two importat facts should be oted about sample meas that are collected from a ormal populatio. For a ormal populatio, the samplig distributio of the sample mea is distributed as ormal (µ, σ/ ), where µ is the populatio mea ad σ is the populatio stadard deviatio. 9 Probabilities ad Percetiles Usig a Samplig Distributio Sice we kow the samplig distributio of the sample mea is ormal whe the populatio is ormally distributed, we ca use the techiques of Sectio 6.5 to aswer questios about the meas of samples take from ormal populatios. Suppose the quiz scores for a certai istructor are ormal (70, 10). Fid the probability that a radomly chose studet s score will be above 80. Fid the probability that a sample of 25 quiz scores will have a mea score greater tha 80. 10 Whe the samplig distributio of the sample mea is ormal, we may stadardize to produce the stadard ormal radom variable: Z x x x x / Probabilities ad Percetiles Usig a Samplig Distributio 11 + 7.2: Cetral Limit Theorem for Meas 12 Suppose the quiz scores for a certai istructor are ormal (70, 10). Objectives: What two symmetric values cotai the middle 90% of all sample meas betwee them? Assume a class size of 25. Use ormal probability plots to assess ormality. The middle 90% will fall betwee the 5 th percetile ad the 95 th percetile. These percetiles correspod to Z = 1.645 ad Z = 1.645. 70 1.645(2) = 66.71 70 + 1.645(2) = 73.29 Describe the samplig distributio of sample meas for skewed ad symmetric populatios as the sample size icreases. Apply the Cetral Limit Theorem for Meas to solve probability questios about the sample mea. 3

Normal Probability Plots Much of our aalysis requires that the sample data come from a populatio that is ormally distributed. We ca use histograms, dotplots, ad stem-ad-leaf displays to assess ormality. But a more precise tool is the ormal probability plot of the estimated cumulative ormal probabilities agaist the correspodig data values. 13 Samplig Distributio of x-bar for Skewed Populatios The samplig distributio of sample meas for a ormal populatio is also ormal. What if the populatio is ot ormal? 14 If the poits i the ormal probability plot either cluster aroud a straight lie or early all fall withi the curved bouds, the it is likely that the data set is ormal. Systematic deviatios off the straight lie are evidece agaist the claim that the data set is ormal. Cetral Limit Theorem for Meas 15 Cetral Limit Theorem for Meas 16 Regardless of the populatio, the samplig distributio of the sample mea becomes approximately ormal as the sample size gets larger. If the Populatio is Normal The samplig distributio of sample meas is ormal. Cetral Limit Theorem for Meas Give a populatio with mea µ ad stadard deviatio σ, the samplig distributio of the sample mea becomes approximately ormal (µ, σ/ ) as the sample size gets larger, regardless of the shape of the populatio. If the Populatio is No-Normal or Ukow ad the Sample Size is At Least 30 The samplig distributio of the sample mea is approximately ormal. Rule of Thumb: We cosider 30 as large eough to apply the Cetral Limit Theorem for Meas for ay populatio. If the Populatio is No-Normal or Ukow ad the Sample Size is Less Tha 30 We have isufficiet iformatio to coclude that the samplig distributio of the sample mea is either ormal or approximately ormal. 4

+ 7.3: Cetral Limit Theorem for Proportios Objectives: Explai the samplig distributio of the sample proportio. Apply the Cetral Limit Theorem for Proportios to solve probability questios about the sample proportio. 17 18 Sample Proportio The sample mea is ot the oly statistic that ca have a samplig distributio. Every statistic has a samplig distributio. Oe of the most importat is the samplig distributio of the sample proportio. Suppose each idividual i a populatio either has or does ot have a particular characteristic. If we take a sample of size from the populatio, the sample proportio (read p-hat) is: p ˆ X where X represets the umber of idividuals i the sample that have the particular characteristic. The samplig distributio of the sample proportio for a give sample size cosists of the collectio of the sample proportios of all possible samples of size from the populatio. Sample Proportio The mea of the samplig distributio of the sample proportio is the value of the populatio proportio p. This may be deoted as p ˆ p The stadard deviatio of the samplig distributio of the sample proportio is called the stadard error of the proportio ad is foud by p(1 p) p ˆ where p is the populatio proportio ad is the sample size. The samplig distributio of the sample proportio may be cosidered approximately ormal oly if both p 5 ad (1 p) 5. The miimum sample size required to produce approximate ormality is the larger of either 1 = 5/p or 2 = 5/(1 p). 19 20 Sample Proportio The Natioal Istitutes of Health reported that color blidess liked to the X chromosome afflicts 8% of me. Suppose we take a radom sample of 100 me ad let p deote the proportio of me i the populatio who have color blidess liked to the X chromosome. Fid ad. pˆ p ˆp pˆ pˆ p 1 p 0.08 10.08 100 0.000736 0.02713 5

Applyig the Cetral Limit Theorem for Proportios Cetral Limit Theorem for Proportios The samplig distributio of the sample proportio follows a approximately ormal distributio with mea p ad stadard deviatio p(1 p) p ˆ whe both p 5 ad (1 p) 5. Whe the samplig distributio of the sample proportio is approximately ormal, we ca stadardize to produce the stadard ormal Z: Z p ˆ p ˆ p ˆ p p ˆ p(1 p) 21 The Texas Workforce Commissio reported that the state uemploymet rate i March 2007 was 4.3%. Let p = 0.043 represet the populatio proportio of uemployed workers i Texas. Fid the probability that a sample of 117 Texas workers will have a proportio uemployed greater tha 9%. Sice 117(0.043) > 5 ad 117(0.957) > 5, we ca apply the Cetral Limit Theorem for Proportios. Z 22.09.043 2.51.043(1.043) 117 P(Z > 2.51) = 1 0.9940 = 0.0060 23 + Chapter 7 Overview 24 The Texas Workforce Commissio reported that the state uemploymet rate i March 2007 was 4.3%. Let p = 0.043 represet the populatio proportio of uemployed workers i Texas. Fid the 99 th percetile of sample proportios for = 117. The Z-value associated with 0.9901 is 2.33. 7.1 Itroductio to Samplig Distributios 7.2 Cetral Limit Theorem for Meas 7.3 Cetral Limit Theorem for Proportios ˆ p 2.33(0.01875) 0.043 0.0867 6