MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem



Similar documents
3.4 The Normal Distribution

The Normal Distribution

Stats on the TI 83 and TI 84 Calculator

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

4. Continuous Random Variables, the Pareto and Normal Distributions

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.

Key Concept. Density Curve

Week 3&4: Z tables and the Sampling Distribution of X

Normal distribution. ) 2 /2σ. 2π σ

6.2 Normal distribution. Standard Normal Distribution:

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

AP Statistics Solutions to Packet 2

6 3 The Standard Normal Distribution

You flip a fair coin four times, what is the probability that you obtain three heads.

Chapter 5: Normal Probability Distributions - Solutions

6.4 Normal Distribution

Math 151. Rumbos Spring Solutions to Assignment #22

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Probability Distributions

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Chapter 4. Probability and Probability Distributions

WHERE DOES THE 10% CONDITION COME FROM?

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

consider the number of math classes taken by math 150 students. how can we represent the results in one number?

The Normal Distribution. Alan T. Arnholt Department of Mathematical Sciences Appalachian State University

AP STATISTICS (Warm-Up Exercises)

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes

3.2 Measures of Spread

Chapter 4. iclicker Question 4.4 Pre-lecture. Part 2. Binomial Distribution. J.C. Wang. iclicker Question 4.4 Pre-lecture

The Normal distribution

AMS 5 CHANCE VARIABILITY

Exercise 1.12 (Pg )

Normal Distribution as an Approximation to the Binomial Distribution

The normal approximation to the binomial

z-scores AND THE NORMAL CURVE MODEL

Measures of Central Tendency and Variability: Summarizing your Data for Others

Normal Probability Distribution

Capital Market Theory: An Overview. Return Measures

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

The Binomial Probability Distribution

5.1 Identifying the Target Parameter

The normal approximation to the binomial

Notes on Continuous Random Variables

The Standard Normal distribution

Lecture 5 : The Poisson Distribution

Interpreting Data in Normal Distributions

Chapter 7 - Practice Problems 1

Variables. Exploratory Data Analysis

Ch. 3.1 # 3, 4, 7, 30, 31, 32

Exact Confidence Intervals

Probability Distribution for Discrete Random Variables

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

An Introduction to Basic Statistics and Probability

Sampling Distributions

Continuous Random Variables

Standard Deviation Estimator

Exploratory Data Analysis. Psychology 3256

CALCULATIONS & STATISTICS

Descriptive Statistics

WISE Sampling Distribution of the Mean Tutorial

Department of Civil Engineering-I.I.T. Delhi CEL 899: Environmental Risk Assessment Statistics and Probability Example Part 1

Statistics 104: Section 6!

Lecture 8. Confidence intervals and the central limit theorem

Lecture 6: Discrete & Continuous Probability and Random Variables

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

MEASURES OF VARIATION

9. Sampling Distributions

Lesson 20. Probability and Cumulative Distribution Functions

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

Introduction; Descriptive & Univariate Statistics

Lecture 1: Review and Exploratory Data Analysis (EDA)

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A) B) C) D) 0.

STAT 200 QUIZ 2 Solutions Section 6380 Fall 2013

MATH 140 Lab 4: Probability and the Standard Normal Distribution

TEACHER NOTES MATH NSPIRED

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Descriptive Statistics

Lesson 7 Z-Scores and Probability

Exploratory Data Analysis

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Joint Exam 1/P Sample Exam 1

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

Unbeknownst to us, the entire population consists of 5 cloned sheep with ages 10, 11, 12, 13, 14 months.

Midterm Review Problems

Practice#1(chapter1,2) Name

Means, standard deviations and. and standard errors

Lecture Notes Module 1

7. Normal Distributions

Probability Distributions

Grade Level Year Total Points Core Points % At Standard %

Descriptive Statistics and Measurement Scales

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe

MBA 611 STATISTICS AND QUANTITATIVE METHODS

Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1

Transcription:

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem Tony Pourmohamad Department of Mathematics De Anza College Spring 2015

Objectives By the end of this set of slides, you should be able to: 1 Understand what the central limit theorem is 2 Recognize the central limit theorem problems 3 Apply and interpret the central limit theorem for means 2 / 20

The Central Limit Theorem The Central Limit Theorem (CLT) is one of the most powerful and useful ideas in all of statistics For this class, we will consider two application of the CLT: 1 CLT for means (or averages) of random variables 2 CLT for sums of random variables Let s start with an example, courtesy of Professor Mo Geraghty http://nebula2.deanza.edu:16080/ mo/holistic/clt.swf Try exploring the following website to better understand the CLT http://spark.rstudio.com/minebocek/clt_mean/ 3 / 20

The Central Limit Theorem So what is happening in the CLT video? 10 Samples 100 Samples Frequency 0 1 2 3 4 Frequency 0 10 20 30 2.5 3.0 3.5 4.0 4.5 5.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 1,000 Samples 10,000 Samples Frequency 0 50 150 250 Frequency 0 500 1000 1500 2 3 4 5 2 3 4 5 4 / 20

The Central Limit Theorem -- Basic Idea Imagine there is some population with a mean µ and standard deviation σ We can collect samples of size n where the value of n is "large enough" We can then calculate the mean of each sample If we create a histogram of those means, then the resulting histogram will look close to being a normal distribution It does not matter what the distribution of the original population is, or whether you even know it. The important fact is the the distribution of the sample means tend to follow the normal distribution 5 / 20

The Central Limit Theorem -- More Formally Suppose that we have a large population with with mean µ and standard deviation σ Suppose that we select random samples of size n items this population Each sample taken from the population has its own average X. The sample average for any specific sample may not equal the population average exactly. 6 / 20

The Central Limit Theorem -- More Formally Continued The sample averages X follow a probability distribution of their own The average of the sample averages is the population average: µ x = µ The standard deviation of the sample averages equals the population standard deviation divided by the square root of the sample size σ x = σ n The shape of the distribution of the sample averages X is normally distributed if the sample size is large enough The larger the sample size, the closer the shape of the distribution of sample averages becomes to the normal distribution This is the Central Limit Theorem! 7 / 20

The Central Limit Theorem -- Case 1 IF a random sample of any size n is taken from a population with a normal distribution with mean and standard deviation σ THEN distribution of the sample mean has a normal distribution with: µ x = µ and σ x = σ n and X N(µ x, σ x ) 8 / 20

The Central Limit Theorem -- Case 1 X ~ N(10, 2) µ X ~ N(10, 2 50) µ 9 / 20

The Central Limit Theorem -- Case 2 IF a random sample of sufficiently large size n is taken from a population with ANY distribution with mean µ and standard deviation THEN the distribution of the sample mean has approximately a normal distribution with: µ x = µ and σ x = σ n and X N(µ x, σ x ) 10 / 20

The Central Limit Theorem -- Case 2 X ~ N(10, 2) µ X ~ N(µ, σ n) µ 11 / 20

The Central Limit Theorem -- Recap 3 important results for the distribution of X 1 The mean stays the same µ x = µ 2 The standard deviation gets smaller σ x = σ n 3 If n is sufficiently large, X has a normal distribution where X N(µ x, σ x ) 12 / 20

What is Large n? How large does the sample size n need to be in order to use the Central Limit Theorem? The value of n needed to be a "large enough" sample size depends on the shape of the original distribution of the individuals in the population If the individuals in the original population follow a normal distribution, then the sample averages will have a normal distribution, no matter how small or large the sample size is If the individuals in the original population do not follow a normal distribution, then the sample averages X become more normally distributed as the sample size grows larger. In this case the sample averages X do not follow the same distribution as the original population 13 / 20

What is Large n? Continued The more skewed the original distribution of individual values, the larger the sample size needed If the original distribution is symmetric, the sample size needed can be smaller Many statistics textbooks use the rule of thumb n 30, considering 30 as the minimum sample size to use the Central Limit Theorem. But in reality there is not a universal minimum sample size that works for all distributions; the sample size needed depends on the shape of the original distribution In this class, we will assume the sample size is large enough for the Central Limit Theorem to be used to find probabilities for X 14 / 20

Calculating Probabilities from a Normal Distribution Here is the general procedure to calculate probabilities from the distribution of the sample mean X 1 You are given an interval in terms of x, i.e. 2 Convert to a z score by using P( X < x) z = x µ σ/ n 3 Look up probability in z table that corresponds to z score, i.e. P(Z < z) This is just the same idea we used in Chapter 6! 15 / 20

Examples Look at Handout #5 on the website 16 / 20

Percentile Calculations Based on the Normal Distribution Here is the general procedure to calculate the value x that corresponds to the P th percentile 1 You are given a probability or percentile desired 2 Look up the z score in table that corresponds to the probability 3 Convert to x by the following formula: x = µ + z ( σ Examples: Look at Handout #5 on the website n ) 17 / 20

Using Your Calculator If you have a graphing calculator, your calculator can calculate all of these probabilities without using a z table If you want to calculate P(a < X < b) follow these steps: 1 Push 2nd, then DISTR 2 Select normalcdf() and then push ENTER 3 Then enter the following: normalcdf(a, b, µ, σ/ n) Question: If X N(0, 1), what is the probability P( 1 < X < 1)? Solution: normalcdf( 1, 1, 0, 1) = 0.6827 68% Question: If X N(10, 2), what is the probability P(7 < X < 9)? Solution: normalcdf(7, 9, 10, 2) = 0.2417 18 / 20

Using Your Calculator If you want to calculate P( X < a) follow these steps: 1 Push 2nd, then DISTR 2 Select normalcdf() and then push ENTER 3 Then enter the following: normalcdf( 10 99, a, µ, σ/ n) Question: If X N(10, 2), what is the probability P( X < 8)? Solution: normalcdf( 10 99, 8, 10, 2) = 0.158656 If you want to calculate P( X > a) follow these steps: 1 Push 2nd, then DISTR 2 Select normalcdf() and then push ENTER 3 Then enter the following: normalcdf(a, 10 99, µ, σ/ n) Question: If X N(10, 2), what is the probability P( X > 9)? Solution: normalcdf(9, 10 99, 10, 2) = 0.691462 19 / 20

Using Your Calculator If you want to calculate the value of X that gives you the P th percentile then follow these steps: 1 Push 2nd, then DISTR 2 Select invnorm() and then push ENTER 3 Then enter the following: invnorm(percentile,µ, σ) Question: If X N(10, 2), what value of X gives us the 25 th percentile? Solution: normalcdf(.25, 10, 2) = 8.65102 Recall: We used the formula x = µ + zσ/ n, so We got -0.67 from the z table x = 10 + ( 0.67)(2) = 8.66 20 / 20