13.0 Central Limit Theorem



Similar documents
AMS 5 CHANCE VARIABILITY

Elementary Statistics and Inference. Elementary Statistics and Inference. 16 The Law of Averages (cont.) 22S:025 or 7P:025.

Elementary Statistics and Inference. Elementary Statistics and Inference. 17 Expected Value and Standard Error. 22S:025 or 7P:025.

Chapter 16. Law of averages. Chance. Example 1: rolling two dice Sum of draws. Setting up a. Example 2: American roulette. Summary.

The Normal Approximation to Probability Histograms. Dice: Throw a single die twice. The Probability Histogram: Area = Probability. Where are we going?

We rst consider the game from the player's point of view: Suppose you have picked a number and placed your bet. The probability of winning is

Chapter 16: law of averages

Section 7C: The Law of Large Numbers

The Math. P (x) = 5! = = 120.

Betting systems: how not to lose your money gambling

The overall size of these chance errors is measured by their RMS HALF THE NUMBER OF TOSSES NUMBER OF HEADS MINUS NUMBER OF TOSSES

MA 1125 Lecture 14 - Expected Values. Friday, February 28, Objectives: Introduce expected values.

Stat 20: Intro to Probability and Statistics

John Kerrich s coin-tossing Experiment. Law of Averages - pg. 294 Moore s Text

This Method will show you exactly how you can profit from this specific online casino and beat them at their own game.

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

3.2 Roulette and Markov Chains

Probability: The Study of Randomness Randomness and Probability Models. IPS Chapters 4 Sections

Stat 20: Intro to Probability and Statistics

36 Odds, Expected Value, and Conditional Probability

$ ( $1) = 40

Statistics 100A Homework 3 Solutions

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 13. Random Variables: Distribution and Expectation

In the situations that we will encounter, we may generally calculate the probability of an event

Statistics and Random Variables. Math 425 Introduction to Probability Lecture 14. Finite valued Random Variables. Expectation defined

Example. A casino offers the following bets (the fairest bets in the casino!) 1 You get $0 (i.e., you can walk away)

V. RANDOM VARIABLES, PROBABILITY DISTRIBUTIONS, EXPECTED VALUE

Statistics 151 Practice Midterm 1 Mike Kowalski

THE CENTRAL LIMIT THEOREM TORONTO

Section 6.1 Discrete Random variables Probability Distribution

Law of Large Numbers. Alexandra Barbato and Craig O Connell. Honors 391A Mathematical Gems Jenia Tevelev

HONORS STATISTICS. Mrs. Garrett Block 2 & 3

Midterm Exam #1 Instructions:

Second Midterm Exam (MATH1070 Spring 2012)

Probability and Expected Value

Random Variables. Chapter 2. Random Variables 1

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

Introductory Probability. MATH 107: Finite Mathematics University of Louisville. March 5, 2014

Solutions: Problems for Chapter 3. Solutions: Problems for Chapter 3

MOVIES, GAMBLING, SECRET CODES, JUST MATRIX MAGIC

MrMajik s Money Management Strategy Copyright MrMajik.com 2003 All rights reserved.

AP Statistics 7!3! 6!

Chapter 4 Lecture Notes

Probability. Sample space: all the possible outcomes of a probability experiment, i.e., the population of outcomes

Session 8 Probability

Chapter 5 Section 2 day f.notebook. November 17, Honors Statistics

Example: Find the expected value of the random variable X. X P(X)

Statistics 100A Homework 4 Solutions

Solution (Done in class)

Unit 19: Probability Models

Betting on Excel to enliven the teaching of probability

Expected Value and the Game of Craps

MONEY MANAGEMENT. Guy Bower delves into a topic every trader should endeavour to master - money management.

Roulette. Math 5 Crew. Department of Mathematics Dartmouth College. Roulette p.1/14

Chapter What is the probability that a card chosen from an ordinary deck of 52 cards is an ace? Ans: 4/52.

Gaming the Law of Large Numbers

7. Normal Distributions

2. How many ways can the letters in PHOENIX be rearranged? 7! = 5,040 ways.

Practical Probability:

How to Beat Online Roulette!

Beating Roulette? An analysis with probability and statistics.

Activities/ Resources for Unit V: Proportions, Ratios, Probability, Mean and Median

WHERE DOES THE 10% CONDITION COME FROM?

Lecture 13. Understanding Probability and Long-Term Expectations

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10

Normal distribution. ) 2 /2σ. 2π σ

High School Statistics and Probability Common Core Sample Test Version 2

Basic Probability. Probability: The part of Mathematics devoted to quantify uncertainty

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

Expected Value. 24 February Expected Value 24 February /19

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions

6.042/18.062J Mathematics for Computer Science. Expected Value I

Basic Probability Theory II

Chapter 26: Tests of Significance

Contemporary Mathematics- MAT 130. Probability. a) What is the probability of obtaining a number less than 4?

Contemporary Mathematics Online Math 1030 Sample Exam I Chapters No Time Limit No Scratch Paper Calculator Allowed: Scientific

Homework Assignment #2: Answer Key

Lesson 1. Basics of Probability. Principles of Mathematics 12: Explained! 314

Math Quizzes Winter 2009

Probabilistic Strategies: Solutions

Lecture 8 The Subjective Theory of Betting on Theories

Math 58. Rumbos Fall Solutions to Review Problems for Exam 2

Section 6.2 Definition of Probability

WISE Sampling Distribution of the Mean Tutorial

Statistics. Head First. A Brain-Friendly Guide. Dawn Griffiths

STT 200 LECTURE 1, SECTION 2,4 RECITATION 7 (10/16/2012)

Ch. 13.3: More about Probability

Week 4: Gambler s ruin and bold play

Worksheet for Teaching Module Probability (Lesson 1)

Exam 3 Review/WIR 9 These problems will be started in class on April 7 and continued on April 8 at the WIR.

AP Stats - Probability Review

Responsible Gambling Education Unit: Mathematics A & B

The Mathematics of Gambling

Recursive Estimation

5 Cumulative Frequency Distributions, Area under the Curve & Probability Basics 5.1 Relative Frequencies, Cumulative Frequencies, and Ogives

Find the indicated probability. 1) If a single fair die is rolled, find the probability of a 4 given that the number rolled is odd.

Lecture 11 Uncertainty

Week 2: Conditional Probability and Bayes formula

Transcription:

13.0 Central Limit Theorem Discuss Midterm/Answer Questions Box Models Expected Value and Standard Error Central Limit Theorem 1

13.1 Box Models A Box Model describes a process in terms of making repeated draws, with replacement, from a box containing numbers. Since draws are made with replacement, the outcomes in a series of draws are independent. The value on the first draw does not affect the value on the second. Box models describe: repeated rolls of a die, repeated tosses of a coin (fair or unfair), drawing a random sample of people and getting their heights or whom they plan to vote for. 2

For a box model, the expected value is the average of the numbers in the box. It is not necessarily one of the numbers in the box For categorical data, such as H or T in coin-tossing, or voting for Bush or Kerry, one converts H and T to say 1 and 0. When sampling from the box, we are averaging zeroes and ones so the result is just a proportion. What about hair colors or eye colors? The Law of Averages says that if one makes many draws from the box and averages the results, that average will converge to the expected value of the box. But the Law of Averages says nothing about the outcome on the next draw. 3

13.2 Expected Value and Standard Error Suppose that a box contains B numbers, X 1,..., X B. Then the expected value of the box is is EV = 1 B X i B and the standard deviation of the box is 1 σ = B [(X 1 EV ) 2 + + (X B EV ) 2 ] ( ) = 1 B Xi 2 EV B 2. i=1 i=1 Arithmetically, this is just the same as calculating the mean and standard deviation for a list of numbers. 4

EV is just a number. It is not random. It is a characteristic of the box. The same is true for σ. The number of elements in the box B has nothing to do with the sample size n that you draw from the box. The mean, X, and standard deviation, sd, for the samples from the box are random. They vary around EV and σ, respectively 5

The standard error for the average of n draws, say X n from a box (with replacement) is: se Xn = σ n. The standard error is the likely size of the difference between the average of n draws from the box and the expected value of the box. Note that as n, the standard error se goes to zero. This is a formal statement of the Law of Averages. It means that the sample average is a good estimate of the average in the box, and the accuracy of the estimate improves as you take more and more draws from the box. 6

The standard error for the sum of n draws from a box (with replacement), n i=1 X i, is: se P n i=1 X i = n σ. Analgously to standard error for averages, the standard error of the sum is the likely size of the difference between the sum of n draws from the box and n times the expected value of the box. Note that as n, this standard error does not go to zero. This means that as the number of draws increases, the likely difference between the sum of the draws and nev gets larger, rather than smaller. Concern about the sum, rather than the average, arises in contexts such as investment or gambling (gambler s ruin), where the total return from multiple trials is important, not the average return. 7

13.3 The Central Limit Theorem The Central Limit Theorem is one of the high-water marks of mathematical thinking. It was worked upon by James Bernoulli, Abraham de Moivre, and Alan Turing. Over the centuries, the theory improved from special cases to a very general rule. Essentially, the Central Limit Theorem allows one to describe how accurately the Law of Averages works. Most people have a good intuitive understanding of the Law of Averages, but in many cases it is important to determine whether a particular size of deviation between the sample mean and the (usually unknown) expected value is probable or improbable. That is, what is the chance that the sample average is more than d away from the true EV? 8

Formally, the Central Limit Theorem for averages says: X EV σ/ n N(0, 1) where X is the average of n draws, EV is the expected value of the box, and σ is the standard deviation of the box. This means that the left-hand side is a random number that is approximately normal with mean zero and standard deviation one. The approximation gets better as n gets larger. Modifications of this formula hold for many other situations, e.g., when there is a little dependence, or when the box changes from draw to draw. 9

A version of the Central Limit holds for sums: Xi nev n σ N(0, 1). Here, X i is just the sum of the n draws from the box. (This should be obvious to everyone.) X EV σ/ n = Xi nev n σ. It is the same random variable! This formula is useful when calculating the chance of winning a given amount of money when gambling, or getting more than a specific score on a test. With these two central limit formulas, one can answer all sorts of practical questions. 10

Problem 1: You want to estimate the average income of people in Durham. You take a random sample of 100 households, and find that X is $42,000 and the sample sd is $5,000. What is the (approximate) probability that the true mean household income in Durham is more than $42,500? What is the box model for this problem? What is the expected value? What is the standard deviation? Note that in order to solve this, we have to assume that mean of the sample is the mean of the box model and that the standard deviation of the sample is equal to the σ of the box. Later in the course we will learn how to correct for the fact that these really are approximations. 11

P[EV > 42, 500] = P[ EV < 42, 500] = P[ X EV < X 42, 500] = P[ X EV σ/ n < X 42, 500 σ/ ] n = P[Z < X 42, 500 σ/ ] n = P[Z < (42, 000 42, 500)/(5000/10)] = P[Z < 1] From the standard normal table, we know this has chance (1/2) (100-68.27) = 15.865%, so the probability of the estimate being too low by $500 is just.15865. 12

Problem 2: You are playing Red and Black in roulette. (A roulette wheel has 38 pockets; 18 are red, 18 are black, and 2 are green the house takes all the money on green). You pick either red or black; if the ball lands in the color you pick, you win a dollar. Otherwise you lose a dollar. Suppose you make 100 plays. What is the chance that you lose $10 or more? What is the box model?. 13

There are 38 objects in the box; 18 are labelled 1 and the 20 are labelled -1. So the expected value of the box is EV = 1 [1 + 1 + + 1 + ( 1) + ( 1) + + ( 1)] 38 = 1 38 [ 2] = 1/19. The standard deviation of the box is σ = 1 ( 38 38 i=1 X 2 i ) EV 2 = 1 ( 1/19) 2 =.998614. 14

The probability of losing more than $10 or more in 100 plays is P[sum < 10] = P[sum nev < 10 nev ] sum nev 10 nev = P[ < ] nσ nσ = P[Z < 10 nev nσ ] = P[Z < [ 10 (100)( 1/19)]/(10.998614)] = P[Z <.47434]. From the standard normal table, the chance of this is about 1/2(100-34.73)%, so the probability is.32635. 15