16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION



Similar documents
Normal distribution. ) 2 /2σ. 2π σ

Math 461 Fall 2006 Test 2 Solutions

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

Chapter 4. iclicker Question 4.4 Pre-lecture. Part 2. Binomial Distribution. J.C. Wang. iclicker Question 4.4 Pre-lecture

Normal Distribution as an Approximation to the Binomial Distribution

The Binomial Probability Distribution

The normal approximation to the binomial

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

39.2. The Normal Approximation to the Binomial Distribution. Introduction. Prerequisites. Learning Outcomes

Chapter 5: Normal Probability Distributions - Solutions

You flip a fair coin four times, what is the probability that you obtain three heads.

Sample Term Test 2A. 1. A variable X has a distribution which is described by the density curve shown below:

Ch5: Discrete Probability Distributions Section 5-1: Probability Distribution

Notes on Continuous Random Variables

39.2. The Normal Approximation to the Binomial Distribution. Introduction. Prerequisites. Learning Outcomes

8. THE NORMAL DISTRIBUTION

Sample Questions for Mastery #5

The Normal Distribution

4. Continuous Random Variables, the Pareto and Normal Distributions

Descriptive Statistics

5. Continuous Random Variables

Math 151. Rumbos Spring Solutions to Assignment #22

UNIT I: RANDOM VARIABLES PART- A -TWO MARKS

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1

3.4 The Normal Distribution

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

b) All outcomes are equally likely with probability = 1/6. The probabilities do add up to 1, as they must.

Section 6.1 Discrete Random variables Probability Distribution

An Introduction to Basic Statistics and Probability

22. HYPOTHESIS TESTING

Chapter 5: Discrete Probability Distributions

Lecture 5 : The Poisson Distribution

Normal Approximation. Contents. 1 Normal Approximation. 1.1 Introduction. Anthony Tanbakuchi Department of Mathematics Pima Community College

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

3.4. The Binomial Probability Distribution. Copyright Cengage Learning. All rights reserved.

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

TEACHER NOTES MATH NSPIRED

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

The normal approximation to the binomial

Stats on the TI 83 and TI 84 Calculator

Characteristics of Binomial Distributions

Week 3&4: Z tables and the Sampling Distribution of X

Lecture 6: Discrete & Continuous Probability and Random Variables

Descriptive Statistics

Chapter 5. Random variables

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

Math 425 (Fall 08) Solutions Midterm 2 November 6, 2008

Binomial random variables (Review)

STAT 200 QUIZ 2 Solutions Section 6380 Fall 2013

2 Binomial, Poisson, Normal Distribution

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

Practice Problems #4

Chapter 4. Probability Distributions

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Binomial Probability Distribution

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Probability Distributions

Department of Civil Engineering-I.I.T. Delhi CEL 899: Environmental Risk Assessment Statistics and Probability Example Part 1

2WB05 Simulation Lecture 8: Generating random variables

WEEK #22: PDFs and CDFs, Measures of Center and Spread

The Standard Normal distribution

The Math. P (x) = 5! = = 120.

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Binomial random variables

Chapter 3 RANDOM VARIATE GENERATION

Chapter 5. Discrete Probability Distributions

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Chapter 4 Lecture Notes

Joint Exam 1/P Sample Exam 1

Math Quizzes Winter 2009

Introduction to the Practice of Statistics Sixth Edition Moore, McCabe Section 5.1 Homework Answers

ST 371 (IV): Discrete Random Variables

Important Probability Distributions OPRE 6301

Section 5 Part 2. Probability Distributions for Discrete Random Variables

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

ECE302 Spring 2006 HW4 Solutions February 6,

Math 431 An Introduction to Probability. Final Exam Solutions

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Point and Interval Estimates

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Sampling Distributions

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

Chapter 5 - Practice Problems 1

Lab 11. Simulations. The Concept

Continuous Random Variables

EMPIRICAL FREQUENCY DISTRIBUTION

AMS 5 CHANCE VARIABILITY

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Mind on Statistics. Chapter 8

STATISTICS 8: CHAPTERS 7 TO 10, SAMPLE MULTIPLE CHOICE QUESTIONS

Lecture 10: Depicting Sampling Distributions of a Sample Proportion

Probability Distributions

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe

Chapter 4. Probability and Probability Distributions

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 18. A Brief Introduction to Continuous Probability

Transcription:

6. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION It is sometimes difficult to directly compute probabilities for a binomial (n, p) random variable, X. We need a different table for each value of n, p. If we don't have a table, direct calculations can get cumbersome very quickly. Eg: Compute P(X 00) for n = 50, p = 0.35. For normal random variables, on the other hand, probability calculations are extremely easy; just one table is required. Fortunately, we can approximate the binomial distribution by a normal distribution, with an appropriate choice of µ and σ. To get a feel for why this might work, let's study the Quincunx. The Quincunx is a device invented by Sir Francis Galton in the 800 s which shows empirically that binomial random variables, observed repeatedly, reveal a histogram which looks bell-shaped, as long as the number of trials is not too small. See Quincunx website at: http://www.rand.org/methodology/stat/applets/clt.html

In general, the distribution of a binomial random variable may be accurately approximated by that of a normal random variable, as long as np 5, nq 5, and assuming that a continuity correction is made to account for the fact that we are using a continuous distribution (the normal) to approximate a discrete one (the binomial). For approximating the distribution of X, we will use the normal distribution with mean µ = np, variance σ = npq, where q = p. Why are these reasonable choices of µ, σ? To study the quality of this approximation, visit the Normal Approximation to the Binomial website at: http://www.stat.sc.edu/~west/applets/binomialdemo.html This draws a bar chart of the binomial distribution for a given n, p, and superimposes the approximating normal distribution. Note how skewness increases as p moves away from 0.5. See histograms of number of dark M&Ms and orange M&Ms from M&M Lab. (Separate handout).

If p(x) is the binomial distribution and f (x) is the density of the normal, the approximation is: p( a) b a + a p( x) x= a f ( x) dx b+ a f ( x) dx Thus, the binomial probability p(a) is approximately equal to the probability that a normal RV with mean np and variance npq lies between x = a / and x = a + /. Also, P(a X b) is approximately equal to the area under the normal curve between x = a / and x = b + /. The continuity correction is the use of a /, b + / in the normal approximation. This ensures that probabilities are always approximated by areas under the normal curve. It can dramatically improve the quality of the approximation, even when n is large, so it should be used whenever possible.

In the diagram above, the bars represent the binomial distribution with n = 0, p = 0.5. The superimposed curve is a normal density f(x). The mean of the normal is µ = np = 5, and the standard deviation is σ= 0(.)(.) 05 05 = 58. Suppose we wish to find p(4), the probability that the binomial equals 4. From Table of Appendix B, we get p(4) = 0.3770 0.79 = 0.05. This is the exact probability, but we won t always be so lucky as to have a binomial table for the given n and p. So let s try the normal approximation. Using the normal approximation, we need to calculate the probability that our normal is between 3.5 and 4.5. The corresponding z-scores are (3.5 5)/.58 = 0.95 and (4.5 5)/.58 = 0.3. Thus, the normal approximation to p(4) is Pr(0.3 < Std Normal < 0.95) = 0.389 0.55 = 0.034. This is quite close to the actual value, p(4) = 0.05. If we hadn t used the continuity correction, our approximation to p(4) would be zero, that is, the area under the normal curve between 4 and 4. This would be a very poor approximation indeed!

As the diagram shows, the area under the normal density between 3.5 and 4.5 provides a reasonable approximation to the height of the bar, p(4). This should make it clear why the continuity correction is helpful. Eg: Tomorrow morning s Iberia flight to Madrid can seat 370 passengers. From past experience, Iberia knows that the probability is 0.90 that a given ticket-holder will show up for the flight. They have sold 400 tickets, deliberately overbooking the flight. How confident can Iberia be that no passenger will need to be bumped (denied boarding)? Solution: We will assume that the number (X) of passengers showing up for the flight has a binomial distribution with mean µ µ = (400)(0.9) = 360 and standard deviation (Is this reasonable?) σ= 400(.)(.) 0 9 0 = 6 We want Pr[X 370]. We approximate this by the probability that our normal RV is less than 370.5. This is the probability that a standard normal is less than z = (370.5 360)/6 =.75. So the probability that nobody gets bumped is approximately 0.5 + 0.4599 = 0.9599. (Almost 96%).

Eg: What is the probability that you will win at least $0 after playing 00 games of craps for $ per game? Solution: To win at least $0, you must win at least 55 games. The number (X) of games you will win has a binomial distribution with n = 00, p =.493. Therefore, X has mean µ = 49.3 and standard deviation σ = 00 (.493)(.507) = 5.00. We want Pr[X 55] = Pr[Std. Normal > (54.5 49.3)/5] = Pr[Std. Normal >.04] =.5.3508 =.49. (Just a 5% chance!) [Forecasting Lab Results]