The Binomial Distribution

Similar documents
3.4. The Binomial Probability Distribution. Copyright Cengage Learning. All rights reserved.

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

STAT 315: HOW TO CHOOSE A DISTRIBUTION FOR A RANDOM VARIABLE

Chapter 5. Random variables

LECTURE 16. Readings: Section 5.1. Lecture outline. Random processes Definition of the Bernoulli process Basic properties of the Bernoulli process

Lecture 2 Binomial and Poisson Probability Distributions

Probability Generating Functions

Chapter 5. Discrete Probability Distributions

MAS108 Probability I

Normal distribution. ) 2 /2σ. 2π σ

Random variables, probability distributions, binomial random variable

2. Discrete random variables

ECE302 Spring 2006 HW3 Solutions February 2,

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

Normal Distribution as an Approximation to the Binomial Distribution

Lecture 7: Continuous Random Variables

6.2. Discrete Probability Distributions

ST 371 (IV): Discrete Random Variables

CHAPTER 7 SECTION 5: RANDOM VARIABLES AND DISCRETE PROBABILITY DISTRIBUTIONS

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

Stats on the TI 83 and TI 84 Calculator

WHERE DOES THE 10% CONDITION COME FROM?

Sample Questions for Mastery #5

Binomial Sampling and the Binomial Distribution

Section 6.1 Discrete Random variables Probability Distribution

PROBABILITY AND SAMPLING DISTRIBUTIONS

2 Binomial, Poisson, Normal Distribution

Chapter 4 Lecture Notes

Chapter 15 Binomial Distribution Properties

Lecture 5 : The Poisson Distribution

ECE302 Spring 2006 HW4 Solutions February 6,

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

Math 431 An Introduction to Probability. Final Exam Solutions

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

The normal approximation to the binomial

An Introduction to Basic Statistics and Probability

Lecture 10: Depicting Sampling Distributions of a Sample Proportion

5. Continuous Random Variables

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

2WB05 Simulation Lecture 8: Generating random variables

Ch5: Discrete Probability Distributions Section 5-1: Probability Distribution

1.1 Introduction, and Review of Probability Theory Random Variable, Range, Types of Random Variables CDF, PDF, Quantiles...

4. Continuous Random Variables, the Pareto and Normal Distributions

Important Probability Distributions OPRE 6301

Math 461 Fall 2006 Test 2 Solutions

0 x = 0.30 x = 1.10 x = 3.05 x = 4.15 x = x = 12. f(x) =

Tenth Problem Assignment

Section 5 Part 2. Probability Distributions for Discrete Random Variables

Practice Problems #4

FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL

6.041/6.431 Spring 2008 Quiz 2 Wednesday, April 16, 7:30-9:30 PM. SOLUTIONS

UNIT I: RANDOM VARIABLES PART- A -TWO MARKS

6 PROBABILITY GENERATING FUNCTIONS

Math 370/408, Spring 2008 Prof. A.J. Hildebrand. Actuarial Exam Practice Problem Set 2 Solutions

Applied Reliability Applied Reliability

The Binomial Probability Distribution

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Chapter 4. Probability Distributions

Negative Binomial. Paul Johnson and Andrea Vieux. June 10, 2013

e.g. arrival of a customer to a service station or breakdown of a component in some system.

STAT x 0 < x < 1

Notes on Continuous Random Variables

Principle of Data Reduction

Statistics 100A Homework 4 Solutions

The normal approximation to the binomial

What is Statistics? Lecture 1. Introduction and probability review. Idea of parametric inference

6.3 Conditional Probability and Independence

Lecture Note 1 Set and Probability Theory. MIT Spring 2006 Herman Bennett

Practice problems for Homework 11 - Point Estimation

Probability Calculator

AP Statistics 7!3! 6!

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

The Binomial Distribution

Probability density function : An arbitrary continuous random variable X is similarly described by its probability density function f x = f X

Introduction to Probability

Statistics 100A Homework 3 Solutions

DETERMINE whether the conditions for a binomial setting are met. COMPUTE and INTERPRET probabilities involving binomial random variables

A Tutorial on Probability Theory

Lecture 6: Discrete & Continuous Probability and Random Variables

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 13. Random Variables: Distribution and Expectation

MAT 211 Introduction to Business Statistics I Lecture Notes

Unit 4 The Bernoulli and Binomial Distributions

Joint Exam 1/P Sample Exam 1

E3: PROBABILITY AND STATISTICS lecture notes

Business Statistics 41000: Probability 1

Sums of Independent Random Variables

Homework 4 - KEY. Jeff Brenion. June 16, Note: Many problems can be solved in more than one way; we present only a single solution here.

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

WEEK #23: Statistics for Spread; Binomial Distribution

13.0 Central Limit Theorem

Lecture 8. Confidence intervals and the central limit theorem

Some special discrete probability distributions

Lecture Notes 1. Brief Review of Basic Probability

Chapter 5: Normal Probability Distributions - Solutions

Math/Stats 425 Introduction to Probability. 1. Uncertainty and the axioms of probability

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density

Sampling Distributions

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Transcription:

The Binomial Distribution Binomial Experiment An experiment with these characteristics: 1 For some predetermined number n, a sequence of n smaller experiments called trials; 2 Each trial has two outcomes which we call success (S) and failure (F ); 3 Trials are independent; 4 The probability of success is the same in each trial; we denote it p = P(S). 1 / 22 Discrete Random Variables Binomial Distribution

Examples of Binomial Experiments Toss a coin or a thumbtack a fixed number of times; S could be heads or lands point up. Sample 40 people from a large population; S could be born in January. Sampling Without Replacement A bowl contains 5 red M&Ms and 5 green M&Ms. Choose one at random: P(red) = 1/2. Without replacing it, choose another: { 4 if first was red 9 P(red) = 5 if first was green. 9 Not independent, so not a binomial experiment. 2 / 22 Discrete Random Variables Binomial Distribution

Binomial Random Variable X = number of successes in a binomial experiment Binomial Distribution Each outcome of a binomial experiment can be written as a string of n letters, each S or F. The sample space S is the set of all such strings. The event X = x is the subset of strings with exactly x Ss, and therefore (n x) F s. Each such string has probability p x (1 p) n x, and there are ( n x) of them, so P(X = x) = ( n x) p x (1 p) n x. 3 / 22 Discrete Random Variables Binomial Distribution

Shorthand: if X is a binomial random variable with n trials and success probability p, we write X B(n, p). The range of X is {0, 1,..., n}. We have shown that the pmf of X, denoted b(x; n, p), is ( ) n b(x; n, p) = P(X = x) = p x (1 p) n x, x = 0, 1,..., n. x In R n <- 10 p <- 0.25 probs <- dbinom(0:n, n, p) plot(0:n, probs, type = "h") points(0:n, probs) 4 / 22 Discrete Random Variables Binomial Distribution

Expected Value and Variance of X If X Bin(n, p) and q = 1 p, then E(X ) = np, V (X ) = np(1 p) = npq, and σ X = npq. Note that if n = 1, X is a Bernoulli random variable; in that case, we showed earlier that E(X ) = p and V (X ) = pq. 5 / 22 Discrete Random Variables Binomial Distribution

Hypergeometric and Negative Binomial Distributions Hypergeometric Distribution As noted earlier, sampling without replacement from a finite population is not a binomial experiment. A random sample of size n is drawn without replacement from a finite population of N items, of which M are labeled Success and the remaining N M are labeled Failure. 6 / 22 Discrete Random Variables Hypergeometric and Negative Binomial

Then X, the number of successes in the sample, has the hypergeometric distribution with pmf ( M )( N M ) x n x h(x; n, M, N) = ( N n) for max(0, n N + M) x min(n, M). Binomial Approximation If N is much larger than n, sampling with and without replacement are roughly the same, and the hypergeometric distribution is then close to the binomial distribution: ( h(x; n, M, N) b x; n, M ) N 7 / 22 Discrete Random Variables Hypergeometric and Negative Binomial

Negative Binomial Distribution As in a binomial experiment, we carry out a sequence of independent trials, but instead of carrying out a fixed number of trials, we wait until we see a fixed number r of successes. Then X, the number of failures that occurred before stopping, has the negative binomial distribution with pmf ( ) x + r 1 b neg (x; r, p) = p r (1 p) x, x = 0, 1, 2,... r 1 Generalized Negative Binomial Distribution b neg (x; r, p) can also be defined for non-integer r. 8 / 22 Discrete Random Variables Hypergeometric and Negative Binomial

The Poisson Distribution The binomial distribution, and its special case the Bernoulli distribution, are two of the most important discrete distributions. The Poisson distribution is arguably as important. Poisson distribution The random variable X has the Poisson distribution with parameter µ > 0 if its pmf is p(x; µ) = e µ µx, x = 0, 1, 2,... x! 9 / 22 Discrete Random Variables Poisson Distribution

An Approximation The binomial distribution B(n, p), if n is large and p is small, is approximately the same as the Poisson distribution with parameter µ = np. To be precise, if n and p 0 in such a way that np µ > 0, then b(x; n, p) p(x; µ). 10 / 22 Discrete Random Variables Poisson Distribution

Devore s Rule of Thumb The approximation can safely be used when n > 50 and np < 5. What does this mean? max b(x; 50, 0.1) p(x, 5) =.0095 <.01 0 x 50 Perhaps that the difference is always less than.01? This table shows the smallest n that achieves this for various values of µ = np. µ = np 1 2 3 4 5 6 7 8 9 10 Smallest n 19 29 36 42 48 53 58 62 66 70 11 / 22 Discrete Random Variables Poisson Distribution

But for large µ, all the individual probabilities are small. Perhaps we should look at the cdfs: max F b(x; 50, 0.1) F p (x, 5) =.0147 <.015 0 x 50 Perhaps the rule of thumb is that the difference in the cdfs is always less than.015? This table shows the smallest n that achieves this for various values of µ = np. µ = np 1 3 5 7 10 15 20 25 30 40 Smallest n 13 31 50 67 95 140 183 228 271 360 12 / 22 Discrete Random Variables Poisson Distribution

Mean and Variance The Poisson distribution has the interesting property that both its mean and variance are equal to the parameter µ: If X has the Poisson distribution with parameter µ, then For the binomial distribution, E(X ) = V (X ) = µ. V (X ) < E(X ), and for the negative binomial distribution, V (X ) > E(X ). 13 / 22 Discrete Random Variables Poisson Distribution

Poisson Process The Poisson distribution is associated with the Poisson Process, which is a model for a sequence of events that occur at random, without memory. If the events occur in a Poisson Process with rate α (events per unit time), then the number of events in any time interval of length t has the Poisson distribution with parameter αt. The numbers of events in disjoint time intervals are statistically independent. 14 / 22 Discrete Random Variables Poisson Distribution

Bus Arrivals Suppose you arrive at a bus stop, waiting for a bus that runs every 20 minutes. If you know nothing about the schedule, the chance that a bus arrives in the next minute is 1 minute/20 minute 1 =.05. Suppose you are told that the last bus arrived 19 minutes ago. If the buses run on time, the chance that a bus arrives in the next minute is now 1. But if the schedule slips, it s less than certain. If the buses are delayed badly, the information about the last bus is irrelevant, and the buses arrive in a Poisson Process with rate 1/20 = 0.05 buses per minute. 15 / 22 Discrete Random Variables Poisson Distribution

Radioactive Decay A Geiger counter detects the emission of nuclear radiation, for instance from a sample of radioactive material. Emissions are generally assumed to follow a Poisson Process. 16 / 22 Discrete Random Variables Poisson Distribution

Probability Generating Function The probability generating function (pgf) is a useful tool for studying discrete probability distributions. If X is a discrete random variable taking non-negative integer values, its pgf is G(z) = E ( z ) X = p(x)z x. x=0 17 / 22 Discrete Random Variables Probability Generating Function

Properties of the pgf Set z = 1: G(1) = E ( 1 X ) = p(x) = 1. x=0 Differentiate and set z = 1: G (z) = p(x)xz x 1, x=0 so G (1) = p(x)x = µ X. x=0 18 / 22 Discrete Random Variables Probability Generating Function

Binomial Distribution Suppose that X B(n, p); that is, X is the number of successes in n independent trials, with probability p of success in each trial. For i = 1, 2,..., n, write { 1 if trial i is a success X i = 0 if it is a failure. Then X = n X i. i=1 19 / 22 Discrete Random Variables Probability Generating Function

So ) G(z) = E ( z X ) = E ( z X i ) = E ( n i=1 z X i Because the trials are independent, ( n ) E z X i = i=1 n E ( ) z X i. i=1 Because each X i is Bernoulli, E ( z X i ) = z 0 (1 p) + z 1 p = 1 p + pz = q + pz, and so G(z) = (q + pz) n. 20 / 22 Discrete Random Variables Probability Generating Function

By the binomial theorem, so (q + pz) n = n x=0 b(x; n, p) = coefficient of z x = ( ) n (pz) x q n x, x ( ) n p x q n x, 0 x n. x 21 / 22 Discrete Random Variables Probability Generating Function

Poisson Approximation Suppose that n is large, and p = µ/n for some µ > 0. Then G(z) = (1 µ n + µ ) n n z ( ) n µ(z 1) = 1 + n Now e µ(z 1). e µ(z 1) = e µ e µz = e µ x=0 so µ µx b(x; n, p) e x!, the pmf of the Poisson distribution. (µz) x 22 / 22 Discrete Random Variables Probability Generating Function x!