Hypergeometric Distribution

Similar documents
3.4. The Binomial Probability Distribution. Copyright Cengage Learning. All rights reserved.

WHERE DOES THE 10% CONDITION COME FROM?

Random variables, probability distributions, binomial random variable

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

ST 371 (IV): Discrete Random Variables

Chapter 5. Random variables

Ch5: Discrete Probability Distributions Section 5-1: Probability Distribution

Chapter 4 Lecture Notes

You flip a fair coin four times, what is the probability that you obtain three heads.

Section 6.1 Discrete Random variables Probability Distribution

STAT x 0 < x < 1

STAT 35A HW2 Solutions

Normal distribution. ) 2 /2σ. 2π σ

6.4 Normal Distribution

Lecture 6: Discrete & Continuous Probability and Random Variables

ECON1003: Analysis of Economic Data Fall 2003 Answers to Quiz #2 11:40a.m. 12:25p.m. (45 minutes) Tuesday, October 28, 2003

Solutions for Review Problems for Exam 2 Math You roll two fair dice. (a) Draw a tree diagram for this experiment.

2WB05 Simulation Lecture 8: Generating random variables

The Binomial Probability Distribution

4. Continuous Random Variables, the Pareto and Normal Distributions

Section 5-3 Binomial Probability Distributions

An Introduction to Basic Statistics and Probability

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

Practice Problems #4

STAT 315: HOW TO CHOOSE A DISTRIBUTION FOR A RANDOM VARIABLE

Chapter 5. Discrete Probability Distributions

Chapter 15 Binomial Distribution Properties

5. Continuous Random Variables

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

Without data, all you are is just another person with an opinion.

Math/Stats 425 Introduction to Probability. 1. Uncertainty and the axioms of probability

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

Principle of Data Reduction

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density

Characteristics of Binomial Distributions

Normal Distribution as an Approximation to the Binomial Distribution

Sample Questions for Mastery #5

Chapter 6: Point Estimation. Fall Probability & Statistics

PROBABILITY AND SAMPLING DISTRIBUTIONS

Applied Reliability Applied Reliability

Some special discrete probability distributions

Statistics 100A Homework 4 Solutions

Statistics 100A Homework 3 Solutions

6.3 Conditional Probability and Independence

Math 461 Fall 2006 Test 2 Solutions

Probability density function : An arbitrary continuous random variable X is similarly described by its probability density function f x = f X

Math 151. Rumbos Spring Solutions to Assignment #22

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

Lecture Note 1 Set and Probability Theory. MIT Spring 2006 Herman Bennett

Review for Test 2. Chapters 4, 5 and 6

Lecture 5 : The Poisson Distribution

Homework 4 - KEY. Jeff Brenion. June 16, Note: Many problems can be solved in more than one way; we present only a single solution here.

Math 370/408, Spring 2008 Prof. A.J. Hildebrand. Actuarial Exam Practice Problem Set 2 Solutions

ECE302 Spring 2006 HW4 Solutions February 6,

Binomial random variables (Review)

FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL

Math 3C Homework 3 Solutions

1.1 Introduction, and Review of Probability Theory Random Variable, Range, Types of Random Variables CDF, PDF, Quantiles...

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Joint Exam 1/P Sample Exam 1

Section 6.2 Definition of Probability

Probabilistic Strategies: Solutions

0 x = 0.30 x = 1.10 x = 3.05 x = 4.15 x = x = 12. f(x) =

Statistics 100A Homework 4 Solutions

ACMS Section 02 Elements of Statistics October 28, Midterm Examination II

AP Statistics 7!3! 6!

MAT 211 Introduction to Business Statistics I Lecture Notes

The Binomial Distribution

4/1/2017. PS. Sequences and Series FROM 9.2 AND 9.3 IN THE BOOK AS WELL AS FROM OTHER SOURCES. TODAY IS NATIONAL MANATEE APPRECIATION DAY

DETERMINE whether the conditions for a binomial setting are met. COMPUTE and INTERPRET probabilities involving binomial random variables

TEACHER NOTES MATH NSPIRED

LECTURE 16. Readings: Section 5.1. Lecture outline. Random processes Definition of the Bernoulli process Basic properties of the Bernoulli process

Chapter 7 - Roots, Radicals, and Complex Numbers

ACMS Section 02 Elements of Statistics October 28, 2010 Midterm Examination II Answers

Basic Probability Concepts

Tenth Problem Assignment

Probability Distributions

Practice problems for Homework 11 - Point Estimation

Probability Generating Functions

E3: PROBABILITY AND STATISTICS lecture notes

Random Variables. Chapter 2. Random Variables 1

Binomial Sampling and the Binomial Distribution

AP STATISTICS 2010 SCORING GUIDELINES

2 Binomial, Poisson, Normal Distribution

Chapter What is the probability that a card chosen from an ordinary deck of 52 cards is an ace? Ans: 4/52.

Section 6-5 Sample Spaces and Probability

Notes on Continuous Random Variables

Important Probability Distributions OPRE 6301

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Math Quizzes Winter 2009

Statistics 104: Section 6!

Worksheet for Teaching Module Probability (Lesson 1)

ECE302 Spring 2006 HW3 Solutions February 2,

Math 115 Spring 2011 Written Homework 5 Solutions

STAT 319 Probability and Statistics For Engineers PROBABILITY. Engineering College, Hail University, Saudi Arabia

Distinguishing Between Binomial, Hypergeometric and Negative Binomial Distributions

Math 431 An Introduction to Probability. Final Exam Solutions

UNIT I: RANDOM VARIABLES PART- A -TWO MARKS

Transcription:

Assume we are drawing cards from a deck of well-shulffed cards with replacement, one card per each draw. We do this 5 times and record whether the outcome is or not. Then this is a binomial experiment. If we do the same thing without replacement, then it is NO LONGER a binomial experiment. However, if we are drawing from 100 decks of cards without replacement and record only the first 5 outcomes, then it is approximately a binomial experiment. What is the exact model for drawing cards without replacement? Liang Zhang (UofU) Applied Statistics I June 23, 2008 1 / 13

1. The population or set to be sampled consists of N individuals, objects, or elements (a finite population). 2. Each individual can be characterized as a success (S) or a failure (F), and there are M successes in the population. 3. A sample of n individuals is selected without replacement in such a way that each subset of size n is equally likely to be chosen. Definition For any experiment which satisfies the above 3 conditions, let X = the number of S s in the sample. Then X is a hypergeometric random variable and we use h(x; n, M, N) to denote the pmf p(x) = P(X = x). Liang Zhang (UofU) Applied Statistics I June 23, 2008 2 / 13

Examples: In the second cards drawing example (without replacement and totally 52 cards), if we let X = the number of s in the first 5 draws, then X is a hypergeometric random variable with n = 5, M = 13 and N = 52. For the pmf, the probability for getting exactly x (x = 0, 1, 2, 3, 4, or 5) s is calculated as following: ( 13 ) ( x 39 ) 5 x p(x) = P(X = x) = ( 52 ) 5 where ( ) ( 13 x is the number of choices for getting x s, 39 5 x) is the number of choices for getting the remaining 5 x non- cards and ( ) 52 5 is the total number of choices for selecting 5 cards from 52 cards. Liang Zhang (UofU) Applied Statistics I June 23, 2008 3 / 13

Examples: For the same experiment (without replacement and totally 52 cards), if we let X = the number of s in the first 20 draws, then X is still a hypergeometric random variable, but with n = 20, M = 13 and N = 52. However, in this case, all the possible values for X is 0, 1, 2,..., 13 and the pmf is where 0 x 13. p(x) = P(X = x) = ( 13 ) ( x 39 ) ( 52 20 20 x ) Liang Zhang (UofU) Applied Statistics I June 23, 2008 4 / 13

Proposition If X is the number of S s in a completely random sample of size n drawn from a population consisting of M S s and (N M) F s, then the probability distribution of X, called the hypergeometric distribution, is given by ( M ) ( x N M ) n x P(X = x) = h(x; n, M, N) = ( N n) for x an integer satisfying max(0, n N + M) x min(n, M). Remark: If n < M, then the largest x is n. However, if n > M, then the largest x is M. Therefore we require x min(n, M). Similarly, if n < N M, then the smallest x is 0. However, if n > N M, then the smallest x is n (N M). Thus x min(0, n N + M). Liang Zhang (UofU) Applied Statistics I June 23, 2008 5 / 13

Example: (Problem 70) An instructor who taught two sections of engineering statistics last term, the first with 20 students and the second with 30, decided to assign a term project. After all projects had been turned in, the instructor randomly ordered them before grading. Consider the first 15 graded projects. a. What is the probability that exactly 10 of these are from the second section? b. What is the probability that at least 10 of these are from the second section? c. What is the probability that at least 10 of these are from the same section? Liang Zhang (UofU) Applied Statistics I June 23, 2008 6 / 13

Proposition The mean and variance of the hypergeometric rv X having pmf h(x; n, M, N) are E(X ) = n M ( ) N n V (X ) = n M ( N N 1 N 1 M ) N Remark: The ratio M N is the proportion of S s in the ( population. ) If we replace M N by p, then we get E(X ) = np and V (X ) = N n N 1 np(1 p). Recall the mean and variance for a binomial rv is np and np(1 p). We see that the mean for binomial and hypergeometric rv s are equal, while the variances differ by the factor (N n)/(n 1). Liang Zhang (UofU) Applied Statistics I June 23, 2008 7 / 13

Example (Problem 70) continued: An instructor who taught two sections of engineering statistics last term, the first with 20 students and the second with 30, decided to assign a term project. After all projects had been turned in, the instructor randomly ordered them before grading. Consider the first 15 graded projects. d. What are the mean value and standard deviation of the number of projects among these 15 that are from the second section? e. What are the mean value and standard deviation of the number of projects not among these 15 that are from the second section? Liang Zhang (UofU) Applied Statistics I June 23, 2008 8 / 13

Negative Binomial Distribution Consider the card drawing example again. This time, we still draw cards from a deck of well-shulffed cards with replacement, one card per each draw. However, we keep drawing until we get 5 s. Let X = the number of draws which do not give us a, then X is NO LONGER a binomial random variable, but a negative binomial random variable. Liang Zhang (UofU) Applied Statistics I June 23, 2008 9 / 13

Negative Binomial Distribution 1. The experiment consists of a sequence of independent trials. 2. Each trial can result in either s success (S) or a failure (F). 3. The probability of success is constant from trial to trial, so P(S on trial i) = p for i = 1, 2, 3,.... 4. The experiment continues (trials are performed) until a total of r successes have been observed, where r is a specified positive integer. Definition For any experiment which satisfies the above 4 conditions, let X = the number of failures that precede thr r th success. Then X is a negative binomial random variable and we use nb(x; r, p) to denote the pmf p(x) = P(X = x). Liang Zhang (UofU) Applied Statistics I June 23, 2008 10 / 13

Negative Binomial Distribution Remark: 1. In some sources, the negative binomial rv is taken to be the number of trials X + r rather than the number of failures. 2. If r = 1, we call X a geometric random variable. The pmf for X is then the familiar one nb(x; 1, p) = (1 p) x p x = 0, 1, 2,... Liang Zhang (UofU) Applied Statistics I June 23, 2008 11 / 13

Negative Binomial Distribution Proposition The pmf of the negative binomial rv X with parameters r = number of S s and p = P(S) is ( ) x + r 1 nb(x; r, p) = p r (1 p) x r 1 Then mean and variance for X are E(X ) = r(1 p) p and V (X ) = r(1 p) p 2, respectively Liang Zhang (UofU) Applied Statistics I June 23, 2008 12 / 13

Negative Binomial Distribution Example: (Problem 78) Individual A has a red die and B has a green die (both fair). If they each roll until they obtain five doubles (1 1, 2 2,..., 6 6), what is the pmf of X = the total number of times a die is rolled? What are E(X ) and V (X )? Liang Zhang (UofU) Applied Statistics I June 23, 2008 13 / 13