Hypergeometric Distributions



Similar documents
Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

Determining the sample size

I. Chi-squared Distributions

Hypothesis testing. Null and alternative hypotheses

Math C067 Sampling Distributions

Soving Recurrence Relations

CHAPTER 3 THE TIME VALUE OF MONEY

CS103X: Discrete Structures Homework 4 Solutions

Confidence Intervals for One Mean

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

Modified Line Search Method for Global Optimization

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

1 Computing the Standard Deviation of Sample Means

Measures of Spread and Boxplots Discrete Math, Section 9.4

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

ODBC. Getting Started With Sage Timberline Office ODBC

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

INVESTMENT PERFORMANCE COUNCIL (IPC)

3 Basic Definitions of Probability Theory

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Elementary Theory of Russian Roulette

Engineering Data Management

Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011

Chapter 7: Confidence Interval and Sample Size

5: Introduction to Estimation

where: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return

How To Solve The Homewor Problem Beautifully

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

Quadrat Sampling in Population Ecology

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

5.3. Generalized Permutations and Combinations

5.4 Amortization. Question 1: How do you find the present value of an annuity? Question 2: How is a loan amortized?

Confidence Intervals

1. C. The formula for the confidence interval for a population mean is: x t, which was

G r a d e. 2 M a t h e M a t i c s. statistics and Probability

PSYCHOLOGICAL STATISTICS

2-3 The Remainder and Factor Theorems

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Simple Annuities Present Value.

The Stable Marriage Problem

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

Chapter 7 Methods of Finding Estimators

DAME - Microsoft Excel add-in for solving multicriteria decision problems with scenarios Radomir Perzina 1, Jaroslav Ramik 2

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

Your organization has a Class B IP address of Before you implement subnetting, the Network ID and Host ID are divided as follows:

Lesson 17 Pearson s Correlation Coefficient

Department of Computer Science, University of Otago

A Mathematical Perspective on Gambling

Building Blocks Problem Related to Harmonic Series

Properties of MLE: consistency, asymptotic normality. Fisher information.

Lesson 15 ANOVA (analysis of variance)

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

Desktop Management. Desktop Management Tools

MARTINGALES AND A BASIC APPLICATION

A probabilistic proof of a binomial identity

PERMUTATIONS AND COMBINATIONS

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth

Maximum Likelihood Estimators.

Practice Problems for Test 3

Overview of some probability distributions.

Baan Service Master Data Management

Incremental calculation of weighted mean and variance


BINOMIAL EXPANSIONS In this section. Some Examples. Obtaining the Coefficients

NATIONAL SENIOR CERTIFICATE GRADE 12

Present Value Factor To bring one dollar in the future back to present, one uses the Present Value Factor (PVF): Concept 9: Present Value

FM4 CREDIT AND BORROWING

7.1 Finding Rational Solutions of Polynomial Equations

One-sample test of proportions

Predictive Modeling Data. in the ACT Electronic Student Record

Now here is the important step

CHAPTER 3 DIGITAL CODING OF SIGNALS

PENSION ANNUITY. Policy Conditions Document reference: PPAS1(7) This is an important document. Please keep it in a safe place.

5 Boolean Decision Trees (February 11)

BaanERP. BaanERP Windows Client Installation Guide

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE The absolute value of the complex number z a bi is

Time Value of Money. First some technical stuff. HP10B II users

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

Systems Design Project: Indoor Location of Wireless Devices

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments

Solving Logarithms and Exponential Equations

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

INVESTMENT PERFORMANCE COUNCIL (IPC) Guidance Statement on Calculation Methodology

Basic Elements of Arithmetic Sequences and Series

% 60% 70% 80% 90% 95% 96% 98% 99% 99.5% 99.8% 99.9%

FOUNDATIONS OF MATHEMATICS AND PRE-CALCULUS GRADE 10

1 Correlation and Regression Analysis

Listing terms of a finite sequence List all of the terms of each finite sequence. a) a n n 2 for 1 n 5 1 b) a n for 1 n 4 n 2

Domain 1: Designing a SQL Server Instance and a Database Solution

Professional Networking

Transcription:

7.4 Hypergeometric Distributios Whe choosig the startig lie-up for a game, a coach obviously has to choose a differet player for each positio. Similarly, whe a uio elects delegates for a covetio or you deal cards from a stadard deck, there ca be o repetitios. I such situatios, each selectio reduces the umber of items that could be selected i the ext trial. Thus, the probabilities i these trials are depedet. Ofte we eed to calculate the probability of a specific umber of successes i a give umber of depedet trials. INVESTIGATE & INQUIRE: Choosig a Jury I Otario, a citize ca be called for jury duty every three years. Although most juries have 12 members, those for civil trials i Otario usually require oly 6 members. Suppose a civil-court jury is beig selected from a pool of 18 citizes, 8 of whom are me. Develop a simulatio to determie the probability distributio for the umber of wome selected for this jury. 1. Select a radom-umber geerator to simulate the selectio process. 2. Decide how to simplify the selectio process. Decide, also, whether the full situatio eeds to be simulated or whether a proportio of the trials would be sufficiet. 3. Desig each trial so that it simulates the actual situatio. Esure that each trial is depedet by settig the radom-umber geerator so that there are o repetitios withi each series of trials. 4. Set up a method to record the umber of successes i each experimet. Pool your results with those of other studets i your class, if ecessary. 5. Use the results to estimate the probabilities of x successes (wome) i r trials (selectios of a juror). 6. Reflect o the results. Do they accurately represet the probability of x wome beig selected? 7. Compare your simulatio ad its results with those of your classmates. Which are the better simulatios? Explai why. Data i Actio The cost of ruig the crimial, civil, ad family courts i Otario was about $310 millio for 2001. These courts have the equivalet of 3300 full-time employees. 7.4 Hypergeometric Distributios MHR 397

The simulatio i the ivestigatio models a hypergeometric distributio. Such distributios ivolve a series of depedet trials, each with success or failure as the oly possible outcomes. The probability of success chages as each trial is made. The radom variable is the umber of successful trials i a experimet. Calculatios of probabilities i a hypergeometric distributio geerally require formulas usig combiatios. Example 1 Jury Selectio a) Determie the probability distributio for the umber of wome o a civilcourt jury selected from a pool of 8 me ad 10 wome. b) What is the expected umber of wome o the jury? Solutio 1 Usig Pecil ad Paper a) The selectio process ivolves depedet evets sice each perso who is already chose for the jury caot be selected agai. The total umber of ways the 6 jurors ca be selected from the pool of 18 is (S ) = 18 C 6 = 18 564 There ca be from 0 to 6 wome o the jury. The umber of ways i which x wome ca be selected is 10 C x. The me ca fill the remaiig 6 x positios o the jury i 8 C 6 x ways. Thus, the umber of ways of selectig a jury with x wome o it is 10 C x 8 C 6 x ad the probability of a jury with x wome is P(x) = (x) (S) = 10C C x 8 6 x P(x) 0.4 This combiatio could also be writte as C(18, 6) or. 18 6 Number of Wome, x Probability, P(x) 0 C C 10 0 8 6 = 0.001 51 1 C C 10 1 8 5 = 0.030 17 2 C C 10 2 8 4 = 0.169 68 Probability 0.3 0.2 0.1 0 1 2 3 4 5 6 Number of Wome Jurors, x 3 C C 10 3 8 3 = 0.361 99 4 C C 10 4 8 2 = 0.316 74 5 C C 10 5 8 1 = 0.108 60 6 C C 10 6 8 0 = 0.011 31 398 MHR Probability Distributios

b) E(X ) = 6 x i P(x i ) i=0 = (0)(0.001 51) + (1)(0.030 17) + (2)(0.169 68) + (3)(0.361 99) + (4)(0.316 74) + (5)(0.108 60) + (6)(0.011 31) = 3.333 33 The expected umber of wome o the jury is approximately 3.333. Solutio 2 Usig a Graphig Calculator a) Eter the possible values for x, 0 to 6, i L1. The, eter the formula for P(x) i L2: (10 Cr L1) (8 Cr (6 L1)) (18 Cr 6) b) Calculate xp(x) i L3 usig the formula L1 L2. QUIT to the home scree. You ca fid the expected umber of wome by usig the sum( fuctio i the LIST MATH meu. The expected umber of wome o the jury is approximately 3.333. Solutio 3 Usig a Spreadsheet a) Ope a ew spreadsheet. Create titles x, p(x), ad xp(x) i colums A to C. Eter the values of the radom variable x i colum A, ragig from 0 to 6. Next, use the combiatios fuctio to eter the formula for P(x) i cell B3 ad copy it to cells B4 through B9. 7.4 Hypergeometric Distributios MHR 399

b) Calculate xp(x) i colum C by eterig the formula A3*B3 i cell C3 ad copyig it to cells C4 through C9. The, calculate the expected value usig the SUM fuctio. The expected umber of wome o the jury is approximately 3.333. Solutio 4 Usig Fathom TM Ope a ew Fathom documet. Drag a ew collectio box to the work area ad ame it Number of Wome Jurors. Create seve ew cases. Drag a ew case table to the work area. Create three ew attributes: x, px, ad xpx. Eter the values from 0 to 6 for the x attribute. Right-click o the px attribute, select Edit Formula, ad eter combiatios(10,x)*combiatios(8,6-x)/combiatios(18,6) Similarly, calculate xp(x) usig the formula x*px. Next, double-click o the collectio box to ope the ispector. Select the Measures tab, ad ame a ew measure Ex. Right-click o Ex ad use the sum fuctio to eter the formula sum(x*px). The expected umber of wome o the jury is approximately 3.333. 400 MHR Probability Distributios

You ca geeralize the methods i Example 1 to show that for a hypergeometric distributio, the probability of x successes i r depedet trials is Probability i a Hypergeometric Distributio P(x) = ac C x a r x C, r where a is the umber of successful outcomes amog a total of possible outcomes. Although the trials are depedet, you would expect the average probability of a a success to be the same as the ratio of successes i the populatio,. Thus, the expectatio for r trials would be Expectatio for a Hypergeometric Distributio E(X ) = r a This formula ca be prove more rigorously by some challegig algebraic maipulatio of the terms whe P(x) = C C a x a r x is substituted ito the equatio for the expectatio of ay probability distributio, E(X ) = x i P(x i ). C r i=1 Example 2 Applyig the Expectatio Formula Calculate the expected umber of wome o the jury i Example 1. Solutio E(X ) = r a = 6 10 18 = 3.33 The expected umber of wome jurors is 3.33. Example 3 Expectatio of a Hypergeometric Distributio A box cotais seve yellow, three gree, five purple, ad six red cadies jumbled together. a) What is the expected umber of red cadies amog five cadies poured from the box? b) Verify that the expectatio formula for a hypergeometric distributio gives the same result as the geeral equatio for the expectatio of ay probability distributio. 7.4 Hypergeometric Distributios MHR 401

Solutio a) = 7 + 3 + 5 + 6 r = 5 a = 6 = 21 Usig the expectatio formula for the hypergeometric distributio, E(X ) = r a 5 6 = 21 = 1.4285 Oe would expect to have approximately 1.4 red cadies amog the 5 cadies. b) Usig the geeral formula for expectatio, E(X ) = xp(x) = (0) C C + (1) C C 6 + (2) C C + (3) C C + (4) C C + (5) C C 6 0 15 5 6 1 15 4 6 2 15 3 6 3 15 2 6 4 15 1 5 15 0 = 1.4285 Agai, the expected umber of red cadies is approximately 1.4. Example 4 Wildlife Maagemet I the sprig, the Miistry of the Eviromet caught ad tagged 500 raccoos i a wilderess area. The raccoos were released after beig vacciated agaist rabies. To estimate the raccoo populatio i the area, the miistry caught 40 raccoos durig the summer. Of these 15 had tags. a) Determie whether this situatio ca be modelled with a hypergeometric distributio. b) Estimate the raccoo populatio i the wilderess area. www.mcgrawhill.ca/liks/mdm12 To lear more about samplig ad wildlife, visit Solutio the above web site ad follow the liks. Write a brief descriptio of some of the samplig a) The 40 raccoos captured durig the summer were all differet from each other. I other words, techiques that are used. there were o repetitios, so the trials were depedet. The raccoos were either tagged (a success) or ot (a failure). Thus, the situatio does have all the characteristics of a hypergeometric distributio. b) Assume that the umber of tagged raccoos caught durig the summer is equal to the expectatio for the hypergeometric distributio. You ca substitute the kow values i the expectatio formula ad the solve for the populatio size,. 402 MHR Probability Distributios

Here, the umber of raccoos caught durig the summer is the umber of trials, so r = 40. The umber of tagged raccoos is the umber of successes i the populatio, so a = 500. ra E(X ) =, so 15= 40 500 = 40 500 15 = 1333.3 The raccoo populatio i the wilderess area is approximately 1333. Alteratively, you could assume that the proportio of tagged raccoos amog the sample captured durig the summer correspods to that i the whole populatio. The, 1 5 = 50 0, which gives the same estimate for 40 as the calculatio show above. Key Cocepts A hypergeometric distributio has a specified umber of depedet trials havig two possible outcomes, success or failure. The radom variable is the umber of successful outcomes i the specified umber of trials. The idividual outcomes caot be repeated withi these trials. The probability of x successes i r depedet trials is P(x) = C C a x a r x, where is the populatio size ad a is the umber of successes i the populatio. ra The expectatio for a hypergeometric distributio is E(X ) =. To simulate a hypergeometric experimet, esure that the umber of trials is represetative of the situatio ad that each trial is depedet (o replacemet or resettig betwee trials). Record the umber of successes ad summarize the results by calculatig probabilities ad expectatio. C r Commuicate Your Uderstadig 1. Describe how the graph i Example 1 differs from the graphs of the uiform, biomial, ad geometric distributios. 2. Cosider this questio: What is the probability that 5 people out of a group of 20 are left haded if 10% of the populatio is left-haded? Explai why this situatio does ot fit a hypergeometric model. Rewrite the questio so that you ca use a hypergeometric distributio. 7.4 Hypergeometric Distributios MHR 403

Practise A 1. Which of these radom variables have a hypergeometric distributio? Explai why. a) the umber of clubs dealt from a deck b) the umber of attempts before rollig a six with a die c) the umber of 3s produced by a radomumber geerator d) the umber of defective screws i a radom sample of 20 take from a productio lie that has a 2% defect rate e) the umber of male ames o a page selected at radom from a telephoe book f) the umber of left-haded people i a group selected from the geeral populatio g) the umber of left-haded people selected from a group comprised equally of left-haded ad right-haded people 2. Prepare a table ad a graph of a hypergeometric distributio with a) = 6, r = 3, a = 3 b) = 8, r = 3, a = 5 Apply, Solve, Commuicate B 3. There are five cats ad seve dogs i a pet shop. Four pets are chose at radom for a visit to a childre s hospital. a) What is the probability that exactly two of the pets will be dogs? b) What is the expected umber of dogs chose? 4. Commuicatio Earlier this year, 520 seals were caught ad tagged. O a recet survey, 30 out of 125 seals had bee tagged. a) Estimate the size of the seal populatio. b) Explai why you caot calculate the exact size of the seal populatio. 5. Of the 60 grade-12 studets at a school, 45 are takig Eglish. Suppose that 8 grade-12 studets are selected at radom for a survey. a) Develop a simulatio to determie the probability that 5 of the selected studets are studyig Eglish. b) Use the formulas developed i this sectio to verify your simulatio results. 6. Iquiry/Problem Solvig I a study of Caada geese, 200 of a kow populatio of 1200 geese were caught ad tagged. Later, aother 50 geese were caught. a) Develop a simulatio to determie the expected umber of tagged geese i the secod sample. b) Use the formulas developed i this sectio to verify your simulatio results. 7. Applicatio I a mathematics class of 20 studets, 5 are biligual. If the class is radomly divided ito 4 project teams, a) what is the probability that a team has fewer tha 2 biligual studets? b) what is the expected umber of biligual studets o a team? 8. I a swim meet, there are 16 competitors, 5 of whom are from the Easter Swim Club. a) What is the probability that 2 of the 5 swimmers i the first heat are from the Easter Swim Club? b) What is the expected umber of Easter Swim Club members i the first heat? 9. The door prizes at a dace are four $10 gift certificates, five $20 gift certificates, ad three $50 gift certificates. The prize evelopes are mixed together i a bag, ad five prizes are draw at radom. a) What is the probability that oe of the prizes is a $10 gift certificate? b) What is the expected umber of $20 gift certificates draw? 404 MHR Probability Distributios

10. A 12-member jury for a crimial case will be selected from a pool of 14 me ad 11 wome. a) What is the probability that the jury will have 6 me ad 6 wome? b) What is the probability that at least 3 jurors will be wome? c) What is the expected umber of wome? 11. Seve cards are dealt from a stadard deck. a) What is the probability that three of the seve cards are hearts? b) What is the expected umber of hearts? 12. A bag cotais two red, five black, ad four gree marbles. Four marbles are selected at radom, without replacemet. Calculate a) the probability that all four are black b) the probability that exactly two are gree c) the probability that exactly two are gree ad oe are red d) the expected umbers of red, black, ad gree marbles Kowledge/ Uderstadig ACHIEVEMENT CHECK Thikig/Iquiry/ Problem Solvig Commuicatio Applicatio 13. A calculator maufacturer checks for defective products by testig 3 calculators out of every lot of 12. If a defective calculator is foud, the lot is rejected. a) Suppose 2 calculators i a lot are defective. Outlie two ways of calculatig the probability that the lot will be rejected. Calculate this probability. b) The quality-cotrol departmet wats to have at least a 30% chace of rejectig lots that cotai oly oe defective calculator. Is testig 3 calculators i a lot of 12 sufficiet? If ot, how would you suggest they alter their quality-cotrol techiques to achieve this stadard? Support your aswer with mathematical calculatios. C 14. Suppose you buy a lottery ticket for which you choose six differet umbers betwee 1 ad 40 iclusive. The order of the first five umbers is ot importat. The sixth umber is a bous umber. To wi first prize, all five regular umbers ad the bous umber must match, respectively, the radomly geerated wiig umbers for the lottery. For the secod prize, you must match the bous umber plus four of the regular umbers. a) What is the probability of wiig first prize? b) What is the probability of wiig secod prize? c) What is the probability of ot wiig a prize if your first three regular umbers match wiig umbers? 15. Iquiry/Problem Solvig Uder what coditios would a biomial distributio be a good approximatio for a hypergeometric distributio? 16. Iquiry/Problem Solvig You start at a corer five blocks south ad five blocks west of your fried. You walk orth ad east while your fried walks south ad west at the same speed. What is the probability that the two of you will meet o your travels? 17. A research compay has 50 employees, 20 of whom are over 40 years old. Of the 22 scietists o the staff, 12 are over 40. Compare the expected umbers of older ad youger scietists i a radomly selected focus group of 10 employees. 7.4 Hypergeometric Distributios MHR 405