Conditional Probability, Hypothesis Testing, and the Monty Hall Problem



Similar documents
Probability. a number between 0 and 1 that indicates how likely it is that a specific event or set of events will occur.

Book Review of Rosenhouse, The Monty Hall Problem. Leslie Burkholder 1

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10

Week 2: Conditional Probability and Bayes formula

Math 141. Lecture 2: More Probability! Albyn Jones 1. jones/courses/ Library 304. Albyn Jones Math 141

How To Choose Between A Goat And A Door In A Game Of \"The Black Jackpot\"

STEP 5: Giving Feedback

5544 = = = Now we have to find a divisor of 693. We can try 3, and 693 = 3 231,and we keep dividing by 3 to get: 1

Assertive Communication

Section 2: Ten Tools for Applying Sociology

Probabilities. Probability of a event. From Random Variables to Events. From Random Variables to Events. Probability Theory I

6.3 Conditional Probability and Independence

Monty Hall, Monty Fall, Monty Crawl

R Simulations: Monty Hall problem

Types of Studies. Systematic Reviews and Meta-Analyses

Hypothesis Testing for Beginners

Definition and Calculus of Probability

Introduction to Hypothesis Testing

Conditional Probability, Independence and Bayes Theorem Class 3, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Self-directed learning: managing yourself and your working relationships

Bayesian Tutorial (Sheet Updated 20 March)

Probability definitions

Picture yourself in a meeting. Suppose there are a dozen people

A Few Basics of Probability

1. The RSA algorithm In this chapter, we ll learn how the RSA algorithm works.

Relative and Absolute Change Percentages

The 5 P s in Problem Solving *prob lem: a source of perplexity, distress, or vexation. *solve: to find a solution, explanation, or answer for

A PARENT S GUIDE TO CPS and the COURTS. How it works and how you can put things back on track

The Calculus of Probability

Financial Mathematics

Self-Acceptance. A Frog Thing by E. Drachman (2005) California: Kidwick Books LLC. ISBN Grade Level: Third grade

Chapter 3. Cartesian Products and Relations. 3.1 Cartesian Products

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2015

Lecture 13. Understanding Probability and Long-Term Expectations

Video Poker in South Carolina: A Mathematical Study

FOR MORE, go to Problem Behavior in My Classroom?

3. Mathematical Induction

Chapter 4 COMMUNICATION SKILLS. The difference between verbal and nonverbal communication. The difference between hearing and listening

THE DIMENSION OF A VECTOR SPACE

Writing a Scholarship Essay. Making the essay work for you!

FOR. 14 Recommendations from a Top Futures Broker. Stuart A. Vosk. Center for Futures Education, Inc.

HIPAA RULES AND REGULATIONS

MATH 140 Lab 4: Probability and the Standard Normal Distribution

Companion Animals Amphibians & Reptiles Cats Dogs Horses Veterinary Science

Basic Probability Concepts

4 G: Identify, analyze, and synthesize relevant external resources to pose or solve problems. 4 D: Interpret results in the context of a situation.

Discrete Math in Computer Science Homework 7 Solutions (Max Points: 80)

Introduction to Hypothesis Testing OPRE 6301

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

E3: PROBABILITY AND STATISTICS lecture notes

Determine If An Equation Represents a Function

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

Introduction to Probability

1 if 1 x 0 1 if 0 x 1

Mathematical Induction. Mary Barnes Sue Gordon

Lab 11. Simulations. The Concept

news Membership statistics update Under One Roof is the service for letting affordable rented homes in St Helens. In this issue...

Section 6.2 Definition of Probability

MOST FREQUENTLY ASKED INTERVIEW QUESTIONS. 1. Why don t you tell me about yourself? 2. Why should I hire you?

Preparing for the IELTS test with Holmesglen Institute of TAFE

Average producers can easily increase their production in a larger office with more market share.

Probabilistic Strategies: Solutions

STA 371G: Statistics and Modeling

BBC Learning English Funky Phrasals Dating

Ch. 13.2: Mathematical Expectation

Problem sets for BUEC 333 Part 1: Probability and Statistics

Participants Manual Video Seven The OSCAR Coaching Model

Mathematical goals. Starting points. Materials required. Time needed

Feb 7 Homework Solutions Math 151, Winter Chapter 4 Problems (pages )

Writing Thesis Defense Papers

Bayesian Updating with Discrete Priors Class 11, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Dom Jackson, Web Support Assistant Student Services Information Desk

DOING YOUR BEST ON YOUR JOB INTERVIEW

Instructor s Guide. Quick Interview and Salary Negotiation Video

P1. All of the students will understand validity P2. You are one of the students C. You will understand validity

An edited version of this article was published in the London Review of Books 37 (10), 21 May 2015, p.38 (

Navigate: #God s will. God Has a Blueprint for My Life? Week #1:

Chapter 6 Experiment Process

3.3 Real Zeros of Polynomials

Employment & Support Allowance. Physical Health Grounds

Introductory Probability. MATH 107: Finite Mathematics University of Louisville. March 5, 2014

PREDICTING SUCCESS BY DEFINING CALL CENTER COMPETENCIES Product No

Being Accountable in Work and Life

6.042/18.062J Mathematics for Computer Science. Expected Value I

8 Divisibility and prime numbers

Terminology and Scripts: what you say will make a difference in your success

Teaching & Learning Plans. Plan 1: Introduction to Probability. Junior Certificate Syllabus Leaving Certificate Syllabus

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

A Guide to Cover Letter Writing

Risk, Return and Market Efficiency

Arkansas State PIRC/ Center for Effective Parenting

Cosmological Arguments for the Existence of God S. Clarke

Icebreaker: Fantastic Four

Chapter 6: Probability

Chapter 11 Number Theory

Philosophical argument

Chapter 1 Introduction to Correlation

Transcription:

Conditional Probability, Hypothesis Testing, and the Monty Hall Problem Ernie Croot September 17, 2008 On more than one occasion I have heard the comment Probability does not exist in the real world, and most recently I heard this in the context of plane crashes: If you ask me what the probability that the next plane I get on will crash, I would say probably 1 in 10,000,000; but if you told me that the pilot was drunk, I might say 1 in 1,000,000. The problem here is that there is no the probability by the probability we mean the probability conditioned on the given information, making certain natural assumptions. Thus, as the information and our assumptions change, so will our probability measure. That is the difficulty in applying probability theory to real-world problems: We often have to infer from the data a natural probability measure to use; however, in the purely theoretical world, that measure is usually just given to us. In this short note I will work through two examples in depth to show how conditional probability and Bayes s theorem can be used to solve them: the testing problem, and the Monty Hall problem. 1 Hypothesis Testing If someone tells you that a test for cancer (or alchohol, or drugs, or lies etc.) is 98 percent accurate, it would be wise to ask them what they mean, as the following example will demonstrate: Suppose that a person goes in for a routine medical test, and one of the tests says that he has cancer. The doctor tells him that the test is not foolproof, but is nonethless 98 percent accurate. The patient asks the 1

doctor what he means, and the doctor tells him that if someone with cancer is given the test, it will correctly say cancer 98% of the time, and if someone does not have cancer, the test responds no cancer 98% of the time. Assuming that only 2% of the population has cancer, if a person selected at random from the population tests positive for cancer, how reliable is the test? The answer turns out to be only 50%. Here is a commonsense explanation: Suppose you have 5000 people, 2% or 100 of whom have cancer, and so 4900 of whom do not. If you give these people the test, 2% of those 4900 will test positive for cancer these are what are called false positives. At the same time, 98 of the 100 people who really have cancer will test positive. In total you will get (0.02)4900 + 98 196 test results that read positive ; and, only 98 of these will actually have cancer. Thus, the test is only 50% reliable. In the language of conditional probability and Bayes s theorem, P(cancer tests+) P(tests+ cancer)p(cancer) P(tests+ cancer)p(cancer) + P(tests+ no cancer)p(no cancer) (0.98)(0.02) (0.98)(0.02) + (0.02)(0.98) 1 2. 1.1 Eliminating Suspects In tests of evidence or even medical tests, there is the notion of eliminating suspects as not being the same as pointing the finger of blame. Just to give you an idea of how good a test can be to eliminate suspects, while at the same time being very poor at pointing the finger of blame, let us continue with our cancer test example: Suppose you are given that the test responds negative for cancer. What is the probability that it is correct i.e. that the person really does not have cancer? Applying Bayes s theorem as above, one will quickly discover (try it yourself) that P(no cancer tests ) 99.96%. 2

So, the test is extremely good at screening out people, but very poor at deciding whether a person actually has it! 1.2 Pay attention to how the test is presented As we have seen, if a test is presented to you as being 98% accurate, that may not be at all true, due to a high false positive rate. What one should do if someone advertises a test as 98% accurate is one ask them to be precise, and one should pay close attention to their answer! Also, it would be a good idea to ask them how high the false positive rate is. If the test giver provides you with the following sort of answer, know that it is bogus: The giver of the cancer test tries to demonstrate to you that it is highly effective by taking 10 samples of people with cancer, and 10 samples without, and feeding the samples to the testing device. Upon getting correct results for the 20 samples, the tester tells you, see!... the test is accurate. Why is the test giver s demonstration completely bogus? 2 The Monty Hall Problem Here I will state a variant of the problem, where the math is easier to analyze: There are three doors. Behind two are goats, and behind one is a prize. You begin by picking door 1 as the door you want the host to open. The host looks at you and says Wait a minute. Let me show you what is behind this door, and you see a goat behind one of doors 2 or 3. He then asks if you want to stay with door 1, or switch to the remaining unopened door. It turns out that your chances of winning if you switch are 2/3, making certain natural assumptions, such as that the prize was chosen at random to be behind doors 1,2,3, each with equal probability; and, that if the prize was behind door 1, then the host picked door 2 or 3 with equal probability, for the one to reveal. A quick way to get to this answer of 2/3 is to realize that if after initially selecting door 1 you shut your ears and eyes, and pay no attention to the host (Monty Hall), your chances of winning with door 1 must be 1/3. So, the chances that the remaining door was the winner must be 1 1/3 2/3. In the next subsections we will give alternative ways to solve the problem. 3

2.1 Exhausting the Possibilities One way to solve the problem is to write down the sample space S, and reason for the probability of all the elementary elements; and, then, interpret the event you win if you switch as a subset of S. A natural definition for S is {(1, 2), (1, 3), (2, 3), (3, 2)}, where the first coordinates are the doors that the prize is behind, and the second coordinate is the door Monty decides to reveal. Since it is equally likely that the prize is behind doors 1, 2, or 3, we know that P({(1, 2), (1, 3)}) P({(2, 3)}) P({(3, 2)}) 1 3. Also, we assume that P({(1, 2)}) P({(1, 3)}), as Monty is not picky about whether to reveal door 2 or door 3 when the prize is behind door 1. With these assumptions, it follows that P({(1, 2)}) 1 6 P({(1, 3)}) 1 6 P({(2, 3)}) 1 3 P({(3, 2)}) 1 3. Now, the event You win if you switch to door 2 or 3 that Monty did not reveal is E {(2, 3), (3, 2)}, which has probability P(E) P({(2, 3)}) + P({(3, 2)}) 2 3. 4

A common mistake here is to assume that the probabilities of all the events of S are equal, which would give 1/2 for P(E). This problem is a good example of a case where your probability measure on S is not uniform (like it is for fair die or fair coin experiments)! 2.2 Conditional Probability Approach Here is a somewhat better way to do it using the following fact (which itself is a part of the general Bayes s Theorem): Suppose that E 1, E 2,..., E k are k disjoint events whose union is all of S, the sample space. Then, given an event E we have P(E) P(E E 1 )P(E 1 ) + + P(E E k )P(E k ). The way we apply this theorem is as follows: As in the previous subsection, let E be the event that you win if you switch to the door 2 or 3 that Monty didn t show you; and, let E 1 be the event that the prize is behind door 1, E 2 is the event the prize is behind door 2, and E 3 is the event the prize is behind door 3. Clearly, E 1, E 2, E 3 are disjoint and their union is S. We now have P(E) P(E E 1 )P(E 1 ) + P(E E 2 )P(E 2 ) + P(E E 3 )P(E 3 ). Clearly, P(E E 1 ) 0, P(E 1 ) P(E 2 ) P(E 3 ) 1/3, and P(E E 2 ) P(E E 3 ) 1. So, 2.3 Two Variations P(E) 0 + 1 3 + 1 3 2 3. We consider the following two variations of the problem which look almost identical, but are subtlely different (actually, it isn t that subtle if one thinks about it). Problem 1. Given that the host reveals door 3, what is the probability of winning if you switch? Problem 2. Given that door 3 has a goat, what is the probability of winning if you switch? 5

The conditional information in problem 1 is the set A {(1, 3), (2, 3)}, and the conditional information in problem 2 is the set B {(1, 3), (1, 2), (2, 3)}. So, in problem 2 you are given less information (that is, a larger set of possibilities). The solution to problem 1 is thus P(E A) P(E A) P(A) P({(2, 3)} P(A) 1/3 1/6 + 1/3 2 3. The solution to problem 2 is P(E B) P(E B) P(B) 1/3 1/6 + 1/6 + 1/3 1 2. 6