STA 371G: Statistics and Modeling

Similar documents

Basic Probability. Probability: The part of Mathematics devoted to quantify uncertainty

What Is Probability?

Probability, statistics and football Franka Miriam Bru ckler Paris, 2015.

In the situations that we will encounter, we may generally calculate the probability of an event

Chapter 5 A Survey of Probability Concepts

Homework 8 Solutions

Section 6.2 Definition of Probability

Probability. a number between 0 and 1 that indicates how likely it is that a specific event or set of events will occur.

Lecture Note 1 Set and Probability Theory. MIT Spring 2006 Herman Bennett

People have thought about, and defined, probability in different ways. important to note the consequences of the definition:

Statistics in Geophysics: Introduction and Probability Theory

Machine Learning.

Bayesian Updating with Discrete Priors Class 11, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Bayesian Tutorial (Sheet Updated 20 March)

Chapter 4: Probability and Counting Rules

CONTINGENCY (CROSS- TABULATION) TABLES

PROBABILITY. Chapter. 0009T_c04_ qxd 06/03/03 19:53 Page 133

Math/Stats 425 Introduction to Probability. 1. Uncertainty and the axioms of probability

1 Introduction to Option Pricing

MAS113 Introduction to Probability and Statistics

Chapter 7 Probability. Example of a random circumstance. Random Circumstance. What does probability mean?? Goals in this chapter

The Calculus of Probability

Math 141. Lecture 2: More Probability! Albyn Jones 1. jones/courses/ Library 304. Albyn Jones Math 141

36 Odds, Expected Value, and Conditional Probability

Concepts of Probability

University of Chicago Graduate School of Business. Business 41000: Business Statistics Solution Key

V. RANDOM VARIABLES, PROBABILITY DISTRIBUTIONS, EXPECTED VALUE

Probabilities. Probability of a event. From Random Variables to Events. From Random Variables to Events. Probability Theory I

Probability & Probability Distributions

A Few Basics of Probability

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

E3: PROBABILITY AND STATISTICS lecture notes

Chapter 13 & 14 - Probability PART

Week 2: Conditional Probability and Bayes formula

Probability. Sample space: all the possible outcomes of a probability experiment, i.e., the population of outcomes

Definition and Calculus of Probability

Probability and Expected Value

Unit 19: Probability Models

MTH6120 Further Topics in Mathematical Finance Lesson 2

Bayesian Analysis for the Social Sciences

psychology and economics

Probability and statistics; Rehearsal for pattern recognition

Decision making in the presence of uncertainty II

1. Probabilities lie between 0 and 1 inclusive. 2. Certainty implies unity probability. 3. Probabilities of exclusive events add. 4. Bayes s rule.

Remarks on the Concept of Probability

AP Stats - Probability Review

Decision Making Under Uncertainty. Professor Peter Cramton Economics 300

Lecture 13. Understanding Probability and Long-Term Expectations

The "Dutch Book" argument, tracing back to independent work by. F.Ramsey (1926) and B.deFinetti (1937), offers prudential grounds for

Data Modeling & Analysis Techniques. Probability & Statistics. Manfred Huber

Chapter 4 & 5 practice set. The actual exam is not multiple choice nor does it contain like questions.

15-381: Artificial Intelligence. Probabilistic Reasoning and Inference

Teaching & Learning Plans. Plan 1: Introduction to Probability. Junior Certificate Syllabus Leaving Certificate Syllabus

Introductory Probability. MATH 107: Finite Mathematics University of Louisville. March 5, 2014

Practice Midterm IND E 411 Winter 2015 February 2, 2015

6.042/18.062J Mathematics for Computer Science. Expected Value I

P (B) In statistics, the Bayes theorem is often used in the following way: P (Data Unknown)P (Unknown) P (Data)

Fundamentals of Probability

A probability experiment is a chance process that leads to well-defined outcomes. 3) What is the difference between an outcome and an event?

Conditional Probability, Hypothesis Testing, and the Monty Hall Problem

The study of probability has increased in popularity over the years because of its wide range of practical applications.

Comparing & Contrasting. - mathematically. Ways of comparing mathematical and scientific quantities...

AP STATISTICS TEST #2 - REVIEW - Ch. 14 &15 Period:

Homework 3 Solution, due July 16

PROBABILITY PROBABILITY

WORKED EXAMPLES 1 TOTAL PROBABILITY AND BAYES THEOREM

If, under a given assumption, the of a particular observed is extremely. , we conclude that the is probably not

Chapter 5 Section 2 day f.notebook. November 17, Honors Statistics

6.3 Conditional Probability and Independence

Session 8 Probability

Prediction Markets, Fair Games and Martingales

Curriculum Design for Mathematic Lesson Probability

Chapter 4. Probability and Probability Distributions

Decision Making under Uncertainty

Statistics and Data Analysis B

Expected Value. 24 February Expected Value 24 February /19

3. Data Analysis, Statistics, and Probability

STA 256: Statistics and Probability I

Section 6-5 Sample Spaces and Probability

AMS 5 CHANCE VARIABILITY

Chapter 4 - Practice Problems 1

Chapter 4 Decision Analysis

Decision Trees and Networks

Discrete Structures for Computer Science

Algebra 2 C Chapter 12 Probability and Statistics

STATISTICS 8: CHAPTERS 7 TO 10, SAMPLE MULTIPLE CHOICE QUESTIONS

Binomial Probability Distribution

Review. Bayesianism and Reliability. Today s Class

Introduction to Hypothesis Testing

Practice Problems for Homework #8. Markov Chains. Read Sections Solve the practice problems below.

Lecture 11 Uncertainty

Risk and Uncertainty. Managerial Economics: Economic Tools for Today s Decision Makers, 4/e

CHAPTER 2 Estimating Probabilities

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS INTRODUCTION

When 95% Accurate Isn t Written by: Todd CadwalladerOlsker California State University, Fullerton tcadwall@fullerton.edu

Exam #1 (100 points)

Transcription:

STA 371G: Statistics and Modeling Decision Making Under Uncertainty: Probability, Betting Odds and Bayes Theorem Mingyuan Zhou McCombs School of Business The University of Texas at Austin http://mingyuanzhou.github.io/sta371g 1

Probability: Measuring Uncertainty Probability can be thought of as the mathematical language of uncertainty. The axioms of probability can be stated as The probability of an event E is nonnegative, P(E) 0 If S denotes the set of all possible events, then P(S) = 1 If two events E 1 and E 2 are mutually exclusive, then P(E 1 or E 2 ) = P(E 1 ) + P(E 2 ) 2

Interpretation of Probability Frequency interpretation: P(E) = number of elementary events comprising E total number of elementary events in S Examples: How often would one expect to obtain heads if a coin is tossed repeatedly? Drawing a marble at random with replacement from a box with 30 red and 70 blue marbles, how often would one expect to obtain a red marble? The probability of an event is interpreted as the relative frequency of occurrence of an event to be expected in the long run Law of large numbers: if an experiment is repeated many times under identical conditions, the relative frequency of occurrence for any event is likely to be close to the probability of that event 3

Interpretation of Probability Subjective interpretation: a probability is interpreted as a measure of degree of belief, or as the quantified judgement of a particular individual. Information in the form of observed frequencies might not be available regarding repeated trials under identical conditions. Examples: I am likely to get a score above 90 in the test It will probably rain tomorrow UT football might have a 10% chance to win the national championship next year 4

Probabilities, Lotteries, and Betting Odds Subjective probabilities may vary from person to person for the same event, as different individuals may have different degrees of belief, or judgments. A formal definition of subjective probability can be given in terms of lotteries. Your subjective probability P(E) of an event E is the number P(E) that makes you indifferent between the following two lotteries: Lottery A: Lottery B: You obtain X with probability P(E) You obtain Y with probability 1 P(E) You obtain X if E occurs You obtain Y if E does not occur 5

Probabilities, Lotteries, and Betting Odds Lottery A: You win $100 with probability 40% You win $0 with probability 60% Lottery B: You obtain $100 if it rains tomorrow You obtain $0 if it does not rain tomorrow Lottery A: You loose $10 with probability 70% You loose $20 with probability 30% Lottery B: You loose $10 if it rains tomorrow You loose $20 if it does not rain tomorrow 6

Probabilities, Lotteries, and Betting Odds If the probability of an event is p, then the odds in favor of that event are p to (1 p). If the odds in favor of an event are a to b, then the probability of that event is equal to a/(a + b). If the odds against an event are a to b, then the probability of that event is equal to b/(a + b). Two persons make the following bet: Person A bets $3 that IBM stock price goes up tomorrow Person B bets $2 that IBM stock price does not go up tomorrow If you think this is a fair bet, what are the odds in favor of IBM stock price going up tomorrow? What are the odds against IBM stock price going up tomorrow? What s your probability that IBM stock price goes up tomorrow? 7

Payoff Tables The consequence of the decision can be expressed in a Payoff Table. For example, depending on whether or not it rains, the consequence of your decision on whether or not to carry an umbrella can be summarized in a Payoff Table as: Rain No Rain Carry umbrella $3 $3 Do not carry umbrella $90 $0 Your uncertainty: P(Rain) and P(No Rain) Your decision: Carry umbrella or Do not carry umbrella The payoffs for different decisions may vary from person to person Sources of information: weather report, historical data, yesterday s weather, look out the window before you leave home... 8

Example: Oil Drilling The decision maker must decide whether or not to drill for oil at a particular location. The cost of drilling is $300,000. If the decision is to drill and Oil is found, then the Oil will be worth $1,300,000. If the decision is not to drill, then the drilling rights will be sold for a fixed sum of $100,000 plus another $100,000 contingent upon the presence of oil. Construct the payoff table and discuss how to model the decision maker s uncertainty about the presence of oil. 9

The payoff table can be expressed as: Oil No Oil Drill $1,000,000 $300,000 Do not drill $200,000 $100,000 Decision maker s uncertainty: P(Oil) and P(No Oil) Decision maker s decision: Drill or Do not drill Sources of information: experience on similar oil-drilling locations, geological information, a geologist s assessment What would be your recommendation to the decision maker if the geologist assesses that the odds against oil is 7 to 3? 10

Example: Advertising A decision maker has to decide whether to launch ad campaign. If no ad campaign is launched, then the revenue is $700,000 if the competitor does not introduce a new product and decreases to $100,000 otherwise. If a minor ad campaign at the cost of $150,000 is launched, then the revenue is $750,000 if the competitor does not introduce a new product and decreases to $450,000 otherwise. If a major ad campaign at the cost of $300,000 is launched, then the revenue is $800,000 if the competitor does not introduce a new product and decreases to $700,000 otherwise. Construct the payoff table and discuss how to model the decision maker s uncertainty. 11

The payoff table can be expressed as: Competitor Competitor introduces introduces new product no new product No advertising $100,000 $700,000 Minor ad campaign $300,000 $600,000 Major ad campaign $400,000 $500,000 Decision maker s uncertainty:? Sources of information:? 12

Game Theory In the ad campaign example, the uncertainty involves a competing firm s action. Such situations are often classified under the heading of game theory: In game theory, the state of the world is determined by some opponent who is interested in making the best possible, as opposed to having the state of the world being determined by a disinterested opponent (such as nature). Some examples of realistic game-theoretic situations: A bidding situation: A bargaining situation: A military situation: How to assess your subjective probabilities for an opponent s action in a game-theoretic situation? 13

Game Theory In the ad campaign example, the uncertainty involves a competing firm s action. Such situations are often classified under the heading of game theory: In game theory, the state of the world is determined by some opponent who is interested in making the best possible, as opposed to having the state of the world being determined by a disinterested opponent (such as nature). Some examples of realistic game-theoretic situations: A bidding situation: competing firms bid for a contract and the firm with the lowest bid is awarded the contract. A bargaining situation: A military situation: How to assess your subjective probabilities for an opponent s action in a game-theoretic situation? 14

Game Theory In the ad campaign example, the uncertainty involves a competing firm s action. Such situations are often classified under the heading of game theory: In game theory, the state of the world is determined by some opponent who is interested in making the best possible, as opposed to having the state of the world being determined by a disinterested opponent (such as nature). Some examples of realistic game-theoretic situations: A bidding situation: competing firms bid for a contract and the firm with the lowest bid is awarded the contract. A bargaining situation: two parties attempt to reach an agreement. A military situation: How to assess your subjective probabilities for an opponent s action in a game-theoretic situation? 15

Game Theory In the ad campaign example, the uncertainty involves a competing firm s action. Such situations are often classified under the heading of game theory: In game theory, the state of the world is determined by some opponent who is interested in making the best possible, as opposed to having the state of the world being determined by a disinterested opponent (such as nature). Some examples of realistic game-theoretic situations: A bidding situation: competing firms bid for a contract and the firm with the lowest bid is awarded the contract. A bargaining situation: two parties attempt to reach an agreement. A military situation: each side must consider the possible actions that could be taken by opponents. How to assess your subjective probabilities for an opponent s action in a game-theoretic situation? 16

The Conditional Nature of Probability Conditional probability is an important concept in the theory of probability. Often one is interested in the probability that one event will occur, given that a particular second event has occurred or will occur. P(E 2 E 1 ): the conditional probability of event E 2 given event E 1. 17

Conditional Probability Your (subjective) conditional probability of event E 2 given event E 1 is the number P(E 2 E 1 ) that makes you indifferent between the following two lotteries: Lottery A: If E 1 occurs, you obtain X with probability P(E 2 E 1 ) and you obtain Y with probability 1 P(E 2 E 1 ). Lottery B: If E 1 occurs, you obtain X if E 2 occurs and you obtain Y if E 2 does not occur. Lottery A: If it rains today, you win $100 with probability 1/2 and you win $0 with probability 1/2. Lottery B: If it rains today, you win $100 if it rains tomorrow and you win $0 if it does not rain tomorrow. 18

Conditional Probability It is also possible to consider conditional probability in terms of conditional bets or conditional reference contracts. Person A bets $50 that the Oakland A s wins over half of its games in the coming season. Person B bets $20 that the team does not win over half of its games, with the proviso that the bet is called off if any of a stipulated list of key players of Oakland A s is injured during the course of the season. If you think this is a fair conditional bet, then what s your conditional probability P(Oakland A s wins over half of its games no injuries of key players) =? 19

Tree Diagram Rain Tomorrow = E 2 E 1 and E 2 Rain Today = E 1 E 1 No Rain Tomorrow = Ē2 E 1 and Ē2 Tree No Rain Today = Ē 1 Rain Tomorrow = E 2 Ē 1 and E 2 Ē 1 No Rain Tomorrow = Ē2 Ē 1 and Ē 2 20

P(Ē2 Ē1) = 0.9 Tree Diagram P(E 2 E 1) = 0.3 P(E 1, E 2) = 0.06 P(E 1) = 0.2 E 1 P(Ē2 E1) = 0.7 P(E 1, Ē2) = 0.14 Tree P(Ē 1) = 0.8 P(E 2 Ē1) = 0.1 P(Ē1, E2) = 0.08 Ē 1 P(Ē1, Ē2) = 0.72 21

Joint Probability Table P(E 1, E 2 ) = P(E 2 E 1 )P(E 1 ) E 2 Ē 2 E 1 P(E 1, E 2 ) P(E 1, Ē 2 ) P(E 1 ) Ē 1 P(Ē 1, E 2 ) P(Ē 1, Ē 2 ) P(Ē 1 ) P(E 2 ) P(Ē2) 1 Rain tomorrow No rain tomorrow Rain today 0.14 0.06 0.20 No rain today 0.08 0.72 0.80 0.22 0.78 1 22

Tree Diagram 23

Joint Probability Table Joint Probability Table for Sales (S) and Economy (E) 24

Bayes Theorem Disease testing example... Let D = 1 indicate you have a disease Let T = 1 indicate that you test positive for it If you take the test and the result is positive, you are really interested in the question: What s the chance for a person with a positive test to have the disease? 25

Bayes Theorem 0.019 P(D = 1 T = 1) = (0.019 + 0.0098) = 0.66 P(D = 1 T = 0) =? 26

Bayes Theorem A convenient formula that gives the relationship among various conditional probabilities is Bayes Theorem: P(A B) = P(B A)P(A) P(B) P(A): prior, initial degree of belief in A P(B A): likelihood P(B) = j P(B A = A j)p(a = A j ) P(A B): posterior, degree of belief in A after accounting for evidence B Bayes theorem describes the computation of the posterior probabilities given the prior probabilities and likelihoods. 27

Bayes Theorem In the disease testing example: P(D = 1 T = 1) = P(T =1 D=1)P(D=1) P(T =1) P(D = 1) = 0.02: the initial degree of belief in D = 1 is 2% P(T = 1 D = 1) = 0.95: likelihood P(T = 1) = P(T = 1 D = 1)P(D = 1) + P(T = 1 D = 0)P(D = 0) = 0.0288 P(D = 1 T = 1) = 0.95 0.02 0.0288 = 0.66: given T = 1, the degree of belief in D = 1 becomes 66% P(D = 0 T = 0) =? 28

Frequentist Interpretation In Bayesian interpretation, probability measures a degree of belief. In frequentist interpretation, probability measures a proportion of outcomes. Try to think about this intuitively... imagine you are about to test 100,000 people. we assume that about 2,000 of those have the disease. we also expect 1% of the disease-free people to test positive, ie, 980, and 95% of the sick people to test positive, ie 1,900. So, we expect a total of 2,880 positive tests. Choose one of the 2,880 people at random... what is the probability that he/she has the disease? isn t that the same?! P(D = 1 T = 1) = 1, 900/2, 880 = 0.66 29

Bayes Rule Bayes theorem is also known as Bayes rule or Bayes law Let A 1, A 2,..., A p be the possible outcomes of a random event... Also, let B be the outcome of an event that happened before we get to see A. By Bayes Rule: p(a i B) = p(b A i )p(a i ) p(b A 1 )p(a 1 ) + + p(b A p )p(a p ) 30

Example: Drug Testing Imagine we are trying to test football players for the possible use of steroids. The result of the test is either positive or negative. In general, tests are not perfect and some athletes who are drug free test positive (false positive) while some other drug users test negative (false negatives). Assume that in general you believe that 5% of players use steroids and the probability of a false positive is 3% and of a false negative is 7%. Assume we decide to perform this test on Colt McCoy and he tests positive! Are we sure he is indeed a drug user? 31

Example: Drug Testing We are looking for p(s +), ie, the probability of steroids usage (S) given a positive test result (+) What do we have? p(+ S), p(+ NS), p( S), p( NS), p(s) and p(ns)... right? By Bayes rule: Why? p(s +) = p(+ S)p(S) p(+ S)p(S) + p(+ NS)p(NS) = 0.620 32

Example: Drug Testing Without knowing whether or not one is a drug user what is the marginal probability of a positive test? p(+) p(+) = p(+ S)p(S) + p(+ NS)p(NS) = 0.075 and so p( ) = 0.925 Why? 33

Example: Drug Testing Try to think about this intuitively... imagine you are about to test 100,000 players. You assume that 5,000 of those use steroids. We also expect 3% of the non-users to test positive, ie, 2,850, and 93% of the users to test positive, ie 4,650. So, we expect a total of 7,500 positive tests. Choose one of the 7,500 players at random... what is the probability that he is a drug user? isn t that the same?! p(s +) = 4, 650/7, 500 = 0.62 34

Simpson s Paradox: Berkeley Gender Bias Case UC-Berkeley was one of the first universities to be sued for sexual discrimination. The graduate admission rates in the fall of 1973 were 44% for the male applicants but only 35% for the female applicants. However, examining the admission rates for individual departments did not reveal discrimination against female applicants. Table: Graduate admission Department A B C D E F A-F Male 62% 63% 37% 33% 28% 6% 52% Female 82% 68% 34% 35% 24% 7% 42% 35

Berkeley Gender Bias Case Let A = admitted to Berkeley. In 1973 it was noted that P(A male) = 0.44 while P(A female) = 0.35. Meanwhile, individual departments showed no signs of discrimination. Consider that the chemistry department and the psychology department are the only two options and the following information: Chemistry Psychology P(A female) 0.6 0.3 P(A male) 0.5 0.25 What is going on? 36

Berkeley Gender Bias Case Chemistry Psychology P(A F) 0.6 0.3 P(A M) 0.5 0.25 P(A F) = P(A F, chem)p(chem F) + P(A F, psych)p(psych F) = 0.6q f + 0.3(1 q f ) = 0.35 (hence) q f = 0.167 P(A M) = P(A M, chem)p(chem M) + P(A M, psych)p(psych M) = 0.5q m + 0.25(1 q m ) = 0.44 (hence) q m = 0.76 37

Berkeley Gender Bias Case The explanation for the apparent overall bias was that women have a higher probability of applying to Psychology than to Chemistry (assuming for simplicity that these are the only two options) and overall Psychology has a lower admissions rate! This is a cautionary tale! Before we can act on an apparent association between two variables (for example, sue Berkeley) we need to account for potential lurking variables that are the real cause of the relationship. Again, association is NOT causation! 38

Simpson s Paradox: MBA Promotion Table: Promotion Rate MBA Yes (B) MBA No ( B) First Employer Yes (C) 0.10 0.05 First Employer No ( C) 0.35 0.20 P(Promote B, C) > P(Promote B, C) P(Promote B, C) > P(Promote B, C) Employees with MBA degrees are MORE likely to be promoted. Table: Promotion Rate MBA Yes (B) MBA No ( B) 0.125 0.155 Employees with MBA degrees are LESS likely to be promoted! What s going on? 39

Simpson s Paradox: MBA Promotion P(Promote B) = P(Promote, C B) + P(Promote, C B) = P(Promote B, C)P(C B) + P(Promote B, C)P( C B) = 0.10P(C B) + 0.35(1 P(C B)) = 0.125 => P(C B) = 0.90 P(Promote B) = P(Promote, C B) + P(Promote, C B) = P(Promote B, C)P(C B) + P(Promote B, C)P( C B) = 0.05P(C B) + 0.20(1 P(C B)) = 0.155 => P(C B) = 0.30 40

Simpson s Paradox: MBA Promotion Assuming P(B) = P( B) = 0.5 and Promotion = A, can we find P(B, C A), P( B, C A), P(B, C A) and P( B, C A)? Try the Bayes Rule: where P(B, C A) = How about P(C A) and P(B A)? How about P(A C) and P(A C)? P(A B, C)P(C B)P(B) P(A) P(A) = P(A B, C)P(C B)P(B) + P(A B, C)P(C B)P( B) + P(A B, C)P( C B)P(B) + P(A B, C)P( C B)P( B) 41

Simpson s Paradox Drug A has a higher overall cure rate than Drug B, but has lower cure rates for different disease types. Player A has a higher batting average than Player B for each of the past five years, but has a lower batting average when these five years are combined. Additional examples? 42