Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 4th Class 6/10/10

Similar documents
Chapter 4 & 5 practice set. The actual exam is not multiple choice nor does it contain like questions.

Lesson 1. Basics of Probability. Principles of Mathematics 12: Explained! 314

2. How many ways can the letters in PHOENIX be rearranged? 7! = 5,040 ways.

Chapter 4 - Practice Problems 1

Chapter What is the probability that a card chosen from an ordinary deck of 52 cards is an ace? Ans: 4/52.

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10

V. RANDOM VARIABLES, PROBABILITY DISTRIBUTIONS, EXPECTED VALUE

Chapter 5 A Survey of Probability Concepts

Basic Probability Theory II

Bayesian Tutorial (Sheet Updated 20 March)

Statistics 100A Homework 2 Solutions

Chapter 4 Lecture Notes

Conditional Probability, Independence and Bayes Theorem Class 3, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Probability definitions

Probability --QUESTIONS-- Principles of Math 12 - Probability Practice Exam 1

Session 8 Probability

Introduction to Probability

MA 1125 Lecture 14 - Expected Values. Friday, February 28, Objectives: Introduce expected values.

Week 2: Conditional Probability and Bayes formula

Discrete Structures for Computer Science

Random variables, probability distributions, binomial random variable

Problem of the Month: Fair Games

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 13. Random Variables: Distribution and Expectation

SCHOOL OF ENGINEERING & BUILT ENVIRONMENT. Mathematics

Section 6.2 Definition of Probability

AP Statistics 7!3! 6!

Chapter 4 - Practice Problems 2

AP Stats - Probability Review

Probability: Terminology and Examples Class 2, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

The overall size of these chance errors is measured by their RMS HALF THE NUMBER OF TOSSES NUMBER OF HEADS MINUS NUMBER OF TOSSES

36 Odds, Expected Value, and Conditional Probability

Section 6.1 Discrete Random variables Probability Distribution

Decision Making Under Uncertainty. Professor Peter Cramton Economics 300

Problem sets for BUEC 333 Part 1: Probability and Statistics

Statistics and Random Variables. Math 425 Introduction to Probability Lecture 14. Finite valued Random Variables. Expectation defined

(b) You draw two balls from an urn and track the colors. When you start, it contains three blue balls and one red ball.

STAT 35A HW2 Solutions

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Find the indicated probability. 1) If a single fair die is rolled, find the probability of a 4 given that the number rolled is odd.

Random variables P(X = 3) = P(X = 3) = 1 8, P(X = 1) = P(X = 1) = 3 8.

Name: Date: Use the following to answer questions 2-4:

Bayesian Updating with Discrete Priors Class 11, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Lecture 13. Understanding Probability and Long-Term Expectations

Statistics 100A Homework 3 Solutions

AMS 5 CHANCE VARIABILITY

Ch. 13.3: More about Probability

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

Lecture Note 1 Set and Probability Theory. MIT Spring 2006 Herman Bennett

Unit 4 The Bernoulli and Binomial Distributions

Pattern matching probabilities and paradoxes A new variation on Penney s coin game

ACMS Section 02 Elements of Statistics October 28, Midterm Examination II

Math 141. Lecture 2: More Probability! Albyn Jones 1. jones/courses/ Library 304. Albyn Jones Math 141

6.3 Conditional Probability and Independence

Name Please Print MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

An Introduction to Basic Statistics and Probability

PROBABILITY. The theory of probabilities is simply the Science of logic quantitatively treated. C.S. PEIRCE

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Probability: The Study of Randomness Randomness and Probability Models. IPS Chapters 4 Sections

Statistics 100A Homework 4 Solutions

Section 7C: The Law of Large Numbers

ACMS Section 02 Elements of Statistics October 28, 2010 Midterm Examination II Answers

Probability. a number between 0 and 1 that indicates how likely it is that a specific event or set of events will occur.

ST 371 (IV): Discrete Random Variables

In the situations that we will encounter, we may generally calculate the probability of an event

Formula for Theoretical Probability

Definition and Calculus of Probability

Fundamentals of Probability

CHAPTER 2 Estimating Probabilities

PROBABILITY. Chapter. 0009T_c04_ qxd 06/03/03 19:53 Page 133

Book Review of Rosenhouse, The Monty Hall Problem. Leslie Burkholder 1

A review of the portions of probability useful for understanding experimental design and analysis.

psychology and economics

Probabilistic Strategies: Solutions

2 What is Rational: Normative. 4 Lottery & Gambling

STA 371G: Statistics and Modeling

For two disjoint subsets A and B of Ω, say that A and B are disjoint events. For disjoint events A and B we take an axiom P(A B) = P(A) + P(B)

Statistics in Geophysics: Introduction and Probability Theory

Elementary Statistics and Inference. Elementary Statistics and Inference. 16 The Law of Averages (cont.) 22S:025 or 7P:025.

Lecture 14. Chapter 7: Probability. Rule 1: Rule 2: Rule 3: Nancy Pfenning Stats 1000

6.042/18.062J Mathematics for Computer Science. Expected Value I

Exam 3 Review/WIR 9 These problems will be started in class on April 7 and continued on April 8 at the WIR.

Basic Probability. Probability: The part of Mathematics devoted to quantify uncertainty

Session 7 Bivariate Data and Analysis

HONORS STATISTICS. Mrs. Garrett Block 2 & 3

A probability experiment is a chance process that leads to well-defined outcomes. 3) What is the difference between an outcome and an event?

Activities/ Resources for Unit V: Proportions, Ratios, Probability, Mean and Median

Solutions to Homework 6 Statistics 302 Professor Larget

( ) = 1 x. ! 2x = 2. The region where that joint density is positive is indicated with dotted lines in the graph below. y = x

Review for Test 2. Chapters 4, 5 and 6

Example. A casino offers the following bets (the fairest bets in the casino!) 1 You get $0 (i.e., you can walk away)

E3: PROBABILITY AND STATISTICS lecture notes

Ch5: Discrete Probability Distributions Section 5-1: Probability Distribution

Business Statistics 41000: Probability 1

EXAM. Exam #3. Math 1430, Spring April 21, 2001 ANSWERS

Introduction to Hypothesis Testing

Probability. Sample space: all the possible outcomes of a probability experiment, i.e., the population of outcomes

If, under a given assumption, the of a particular observed is extremely. , we conclude that the is probably not

Random Variables. Chapter 2. Random Variables 1

6th Grade Lesson Plan: Probably Probability

Introductory Probability. MATH 107: Finite Mathematics University of Louisville. March 5, 2014

Transcription:

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis 4th Class 6/10/10 There are two times in a man's life when he should not speculate: when he can't afford it, and when he can. -- Mark Twain summarizing our important results so far: 1. addition law of probability: Pr(A 1 A 2 ) = Pr(A 1 ) + Pr(A 2 ) - Pr(A 1 A 2 ) 2. finding probability of an event by finding the probability of its complement: Pr(E) = 1 - Pr( E ) 3. definition of conditional probability: Pr(A 2 A 1 ) = Pr(A 1 A 2 ) / Pr(A 1 ) 4. multiplication law of probability: Pr(A 1 A 2 ) = Pr(A 2 )Pr(A 1 A 2 ) = Pr(A 1 )Pr(A 2 A 1 ) 5. importance of the assumption of independence for simplifying multiplication and addition laws: if A 1 and A 2 are independent, so: Pr(A 1 A 2 ) = Pr(A 1 ) and Pr(A 2 A 1 ) = Pr(A 2 ), then: Pr(A 1 A 2 ) = Pr(A 1 ) * Pr(A 2 ) Pr(A 1 A 2 ) = Pr(A 1 ) + Pr(A 2 ) - Pr(A 1 ) * Pr(A 2 ) 4-1

clarify the relationship between mutual exclusiveness, where (Pr(A 1 A 2 ) = 0, and independence define statistical dependence as opposite of independence: Pr(A 1 A 2 ) Pr(A 1 ) and Pr(A 2 A 1 ) Pr(A 2 ) if two events are mutually exclusive they must be dependent: Pr(A 2 A 1 ) = (Pr(A 1 A 2 ) / Pr(A 1 ) = 0 Pr(A 1 ) = 0 but 0 Pr(A 2 ) in general so Pr(A 2 A 1 ) Pr(A 2 ) so if two events are independent they must not be mutually exclusive but if two events are dependent they need not be mutually exclusive Let me do a problem to illustrate the three cases that can occur: consider 3 tosses of a fair coin. Eight outcomes are possible: e1 e2 e3 e4 e5 e6 e7 e8 HHH HHT HTH HTT THH THT TTH TTT we can show the sample space graphically: 4-2

e 1 e 2 e 3 e 4 e 5 e 6 e 7 e 8 consider three cases: 1) A1 and A2 are mutually exclusive and therefore dependent A1 = all heads Pr(A1) = 1 8 A2 = all tails Pr(A2) = 1 8 A 1 e 1 e 2 e 3 e 4 e 5 e 6 e 7 e 8 A 2 Pr(A1 A2) = 0 Pr(A2 A1) = 0 2) A1 and A2 are dependent, but not mutually exclusive A1 = all heads Pr(A1) = 1 8 4-3

A2 = at least one head Pr(A2) = 7 8 A 1 e e e e A 1 2 3 4 2 e 5 e 6 e 7 e 8 Pr(A1 A2) = 1 7 Pr(A2 A1) = 1 3) A1 and A2 are independent A1 = first coin is heads Pr(A1) = 1 2 A2 = second coin is heads Pr(A2) = 1 2 A 1 e 1 e 2 e 3 e 4 e 5 e 6 e 7 e 8 A 2 Pr(A1 A2) = 1 2 = Pr(A 1) Pr(A2 A1) = 1 2 = Pr(A 2) 4-4

Now do a few problems to illustrate use of probability: for fans of probability questions, Ask Marilyn has tons of them. for example, from her column of Nov. 14, 1993: Question: There are five cars on display as prizes, and their five ignition keys are in a box. You get to pick one key out of the box and try it in the ignition of one car. If it fits, you win the car. What are your chances of winning a car? answer: 1/5. You may have been tempted to say 1/25, which is the probability that you win the red Ferrari secretly preferred by a member of the audience, but if you prefer the red Ferrari, the probability switches back to 1/5. Question: Consider the following five events in Reno-style Roulette (36 numbers divided evenly between red and black, plus 2 green numbers, 0 and 00): A is the event that the wheel comes up green on the first spin B is the event that the wheel comes up black on the first spin C is the event that the wheel comes up green or black on the first spin D is the event that the wheel comes up green or red on the first spin E is the event that the wheel comes up green on the second spin. a) which two of these events are complements? answer: B and D b) which two of these events are mutually exclusive but not complements? answer: A and B c) which two of these events are independent? answer: E and any of the others d) which one of these events is the union of two of the other four? which two? answer: C is the union of A and B e) which one of these events is the intersection of two of the other four? which two? answer: A is the intersection of C and D Question: Suppose that 33% of all pets wear collars, that 60% of all pets are cats, and that 81% of all pets are cats or wear collars. a) What percentage of pets are cats who wear collars? answer: Pr(A 1 A 2 ) = Pr(A 1 ) + Pr(A 2 ) - Pr(A 1 A 2 ) rearranging, Pr(A 1 A 2 ) = Pr(A 1 ) + Pr(A 2 ) - Pr(A 1 A 2 ) If A 1 is the event of collar-wearing and A 2 is the event of cat-being, then 4-5

Pr(collar-wearing cat) =.33 +.60 -.81 =.12 or 12% b) what percentage of cats wear collars? answer:.12.60 =.20 or 20% rework cats and collars problem showing it in a 2x2 table: cat collar y n y.12.48.60 n.21.19.40.33.67 you only know the.33 and.60 and.81, from these you get the marginal distributions and knowing.81 gets you 1 -.81 =.19 Question: You have a sample containing 40% men and 60% women. Suppose that 6% of the men are colorblind and 1% of the women are colorblind. a) what is the probability that a person selected at random is colorblind? answer: can set up a 2x2 table to show how the sample breaks into 4 groups: sex male female not colorblind 37.6% 59.4% colorblind 2.4% 0.6% so Pr(colorblind) = 3% b) if a person selected at random is colorblind, what is the probability that this person is a man? answer: Pr(man colorblind) = 2.4 3 = 80% Question: In an episode of the TV show Star Trek entitled The Empath, some aliens are holding a few members of the crew of the Starship Enterprise captive. One of these aliens states that the probability is 87% that Dr. McCoy will die, while the probability is 4-6

93% that Mr. Spock will suffer severe brain damage. What is the probability that neither event will occur? Guess what actually does happen. answer: Pr(neither occurs) =.13*.07 =.0091 or.9% Question: On the TV game show Jeopardy on April 3, 1985, the final question had four possible answers. All three contestants had absolutely no idea, so each had to guess at random and independently of the other contestants. What is the probability that at least one contestant would get the right answer? answer: Pr(no one gets it) = ( 3 4 )3 =.42, so Pr at least one gets it = 1 -.42 =.58 or 58% (the question was, In a deck of cards, which one of the four suits does not have a oneeyed face card? The answer is clubs, and none of the contestants got it right.) Important application of probability theory to consideration of rationality, which is after all an underpinning of economic theory: Consider relationship of probability theory to people s intuitive understanding of likelihood Question: Linda is 31, single, outspoken, and very bright. She majored in philosophy in college. As a student, she was deeply concerned with discrimination and other social issues, and participated in anti-nuclear demonstrations. Which statement is more likely: a. Linda is a bank teller b. Linda is a bank teller and active in the feminist movement answer: as a principle of probability, the likelihood of any two uncertain events happening together is always less than the odds of either happening alone, so a. is more likely. Principle: the problem-solving method of representativeness, in which the mind compares an event to a mental model to decide how likely the event is, can lead you astray. Question: In a typical English text, does the letter k appear more often as the first letter in a word or the third letter? 4-7

answer: k appears about twice as often in the third position. Most people judge that k is commoner at the beginning of words because it is easier to recall words that begin with a letter than those that contain it somewhere in the middle Principle: the problem-solving method/mental short cut of availability, which occurs when people judge the likelihood of something happening by how easily they can call other examples of the same thing to mind, can lead you astray. Question: A cab was involved in a hit-and-run accident. Two cab companies serve the city: the Green, which operates 85 percent of the cabs, and the Blue, which operates the remaining 15 percent. A witness identifies the hit-and-run cab as Blue. When the court tests the reliability of the witness under circumstances similar to those on the night of the accident, he correctly identifies the color of a cab 80 percent of the time and misidentifies it the other 20 percent. What s the probability that the cab involved in the accident was Blue, as the witness stated? answer: can show this application of Bayes Theorem using a tree form:.80 green =.68.85 green so 71% green.20 blue =.17.20 green =.03.15 blue and 29% blue.80 blue =.12 100% of sample So Pr(blue blue ) =.12.29 =.41 or 41% Principle: it is important to consider the base rate, or background data, against which the probability of an event must be judged (i.e. need to apply Bayes Theorem and take prior probability into account). Bayes Theorem 4-8

idea is to take into account prior knowledge of the distribution of outcomes along with data regarding outcomes in arriving at a posterior probability for an event Let s do another problem before proceeding to a formal statement of it: Question: Suppose that 80% of Econ 300 students study for their first quiz, that 90% of those who study pass the quiz, and that 40% of those who do not study pass the quiz. A student chosen at random is found to have failed: what is the probability that this student studied? answer:.90 pass =.72.80 study so 80% pass.10 fail =.08.40 pass =.08.20 don t study and 20% fail.60 fail =.12 100% of sample So Pr(studied failed) =.08.20 =.40 or 40% this is a very old result, Rev. Thomas Bayes (1702-61) having had this result published posthumously in 1763. formal statement of Bayes Theorem: Let the events A 1,, A k form a partition of the space S such that Pr(A j ) > 0 for j = 1,,k, and let B be any event such that Pr(B) > 0. Then for i = 1,,k, Pr(A i B) = k j =1 Pr(A i )Pr(B A i ) Pr(A j ) Pr(B A j ) Proof: by the definition of conditional probability, Pr(A i B) = Pr(A i B) / Pr(B) 4-9

of which the numerator Pr(A i B) = Pr(A i )Pr(B A i ) and the denominator Pr(B) = k Pr(A j )Pr(B A j ). Q.E.D. j=1 Problem: three different machines, M 1, M 2, and M 3, are used to produce a large batch of manufactured items. Suppose that 20% were produced by M 1, 30% by M 2, and 50% by M 3. Suppose further that 1% of the items produced by M 1 are defective, 2% of those produced by M 2 are defective, and 3% of those produced by M 3 are defective. Finally, suppose that one item is selected at random from the entire batch and is found to be defective. Determine the probability that this item was produced by machine M 2. answer: Let A i be the event that the selected item was produced by machine M i (i = 1,2,3) and let B be the event that the selected item is defective. We are attempting to calculate the conditional probability Pr(A 2 B). From the problem s conditions, Pr(A 1 ) =.2, Pr(A 2 ) =.3, Pr(A 3 ) =.5 Also, the probabilities Pr(B A i ) that an item produced by machine M i is defective are: Pr(B A 1 ) =.01, Pr(B A 2 ) =.02, Pr(B A 3 ) =.03 Then it follows from Bayes Theorem that: Pr(A 2 B) = 3 j =1 Pr(A 2 )Pr(B A 2 ) Pr(A j ) Pr(B A j ) = (.3)(.02) (.2)(.01) + (.3)(.02) + (.5)(.03) =.006.002 +.006 +.015 = 6 23 =.26 Note that Pr(A 2 B) is lower than Pr(A 2 ). A probability like Pr(A 2 ) is often called the prior probability that the selected item has been produced by machine M 2 and a probability like Pr(A 2 B) is called the posterior probability that the selected item was produced by M 2 because it is the probability of this event after it is known that the selected item is defective. Probability distributions change our terminology slightly now a discrete random variable X takes on various values x with probabilities specified by its probability distribution p(x), so Pr(X=1) = p(1) 4-10

reduction of sample space: care about numbers, not order of events, for most problems can add up relevant probabilities p(x) to answer new questions: e.g., Pr(X < 2) = p(0) + p(1) can graph x vs. p(x) x on horizontal axis, p(x) on vertical axis [draw graph], where the heights of the lines at each discrete value of x sum up to 1 (so p(x) is bounded between 0 and 1 inclusive) We now distinguish between the mean and variance of a random variable from its probability distribution and the mean and variance of a sample of observations. The mean is known more generally as the first moment and the variance is known as the second moment: mean: X = x f n sample moments population moments µ = xp(x) variance: s 2 MSD = (x X) 2 f n σ 2 = (x µ) 2 p(x) Note. occasionally one also uses the third moment, also known as skewness, which tells whether there is more weight in the left or right tail (if positive, the distribution is skewed to the right, i.e., has a longer right tail; if negative, it is skewed to the left, i.e., has a longer left tail): (x µ) 3 p(x) ; and the fourth moment, also known as kurtosis, which tells how much weight is in the tail as opposed to the center of the distribution (gives an idea of flatness vs. spikeness of distribution the closer to 0, the more lighttailed it is): (x µ) 4 p(x). These are shown in the population moment form; they have sample moment analogies as well. platykurtic thinner tails (flatter); mesokurtic like a normal distribution; leptokurtic thicker tails (more spiked) Important simplification of the calculation of population variance: σ 2 = x 2 p(x) - µ 2 4-11

We will also start using the expected value symbol now: for the random variable X, µ X = mean of X = average of X = expected value of X, or E(X) so E(X) = xp(x) This carries over into functions of X: if G = g(x), then E(G) = E[g(X)] = g(x)p(x) note one possible function G = g(x) = (X - µ) 2 so E[(X - µ) 2 ] = (x µ) 2 p(x) = σ 2 X and σ 2 can be calculated as σ 2 = E(X 2 ) - µ 2 [see handout for full proof] If G is a linear function of X: G = a + bx then the formulas for µ G and σ G are simple: µ G = a + bµ X σ G = b σ X If G is nonlinear, have to use the general formulas to calculate µ G and σ G : µ G = g(x)p(x) 4-12

σ 2 G = (g(x) µ G ) 2 p(x) For tomorrow, finish reading Chapter 4. 4-13

Elementary Rules of Expectations 1. E(k) = k (where k is any number; i.e., the expectation of a constant is the constant) 2. E(k*X) = k*e(x) (where k is any number) 3. E(X + Y) = E(X) + E(Y) Using these rules one can show that: σ2 = E[(X - µ)2] = E(X2) - µ2 Proof: E[(X - µ)2] = E[(X - µ)*x - (X - µ)*µ] = E[X2 - µ*x - X*µ + µ2] = E(X2) - E(µ*X) - E(X*µ) + E(µ2) [Rule 3] = E(X2) - µ*e(x) - µ*e(x) + µ2 [by Rules 1 and 2 and the fact that µ is a constant] = E(X2) - µ2 - µ2 + µ2 [since E(X) = µ] = E(X2) - µ2 4-14