1 Probability, Conditional Probability and Bayes Formula
|
|
|
- Ross Hart
- 9 years ago
- Views:
Transcription
1 ISyE8843A, Brani Vidakovic Handout 1 1 Probability, Conditional Probability and Bayes Formula The intuition of chance and probability develops at very early ages. 1 However, a formal, precise definition of the probability is elusive. If the experiment can be repeated potentially infinitely many times, then the probability of an event can be defined through relative frequencies. For instance, if we rolled a die repeatedly, we could construct a frequency distribution table showing how many times each face came up. These frequencies (n i ) can be expressed as proportions or relative frequencies by dividing them by the total number of tosses n : f i = n i /n. If we saw six dots showing on 107 out of 600 tosses, that face s proportion or relative frequency is f 6 = 107/600 = As more tosses are made, we expect the proportion of sixes to stabilize around 1 6. Famous Coin Tosses: Buffon tossed a coin 4040 times. Heads appeared 2048 times. K. Pearson tossed a coin times and times. The heads appeared 6019 times and 12012, respectively. For these three tosses the relative frequencies of heads are , ,and What if the experiments can not be repeated? For example what is probability that Squiki the guinea pig survives its first treatment by a particular drug. Or the experiment of you taking ISyE8843 course in Fall It is legitimate to ask for the probability of getting a grade of an A. In such cases we can define probability subjectively as a measure of strength of belief. Figure 1: A gem proof condition 1913 Liberty Head nickel, one of only five known and the finest of the five. Collector Jay Parrino of Kansas City bought the elusive nickel for a record $1,485,000, the first and only time an American coin has sold for over $1 million. Tutubalin s Problem. In a desk drawer in the house of Mr Jay Parrino of Kansas City there is a coin, 1913 Liberty Head nickel. What is the probability that the coin is heads up? The symmetry properties of the experiment lead to the classical definition of probability. An ideal die is symmetric. All sides are equiprobable. The probability of 6, in our example is a ratio of the number of favorable outcomes (in our example only one favorable outcome, namely, 6 itself) and the number of all possible outcomes, 1/ Piaget, J. and Inhelder B. The Origin of the Idea of Chance in Children, W. W. Norton & Comp., N.Y. 2 This definition is attacked by philosophers because of the fallacy called circulus vitiosus. One defines the notion of probability supposing that outcomes are equiprobable. 1
2 (Frequentist) An event s probability is the proportion of times that we expect the event to occur, if the experiment were repeated a large number of times. (Subjectivist) A subjective probability is an individual s degree of belief in the occurrence of an event. (Classical) An event s probability is the ratio of the number of favorable outcomes and possible outcomes in a (symmetric) experiment. Term Description Example Experiment Sample space Event Probability Phenomenon where outcomes are uncertain Set of all outcomes of the experiment A collection of outcomes; a subset of S A number between 0 and 1 assigned to an event. Single throws of a six-sided die S = {1, 2, 3, 4, 5, 6}, (1, 2, 3, 4, 5, or 6 dots show) A = {3} (3 dots show), B = {3, 4, 5, or 6} (3, 4, 5, or 6 dots show) or at least three dots show P (A) = 1 6. P (B) = 4 6. Sure event occurs every time an experiment is repeated and has the probability 1. Sure event is in fact the sample space S. An event that never occurs when an experiment is performed is called impossible event. The probability of an impossible event, denoted usually by is 0. For any event A, the probability that A will occur is a number between 0 and 1, inclusive: 0 P (A) 1, P ( ) = 0, P (S) = 1. The intersection (product) A B of two events A and B is an event that occurs if both events A and B occur. The key word in the definition of the intersection is and. In the case when the events A and B are independent the probability of the intersection is the product of probabilities: P (A B) = P (A)P (B). Example: The outcomes of two consecutive flips of a fair coin are independent events. Events are said to be mutually exclusive if they have no outcomes in common. In other words, it is impossible that both could occur in a single trial of the experiment. For mutually exclusive events holds P (A B) = P ( ) = 0. 2
3 In the die-toss example, events A = {3} and B = {3, 4, 5, 6} are not mutually exclusive, since the outcome {3} belongs to both of them. On the other hand, the events A = {3} and C = {1, 2} are mutually exclusive. The union A B of two events A and B is an event that occurs if at least one of the events A or B occur. The key word in the definition of the union is or. For mutually exclusive events, the probability that at least one of them occurs is P (A C) = P (A) + P (C) For example, if the probability of event A = {3} is 1/6, and the probability of the event C = {1, 2} is 1/3, then the probability of A or C is P (A C) = P (A) + P (C) = 1/6 + 1/3 = 1/2. The additivity property is valid for any number of mutually exclusive events A 1, A 2, A 3,... : P (A 1 A 2 A 3... ) = P (A 1 ) + P (A 2 ) + P (A 3 ) +... What is P (A B) if the events A and B are not mutually exclusive. For any two events A and B, the probability that either A or B will occur is given by the inclusion-exclusion rule P (A B) = P (A) + P (B) P (A B) If the events A abd B are exclusive, then P (A B) = 0, and we get the familiar formula P (A B) = P (A) + P (B). The inclusion-exclusion rule can be generalized to unions of arbitrary number of events. For example, for three events A, Ba and C, the rule is: P (A B C) = P (A) + P (B) + P (C) P (A B) P (A C) P (B C) + P (A B C). For every event defined on S, we can define a counterpart-event called its complement. The complement A c of an event A consists of all outcomes that are in S, but are not in A. The key word in the definition of an complement is not. In our example, A c consists of the outcomes: {1, 2, 3, 4, 5}. The events A and A c are mutually exclusive by definition. Consequently, P (A A c ) = P (A) + P (A c ) Since we also know from the definition of A c that it includes all the events in the sample space, S, that are not in A, so P (A) + P (A c ) = P (S) = 1 For any complementary events A and A c, P (A) + P (A c ) = 1, P (A) = 1 P (A c ), P (A c ) = 1 P (A) 3
4 These equations simplify solutions of some probability problems. If P (A c ) is easier to calculate than P (A), then P (A c ) and equations above let us obtain P (A) indirectly. This and some other properties of probability are summarized in table below. Property Notation If event S will always occur, its probability is 1. P (S) = 1 If event will never occur, its probability is 0. P ( ) = 0 Probabilities are always between 0 and 1, inclusive 0 P (A) 1 If A, B, C,... are all mutually exclusive then P (A B C... ) can be found by addition. If A and B are mutually exclusive then P (A B) can be found by addition. Addition rule: The general addition rule for probabilities Since A and A c are mutually exclusive and between them include all possible outcomes, P (A A c ) is 1. P (A B C... ) = P (A) + P (B) + P (C) +... P (A B) = P (A) + P (B) P (A B) = P (A) + P (B) P (A B) P (A A c ) = P (A) + P (A c ) = P (S) = 1, and P (A c ) = 1 P (A) 2 Conditional Probability and Independence A conditional probability is the probability of one event if another event occurred. In the die-toss example, the probability of event A, three dots showing, is P (A) = 1 6 on a single toss. But what if we know that event B, at least three dots showing, occurred? Then there are only four possible outcomes, one of which is A. The probability of A = {3} is 1 4, given that B = {3, 4, 5, 6} occurred. The conditional probability of A given B is written P (A B). P (A B) = P (A B) P (B) Event A is independent of B if the conditional probability of A given B is the same as the unconditional probability of A. That is, they are independent if P (A B) = P (A) In the die-toss example, P (A) = 1 6 and P (A B) = 1 4, so the events A and B are not independent. 4
5 The probability that two events A and B will both occur is obtained by applying the multiplication rule: P (A B) = P (A)P (B A) = P (B)P (A B) where P (A B) (P (B A)) means the probability of A given B (B given A). For independent events only, the equation in the box simplifies to P (A B) = P (A)P (B). Prove P (A 1 A 2... A n ) = P (A 1 A 2... A n ) P (A 2 A 3... A n )... P (A n 1 A n ) P (A n ). Example: Let the experiment involves a random draw from a standard deck of 52 playing cards. Define events A and B to be the card is and the card is queen. Are the events A and B independent? By definition, P (A&B) = P (Q ) = This is the product of P ( ) = 52 and P (Q) = 52, and A and B in question are independent. The intuition barely helps here. Pretend that from the original deck of cards 2 is excluded prior to the experiment. Now the events A and B become dependent since P (A) P (B) = = P (A&B). The multiplication rule tells us how to find probabilities for composite event (A B). The probability of (A B) is used in the general addition rule for finding the probability of (A B). Rule Definitions The conditional probability of A given B is the probability of event A, if event B occurred. A is independent of B if the conditional probability of A given B is the same as the unconditional probability of A. Multiplication rule: The general multiplication rule for probabilities For independent events only, the multiplication rule is simplified. Notation P (A B) P (A B) = P (A) P (A B) = P (A)P (B A) = P (B)P (A B) P (A B) = P (A)P (B) 3 Pairwise and Global Independence If three events A, B, and C are such that any of the pairs are exclusive, i.e., AB =, AC = or BC =, then the events are mutually exclusive, i.e, ABC =. In terms of independence the picture is different. Even if the events are pairwise independent for all three pairs A, B; A, C; and B, C, i.e., P (AB) = P (A)P (B), P (AC) = P (A)P (C), and P (BC) = P (B)P (C), they may be dependent in the totality, P (ABC) P (A)P (B)P (C). Here is one example. 5
6 The four sides of a tetrahedron (regular three sided pyramid with 4 sides consisting of isosceles triangles) are denoted by 2, 3, 5 and 30, respectively. If the tetrahedron is rolled the number on the basis is the outcome of interest. Let the three events are A-the number on the base is even, B-the number is divisible by 3, and C-the number is divisible by 5. The events are pairwise independent, but in totality dependent. Algebra is clear here, but what is the intuition. The trick is that the events AB, AC, BC and ABC all coincide. That is P (A BC) = 1 although P (A B) = P (A C) = P (A). The concept of independence (dependence) is not transitive. At first glance that may seem not correct, as one may argue as follows, If event A depends on B, and event B depends on C, then event A should depend on the event C. A simple exercise proves the above statement wrong. Take a standard deck of 52 playing cards and replace the Q with Q. Thus the deck now still has 52 cards, two Q and no Q. From such a deck draw a card at random and consider three events: A-the card is queen, B-the card is red, and C- the card is. It ie easy to see that A and B are dependent since P (A&B) = 3/52 P (A) P (B) = 4/52 27/52. The events B and C are dependent as well since the event C is contained in B, and P (BC) = P (C) P (B) P (C). However, the events A and C are independent, since P (AC) = P ( Q) = 1/52 = P (A)P (C) = 13/53 4/52. 4 Total Probability Events H 1, H 2,..., H n form a partition of the sample space S if (i) They are mutually exclusive (H i H j =, i j) and (ii) Their union is the sample space S, n i=1 H i = S. The events H 1,..., H n are usually called hypotheses and from their definition follows that P (H 1 ) + + P (H n ) = 1 (= P (S)). Let the event of interest A happens under any of the hypotheses H i with a known (conditional) probability P (A H i ). Assume, in addition, that the probabilities of hypotheses H 1,..., H n are known. Then P (A) can be calculated using the total probability formula. Total Probability Formula. P (A) = P (A H 1 )P (H 1 ) + + P (A H n )P (H n ). The probability of A is the weighted average of the conditional probabilities P (A H i ) with weights P (H i ). Stanley. Stanley takes an oral exam in statistics by answering 3 questions from an examination card drawn at random from the set of 20 cards. There are 8 favorable cards (Stanley knows answers on all 3 questions). Stanley will get a grade A if he answers all 3 questions. What is the probability for Stanley to get an A if he draws the card (a) first 6
7 (b) second (c) third? Solution: Denote with A the event that Stanley draws a favorable card (and consequently gets an A). (i) If he draws the card first, then clearly P (A) = 8/20 = 2/5. (ii) If Stanley draws the second, then one card was taken by the student before him. That first card taken might have been favorable (hypothesis H 1 ) or unfavorable (hypothesis H 2 ). Obviously, the hypotheses H 1 and H 2 partition the sample space since no other type of cards is possible, in this context. Also, the probabilities of H 1 and H 2 are 8/20 and 12/20, respectively. Now, after one card has been taken Stanley draws the second. If H 1 had happened, probability of A is 7/19, and if H 2 had happened, the probability of A is 8/19. Thus, P (A H 1 ) = 7/19 and P (A H 2 ) = 8/19. By the total probability formula, P (A) = 7/19 8/20 + 8/19 12/20 = 8/20 = 2/5. (iii) Stanley has the same probability of getting an A after two cards have been already taken. The hypotheses are H 1 ={ both cards taken favorable }, H 2 ={ exactly one card favorable }, and H 3 ={ none of the cards taken favorable }. P (H 1 ) = 8/20 7/19, P (H 3 ) = 12/20 11/19. and P (H 2 ) = 1 P (H 1 ) P (H 3 ). Next, P (A H 1 ) = 6/18, P (A H 2 ) = 7/18, and P (A H 3 ) = 8/18. Therefore, P (A) = 6/18 7/19 8/20+ 7/ /18 11/19 12/20 = 12/20. Moral: Stanley s lack on the exam does not depend on the order in drawing the examination card. Two-headed coin Out of 100 coins one has heads on both sides. One coin is chosen at random and flipped two times. What is the probability to get (a) two heads? (b) two tails? Solution: (a) Let A be the event that two heads are obtained. Denote by H 1 the event (hypothesis) that a fair coin was chosen. The hypothesis H 2 = H1 c is the event that the two-headed coin was chosen. P (A) = P (A H 1 )P (H 1 ) + P (A H 2 )P (H 2 ) = 1/4 99/ /100 = 103/400 = (b) Exercise. [Ans ] 5 Bayes Formula Recall that multiplication rule claims: P (AH) = P (A)P (H A) = P (H)P (A H). This simple identity is the essence of Bayes Formula. 7
8 Bayes Formula. Let the event of interest A happens under any of hypotheses H i with a known (conditional) probability P (A H i ). Assume, in addition, that the probabilities of hypotheses H 1,..., H n are known (prior probabilities). Then the conditional (posterior) probability of the hypothesis H i, i = 1, 2,..., n, given that event A happened, is where P (H i A) = P (A H i)p (H i ), P (A) P (A) = P (A H 1 )P (H 1 ) + + P (A H n )P (H n ). Assume that out of N coins in a box, one has heads at both sides. Such two-headed coin can be purchased in Spencer stores. Assume that a coin is selected at random from the box, and without inspecting it, flipped k times. All k times the coin landed up heads. What is the probability that two headed coin was selected? Denote with A k the event that randomly selected coin lands heads up k times. The hypotheses are H 1 -the coin is two headed, and H 2 the coin is fair. It is easy to see that P (H 1 ) = 1/N and P (H 2 ) = (N 1)/N. The conditional probabilities are P (A k H 1 ) = 1 for any k, and P (A k H 2 ) = 1/2 k. By total probability formula, and P (A k ) = 2k + N 1 2 k, N P (H 1 A k ) = 2 k 2 k + N 1. For N = 1, 000, 000 and k = 1, 2,..., 30 the graph of posterior probabilities is given in Figure 2 It is interesting that our prior probability P (H 1 ) = jumps to posterior probability of , after observing 30 heads in a row. The matlab code bayes1 1.m producing Figure 2 is given in the Programs/Codes on the GTBayes Page. Prosecutor s Fallacy The prosecutor s fallacy is a fallacy commonly occurring in criminal trials but also in other various arguments involving rare events. It consists of subtle exchanging of P (A B) for P (B A). A zealous prosecutor has collected an evidence, say fingerprint match, and has an expert testify that the probability of finding this evidence if the accused were innocent is tiny. The fallacy is committed the prosecutor proceeds to claim that the probability of the accused being innocent is comparably tiny. Why is this incorrect? Suppose there is a one-in-a-million chance of a match given that the accused is innocent. The prosector deduces that means there is only a one-in-a-million chance of innocence. But in a community of 10 million people, one expects 10 matches, and the accused is just one of those ten. That would indicate only a one-in-ten chance of guilt, if no other evidence is available. 8
9 posterior probability number of flips Figure 2: Posterior probability of the two-headed coin for N = 1, 000, 000 if k heads appeared. The fallacy thus lies in the fact that the a priori probability of guilt is not taken into account. If this probability is small, then the only effect of the presented evidence is to increase that probability somewhat, but not necessarily dramatically. As a real-life example, consider the case of Sally Clark [1], who was accused in 1998 of having killed her first child at 11 weeks of age, then conceived another child and killed it at 8 weeks of age. The prosecution had expert witness testify that the probability of two children dying from sudden infant death syndrome is about 1 in 73 million. To provide proper context for this number, the probability of a mother killing one child, conceiving another and killing that one too, should have been estimated and compared to the 1 in 73 million figure, but it wasn t. Ms. Clark was convicted in 1999, resulting in a press release by the UK Royal Statistical Society which pointed out the mistake. Sally Clark s conviction was eventually quashed on other grounds on appeal on 29th January In another scenario, assume a burglary has been committed in a town, and 10,000 men in the town have their fingerprints compared to a sample from the crime. One of these men has matching fingerprint, and at his trial, it is testified that the probability that two fingerprint profiles match by chance is only 1 in 20,000. This does not mean the probability that the suspect is innocent is 1 in 20,000. Since 10,000 men were taken fingerprints, there were 10,000 opportunities to find a match by chance; the probability that there was at least ) ( 1 ) 0 ( , 20000) about 39% considerably more than 1 in ) ( 1 ) 1 ( ) one fingerprint match is 1 ( ,000. (The probability that exactly one of the 10,000 men has a match is ( or about 30%, which certainly casts a reasonable doubt.) Return to our example with N coins. Assume that out of N = coins in a box, one has heads at both sides. Such two-headed coin is guilty. Assume that a coin is selected at random from the box, and without inspecting it, flipped k = 15 times. All k = 15 times the coin landed up heads. Based on this evidence the prosecutor claims the selected coin is guilty since if it was innocent, observed k = 15 heads in a row will appear with probability But the probability that the guilty coin was really selected is 0.03 and prosecutor is accusing an innocent coin with probability of /2 15 approximately In legal terms, the prosecutor is operating in terms of a presumption of guilt, something which is contrary to the normal presumption of innocence where a person is assumed to be innocent unless found guilty. 9
10 Two Masked Robbers. Two masked robbers try to rob a crowded bank during the lunch hour but the teller presses a button that sets off an alarm and locks the front door. The robbers realizing they are trapped, throw away their masks and disappear into the chaotic crowd. Confronted with 40 people claiming they are innocent, the police gives everyone a lie detector test. Suppose that guilty people are detected with probability 0.85 and innocent people appear to be guilty with probability What is the probability that Mr. Smith was one of the robbers given that the lie detector says he is? Guessing. Subject in an experiment are told that either a red or a green light will flash. Each subject is to guess which light will flash. The subject is told that the probability of a red light is 0.7, independent of guesses. Assume that the subject is a probability matcher- that is, guesses red with probability.70 and green with probability.30. (i) What is the probability that the subject guesses correctly? (ii) Given that a subject guesses correctly, what is the probability that the light flashed red? False Positives. False positives are a problem in any kind of test: no test is perfect, and sometimes the test will incorrectly report a positive result. For example, if a test for a particular disease is performed on a patient, then there is a chance (usually small) that the test will return a positive result even if the patient does not have the disease. The problem lies, however, not just in the chance of a false positive prior to testing, but determining the chance that a positive result is in fact a false positive. As we will demonstrate, using Bayes theorem, if a condition is rare, then the majority of positive results may be false positives, even if the test for that condition is (otherwise) reasonably accurate. Suppose that a test for a particular disease has a very high success rate: if a tested patient has the disease, the test accurately reports this, a positive, 99% of the time (or, with probability 0.99), and if a tested patient does not have the disease, the test accurately reports that, a negative, 95% of the time (i.e. with probability 0.95). Suppose also, however, that only 0.1% of the population have that disease (i.e. with probability 0.001). We now have all the information required to calculate the probability that, given the test was positive, that it is a false positive. Let D be the event that the patient has the disease, and P be the event that the test returns a positive result. The probability of a true positive is P (D P ) = , and hence the probability of a false positive is about ( ) = Despite the apparent high accuracy of the test, the incidence of the disease is so low (one in a thousand) that the vast majority of patients who test positive (98 in a hundred) do not have the disease. Nonetheless, this is 20 times the proportion before we knew the outcome of the test! The test is not useless, and retesting may improve the reliability of the result. In particular, a test must be very reliable in reporting a negative result when the patient does not have the disease, if it is to avoid the problem of false positives. In mathematical terms, this would ensure that the second term in the denominator of the above calculation is small, relative to the first term. For example, if the test reported a negative result in patients without the disease with probability 0.999, then using this value in the calculation yields a probability of a false positive of roughly
11 Multiple Choice. A student answers a multiple choice examination question that has 4 possible answers. Suppose that the probability that the student knows the answer to the question is 0.80 and the probability that the student guesses is If student guesses, probability of correct answer is (i) What is the probability that the fixed question is answered correctly? (ii) If it is answered correctly what is the probability that the student really knew the correct answer. Manufacturing Bayes. Factory has three types of machines producing an item. Probabilities that the item is I quality f it is produced on i-th machine are given in the following table: machine probability of I quality The total production is done 30% on type I machine, 50% on type II, and 20% on type III. One item is selected at random from the production. (i) What is the probability that it is of I quality? (ii) If it is of first quality, what is the probability that it was produced on the machine I? Two-headed coin. 4 One out of 1000 coins has two tails. The coin is selected at random out of these 1000 coins and flipped 5 times. If tails appeared all 5 times, what is the probability that the selected coin was two-tailed? Kokomo, Indiana. In Kokomo, IN, 65% are conservatives, 20% are liberals and 15% are independents. Records show that in a particular election 82% of conservatives voted, 65% of liberals voted and 50% of independents voted. If the person from the city is selected at random and it is learned that he/she did not vote, what is the probability that the person is liberal? Inflation and Unemployment. Businesses commonly project revenues under alternative economic scenarios. For a stylized example, inflation could be high or low and unemployment could be high or low. There are four possible scenarios, with the assumed probabilities: Scenario Inflation Unemployment Probability 1 high high high low low high low low 0.24 (i) What is the probability of high inflation? (ii) What is the probability of high inflation if unemployment is high? (iii) Are inflation and unemployment independent? 11
12 Information Channel. One of three words AAAA, BBBB, and CCCC is transmitted via an information channel. The probabilities of these words are 0.3, 0.5, and 0.2, respectively. Each letter is transmitted independently of the other letters and it is received correctly with probability 0.6. Since the channel is not perfect, the letter can change to one of the other two letters with equal probabilities of 0.2. What is the probability that word AAAA had been submitted if word ABCA was received. Jim Albert s Question. An automatic machine in a small factory produces metal parts. Most of the time (90% by long records), it produces 95% good parts and the remaining have to be scrapped. Other times, the machine slips into a less productive mode and only produces 70% good parts. The foreman observes the quality of parts that are produced by the machine and wants to stop and adjust the machine when she believes that the machine is not working well. Suppose that the first dozen parts produced are given by the sequence s u s s s s s s s u s u where s satisfactory and u unsatisfactory. After observing this sequence, what is the probability that the machine is in its good state? If the foreman wishes to stop the machine when the probability of good state is under 0.7, when should she stop? Medical Doctors and Probabilistic Reasoning. The following problem was posed by Casscells, Schoenberger, and Grayboys (1978) to 60 students and staff at Harvard Medical School: If a test to detect a disease whose prevalence is 1/1000 has a false positive rate of 5%, what is the chance that a person found to have a positive result actually has the disease, assuming you know nothing about the person s symptoms or signs? Assuming that the probability of a positive result given the disease is 1, the answer to this problem is approximately 2%. Casscells et al. found that only 18% of participants gave this answer. The modal response was 95%, presumably on the supposition that, because an error rate of the test is 5%, it must get 95% of results correct. Let s Make a Deal. In the popular television game show Let s Make a Deal, Monty Hall is the master of ceremonies. At certain times during the show, a contestant is allowed to choose one of three identical doors A, B, C, behind only one of which is a valuable prize (a new car). After the contestant picks a door (say, door A), Monty Hall opens another door and shows the contestant that there is no prize behind that door. (Monty Hall knows where the prize is and always chooses a door where there is no prize.) He then asks the contestant whether he or she wants to stick with their choice of door or switch to the remaining unopened door. Should the contestant switch doors? Does it matter? A federalist paper resolved? Suppose that a work, the author of which is known to be either Madson or Hamilton, contains a certain key phrase. Suppose further that this phrase occurs in 60% of the papers known to have been written by Madison, but in only 20% of those by Hamilton. Finally, suppose a historian gives subjective probability.3 to the event that the author is Madison (and consequently.7 to the complementary event that the author is Hamilton). Compute the historian s posterior probability for the event that the author is Madison. 12
13 References [1] Batt, J. Stolen Innocence: A Mother s Fight for Justice. The Authorised Story of Sally Clark, Ebury Press. [2] Barbeau, E. (1993). The Problem of the Car and Goats, CMJ, 24:2, p. 149 [3] Casscells, W., Schoenberger, A., and Grayboys, T. (1978). Interpretation by physicians of clinical laboratory results. New England Journal of Medicine, 299, [4] Gillman, L. (1992). The Car and the Goats,AMM 99:1, p. 3 [5] Selvin, S. (1975). A Problem in Probability, American Statistician, 29:1, p
6.3 Conditional Probability and Independence
222 CHAPTER 6. PROBABILITY 6.3 Conditional Probability and Independence Conditional Probability Two cubical dice each have a triangle painted on one side, a circle painted on two sides and a square painted
Week 2: Conditional Probability and Bayes formula
Week 2: Conditional Probability and Bayes formula We ask the following question: suppose we know that a certain event B has occurred. How does this impact the probability of some other A. This question
E3: PROBABILITY AND STATISTICS lecture notes
E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................
Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10
CS 70 Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10 Introduction to Discrete Probability Probability theory has its origins in gambling analyzing card games, dice,
Conditional Probability, Independence and Bayes Theorem Class 3, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom
Conditional Probability, Independence and Bayes Theorem Class 3, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom 1 Learning Goals 1. Know the definitions of conditional probability and independence
AP Stats - Probability Review
AP Stats - Probability Review Multiple Choice Identify the choice that best completes the statement or answers the question. 1. I toss a penny and observe whether it lands heads up or tails up. Suppose
Probability. a number between 0 and 1 that indicates how likely it is that a specific event or set of events will occur.
Probability Probability Simple experiment Sample space Sample point, or elementary event Event, or event class Mutually exclusive outcomes Independent events a number between 0 and 1 that indicates how
Section 6-5 Sample Spaces and Probability
492 6 SEQUENCES, SERIES, AND PROBABILITY 52. How many committees of 4 people are possible from a group of 9 people if (A) There are no restrictions? (B) Both Juan and Mary must be on the committee? (C)
A Few Basics of Probability
A Few Basics of Probability Philosophy 57 Spring, 2004 1 Introduction This handout distinguishes between inductive and deductive logic, and then introduces probability, a concept essential to the study
Conditional Probability, Hypothesis Testing, and the Monty Hall Problem
Conditional Probability, Hypothesis Testing, and the Monty Hall Problem Ernie Croot September 17, 2008 On more than one occasion I have heard the comment Probability does not exist in the real world, and
Book Review of Rosenhouse, The Monty Hall Problem. Leslie Burkholder 1
Book Review of Rosenhouse, The Monty Hall Problem Leslie Burkholder 1 The Monty Hall Problem, Jason Rosenhouse, New York, Oxford University Press, 2009, xii, 195 pp, US $24.95, ISBN 978-0-19-5#6789-8 (Source
Chapter 4. Probability and Probability Distributions
Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the
Basic Probability Concepts
page 1 Chapter 1 Basic Probability Concepts 1.1 Sample and Event Spaces 1.1.1 Sample Space A probabilistic (or statistical) experiment has the following characteristics: (a) the set of all possible outcomes
Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom
Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom 1 Learning Goals 1. Be able to explain the difference between the p-value and a posterior
Lecture Note 1 Set and Probability Theory. MIT 14.30 Spring 2006 Herman Bennett
Lecture Note 1 Set and Probability Theory MIT 14.30 Spring 2006 Herman Bennett 1 Set Theory 1.1 Definitions and Theorems 1. Experiment: any action or process whose outcome is subject to uncertainty. 2.
Probability definitions
Probability definitions 1. Probability of an event = chance that the event will occur. 2. Experiment = any action or process that generates observations. In some contexts, we speak of a data-generating
Lecture 13. Understanding Probability and Long-Term Expectations
Lecture 13 Understanding Probability and Long-Term Expectations Thinking Challenge What s the probability of getting a head on the toss of a single fair coin? Use a scale from 0 (no way) to 1 (sure thing).
Introduction to Hypothesis Testing
I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true
Fundamentals of Probability
Fundamentals of Probability Introduction Probability is the likelihood that an event will occur under a set of given conditions. The probability of an event occurring has a value between 0 and 1. An impossible
Probabilistic Strategies: Solutions
Probability Victor Xu Probabilistic Strategies: Solutions Western PA ARML Practice April 3, 2016 1 Problems 1. You roll two 6-sided dice. What s the probability of rolling at least one 6? There is a 1
36 Odds, Expected Value, and Conditional Probability
36 Odds, Expected Value, and Conditional Probability What s the difference between probabilities and odds? To answer this question, let s consider a game that involves rolling a die. If one gets the face
Bayesian Updating with Discrete Priors Class 11, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom
1 Learning Goals Bayesian Updating with Discrete Priors Class 11, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom 1. Be able to apply Bayes theorem to compute probabilities. 2. Be able to identify
Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
Introduction to Hypothesis Testing OPRE 6301
Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about
Lecture 1 Introduction Properties of Probability Methods of Enumeration Asrat Temesgen Stockholm University
Lecture 1 Introduction Properties of Probability Methods of Enumeration Asrat Temesgen Stockholm University 1 Chapter 1 Probability 1.1 Basic Concepts In the study of statistics, we consider experiments
Lab 11. Simulations. The Concept
Lab 11 Simulations In this lab you ll learn how to create simulations to provide approximate answers to probability questions. We ll make use of a particular kind of structure, called a box model, that
Math/Stats 425 Introduction to Probability. 1. Uncertainty and the axioms of probability
Math/Stats 425 Introduction to Probability 1. Uncertainty and the axioms of probability Processes in the real world are random if outcomes cannot be predicted with certainty. Example: coin tossing, stock
Math 141. Lecture 2: More Probability! Albyn Jones 1. [email protected] www.people.reed.edu/ jones/courses/141. 1 Library 304. Albyn Jones Math 141
Math 141 Lecture 2: More Probability! Albyn Jones 1 1 Library 304 [email protected] www.people.reed.edu/ jones/courses/141 Outline Law of total probability Bayes Theorem the Multiplication Rule, again Recall
Basic Probability Theory II
RECAP Basic Probability heory II Dr. om Ilvento FREC 408 We said the approach to establishing probabilities for events is to Define the experiment List the sample points Assign probabilities to the sample
Mathematical goals. Starting points. Materials required. Time needed
Level S2 of challenge: B/C S2 Mathematical goals Starting points Materials required Time needed Evaluating probability statements To help learners to: discuss and clarify some common misconceptions about
6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1
BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine
People have thought about, and defined, probability in different ways. important to note the consequences of the definition:
PROBABILITY AND LIKELIHOOD, A BRIEF INTRODUCTION IN SUPPORT OF A COURSE ON MOLECULAR EVOLUTION (BIOL 3046) Probability The subject of PROBABILITY is a branch of mathematics dedicated to building models
8 Divisibility and prime numbers
8 Divisibility and prime numbers 8.1 Divisibility In this short section we extend the concept of a multiple from the natural numbers to the integers. We also summarize several other terms that express
Basic Probability. Probability: The part of Mathematics devoted to quantify uncertainty
AMS 5 PROBABILITY Basic Probability Probability: The part of Mathematics devoted to quantify uncertainty Frequency Theory Bayesian Theory Game: Playing Backgammon. The chance of getting (6,6) is 1/36.
Chapter 3. Distribution Problems. 3.1 The idea of a distribution. 3.1.1 The twenty-fold way
Chapter 3 Distribution Problems 3.1 The idea of a distribution Many of the problems we solved in Chapter 1 may be thought of as problems of distributing objects (such as pieces of fruit or ping-pong balls)
Definition and Calculus of Probability
In experiments with multivariate outcome variable, knowledge of the value of one variable may help predict another. For now, the word prediction will mean update the probabilities of events regarding the
Chapter 6: Probability
Chapter 6: Probability In a more mathematically oriented statistics course, you would spend a lot of time talking about colored balls in urns. We will skip over such detailed examinations of probability,
MAS113 Introduction to Probability and Statistics
MAS113 Introduction to Probability and Statistics 1 Introduction 1.1 Studying probability theory There are (at least) two ways to think about the study of probability theory: 1. Probability theory is a
Introduction to Probability
3 Introduction to Probability Given a fair coin, what can we expect to be the frequency of tails in a sequence of 10 coin tosses? Tossing a coin is an example of a chance experiment, namely a process which
Hypothesis testing. c 2014, Jeffrey S. Simonoff 1
Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there
WORKED EXAMPLES 1 TOTAL PROBABILITY AND BAYES THEOREM
WORKED EXAMPLES 1 TOTAL PROBABILITY AND BAYES THEOREM EXAMPLE 1. A biased coin (with probability of obtaining a Head equal to p > 0) is tossed repeatedly and independently until the first head is observed.
Probability and Expected Value
Probability and Expected Value This handout provides an introduction to probability and expected value. Some of you may already be familiar with some of these topics. Probability and expected value are
Random variables, probability distributions, binomial random variable
Week 4 lecture notes. WEEK 4 page 1 Random variables, probability distributions, binomial random variable Eample 1 : Consider the eperiment of flipping a fair coin three times. The number of tails that
Ch. 13.3: More about Probability
Ch. 13.3: More about Probability Complementary Probabilities Given any event, E, of some sample space, U, of a random experiment, we can always talk about the complement, E, of that event: this is the
Probability Using Dice
Using Dice One Page Overview By Robert B. Brown, The Ohio State University Topics: Levels:, Statistics Grades 5 8 Problem: What are the probabilities of rolling various sums with two dice? How can you
PROBABILITY SECOND EDITION
PROBABILITY SECOND EDITION Table of Contents How to Use This Series........................................... v Foreword..................................................... vi Basics 1. Probability All
Independent samples t-test. Dr. Tom Pierce Radford University
Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of
Bayesian Tutorial (Sheet Updated 20 March)
Bayesian Tutorial (Sheet Updated 20 March) Practice Questions (for discussing in Class) Week starting 21 March 2016 1. What is the probability that the total of two dice will be greater than 8, given that
The study of probability has increased in popularity over the years because of its wide range of practical applications.
6.7. Probability. The study of probability has increased in popularity over the years because of its wide range of practical applications. In probability, each repetition of an experiment is called a trial,
6. Let X be a binomial random variable with distribution B(10, 0.6). What is the probability that X equals 8? A) (0.6) (0.4) B) 8! C) 45(0.6) (0.
Name: Date:. For each of the following scenarios, determine the appropriate distribution for the random variable X. A) A fair die is rolled seven times. Let X = the number of times we see an even number.
AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics
Ms. Foglia Date AP: LAB 8: THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
6th Grade Lesson Plan: Probably Probability
6th Grade Lesson Plan: Probably Probability Overview This series of lessons was designed to meet the needs of gifted children for extension beyond the standard curriculum with the greatest ease of use
Cosmological Arguments for the Existence of God S. Clarke
Cosmological Arguments for the Existence of God S. Clarke [Modified Fall 2009] 1. Large class of arguments. Sometimes they get very complex, as in Clarke s argument, but the basic idea is simple. Lets
Probabilities. Probability of a event. From Random Variables to Events. From Random Variables to Events. Probability Theory I
Victor Adamchi Danny Sleator Great Theoretical Ideas In Computer Science Probability Theory I CS 5-25 Spring 200 Lecture Feb. 6, 200 Carnegie Mellon University We will consider chance experiments with
Chapter 4: Probability and Counting Rules
Chapter 4: Probability and Counting Rules Learning Objectives Upon successful completion of Chapter 4, you will be able to: Determine sample spaces and find the probability of an event using classical
Section 5-3 Binomial Probability Distributions
Section 5-3 Binomial Probability Distributions Key Concept This section presents a basic definition of a binomial distribution along with notation, and methods for finding probability values. Binomial
Descriptive Statistics and Measurement Scales
Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample
6.042/18.062J Mathematics for Computer Science. Expected Value I
6.42/8.62J Mathematics for Computer Science Srini Devadas and Eric Lehman May 3, 25 Lecture otes Expected Value I The expectation or expected value of a random variable is a single number that tells you
ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2015
ECON 459 Game Theory Lecture Notes Auctions Luca Anderlini Spring 2015 These notes have been used before. If you can still spot any errors or have any suggestions for improvement, please let me know. 1
How To Find The Sample Space Of A Random Experiment In R (Programming)
Probability 4.1 Sample Spaces For a random experiment E, the set of all possible outcomes of E is called the sample space and is denoted by the letter S. For the coin-toss experiment, S would be the results
Chapter 3. Cartesian Products and Relations. 3.1 Cartesian Products
Chapter 3 Cartesian Products and Relations The material in this chapter is the first real encounter with abstraction. Relations are very general thing they are a special type of subset. After introducing
2. How many ways can the letters in PHOENIX be rearranged? 7! = 5,040 ways.
Math 142 September 27, 2011 1. How many ways can 9 people be arranged in order? 9! = 362,880 ways 2. How many ways can the letters in PHOENIX be rearranged? 7! = 5,040 ways. 3. The letters in MATH are
Chapter 11 Number Theory
Chapter 11 Number Theory Number theory is one of the oldest branches of mathematics. For many years people who studied number theory delighted in its pure nature because there were few practical applications
Problem of the Month: Fair Games
Problem of the Month: The Problems of the Month (POM) are used in a variety of ways to promote problem solving and to foster the first standard of mathematical practice from the Common Core State Standards:
A natural introduction to probability theory. Ronald Meester
A natural introduction to probability theory Ronald Meester ii Contents Preface v 1 Experiments 1 1.1 Definitions and examples........................ 1 1.2 Counting and combinatorics......................
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency
Chapter 4 - Practice Problems 2
Chapter - Practice Problems 2 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Find the indicated probability. 1) If you flip a coin three times, the
1 Combinations, Permutations, and Elementary Probability
1 Combinations, Permutations, and Elementary Probability Roughly speaking, Permutations are ways of grouping things where the order is important. Combinations are ways of grouping things where the order
Homework 3 (due Tuesday, October 13)
Homework (due Tuesday, October 1 Problem 1. Consider an experiment that consists of determining the type of job either blue-collar or white-collar and the political affiliation Republican, Democratic,
INCIDENCE-BETWEENNESS GEOMETRY
INCIDENCE-BETWEENNESS GEOMETRY MATH 410, CSUSM. SPRING 2008. PROFESSOR AITKEN This document covers the geometry that can be developed with just the axioms related to incidence and betweenness. The full
Introduction to Probability
Massachusetts Institute of Technology Course Notes 0 6.04J/8.06J, Fall 0: Mathematics for Computer Science November 4 Professor Albert Meyer and Dr. Radhika Nagpal revised November 6, 00, 57 minutes Introduction
Lesson 1. Basics of Probability. Principles of Mathematics 12: Explained! www.math12.com 314
Lesson 1 Basics of Probability www.math12.com 314 Sample Spaces: Probability Lesson 1 Part I: Basic Elements of Probability Consider the following situation: A six sided die is rolled The sample space
1. The Fly In The Ointment
Arithmetic Revisited Lesson 5: Decimal Fractions or Place Value Extended Part 5: Dividing Decimal Fractions, Part 2. The Fly In The Ointment The meaning of, say, ƒ 2 doesn't depend on whether we represent
WHERE DOES THE 10% CONDITION COME FROM?
1 WHERE DOES THE 10% CONDITION COME FROM? The text has mentioned The 10% Condition (at least) twice so far: p. 407 Bernoulli trials must be independent. If that assumption is violated, it is still okay
7.S.8 Interpret data to provide the basis for predictions and to establish
7 th Grade Probability Unit 7.S.8 Interpret data to provide the basis for predictions and to establish experimental probabilities. 7.S.10 Predict the outcome of experiment 7.S.11 Design and conduct an
3. Mathematical Induction
3. MATHEMATICAL INDUCTION 83 3. Mathematical Induction 3.1. First Principle of Mathematical Induction. Let P (n) be a predicate with domain of discourse (over) the natural numbers N = {0, 1,,...}. If (1)
CHAPTER 2. Logic. 1. Logic Definitions. Notation: Variables are used to represent propositions. The most common variables used are p, q, and r.
CHAPTER 2 Logic 1. Logic Definitions 1.1. Propositions. Definition 1.1.1. A proposition is a declarative sentence that is either true (denoted either T or 1) or false (denoted either F or 0). Notation:
Association Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
3 Some Integer Functions
3 Some Integer Functions A Pair of Fundamental Integer Functions The integer function that is the heart of this section is the modulo function. However, before getting to it, let us look at some very simple
Chapter 4 - Practice Problems 1
Chapter 4 - Practice Problems SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. ) Compare the relative frequency formula
So let us begin our quest to find the holy grail of real analysis.
1 Section 5.2 The Complete Ordered Field: Purpose of Section We present an axiomatic description of the real numbers as a complete ordered field. The axioms which describe the arithmetic of the real numbers
PART 3 MODULE 3 CLASSICAL PROBABILITY, STATISTICAL PROBABILITY, ODDS
PART 3 MODULE 3 CLASSICAL PROBABILITY, STATISTICAL PROBABILITY, ODDS PROBABILITY Classical or theoretical definitions: Let S be the set of all equally likely outcomes to a random experiment. (S is called
Elasticity. I. What is Elasticity?
Elasticity I. What is Elasticity? The purpose of this section is to develop some general rules about elasticity, which may them be applied to the four different specific types of elasticity discussed in
In the situations that we will encounter, we may generally calculate the probability of an event
What does it mean for something to be random? An event is called random if the process which produces the outcome is sufficiently complicated that we are unable to predict the precise result and are instead
How To Choose Between A Goat And A Door In A Game Of \"The Black Jackpot\"
Appendix D: The Monty Hall Controversy Appendix D: The Monty Hall Controversy - Page 1 Let's Make a Deal Prepared by Rich Williams, Spring 1991 Last Modified Fall, 2004 You are playing Let's Make a Deal
Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 2
CS 70 Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 2 Proofs Intuitively, the concept of proof should already be familiar We all like to assert things, and few of us
Clock Arithmetic and Modular Systems Clock Arithmetic The introduction to Chapter 4 described a mathematical system
CHAPTER Number Theory FIGURE FIGURE FIGURE Plus hours Plus hours Plus hours + = + = + = FIGURE. Clock Arithmetic and Modular Systems Clock Arithmetic The introduction to Chapter described a mathematical
Bayesian probability theory
Bayesian probability theory Bruno A. Olshausen arch 1, 2004 Abstract Bayesian probability theory provides a mathematical framework for peforming inference, or reasoning, using probability. The foundations
Ch. 13.2: Mathematical Expectation
Ch. 13.2: Mathematical Expectation Random Variables Very often, we are interested in sample spaces in which the outcomes are distinct real numbers. For example, in the experiment of rolling two dice, we
PYTHAGOREAN TRIPLES KEITH CONRAD
PYTHAGOREAN TRIPLES KEITH CONRAD 1. Introduction A Pythagorean triple is a triple of positive integers (a, b, c) where a + b = c. Examples include (3, 4, 5), (5, 1, 13), and (8, 15, 17). Below is an ancient
Answer Key for California State Standards: Algebra I
Algebra I: Symbolic reasoning and calculations with symbols are central in algebra. Through the study of algebra, a student develops an understanding of the symbolic language of mathematics and the sciences.
The result of the bayesian analysis is the probability distribution of every possible hypothesis H, given one real data set D. This prestatistical approach to our problem was the standard approach of Laplace
Hands-On Math Algebra
Hands-On Math Algebra by Pam Meader and Judy Storer illustrated by Julie Mazur Contents To the Teacher... v Topic: Ratio and Proportion 1. Candy Promotion... 1 2. Estimating Wildlife Populations... 6 3.
Chapter 4 & 5 practice set. The actual exam is not multiple choice nor does it contain like questions.
Chapter 4 & 5 practice set. The actual exam is not multiple choice nor does it contain like questions. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Probability & Probability Distributions
Probability & Probability Distributions Carolyn J. Anderson EdPsych 580 Fall 2005 Probability & Probability Distributions p. 1/61 Probability & Probability Distributions Elementary Probability Theory Definitions
Recursive Estimation
Recursive Estimation Raffaello D Andrea Spring 04 Problem Set : Bayes Theorem and Bayesian Tracking Last updated: March 8, 05 Notes: Notation: Unlessotherwisenoted,x, y,andz denoterandomvariables, f x
Contemporary Mathematics- MAT 130. Probability. a) What is the probability of obtaining a number less than 4?
Contemporary Mathematics- MAT 30 Solve the following problems:. A fair die is tossed. What is the probability of obtaining a number less than 4? What is the probability of obtaining a number less than
