Stat 20: Intro to Probability and Statistics



Similar documents
Stat 20: Intro to Probability and Statistics

The Normal Approximation to Probability Histograms. Dice: Throw a single die twice. The Probability Histogram: Area = Probability. Where are we going?

AMS 5 CHANCE VARIABILITY

Elementary Statistics and Inference. Elementary Statistics and Inference. 17 Expected Value and Standard Error. 22S:025 or 7P:025.

Chapter 20: chance error in sampling

Elementary Statistics and Inference. Elementary Statistics and Inference. 16 The Law of Averages (cont.) 22S:025 or 7P:025.

Lab 11. Simulations. The Concept

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

MATH 140 Lab 4: Probability and the Standard Normal Distribution

$ ( $1) = 40

The overall size of these chance errors is measured by their RMS HALF THE NUMBER OF TOSSES NUMBER OF HEADS MINUS NUMBER OF TOSSES

MA 1125 Lecture 14 - Expected Values. Friday, February 28, Objectives: Introduce expected values.

13.0 Central Limit Theorem

Section 7C: The Law of Large Numbers

Statistics and Random Variables. Math 425 Introduction to Probability Lecture 14. Finite valued Random Variables. Expectation defined

Thursday, October 18, 2001 Page: 1 STAT 305. Solutions

6.3 Conditional Probability and Independence

Mathematical Expectation

Chapter 4 Lecture Notes

Lecture 13. Understanding Probability and Long-Term Expectations

Section 6.2 Definition of Probability

2. How many ways can the letters in PHOENIX be rearranged? 7! = 5,040 ways.

Chapter 4 & 5 practice set. The actual exam is not multiple choice nor does it contain like questions.

That s Not Fair! ASSESSMENT #HSMA20. Benchmark Grades: 9-12

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

Week 4: Standard Error and Confidence Intervals

Hoover High School Math League. Counting and Probability

STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science

Expected Value and the Game of Craps

Probability Distribution for Discrete Random Variables

Betting systems: how not to lose your money gambling

Decimals and Percentages

Example. A casino offers the following bets (the fairest bets in the casino!) 1 You get $0 (i.e., you can walk away)

Contemporary Mathematics Online Math 1030 Sample Exam I Chapters No Time Limit No Scratch Paper Calculator Allowed: Scientific

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

Homework Assignment #2: Answer Key

Contemporary Mathematics- MAT 130. Probability. a) What is the probability of obtaining a number less than 4?

Ch. 13.2: Mathematical Expectation

Bayesian Tutorial (Sheet Updated 20 March)

Probability, statistics and football Franka Miriam Bru ckler Paris, 2015.

Lesson 1. Basics of Probability. Principles of Mathematics 12: Explained! 314

In the situations that we will encounter, we may generally calculate the probability of an event

Algebra 2 C Chapter 12 Probability and Statistics

AP Stats - Probability Review

Introductory Probability. MATH 107: Finite Mathematics University of Louisville. March 5, 2014

STATISTICS 8: CHAPTERS 7 TO 10, SAMPLE MULTIPLE CHOICE QUESTIONS

Sample Term Test 2A. 1. A variable X has a distribution which is described by the density curve shown below:

V. RANDOM VARIABLES, PROBABILITY DISTRIBUTIONS, EXPECTED VALUE

Mathematical goals. Starting points. Materials required. Time needed

MTH6120 Further Topics in Mathematical Finance Lesson 2

COMMON CORE STATE STANDARDS FOR

Chapter What is the probability that a card chosen from an ordinary deck of 52 cards is an ace? Ans: 4/52.

Chapter 16. Law of averages. Chance. Example 1: rolling two dice Sum of draws. Setting up a. Example 2: American roulette. Summary.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Section 5-3 Binomial Probability Distributions

Hooray for the Hundreds Chart!!

Sue Fine Linn Maskell

36 Odds, Expected Value, and Conditional Probability

Solution. Solution. (a) Sum of probabilities = 1 (Verify) (b) (see graph) Chapter 4 (Sections ) Homework Solutions. Section 4.

Ch5: Discrete Probability Distributions Section 5-1: Probability Distribution

Chapter 4 - Practice Problems 1

Problem sets for BUEC 333 Part 1: Probability and Statistics

PROBABILITY SECOND EDITION

Exploratory Data Analysis. Psychology 3256

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

STAT 35A HW2 Solutions

Complement. If A is an event, then the complement of A, written A c, means all the possible outcomes that are not in A.

6th Grade Lesson Plan: Probably Probability

An Introduction to Basic Statistics and Probability

4. Continuous Random Variables, the Pareto and Normal Distributions

Basic Probability. Probability: The part of Mathematics devoted to quantify uncertainty

Chapter 16: law of averages

1.6 The Order of Operations

High School Statistics and Probability Common Core Sample Test Version 2

Economics 1011a: Intermediate Microeconomics

John Kerrich s coin-tossing Experiment. Law of Averages - pg. 294 Moore s Text

Concepts of Probability

What is the purpose of this document? What is in the document? How do I send Feedback?

STT 200 LECTURE 1, SECTION 2,4 RECITATION 7 (10/16/2012)

Probability: The Study of Randomness Randomness and Probability Models. IPS Chapters 4 Sections

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MAKING MATH MORE FUN BRINGS YOU FUN MATH GAME PRINTABLES FOR HOME OR SCHOOL

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Unit 19: Probability Models

Choice Under Uncertainty

Lecture 14. Chapter 7: Probability. Rule 1: Rule 2: Rule 3: Nancy Pfenning Stats 1000

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Intro to Simulation (using Excel)

Minimax Strategies. Minimax Strategies. Zero Sum Games. Why Zero Sum Games? An Example. An Example

Section 6.1 Discrete Random variables Probability Distribution

We rst consider the game from the player's point of view: Suppose you have picked a number and placed your bet. The probability of winning is

Chapter 5. Discrete Probability Distributions

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Ch. 13.3: More about Probability

6.042/18.062J Mathematics for Computer Science. Expected Value I

Is it possible to beat the lottery system?

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Two-sample inference: Continuous data

Part III. Lecture 3: Probability and Stochastic Processes. Stephen Kinsella (UL) EC4024 February 8, / 149

Statistics 151 Practice Midterm 1 Mike Kowalski

Transcription:

Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014

By the end of this lecture... You will be able to: Determine what we expect the sum of draws from a box to be, and how far off we will likely be Quickly calculate the SD of a list with only two kinds of numbers Easily calculate probabilities for sums of draws Use a box model to address more kinds of problems, e.g. counting the number of 6 s shown in a series of throws 2 / 28

Recap: Box Models Box models are useful in analyzing games of chance Draw a box Indicate the number and kind of tickets Indicate the number and kind of draws Indicate what is done with each ticket Examined minimum and maximum of sum of draws 3 / 28

Example 1: Box Model Have a box with three tickets a 1, a 2, and a 3 Draw 5 times, with replacement Add together the values seen on each ticket What is the sum of the draws? How much does each draw contribute to the sum? What can we reasonably expect the sum to be? 4 / 28

The Expected Value The Expected Value (EV) for the sum of the draws from the box is # of draws average of the box The sum of draws from a box (with replacement) should be somewhere around the expected value. 5 / 28

Example 2: Rolling Dice You are playing a dice game. It costs $1 per play. You roll the dice, and if it is an even number, you win $3. If it is odd you win nothing. About how much do you expect to win or lose in 50 plays? 1 Draw a box model, indicating the number and kind of tickets 2 Indicate the number and kind of draws 3 Indicate what is done with each ticket 4 Answer the question above 6 / 28

Example 3: Coin Flipping You are playing a coin flipping game. It costs $1 to play. You flip the two coins, and if there is at least one head showing, you win $2. Otherwise, you win nothing. True or False, and Explain: If you play 30 times, you will definitely win $10. 7 / 28

Chance Error Variation around expected value is due to chance error chance error = # observed # expected If I actually win $5, what is my chance error? What if I lose $15? How big is my chance error likely to be? 8 / 28

Standard Error The Standard Error (SE) for the sum of the draws from the box is # of draws SD of the box The sum of draws from a box (with replacement) should be somewhere around the expected value, give or take a SE. 9 / 28

Example 4: Two Boxes What kind of variability do we expect from the sum of 5 draws, with replacement, from a box with: 1 A single 1 and a single 3? 2 A single 1 and a single 10? Calculate the EV and SE of both of these situations. 10 / 28

SD Shortcut Obviously calculating a lot of SDs: 1 Find average 2 Find the deviations from average 3 Square the deviations from average 4 Average the squared deviations 5 Take the square root 11 / 28

SD Shortcut (cont.) If there are only two types of tickets in the box (or only two types of numbers in the list): 1 Call the larger number the big # and the smaller number the small # 2 Call the fraction of larger numbers b.f. and the fraction of smaller numbers s.f. 3 Calculate SD = (big # small #) b.f. s.f. 12 / 28

SD Shortcut (cont.) Let s look at a box with three 2 s and two 1 s avg = 1.6 (2 1.6) sd = 2 + (2 1.6) 2 + (2 1.6) 2 + (1 1.6) 2 + (1 1.6) 2 5 (0.4) 2 + (0.4) 2 + (0.4) 2 + ( 0.6) 2 + ( 0.6) 2 = = 0.49 sd = (2 1) = 0.49 3 5 2 5 5 13 / 28

Lists vs. Chance Processes List of numbers (tickets in a box), all values are known mean = average = sum of values, divided by number of values; the typical size of an entry/ticket SD = standard deviation = square root of average of deviations from mean; the typical size of the deviation from the mean in a single entry The typical entry in a list is around average, give or take a standard deviation or so. 14 / 28

Lists vs. Chance Processes (cont.) Chance process (draws from a box), values are unknown EV for sum of draws with replacement = number of draws times average of box; typical size of the sum of draws with replacement SE for sum of draws with replacement = standard error = square root of number of draws times SD of box; typical size of deviation from EV in a single sum of draws The sum of draws with replacement is around expected value, give or take a standard error or so. 15 / 28

Example 5: Drawing from a Box 50 draws are taken, with replacement, from a box with 1 each of the following: 1, 2, 3, 6, 8 1 Calculate the expected value and standard error for the sum of the draws. 2 The sum of the draws will be around, give or take or so. 3 Someone actually makes 50 draws with replacement. You are asked to guess what the sum is. Do you think your guess is off by about 2, 12, or 20? 4 You are told that 175 is the sum. Fill in the following: (a) expected value = (b) observed value = (c) chance error = (d) standard error = 16 / 28

Interesting Question: What is the chance that using the box above (1 each of: 1, 2, 3, 6, 8 ), the sum of 1000 draws is between 3900 and 4100? Could you find a similar probability for a much smaller number of draws? Recalling the frequency definition of probability, could you find this probability? 17 / 28

Interesting Question: (cont.) Draw 1000 tickets with replacement, calculate the sum of the tickets Do this 10 times, record the proportion of sums (out of 10) that are between 3900 and 4100 Do this 100 times, record the proportion of sums (out of 100) that are between 3900 and 4100 Relative Proportion Between 3900 and 4100 0.75 0.80 0.85 0.90 Relative Proportion of Observed Sums of 1000 Draws Between 3900 and 4100 0 2000 4000 6000 8000 10000 Number of Observed Sums 18 / 28

Interesting Question: (cont.) Draw 1000 tickets with replacement, calculate the sum of the tickets Do this 200 times, record the proportion of sums (out of 200) that are between 3900 and 4100 Do this times, record the proportion of sums that are between 3900 and 4100 Relative Proportion Between 3900 and 4100 0.75 0.80 0.85 0.90 Relative Proportion of Observed Sums of 1000 Draws Between 3900 and 4100 0 2000 4000 6000 8000 10000 Number of Observed Sums 19 / 28

Interesting Question: (cont.) Histogram of 10,000 Observed Sums of 1,000 Draws From The Box Draw 1000 tickets with replacement, calculate the sum of the tickets Do this 10,000 times, make a histogram of the sum The histogram looks normal Density 0.000 0.001 0.002 0.003 0.004 3700 3800 3900 4000 4100 4200 4300 Sum of 1,000 Draws 20 / 28

Interesting Question: (cont.) We can use the approximate normality of this curve to calculate the chance that the sum of 1,000 draws is between 3900 and 4100. z = value of sum expected value of sum standard error of sum Use the normal table to find the chance that the sum of 1000 draws from the box ( 1, 2, 3, 6, 8 ) is between 3900 and 4100. In general, the normal curve can be used to calculate probabilities for sums of random draws with replacement from a box, when the number of draws is large 21 / 28

Example 6: Using the Normal Curve A fair die is thrown 200 times. 1 Calculate the expected value and standard error for the sum of the throws 2 The sum of the throws will be around, give or take or so. 3 Find the probability that the sum of the throws is greater than 647. 22 / 28

Example 7: Counting the Evens A fair die is thrown 600 times. 1 The sum of the throws will be around 2100, give or take 42 or so. 2 The sum of evens thrown will be around, give or take or so. 3 The number of evens thrown will be around, give or take or so. 23 / 28

Example 7: Counting the Evens (cont.) Let s simplify, and assume a fair die is thrown 5 times. I roll 6 2 4 5 3 If I am making a sum of throws, I will add: 6 + 2 + 4 + 5 + 3 = 20 If I am making a sum of evens thrown, I will add: 6 + 2 + 4 + 0 + 0 = 12 If I am counting the number of evens thrown, I will add: 1 + 1 + 1 + 0 + 0 = 3 Each process can be written as a sum! 24 / 28

Example 7: Counting the Evens (cont.) Strategy for adding only certain things or counting number of things: 1 Make the box describing the basic chance process 2 Formulate your desired quantity as a sum 3 Change the value of the tickets (but not the number of tickets!) to add as appropriate 25 / 28

Example 7: Counting the Evens (cont.) A fair die is thrown 600 times. Box is ( 1, 2, 3, 4, 5, 6 ). Have 600 throws, with replacement 1 Sum of throws is like sum of draws from box above, usual EV and SE formulas apply 2 Sum of evens: each even drawn adds itself to the sum. Each odd drawn adds 0 to the sum. Box becomes ( 0, 2, 0, 4, 0, 6 ). Sum of evens is like sum of draws from this box, usual EV and SE formulas apply 3 Number of evens: each even drawn adds 1 to the count (sum). Each odd drawn adds 0 to the count (sum). Box becomes ( 0, 1, 0, 1, 0, 1 ). Number of evens is like sum of draws from this box, usual EV and SE formulas apply. 26 / 28

In a 0-1 Box The 1 s represent the event(s) that we wish to count, the 0 s represent the event(s) that we do not wish to count The EV for the sum = number of draws average of box. But what is the average of the box? The SE for the sum = number of draws SD of box. But what is the SD of the box? The normal curve can be used to calculate chances for sums of draws: new avg = EV for sum, new SD = SE for sum 27 / 28

Important Takeaways The EV is the sum that we expect to see The SE is the amount we expect a particular sum to be off from the EV, due to chance error There is a shortcut for calculating the SD of a list with only two kinds of numbers The normal curve can be used to calculate chances for sums of draws: new avg = EV for sum, new SD = SE for sum Changing the box helps address a lot of problems either to add only certain kinds of tickets, or to count the number of a certain event Next time: Why use the normal curve to calculate probabilities? 28 / 28