Stat 20: Intro to Probability and Statistics



Similar documents
Stat 20: Intro to Probability and Statistics

AMS 5 CHANCE VARIABILITY

Elementary Statistics and Inference. Elementary Statistics and Inference. 16 The Law of Averages (cont.) 22S:025 or 7P:025.

The Normal Approximation to Probability Histograms. Dice: Throw a single die twice. The Probability Histogram: Area = Probability. Where are we going?

Elementary Statistics and Inference. Elementary Statistics and Inference. 17 Expected Value and Standard Error. 22S:025 or 7P:025.

13.0 Central Limit Theorem

Chapter 16: law of averages

Chapter 20: chance error in sampling

We rst consider the game from the player's point of view: Suppose you have picked a number and placed your bet. The probability of winning is

$ ( $1) = 40

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

John Kerrich s coin-tossing Experiment. Law of Averages - pg. 294 Moore s Text

Chapter 16. Law of averages. Chance. Example 1: rolling two dice Sum of draws. Setting up a. Example 2: American roulette. Summary.

MA 1125 Lecture 14 - Expected Values. Friday, February 28, Objectives: Introduce expected values.

Section 7C: The Law of Large Numbers

Chapter 26: Tests of Significance

Section 6.1 Discrete Random variables Probability Distribution

Probability. Sample space: all the possible outcomes of a probability experiment, i.e., the population of outcomes

Statistics 151 Practice Midterm 1 Mike Kowalski

The overall size of these chance errors is measured by their RMS HALF THE NUMBER OF TOSSES NUMBER OF HEADS MINUS NUMBER OF TOSSES

Statistics 100A Homework 3 Solutions

Statistics 100A Homework 4 Solutions

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

AP Statistics 7!3! 6!

Example. A casino offers the following bets (the fairest bets in the casino!) 1 You get $0 (i.e., you can walk away)

V. RANDOM VARIABLES, PROBABILITY DISTRIBUTIONS, EXPECTED VALUE

Week 5: Expected value and Betting systems

Lecture 13. Understanding Probability and Long-Term Expectations

Example: Find the expected value of the random variable X. X P(X)

Math 58. Rumbos Fall Solutions to Review Problems for Exam 2

WHERE DOES THE 10% CONDITION COME FROM?

Chapter 5 Section 2 day f.notebook. November 17, Honors Statistics

You can place bets on the Roulette table until the dealer announces, No more bets.

Probability: The Study of Randomness Randomness and Probability Models. IPS Chapters 4 Sections

3.2 Roulette and Markov Chains

Chapter 4 Lecture Notes

Choice Under Uncertainty

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 13. Random Variables: Distribution and Expectation

Introductory Probability. MATH 107: Finite Mathematics University of Louisville. March 5, 2014

Roulette Wheel Testing. Report on Stages 3.2 and 3.3 of NWML/GBGB Project Proposal

2. How many ways can the letters in PHOENIX be rearranged? 7! = 5,040 ways.

Random Variables. Chapter 2. Random Variables 1

Worksheet for Teaching Module Probability (Lesson 1)

This Method will show you exactly how you can profit from this specific online casino and beat them at their own game.

STAT 35A HW2 Solutions

STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science

Introduction to Game Theory IIIii. Payoffs: Probability and Expected Utility

Unit 4 The Bernoulli and Binomial Distributions

Lab 11. Simulations. The Concept

Basic Probability Theory II

Near Misses in Bingo

CS170 Lab 11 Abstract Data Types & Objects

Lesson 1. Basics of Probability. Principles of Mathematics 12: Explained! 314

Betting on Excel to enliven the teaching of probability

The Mathematics of Gambling

A Non-Linear Schema Theorem for Genetic Algorithms

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

EC313 - VHDL State Machine Example

= = 106.

Chapter 5. Discrete Probability Distributions

Chapter 4 & 5 practice set. The actual exam is not multiple choice nor does it contain like questions.

Family Connection by Naviance

Ch. 13.3: More about Probability

Random Fibonacci-type Sequences in Online Gambling

Lotto Master Formula (v1.3) The Formula Used By Lottery Winners

X: Probability:

STT 200 LECTURE 1, SECTION 2,4 RECITATION 7 (10/16/2012)

Thursday, November 13: 6.1 Discrete Random Variables

Expected Value. 24 February Expected Value 24 February /19

The Math. P (x) = 5! = = 120.

Why is Insurance Good? An Example Jon Bakija, Williams College (Revised October 2013)

MONEY MANAGEMENT. Guy Bower delves into a topic every trader should endeavour to master - money management.

Statistics and Random Variables. Math 425 Introduction to Probability Lecture 14. Finite valued Random Variables. Expectation defined

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Lecture 14. Chapter 7: Probability. Rule 1: Rule 2: Rule 3: Nancy Pfenning Stats 1000

Find the indicated probability. 1) If a single fair die is rolled, find the probability of a 4 given that the number rolled is odd.

Two-sample inference: Continuous data

Table 1: Field Experiment Dependent Variable Probability of Donation (0 to 100)

Probability and Expected Value

Second Midterm Exam (MATH1070 Spring 2012)

Exam. Name. How many distinguishable permutations of letters are possible in the word? 1) CRITICS

Introduction to Discrete Probability. Terminology. Probability definition. 22c:19, section 6.x Hantao Zhang

Introduction to Ethics for Health Care Aides Online course presented by the Manitoba Provincial Health Ethics Network Frequently asked questions

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

In the situations that we will encounter, we may generally calculate the probability of an event

Rules of core casino games in Great Britain

Statistics. Head First. A Brain-Friendly Guide. Dawn Griffiths

Solution (Done in class)

TABLE OF CONTENTS. ROULETTE FREE System # ROULETTE FREE System #

Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes

College Scorecard data demonstrates that UC is a good investment not only for students and families, but the federal government and the state

Week 4: Standard Error and Confidence Intervals

Northumberland Knowledge

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Orange High School. Year 7, Mathematics Assignment 2

Mathematics Content: Pie Charts; Area as Probability; Probabilities as Percents, Decimals & Fractions

Money Borrowing money

Conditional Probability, Independence and Bayes Theorem Class 3, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Betting systems: how not to lose your money gambling

Transcription:

Stat 20: Intro to Probability and Statistics Lecture 18: Simple Random Sampling Tessa L. Childers-Day UC Berkeley 24 July 2014

By the end of this lecture... You will be able to: Draw box models for real-world scenarios Find EVs and SEs for percentages (as opposed to sums) Explain the difference between drawing with and without replacement (and how that affects EVs, SEs, and use of the normal) 2 / 23

Recap: Box Models Box models are useful in analyzing games of chance Draw a box model to describe a process (sums or classify/count) Draw randomly, with replacement, from the box Calculate EV for sum and SE for sum Use normal curve to calculate probabilities of certain outcomes. Why? 3 / 23

Example: Roulette You are playing roulette, a game in which a ball has equal chances of landing on one of 18 red spaces, 18 black spaces, or 2 green spaces. What does the box look like? Now, I want to know what the chance is of having the ball land on a red space 20 or more times in 35 plays. Draw a box model to describe the game (sums or classify/count) Draw randomly, with replacement, from the box Calculate EV for sum and SE for sum Use normal curve to calculate probability 4 / 23

Other Box Models Box models can also be useful in analyzing the ways that other chance processes work Surveys, studies, experiments, etc. What is our box? How do we draw from it? How is this different from the other box models we ve seen? 5 / 23

Example: Loan Debt You are interested in estimating the percentage of students at Cal who have taken out student loans. Here, and in general: Draw a box model to describe the population Draw randomly, without replacement, from the box Calculate EV for percent and SE for percent Use normal curve to calculate probability 6 / 23

Chance Variability Say that the population is divided evenly among class rank. Will my sample reflect this? Why or why not? If I take one sample, put every ticket back, and draw another sample, will they match? Is this a problem? 7 / 23

The Expected Value Recall, that when drawing with replacement, EV for sum = # of draws average of box. But our box is made of only 0 s and 1 s, so EV for # of 1 s = number of draws proportion of 1 s. 8 / 23

The Expected Value (cont.) A percentage = # of something total # 100, so EV for # of 1 s EV for percent of 1 s = 100 # of draws number of draws proportion of 1 s = 100 # of draws = proportion of 1 s 100 = percentage of 1 s This is when we draw with replacement. 9 / 23

The Expected Value (cont.) When drawing without replacement EV for percent of 1 s = percentage of 1 s. Intuitively, it makes sense that we expect to see a representative number of 1 s, since this is a good sampling method 10 / 23

The Standard Error Recall, that when drawing with replacement, SE for sum = # of draws SD of box. But our box is made of only 0 s and 1 s, so SE for # of 1 s = # of draws (1 0) (proportion of 1 s ) ( ) proportion. of 0 s 11 / 23

The Standard Error (cont.) A percentage = # of something total # 100, so SE for # of 1 s SE for percent of 1 s = 100 # of draws (proportion ) ( ) number of draws proportion = 100 # of draws of 1 s of 0 s ( ) ( ) proportion proportion = of 1 s of 0 s number of draws 100 This is when we draw with replacement. 12 / 23

The Standard Error (cont.) SE for percent of 1 s = ( ) ( ) proportion proportion of 1 s of 0 s number of draws 100 We can see that: Increasing the number of trials: increases the SE for the sum by a factor of the square root decreases the SE for % by a factor of the square-root As with our previous SE s, this tells us about how far off a draw will be from the EV 13 / 23

The Standard Error (cont.) When drawing without replacement SE for percent of 1 s without replacement = correction factor SE for percent of 1 s. with replacement Where correction factor = population size sample size population size 1 Intuitively, it makes sense that we must relate the sample size and the population size 14 / 23

The Correction Factor correction factor = population size sample size population size 1 What happens if our sample is small compared to the population? What if it is large? 15 / 23

The Correction Factor (cont.) Corection Factor 0.0 0.2 0.4 0.6 0.8 1.0 Pop Size = 100,000 Pop Size = 10,000 Pop Size = 4000 Pop Size = 3000 Pop Size = 2000 Pop Size = 1,000 0 1000 2000 3000 4000 Sample Size 16 / 23

Comparison Sum of draws from a box, with replacement Percent of 1 s from a 0-1 box, with replacement EV for sum = # of draws avg. of box EV for % = percent of 1 s in the box SE for sum = # of draws SD of box SE for % ( ) ( ) proportion proportion = of 1 s of 0 s number of draws 100 17 / 23

Comparison (cont.) Percent of 1 s from a 0-1 box, with replacement Percent of 1 s from a 0-1 box, without replacement EV for % = percent of 1 s in the box EV for % = percent of 1 s in the box SE for % ( ) ( ) proportion proportion = of 1 s of 0 s number of draws 100 SE for % = correction factor ( ) ( ) proportion proportion of 1 s of 0 s number of draws 100 18 / 23

Examples Say we want to find the chance of having a roulette ball land on a red space 20 or more times in 35 plays. Draw a box, indicate number and kind of tickets, number of draws, kind of draws Calculate EV for sum Calculate SE for sum Use normal curve 19 / 23

Examples (cont.) Say we want to find the chance of having a roulette ball land on a red space 57% or more of the time, in 35 plays. Draw a box, indicate number and kind of tickets, number of draws, kind of draws Calculate EV for percent Calculate SE for percent Use normal curve 20 / 23

Examples (cont.) Assume that there are 30,000 students at UC Berkeley, and that 65% of them have some student loan debt. We want to find the chance of having 67% or more of those sampled have student loan debt, in a sample of size 1,000. Draw a box, indicate number and kind of tickets, number of draws, kind of draws Calculate EV for percent Calculate SE for percent Use normal curve 21 / 23

Examples (cont.) I m interested in looking at student loan debt at other colleges, besides UC Berkeley. So, I expand my survey to include UCLA and Pomona College (a small, liberal arts university). In both places, I will take a sample of 2% of the students, in order to estimate the percentage of students with loan debt. Other things being equal: 1 The accuracy to be expected at UCLA is about the same as the accuracy to be expected at Pomona. 2 The accuracy to be expected at UCLA is quite a bit higher than at Pomona. 3 The accuracy to be expected at UCLA is quite a bit lower than at Pomona. Explain. 22 / 23

Important Takeaways Box models can be used for real world problems, not just gambling Box = population, Draws from box without replacement = sample (SRS) EV, and SE for percent of 1 s drawn from a 0-1 box Correction factor for drawing without replacement Probability histogram still normally distributed We can use the correction factor, combined with the normal curve, to find probabilities under the normal curve (approximate probabilities from the probability histogram) Next time: What if we don t know what s in the box? 23 / 23