WEEK #23: Statistics for Spread; Binomial Distribution



Similar documents
WHERE DOES THE 10% CONDITION COME FROM?

Sampling Distributions

The Binomial Probability Distribution

DETERMINE whether the conditions for a binomial setting are met. COMPUTE and INTERPRET probabilities involving binomial random variables

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

You flip a fair coin four times, what is the probability that you obtain three heads.

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions

WEEK #22: PDFs and CDFs, Measures of Center and Spread

Chapter 5. Discrete Probability Distributions

Sample Questions for Mastery #5

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

3.4. The Binomial Probability Distribution. Copyright Cengage Learning. All rights reserved.

Characteristics of Binomial Distributions

Lecture 2 Binomial and Poisson Probability Distributions

Binomial Probability Distribution

Probability Distribution for Discrete Random Variables

Math 431 An Introduction to Probability. Final Exam Solutions

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Homework 4 - KEY. Jeff Brenion. June 16, Note: Many problems can be solved in more than one way; we present only a single solution here.

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Stats on the TI 83 and TI 84 Calculator

2 Binomial, Poisson, Normal Distribution

Section 6-5 Sample Spaces and Probability

LAB : PAPER PET GENETICS. male (hat) female (hair bow) Skin color green or orange Eyes round or square Nose triangle or oval Teeth pointed or square

ST 371 (IV): Discrete Random Variables

Lecture 5 : The Poisson Distribution

Review for Test 2. Chapters 4, 5 and 6

MATH 140 Lab 4: Probability and the Standard Normal Distribution

Chapter 5: Discrete Probability Distributions

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

The Binomial Distribution

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

Section 5-3 Binomial Probability Distributions

6. Let X be a binomial random variable with distribution B(10, 0.6). What is the probability that X equals 8? A) (0.6) (0.4) B) 8! C) 45(0.6) (0.

Binomial random variables

Chapter 4 Lecture Notes

Normal Approximation. Contents. 1 Normal Approximation. 1.1 Introduction. Anthony Tanbakuchi Department of Mathematics Pima Community College

Normal distribution. ) 2 /2σ. 2π σ

Lesson 17: Margin of Error When Estimating a Population Proportion

Section 5 Part 2. Probability Distributions for Discrete Random Variables

Binomial Sampling and the Binomial Distribution

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Lecture Note 1 Set and Probability Theory. MIT Spring 2006 Herman Bennett

Chapter 4. Probability Distributions

Chapter 5. Random variables

Normal Distribution as an Approximation to the Binomial Distribution

5.1 Identifying the Target Parameter

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

3.2 Measures of Spread

Statistics 100A Homework 4 Solutions

Confidence Intervals for the Difference Between Two Means

The normal approximation to the binomial

Sums of Independent Random Variables

Random variables, probability distributions, binomial random variable

7.S.8 Interpret data to provide the basis for predictions and to establish

7. Normal Distributions

AP Statistics 7!3! 6!

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

Probability Distributions

The normal approximation to the binomial

statistics Chi-square tests and nonparametric Summary sheet from last time: Hypothesis testing Summary sheet from last time: Confidence intervals

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

Contemporary Mathematics Online Math 1030 Sample Exam I Chapters No Time Limit No Scratch Paper Calculator Allowed: Scientific

Math 58. Rumbos Fall Solutions to Review Problems for Exam 2

16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION

2. Discrete random variables

6.3 Conditional Probability and Independence

Fairfield Public Schools

MEASURES OF VARIATION

Sample Term Test 2A. 1. A variable X has a distribution which is described by the density curve shown below:

13.0 Central Limit Theorem

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Important Probability Distributions OPRE 6301

Unit 4 The Bernoulli and Binomial Distributions

Math 3C Homework 3 Solutions

A Few Basics of Probability

Random variables P(X = 3) = P(X = 3) = 1 8, P(X = 1) = P(X = 1) = 3 8.

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Key Concept. Density Curve

Binomial random variables (Review)

An Introduction to Basic Statistics and Probability

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

Ch5: Discrete Probability Distributions Section 5-1: Probability Distribution

How To Understand And Solve A Linear Programming Problem

Northumberland Knowledge

Chapter 4. iclicker Question 4.4 Pre-lecture. Part 2. Binomial Distribution. J.C. Wang. iclicker Question 4.4 Pre-lecture

Tests for One Proportion

TEACHER NOTES MATH NSPIRED

1.5 Oneway Analysis of Variance

PROBABILITY AND SAMPLING DISTRIBUTIONS

The mathematical branch of probability has its

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A) B) C) D) 0.

People have thought about, and defined, probability in different ways. important to note the consequences of the definition:

Chapter 5: Normal Probability Distributions - Solutions

Transcription:

WEEK #23: Statistics for Spread; Binomial Distribution Goals: Study measures of central spread, such interquartile range, variance, and standard deviation. Introduce standard distributions, including the binomial distribution. Textbook reading for Week #23: Study Adler Section 6.9, 7.4

Standard Distributions We have seen distributions, both discrete and continous, that originate with data from experiments, or with formulas we made up to fit an example. Many common and important classes of experiments have well-understood distributions. We will start with the binomial distribution. Coin Flipping The canonical example for the binomial distribution is a sequence of coin flips. Compute the probability distribution for the number of heads flipped, if we flip a coin three times. The coin is loaded, so the probability of a heads each time is only 0.4 (tails probability is 0.6). The Binomial Distribution Consider the scenario of repeatedly running a two-outcome experiment. e.g. coin flipping, where each outcome is either heads or tails, or selecting alleles from parents, where each outcome is either B or b 1 We arbitrarily label one of the outcomes as a success (e.g. heads, B allele). If the probability of a successful outcome in each experiment is p, then what is the probability of getting exactly k successes in n trials? 1 These two-outcome experiments are sometimes called Bernoulli trials.

We note that there is an essential counting step in computing these probabilities: How many ways can there be k successful outcomes out of n trials? Fortunately, this is an well-known question, with a well-understood solution, and the calculation is built into most scientific calculators: Number of ways to choose k successes in n tries = n choose k = ( ) n = k n! k!(n k)! On a Casio 991 model calculator, you can compute this with the (<SHIFT> ) or ncr button. Use the binomial distribution to find the probability that you will roll exactly 3 ones while rolling a fair six-sided die 10 times. Consider a multi-genic phenotype, for which the visible effect of the genotype depends on the number of B copies a plant has in total over 12 different loci (24 possible B copies in total). Find the probability that a new offspring has a total of 16 different B alleles out of the possible 24, given that B alleles are distributed with p = 0.5 in the population at all loci.

We can now formalize these calculations by defining the binomial distribution. Binomial Distribution If a trial has two outcomes, and each trial is independent, the probability of k successful outcomes in n trials is given by b(k;n,p) = where p is the probability of success in each trial. ( ) n p k (1 p) n k k Properties of the Binomial Distribution If you flip 100 coins, intuitively how many do you expect to come up heads? If the coins were loaded, so that the probability of a heads was only 0.1 instead of 0.5, intuitively how many heads would you expect out of 100 tosses? Mean of the Binomial Distribution The mean of a binomial distribution, b(k;n,p), is given by E(b) = b(k;n,p) = n p Variance of the Binomial Distribution variance = σ 2 = Var(b(k; n, p)) = np(1 p) Standard Deviation of the Binomial Distribution Standard dev. = σ = Std. Dev(b(k; n, p)) = np(1 p)

Histograms of the Binomial Distribution Knowing the binomial distribution function, it straightforward to compute the probability of each number of successes, and so to draw a graph of the entire distribution. What is the range of the number of successes in n trials? How many possible outcomes is does that entail? Consider the distribution of the number of heads turning up in 10 flips of a fair coin. Sketch the distribution you would expect for the total number of heads out of 10. 0 1 2 3 4 5 6 7 8 9 10 Consider the distribution of the number of heads turning up in 10 flips of a loaded coin, where heads have only a 0.1 probability each flip. Sketch the distribution you would expect for the total number of heads out of 10. 0 1 2 3 4 5 6 7 8 9 10

Here are some other distributions of the binomial distribution, as we change the probability of each individual success (p), and the number of trials (n). p = 0.1, n = 10 p = 0.3, n = 10 p = 0.5, n = 10 ] ] 0 1 2 3 4 5 6 7 8 9 10 ] ] 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 p = 0.1, n = 20 p = 0.3, n = 20 p = 0.5, n = 20 ] ] 0 5 10 15 20 ] ] 0 5 10 15 20 ] ] 0 5 10 15 20 p = 0.1, n = 50 p = 0.3, n = 50 p = 0.5, n = 50 ] ] 0 10 20 30 40 50 ] ] 0 10 20 30 40 50 ] ] 0 10 20 30 40 50 p = 0.1, n = 100 p = 0.3, n = 100 p = 0.5, n = 100 ] ] 0 20 40 60 80 100 ] ] 0 20 40 60 80 100 ] ] 0 20 40 60 80 100 Comment on the patterns you see in the distributions.

Normal Approximation to the Binomial Distribution From the histograms of the binomial distribution, it seems that for large n values the binomial distribution starts to look a lot like a normal, gaussian, or bell-curve distribution. One commonly referenced rule of thumb is: A binomial distribution will be approximately normal in shape if both np and n(1 p) are above 10. Relate this observation back to the previous histograms. Beyond its mathematical interest (why does the binomial look like the normal distribution?), we can take advantage of well-understood properties of the normal distribution in analyzing binomial data. Theorem: For a normal distribution, the probability of an outcome within ±2 standard deviations of the mean is 95% (rounded). 0.4 0.3 0.2 0.1 0 4 3 2 1 0 1 2 3 4 Under the assumption that some binomial distributions approximate the normal distribution, express this theorem as it applies to binomial distributions.

Example: A drug is undergoing re-evaluation by Health Canada for effectiveness as an anti-fungal treatment. The manufacturer claims the drug is effective 60% of the time in killing off the fungus. Health Canada tracks 100 patients who are treated. Sketch the probability distribution for the number of patients who are cured by the treatment, assuming the manufacturer s claims are true. 0 10 20 30 40 50 60 70 80 90 100 In this trial, only 53% of the patients are cured by the drug. Comment on how much you can trust the claimed 60% cure rate.

Doubts have been raised in other countries about the same drug, so a larger trial is commissioned with 1,000 patients. Sketch the probability distribution for the number of patients cured in this trial, assuming again a 60% curative probability for each patient. 0 200 400 600 800 1000 Again, only 53% of the patients are cured by the drug. Is your conclusion the same or different than in the last example, and why?