Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem

Similar documents
Random variables, probability distributions, binomial random variable

You flip a fair coin four times, what is the probability that you obtain three heads.

4. Continuous Random Variables, the Pareto and Normal Distributions

Probability density function : An arbitrary continuous random variable X is similarly described by its probability density function f x = f X

Important Probability Distributions OPRE 6301

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

Probability Distributions

The Normal Distribution. Alan T. Arnholt Department of Mathematical Sciences Appalachian State University

Chapter 4 Lecture Notes

MAS108 Probability I

Math 431 An Introduction to Probability. Final Exam Solutions

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Chapter 5. Random variables

Normal distribution. ) 2 /2σ. 2π σ

Statistics 100A Homework 4 Solutions

Normal Approximation. Contents. 1 Normal Approximation. 1.1 Introduction. Anthony Tanbakuchi Department of Mathematics Pima Community College

An Introduction to Basic Statistics and Probability

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Math 151. Rumbos Spring Solutions to Assignment #22

Lecture 8: More Continuous Random Variables

ECE302 Spring 2006 HW4 Solutions February 6,

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

Math 461 Fall 2006 Test 2 Solutions

Stats on the TI 83 and TI 84 Calculator

What is Statistics? Lecture 1. Introduction and probability review. Idea of parametric inference

Review of Random Variables

1.1 Introduction, and Review of Probability Theory Random Variable, Range, Types of Random Variables CDF, PDF, Quantiles...

Lecture 8. Confidence intervals and the central limit theorem

Joint Exam 1/P Sample Exam 1

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

Introduction to Probability

0 x = 0.30 x = 1.10 x = 3.05 x = 4.15 x = x = 12. f(x) =

UNIT I: RANDOM VARIABLES PART- A -TWO MARKS

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

Notes on Continuous Random Variables

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

Math 370/408, Spring 2008 Prof. A.J. Hildebrand. Actuarial Exam Practice Problem Set 2 Solutions

STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Exploratory Data Analysis

Probability Distributions

5. Continuous Random Variables

Section 5 Part 2. Probability Distributions for Discrete Random Variables

WHERE DOES THE 10% CONDITION COME FROM?

Chapter 9 Monté Carlo Simulation

e.g. arrival of a customer to a service station or breakdown of a component in some system.

The normal approximation to the binomial

The normal approximation to the binomial

The Math. P (x) = 5! = = 120.

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Probability Distribution for Discrete Random Variables

3.4. The Binomial Probability Distribution. Copyright Cengage Learning. All rights reserved.

ECE302 Spring 2006 HW3 Solutions February 2,

Probability. Distribution. Outline

Order Statistics. Lecture 15: Order Statistics. Notation Detour. Order Statistics, cont.

WEEK #22: PDFs and CDFs, Measures of Center and Spread

6 PROBABILITY GENERATING FUNCTIONS

The Binomial Probability Distribution

6.4 Normal Distribution

Chapter 5. Discrete Probability Distributions

10.1. Solving Quadratic Equations. Investigation: Rocket Science CONDENSED

The Exponential Distribution

ECE302 Spring 2006 HW5 Solutions February 21,

Random variables P(X = 3) = P(X = 3) = 1 8, P(X = 1) = P(X = 1) = 3 8.

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

Chapter 4. Probability Distributions

LOGNORMAL MODEL FOR STOCK PRICES

Sums of Independent Random Variables

Approximation of Aggregate Losses Using Simulation

Example 1: Dear Abby. Stat Camp for the Full-time MBA Program

WEEK #23: Statistics for Spread; Binomial Distribution

TEACHER NOTES MATH NSPIRED

Stat 20: Intro to Probability and Statistics

Chapter 3 RANDOM VARIATE GENERATION

AMS 5 CHANCE VARIABILITY

Master s Theory Exam Spring 2006

SUNY ECC. ACCUPLACER Preparation Workshop. Algebra Skills

PROBABILITY AND SAMPLING DISTRIBUTIONS

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

The Normal Approximation to Probability Histograms. Dice: Throw a single die twice. The Probability Histogram: Area = Probability. Where are we going?

ST 371 (IV): Discrete Random Variables

13.0 Central Limit Theorem

TABLE OF CONTENTS. 4. Daniel Markov 1 173

Normal Distribution as an Approximation to the Binomial Distribution

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

Lecture Notes 1. Brief Review of Basic Probability

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

Statistical Distributions in Astronomy

The Normal Distribution

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

Binomial Distribution n = 20, p = 0.3

Negative Binomial. Paul Johnson and Andrea Vieux. June 10, 2013

Lecture 6: Discrete & Continuous Probability and Random Variables

Transformations and Expectations of random variables

Math 0980 Chapter Objectives. Chapter 1: Introduction to Algebra: The Integers.

Lecture 3: Continuous distributions, expected value & mean, variance, the normal distribution

2 Binomial, Poisson, Normal Distribution

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

Transcription:

1.1.2 Normal distribution 1.1.3 Approimating binomial distribution by normal 2.1 Central Limit Theorem Prof. Tesler Math 283 October 22, 214 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 1 / 36

Normal distribution a.k.a. Bell curve and Gaussian distribution The normal distribution is a continuous distribution. Parameters: µ = mean (center) σ = standard deviation (width) ( ) PDF: f X () = 1 σ ep ( µ)2 for < <. 2π 2σ 2 Normal distribution N(2, 5): µ = 2, σ = 5..4.8 Normal µ µ ± σ 1 2 3 4 The normal distribution is symmetric about = µ, so median = mean = µ. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 2 / 36

Applications of normal distribution Applications Many natural quantities are modelled by it: e.g., a histogram of the heights or weights of everyone in a large population often follows a normal distribution. Many distributions such as binomial, Poisson,... are closely approimated by it when the parameters are large enough. Sums and averages of huge quantities of data are often modelled by it. Coverage in DNA sequencing Illumina GA II sequencing of E. coli at 6 coverage. Chitsaz et al. (211), Nature Biotechnology % of positions with coverage..2.4.6.8 1. Empirical distribution of coverage 2 4 6 8 1 Coverage Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 3 / 36

Cumulative distribution function It turns out that the integral for total probability does equal 1: ) 1 ( σ 2π ep ( µ)2 2σ 2 d = 1 However, it s a tricky integral; see a Calculus tetbook in the section on double integrals in polar coordinates. The cumulative distribution function is the integral F X () = P(X ) = ) 1 ( σ 2π ep (t µ)2 2σ 2 dt This integral cannot be done symbollically in terms of the usual functions (polynomials, eponentials, logs, trig functions, etc.). It can be done via numerical integration or Taylor series. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 4 / 36

Standard normal distribution Standard normal distribution N(, 1): µ =, σ = 1 CDF of standard normal distribution..2.4 Normal µ µ ± σ cdf..4.8 Normal µ µ ± σ 4 2 2 4 4 2 2 4 z The standard normal distribution is the normal distribution for µ =, σ = 1. Use the variable name Z: PDF: φ(z) = f Z (z) = e z2 /2 2π for < z < CDF: Φ(z) = F Z (z) = P(Z z) = 1 z e t2 /2 dt 2π The integral can t be done in terms of ordinary functions, but it can be done using numerical methods. In the past, people used lookup tables. We ll use functions for it in Matlab and R. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 5 / 36 z

Matlab and R commands For the standard normal: Φ(1.96).975 Φ 1 (.975) 1.96 Matlab: normcdf(1.96) norminv(.975) R: pnorm(1.96) qnorm(.975) We will see shortly how to convert between an arbitrary normal distribution (any µ, σ) and the standard normal distribution. The commands above allow additional arguments to specify µ and σ, e.g., normcdf(1.96,,1). R also can work with the right tail directly: pnorm(1.96, lower.tail = FALSE).975 qnorm(.975, lower.tail = FALSE) 1.96 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 6 / 36

Standard normal distribution areas.4 Standard Normal Curve.3.2.1!5 a b 5 z The area between z = a and z = b is P(a Z b) = 1 2π b a e t2 /2 dt = Φ(b) Φ(a) P(1.51 Z 1.62) = Φ(1.62) Φ(1.51) =.9474.9345 =.129 Matlab: normcdf(1.62) normcdf(1.51) R: pnorm(1.62) pnorm(1.51) Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 7 / 36

Standard normal distribution symmetries of areas.4 Area! on the right "#$ &'()*!*+,*-.(*/(-.2 12 "#! z!!2 2 z "!%!!! " %! Area right of z is P(Z > z) = 1 Φ(z). By symmetry, the area left of z and the area right of z are equal: Φ( z) = 1 Φ(z) Φ( 1.51) = 1 Φ(1.51) = 1.9345 =.655 Area between z = ±1.51 is Φ(1.51) Φ( 1.51) = 2Φ(1.51) 1.869 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 8 / 36

Central area Area between z = ±1 is 68.27%. Area between z = ±2 is 95.45%. Area between z = ±3 is 99.73%..4.2 Area! split half on each tail!z!/2 z!/2!2 z 2 Find the center part containing 95% of the area Put 2.5% of the area at the upper tail, 2.5% at the lower tail, and 95% in the middle. The value of z putting 2.5% at the top gives Φ(z) = 1.25 =.975. Notation: z.25 = 1.96. The area between z = ±1.96 is about 95%. For 99% in the middle,.5% on each side, use z.5 2.58. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 9 / 36

Areas on normal curve for arbitrary µ, σ P(a X b) = b a ) 1 ( σ 2π ep ( µ)2 2σ 2 d Substitute z = µ σ (or = σz + µ) into the integral to turn it into the standard normal integral: ( a µ P X µ b µ ) ( a µ = P Z b µ ) σ σ σ σ σ ( ) ( ) b µ a µ = Φ Φ σ σ The z-score of is z = µ σ. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 1 / 36

Binomial distribution Compute P(43 X 51) when n = 6, p = 3/4 Binomial: n = 6, p = 3/4 k P(X = k) = ( ) 6 k (.75) k (.25) 6 k 43.9562 44.1183 45.11822 46.11565 47.1335 48.8397 49.6169 5.471 51.2395 Total.7544 Mean µ = np = 6(3/4) = 45 Standard deviation σ= np(1 p) = 6(3/4)(1/4) = 11.25 3.35411966 Mode (k with ma ) np + p = 6(3/4) + (3/4) = 45 3 4 = 45 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 11 / 36

Mode of a distribution The mode of random variable X is the value k at which the is maimum. Mode of binomial distribution when < p < 1 The mode is (n + 1)p. Eception: If (n + 1)p is an integer then (n + 1)p and (n + 1)p 1 are tied as the mode. The mode is within 1 of the mean np. When np is an integer, the mode equals the mean. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 12 / 36

Binomial and normal distributions Binomial Normal approimation to binomial k P(X = k).15 Binomial: n=6, p=3/4 Binomial P(43! X! 51) 43.9562 44.1183 Normal: µ=45,"=3.35 45.11822.1 46.11565 47.1335 48.8397 49.6169.5 5.471 51.2395 Total.7544 41 43 45 47 49 51 53 55 P(X = k) shown as a rectangle: height P(X = k), etent k ± 1/2. The binomial distribution is only defined at the integers, and is very close to the normal distribution shown. We will approimate the probability P(43 X 51) we had above by the corresponding one for the normal distribution. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 13 / 36

Normal approimation to binomial, step 1 Compute corresponding parameters We want to approimate P(a X b) in a binomial distribution. We ll use n = 6, p = 3/4 and approimate P(43 X 51). Determine µ, σ: µ = np = 6(3/4) = 45 σ = np(1 p) = 11.25 3.354 The normal distribution with those same values of µ, σ is a good approimation to the binomial distribution provided µ ± 3σ are both between and n. Check: µ 3σ 45 3(3.354) = 34.938 µ + 3σ 45 + 3(3.354) = 55.62 are both between and 6, so we may proceed. Note: Some applications are more strict and may require µ ± 5σ or more to be between and n. Since µ + 5σ 61.771, this would fail at that level of strictness. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 14 / 36

Normal approimation to binomial, step 2 Continuity correction Normal approimation to binomial.15.1.5 Binomial: n=6, p=3/4 Binomial P(43! X! 51) Normal: µ=45,"=3.35 The binomial distribution is discrete (X = integers) but the normal distribution is continuous. 41 43 45 47 49 51 53 55 The sum P(X = 43) + + P(X = 51) has 9 terms, corresponding to the area of the 9 rectangles in the picture. The area under the normal distribution curve from 42.5 X 51.5 approimates the area of those rectangles. Change P(43 X 51) to P(42.5 X 51.5). Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 15 / 36

Normal approimation to binomial, steps 3 4 3. Convert to z-scores 4. Use the normal distribution to approimately evaluate it For random variable X with mean µ and standard deviation σ, The z-score of a value is z = E(X) SD(X) The random variable Z is Z = X E(X) SD(X) Convert to z-scores: P(42.5 X 51.5) = P = µ σ. = X µ σ. ( 42.5 45 11.25 X 45 11.25 ) 51.5 45 11.25 = P(.7453559926 Z 1.937925581) Approimate this by the standard normal distribution cdf: Φ(1.937925581) Φ(.7453559926).7456555785 This is close to the true answer (apart from rounding errors) P(43 X 51) =.7544 we got from the binomial distribution. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 16 / 36

Estimating fraction of successes instead of number of successes What is the value of p in the binomial distribution? Estimate it: flip a coin n times and divide the # heads by n. Let X = binomial distribution for n flips, probability p of heads. Let X = X/n be the fraction of flips that are heads. X is discrete, with possible values n, 1 n, 2 n,..., n n {(. n ) P(X = k n ) = P(X = k) = k p k (1 p) n k for k =, 1,..., n; otherwise. Mean E(X) = E(X/n) = E(X)/n = np/n = p. Variance Var(X) = Var ( ) X n = Var(X) n 2 Standard deviation SD(X) = p(1 p)/n. = np(1 p) n 2 = p(1 p) n. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 17 / 36

Normal approimation for fraction of successes n flips, probability p of heads, X=observed fraction of heads Mean E(X) = p Variance Var(X) = p(1 p)/n Standard deviation SD(X) = p(1 p)/n The Z transformation of X is Z = X E(X) SD(X) = X p p(1 p)/n and value X = has z-score z = p p(1 p)/n. For k heads in n flips, The z-score of X = k is z 1 = k np. np(1 p) The z-score of X = k/n is z 2 = (k/n) p p(1 p)/n. These are equal! Divide the numerator and denominator of z 1 by n to get z 2. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 18 / 36

Normal approimation for fraction of successes For n = 6 flips of a coin with p = 3 4, we ll estimate P ( 43 6 The eact answer equals P(43 X 51).7544. X 51 6). Step 1: Determine mean and SD E(X) = p =.75 SD(X) = p(1 p)/n = (.75)(.25)/6 =.3125.559 Verify approimation is valid: Mean ± 3 SD between and 1 Mean 3 SD =.58229 Mean + 3 SD =.9177 Both are between and 1. Step 2: Continuity correction P ( ) ( 43 51 6 X 6 = P 42.5 6 Step 3: z-scores X 51.5 6 Step 4: Evaluate approimate answer using normal distribution Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 19 / 36 )

Normal approimation for fraction of successes P ( 43 6 ) ( 51 X 6 = P 42.5 6 X 51.5 6 = P(.7833 X.85833) (.7833 E(X) = P SD(X) ) X E(X) SD(X) = P (.7833.75.559 Z.85833.75.559 = P(.74535 Z 1.93792).85833 E(X) SD(X) ) ) = Φ(1.93792) Φ(.74535).74565 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 2 / 36

Mean and SD of sums and averages of i.i.d. random variables Let X 1,..., X n be n i.i.d. (independent identically distributed) random variables, each with mean µ and standard deviation σ. Let S n = X 1 + + X n be their sum and X n = (X 1 + + X n )/n = S n /n be their average. Means: Sum: E(S n ) = E(X 1 ) + + E(X n ) = n E(X 1 ) = nµ Avg: E(X n ) = E(S n /n) = nµ/n = µ Variance: Sum: Var(S n ) = Var(X 1 ) + + Var(X n ) = n Var(X 1 ) = nσ 2 Avg: Var(X n ) = Var(S n )/n 2 = nσ 2 /n 2 = σ 2 /n Standard deviation: Sum: SD(S n ) = σ n Avg: SD(X n ) = σ/ n Terminology for different types of standard deviation The standard deviation (SD) of a trial (each X i ) is σ The standard error (SE) of the sum is σ n The standard error (SE) of the average is σ/ n Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 21 / 36

Theorem (Central Limit Theorem abbreviated CLT) For n i.i.d. random variables X 1,..., X n with sum S n = X 1 + + X n and average X n = S n /n, and any real numbers a < b, P ( a S n nµ σ n ) b = P ( a X ) n µ σ/ n b Φ(b) Φ(a) if n is large enough. As n, the approimation becomes eact equality. 1.5 Binomial n=1,p=.75; µ=.75,!=.43.5 1 Binomial n=6,p=.75; µ=45.,!=3.35.15.1.5 2 4 6 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 22 / 36

Interpretation of Central Limit Theorem As n increases, the more and more closely resembles a normal distribution. However, the is defined as in-between the red points shown, if it s a discrete distribution. The cdfs are approimately equal everywhere on the continuum. Probabilities of intervals for sums or averages of enough i.i.d. variables can be approimately evaluated using the normal distribution. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 23 / 36

Repeated rolls of a die One roll: µ = 3.5, σ = 35/12 1.71.3 Average of 1 roll of die; µ=3.5,!=1.71.2 Average of 2 rolls of die; µ=3.5,!=1.21.2.15.1.1.5 2 4 6 2 4 6.15.1.5 Average of 3 rolls of die; µ=3.5,!=.99.3.2.1 Average of 1 rolls of die; µ=3.5,!=.17 Die average Normal dist. µ µ±! 2 4 6 2 4 6 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 24 / 36

Repeated rolls of a die Find n so that at least 95% of the time, the average of n rolls of a die is between 3 and 4. ( ) P(3 X 4) = P 3 µ σ/ X µ n σ/ 4 µ n σ/ n Plug in µ = 3.5 and σ = 35/12. ( P(3 X 4) = P 1/2 Z 35/(12n) ) 1/2 35/(12n) Recall the center 95% of the area on the standard normal curve is between z = ±1.96. 1/2 1.96 n (1.96) 2 35/12 44.81 35/(12n) (1/2) 2 n is an integer so n 45 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 25 / 36

Sawtooth distribution (made up as demo) One trial: µ = 4, σ 2.24 Average of 1 trial; µ=4.,!=2.24 Average of 2 trials; µ=4.,!=1.58.25.2.15.1.5 2 4 6 8 Average of 3 trials; µ=4.,!=1.29.15.1.5 2 4 6 8 Average of 1 trials; µ=4.,!=.22.1.5 2 4 6 8.2.15.1.5 2 4 6 8 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 26 / 36

Binomial distribution (n, p) A Bernoulli trial is to flip a coin once and count the number of heads, { 1 probability p; X 1 = probability 1 p. Mean E(X 1 ) = p, standard deviation SD(X 1 ) = p(1 p). The binomial distribution is the sum of n i.i.d. Bernoulli trials. Mean µ = np, standard deviation σ = np(1 p). The binomial distribution is approimated pretty well by the normal distribution when µ ± 3σ are between and n. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 27 / 36

Binomial distribution (n, p) One flip: µ = p =.75, σ = p(1 p) =.1875.433 Binomial n=1,p=.75; µ=.75,!=.43 1.5 Binomial n=6,p=.75; µ=4.5,!=1.6.4.2.5 1 Binomial n=3,p=.75; µ=22.5,!=2.37.2.1 1 2 3 2 4 6 Binomial n=6,p=.75; µ=45.,!=3.35.15.1.5 2 4 6 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 28 / 36

Poisson distribution (µ or µ = λ d) Mean: µ (same as the Poisson parameter) Standard deviation: σ = µ. It is approimated pretty well by the normal distribution when µ 5. The reason the Central Limit Theorem applies is that a Poisson distribution with parameter µ equals the sum of n i.i.d. Poissons with parameter µ/n. The Poisson distribution has infinite range =, 1, 2,... and the normal distribution has infinite range < < (reals). Both are truncated in the plots. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 29 / 36

Poisson distribution (µ) Poisson µ=1;!=1..4.2.2.1 Poisson µ=6;!=2.45 5 1.8.6.4.2 Poisson µ=3;!=5.48 2 4 6 8 5 1 15.6.4.2 Poisson µ=6;!=7.75 5 1 15 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 3 / 36

Geometric and negative binomial distributions Geometric distribution (p) X is the number of flips { until the first heads, (1 p) 1 p if = 1, 2, 3,... ; p X () = otherwise. The plot doesn t resemble the normal distribution at all. Mean: µ = 1/p Negative binomial distribution (r, p) Standard deviation: σ = 1 p/p r = 1 is same as geometric distribution. r > 2: The has a bell -like shape, but is not close to the normal distribution unless r is very large. Mean: µ = r/p Standard deviation: σ = r(1 p)/p Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 31 / 36

Geometric and negative binomial distributions Heads with probability p =.1 Geometric p=.1; µ=1.,!=9.49.1.5 Neg. bin. r=6,p=.1; µ=6.,!=23.24.2.1.5 1 2 3 Neg. bin. r=3,p=.1; µ=3.,!=51.96.1 1 2 3 4 4 2 5 1 Neg. bin. r=6,p=.1; µ=6.,!=73.48 6 1!3 2 4 6 8 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 32 / 36

Eponential and gamma distributions Eponential distribution (λ) The eponential distribution doesn t resemble the normal distribution at all. Mean: µ = 1/λ Standard deviation: σ = 1/λ Gamma distribution (r, λ) The gamma distribution for r = 1 is the eponential distribution. The gamma distribution for r > 1 does have a bell -like shape, but it is not close to the normal distribution until r is very large. There is a generalization to allow r to be real numbers, not just integers. Mean: µ = r/λ Standard deviation: σ = r/λ Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 33 / 36

Eponential and gamma distributions Rate λ =.1 Eponential!=.1; µ=1.,"=1..1.5 Gamma r=6,p=.1; µ=6.,!=24.49.2.1 6 4 2 1 2 3 4 Gamma r=3,p=.1; µ=3.,!=54.77 8 1!3 1 2 3 4 4 2 5 1 Gamma r=6,p=.1; µ=6.,!=77.46 6 1!3 2 4 6 8 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 34 / 36

Geometric/Negative binomial vs. Eponential/Gamma p = λ gives same means for geometric and eponential. p = 1 e λ gives same eponential decay rate for both geometric and eponential distributions. 1 e λ λ when λ is small. This corespondence carries over to the gamma and negative binomial distributions. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 35 / 36

Geometric/negative binomial vs. Eponential/gamma This is for p =.1 vs. λ =.1; a better fit for λ =.1 would be p = 1 e λ.95 Geometric p=.1; µ=1.,!=9.49 Eponential!=.1; µ=1.,"=1..1.5.1.5.5 1 2 3 Neg. bin. r=3,p=.1; µ=3.,!=51.96.1 1 2 3 4 6 4 2 1 2 3 4 Gamma r=3,p=.1; µ=3.,!=54.77 8 1!3 1 2 3 4 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / October 22, 214 36 / 36