Sampling Distributions



Similar documents
Normal Distribution as an Approximation to the Binomial Distribution

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

WHERE DOES THE 10% CONDITION COME FROM?

1 Sufficient statistics

WEEK #23: Statistics for Spread; Binomial Distribution

Chapter 4. Probability and Probability Distributions

The Normal distribution

4. Continuous Random Variables, the Pareto and Normal Distributions

The Binomial Probability Distribution

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

The Math. P (x) = 5! = = 120.

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

5.1 Identifying the Target Parameter

You flip a fair coin four times, what is the probability that you obtain three heads.

An Introduction to Basic Statistics and Probability

Characteristics of Binomial Distributions

Statistics 104: Section 6!

Normal distribution. ) 2 /2σ. 2π σ

Descriptive Statistics

EMPIRICAL FREQUENCY DISTRIBUTION

Homework 4 - KEY. Jeff Brenion. June 16, Note: Many problems can be solved in more than one way; we present only a single solution here.

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem

Math 461 Fall 2006 Test 2 Solutions

Lecture 5 : The Poisson Distribution

6 PROBABILITY GENERATING FUNCTIONS

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

Exploratory Data Analysis

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

Chapter 5: Normal Probability Distributions - Solutions

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

STAT 830 Convergence in Distribution

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

TEACHER NOTES MATH NSPIRED

Lecture 10: Depicting Sampling Distributions of a Sample Proportion

6.4 Normal Distribution

Probability Calculator

Unit 4 The Bernoulli and Binomial Distributions

Chapter 4. iclicker Question 4.4 Pre-lecture. Part 2. Binomial Distribution. J.C. Wang. iclicker Question 4.4 Pre-lecture

16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION

The Normal Distribution

PROBABILITY AND SAMPLING DISTRIBUTIONS

Notes on Continuous Random Variables

Lecture 8. Confidence intervals and the central limit theorem

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

1 Prior Probability and Posterior Probability

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

The Binomial Distribution

2WB05 Simulation Lecture 8: Generating random variables

STAT 360 Probability and Statistics. Fall 2012

Random variables, probability distributions, binomial random variable

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.

Introduction to General and Generalized Linear Models

Important Probability Distributions OPRE 6301

THE NUMBER OF GRAPHS AND A RANDOM GRAPH WITH A GIVEN DEGREE SEQUENCE. Alexander Barvinok

Ch5: Discrete Probability Distributions Section 5-1: Probability Distribution

Section 5 Part 2. Probability Distributions for Discrete Random Variables

MBA 611 STATISTICS AND QUANTITATIVE METHODS

2. Discrete random variables

ST 371 (IV): Discrete Random Variables

ECE302 Spring 2006 HW4 Solutions February 6,

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

Statistics 100A Homework 4 Solutions

AFM Ch.12 - Practice Test

Lesson 4 Measures of Central Tendency

Topic 8. Chi Square Tests

10.2 Series and Convergence

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 12: June 22, Abstract. Review session.

LECTURE 16. Readings: Section 5.1. Lecture outline. Random processes Definition of the Bernoulli process Basic properties of the Bernoulli process

Principle of Data Reduction

Bootstrap Example and Sample Code

Chapter 5. Random variables

Practice problems for Homework 11 - Point Estimation

Generating Random Data. Alan T. Arnholt Department of Mathematical Sciences Appalachian State University

The Normal Approximation to Probability Histograms. Dice: Throw a single die twice. The Probability Histogram: Area = Probability. Where are we going?

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

2 Binomial, Poisson, Normal Distribution

Hypothesis Testing for Beginners

The Normal Distribution

Math 151. Rumbos Spring Solutions to Assignment #22

Lecture 6: Discrete & Continuous Probability and Random Variables

9.2 Summation Notation

COMP 250 Fall 2012 lecture 2 binary representations Sept. 11, 2012

5.4 Solving Percent Problems Using the Percent Equation

Week 3&4: Z tables and the Sampling Distribution of X

Lesson 17: Margin of Error When Estimating a Population Proportion

Department of Civil Engineering-I.I.T. Delhi CEL 899: Environmental Risk Assessment Statistics and Probability Example Part 1

ECE302 Spring 2006 HW3 Solutions February 2,

Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1

X: Probability:

1.5 Oneway Analysis of Variance

BODY OF KNOWLEDGE CERTIFIED SIX SIGMA YELLOW BELT

Covariance and Correlation

Binomial Distribution n = 20, p = 0.3

Pr(X = x) = f(x) = λe λx

Transcription:

Sampling Distributions You have seen probability distributions of various types. The normal distribution is an example of a continuous distribution that is often used for quantitative measures such as weights, heights, etc. The binomial distribution is an example of a discrete distribution because the possible outcomes are only a discrete set of values,, 1,, n. The value of the binomial random variable is the number of successes out of a random sample of n trials, in which the probability of success on a particular trail is π.

The Binomial Distribution as a Sampling Distributions The nature of the binomial distribution makes it a sampling distribution. In other words, the value of the binomial random variable is derived from a sample of size n from a population. You can think of the population as being a set containing s and 1 s, with a 1 representing a success and representing a failure. (The term success does not necessarily imply something good, nor does failure imply something bad.) The sample is obtained as a sequence of n trials, with each trial being a draw from the population of s and 1 s. (Such trials are called Bernoulli trials.) On the first trial, you draw a value from {, 1}, with P(1)=π. Then you draw again in the same way, and do this repeatedly a total of n times. So your sample will be a set such as {1, 1,, 1,, 1} in the case for n=6. This set has 4 1 s and 2 s, so the value of the binomial random variable is y=4. The value of the binomial distribution is the sum of the outcomes from the trials; that is, the number of 1 s.

The Binomial Distribution as a Sampling Distribution Recall that the binomial probability formula is where P(y successes in n trials) = n! (1 ) y!( n y)! π π y n y and π = probability of success on single trial n = number or trials. Mean of the binomial distribution: nπ Variance of the binomial distribution: nπ (1 π ) This probability is derived from the sampling distribution of the number of successes that would result from a very large number of samples.

The Binomial distribution as a Sampling Distribution Consider a population that consists 6% of 1 s and 4% of s. If you draw a value at random, you get a 1 with probability.6 and a with probability.4. Such a draw would constitute a Bernoulli trial with P(1)=.6. Suppose you draw a sample of size n and add up the value you obtain. This would give you a binomial random variable. Now suppose you do this again and again for a very large number of samples. The conceptual results make up a conceptual population. Here are the histograms that correspond to the population for values of n equal to 1, 2, 3, 6, 1, and 2. Notice how the shapes of the histograms change as n increases. The distributions become more mound-shaped and symmetric.

Histotram of B(1,.6) Distribution 7 6 5 4 3 2 1 1 Histogram of B(2,.6) Distribution.6.5.4.3.2.1 1 2 Histogram of B(3,.6) Distribution.5.4.3.2.1 1 2 3

Histogram of B(6,.6) Distribution.35.3.25.2.15.1.5 1 2 3 4 5 6 Histogram of B(1,.6) Distribution.3.25.2.15.1.5 1 2 3 4 5 6 7 8 9 1 Histogram of.2.15.1.5 1 2 3 4 5 6 7 8 9 1 11 12 13 14 15 16 17 18 19 2

The Binomial Sampling Distribution and Statistical Inference Here are the values of P(y=k) and P(y<=k) for n=2 and k=6-17 (P(y=k)<.5 for n<6 or n>17): 6 7 8 9 1 11 12 13 14 15 16 17 P(y=k).5.15.35.71.117.16.18.165.124.75.35.12 P(y k).6.21.57.128.245.44.584.75.874.949.984.996 Remember that these are the probabilities for π=.6. Suppose you are sampling from a distribution with unknown π. If you drew sample with y=12, would you have reason to doubt that π=.6? (Note: y/n=.6) If you drew sample with y=1, would you have reason to doubt that π=.6? (Note: y/n=.5) If you drew sample with y=6, would you have reason to doubt that π=.6? (Note: y/n=.3)

The Binomial Sampling Distribution and Statistical Inference Here are the numeric values of P(y=k) for n=1, 3 decimal places: 1 2 3 4 5 6 7 8 9 1 P(y= k)..2.11.42.111.21.251.215.121.4.6 P(y k)..2.12.55.166.367.618.833.954.994 1. Remember that these are the probabilities for π=.6. Suppose you are sampling from a distribution with unknown π. If you drew sample with y=6, would you have reason to doubt that π=.6? (Note: y/n=.6) If you drew sample with y=5, would you have reason to doubt that π=.6? (Note: y/n=.5) If you drew sample with y=3, would you have reason to doubt that π=.6? (Note: y/n=.3)

Normal Distributions and Sampling Distributions One of the most useful statistics is the sample mean, y. Statistical inference based on y is derived from its sampling distribution. If the population from which the sample was obtained is normally distributed with mean µ and variance σ 2, then the sampling distribution of y is normal. Moreover, the mean of the sampling has mean µ and variance σ 2 /n. If 2 2 yi ~ N ( µ, σ ), i=1,,n, then y ~ N( µσ, / n).

Normal Distributions and Sampling Distributions If the distribution from which the samples were obtained is not normal, then the sampling distribution of y is only approximately normal, but the distribution becomes more nearly normal as n increases. This is the Central limit Theorem. Central Limit Theorem: If y is a random variable with mean µ and variance σ 2, then the sampling distribution of y is approximately normal with mean µ and variance σ 2 /n.