An Introduction to Basic Statistics and Probability



Similar documents
Chapter 5. Random variables

Notes on Continuous Random Variables

Lecture 6: Discrete & Continuous Probability and Random Variables

5. Continuous Random Variables

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

The Binomial Probability Distribution

Normal distribution. ) 2 /2σ. 2π σ

Math 151. Rumbos Spring Solutions to Assignment #22

Chapter 4. Probability and Probability Distributions

ST 371 (IV): Discrete Random Variables

Chapter 4 Lecture Notes

The Normal Distribution. Alan T. Arnholt Department of Mathematical Sciences Appalachian State University

4. Continuous Random Variables, the Pareto and Normal Distributions

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

Probability Distribution for Discrete Random Variables

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March Due:-March 25, 2015.

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

Data Modeling & Analysis Techniques. Probability & Statistics. Manfred Huber

Pr(X = x) = f(x) = λe λx

University of California, Los Angeles Department of Statistics. Random variables

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1].

Normal Distribution as an Approximation to the Binomial Distribution

Random variables P(X = 3) = P(X = 3) = 1 8, P(X = 1) = P(X = 1) = 3 8.

Probability Distributions

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

WHERE DOES THE 10% CONDITION COME FROM?

M 1313 Review Test 4 1

WEEK #22: PDFs and CDFs, Measures of Center and Spread

EMPIRICAL FREQUENCY DISTRIBUTION

Bayes Theorem. Bayes Theorem- Example. Evaluation of Medical Screening Procedure. Evaluation of Medical Screening Procedure

Lecture 14. Chapter 7: Probability. Rule 1: Rule 2: Rule 3: Nancy Pfenning Stats 1000

Ch5: Discrete Probability Distributions Section 5-1: Probability Distribution

Week 3&4: Z tables and the Sampling Distribution of X

Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density

Exploratory Data Analysis

Point and Interval Estimates

Sample Questions for Mastery #5

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 18. A Brief Introduction to Continuous Probability

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

Practice problems for Homework 11 - Point Estimation

Joint Exam 1/P Sample Exam 1

Section 6.1 Discrete Random variables Probability Distribution

Math 461 Fall 2006 Test 2 Solutions

Section 5.1 Continuous Random Variables: Introduction

Department of Civil Engineering-I.I.T. Delhi CEL 899: Environmental Risk Assessment Statistics and Probability Example Part 1

Random variables, probability distributions, binomial random variable

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A) B) C) D) 0.

STAT 830 Convergence in Distribution

2 Binomial, Poisson, Normal Distribution

Lecture 7: Continuous Random Variables

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

東 海 大 學 資 訊 工 程 研 究 所 碩 士 論 文

1.1 Introduction, and Review of Probability Theory Random Variable, Range, Types of Random Variables CDF, PDF, Quantiles...

Chapter 5: Normal Probability Distributions - Solutions

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability Distributions

Section 6.2 Definition of Probability

Section 5 Part 2. Probability Distributions for Discrete Random Variables

Chapter 4. Probability Distributions

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Key Concept. Density Curve

Stats on the TI 83 and TI 84 Calculator

Lecture 3: Continuous distributions, expected value & mean, variance, the normal distribution

Lecture 10: Depicting Sampling Distributions of a Sample Proportion

16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION

6.4 Normal Distribution

Dongfeng Li. Autumn 2010

Sample Term Test 2A. 1. A variable X has a distribution which is described by the density curve shown below:

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

MATH 140 Lab 4: Probability and the Standard Normal Distribution

Math 370, Spring 2008 Prof. A.J. Hildebrand. Practice Test 1 Solutions

MBA 611 STATISTICS AND QUANTITATIVE METHODS

PROBABILITY AND SAMPLING DISTRIBUTIONS

Lecture 5 : The Poisson Distribution

1 Sufficient statistics

Chapter 5 Discrete Probability Distribution. Learning objectives

Lesson 20. Probability and Cumulative Distribution Functions

Probability for Estimation (review)

You flip a fair coin four times, what is the probability that you obtain three heads.

Chapter 4. iclicker Question 4.4 Pre-lecture. Part 2. Binomial Distribution. J.C. Wang. iclicker Question 4.4 Pre-lecture

2.6. Probability. In general the probability density of a random variable satisfies two conditions:

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

Example 1: Dear Abby. Stat Camp for the Full-time MBA Program

Math 431 An Introduction to Probability. Final Exam Solutions

Random Variables. Chapter 2. Random Variables 1

Unit 4 The Bernoulli and Binomial Distributions

THE BINOMIAL DISTRIBUTION & PROBABILITY

AP Statistics Solutions to Packet 2

Sampling Distributions

Binomial Distribution n = 20, p = 0.3

Lecture 8: More Continuous Random Variables

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1

Transcription:

An Introduction to Basic Statistics and Probability Shenek Heyward NCSU An Introduction to Basic Statistics and Probability p. 1/4

Outline Basic probability concepts Conditional probability Discrete Random Variables and Probability Distributions Continuous Random Variables and Probability Distributions Sampling Distribution of the Sample Mean Central Limit Theorem An Introduction to Basic Statistics and Probability p. 2/4

Idea of Probability Chance behavior is unpredictable in the short run, but has a regular and predictable pattern in the long run. The probability of any outcome of a random phenomenom is the proportion of times the outcome would occur in a very long series of repetitions. An Introduction to Basic Statistics and Probability p. 3/4

Terminology Sample Space - the set of all possible outcomes of a random phenomenon Event - any set of outcomes of interest Probability of an event - the relative frequency of this set of outcomes over an infinite number of trials Pr(A) is the probability of event A An Introduction to Basic Statistics and Probability p. 4/4

Example Suppose we roll two die and take their sum S = {2, 3, 4, 5,.., 11, 12} Pr(sum = 5) = 4 36 Because we get the sum of two die to be 5 if we roll a (1,4),(2,3),(3,2) or (4,1). An Introduction to Basic Statistics and Probability p. 5/4

Notation Let A and B denote two events. A B is the event that either A or B or both occur. A B is the event that both A and B occur simultaneously. The complement of A is denoted by A. A is the event that A does not occur. Note that Pr(A) = 1 Pr(A). An Introduction to Basic Statistics and Probability p. 6/4

Definitions A and B are mutually exclusive if both cannot occur at the same time. A and B are independent events if and only if Pr(A B) = Pr(A) Pr(B). An Introduction to Basic Statistics and Probability p. 7/4

Laws of Probability Multiplication Law: If A 1,,A k are independent events, then Pr(A 1 A 2 A k ) = Pr(A 1 ) Pr(A 2 ) Pr(A k ). Addition Law: If A and B are any events, then Pr(A B) = Pr(A) + Pr(B) Pr(A B) Note: This law can be extended to more than 2 events. An Introduction to Basic Statistics and Probability p. 8/4

Conditional Probability The conditional probability of B given A Pr(B A) = Pr(A B) Pr(A) A and B are independent events if and only if Pr(B A) = Pr(B) = Pr(B A) An Introduction to Basic Statistics and Probability p. 9/4

Random Variable A random variable is a variable whose value is a numerical outcome of a random phenomenon Usually denoted by X, Y or Z. Can be Discrete - a random variable that has finite or countable infinite possible values Example: the number of days that it rains yearly Continuous - a random variable that has an (continuous) interval for its set of possible values Example: amount of preparation time for the SAT An Introduction to Basic Statistics and Probability p. 10/4

Probability Distributions The probability distribution for a random variable X gives the possible values for X, and the probabilities associated with each possible value (i.e., the likelihood that the values will occur) The methods used to specify discrete prob. distributions are similar to (but slightly different from) those used to specify continuous prob. distributions. An Introduction to Basic Statistics and Probability p. 11/4

Probability Mass Function f(x) - Probability mass function for a discrete random variable X having possible values x 1,x 2, f(x i ) = Pr(X = x i ) is the probability that X has the value x i Properties 0 f(x i ) 1 i f(x i) = f(x 1 ) + f(x 2 ) + = 1 f(x i ) can be displayed as a table or as a mathematical function An Introduction to Basic Statistics and Probability p. 12/4

Probability Mass Function Example: (Moore p. 244)Suppose the random variable X is the number of rooms in a randomly chosen owner-occupied housing unit in Anaheim, California. The distribution of X is: Rooms X 1 2 3 4 5 6 7 Probability.083.071.076.139.210.224.197 An Introduction to Basic Statistics and Probability p. 13/4

Parameters vs. Statistics A parameter is a number that describes the population. Usually its value is unknown. A statistic is a number that can be computed from the sample data without making use of any unknown parameters. In practice, we often use a statistic to estimate an unknown parameter. An Introduction to Basic Statistics and Probability p. 14/4

Parameter vs. Statistic Example For example, we denote the population mean by µ, and we can use the sample mean x to estimate µ. Suppose we wanted to know the average income of households in NC. To estimate this population mean income µ, we may randomly take a sample of 1000 households and compute their average income x and use this as an estimate for µ. An Introduction to Basic Statistics and Probability p. 15/4

Expected Value Expected Value of X or (population) mean µ = E(X) = R x i Pr(X = x i ) = R x i f(x i ), i=1 i=1 where the sum is over R possible values. R may be finite or infinite. Analogous to the sample mean x Represents the "average" value of X An Introduction to Basic Statistics and Probability p. 16/4

Variance (Population) variance σ 2 = V ar(x) R = (x i µ) 2 Pr(X = x i ) i=1 R = x 2 i Pr(X = x i ) µ 2 i=1 Represents the spread, relative to the expected value, of all values with positive probability The standard deviation of X, denoted by σ, is the square root of its variance. An Introduction to Basic Statistics and Probability p. 17/4

Room Example For the Room example, find the following E(X) V ar(x) Pr [a unit has at least 5 rooms] An Introduction to Basic Statistics and Probability p. 18/4

Binomial Distribution Structure Two possible outcomes: Success (S) and Failure (F). Repeat the situation n times (i.e., there are n trials). The "probability of success," p, is constant on each trial. The trials are independent. An Introduction to Basic Statistics and Probability p. 19/4

Binomial Distribution Let X = the number of S s in n independent trials. (X has values x = 0, 1, 2,,n) Then X has a binomial distribution with parameters n and p. The binomial probability mass function is ( ) n Pr(X = x) = p x (1 p) n x, x = 0, 1, 2,,n x Expected Value: µ = E(X) = np Variance: σ 2 = V ar(x) = np(1 p) An Introduction to Basic Statistics and Probability p. 20/4

Example Example: (Moore p.306) Each child born to a particular set of parents has probability 0.25 of having blood type O. If these parents have 5 children, what is the probability that exactly 2 of them have type O blood? Let X= the number of boys ( ) 5 Pr(X = 2) = f(2) = (.25) 2 (.75) 3 =.2637 2 An Introduction to Basic Statistics and Probability p. 21/4

Example What is the expected number of children with type O blood? µ = 5(.25) = 1.25 What is the probability of at least 2 children with type O blood? Pr(X 2) = 5 k=2 = 1 ( ) 5 (.25) k (.75) 5 k k 1 k=0 =.3671875 ( ) 5 (.25) k (.75) 5 k k An Introduction to Basic Statistics and Probability p. 22/4

Continuous Random Variable f(x) - Probability density function for a continuous random variable X Properties f(x) 0 f(x)dx = 1 P[a X b] = b a f(x)dx Important Notes P[a X a] = a a f(x)dx = 0 This implies that P[X = a] = 0 P[a X b] = P[a < X < b] An Introduction to Basic Statistics and Probability p. 23/4

Summarizations Summarizations for continuous prob. distributions Mean or Expected Value of X µ = EX = xf(x)dx Variance σ 2 = V arx = = (x EX) 2 f(x)dx x 2 f(x)dx (EX) 2 An Introduction to Basic Statistics and Probability p. 24/4

Example Let X represent the fraction of the population in a certain city who obtain the flu vaccine. { 2x when 0 x 1, f(x) = 0 otherwise. Find P(1/4 X 1/2) P(1/4 X 1/2) = = 1/2 1/4 1/2 1/4 f(x)dx 2xdx = 3/16 An Introduction to Basic Statistics and Probability p. 25/4

Example f(x) = Find P(X 1/2) Find EX Find V arx { 2x when 0 x 1, 0 otherwise. An Introduction to Basic Statistics and Probability p. 26/4

Normal Distribution Most widely used continuous distribution Also known as the Gaussian distribution Symmetric An Introduction to Basic Statistics and Probability p. 27/4

Normal Distribution Probability density function EX = µ V arx = σ 2 f(x) = 1 ] [ σ 2π exp (x µ)2 2σ 2 Notation: X N(µ,σ 2 ) means that X is normally distributed with mean µ and variance σ 2. An Introduction to Basic Statistics and Probability p. 28/4

Standard Normal Distribution A normal distribution with mean 0 and variance 1 is called a standard normal distribution. Standard normal probability density function f(x) = 1 [ x 2] exp 2π 2 Standard normal cumulative probability function Let Z N(0, 1) Φ(z) = P(Z z) Symmetry property Φ( z) = 1 Φ(z) An Introduction to Basic Statistics and Probability p. 29/4

Standardization Standardization of a Normal Random Variable Suppose X N(µ,σ 2 ) and let Z = X µ σ. Then Z N(0, 1). If X N(µ,σ 2 ), what is P(a < X < b)? Form equivalent probability in terms of Z : ( a µ P(a < X < b) = P < Z < b µ ) σ σ Use standard normal tables to compute latter probability. An Introduction to Basic Statistics and Probability p. 30/4

Standardization Example. (Moore pp.65-67) Heights of Women Suppose the distribution of heights of young women are normally distributed with µ = 64 and σ 2 = 2.7 2 What is the probability that a randomly selected young woman will have a height between 60 and 70 inches? ( 60 64 Pr(60 < X < 70) = Pr 2.7 < Z < = Pr( 1.48 < Z < 2.22) = Φ(2.22) Φ( 1.48) =.9868.0694 =.9174 ) 70 64 2.7 An Introduction to Basic Statistics and Probability p. 31/4

Sampling Distribution of X A natural estimator for the population mean µ is the sample mean n X i X = n. i=1 Consider x to be a single realization of a random variable X over all possible samples of size n. The sampling distribution of X is the distribution of values of x over all possible samples of size n that could be selected from the population. An Introduction to Basic Statistics and Probability p. 32/4

Expected Value of X The average of the sample means (x s) when taken over a large number of random samples of size n will approximate µ. Let X 1,,X n be a random sample from some population with mean µ. Then for the sample mean X, E(X) = µ. X is an unbiased estimator of µ. An Introduction to Basic Statistics and Probability p. 33/4

Standard Error of X Let X 1,,X n be a random sample from some population with mean µ. and variance σ 2. The variance of the sample mean X is given by V ar(x) = σ 2 /n. The standard deviation of the sample mean is given by σ/ n. This quantity is called the standard error (of the mean). An Introduction to Basic Statistics and Probability p. 34/4

Standard Error of X The standard error σ/ n is estimated by s/ n. The standard error measures the variability of sample means from repeated samples of size n drawn from the same population. A larger sample provides a more precise estimate X of µ An Introduction to Basic Statistics and Probability p. 35/4

Sampling Distribution of X Let X 1,,X n be a random sample from a population that is normally distributed with mean µ and variance σ 2. Then the sample mean X is normally distributed with mean µ and variance σ 2 /n. That is X N(µ,σ 2 /n). An Introduction to Basic Statistics and Probability p. 36/4

Central Limit Theorem Let X 1,,X n be a random sample from any population with mean µ and variance σ 2. Then the sample mean X is approximately normally distributed with mean µ and variance σ 2 /n. An Introduction to Basic Statistics and Probability p. 37/4

ata Sampled from Uniform Distribution The following is a distribution of X when we take samples of size 1. An Introduction to Basic Statistics and Probability p. 38/4

Example The following is a distribution of X when we take samples of size 10. An Introduction to Basic Statistics and Probability p. 39/4

References Moore, David S., "The Basic Practice of Statistics." Third edition. W.H. Freeman and Company. New York. 2003 Weems, Kimberly. SIBS Presentation, 2005. An Introduction to Basic Statistics and Probability p. 40/4