Random Variable: A function that assigns numerical values to all the outcomes in the sample space.

Similar documents
Chapter 5. Random variables

3.4. The Binomial Probability Distribution. Copyright Cengage Learning. All rights reserved.

WHERE DOES THE 10% CONDITION COME FROM?

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

2. Discrete random variables

Chapter 4 Lecture Notes

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

Chapter 5. Discrete Probability Distributions

Chapter 4. Probability Distributions

Section 6.1 Discrete Random variables Probability Distribution

Random variables, probability distributions, binomial random variable

Probability Distribution for Discrete Random Variables

Lecture 6: Discrete & Continuous Probability and Random Variables

Stat 515 Midterm Examination II April 6, 2010 (9:30 a.m. - 10:45 a.m.)

You flip a fair coin four times, what is the probability that you obtain three heads.

Normal distribution. ) 2 /2σ. 2π σ

2 Binomial, Poisson, Normal Distribution

Random variables P(X = 3) = P(X = 3) = 1 8, P(X = 1) = P(X = 1) = 3 8.

Notes on Continuous Random Variables

Math 431 An Introduction to Probability. Final Exam Solutions

STAT 315: HOW TO CHOOSE A DISTRIBUTION FOR A RANDOM VARIABLE

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.

Ch5: Discrete Probability Distributions Section 5-1: Probability Distribution

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

ST 371 (IV): Discrete Random Variables

6.2. Discrete Probability Distributions

Section 5 Part 2. Probability Distributions for Discrete Random Variables

Chapter 5 Discrete Probability Distribution. Learning objectives

Practice Problems #4

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

The Binomial Probability Distribution

Binomial random variables (Review)

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Section 5-3 Binomial Probability Distributions

Binomial Probability Distribution

Homework 4 - KEY. Jeff Brenion. June 16, Note: Many problems can be solved in more than one way; we present only a single solution here.

An Introduction to Basic Statistics and Probability

Normal Distribution as an Approximation to the Binomial Distribution

5. Continuous Random Variables

Math 461 Fall 2006 Test 2 Solutions

Probability Calculator

CHAPTER 7 SECTION 5: RANDOM VARIABLES AND DISCRETE PROBABILITY DISTRIBUTIONS

Sample Questions for Mastery #5

STAT x 0 < x < 1

Joint Exam 1/P Sample Exam 1

Some special discrete probability distributions

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Random Variables. Chapter 2. Random Variables 1

The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1].

UNIT I: RANDOM VARIABLES PART- A -TWO MARKS

The normal approximation to the binomial

University of California, Los Angeles Department of Statistics. Random variables

ECE302 Spring 2006 HW3 Solutions February 2,

Lecture 5 : The Poisson Distribution

Lecture 3: Continuous distributions, expected value & mean, variance, the normal distribution

The normal approximation to the binomial

The Binomial Distribution

0 x = 0.30 x = 1.10 x = 3.05 x = 4.15 x = x = 12. f(x) =

6 PROBABILITY GENERATING FUNCTIONS

BINOMIAL DISTRIBUTION

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

Tenth Problem Assignment

Chapter 4. Probability and Probability Distributions

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

ECON1003: Analysis of Economic Data Fall 2003 Answers to Quiz #2 11:40a.m. 12:25p.m. (45 minutes) Tuesday, October 28, 2003

Important Probability Distributions OPRE 6301

Math 425 (Fall 08) Solutions Midterm 2 November 6, 2008

Binomial random variables

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

Applied Reliability Applied Reliability

1.1 Introduction, and Review of Probability Theory Random Variable, Range, Types of Random Variables CDF, PDF, Quantiles...

Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density

MAS108 Probability I

6.041/6.431 Spring 2008 Quiz 2 Wednesday, April 16, 7:30-9:30 PM. SOLUTIONS

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March Due:-March 25, 2015.

ECE302 Spring 2006 HW4 Solutions February 6,

Chapter 5: Normal Probability Distributions - Solutions

4. Continuous Random Variables, the Pareto and Normal Distributions

Statistics 100A Homework 7 Solutions

Stat 704 Data Analysis I Probability Review

ECE302 Spring 2006 HW5 Solutions February 21,

SCHOOL OF ENGINEERING & BUILT ENVIRONMENT. Mathematics

Statistics 100A Homework 4 Solutions

The Binomial Distribution. Summer 2003

Probability: Terminology and Examples Class 2, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

International Examinations. Advanced Level Mathematics Statistics 2 Steve Dobbs and Jane Miller

Example. A casino offers the following bets (the fairest bets in the casino!) 1 You get $0 (i.e., you can walk away)

Unit 4 The Bernoulli and Binomial Distributions

Statistics 100A Homework 4 Solutions

Chapter 9 Monté Carlo Simulation

Lecture 14. Chapter 7: Probability. Rule 1: Rule 2: Rule 3: Nancy Pfenning Stats 1000

Exploratory Data Analysis

Chapter 5: Discrete Probability Distributions

Characteristics of Binomial Distributions

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 5 Discrete Probability Distributions

Section 5.1 Continuous Random Variables: Introduction

Chapter 5 - Practice Problems 1

Math/Stats 342: Solutions to Homework

Transcription:

STAT 509 Section 3.2: Discrete Random Variables Random Variable: A function that assigns numerical values to all the outcomes in the sample space. Notation: Capital letters (like Y) denote a random variable. Lowercase letters (like y) denote possible values of the random variable. Discrete Random Variable : A numerical r.v. that takes on a countable number of values (there are gaps in the range of possible values). Examples: 1. Number of phone calls received in a day by a company 2. Number of heads in 5 tosses of a coin Continuous Random Variable : A numerical r.v. that takes on an uncountable number of values (possible values lie in an unbroken interval). Examples: 1. Length of nails produced at a factory 2. Time in 100-meter dash for runners Other examples?

The probability distribution of a random variable is a graph, table, or formula which tells what values the r.v. can take and the probability that it takes each of those values. Example 1: A design firm submits bids for four projects. Let Y = number of successful bids. y 0 1 2 3 4 P(y) 0.06 0.35 0.43 0.15 0.01 Example 2: Toss 2 coins. The r.v. Y = number of heads showing. y 0 1 2 P(y) ¼ ½ ¼ Graph for Example 1: For any probability distribution: (1) P(y) is between 0 and 1 for any value of y. (2) P ( y) = 1. That is, the sum of the probabilities for y all possible y values is 1.

Example 3: P(y) = y / 10 for y = 1, 2, 3, 4. Valid Probability Distribution? Property 1? Property 2? Cumulative Distribution Function: If Y is a random variable, then the cumulative distribution function (cdf) is denoted by F(y). F(y) = P(Y < y) cdf for r.v. in Example 1? Graph of cdf:

Expected Value of a Discrete Random Variable The expected value of a r.v. is its mean (i.e., the mean of its probability distribution). For a discrete r.v. Y, the expected value of Y, denoted or E(Y), is: = E(Y) = yp ( y) y where represents a summation over all values of Y. y Recall Example 3: = Here, the expected value of y is Recall Example 1: What is the expected number of successful design bids? E(Y) = So on average, a firm in this situation would win bids. The expected value does not have to be a possible value of the r.v. --- it s an average value.

Variance of a Discrete Random Variable The variance 2 is the expected value of the squared deviation from the mean ; that is, 2 = E[(Y ) 2 ]. Shortcut formula: 2 = [ y where y 2 = (y ) 2 P(y) 2 y P( y) ] 2 represents a summation over all values of y. Example 3: Recall = 3 for this r.v. y 2 P(y) = Thus 2 = Note that the standard deviation of the r.v. is the square root of 2. So for Example 3, = Example 1: Recall = 1.7 for this r.v. y 2 P(y) = Thus 2 = and =

Some Rules for Expected Values and Variances If c is a constant number, E(c) = c E(cY) = ce(y) If X and Y are two random variables, E(X + Y) = E(X) + E(Y) And: var(c) = 0 var(cy) = c 2 var(y) If X and Y are two independent random variables, var(x + Y) = var(x) + var(y) The Binomial Random Variable Many experiments have responses with 2 possibilities (Yes/No, Pass/Fail). Certain experiments called binomial experiments yield a type of r.v. called a binomial random variable. Characteristics of a binomial experiment: (1) The experiment consists of a number (denoted n) of identical trials. (2) There are only two possible outcomes for each trial denoted Success (S) or Failure (F) (3) The probability of success (denoted p) is the same for each trial. (Probability of failure = q = 1 p.) (4) The trials are independent.

Then the binomial r.v. (denoted Y) is the number of successes in the n trials. Example 1: A factory makes 25 bricks in an hour. Suppose typically 5% of all bricks produced are nonconforming. Let Y = total number of nonconforming bricks. Then Y is Example 2: A student randomly guesses answers on a multiple choice test with 3 questions, each with 4 possible answers. Y = number of correct answers. Then Y is What is the probability distribution for Y in this case? Outcome y P(outcome)

Probability Distribution of Y y P(y) General Formula: (Binomial Probability Distribution) (n = number of trials, p = probability of success.) The probability there will be exactly y successes is: p(y) = n y where n = n choose y y = n! y! (n y)! p y q n y (y = 0, 1, 2,, n) Here, 0! = 1, 1! = 1, 2! = 2 1 = 2, 3! = 3 2 1 = 6, etc. Example 1(a): Out of 25 bricks produced, what is the probability of exactly 2 nonconforming bricks?

Example 1(b): Out of 25 bricks produced, what is the probability of at least 1 nonconforming brick? Example 1(c): Out of 25 bricks produced, what is the probability of at least 2 nonconforming bricks? The mean (expected value) of a binomial r.v. is = np. The variance of a binomial r.v. is 2 = npq. The standard deviation of a binomial r.v. is = Example: What is the mean number of nonconforming bricks that we would expect out of the 25 produced? = np = What is the standard deviation of this binomial r.v.?

Finding Binomial Probabilities using R Hand calculations of binomial probabilities can be tedious at times. Example 1(c): Out of 25 bricks produced, what is the probability of at least 6 nonconforming bricks? > sum(dbinom(6:25, size = 25, prob = 0.05)) [1] 0.001212961 Example 4: Historically, 10% of homes in Florida have radon levels higher than that recommended by EPA. In a random sample of 20 homes, find the probability that exactly 3 have radon levels higher than the EPA recommendation. In a random sample of 20 homes, find the probability that more than 4 have radon levels higher than the EPA recommendation. In a random sample of 20 homes, find the probability that between 2 and 5 have radon levels higher than the EPA recommendation.

Sampling without replacement: The Hypergeometric Distribution If we take a sample (without replacement) of size n from a finite collection of N objects (r of which are successes ), then the number of successes Y in our sample is not binomial. Why not? It follows a hypergeometric distribution. The probability function of Y is: P(y) = r N r y n y N n for y = 0, 1, 2,, n The mean (expected value) of Y is = E(Y) = Example: Suppose a factory makes 100 filters per day, 5 of which are defective. If we randomly sample (without replacement) 15 of these filters, what is the expected number of defective filters in the sample? n r N What is the probability that exactly 1 of the sampled filters will be defective?

Poisson Random Variables The Poisson distribution can be used to model the number of events occurring in a continuous time or space. Number of telephone calls received per hour Number of claims received per day by an insurance company Number of accidents per month at an intersection Number of breaks per 1000 meters of copper wire. The mean number of events (per 1-unit interval) for a Poisson distribution is denoted. Which values can a Poisson r.v. take? Probability distribution for Y (if Y is Poisson with mean ) P(y) = y e y! (for y = 0, 1, 2, ) Mean of Poisson probability distribution: Variance of Poisson probability distribution:

If we record the number of occurrences Y in t units of time or space, then the probability distribution for Y is: P(y) = ( t) y e t y! (for y = 0, 1, 2, ) Example 1(a): Historically a process has averaged 2.6 breaks in the insulation per 1000 meters of wire. What is the probability that 1000 meters of wire will have 1 or fewer breaks in insulation? Example 1(b): What is the probability that 3000 meters of wire will have 1 or fewer breaks in insulation? Now, E(Y) = t =

Finding Poisson Probabilities using R Hand calculations of Poisson probabilities can be tedious at times. Example 1(c): What is the probability that 1000 meters of wire will have between 3 and 7 breaks in insulation? > sum(dpois(3:7, lambda = 2.6)) [1] 0.4762367 Example 1(d): What is the probability that 2000 meters of wire will have at least 4 breaks in insulation? 1 - sum(dpois(0:3, lambda = 2.6 * 2)) Conditions for a Poisson Process 1) Areas of inspection are independent of one another. 2) The probability of the event occurring at any particular point in space or time is negligible. 3) The mean remains constant over all areas/intervals of inspection. Example 2: Suppose we average 5 radioactive particles passing a counter in 1 millisecond. What is the probability that exactly 10 particles will pass in the next three milliseconds? 10 or fewer particles?