Random variables, probability distributions, expected value, variance, binomial distribution, Gaussian distribution (normal distribution)

Similar documents
Math 151. Rumbos Spring Solutions to Assignment #22

An Introduction to Basic Statistics and Probability

Lecture 8. Confidence intervals and the central limit theorem

4. Continuous Random Variables, the Pareto and Normal Distributions

Lecture 6: Discrete & Continuous Probability and Random Variables

You flip a fair coin four times, what is the probability that you obtain three heads.

The Binomial Distribution

Joint Exam 1/P Sample Exam 1

Math 431 An Introduction to Probability. Final Exam Solutions

2. Discrete random variables

Chapter 5. Random variables

Unit 4 The Bernoulli and Binomial Distributions

Probability Distribution for Discrete Random Variables

University of California, Los Angeles Department of Statistics. Random variables

Random variables P(X = 3) = P(X = 3) = 1 8, P(X = 1) = P(X = 1) = 3 8.

MATH 140 Lab 4: Probability and the Standard Normal Distribution

Binomial Probability Distribution

Section 5.1 Continuous Random Variables: Introduction

Section 5-3 Binomial Probability Distributions

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 18. A Brief Introduction to Continuous Probability

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

Normal distribution. ) 2 /2σ. 2π σ

Binomial random variables

Some special discrete probability distributions

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

Lecture Note 1 Set and Probability Theory. MIT Spring 2006 Herman Bennett

Math Quizzes Winter 2009

2 Binomial, Poisson, Normal Distribution

Lecture 3: Continuous distributions, expected value & mean, variance, the normal distribution

The normal approximation to the binomial

FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL

ECE302 Spring 2006 HW3 Solutions February 2,

ECE 316 Probability Theory and Random Processes

Chapter 4 Lecture Notes

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

Chapter 4. Probability and Probability Distributions

Econ 132 C. Health Insurance: U.S., Risk Pooling, Risk Aversion, Moral Hazard, Rand Study 7

The normal approximation to the binomial

Ch5: Discrete Probability Distributions Section 5-1: Probability Distribution

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

3.4. The Binomial Probability Distribution. Copyright Cengage Learning. All rights reserved.

6 PROBABILITY GENERATING FUNCTIONS

The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1].

Homework 4 - KEY. Jeff Brenion. June 16, Note: Many problems can be solved in more than one way; we present only a single solution here.

Mathematical Expectation

The Binomial Probability Distribution

Important Probability Distributions OPRE 6301

Feb 7 Homework Solutions Math 151, Winter Chapter 4 Problems (pages )

Hypothesis Testing for Beginners

Business Statistics 41000: Probability 1

Probability: Terminology and Examples Class 2, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 13. Random Variables: Distribution and Expectation

Math 461 Fall 2006 Test 2 Solutions

16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Practice problems for Homework 11 - Point Estimation

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

Statistics 100A Homework 4 Solutions

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.

Math 425 (Fall 08) Solutions Midterm 2 November 6, 2008

Section 5 Part 2. Probability Distributions for Discrete Random Variables

Binomial Sampling and the Binomial Distribution

Bayes and Naïve Bayes. cs534-machine Learning

Solution. Solution. (a) Sum of probabilities = 1 (Verify) (b) (see graph) Chapter 4 (Sections ) Homework Solutions. Section 4.

Example. A casino offers the following bets (the fairest bets in the casino!) 1 You get $0 (i.e., you can walk away)

Math/Stats 342: Solutions to Homework

8. THE NORMAL DISTRIBUTION

Lecture 7: Continuous Random Variables

MAS108 Probability I

Chapter 6: Point Estimation. Fall Probability & Statistics

Sample Questions for Mastery #5

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

10.2 Series and Convergence

Chapter 5: Discrete Probability Distributions

STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science

Stat 20: Intro to Probability and Statistics

Chapter 5. Discrete Probability Distributions

Practice Problems #4

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density

Discrete Math in Computer Science Homework 7 Solutions (Max Points: 80)

Aggregate Loss Models

Lecture 14. Chapter 7: Probability. Rule 1: Rule 2: Rule 3: Nancy Pfenning Stats 1000

Normal Distribution as an Approximation to the Binomial Distribution

Covariance and Correlation

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

Notes on Continuous Random Variables

Lecture 5 : The Poisson Distribution

Review of Random Variables

Binomial random variables (Review)

WEEK #23: Statistics for Spread; Binomial Distribution

STAT 315: HOW TO CHOOSE A DISTRIBUTION FOR A RANDOM VARIABLE

Stats on the TI 83 and TI 84 Calculator

Sample Term Test 2A. 1. A variable X has a distribution which is described by the density curve shown below:

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

Week 4: Standard Error and Confidence Intervals

Lecture 8: More Continuous Random Variables

ECON1003: Analysis of Economic Data Fall 2003 Answers to Quiz #2 11:40a.m. 12:25p.m. (45 minutes) Tuesday, October 28, 2003

Mathematics of Risk. Introduction. Case Study #1 Personal Auto Insurance Pricing. Mathematical Concepts Illustrated. Background

Transcription:

Random variables, probability distributions, expected value, variance, binomial distribution, Gaussian distribution (normal distribution) Relevant Readings: Table 5.2, Section 5.3 up through 5.3.4 in Mitchell CS495 - Machine Learning, Fall 2009

Prob and stats Probability, statistics, and sampling theory aren t only just useful in ML. These are generally useful in computer science (such is in the study of randomized algorithms)

Prob and stats Probability, statistics, and sampling theory aren t only just useful in ML. These are generally useful in computer science (such is in the study of randomized algorithms)

Some basics A random variable is a numerical outcome of some random experiment

Some basics A random variable is a numerical outcome of some random experiment It can be thought of as an unknown value that changes every time it is observed

Some basics A random variable is a numerical outcome of some random experiment It can be thought of as an unknown value that changes every time it is observed Similar to Math.random()

Some basics A random variable is a numerical outcome of some random experiment It can be thought of as an unknown value that changes every time it is observed Similar to Math.random() Example: The result of rolling a die

Some basics A random variable is a numerical outcome of some random experiment It can be thought of as an unknown value that changes every time it is observed Similar to Math.random() Example: The result of rolling a die A random variable s behavior is governed its probability distribution

Some basics A random variable is a numerical outcome of some random experiment It can be thought of as an unknown value that changes every time it is observed Similar to Math.random() Example: The result of rolling a die A random variable s behavior is governed its probability distribution The probability distribution is a specification for the likelihood of the random variable taking on different values

Some basics A random variable is a numerical outcome of some random experiment It can be thought of as an unknown value that changes every time it is observed Similar to Math.random() Example: The result of rolling a die A random variable s behavior is governed its probability distribution The probability distribution is a specification for the likelihood of the random variable taking on different values Example: For rolling a die, the probability distribution is uniform, meaning that each possibility (1, 2, 3, 4, 5, 6) has equal probability of coming up (1/6).

Some basics A random variable is a numerical outcome of some random experiment It can be thought of as an unknown value that changes every time it is observed Similar to Math.random() Example: The result of rolling a die A random variable s behavior is governed its probability distribution The probability distribution is a specification for the likelihood of the random variable taking on different values Example: For rolling a die, the probability distribution is uniform, meaning that each possibility (1, 2, 3, 4, 5, 6) has equal probability of coming up (1/6). Random variables can be discrete or continuous

Some basics A random variable is a numerical outcome of some random experiment It can be thought of as an unknown value that changes every time it is observed Similar to Math.random() Example: The result of rolling a die A random variable s behavior is governed its probability distribution The probability distribution is a specification for the likelihood of the random variable taking on different values Example: For rolling a die, the probability distribution is uniform, meaning that each possibility (1, 2, 3, 4, 5, 6) has equal probability of coming up (1/6). Random variables can be discrete or continuous The probability that random variable X takes on the value x is denoted Pr(X = x)

Some basics A random variable is a numerical outcome of some random experiment It can be thought of as an unknown value that changes every time it is observed Similar to Math.random() Example: The result of rolling a die A random variable s behavior is governed its probability distribution The probability distribution is a specification for the likelihood of the random variable taking on different values Example: For rolling a die, the probability distribution is uniform, meaning that each possibility (1, 2, 3, 4, 5, 6) has equal probability of coming up (1/6). Random variables can be discrete or continuous The probability that random variable X takes on the value x is denoted Pr(X = x)

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2]

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)?

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero.

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero. What is Pr(X = 1/4)?

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero. What is Pr(X = 1/4)? Zero.

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero. What is Pr(X = 1/4)? Zero. What is Pr(1/4 X 1/2)?

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero. What is Pr(X = 1/4)? Zero. What is Pr(1/4 X 1/2)? 1/8

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero. What is Pr(X = 1/4)? Zero. What is Pr(1/4 X 1/2)? 1/8 So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero. What is Pr(X = 1/4)? Zero. What is Pr(1/4 X 1/2)? 1/8 So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly Think of it this way: what s the probability of throwing a dart and hitting exactly the right spot on the wall?

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero. What is Pr(X = 1/4)? Zero. What is Pr(1/4 X 1/2)? 1/8 So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly Think of it this way: what s the probability of throwing a dart and hitting exactly the right spot on the wall? Zero.

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero. What is Pr(X = 1/4)? Zero. What is Pr(1/4 X 1/2)? 1/8 So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly Think of it this way: what s the probability of throwing a dart and hitting exactly the right spot on the wall? Zero. But after it s thrown, it actually hit some exact spot.

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero. What is Pr(X = 1/4)? Zero. What is Pr(1/4 X 1/2)? 1/8 So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly Think of it this way: what s the probability of throwing a dart and hitting exactly the right spot on the wall? Zero. But after it s thrown, it actually hit some exact spot. Therefore zero-probability doesn t necessarily mean impossible

Continuous probability distributions Consider a uniform continuous random variable X over the range [0,2] What is Pr(X = 1/2)? Zero. What is Pr(X = 1/4)? Zero. What is Pr(1/4 X 1/2)? 1/8 So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly Think of it this way: what s the probability of throwing a dart and hitting exactly the right spot on the wall? Zero. But after it s thrown, it actually hit some exact spot. Therefore zero-probability doesn t necessarily mean impossible

Some basic notions Summation notation: n i=0 a i means a 0 + a 1 + + a n

Some basic notions Summation notation: n i=0 a i means a 0 + a 1 + + a n The expected value (aka. mean) of a random variable Y is E[Y ] = i y i Pr(Y = y i ), where the sum is over all possible outcomes

Some basic notions Summation notation: n i=0 a i means a 0 + a 1 + + a n The expected value (aka. mean) of a random variable Y is E[Y ] = i y i Pr(Y = y i ), where the sum is over all possible outcomes Just a weighted average (the continuous case would be an integral).

Some basic notions Summation notation: n i=0 a i means a 0 + a 1 + + a n The expected value (aka. mean) of a random variable Y is E[Y ] = i y i Pr(Y = y i ), where the sum is over all possible outcomes Just a weighted average (the continuous case would be an integral). The variance of a random variable Y is Var(Y ) = E[(Y E[Y ]) 2 ]

Some basic notions Summation notation: n i=0 a i means a 0 + a 1 + + a n The expected value (aka. mean) of a random variable Y is E[Y ] = i y i Pr(Y = y i ), where the sum is over all possible outcomes Just a weighted average (the continuous case would be an integral). The variance of a random variable Y is Var(Y ) = E[(Y E[Y ]) 2 ] This is a measure of how much Y deviates from the mean

Some basic notions Summation notation: n i=0 a i means a 0 + a 1 + + a n The expected value (aka. mean) of a random variable Y is E[Y ] = i y i Pr(Y = y i ), where the sum is over all possible outcomes Just a weighted average (the continuous case would be an integral). The variance of a random variable Y is Var(Y ) = E[(Y E[Y ]) 2 ] This is a measure of how much Y deviates from the mean The standard deviation of a random variable Y is simply Var(Y )

Some basic notions Summation notation: n i=0 a i means a 0 + a 1 + + a n The expected value (aka. mean) of a random variable Y is E[Y ] = i y i Pr(Y = y i ), where the sum is over all possible outcomes Just a weighted average (the continuous case would be an integral). The variance of a random variable Y is Var(Y ) = E[(Y E[Y ]) 2 ] This is a measure of how much Y deviates from the mean The standard deviation of a random variable Y is simply Var(Y )

Discrete uniform distribution Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3,..., m}

Discrete uniform distribution Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3,..., m} What is E[X ]?

Discrete uniform distribution Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3,..., m} What is E[X ]? (m + 1)/2

Discrete uniform distribution Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3,..., m} What is E[X ]? (m + 1)/2 What is Var[X ]?

Discrete uniform distribution Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3,..., m} What is E[X ]? (m + 1)/2 What is Var[X ]? (m 2 1)/12

Discrete uniform distribution Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3,..., m} What is E[X ]? (m + 1)/2 What is Var[X ]? (m 2 1)/12

Continuous uniform distribution Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b]

Continuous uniform distribution Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b] What is E[X ]?

Continuous uniform distribution Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b] What is E[X ]? (a + b)/2

Continuous uniform distribution Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b] What is E[X ]? (a + b)/2 What is Var[X ]?

Continuous uniform distribution Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b] What is E[X ]? (a + b)/2 What is Var[X ]? (b a) 2 /12

Continuous uniform distribution Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b] What is E[X ]? (a + b)/2 What is Var[X ]? (b a) 2 /12

Binomial distribution Imagine an unfair coin n times that has probability p of landing on heads

Binomial distribution Imagine an unfair coin n times that has probability p of landing on heads A binomial distribution is: how many times did we get heads?

Binomial distribution Imagine an unfair coin n times that has probability p of landing on heads A binomial distribution is: how many times did we get heads? Is this a discrete or continuous distribution?

Binomial distribution Imagine an unfair coin n times that has probability p of landing on heads A binomial distribution is: how many times did we get heads? Is this a discrete or continuous distribution? Discrete

Binomial distribution Imagine an unfair coin n times that has probability p of landing on heads A binomial distribution is: how many times did we get heads? Is this a discrete or continuous distribution? Discrete Suppose X is a random variable with binomial probability distribution

Binomial distribution Imagine an unfair coin n times that has probability p of landing on heads A binomial distribution is: how many times did we get heads? Is this a discrete or continuous distribution? Discrete Suppose X is a random variable with binomial probability distribution What is E[X ]?

Binomial distribution Imagine an unfair coin n times that has probability p of landing on heads A binomial distribution is: how many times did we get heads? Is this a discrete or continuous distribution? Discrete Suppose X is a random variable with binomial probability distribution What is E[X ]? np

Binomial distribution Imagine an unfair coin n times that has probability p of landing on heads A binomial distribution is: how many times did we get heads? Is this a discrete or continuous distribution? Discrete Suppose X is a random variable with binomial probability distribution What is E[X ]? np What is Var[X ]?

Binomial distribution Imagine an unfair coin n times that has probability p of landing on heads A binomial distribution is: how many times did we get heads? Is this a discrete or continuous distribution? Discrete Suppose X is a random variable with binomial probability distribution What is E[X ]? np What is Var[X ]? np(1 p)

Binomial distribution Imagine an unfair coin n times that has probability p of landing on heads A binomial distribution is: how many times did we get heads? Is this a discrete or continuous distribution? Discrete Suppose X is a random variable with binomial probability distribution What is E[X ]? np What is Var[X ]? np(1 p)

Gaussian (normal) distribution If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian.

Gaussian (normal) distribution If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian. Thus, Gaussian distributions show up a lot in nature

Gaussian (normal) distribution If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian. Thus, Gaussian distributions show up a lot in nature It s a bell shaped curve

Gaussian (normal) distribution If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian. Thus, Gaussian distributions show up a lot in nature It s a bell shaped curve The probability distribution for a Gaussian with mean µ and variance σ 2 is defined as:

Gaussian (normal) distribution If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian. Thus, Gaussian distributions show up a lot in nature It s a bell shaped curve The probability distribution for a Gaussian with mean µ and variance σ 2 is defined as: e (x µ)2 /(2σ 2) /(σ 2π)

Gaussian (normal) distribution If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian. Thus, Gaussian distributions show up a lot in nature It s a bell shaped curve The probability distribution for a Gaussian with mean µ and variance σ 2 is defined as: e (x µ)2 /(2σ 2) /(σ 2π)

Application Suppose we are programming Naive Bayes and want to handle a continuous variable

Application Suppose we are programming Naive Bayes and want to handle a continuous variable One way we could do this: measure the mean and variance of the relevant examples, model them with a probability distribution (such as a Gaussian), and use that distribution to help determine whether yes or no is more likely for the given instance

Application Suppose we are programming Naive Bayes and want to handle a continuous variable One way we could do this: measure the mean and variance of the relevant examples, model them with a probability distribution (such as a Gaussian), and use that distribution to help determine whether yes or no is more likely for the given instance