Discrete Probability Distributions Continuous Probability Distributions

Similar documents
MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

Numerical integration of a function known only through data points

Continuous Random Variables

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

AP Statistics Solutions to Packet 2

Normal distribution. ) 2 /2σ. 2π σ

Probability. Distribution. Outline

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Probability Distributions

Statistics Revision Sheet Question 6 of Paper 2

What Does the Normal Distribution Sound Like?

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

Chapter 3 RANDOM VARIATE GENERATION

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

3.4 The Normal Distribution

TEACHER NOTES MATH NSPIRED

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Distribution is a χ 2 value on the χ 2 axis that is the vertical boundary separating the area in one tail of the graph from the remaining area.

AMS 5 CHANCE VARIABILITY

The Normal Distribution

6.4 Normal Distribution

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA

4. Continuous Random Variables, the Pareto and Normal Distributions

Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab

WEEK #22: PDFs and CDFs, Measures of Center and Spread

Graphs. Exploratory data analysis. Graphs. Standard forms. A graph is a suitable way of representing data if:

An Introduction to Basic Statistics and Probability

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Descriptive Statistics and Measurement Scales

Bar Charts, Histograms, Line Graphs & Pie Charts

Chapter 4 - Lecture 1 Probability Density Functions and Cumul. Distribution Functions

The Normal Distribution

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

Stats on the TI 83 and TI 84 Calculator

CALCULATING THE AREA OF A FLOWER BED AND CALCULATING NUMBER OF PLANTS NEEDED

16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION

6 3 The Standard Normal Distribution

x 2 + y 2 = 1 y 1 = x 2 + 2x y = x 2 + 2x + 1

Unit 7: Normal Curves

Data Modeling & Analysis Techniques. Probability & Statistics. Manfred Huber

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem

MBA 611 STATISTICS AND QUANTITATIVE METHODS

Jitter Measurements in Serial Data Signals

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

The Normal Distribution. Alan T. Arnholt Department of Mathematical Sciences Appalachian State University

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Week 4: Standard Error and Confidence Intervals

Descriptive Statistics

Directions for Frequency Tables, Histograms, and Frequency Bar Charts

GeoGebra Statistics and Probability

Notes on Continuous Random Variables

6.1. The Exponential Function. Introduction. Prerequisites. Learning Outcomes. Learning Style

Interpreting Data in Normal Distributions

THE BINOMIAL DISTRIBUTION & PROBABILITY

Drawing a histogram using Excel

Introduction to MATLAB IAP 2008

Example 1. so the Binomial Distrubtion can be considered normal

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Fairfield Public Schools

MATHEMATICS FOR ENGINEERING INTEGRATION TUTORIAL 3 - NUMERICAL INTEGRATION METHODS

MA107 Precalculus Algebra Exam 2 Review Solutions

Curve Fitting, Loglog Plots, and Semilog Plots 1

Math Quizzes Winter 2009

Graphical Integration Exercises Part Four: Reverse Graphical Integration

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Paper Reference. Edexcel GCSE Mathematics (Linear) 1380 Paper 4 (Calculator) Monday 5 March 2012 Afternoon Time: 1 hour 45 minutes

Scatter Plots with Error Bars

DATA INTERPRETATION AND STATISTICS

Variables. Exploratory Data Analysis

Paper Reference. Ruler graduated in centimetres and millimetres, protractor, compasses, pen, HB pencil, eraser, calculator. Tracing paper may be used.

Introduction to nonparametric regression: Least squares vs. Nearest neighbors

Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

STATISTICS 8: CHAPTERS 7 TO 10, SAMPLE MULTIPLE CHOICE QUESTIONS

R Graphics Cookbook. Chang O'REILLY. Winston. Tokyo. Beijing Cambridge. Farnham Koln Sebastopol

Probability Distributions

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

Triangular Distributions

Exploratory Data Analysis

Probability, Mean and Median

Lecture 6: Discrete & Continuous Probability and Random Variables

Mathematical Conventions Large Print (18 point) Edition

Chapter 2: Frequency Distributions and Graphs

7 CONTINUOUS PROBABILITY DISTRIBUTIONS

CALCULATIONS & STATISTICS

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Introduction to Quantitative Methods

Statistics, Probability and Noise

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Lab 11. Simulations. The Concept

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering

Lecture 7: Continuous Random Variables

Random Variables. Chapter 2. Random Variables 1

3D Drawing. Single Point Perspective with Diminishing Spaces

MEASURES OF VARIATION

PLOTTING DATA AND INTERPRETING GRAPHS

Power (kw) vs Time (hours)

FREE FALL. Introduction. Reference Young and Freedman, University Physics, 12 th Edition: Chapter 2, section 2.5

Statistics 151 Practice Midterm 1 Mike Kowalski

Lecture 5 : The Poisson Distribution

Transcription:

Section 6 1A: Continuous Probability Distributions Discrete Probability Distributions Chapter 5 developed the mathematics of Discrete Probability Distributions. Discrete Probability Distributions have a fixed number n of values for the discrete random variable x. These probability distributions are represented by a table or a histogram graph with rectangles. The width of each rectangle is one. The height of each rectangle is P(x) and represents the probability for each of the different values of the random variable x. The area of each rectangle is the width of the rectangle (1) times the height of the rectangle P(x). The area of each rectangle is P(x). The total of all the P(x) values is 1 so the area of all the rectangles in a Discrete Probability Histogram must also total 1. A Discrete Probability Distribution Table A Discrete Probability Distribution Histogram x P(x) P(x).40 0.10.30 1.20 2.40.20 3.20.10 4.10 The total of all the P(x) values total 1 The area of each rectangle is P(x). P(x) =.10 +.20 +.40 +.20 +.10 =1 The area of all the rectangles must total 1 Continuous Probability Distributions This chapter will introduce the concept of a Continuous Probability Distribution. These distributions will not be represented by probability tables or histograms. These distributions will be represented by smooth continuous curves. The area in yellow represents the total of all the separate values of P(x) so the total area must be 1. The area under the curve is used to find P(x) The entire area under the curve must total 1 Section 6 1A Page 1 of 8 2012 Eitel

Continuous Probability Distributions Discrete Probability Distributions have a discrete random variable x. The discrete random variable x is represented by the whole numbers on the interval that x is defined for Continuous Probability Distributions have a continuous random variable x. The continuous random variable x takes on every real number on the interval that x is defined for. Since the real numbers are continuous on any interval the random variable x is continuous. Continuous random variables are used to represent data that is measured on a continuous scale: weight, time, length, etc.. Discrete Probability Distributions are represented by a histogram graph of rectangles. The area of each rectangle is P(x) and the total area of all the rectangles is 1. Continuous Probability Distributions are far more difficult to describe. The graph of a Continuous Probability Distributions is an unbroken curve above an x axis. The x axis represents the continuous real values for x. No rectangle can be drawn to represent a single value of x as it would have to be the width of a single point which is zero. The point on the curve above a value for x is not P(x). In fact, for any single value of x, P(x) will be infinitely small with a round off value of 0+. The total area under the curve represents the total of all the infinite number of values of P(x) and that total is 1. One way to try to understand a Continuous Probability Distribution is to start with a Discrete Probability Distribution and begin to transform it into a Continuous Probability Distribution One way to look at the transition from discrete distributions to continuous distributions is to think of the following process. Start with a histogram of n rectangles The width of each rectangle is one, the height of each rectangle is P(x) and the total area of all of the rectangles is 1. Now split each rectangle into 2 equal widths so that the width of each rectangle is now.5. Continue splitting the rectangles over and over. This process will lead to the width of each rectangle becoming very small. As the rectangles become thinner and thinner the graph starts to look like one continuous set of thin lines. The final graph is called a Continuous Probability Distribution. Section 6 1A Page 2 of 8 2012 Eitel

Discrete to Continuous Example A Discrete Probability Distribution The same Probability Distribution with with 5 values for x between 0 and x with 10 values for x between 0 and x The area of all 5 rectangles = 1 The area of all 10 rectangles = 1 The same Probability Distribution with The same Probability Distribution with with 20 values for x between 0 and x with 40 values for x between 0 and x The area of all 20 rectangles = 1 The area of all 40 rectangles = 1 If we continue to split each rectangle over and over we would have an infinite number of very thin rectangles from x = 0 to x = 5. The area of each rectangle will get closer and closer to zero as we add more and more thinner and thinner rectangles to fill the area from x = 0 to x = 5. While the area of each rectangle gets closer and closer to zero the total of all of the rectangles will still remain 1. Section 6 1A Page 3 of 8 2012 Eitel

Changing An infinite Number of Discrete Rectangles into a Continuous Curve If we continue to split each rectangle over and over again we would have an infinite number of very, very, very, very thin rectangles from x = 0 to x = 5. In fact the width of each rectangle would get closer and closer to 0. If we place one point at the top of each of the infinitely many thin rectangles and erase the rectangles themselves we would get a continuous curve above a horizontal x axis from 0 to 5. The points on the x axis stand for the values of the continuous random variable x. The red curve is a continuous curve and the area under the curve represents a Continuous Probability Distribution. Another name for a Continuous Probability Distribution is a Probability Density Function or a PDF. As the number of rectangles between 0 and 5 get larger and larger the width of each rectangle gets smaller and smaller. If we place one point at the top of each of the infinitely many rectangles and erase the rectangles themselves we would get a continuous curve above a horizontal x axis from 0 to 5. Relative Frequency Section 6 1A Page 4 of 8 2012 Eitel x

A Discrete Probability Distribution A Continuous Probability Distribution with 40 values of x from 0 to 5 with continuous x values from 0 to 5 The total area for the 40 rectangles is 1 The total area under the curve is 1 A Discrete Probability Distribution The values on the x axis are discrete (often the whole numbers) with a gap between each number) A table can display each x and P(x) The graph is a histogram of n rectangles The value of each P(x) can be calculated The total area of all n rectangles is 1 A Continuous Probability Distribution The values for x on the x axis are continuous (no gaps between the real numbers) No table can represent the infinite values of x The graph is an unbroken curve above the x axis The value of each P(x) is 0+ The total of the entire area under the curve is 1 The probability for a range of discrete x values from x = a to x = b is found by adding up the P(x) values for each x in the range from a to b P(a) + P(a +1) + P(a +1) +... + P(b) The probability for a range of continuous x values from x = a to x = b is found by finding the area under the curve from a to b by integral calculus. b a f (x) Section 6 1A Page 5 of 8 2012 Eitel

What is a Continuous Probability Distribution good for if P(x) = 0+ for every value of x The area under the curve is used to find the Probability for a range of x values? Since continuous probability functions are defined for an infinite number of points over a continuous interval, the value of P(x) for any single value of x is always 0+. Probabilities for continuous probability functions are measured over intervals, not single points. That is, the area under the continuous curve between two distinct points defines the probability for that interval. P(2.4 < x< 4.2) A Continuous Probability Distribution that represents a relative frequency curve for an infinite data set of values from x = 0 to x = 5 is displayed below. If we select one data bit from the data set the probability that it will be a value between 2.4 and 4.2 is written as P(2.4 < x < 4.2). To find this probability we would need to find the area under the curve between x = 2.4 to x = 4.2 We would use integral calculus 4.2 f(x) 2.4 to find the area that represents P(2.4 < x < 4.2) Section 6 1A Page 6 of 8 2012 Eitel

P( x < 1.8) If we select one data bit from the data set the probability that it will be a value less than 1.8 is written as P( x< 1.8). To find this probability we would need to find the area under the curve to the left of 1.8 We would use integral calculus 1.8 0 f(x) to find the area that represents P(x < 1.8) P( x > 3.4) If we select one data bit from the data set the probability that it will be a value more than 3.4 is written as P( x > 3.4). To find this probability we would need to find the area under the curve to the right of 3.4 We would use integral calculus 5 3.4 to find the area that represents P(x > 3.4) Section 6 1A Page 7 of 8 2012 Eitel f(x)

What is a Continuous Probability Distribution good for if we need to use integral Calculus to find the Probability for a range of x values? That is a great question. While integral calculus is the basis for finding the area under a curve, it is not required that you know this level of math to find the probabilities of certain Continuous Probability Distributions. A small number of different continuous distributions can be used to answer the probability questions that we will be asked in the rest of this course. The area under the curve for a range of x values has been calculated and put into a table for each of these 4 distributions. You will lean to read the area under the curve for a given range of values from these tables and use that information to answer the probability questions that a student will encounter in an general education level statistics course. We will focus on the following 4 Continuous Probability Distributions for the rest of this course. There are many other continuous probability distributions but these 4 form the basis for almost all introductory statistics courses. They allow us to answer most of the important probability questions that a student will encounter. The 4 basic Continuous Probability Distributions for this course Each Distribution has a table to help you find the areas in yellow The Standard Normal Probability Distribution (z) The t Probability Distribution Mean = 0 Standard Deviation = 1 z Mean = 0 Standard Deviation = varies t The Chi Squared Probability Distribution χ 2 The F Probability Distribution 0 χ 2 0 F Section 6 1A Page 8 of 8 2012 Eitel