7.1: Normal Distributions

Similar documents
Key Concept. Density Curve

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

6.4 Normal Distribution

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Section 1.3 Exercises (Solutions)

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

AP Statistics Solutions to Packet 2

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

4. Continuous Random Variables, the Pareto and Normal Distributions

6 3 The Standard Normal Distribution

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

The Normal Distribution

MBA 611 STATISTICS AND QUANTITATIVE METHODS

You flip a fair coin four times, what is the probability that you obtain three heads.

Probability. Distribution. Outline

Exercise 1.12 (Pg )

Simple linear regression

What Does the Normal Distribution Sound Like?

8. THE NORMAL DISTRIBUTION

Descriptive Statistics

Week 3&4: Z tables and the Sampling Distribution of X

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Characteristics of Binomial Distributions

Continuous Random Variables

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

Interpreting Data in Normal Distributions

Unit 7: Normal Curves

Review of Random Variables

3.2 LOGARITHMIC FUNCTIONS AND THEIR GRAPHS. Copyright Cengage Learning. All rights reserved.

Coins, Presidents, and Justices: Normal Distributions and z-scores

MEASURES OF VARIATION

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.

Chapter 3 RANDOM VARIATE GENERATION

Probability Distributions

WEEK #22: PDFs and CDFs, Measures of Center and Spread

Lesson 20. Probability and Cumulative Distribution Functions

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

7. Normal Distributions

Lesson 4 Measures of Central Tendency

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA

AP STATISTICS REVIEW (YMS Chapters 1-8)

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

6.2 Normal distribution. Standard Normal Distribution:

CURVE FITTING LEAST SQUARES APPROXIMATION

Summarizing and Displaying Categorical Data

How To Write A Data Analysis

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Chapter 7 Section 7.1: Inference for the Mean of a Population

Equations. #1-10 Solve for the variable. Inequalities. 1. Solve the inequality: Solve the inequality: 4 0

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

3: Summary Statistics

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Lesson 7 Z-Scores and Probability

Review of Fundamental Mathematics

Solutions to Homework 6 Statistics 302 Professor Larget

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

Variables. Exploratory Data Analysis

THE BINOMIAL DISTRIBUTION & PROBABILITY

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

Gamma Distribution Fitting

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

4.3 Areas under a Normal Curve

Section 5 Part 2. Probability Distributions for Discrete Random Variables

The Standard Normal distribution

Descriptive Statistics and Measurement Scales

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

Normal distribution. ) 2 /2σ. 2π σ

Name: Date: Use the following to answer questions 2-3:

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

SKEWNESS. Measure of Dispersion tells us about the variation of the data set. Skewness tells us about the direction of variation of the data set.

November 08, S8.6_3 Testing a Claim About a Standard Deviation or Variance

seven Statistical Analysis with Excel chapter OVERVIEW CHAPTER

MATH 140 Lab 4: Probability and the Standard Normal Distribution

Common Tools for Displaying and Communicating Data for Process Improvement

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

Descriptive Statistics

Drawing a histogram using Excel

An Introduction to Basic Statistics and Probability

Regression III: Advanced Methods

FINAL EXAM SECTIONS AND OBJECTIVES FOR COLLEGE ALGEBRA

Gestation Period as a function of Lifespan

z-scores AND THE NORMAL CURVE MODEL

Diagrams and Graphs of Statistical Data

7.1 Graphs of Quadratic Functions in Vertex Form

TEACHER NOTES MATH NSPIRED

Statistics Revision Sheet Question 6 of Paper 2

Frequency Distributions

Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data

Means, standard deviations and. and standard errors

3.4 The Normal Distribution

SPC Response Variable

Geostatistics Exploratory Analysis

Lecture 1: Review and Exploratory Data Analysis (EDA)

Transcription:

In Chapter 7: Chapter 7: Normal Probability Distributions 7.1 Normal Distributions 7.2 Determining Normal Probabilities 7.3 Finding Values That Correspond to Normal Probabilities 7.4 Assessing Departures from Normality October 09 7: Normal Probability Distributions 1 7: Normal Probability Distributions 2 7.1: Normal Distributions This pdf is widely useful the most popular distribution for continuous random variables First described de Moivre in 1733 Elaborated in 1812 by Laplace Describes some natural phenomena and (more importantly) the sampling characteristics of totals and means 7: Normal Probability Distributions 3 Normal Probability Density Function Recall: continuous random variables are described with probability density function (pdfs) curves Normal pdfs are recognized by their typical bell-shape Figure: Age distribution of a pediatric population with overlying Normal pdf 7: Normal Probability Distributions 4 Area Under the Curve Top Figure: The darker bars of the histogram correspond to ages 9 (~40% of distribution) Bottom Figure: shaded area under the curve (AUC) corresponds to ages 9 (~40% of area) f ( x) 2 1 x 2 7: Normal Probability Distributions 5 1 2 e Parameters μ and σ Normal pdfs have two parameters μ - expected value ( mean mu ) σ - standard deviation (sigma) μ controls location σ controls spread 7: Normal Probability Distributions 6

Mean and Standard Deviation of Normal Density μ σ 7: Normal Probability Distributions 7 Standard Deviation σ Points of inflections one σ below and above μ Practice sketching the Normal curves Feel inflection points (where slopes change) Label horizontal axis with X-value standard deviation landmarks 7: Normal Probability Distributions 8 68-95-99.7 Rule for Normal Distributions 68% of the AUC within ±1σ of μ 95% of the AUC within ±2σ of μ 99.7% of the AUC within ±3σ of μ 7: Normal Probability Distributions 10 Example: 68-95-99.7 Rule Wechsler adult intelligence scores: Normally distributed with μ = 100 and σ = 15; X ~ N(100, 15) 68% of scores within μ ± σ = 100 ± 15 = 85 to 115 95% of scores within μ ±2σ = 100 ± (2)(15) = 70 to 130 99.7% of scores in μ ±3σ = 100 ± (3)(15) = 55 to 145 7: Normal Probability Distributions 11 Symmetry in the Tails Because the Normal curve is symmetrical and the total AUC is exactly 1 Example: Male Height Male height: Normal with μ = 70.0 and σ = 2.8 68% within μ ± σ = 70.0 2.8 = 67.2 to 72.8 32% in tails (below 67.2 and above 72.8 ) 16% below 67.2 and 16% above 72.8 (symmetry) 95% we can easily determine the AUC in tails 7: Normal Probability Distributions 12 7: Normal Probability Distributions 13

Reexpression of Non-Normal Variables Many variables are not Normal We can often re-express non-normal variables with mathematical transformations to make them more Normal Example of mathematical transforms: logarithmic exponential square roots Review logarithmic transformations 7: Normal Probability Distributions 14 Logarithms Logarithms are exponents of their base Common log (base 10) Base 10 log function log(10 0 ) = 0 log(10 1 ) = 1 log(10 2 ) = 2 Natural ln (base e) ln(e 0 ) = 0 ln(e 1 ) = 1 7: Normal Probability Distributions 15 Example: Logarithmic Reexpression Prostate Specific Antigen (PSA) used to screen for prostate cancer Not Normally distributed, but its logarithm is Normal ln(psa) ~N(μ = 0.3, σ = 0.8) 95% of ln(psa) within = μ ±2σ = 0.3 ± (2)(0.8) = 1.9 to 1.3 Taking exponents of range In non-diseased populations: 95% in range 0.15 and 3.67 2.5% are greater than 3.67 use 3.67 as screening cutoff 7.2: Determining Normal Probabilities When value do not fall directly on σ landmarks: 1. State the problem 2. Standardize the value (z score) 3. Sketch and shade the curve 4. Use Table B 7: Normal Probability Distributions 16 7: Normal Probability Distributions 17 Step 1: State the Problem What percentage of human gestations are less than 40 weeks? Let X gestation length Research suggests: X ~ N(39, 2) weeks Pr(X 40) =? Step 2: Standardize Standard Normal variable Z a Normal random variable with μ = 0 and σ = 1, i.e., Z ~ N(0,1) Use Table B to determine cumulative probabilities for Z variables 7: Normal Probability Distributions 18 7: Normal Probability Distributions 19

Example: A Z variable of 1.96 has cumulative probability 0.9750. 7: Normal Probability Distributions 20 Step 2 (cont.) Turn value into z score: z x The z-score = no. of σ-units above (positive z) or below (negative z) mean μ For example, the value 40 from 40 39 z 0.5 2 X ~ N(39,2) has 7: Normal Probability Distributions 21 Steps 3 & 4: Sketch & Table B 3. Sketch 4. Use Table B to lookup Pr(Z 0.5) = 0.6915 Probabilities Between Points a represents a lower boundary b represents an upper boundary Pr(a Z b) = Pr(Z b) Pr(Z a) 7: Normal Probability Distributions 22 7: Normal Probability Distributions 23 Between Two Points Pr(-2 Z 0.5) = Pr(Z 0.5) Pr(Z -2).6687 =.6915.0228.6687.6915 See p. 144 in text.0228-2 0.5 0.5-2 7.3 Values Corresponding to Normal Probabilities 1. State the problem 2. Find Z-score corresponding to percentile (Table B) 3. Sketch 4. Unstandardize: x z p 7: Normal Probability Distributions 24 7: Normal Probability Distributions 25

z percentiles Use Table B to look up z score with cumulative probability p Look inside the table for the closest cumulative probability entry Trace the z score row and column labels 7: Normal Probability Distributions 26 e.g., The 97.5 th percentile (cumulative probability).9750 corresponds to z = 1.96 Notation: Let z p represents the z score with cumulative probability p, e.g., z.975 = 1.96 7: Normal Probability Distributions 27 Step 1: State Problem Example: What gestational length is smaller than 97.5% of gestations? Let X represent gestations length Recall: X ~ N(39, 2) A gestation that is smaller than.975 of gestations has a umulative probability of.025 (draw curve) 7: Normal Probability Distributions 28 Example: Step 2 (z percentile) Less than 97.5% (right tail) = greater than 2.5% (left tail). z lookup: z.025 = 1.96 z.00.01.02.03.04.05.06.07.08.09 1.9.0287.0281.0274.0268.0262.0256.0250.0244.0239.0233 7: Normal Probability Distributions 29 3. Sketch 7.4 Assessing Departures from Normality Approximately Normal histogram Same distribution on Normal Q-Q Plot 4. Unstandardize x 39 ( 1.96)(2) 35.08 35 The 2.5 th percentile is 35 weeks 7: Normal Probability Distributions 30 Normal distributions adhere to diagonal line on Q-Q plot 7: Normal Probability Distributions 31

Negative Skew Positive Skew Negative skew shows upward curve on Q-Q plot 7: Normal Probability Distributions 32 Positive skew shows downward curve on Q-Q plot 7: Normal Probability Distributions 33 Same data as prior slide with logarithmic transformation Leptokurtotic Leptokurtotic distribution show S-shape on Q-Q plot The log transform 7: Normal Probability Normalize Distributions the skew 34 7: Normal Probability Distributions 35