STAT 155 Introductory Statistics. Lecture 5: Density Curves and Normal Distributions (I)

Similar documents
AP Statistics Solutions to Packet 2

Exercise 1.12 (Pg )

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

First Midterm Exam (MATH1070 Spring 2012)

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data

Unit 7: Normal Curves

The Normal Distribution

Variables. Exploratory Data Analysis

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Chapter 3. The Normal Distribution

Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1

AP * Statistics Review. Descriptive Statistics

Exploratory Data Analysis

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.

Lesson 4 Measures of Central Tendency

Exploratory data analysis (Chapter 2) Fall 2011

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

consider the number of math classes taken by math 150 students. how can we represent the results in one number?

Descriptive Statistics

MEASURES OF VARIATION

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Descriptive Statistics

8. THE NORMAL DISTRIBUTION

Section 1.3 Exercises (Solutions)

3.4 The Normal Distribution

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Diagrams and Graphs of Statistical Data

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

MTH 140 Statistics Videos

Lecture 1: Review and Exploratory Data Analysis (EDA)

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

z-scores AND THE NORMAL CURVE MODEL

Ch. 3.1 # 3, 4, 7, 30, 31, 32

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Mind on Statistics. Chapter 2

Probability Distributions

Exploratory Data Analysis. Psychology 3256

Interpreting Data in Normal Distributions

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Data Exploration Data Visualization

6.2 Normal distribution. Standard Normal Distribution:

Statistics. Measurement. Scales of Measurement 7/18/2012

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Statistical Data analysis With Excel For HSMG.632 students

6.4 Normal Distribution

4.3 Areas under a Normal Curve

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

Chapter 1: Exploring Data

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

Descriptive Statistics and Measurement Scales

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Geostatistics Exploratory Analysis

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

a. mean b. interquartile range c. range d. median

Statistics 151 Practice Midterm 1 Mike Kowalski

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous

Lecture 2. Summarizing the Sample

3.2 Measures of Spread

THE BINOMIAL DISTRIBUTION & PROBABILITY

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Module 4: Data Exploration

3: Summary Statistics

The Normal distribution

Measurement & Data Analysis. On the importance of math & measurement. Steps Involved in Doing Scientific Research. Measurement

Probability. Distribution. Outline

Algebra I Vocabulary Cards

Chapter 4. Probability and Probability Distributions

Means, standard deviations and. and standard errors

6 3 The Standard Normal Distribution

EXPLORING SPATIAL PATTERNS IN YOUR DATA

Northumberland Knowledge

Box-and-Whisker Plots

Summarizing and Displaying Categorical Data

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test March 2014

The Normal Distribution

How To Write A Data Analysis

An Introduction to Basic Statistics and Probability

EMPIRICAL FREQUENCY DISTRIBUTION

Statistics Revision Sheet Question 6 of Paper 2

2.6. Probability. In general the probability density of a random variable satisfies two conditions:

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

DATA INTERPRETATION AND STATISTICS

Chapter 2 Data Exploration

Dongfeng Li. Autumn 2010

+ Chapter 1 Exploring Data

What Does the Normal Distribution Sound Like?

Week 1. Exploratory Data Analysis

CALCULATIONS & STATISTICS

Basics of Statistics

How To Test For Significance On A Data Set

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Lesson 7 Z-Scores and Probability

Normal distribution. ) 2 /2σ. 2π σ

Transcription:

The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STAT 155 Introductory Statistics Lecture 5: Density Curves and Normal Distributions (I) 9/12/06 Lecture 5 1

A problem about Standard Deviation A variable has 5 values. The mean is 125 and the standard deviation is 25. If a sixth number of 125 is added in, what is the new standard deviation? 9/12/06 Lecture 5 2

A Tool Kit for Describing Dist. of Single Quan. Var. Plot your data: Stemplot, histogram, time plot, boxplot Look for overall pattern (shapes) + striking deviations (outliers) Calculate appropriate numerical summaries about center and spread Mean, median, mode Range, percentiles, quartiles, standard deviation Five-number summary, boxplot IQR and outliers Make things easier with a smooth curve 9/12/06 Lecture 5 3

9/12/06 Lecture 5 4

Density Curve To calculate probabilities, we define a probability density function f(x). The curve that plots f(x) is called the corresponding density curve. f(x) satisfies: f(x) >= 0; The total area under the curve representing f(x) equals 1. 9/12/06 Lecture 5 5

Density Curves Describe the overall shape of distributions Idealized mathematical models for distributions Show patterns that are accurate enough for practical purposes Always on or above the horizontal axis The total area under the curve is exactly 1 Areas under the curve represent relative frequencies of observations 9/12/06 Lecture 5 6

9/12/06 Lecture 5 7

Histograms vs. Density Curves Histograms show either frequencies (counts) or relative frequencies (proportions) in each class interval. Density curves show the proportion of observations in any region by areas under the curve. You can think of density curves as refined histograms when there are huge amount of data. 9/12/06 Lecture 5 8

Histogram vs. Density Curve 9/12/06 Lecture 5 9

Center of a Density Curve The mode of a distribution is the point where the curve is the highest. Highest Point. The median is the point where half of the area under the curve lies on the left and the other half on the right. Equal Areas Point. The mean µ is the point at which the curve would balance if made out of solid material. Balance Point. 9/12/06 Lecture 5 10

Mean of a Density Curve 9/12/06 Lecture 5 11

Spread of a Density Curve Quartiles can be found by dividing the area under the curve into four equal parts ¼ of the area is to the left of the 1st quartile, Q 1 ¾ of the area is to the left of the 3 rd quartile, Q 3 The standard deviation of a density curve is denoted by σ. Not easy to calculate 9/12/06 Lecture 5 12

9/12/06 Lecture 5 13

Normal Distribution Pictorially speaking, a Normal Distribution is a distribution that has a symmetric, unimodal and bell-shaped density curve. The mean and standard deviation completely specify the curve. The mean, median, and mode are the same. 9/12/06 Lecture 5 14

The height of a normal density curve at any point x is given by f 1 1 x ( 2 σ ( ) x = σ σ 2 π µ σ e is the mean is the standard deviation µ ) 2 µ 9/12/06 Lecture 5 15

Change of Curvature 9/12/06 Lecture 5 16

Example: The normal distribution is the most important distribution in Statistics. Typical normal curves with different sigma (standard deviation) values are shown below. 9/12/06 Lecture 5 17

Examples with approximate Normal distributions Height Weight IQ scores Standardized test scores Body temperature Repeated measurement of same quantity 9/12/06 Lecture 5 18

The 68-95-99.7 Rule µ σ 2σ 3σ 9/12/06 Lecture 5 19

The 68-95-99.7 Rule 9/12/06 Lecture 5 20

Example: Young Women s Height The heights of young women are approximately normal with mean = 64.5 inches and std.dev. = 2.5 inches. 9/12/06 Lecture 5 21

Example: Young Women s Height % of young women between 62 and 67? % of young women lower than 62 or taller than 67? % between 59.5 and 62? % taller than 68.25? 9/12/06 Lecture 5 22

Standardizing and z-scores an observation x comes from a distribution with mean µ and standard deviation σ The standardized value of x is defined as which is also called a z-score. z = σ A z-score indicates how many standard deviations the original observation is away from the mean, and in which direction. Mean and S.D. of the distribution of z? x µ, 9/12/06 Lecture 5 23

Example: Young Women s Height The heights of young women are approximately normal with mean = 64.5 inches and std.dev. = 2.5 inches. In our class, there is a female student who is 68.25 inches tall, what is her z-score? 9/12/06 Lecture 5 24

Effects of Standardizing Standardizing is a linear transformation. What are a and b? Effects on shape, center and spread. The standardized values for any distribution always have mean 0 and standard deviation 1. Linear transformation: normal into normal. 9/12/06 Lecture 5 25

The standard normal distribution The standard normal distribution is the normal dist. with mean 0 and standard deviation 1, denoted as N(0,1). N(0,1) can be treated as a benchmark. Any normal distribution can be related to N(0,1) by a linear transformation. Z: N(0,1) What is the distribution for X=a+bZ? 9/12/06 Lecture 5 26

Take Home Message Density curve Center, spread Normal distributions and normal curves The 68-95-99.7 rule for normal distributions Standardizing observations The standard normal distribution 9/12/06 Lecture 5 27