Normal Distribution Lecture Notes



Similar documents
MBA 611 STATISTICS AND QUANTITATIVE METHODS

Elementary Statistics

Fairfield Public Schools

Descriptive Statistics

6. Decide which method of data collection you would use to collect data for the study (observational study, experiment, simulation, or survey):

Math 108 Exam 3 Solutions Spring 00

Chapter 4. Probability and Probability Distributions

Sampling Procedures Y520. Strategies for Educational Inquiry. Robert S Michael

AP Statistics Solutions to Packet 2

Week 3&4: Z tables and the Sampling Distribution of X

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Course Syllabus MATH 110 Introduction to Statistics 3 credits

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

Foundation of Quantitative Data Analysis

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A) B) C) D) 0.

Chapter 1: The Nature of Probability and Statistics

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous

Interpreting Data in Normal Distributions

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

SAMPLING DISTRIBUTIONS

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

Section 6.2 Definition of Probability

Lesson 9 Hypothesis Testing

Non-Parametric Tests (I)

Statistics Class Level Test Mu Alpha Theta State 2008

Math 210 Lecture Notes: Ten Probability Review Problems

Statistics: Descriptive Statistics & Probability

Chapter 3. The Normal Distribution

Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Practice#1(chapter1,2) Name

MTH 140 Statistics Videos

Interpret Box-and-Whisker Plots. Make a box-and-whisker plot

Mind on Statistics. Chapter 2

3. Data Analysis, Statistics, and Probability

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Northumberland Knowledge

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

II. DISTRIBUTIONS distribution normal distribution. standard scores

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

5.1 Identifying the Target Parameter

Chapter 1: Data and Statistics GBS221, Class January 28, 2013 Notes Compiled by Nicolas C. Rouse, Instructor, Phoenix College

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1

Lesson 4 Measures of Central Tendency

Lecture 10: Depicting Sampling Distributions of a Sample Proportion

= N(280, )

Lecture 1: Review and Exploratory Data Analysis (EDA)

Statistics 2014 Scoring Guidelines

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Sample Term Test 2A. 1. A variable X has a distribution which is described by the density curve shown below:

4. Continuous Random Variables, the Pareto and Normal Distributions

The Normal Distribution

Box-and-Whisker Plots

The Math. P (x) = 5! = = 120.

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

RUTHERFORD HIGH SCHOOL Rutherford, New Jersey COURSE OUTLINE STATISTICS AND PROBABILITY

Descriptive Statistics

Determine whether the data are qualitative or quantitative. 8) the colors of automobiles on a used car lot Answer: qualitative

This curriculum is part of the Educational Program of Studies of the Rahway Public Schools. ACKNOWLEDGMENTS

Discovering Math: Using and Collecting Data Teacher s Guide

AP Statistics: Syllabus 1

What is Statistic? OPRE 6301

Observation in Research

NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York

Chapter 2: Descriptive Statistics

Unit 9 Describing Relationships in Scatter Plots and Line Graphs

Lecture Notes Module 1

Math 251, Review Questions for Test 3 Rough Answers

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

DATA COLLECTION AND ANALYSIS

What is the purpose of this document? What is in the document? How do I send Feedback?

Answers: a to b to 92.94

AP Statistics Chapters Practice Problems MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

Probability Distributions

5) The table below describes the smoking habits of a group of asthma sufferers. two way table ( ( cell cell ) (cell cell) (cell cell) )

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Chapter 7 Probability. Example of a random circumstance. Random Circumstance. What does probability mean?? Goals in this chapter

Contemporary Mathematics Online Math 1030 Sample Exam I Chapters No Time Limit No Scratch Paper Calculator Allowed: Scientific

6.2 Normal distribution. Standard Normal Distribution:

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

The Normal Distribution

STT 200 LECTURE 1, SECTION 2,4 RECITATION 7 (10/16/2012)

The Binomial Probability Distribution

Street Address: 1111 Franklin Street Oakland, CA Mailing Address: 1111 Franklin Street Oakland, CA 94607

Midterm Review Problems

Mind on Statistics. Chapter 8

Survey Research. Classifying surveys on the basis of their scope and their focus gives four categories:

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

Comparing Sets of Data Grade Eight

Transcription:

Normal Distribution Lecture Notes Professor Richard Blecksmith richard@math.niu.edu Dept. of Mathematical Sciences Northern Illinois University Math 101 Website: http://math.niu.edu/ richard/math101 Section 2 Website: http://math.niu.edu/ richard/math101/fall06 1. Normal Distribution Curve 2.5% 13.5% µ 2σ µ σ 34% 34% 2.5% 13.5% µ µ + σ µ + 2σ In a normal distribution Fact 1. Center = mean = median Fact 2. The data lies equally distributed on each side of the center. 50% of the data lies to the left of µ and 50% of the data lies to the right of µ. 2. The 68 95 99 Rule Fact 3. 68% of the data lies within 1 standard deviation of the mean 95% of the data lies within 2 standard deviations of the mean 99% of the data lies within 3 standard deviations of the mean 1

2 3. Standardizing Data Given normally distributed data, with mean µ and starndard deviation σ. If x is a data point, we wish to know: how many standard deviations is x to the right (or left) of the center? That is, x = µ + z σ. Solve for z. µ + z σ = x z σ = x µ z = (x µ)/σ 4. The z Rule Original Data Value x Standardized Data Value z = (x µ)/σ A negative value of z represents a data point to the left of the center A positive value of z represents a data point to the right of center 5. Example from Text (page 51) The lifetime of 20,000 flashlight batteries are normally distributed, with a mean of µ = 370 days and a standard deviation of σ = 30 days. 1. What percentage of the batteries are expected to last more than 340 days? Solution: z = (x µ)/σ = (340 370)/30 = 1.00 Look up z = 1 in the chart. (The negative means that this value occurs one standard deviation to the left of the center µ.) The corresponding P value is 34.1%.

3 6. Draw the picture µ 1.00σ 34.1 µ 50 The answer is 34.1 + 50 = 84.1%. 7. Question 2 2. How many batteries can be expected to last less than 325 days? Solution: Work with percentages. z = (x µ)/σ = (325 370)/30 = 1.50 Look up z = 1.5 in the chart. The corresponding P value is 43.3%. 8. Draw the picture µ 1.50σ 43.3 µ Fifty percent of the data lies to the left of the center. Since 43.3% lies between µ 1.50σ and the center µ, the percentage to the left of µ 1.50σ is 50.0 43.3 = 6.7% The final answer is: 6.7 percent of 20,000 =.067 20, 000 = 1340 9. SAT Example In 2001 a total of 1,276,320 college-bound students took the SAT exam.

4 The mean and standard deviation of the test scores was µ = 506 and σ = 111. 68% of the students fall within 1 standard deviation of the mean, that is in the range µ σ = 506 111 = 395 to µ+σ = 506+111 = 617. 95% of the students fall within 2 standard deviations of the mean, that is in the range µ 2σ = 506 222 = 284 to µ + 2σ = 506 + 222 = 728. Where is the cutoff between the first and second Quartile? 10. SAT Example Cont d We want P = 25%. The (3-digit) chart shows the z-value corresponding to P =.25 is z =.675. This means that 25% of the data occurs before you get within.675 standard deviations of µ (on the left). Another 25% lies between µ.675σ and µ itself. So the first quartile occurs at Q 1 = µ.675σ = 506 (.675)111 = 431 It turns out Q 1 was exactly 430. The third quartile occurs at Q 1 = µ +.675σ = 506 + (.675)111 = 581 11. Draw the Picture 2001 SAT Scores 25% µ 0.675σ Q 1 = 431 25% 25% µ 506 25% µ + 0.675σ Q 3 = 581 I.4 Sampling Lecture Notes

5 12. Statistical Thinking Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write. H. G. Wells, author of War of the Worlds Definition: Statistics is the science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively evaluated. 13. Three Phases of Statistics Collect the data Analyze the data order the data graphical displays numerical calculations (such as mean and standard dev) Interpret the results use proper statistical techniques to substantiate or refute hypothesized statements match data to the appropriate technique determine whether the proper assumptions are satisfied 14. Two types of statistics Descriptive statistics summarize and describe a characteristic for some group Inferential statistics estimate, infer, predict, or conclude something about a larger group 15. Examples Descriptive Batting Average Yards Per Carry Test Scores Inferential Polls Medical Studies Market Surveys

6 16. Two types of data Quantitative data values recorded on a natural numerical scale Qualitative data classified into categories 17. Quantitative Data Weight of subjects in medical sample Height of buildings in Chicago Temperatures per day at Antarctica Weather Station 18. Qualitative Data Gender of subjects in medical sample Political affilation of respondents in a poll survey Class (fresh, soph, jr, sr) of Math 101 students 19. Vocabulary The population is the entire set of objects (people or things) under consideration. A sample is a subset of the population that is available for the analysis. A bias is a favoring of certain outcomes over others. A census collects data from each member of the population. A statistic is a statement of numerical information about a sample. A parameter is a statement of numerical information about a population. 20. Census versus Sample Would you use a census or a sample to determine the following: Project the winner of an election Calculate a baseball player s batting average

7 Predict whether it will rain tomorrow Test whether the soup is too salty Calculate Shaq s free throw average Use a market study to determine a new flavor of toothpaste Report the Dow Jones Average Generalize a medical study to other groups The average score on the first test 21. Dealing with bias Bias in some form occurs in the collecting of most, if not all, sets of data. The bias may come from the portion of the population surveyed the phrasing of the questions 22. Examples Dewey defeats Truman projection of Chicago Tribune based on 1948 telephone poll Are you in favor of Illinois banning cell phones in cars? Dial *91 on your cellular phone to vote. Do you feel budget cuts are more important than humanitarian programs that would need to be cut to obtain a balanced budget? Judgement Sample 23. Methods for Choosing Samples

8 Use the opinion of person(s) deemed qualified to choose members of the sample. Example: to investigate study habits of atheletes, ask their coaches and teachers. Simple Random Selection Use random numbers to select the sample. Page 315 Random Digit Table: 72985547555515086461 Stratefied Sampling Divide the population into relatively homogenous groups, draw a sample from each group, and take their union. 24. Goals of a good sample from the correct population chosen in an unbiased way large enough to reflect total population 25. Normal Distribution of Random Events Toss a coin 100 times and count the number of heads. How many heads would you expect? about 50 exactly 50 It does not seem reasonable that the count will be exactly 50. We would not be surprised if the number of heads turned out to be 48 or 51 or even 55. We would be surprised to see 80 heads, and would begin to suspect that the coin was not fair. 26. Coin Toss Data Experiment: A coin is tossed n = 100 times.

9 The experiment is repeated 1000 times. Here are the results: 27. Frequency Table: No. of Heads Heads Freq Heads Freq Heads Freq 1 0 45 54 58 27. 0 46 49 59 19 34 0 47 54 60 11 35 2 48 66 61 11 36 2 49 89 62 5 37 2 50 70 63 4 38 2 51 77 64 2 39 5 52 85 65 0 40 14 53 62 66 0 41 16 54 57 67 1 42 25 55 52 68 0 43 30 56 40. 0 44 31 57 36 100 0 28. Mean and Standard Deviation mean = 50.296 stand dev = 5.100

10 29. Coin Toss Histogram 30 40 50 60 70 30. Sampling Distributions If we could examine all possible samples of size n of a population, then the frequency distribution of the means of these samples is normally distributed. µ = the mean over the entire population σ = the standard deviation over the entire population x = the mean of the sampling distribution σ x = the standard deviation of the sampling distribution Rule 1. x = µ 31. Two Rules Rule 2. σ x = σ n We are assuming in Rule 2 that the size of the entire population is much larger than the sample size n.