(positive, not very strong) (perfectly negatively correlated)

Size: px
Start display at page:

Download "(positive, not very strong) (perfectly negatively correlated)"

Transcription

1 HOMEWORK PROBLEMS Part II. Chapter 8 -- Review Exercises Statistics Dr. McGahagan Review Exercises (p ) Problem 3. Men always marry women exactly 10 percent shorter. Let HH = height of husband and HW = height of wife Therefore, HW = a HH will be the equation of the line enabling us to predict the height of the wife, and every point representing a pair of husbands and wives will be exactly on that line -- a man 60 inches tall will have a wife 54 inches tall, a man 70 inches tall will have a wife 63 inches tall, a man 500 inches tall will have a wife 450 inches tall, and so on. The correlation coefficient will be + 1. IMPORTANT LESSON: the correlation coefficient does not give the slope of the line, but is determined by the presence or absence of scatter about the line. Problem 5. Guess the correlation. Choices: -0.5, 0.0, 0.3, 0.6, 0.95 (a) Between freshman and sophomore GPA. Positive but certainly not perfect. Many students who have initial trouble improve in second year. Best guess: 0.6 [incidentally, this is on the order of the correlation between SAT scores and freshman grades] (b) Between freshman and senior grades: Still positive, but with more time comes more variation. Best guess: 0.30 (c) Between weight and length of 2x4 pieces of pine. Best guess: 0.95 There is still likely to be variation if moisture conditions or number of knotholes differ. Problem 7. Associate scatter diagrams with correlation coefficients. Grid of correlation coefficients corresponding with the plots should be: (positive, not very strong) (perfectly negatively correlated) (negative, pretty strong) (positive and very strong) (no real pattern, and not (weak, but pretty definitely negative; even very clear that the compare with the plot to the left relation is positive) to decide which of the final two is which)

2 Problem 8. Human Growth Study. Plot of heights at age 18 against height at age 4 shows fairly strong but not perfect correlation. Average height at age 4 seems about 41 inches -- at about the point where the two lines cross. SD of height at age 4 = about 1.5 inches, because 41 +/- 2 (1.5) will cover almost all points (put an index card vertically at 44 and 38 inches to see this) Average height at age 18 seems about 71 inches (again the point at which the two lines cross). SD of height at age 18 = about 2.5 inches, because 71 +/- 2 (2.5) will cover almost all points (put an index card horizontally at 66 and 76 inches to see this). The correlation coefficient is about 0.75 to high but not perfect correlation. The SD line will be the solid line; we will soon meet the dashed line as the regression line. If you xerox the page (on higher magnification) and draw the horizontal and vertical lines as above, you will see that the SD box corners should pretty much fall on the solid line. Problem 9. Correlation coefficient calculation exercise. Mean SD Corr. (a) X Y 3 2 (b) X Y 2 1 (c) X (note that Y = 2 X) Y 4 2 Computer notes: (bind x (list )) and X will stay the same for all three problems. (bind y (list...)) will change with each problem. (plot y x) and use the PLOT menu to adjust the axis range so X and Y start with zero. You can jitter the plots with the capital J key, distinguishing overlapping points. Problem 11. Quiz in statistics With 10 questions on a quiz, and an average number of 6.4 right, and a SD of 2.0 for the right answers, we note that, letting R = number of right answers and W = number of wrong answers, W = 10 - R There cannot possibly be any point off the SD line, so the correlation coefficient will be minus 1. Simulation: (bind right (rnd 20 10)) Create 50 uniform random integers from 0 to 10, using the Mersenne Twister (bind wrong (- 10 right)) (stats right wrong) (corr right wrong) (plot right wrong) (abline 10-1)

3 Chapter 9 -- Review Exercises Statistics Dr. McGahagan Problem 2. True or False? (a) FALSE. If correlation coefficient is -0.8, below-average values of the dependent variable are associated with ABOVE average values of the independent variable. Try (scatter -0.8) and compare to (scatter 0.8) (b) FALSE. If y is always less than x, the correlation coefficient may be positive or negative. Try (bind x (list )), (bind y (/ x 10)) (< y x) = (T T T T T) so y is always less than X. (corr x y) = 1.00 Then (bind y (- (/ x 10))) and repeat: (< y x) = (T T T T T) so y is always less than X. (corr x y) = Problem 5. Can X and Y be perfectly correlated? Given the following sets of numbers, can you fill in the final number to ensure rho = + 1? (a) X = ( ) Y = ( ) All points given lie along a line through the points (1, 1) and (2, 3) The slope of the line is + 2, so the equation of the line would be Y = a + 2 X To fit the other points, we must have: 1 = a + 2 (1) so a = -1 3 = a + 2 (2) so a = -1 The required point will be Y = (4) = 7 You may check your solution by defining the lists: (bind y (list )) (bind x (list )) Then (corr x y) = will show that you have solved the problem (b) X = ( ) Y = ( ) The first two points lie on a line through the points (1, 1) and (2, 3) The equation of that line was Y = X, as found in part (a) The third point (3, 4) does NOT lie on that line: Y = (3) = 5, so an X value of 3 corresponds to a value of 5, not 4. It is therefore impossible for all 4 points to line on the same line.

4 Problem 9. Student evaluations of assistants and exam performance. Calculate the correlations: (bind assistant (list )) (bind course (list )) (bind final (list )) The correlations are: (corr assistant course) = ; not much relation. (b) is true. (corr assistant final ) = definite NEGATIVE correlation. Hence (a) is false. The more students liked the assistant, the worse they did on the final. Hypothesis: the well-liked assistants were entertaining, but not demanding. (corr course final) = Positive relation here; though it is unclear from the correlation whether students put forward effort because they liked the course, or liked the course because they realized they were doing well in it. Problem 12. Education of husbands and wives Correlation of about 0.8, at a guess. (a) Vertical/horizontal stripes occur because few people have, and even fewer report, years of school -- the data are discrete, not continuous. (b) Few points appear because they overlap. If the graph were in EcLS, the capital J key would jitter the points (add some random noise so they did not overlap). (c) Shaded areas on which plot indicate that: (i) Wife completed exactly 16 years of schooling. Plot C (ii) Wife completed more schooling than husband. No plot shows this; given the actual plot, you could draw a 45-degree line (y = x) with the command (abline 0 1). Points above and to the left of that line would indicate wife's education > husband's education. (iii) Husband completed more than 16 years of schooling. Plot B. (iv) Husband completed exactly 12 years, and wife completed fewer than 12 years. Plot A.

5 Review Questions -- Chapter Regression Statistics Dr. McGahagan * Problem 4. Educational levels of husband and wife. Average ed. level = 12 years, SD = 3 years for both; correlation coefficient = 0.5 (a) If husband has 18 years of schooling, he is (18-12) / 3 = 2 SD above average. The wife would be expected to be 0.5 * 2 SD = 1 SD above average, or to have 15 years of education. (b) If wife has 15 years of schooling, she is 1 SD above average. Her husband would be expected to be half a standard deviation above average, that is to have = 13.5 years of education. (c) Isn't this strange? Well-educated men marry above average, but less well educated women, but can it be also true that the women marry still less educated men? While very well educated men or women might prefer to marry someone as well educated as themselves, the marriage pool doesn't include many of them, and chances are they will on average marry someone a bit less highly educated. Note the "on average" -- it will be the somewhat unusual very well educated man or woman who marries someone more educated than themselves, but "somewhat unusual" does not mean "non-existent". * Problem 6. Which line is the regression line? The dotted, roughly 45 degree line is the SD line. Regression lines are always flatter than the SD line, when the response variable (the dependent variable) is on the vertical axis. The dashed line tells you the expected value of y given x; the solid line indicates the expected value of X given Y (note that you have to interchange the axes so that the response variable is on the vertical axis). * Problem 9. Percentile rank and correlations. Correlation of midterm scores and final scores is 0.50 (a) Percentile rank on midterm = 5 percent. SD below mean on midterm: (normal-quant 0.05) = or find 90 in the area column of the normal table (5 percent on either side). The area percent is defined by a z-score of +/ Expected SD below mean on final: 0.5 * = This translates into an area defined by +/ of between and 60.47, let's say a 60 percent central area, so that 20 percent is below and 20 percent above This means the average student at the 5th percentile could be predicted to be at the 20th percentile on the final. In EcLS: (normal-cdf (* rho (normal-quant pctile1)) = (normal-cdf (* 0.5 (normal-quant 0.05)) =.2054 (b) Percentile rank on midterm: Since 20 percent rank higher, we must first look for central area of 60, that is defined by about +/ SDs Score will be expected to be 0.5 * 0.85 = SDs, which gives a central area of between and in the table -- say one third of the total area for simplicity. About a third will be above SDs, so the expected percentile rank is 2/3 = 67th percentile. In EcLS: (normal-cdf (* 0.5 (normal-quant 0.80)) = or the 66th percentile. (c) and (d). Whether we know the student is average at the midterm or simply assume it because we have no further information, we expect the student to be average on the final. * Problem 10. Regression effect. Expect the student at the 40th percentile to improve somewhat. Exactly how much depends on the correlation coefficient. See Exercise set C, problem 2 for a similar problem.

6 Review Questions -- Chapter RMS Error * Problem 4. Midterm and Final Scores, part I. Average grade on midterm = 50; SD on midterm = 25 Average grade on final = 55; SD on final = 15 Correlation of midterm and final = 0.60 (a) For about one-third of students, prediction of final will be off by more than --- RMSEs. For about 2/3 of the students, the prediction will be off by less than the same unknown value. But we do know that a central area of about 2/3 will be from - 1 to + 1; so the blank should be filled by 1 RMSE.This is given by the problem 1 formula as (sqrt (1-0.36)) * 15 = 0.8 * 15 = 12 points. (b) Given a midterm of 80, the student scored (80-50) / 25 = 30 / 25 = 1.2 SDs above the mean for the midterm. He is therefore expected to score 0.6 * 1.2 = 0.72 SDs above the mean on the final. The standardized score of the final will have been computed as (X - 55) / 15 = 0.72; hence X = * 15 or we would predict a score of In our notation, E [final midterm = 80] = 65.8 (c) This prediction will, if all assumptions are met (normal distribution of residuals, no pattern to residuals), have a 50 percent chance of being within 0.7 RMSEs (see the normal table; look for central area of 51.61). That is, it has an even chance of falling within 0.7 * 12 = 8.4 points of the predicted value. * Problem 5. Midterm and Final Scores, Part II. (a) Assuming normality, the percentage of students scoring above 80 on the final can be found without regression: Z-score of 80 points = (80-55) / 15 = 25 / 15 = 5/3 = 1.67 Corresponding central area (for Z = 1.65, closest value in table) = percent Two tail area = = 9.89 percent, so one tail area (above 80 points) = percent. (b) Given a score of 80 on the midterm, the expected score on the final is 65.8 (see last problem). RMSE is 12 points, so a score of 80 would be ( ) / 12 = 1.18 RMSEs above the mean. The central area for Z = +/- 1.2 (closest to 1.18) is 76.99, so about 23 percent of the points would be outside this area, or half that = 12.5 points above 80. * Problem 7. Correlation and regression (Extended) There is a correlation between math and physics test scores; both have mean of 60 and equal SDs. We would expect students who scored an above average 75 on the math test to score above average on the physics test as well -- but, due to the regression effect, not as much above average. So the expected score will be more than 60 but less than 75. Suppose the SDs of each test are 20, and the correlation coefficient 0.90 Then the score of 75 was (75-60) / 20 = 15 /20 = 0.75 SDs above average on the math test. We would expect a score of 0.9 * 0.75 = SDs above average on the physics test. This Z-score would have been computed as; = (X - 60) / 20 so X - 60 = * 20 = 13.5 and X = 73.5 Note that due to the high correlation, we don't expect the new score to be much less

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade Statistics Quiz Correlation and Regression -- ANSWERS 1. Temperature and air pollution are known to be correlated. We collect data from two laboratories, in Boston and Montreal. Boston makes their measurements

More information

. 58 58 60 62 64 66 68 70 72 74 76 78 Father s height (inches)

. 58 58 60 62 64 66 68 70 72 74 76 78 Father s height (inches) PEARSON S FATHER-SON DATA The following scatter diagram shows the heights of 1,0 fathers and their full-grown sons, in England, circa 1900 There is one dot for each father-son pair Heights of fathers and

More information

Continuing, we get (note that unlike the text suggestion, I end the final interval with 95, not 85.

Continuing, we get (note that unlike the text suggestion, I end the final interval with 95, not 85. Chapter 3 -- Review Exercises Statistics 1040 -- Dr. McGahagan Problem 1. Histogram of male heights. Shaded area shows percentage of men between 66 and 72 inches in height; this translates as "66 inches

More information

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Section 14 Simple Linear Regression: Introduction to Least Squares Regression Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

More information

The correlation coefficient

The correlation coefficient The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative

More information

Homework 11. Part 1. Name: Score: / null

Homework 11. Part 1. Name: Score: / null Name: Score: / Homework 11 Part 1 null 1 For which of the following correlations would the data points be clustered most closely around a straight line? A. r = 0.50 B. r = -0.80 C. r = 0.10 D. There is

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2 Lesson 4 Part 1 Relationships between two numerical variables 1 Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables

More information

Descriptive statistics; Correlation and regression

Descriptive statistics; Correlation and regression Descriptive statistics; and regression Patrick Breheny September 16 Patrick Breheny STA 580: Biostatistics I 1/59 Tables and figures Descriptive statistics Histograms Numerical summaries Percentiles Human

More information

Section 3 Part 1. Relationships between two numerical variables

Section 3 Part 1. Relationships between two numerical variables Section 3 Part 1 Relationships between two numerical variables 1 Relationship between two variables The summary statistics covered in the previous lessons are appropriate for describing a single variable.

More information

EQUATIONS and INEQUALITIES

EQUATIONS and INEQUALITIES EQUATIONS and INEQUALITIES Linear Equations and Slope 1. Slope a. Calculate the slope of a line given two points b. Calculate the slope of a line parallel to a given line. c. Calculate the slope of a line

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

Applied Data Analysis. Fall 2015

Applied Data Analysis. Fall 2015 Applied Data Analysis Fall 2015 Course information: Labs Anna Walsdorff anna.walsdorff@rochester.edu Tues. 9-11 AM Mary Clare Roche maryclare.roche@rochester.edu Mon. 2-4 PM Lecture outline 1. Practice

More information

CURVE FITTING LEAST SQUARES APPROXIMATION

CURVE FITTING LEAST SQUARES APPROXIMATION CURVE FITTING LEAST SQUARES APPROXIMATION Data analysis and curve fitting: Imagine that we are studying a physical system involving two quantities: x and y Also suppose that we expect a linear relationship

More information

Correlation key concepts:

Correlation key concepts: CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Solutions to Homework 6 Statistics 302 Professor Larget

Solutions to Homework 6 Statistics 302 Professor Larget s to Homework 6 Statistics 302 Professor Larget Textbook Exercises 5.29 (Graded for Completeness) What Proportion Have College Degrees? According to the US Census Bureau, about 27.5% of US adults over

More information

table to see that the probability is 0.8413. (b) What is the probability that x is between 16 and 60? The z-scores for 16 and 60 are: 60 38 = 1.

table to see that the probability is 0.8413. (b) What is the probability that x is between 16 and 60? The z-scores for 16 and 60 are: 60 38 = 1. Review Problems for Exam 3 Math 1040 1 1. Find the probability that a standard normal random variable is less than 2.37. Looking up 2.37 on the normal table, we see that the probability is 0.9911. 2. Find

More information

The Normal Distribution

The Normal Distribution Chapter 6 The Normal Distribution 6.1 The Normal Distribution 1 6.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Recognize the normal probability distribution

More information

Statistics E100 Fall 2013 Practice Midterm I - A Solutions

Statistics E100 Fall 2013 Practice Midterm I - A Solutions STATISTICS E100 FALL 2013 PRACTICE MIDTERM I - A SOLUTIONS PAGE 1 OF 5 Statistics E100 Fall 2013 Practice Midterm I - A Solutions 1. (16 points total) Below is the histogram for the number of medals won

More information

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)

More information

Draft 1, Attempted 2014 FR Solutions, AP Statistics Exam

Draft 1, Attempted 2014 FR Solutions, AP Statistics Exam Free response questions, 2014, first draft! Note: Some notes: Please make critiques, suggest improvements, and ask questions. This is just one AP stats teacher s initial attempts at solving these. I, as

More information

Relationships Between Two Variables: Scatterplots and Correlation

Relationships Between Two Variables: Scatterplots and Correlation Relationships Between Two Variables: Scatterplots and Correlation Example: Consider the population of cars manufactured in the U.S. What is the relationship (1) between engine size and horsepower? (2)

More information

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there is a relationship between variables, To find out the

More information

Stata Walkthrough 4: Regression, Prediction, and Forecasting

Stata Walkthrough 4: Regression, Prediction, and Forecasting Stata Walkthrough 4: Regression, Prediction, and Forecasting Over drinks the other evening, my neighbor told me about his 25-year-old nephew, who is dating a 35-year-old woman. God, I can t see them getting

More information

Point Biserial Correlation Tests

Point Biserial Correlation Tests Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

F.IF.7b: Graph Root, Piecewise, Step, & Absolute Value Functions

F.IF.7b: Graph Root, Piecewise, Step, & Absolute Value Functions F.IF.7b: Graph Root, Piecewise, Step, & Absolute Value Functions F.IF.7b: Graph Root, Piecewise, Step, & Absolute Value Functions Analyze functions using different representations. 7. Graph functions expressed

More information

Independent samples t-test. Dr. Tom Pierce Radford University

Independent samples t-test. Dr. Tom Pierce Radford University Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of

More information

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles. Math 1530-017 Exam 1 February 19, 2009 Name Student Number E There are five possible responses to each of the following multiple choice questions. There is only on BEST answer. Be sure to read all possible

More information

17. SIMPLE LINEAR REGRESSION II

17. SIMPLE LINEAR REGRESSION II 17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.

More information

Unit 9 Describing Relationships in Scatter Plots and Line Graphs

Unit 9 Describing Relationships in Scatter Plots and Line Graphs Unit 9 Describing Relationships in Scatter Plots and Line Graphs Objectives: To construct and interpret a scatter plot or line graph for two quantitative variables To recognize linear relationships, non-linear

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Updates to Graphing with Excel

Updates to Graphing with Excel Updates to Graphing with Excel NCC has recently upgraded to a new version of the Microsoft Office suite of programs. As such, many of the directions in the Biology Student Handbook for how to graph with

More information

AP Statistics Solutions to Packet 2

AP Statistics Solutions to Packet 2 AP Statistics Solutions to Packet 2 The Normal Distributions Density Curves and the Normal Distribution Standard Normal Calculations HW #9 1, 2, 4, 6-8 2.1 DENSITY CURVES (a) Sketch a density curve that

More information

9. Sampling Distributions

9. Sampling Distributions 9. Sampling Distributions Prerequisites none A. Introduction B. Sampling Distribution of the Mean C. Sampling Distribution of Difference Between Means D. Sampling Distribution of Pearson's r E. Sampling

More information

Describing Relationships between Two Variables

Describing Relationships between Two Variables Describing Relationships between Two Variables Up until now, we have dealt, for the most part, with just one variable at a time. This variable, when measured on many different subjects or objects, took

More information

containing Kendall correlations; and the OUTH = option will create a data set containing Hoeffding statistics.

containing Kendall correlations; and the OUTH = option will create a data set containing Hoeffding statistics. Getting Correlations Using PROC CORR Correlation analysis provides a method to measure the strength of a linear relationship between two numeric variables. PROC CORR can be used to compute Pearson product-moment

More information

How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

More information

Worksheet A5: Slope Intercept Form

Worksheet A5: Slope Intercept Form Name Date Worksheet A5: Slope Intercept Form Find the Slope of each line below 1 3 Y - - - - - - - - - - Graph the lines containing the point below, then find their slopes from counting on the graph!.

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

6 3 The Standard Normal Distribution

6 3 The Standard Normal Distribution 290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since

More information

Chapter 11: r.m.s. error for regression

Chapter 11: r.m.s. error for regression Chapter 11: r.m.s. error for regression Context................................................................... 2 Prediction error 3 r.m.s. error for the regression line...............................................

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Dealing with Data in Excel 2010

Dealing with Data in Excel 2010 Dealing with Data in Excel 2010 Excel provides the ability to do computations and graphing of data. Here we provide the basics and some advanced capabilities available in Excel that are useful for dealing

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

Years after 2000. US Student to Teacher Ratio 0 16.048 1 15.893 2 15.900 3 15.900 4 15.800 5 15.657 6 15.540

Years after 2000. US Student to Teacher Ratio 0 16.048 1 15.893 2 15.900 3 15.900 4 15.800 5 15.657 6 15.540 To complete this technology assignment, you should already have created a scatter plot for your data on your calculator and/or in Excel. You could do this with any two columns of data, but for demonstration

More information

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries? Statistics: Correlation Richard Buxton. 2008. 1 Introduction We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries? Do

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

Statistics 151 Practice Midterm 1 Mike Kowalski

Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Multiple Choice (50 minutes) Instructions: 1. This is a closed book exam. 2. You may use the STAT 151 formula sheets and

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

Plot the following two points on a graph and draw the line that passes through those two points. Find the rise, run and slope of that line.

Plot the following two points on a graph and draw the line that passes through those two points. Find the rise, run and slope of that line. Objective # 6 Finding the slope of a line Material: page 117 to 121 Homework: worksheet NOTE: When we say line... we mean straight line! Slope of a line: It is a number that represents the slant of a line

More information

Name: Date: Use the following to answer questions 2-3:

Name: Date: Use the following to answer questions 2-3: Name: Date: 1. A study is conducted on students taking a statistics class. Several variables are recorded in the survey. Identify each variable as categorical or quantitative. A) Type of car the student

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

What does the number m in y = mx + b measure? To find out, suppose (x 1, y 1 ) and (x 2, y 2 ) are two points on the graph of y = mx + b.

What does the number m in y = mx + b measure? To find out, suppose (x 1, y 1 ) and (x 2, y 2 ) are two points on the graph of y = mx + b. PRIMARY CONTENT MODULE Algebra - Linear Equations & Inequalities T-37/H-37 What does the number m in y = mx + b measure? To find out, suppose (x 1, y 1 ) and (x 2, y 2 ) are two points on the graph of

More information

The normal approximation to the binomial

The normal approximation to the binomial The normal approximation to the binomial The binomial probability function is not useful for calculating probabilities when the number of trials n is large, as it involves multiplying a potentially very

More information

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers Correlation Greg C Elvers What Is Correlation? Correlation is a descriptive statistic that tells you if two variables are related to each other E.g. Is your related to how much you study? When two variables

More information

1. Suppose that a score on a final exam depends upon attendance and unobserved factors that affect exam performance (such as student ability).

1. Suppose that a score on a final exam depends upon attendance and unobserved factors that affect exam performance (such as student ability). Examples of Questions on Regression Analysis: 1. Suppose that a score on a final exam depends upon attendance and unobserved factors that affect exam performance (such as student ability). Then,. When

More information

Second Midterm Exam (MATH1070 Spring 2012)

Second Midterm Exam (MATH1070 Spring 2012) Second Midterm Exam (MATH1070 Spring 2012) Instructions: This is a one hour exam. You can use a notecard. Calculators are allowed, but other electronics are prohibited. 1. [60pts] Multiple Choice Problems

More information

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5 Using Your TI-83/84 Calculator: Linear Correlation and Regression Elementary Statistics Dr. Laura Schultz This handout describes how to use your calculator for various linear correlation and regression

More information

AP * Statistics Review. Descriptive Statistics

AP * Statistics Review. Descriptive Statistics AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

PLOTTING DATA AND INTERPRETING GRAPHS

PLOTTING DATA AND INTERPRETING GRAPHS PLOTTING DATA AND INTERPRETING GRAPHS Fundamentals of Graphing One of the most important sets of skills in science and mathematics is the ability to construct graphs and to interpret the information they

More information

Writing the Equation of a Line in Slope-Intercept Form

Writing the Equation of a Line in Slope-Intercept Form Writing the Equation of a Line in Slope-Intercept Form Slope-Intercept Form y = mx + b Example 1: Give the equation of the line in slope-intercept form a. With y-intercept (0, 2) and slope -9 b. Passing

More information

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Describing Populations Statistically: The Mean, Variance, and Standard Deviation Describing Populations Statistically: The Mean, Variance, and Standard Deviation BIOLOGICAL VARIATION One aspect of biology that holds true for almost all species is that not every individual is exactly

More information

6.2 Normal distribution. Standard Normal Distribution:

6.2 Normal distribution. Standard Normal Distribution: 6.2 Normal distribution Slide Heights of Adult Men and Women Slide 2 Area= Mean = µ Standard Deviation = σ Donation: X ~ N(µ,σ 2 ) Standard Normal Distribution: Slide 3 Slide 4 a normal probability distribution

More information

Problem of the Month: Fair Games

Problem of the Month: Fair Games Problem of the Month: The Problems of the Month (POM) are used in a variety of ways to promote problem solving and to foster the first standard of mathematical practice from the Common Core State Standards:

More information

Lecture 14. Chapter 7: Probability. Rule 1: Rule 2: Rule 3: Nancy Pfenning Stats 1000

Lecture 14. Chapter 7: Probability. Rule 1: Rule 2: Rule 3: Nancy Pfenning Stats 1000 Lecture 4 Nancy Pfenning Stats 000 Chapter 7: Probability Last time we established some basic definitions and rules of probability: Rule : P (A C ) = P (A). Rule 2: In general, the probability of one event

More information

Unit 7: Normal Curves

Unit 7: Normal Curves Unit 7: Normal Curves Summary of Video Histograms of completely unrelated data often exhibit similar shapes. To focus on the overall shape of a distribution and to avoid being distracted by the irregularities

More information

Mean, Median, Standard Deviation Prof. McGahagan Stat 1040

Mean, Median, Standard Deviation Prof. McGahagan Stat 1040 Mean, Median, Standard Deviation Prof. McGahagan Stat 1040 Mean = arithmetic average, add all the values and divide by the number of values. Median = 50 th percentile; sort the data and choose the middle

More information

Describing, Exploring, and Comparing Data

Describing, Exploring, and Comparing Data 24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter

More information

Measurement with Ratios

Measurement with Ratios Grade 6 Mathematics, Quarter 2, Unit 2.1 Measurement with Ratios Overview Number of instructional days: 15 (1 day = 45 minutes) Content to be learned Use ratio reasoning to solve real-world and mathematical

More information

INTRODUCTION TO MULTIPLE CORRELATION

INTRODUCTION TO MULTIPLE CORRELATION CHAPTER 13 INTRODUCTION TO MULTIPLE CORRELATION Chapter 12 introduced you to the concept of partialling and how partialling could assist you in better interpreting the relationship between two primary

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Lesson 26: Reflection & Mirror Diagrams

Lesson 26: Reflection & Mirror Diagrams Lesson 26: Reflection & Mirror Diagrams The Law of Reflection There is nothing really mysterious about reflection, but some people try to make it more difficult than it really is. All EMR will reflect

More information

USING EXCEL ON THE COMPUTER TO FIND THE MEAN AND STANDARD DEVIATION AND TO DO LINEAR REGRESSION ANALYSIS AND GRAPHING TABLE OF CONTENTS

USING EXCEL ON THE COMPUTER TO FIND THE MEAN AND STANDARD DEVIATION AND TO DO LINEAR REGRESSION ANALYSIS AND GRAPHING TABLE OF CONTENTS USING EXCEL ON THE COMPUTER TO FIND THE MEAN AND STANDARD DEVIATION AND TO DO LINEAR REGRESSION ANALYSIS AND GRAPHING Dr. Susan Petro TABLE OF CONTENTS Topic Page number 1. On following directions 2 2.

More information

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph. MBA/MIB 5315 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?

More information

Homework 8 Solutions

Homework 8 Solutions Math 17, Section 2 Spring 2011 Homework 8 Solutions Assignment Chapter 7: 7.36, 7.40 Chapter 8: 8.14, 8.16, 8.28, 8.36 (a-d), 8.38, 8.62 Chapter 9: 9.4, 9.14 Chapter 7 7.36] a) A scatterplot is given below.

More information

TEACHER NOTES MATH NSPIRED

TEACHER NOTES MATH NSPIRED Math Objectives Students will understand that normal distributions can be used to approximate binomial distributions whenever both np and n(1 p) are sufficiently large. Students will understand that when

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus

Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus 1. Introduction Facebook is a social networking website with an open platform that enables developers to extract and utilize user information

More information

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares Linear Regression Chapter 5 Regression Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). We can then predict the average response for all subjects

More information

Common Core State Standards for Mathematics Accelerated 7th Grade

Common Core State Standards for Mathematics Accelerated 7th Grade A Correlation of 2013 To the to the Introduction This document demonstrates how Mathematics Accelerated Grade 7, 2013, meets the. Correlation references are to the pages within the Student Edition. Meeting

More information

3.2. Solving quadratic equations. Introduction. Prerequisites. Learning Outcomes. Learning Style

3.2. Solving quadratic equations. Introduction. Prerequisites. Learning Outcomes. Learning Style Solving quadratic equations 3.2 Introduction A quadratic equation is one which can be written in the form ax 2 + bx + c = 0 where a, b and c are numbers and x is the unknown whose value(s) we wish to find.

More information

7 Relations and Functions

7 Relations and Functions 7 Relations and Functions In this section, we introduce the concept of relations and functions. Relations A relation R from a set A to a set B is a set of ordered pairs (a, b), where a is a member of A,

More information

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

More information

Elasticity. I. What is Elasticity?

Elasticity. I. What is Elasticity? Elasticity I. What is Elasticity? The purpose of this section is to develop some general rules about elasticity, which may them be applied to the four different specific types of elasticity discussed in

More information

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple. Graphical Representations of Data, Mean, Median and Standard Deviation In this class we will consider graphical representations of the distribution of a set of data. The goal is to identify the range of

More information

Foundations for Functions

Foundations for Functions Activity: TEKS: Overview: Materials: Grouping: Time: Crime Scene Investigation (A.2) Foundations for functions. The student uses the properties and attributes of functions. The student is expected to:

More information

Chapter 3. The Normal Distribution

Chapter 3. The Normal Distribution Chapter 3. The Normal Distribution Topics covered in this chapter: Z-scores Normal Probabilities Normal Percentiles Z-scores Example 3.6: The standard normal table The Problem: What proportion of observations

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

MEASURES OF VARIATION

MEASURES OF VARIATION NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are

More information

The University of the State of New York REGENTS HIGH SCHOOL EXAMINATION MATHEMATICS A. Monday, January 27, 2003 1:15 to 4:15 p.m.

The University of the State of New York REGENTS HIGH SCHOOL EXAMINATION MATHEMATICS A. Monday, January 27, 2003 1:15 to 4:15 p.m. The University of the State of New York REGENTS HIGH SCHOOL EXAMINATION MATHEMATICS A Monday, January 27, 2003 1:15 to 4:15 p.m., only Print Your Name: Print Your School s Name: Print your name and the

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information