Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation



Similar documents
Lecture 13/Chapter 10 Relationships between Measurement (Quantitative) Variables

Relationships Between Two Variables: Scatterplots and Correlation

Homework 8 Solutions

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Univariate Regression

Describing Relationships between Two Variables

Homework 11. Part 1. Name: Score: / null

Name: Date: Use the following to answer questions 2-3:

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

Section 3 Part 1. Relationships between two numerical variables

The Big Picture. Correlation. Scatter Plots. Data

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Exercise 1.12 (Pg )

Chapter 7: Simple linear regression Learning Objectives

Plot the following two points on a graph and draw the line that passes through those two points. Find the rise, run and slope of that line.

Correlation and Regression

2. Simple Linear Regression

The correlation coefficient

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Part 1: Background - Graphing

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares

FREE FALL. Introduction. Reference Young and Freedman, University Physics, 12 th Edition: Chapter 2, section 2.5

Correlation key concepts:

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers

table to see that the probability is (b) What is the probability that x is between 16 and 60? The z-scores for 16 and 60 are: = 1.

Slope-Intercept Equation. Example

AP Statistics. Chapter 4 Review

ch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

Graphing Linear Equations

Father s height (inches)

What does the number m in y = mx + b measure? To find out, suppose (x 1, y 1 ) and (x 2, y 2 ) are two points on the graph of y = mx + b.

Linear Equations. Find the domain and the range of the following set. {(4,5), (7,8), (-1,3), (3,3), (2,-3)}

Scatter Plot, Correlation, and Regression on the TI-83/84

Diagrams and Graphs of Statistical Data

Worksheet A5: Slope Intercept Form

Simple linear regression

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Using Excel for Statistical Analysis

(Least Squares Investigation)

3.2 LOGARITHMIC FUNCTIONS AND THEIR GRAPHS. Copyright Cengage Learning. All rights reserved.

MTH 140 Statistics Videos

Module 3: Correlation and Covariance

AP Statistics Ch 3 Aim 1: Scatter Diagrams

Lesson Using Lines to Make Predictions

Chapter 9 Descriptive Statistics for Bivariate Data

with functions, expressions and equations which follow in units 3 and 4.

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

Statistics. Measurement. Scales of Measurement 7/18/2012

Simple Predictive Analytics Curtis Seare

Chapter 7 Scatterplots, Association, and Correlation

Chapter 11: r.m.s. error for regression

Descriptive Statistics

WEB APPENDIX. Calculating Beta Coefficients. b Beta Rise Run Y X

Straightening Data in a Scatterplot Selecting a Good Re-Expression Model

The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

Linear Regression. use waist

Linear Models in STATA and ANOVA

Scatter Plots with Error Bars

A Determination of g, the Acceleration Due to Gravity, from Newton's Laws of Motion

1 of 7 9/5/2009 6:12 PM

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Pennsylvania System of School Assessment

Session 7 Bivariate Data and Analysis

Describing, Exploring, and Comparing Data

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Copyright 2013 by Laura Schultz. All rights reserved. Page 1 of 7

COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared.

MA.8.A.1.2 Interpret the slope and the x- and y-intercepts when graphing a linear equation for a real-world problem. Constant Rate of Change/Slope

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

The importance of graphing the data: Anscombe s regression examples

Introduction to Quantitative Methods

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Motion Graphs. It is said that a picture is worth a thousand words. The same can be said for a graph.

Transforming Bivariate Data

II. DISTRIBUTIONS distribution normal distribution. standard scores

STT 200 LECTURE 1, SECTION 2,4 RECITATION 7 (10/16/2012)

PLOTTING DATA AND INTERPRETING GRAPHS

Dealing with Data in Excel 2010

Algebra II EOC Practice Test

Simple Linear Regression

2.5 Transformations of Functions

Mathematics Online Instructional Materials Correlation to the 2009 Algebra I Standards of Learning and Curriculum Framework

Unit 9 Describing Relationships in Scatter Plots and Line Graphs

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

1.3 LINEAR EQUATIONS IN TWO VARIABLES. Copyright Cengage Learning. All rights reserved.

Algebra II End of Course Exam Answer Key Segment I. Scientific Calculator Only

Algebra 1 Course Information

Florida Math for College Readiness

Part 1 : 07/27/10 21:30:31

Review of Fundamental Mathematics

ANALYSIS OF TREND CHAPTER 5

Data Mining Part 5. Prediction

Transcription:

Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation Display and Summarize Correlation for Direction and Strength Properties of Correlation Regression Line Cengage Learning Elementary Statistics: Looking at the Big Picture 1

Looking Back: Review 4 Stages of Statistics Data Production (discussed in Lectures 1-4) Displaying and Summarizing Single variables: 1 cat,1 quan (discussed Lectures 5-8) Relationships between 2 variables: Categorical and quantitative (discussed in Lecture 9) Two categorical (discussed in Lecture 10) Two quantitative Probability Statistical Inference Elementary Statistics: Looking at the Big Picture L11.2

Example: Two Single Quantitative Variables Background: Data on male students heights and weights: Question: What do these tell us about the relationship between male height and weight? Response: Elementary Statistics: Looking at the Big Picture L11.5

Definition Scatterplot displays relationship between 2 quantitative variables: Explanatory variable (x) on horizontal axis Response variable (y) on vertical axis Elementary Statistics: Looking at the Big Picture L11.6

Example: Explanatory/Response Roles Background: We re interested in the relationship between male students heights and weights. Question: Which variable should be graphed along the horizontal axis of the scatterplot? Response: Elementary Statistics: Looking at the Big Picture L11.8

Definitions Form: relationship is linear if scatterplot points cluster around some straight line Direction: relationship is positive if points slope upward left to right negative if points slope downward left to right Elementary Statistics: Looking at the Big Picture L11.9

Example: Form and Direction Background: Scatterplot displays relationship between male students heights and weights. Question: What are the form and direction of the relationship? Response: Form is direction is Elementary Statistics: Looking at the Big Picture L11.11

Strength of a Linear Relationship Strong: scatterplot points tightly clustered around a line Explanatory value tells us a lot about response Weak: scatterplot points loosely scattered around a line Explanatory value tells us little about response Elementary Statistics: Looking at the Big Picture L11.12

Example: Relative Strengths Background: Scatterplots display: mothers ht. vs. fathers ht. (left) males wt. vs. ht. (middle) mothers age vs. fathers age (right): Question: How do relationships strengths compare? (Which is strongest, which is weakest?) Response: Strongest is on, weakest is on Elementary Statistics: Looking at the Big Picture L11.14

Example: Negative Relationship Background: Scatterplot displays price vs. age for 14 used Pontiac Grand Am s. Questions: Why should we expect the relationship to be negative? Does it appear linear? Is it weak or strong? Responses: Elementary Statistics: Looking at the Big Picture L11.16

Definition Correlation r: tells direction and strength of linear relation between 2 quantitative variables Direction: r is positive for positive relationship negative for negative relationship zero for no relationship Strength: r is between -1 and +1; it is close to 1 in absolute value for strong relationship close to 0 in absolute value for weak relationship close to 0.5 in absolute value for moderate relationship Elementary Statistics: Looking at the Big Picture L11.17

Example: Extreme Values of Correlation Background: Scatterplots show relationships (left) Price per kilogram vs. price per pound for groceries (middle) Used cars age vs. year made (right) Students final exam score vs. order handed in Question: Correlations (scrambled) are -1, 0, +1. Which goes with each scatterplot? Response: left r = ; middle r = ; right r = Elementary Statistics: Looking at the Big Picture L11.20

Example: Relative Strengths Background: Scatterplots display: mothers ht. vs. fathers ht. (left) males wt. vs. ht. (middle) mothers age vs. fathers age (right): Question: Which graphs go with which correlation: r = 0.23, r = 0.78, r = 0.65? Response: left r = ; middle r = ; right r = Elementary Statistics: Looking at the Big Picture L11.22

Example: Imperfect Relationships Background: For 50 states, % voting Republican vs. % Democrat in 2000 presidential election had r = -0.96. Questions: Why should we expect the relationship to be negative? Why is it imperfect? Responses: Negative: Imperfect: Elementary Statistics: Looking at the Big Picture L11.24

More about Correlation r Tells direction and strength of linear relation between 2 quantitative variables A strong curved relationship may have r close to 0 Correlation not appropriate for categorical data Unaffected by roles explanatory/response Unaffected by change of units Overstates strength if based on averages Elementary Statistics: Looking at the Big Picture L11.25

Example: Correlation when Roles are Switched Background: Male students wt vs ht (left) or ht vs wt (right): Questions: How do directions and strengths compare, left vs. right? How do correlations r compare, left vs. right? Responses: Elementary Statistics: Looking at the Big Picture L11.27

Example: Correlation when Units are Changed Background: For male students plot Left: wt (lbs) vs. ht (in) or Right: wt (kg) vs. ht (cm) Question: How do directions, strengths, and r compare, left vs. right? Response: Elementary Statistics: Looking at the Big Picture L11.30

Example: Correlation Based on Averages Background: For male students plot Left: wt. vs. ht. or Right: average wt. vs. ht. Question:Which one has r =+0.87? (other r =+0.65) Response: Plot on has r = +0.87 (stronger). Elementary Statistics: Looking at the Big Picture L11.33

Least Squares Regression Line If form appears linear, then we picture points clustered around a straight line. Questions (Rhetorical): 1. Is there only one best line? 2. If so, how can we find it? 3. If found, how can we use it? Responses: (in reverse order) 3. If found, can use line to make predictions. Elementary Statistics: Looking at the Big Picture L11.35

Least Squares Regression Line Response: 3. If found, can use line to make predictions. Write equation of line : Explanatory value is Predicted response is y-intercept is Slope is and use the line to predict a response for any given explanatory value. Elementary Statistics: Looking at the Big Picture L11.36

Least Squares Regression Line If form appears linear, then we picture points clustered around a straight line. Questions: 1. Is there only one best line? 2. If so, how can we find it? 3. If found, how can we use it? Predictions Response: 2. Find line that makes best predictions. Elementary Statistics: Looking at the Big Picture L11.37

Least Squares Regression Line Response: 2. Find line that makes best predictions: Minimize sum of squared residuals (prediction errors). Resulting line called least squares line or regression line. A Closer Look: The mathematician Sir Francis Galton called it the regression line because of the regression to mediocrity seen in any imperfect relationship: besides responding to x, we see y tending towards its average value. Elementary Statistics: Looking at the Big Picture L11.38

Least Squares Regression Line If form appears linear, then we picture points clustered around a straight line. Questions: 1. Is there only one best line? 2. If so, how can we find it? Minimize errors 3. If found, how can we use it? Predictions Response: 1. Methods of calculusunique best line Elementary Statistics: Looking at the Big Picture L11.39

Least Squares Regression Line If form appears linear, then we picture points clustered around a straight line. Questions: 1. Is there only one best line? 2. If so, how can we find it? 3. If found, how can we use it? Response: 1. Best line has Elementary Statistics: Looking at the Big Picture L11.40

Example: Least Squares Regression Line Background: Car-buyer wants to know if $4,000 is a fair price for an 8-yr-old Grand Am; uses software to regress price on age for 14 used Grand Am s: Price =14,686.3-1,287.64Age Question: How can she use the line? Response: Predict for x=8,. Elementary Statistics: Looking at the Big Picture L11.42

Lecture Summary (Quantitative Relationships; Correlation) Display with scatterplot Summarize with form, direction, strength Correlation r tells direction and strength Properties of r Unaffected by explanatory/response roles Unaffected by change of units Overstates strength if based on averages Least squares regression line for predictions Elementary Statistics: Looking at the Big Picture L11.43