Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers



Similar documents
Homework 11. Part 1. Name: Score: / null

Section 3 Part 1. Relationships between two numerical variables

Univariate Regression

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

Using Excel for Statistical Analysis

Simple Regression Theory II 2010 Samuel L. Baker

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

2. Simple Linear Regression

Scatter Plot, Correlation, and Regression on the TI-83/84

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

The correlation coefficient

Chapter 7: Simple linear regression Learning Objectives

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation

Correlation key concepts:

Zeros of a Polynomial Function

Relationships Between Two Variables: Scatterplots and Correlation

Pearson s Correlation Coefficient

Module 3: Correlation and Covariance

Elasticity. I. What is Elasticity?

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Because the slope is, a slope of 5 would mean that for every 1cm increase in diameter, the circumference would increase by 5cm.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Pennsylvania System of School Assessment

Statistics. Measurement. Scales of Measurement 7/18/2012

You buy a TV for $1000 and pay it off with $100 every week. The table below shows the amount of money you sll owe every week. Week

Session 7 Bivariate Data and Analysis

Lean Six Sigma Analyze Phase Introduction. TECH QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

with functions, expressions and equations which follow in units 3 and 4.

Introduction to Regression and Data Analysis

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA

Moderator and Mediator Analysis

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Simple linear regression

13. Poisson Regression Analysis

Introduction to Quantitative Methods

Father s height (inches)

Mathematics Online Instructional Materials Correlation to the 2009 Algebra I Standards of Learning and Curriculum Framework

Georgia Standards of Excellence Curriculum Map. Mathematics. GSE 8 th Grade

Hedge Effectiveness Testing

table to see that the probability is (b) What is the probability that x is between 16 and 60? The z-scores for 16 and 60 are: = 1.

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Module 5: Multiple Regression Analysis

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1. Suppose that a score on a final exam depends upon attendance and unobserved factors that affect exam performance (such as student ability).

CALCULATIONS & STATISTICS

Econometrics Simple Linear Regression

Example: Boats and Manatees

Premaster Statistics Tutorial 4 Full solutions

Linear Models in STATA and ANOVA

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares

Homework #1 Solutions

The Correlation Coefficient

II. DISTRIBUTIONS distribution normal distribution. standard scores

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Means, standard deviations and. and standard errors

Common Core Unit Summary Grades 6 to 8

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Zeros of Polynomial Functions

COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared.

Indiana State Core Curriculum Standards updated 2009 Algebra I

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

y = a + bx Chapter 10: Horngren 13e The Dependent Variable: The cost that is being predicted The Independent Variable: The cost driver

SPSS Guide: Regression Analysis

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

Unit 9 Describing Relationships in Scatter Plots and Line Graphs

Formula for linear models. Prediction, extrapolation, significance test against zero slope.

Common Core State Standards for Mathematics Accelerated 7th Grade

Course Outlines. 1. Name of the Course: Algebra I (Standard, College Prep, Honors) Course Description: ALGEBRA I STANDARD (1 Credit)

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

11. Analysis of Case-control Studies Logistic Regression

Assessment Schedule 2013

TIME SERIES ANALYSIS & FORECASTING

Method To Solve Linear, Polynomial, or Absolute Value Inequalities:

CORRELATION ANALYSIS

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Financial Risk Management Exam Sample Questions/Answers

AP Physics 1 and 2 Lab Investigations


Introduction to Basic Reliability Statistics. James Wheeler, CMRP

5. Linear Regression

Part 1 Expressions, Equations, and Inequalities: Simplifying and Solving

DATA INTERPRETATION AND STATISTICS

Transforming Bivariate Data

What does the number m in y = mx + b measure? To find out, suppose (x 1, y 1 ) and (x 2, y 2 ) are two points on the graph of y = mx + b.

March 29, S4.4 Theorems about Zeros of Polynomial Functions

Regression and Correlation

Getting Correct Results from PROC REG

Simple Linear Regression, Scatterplots, and Bivariate Correlation

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

Nonlinear Regression Functions. SW Ch 8 1/54/

Transcription:

Correlation Greg C Elvers What Is Correlation? Correlation is a descriptive statistic that tells you if two variables are related to each other E.g. Is your related to how much you study? When two variables are correlated, knowing the value of one variable allows you to predict the value of the other variable Perfect Correlation When two variables are perfectly correlated, knowing the value of one variable allows you to exactly predict the value of the other variable Perfect Correlation For example, if the only thing that determined your was the amount of time that you studied, then the two would be perfectly correlated If you know the value of one variable, you can exactly determine the value of the other variable 5 Study Hours per Day

Perfect Correlation That is, all the variability in one variable is explained by the variability in the other variable 5 Study Hours per Day 5 Perfect Correlations Few, if any, psychological variables are perfectly correlated with each other Many non-psychological variables do have a perfect correlation E.g. Time since the beginning of class and the time remaining in the class are perfectly correlated What are other examples of perfectly correlated variables? 6 Less Than Perfect Correlations Even if two variables are correlated, most of the time you cannot perfectly predict the value of one variable given the other E.g., other variables besides amount of time spent studying influence your Some of the variability is people s is due to the amount of time spent studying, but not all the variability is due to it 7 Less Than Perfect Correlations Study Time IQ 8..5. 5 8.5 5. 5.5 5 Study Hours per Day 8

Less Than Perfect Correlations With a less than perfect correlation, we can no longer perfectly predict the value of one variable given the other variable We cannot explain all the variability in one variable with the variability in the other variable 9 The Correlation Coefficient Correlation coefficients tell us how perfectly two (or more) variables are related to each other They can also be used to determine how much variability in one variable is explainable by variation in the other variable. Pearson s Product Moment Correlation Coefficient Pearson s product moment correlation coefficient, or Pearson s r, for short is a very common measure of how strongly two variables are related to each other Pearson s r must lie in the range of - to + inclusive Interpretation of Pearson s r To interpret Pearson s r, you must consider two parts of it: The sign of r The magnitude, or absolute value of r

The Sign of r When r is greater than (I.e. its sign is positive) the variables are said to have a direct relation In a direct relation, as the value of one variable increases, the value of the other variable also tends to increase 6 Study Hours per Night The Sign of r When r is less than (i.e., its sign is negative) the variables are said to have an indirect relation In an indirect relation, as the value of one variable increases, the value of the other variable tends to decrease 6 Hours of TV per Night Is the Sign of r + or -? As the number of cigarettes smoked per day increases, tends to decrease As the number of cats in a farm yard increases, the number of mice tends to decrease As the weight of a cat increases, the length of its whiskers tends to increase 5 Is the Sign of r + or -? Create two examples of correlations and determine if the sign of r is positive or negative 6

The Magnitude of r The magnitude refers to the size of the correlation coefficient ignoring the sign of r The magnitude is equivalent to taking the absolute value of r The larger the magnitude of r is, the more perfectly the two variables are related to each other The smaller the magnitude of r is, the less perfectly the two variables are related to each other r = 7 When r equals., there is a perfect correlation between the variables Knowing the value of one variable exactly predicts the value of the other variable Hours..8.6... 5 6 Minutes 8 r = When r equals, either the assumptions of correlation have been violated or there is no relation between the two variables The points in a scatter plot with r = will tend to form a circular cluster 5 - -5 5-5 - 9 The larger the magnitude or r is, the more the scatter plot s points will tend to cluster tightly about a line < r < < r <

Magnitude of r Cohen (988) recommends the following values of r for small, medium, and large effects Correlation Small Medium Negative -.9 to -. -.9 to -. Positive. to.9. to.9 Large -. to -.5.5 to. Magnitude of r List a couple of pairs of variables and guess whether the magnitude of r is closer to or closer to Pearson s r Pearson s r makes several assumptions about the data When these assumptions are violated, r must be interpreted with extreme caution Assumptions: Linear relation Non-truncated range Sufficiently large sample size Linear Relation Pearson s r, in its simplest form, only works for variables that are linearly related That is, the equation that allows us to predict the value of one variable from the value of the other is a line: Y = slope * X + intercept Always look at the scatter plot to determine if the two variables are approximately linearly related

Linear Relation If the variables are not linearly related, Pearson s r will indicate a smaller relation than actually exists Often, non-linear relations can be transformed into linear ones by taking the appropriate mathematical transformation 5 Square Root of Y Transformation 6 5 6 8 8 7 6 5 6 8 6 Non-Truncated Range A truncated range occurs when the range of one of the variables is very small When the range is truncated, Pearson s r will indicate a smaller relation between the variables than what actually exists Once a range truncation occurs, there is little that you can do; be careful not to design studies that will lead to a truncated range 7 Truncated Range A linear relation clearly exists in this data Consider only the data in the square (thereby truncating the range) Is the linear relation as clear as it was? No College..5..5..5...5..5..5. High School 8

Sample Size If the size of the sample is too small, relations can appear due to chance These relations disappear when a larger sample is considered Too large of a sample can make near correlations statistically significant, even though they have very little explanatory power 9 Sample Size The magnitude of r does not depend on sample size The likelihood of finding a statistically significant r does depend on sample size The sample should be large enough to generalize to the population of interest