Correlation A relationship between two variables As one goes up, the other changes in a predictable way (either mostly goes up or mostly goes down)

Similar documents
You buy a TV for $1000 and pay it off with $100 every week. The table below shows the amount of money you sll owe every week. Week

Scatter Plot, Correlation, and Regression on the TI-83/84

Correlation key concepts:

Academic Support Center. Using the TI-83/84+ Graphing Calculator PART II

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

Homework 11. Part 1. Name: Score: / null

Linear functions Increasing Linear Functions. Decreasing Linear Functions

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

MATH 60 NOTEBOOK CERTIFICATIONS

Foundations for Functions

Writing the Equation of a Line in Slope-Intercept Form

What are the place values to the left of the decimal point and their associated powers of ten?

Answer Key for California State Standards: Algebra I

Algebra Cheat Sheets

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

What does the number m in y = mx + b measure? To find out, suppose (x 1, y 1 ) and (x 2, y 2 ) are two points on the graph of y = mx + b.

A synonym is a word that has the same or almost the same definition of

Worksheet A5: Slope Intercept Form

Elements of a graph. Click on the links below to jump directly to the relevant section

MATH Fundamental Mathematics IV

Chapter 9. Systems of Linear Equations

Pearson s Correlation Coefficient

Temperature Scales. The metric system that we are now using includes a unit that is specific for the representation of measured temperatures.

MTH 092 College Algebra Essex County College Division of Mathematics Sample Review Questions 1 Created January 17, 2006

Describing Relationships between Two Variables

Lesson 4: Solving and Graphing Linear Equations

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

x x y y Then, my slope is =. Notice, if we use the slope formula, we ll get the same thing: m =

The Dummy s Guide to Data Analysis Using SPSS

Linear Equations. Find the domain and the range of the following set. {(4,5), (7,8), (-1,3), (3,3), (2,-3)}

Tutorial for the TI-89 Titanium Calculator

Introduction to Quadratic Functions

Copyright 2013 by Laura Schultz. All rights reserved. Page 1 of 7

ch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Solving Quadratic Equations

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared.

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Getting to know your TI-83

CORRELATION ANALYSIS

TI-83/84 Plus Graphing Calculator Worksheet #2

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers

Linear Equations. 5- Day Lesson Plan Unit: Linear Equations Grade Level: Grade 9 Time Span: 50 minute class periods By: Richard Weber

Using Excel for inferential statistics

Father s height (inches)

SPECIAL PRODUCTS AND FACTORS

0.8 Rational Expressions and Equations

Section 3 Part 1. Relationships between two numerical variables

1.3 Algebraic Expressions

Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation

Unit 1 Equations, Inequalities, Functions

USING EXCEL ON THE COMPUTER TO FIND THE MEAN AND STANDARD DEVIATION AND TO DO LINEAR REGRESSION ANALYSIS AND GRAPHING TABLE OF CONTENTS

Algebraic expressions are a combination of numbers and variables. Here are examples of some basic algebraic expressions.

Algebra I. In this technological age, mathematics is more important than ever. When students

Title: Line of Best Fit. Brief Overview:

Sample Problems. Practice Problems

II. DISTRIBUTIONS distribution normal distribution. standard scores

HIBBING COMMUNITY COLLEGE COURSE OUTLINE

How Does My TI-84 Do That

25 Integers: Addition and Subtraction

Florida Math for College Readiness

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Graphing Quadratic Functions

Determine If An Equation Represents a Function

Objective. Materials. TI-73 Calculator

Statistics for Sports Medicine

Solve addition and subtraction word problems, and add and subtract within 10, e.g., by using objects or drawings to represent the problem.

PERCENTS. Percent means per hundred. Writing a number as a percent is a way of comparing the number with 100. For example: 42% =

The importance of graphing the data: Anscombe s regression examples

Module 5: Statistical Analysis

Simple Regression Theory II 2010 Samuel L. Baker

Math 0980 Chapter Objectives. Chapter 1: Introduction to Algebra: The Integers.

10.1. Solving Quadratic Equations. Investigation: Rocket Science CONDENSED

Algebra I Teacher Notes Expressions, Equations, and Formulas Review

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

Example: Boats and Manatees

Introduction to Quantitative Methods

Plot the following two points on a graph and draw the line that passes through those two points. Find the rise, run and slope of that line.

MA107 Precalculus Algebra Exam 2 Review Solutions

Algebra I Vocabulary Cards

Formula for linear models. Prediction, extrapolation, significance test against zero slope.

Post-hoc comparisons & two-way analysis of variance. Two-way ANOVA, II. Post-hoc testing for main effects. Post-hoc testing 9.

SAT Math Facts & Formulas Review Quiz

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

7.1 Graphs of Quadratic Functions in Vertex Form

1. Which of the 12 parent functions we know from chapter 1 are power functions? List their equations and names.

Slope & y-intercept Discovery Activity

Linear Models in STATA and ANOVA

Florida Math Correlation of the ALEKS course Florida Math 0028 to the Florida Mathematics Competencies - Upper

Mathematics Common Core Sample Questions

Graphing Linear Equations

The GED math test gives you a page of math formulas that

Session 7 Bivariate Data and Analysis

Unit 7 The Number System: Multiplying and Dividing Integers

Assessment For The California Mathematics Standards Grade 6

-2- Reason: This is harder. I'll give an argument in an Addendum to this handout.

Algebra EOC Practice Test #4

table to see that the probability is (b) What is the probability that x is between 16 and 60? The z-scores for 16 and 60 are: = 1.

Transcription:

Two-Variable Statistics Correlation A relationship between two variables As one goes up, the other changes in a predictable way (either mostly goes up or mostly goes down) Positive Correlation As one variable goes up, so does the other. Examples of positive correlations: cost of using a computer printer and number of pages printed number of free throws a basketball player makes and number of times the player is fouled. time required for a reading assignment and number of pages assigned children s age and the number of words in their vocabulary Negative Correlation The variables move in opposite directions As one goes up, the other goes down (and vice versa). Examples of negative correlations: farm acres in Iowa planted to corn and acres planted to other crops power of a microwave oven and the time it takes to boil a cup of water number of checkouts open at Wal-Mart and how long it takes to check out (at a given time of day) number of raffle tickets sold and your probability of winning the raffle In some cases there can be no correlation. college students height and their grade point averages outside temperature and number of problems assigned in a class IMPORTANT: Just because there is a correlation between two things, it does NOT mean the first thing causes the second. It just means there s a relationship. Something else could be affecting both of the things we are studying. o This is called a lurking, confounding, or extraneous variable. FOR INSTANCE, there is a strong correlation between children s shoe size and their vocabulary. BUT Having big feet doesn t CAUSE kids to have a bigger vocabulary.

Correlations can be shown using a scatterplot. This is literally a graph of the points (x,y) for the values in the relationship. Positive Correlation Negative Correlation Strong Correlation Weak Correlation In a perfect correlation, all the points would be in a straight line. If there is no correlation, the points would be all over the place. We are studying linear correlation, which means we are looking at patterns that are close to being in a line. It is possible to have other patterns that don t happen to approximate a line.

From our point of view here, we would say there is no correlation for these. We typically measure the strength of a correlation with a statistic called r. r is the correlation coefficient (or Pearson s Product Moment). r is always a number between 1 and 1. If r = 0, we say there is NO correlation. If r = 1 or r = -1, we say there is a PERFECT correlation. Usually r is a fraction (normally written as a decimal). Typically fractions less than ½ are considered weak correlations and fractions above ½ are considered strong correlations.

How to find r : On a TI-83 (or 84): FIRST one time before you do any two-variable problems. 1. Hit 2 nd 0 (CATALOG) 2. Scroll through the catalog until you find DiagnosticOn. 3. Hit ENTER twice. Once you ve done that, your calculator will be set. For each individual problem: 1. STAT EDIT 2. Enter the first variable in L1 and the second variable in L2. (It is important that each pair of data be kept paired.) 3. 2 nd MODE (QUIT) 4. STAT CALC 5. Choose #4 LinReg(ax+b) 6. One of the statistics on the read-out is r. If you don t have a calculator that will find r, here s a quick way to find a rough estimate of its value: 1. On graph paper, graph the points corresponding to the data in your problem. 2. Draw a rectangle that surrounds all the points. 3. Measure the length and width of the rectangle. 4. r 1 w / l Significance Tests for Correlation QUESTION: Is this a significant positive (or negative) correlation? HYPOTHESES: H 1 = the correlation in the whole population is significantly greater than 0. H 0 = the correlation isn t significant. You find the critical value in the Pearson Product Moment table on the handout. Use the 1-tail test for the given level of significance. You compute r (test statistic) with your calculator or by graphing, as we learned last week. (In reality, in most cases it is given in the problem.) As always, if the calculated value (absolute value) is more than the critical value, the result is significant. Coefficient of Determination r 2 tells what amount of the change in the second variable can be predicted from the change in the first variable. For example, if r =.7, then.49 or 49% of the change in y can be predicted from the change in x.

Linear Regression using a line to make predictions with 2-variable data The regression line is essentially the average of the data. The idea is a line that runs through the center of all the points as close as possible to every point. Regression Line Regression lines are usually written in the form y m x b x and y mean the predicted values of x and y. On graphing calculators, the slope and y-intercept of the regression line are easily calculated. These may be given in either the form ax + b or a + bx depending on the calculator. Many software programs, such as Microsoft Excel, will also compute regression lines. In general we will be using, but not computing regression lines in this class. In a regression line, Slope (# by x) represents the rate of change in the variables y-intercept (# by itself) is usually the initial value of the second variable IMPORTANT: Regression equations will give reasonable (but not exact) estimates for values near the data used in the sample. A regression estimate is essentially a point estimate of a parameter. For values far away from those used in the sample, regression equations are not usually very accurate (and often give silly answers). Typical Problem: The New York City police noticed that as the number of officers on the street increased, the crime rate decreased. In a certain precinct, there was a linear correlation with the following regression equation:

y 135 7 x, where x is the number of officers and y is the number of crimes committed each day. a. How many crimes can be expected with 12 officers on the street? b. How many officers would be needed to reduce the number of crimes to 20? c. How many crimes could be expected if there were 25 officers on the street? (Note that this is a very large number of officers far more than are involved in either previous problem.) This is a silly answer because the data is nowhere close to the numbers used in the original sample.