Elementary Statistics. Scatter Plot, Regression Line, Linear Correlation Coefficient, and Coefficient of Determination



Similar documents
Straightening Data in a Scatterplot Selecting a Good Re-Expression Model

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Regression and Correlation

What does the number m in y = mx + b measure? To find out, suppose (x 1, y 1 ) and (x 2, y 2 ) are two points on the graph of y = mx + b.

Correlation key concepts:

Scatter Plot, Correlation, and Regression on the TI-83/84

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

TIME SERIES ANALYSIS & FORECASTING

Grade level: secondary Subject: mathematics Time required: 45 to 90 minutes

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

2. Simple Linear Regression

HIBBING COMMUNITY COLLEGE COURSE OUTLINE

Homework 11. Part 1. Name: Score: / null

Pennsylvania System of School Assessment

Relationships Between Two Variables: Scatterplots and Correlation

table to see that the probability is (b) What is the probability that x is between 16 and 60? The z-scores for 16 and 60 are: = 1.

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares

Example: Boats and Manatees

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Chapter 7: Simple linear regression Learning Objectives

Part Three. Cost Behavior Analysis

Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation

Tutorial on Using Excel Solver to Analyze Spin-Lattice Relaxation Time Data

Calculator Notes for the TI-Nspire and TI-Nspire CAS

with functions, expressions and equations which follow in units 3 and 4.

3.3. Solving Polynomial Equations. Introduction. Prerequisites. Learning Outcomes

Simple linear regression

Module 3: Correlation and Covariance

MATH 095, College Prep Mathematics: Unit Coverage Pre-algebra topics (arithmetic skills) offered through BSE (Basic Skills Education)

Univariate Regression

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA

Brunswick High School has reinstated a summer math curriculum for students Algebra 1, Geometry, and Algebra 2 for the school year.

Section 3 Part 1. Relationships between two numerical variables

CRLS Mathematics Department Algebra I Curriculum Map/Pacing Guide

3.1 Solving Systems Using Tables and Graphs

Florida Math for College Readiness

Georgia Standards of Excellence Curriculum Map. Mathematics. GSE 8 th Grade

( ) FACTORING. x In this polynomial the only variable in common to all is x.

Algebra 1 Course Information

Data Mining Part 5. Prediction

Algebra 1 If you are okay with that placement then you have no further action to take Algebra 1 Portion of the Math Placement Test

Linear Equations. Find the domain and the range of the following set. {(4,5), (7,8), (-1,3), (3,3), (2,-3)}

Springfield Technical Community College School of Mathematics, Sciences & Engineering Transfer

The importance of graphing the data: Anscombe s regression examples

THE LEAST SQUARES LINE (other names Best-Fit Line or Regression Line )

What are the place values to the left of the decimal point and their associated powers of ten?

Chapter 9 Descriptive Statistics for Bivariate Data

CORRELATION ANALYSIS

Common Core Unit Summary Grades 6 to 8

Using Microsoft Excel to Plot and Analyze Kinetic Data

The Big Picture. Correlation. Scatter Plots. Data

Equations. #1-10 Solve for the variable. Inequalities. 1. Solve the inequality: Solve the inequality: 4 0

CENTRAL TEXAS COLLEGE SYLLABUS FOR DSMA 0306 INTRODUCTORY ALGEBRA. Semester Hours Credit: 3

3.2. Solving quadratic equations. Introduction. Prerequisites. Learning Outcomes. Learning Style

17. SIMPLE LINEAR REGRESSION II

MBA Jump Start Program

Father s height (inches)

Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th

Worksheet A5: Slope Intercept Form

G r a d e 1 0 I n t r o d u c t i o n t o A p p l i e d a n d P r e - C a l c u l u s M a t h e m a t i c s ( 2 0 S ) Final Practice Exam

Regression Analysis: A Complete Example

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Math Review. for the Quantitative Reasoning Measure of the GRE revised General Test

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

PCHS ALGEBRA PLACEMENT TEST

1.2 GRAPHS OF EQUATIONS. Copyright Cengage Learning. All rights reserved.

Unit 3: Day 2: Factoring Polynomial Expressions

LAYOUT OF THE KEYBOARD

Describing Relationships between Two Variables

Dealing with Data in Excel 2010

Algebra and Geometry Review (61 topics, no due date)

Simple Linear Regression Inference

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Exercise 1.12 (Pg )

Algebra II A Final Exam

7.1 Graphs of Quadratic Functions in Vertex Form

A synonym is a word that has the same or almost the same definition of

Calibration and Linear Regression Analysis: A Self-Guided Tutorial

Tennessee Department of Education. Task: Sally s Car Loan

NSM100 Introduction to Algebra Chapter 5 Notes Factoring

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

Manhattan Center for Science and Math High School Mathematics Department Curriculum

Mario Guarracino. Regression

You buy a TV for $1000 and pay it off with $100 every week. The table below shows the amount of money you sll owe every week. Week

A Determination of g, the Acceleration Due to Gravity, from Newton's Laws of Motion

Module 5: Statistical Analysis

The Point-Slope Form

THE COST OF COLLEGE EDUCATION PROJECT PACKET

Monday 28 January 2013 Morning

Point Biserial Correlation Tests

Higher Education Math Placement

Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data

ALGEBRA 2: 4.1 Graph Quadratic Functions in Standard Form

The Correlation Coefficient

Diagrams and Graphs of Statistical Data

Simple Regression Theory II 2010 Samuel L. Baker

COWLEY COUNTY COMMUNITY COLLEGE REVIEW GUIDE Compass Algebra Level 2

Econometrics Simple Linear Regression

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Transcription:

Scatter Plot, Regression Line, Linear Correlation Coefficient, and Coefficient of Determination

What is a Scatter Plot? A Scatter Plot is a plot of ordered pairs (x, y) where the horizontal axis is used for the x variable and the vertical axis is used for the y variable. How is Scatter Plot helpful? The pattern of the plotted points in a Scatter Plot will enable us to see whether there is a relationship between the two variables.

Example: The study time and midterm exam score for a random sample of 10 students in an elementary statistics course are shown in the following table. Student A B C D E F G H I J Study Time; x 3 4 4 5 6 6 8 9 10 9 (Hours) Midterm Score; y 57 65 72 74 70 80 85 90 97 92 Draw the scatter plot.

Solution: We plot the ordered-pair (3, 57) for student A, ordered-pair (4, 65) for student B, and so on to draw the Scatter Plot.

What is a Regression Equation? The Regression Equation algebraically describes the best linear relationship between two variables x and y. The Regression Equation is usually written in the following form. ŷ = a + bx What is the Regression Line? The Regression Line is the graph of the Regression Equation.

How do we compute a and b? Compute x, y, and xy. Compute x 2, and y 2. Now we use the formulas below. b = n( xy) ( x)( y) n( x 2 ) ( x) 2 and a = ( y)( x 2 ) ( x)( xy) n( x 2 ) ( x) 2

Example: The study time and midterm exam score for a random sample of 10 students in an elementary statistics course are shown in the following table. Student A B C D E F G H I J Study Time; x 3 4 4 5 6 6 8 9 10 9 (Hours) Midterm Score; y 57 65 72 74 70 80 85 90 97 92 Find the equation of the regression line in which the x variable is the study time, and y variable is the midterm score. Draw the regression line and scatter plot in the same coordinate system.

Solution: We first find and verify that x = 64, y = 782, xy = 5277, x 2 = 464, y 2 = 62632, and then we apply these values in the the formula b = n( xy) ( x)( y) n( x 2 ) ( x) 2 = 10(5277) (64)(782) 10(464) (64) 2 = 2722 544 5.004

Solution Continued: and a = ( y)( x 2 ) ( x)( xy) n( x 2 ) ( x) 2 = (782)(464) (64)(5277) 10(464) (64) 2 = 25120 544 46.176 So the equation of the regression line is ŷ = a + bx = 46.176 + 5.004x

Solution Continued: Here is the graph of the regression line as well as the scatter plot.

What is a Correlation? A Correlation between two variables is when there is an apparent association between the values of one variable with the corresponding values from the other variable. What is the Linear Correlation Coefficient? The Linear Correlation Coefficient is a numerical value that measures the strength of the linear correlation between the paired x and y for all values in the sample. We denote this value by r.

What are the properties of r? 1 r 1 It is not designed to measure the strength of a nonlinear relationship. It is very sensitive and changes value if the sample contains any outliers. The Linear Correlation Coefficient is considered significant when r is fairly close to 1.

How do we compute r? Compute x, y, and xy. Compute x 2, and y 2. Now we use the formula r = n( xy) ( x)( y) n( x 2 ) ( x) 2 n( y 2 ) ( y) 2 It is worth noting that r is usually calculated with a computer software or a calculator.

Example: The study time and midterm exam score for a random sample of 10 students in an elementary statistics course are shown in the following table. Student A B C D E F G H I J Study Time; x 3 4 4 5 6 6 8 9 10 9 (Hours) Midterm Score; y 57 65 72 74 70 80 85 90 97 92 Find the value of linear correlation coefficient r.

Solution: We first find and verify that x = 64, y = 782, xy = 5277, x 2 = 464, y 2 = 62632, and then we apply these values in the the formula r = = = n( xy) ( x)( y) n( x 2 ) ( x) 2 n( y 2 ) ( y) 2 10(5277) (64)(782) 10(464) (64) 2 10(62632) (782) 2 2722 544 14796 0.959

What is the Coefficient of Determination? The Coefficient of Determination is a numerical value usually provided in percentage that indicates what percentage of the dependent variable y is explained by the independent variable x. We denote this value by r 2. How do we compute r 2? We simply square the value of r and then convert it to a percentage by moving the decimal point two places to the right.

Example: The study time and midterm exam score for a random sample of 10 students in an elementary statistics course are shown in the following table. Student A B C D E F G H I J Study Time; x 3 4 4 5 6 6 8 9 10 9 (Hours) Midterm Score; y 57 65 72 74 70 80 85 90 97 92 Find the value of coefficient of determination r 2 and explain what this number describes in the context of this example.

Solution: We have already used this example and found the value of the linear correlation coefficient r. We got r 0.959. Now we square this number to get the coefficient of determination. r 2 = (0.959) 2 = 0.919681 0.920 92.0% So 92.0% of the midterm scores are explained by the study time.