Chapter 5. Regression

Size: px
Start display at page:

Download "Chapter 5. Regression"

Transcription

1 Topics covered in this chapter: Chapter 5. Regression Adding a Regression Line to a Scatterplot Regression Lines and Influential Observations Finding the Least Squares Regression Model Adding a Regression Line to a Scatterplot Example 5.: Does fidgeting keep you slim? The Problem: Why is it that some people find it easy to stay slim? Do people with larger increases in nonexercise activity (NEA) tend to gain less fat? NEA change in calories is the explanatory variable and Fat Gain in kg is the response variable. Make a scatterplot of fat gain and NEA change.. Open the data set eg05-0.por. 2. Construct a scatterplot of the data. Refer to the example in Chapter 4 for the instructions. 3. Add a least squares regression line to the scatterplot: a. Double-click on the graph and the Chart Editor opens up. b. Go to the Elements menu and click on Fit Line at Total. The following screen appears: 50

2 5 Chapter 5 c. Select Linear and click Apply. d. Click the Close button. e. Once everything has been done in the Chart Editor, close the editor by clicking the X at the upper right corner of the screen. The resulting scatterplot with the regression line will be in the output:

3 Regression 52 Regression Lines and Influential Observations Example (Apply Your Knowledge 5.0): Make a Scatterplot for the Data in Exercise 5.4 and Fit Regression Lines to Determine Which New Point is More Influential The Problem: Using the body mass data found in Exercise 5.4, make a scatterplot of the data and add two new points, adding one point at a time. Point A: mass 42 kg, metabolic rate 500 calories. Point B: mass 70 kg, metabolic rate 400 calories.. Prepare the data. a. Open the data set ex05-04.por. b. Click on Transform; scroll down and select Compute Variable. c. Under the Target Variable box, type MassPlusOne. d. Click on Mass and then click the arrow to the right of the box. This moves Mass into the Numeric Expression box. e. Click OK. f. Repeat the same process using target variable RatePlusOne which will be based on Rate. g. Repeat the process again, creating variables MassPlusTwo and RatePlusTwo. h. Now the Data View screen will have six columns of data. i. At the bottom of the RatePlusOne and MassPlusOne columns, add the values for point A. Put 42 in row 5 in the MassPlusOne column and 500 in row 5 of the RatePlusOne column.

4 53 Chapter 5 j. Add the values for point A to row 5 of the RatePlusTwo and MassPlusTwo columns as well. k. Type the values for point B in row 6 of the RatePlusTwo and MassPlusTwo columns. 2. Construct a scatterplot of the data which includes all three data sets: the original 2 women, the original women plus Point A, and the original women plus Point B: a. Click on Graphs, scroll down to Legacy Dialogs and then click on Scatter/Dot. b. Click to select Overlay Scatter. c. Click Define. d. The following screen appears and we have to place each Y-X pair into the Y- X Pairs box.

5 Regression 54 e. Click RatePlusTwo and click on the arrow to the left of the Y-X Pairs box. f. Click MassPlusTwo and click on the arrow to the left of the Y-X Pairs box. g. Click RatePlusOne and click on the arrow to the left of the Y-X Pairs box. h. Click MassPlusOne and click on the arrow to the left of the Y-X Pairs box. i. Click Rate and click on the arrow to the left of the Y-X Pairs box. j. Click Mass and click on the arrow to the left of the Y-X Pairs box. k. If the pairs are in X-Y order, click the swap pair (double arrow) button.

6 55 Chapter 5 l. Click OK. The scatterplot will appear in the output window. Notice that points A and B are a different color from the rest of the points. 3. Follow the steps in Example 5. above to add least squares regression lines to the scatterplot. Finding the Least Squares Regression Model Example (Apply Your Knowledge 5.0) (Continued): Analyze the data in Exercise 5.4 using Linear Regression. Open the data set ex05-04.por. 2. Construct a scatterplot of the data and fit regression line. Refer Chapter 4 and Example 5. to construct a scatterplot with a regression fit line. 3. Analyze the data using regression: a. Go to the Analyze menu, scroll down to Regression, and select Linear. The following screen should appear:

7 Regression 56 b. Put Rate in the Dependent Variable box and Mass in the Independent Variable box. Change any default options that you want, and click OK.

8 57 Chapter 5 The above four boxes will appear in the output window. The least squares regression equation along with r, the correlation coefficient, may be found in the fourth and final box labeled Coefficients. The least squares equation is in the form y ˆ = a + bx. The estimates of a, the y-intercept, and b, the slope, can both be found in the first column labeled B. The y-intercept is in the row labeled (Constant) and the slope is in the row labeled Mass. For this example, the least squares equation is yˆ = x, where ŷ is the predicted value of the metabolic rate and x is the body mass used for the prediction. The value of the correlation coefficient can be found under the column labeled Beta. The correlation coefficient is.876, which indicates a strong, positive linear association between metabolic rate and body mass. Chapter 5 Exercises 5.3 Ebola and gorillas. 5. Outsourcing by airlines Keeping water clean Our brains don t like losses Managing diabetes. 5.4 Managing diabetes, continued. 5.5 Beavers and beetles Saving energy with solar panels.

9 336 Chapter 5 SPSS Solutions 5.3 Open data file ex We can find the mean and standard deviation for both variables using Analyze, Descriptive Statistics, Descriptives. Click to enter both variables, then OK. Descriptive Statistics N Minimum Maximum Mean Std. Deviation Ranges Days Valid N (listwise) 6 To find the correlation, use Analyze, Correlate, Bivariate. Again, click to enter both variables and OK. Correlations Ranges Days Ranges Days Pearson Correlation ** Sig. (2-tailed).002 N Pearson Correlation.962 **.000 Sig. (2-tailed).002 N **. Correlation is significant at the 0.0 level (2-tailed). sy 6.33 To hand calculate the slope, we have b= r =.962 =.263. Similarly, we s x.378 find the intercept as a = y bx = *3.5 = To check this against the software calculation, compute the regression using Analyze, Regression, Linear. Click to enter Days as the Dependent variable and Ranges as the Independent variable. OK performs the calculations. We see our results agree to within rounding. Model Unstandardized Coefficients Coefficients B Std. Error Beta (Constant) t Sig. Ranges a. Dependent Variable: Days

10 The scatterplot was created in the solution to Exercise 4.5. See that solution for details. We ll compute the correlation and regression with all the data points, then delete Hawaiian and recomputed to see its influence. Click Analyze, Regression, Linear. (There is also an option of Analyze, Correlate, Bivariate that computes the correlation but using this would mean you still have to perform the regression.) Click to enter Delay as the Dependent and Outsource as the Independent. OK performs the regression. Model R R Square Adjusted R Square Std. Error of the Estimate.476 a a. Predictors: (Constant), Outsource (Constant) Outsource a. Dependent Variable: Delay The regression equation becomes DelayPct = *OutsourcePct. The correlation is r = Based on this line, an airline with 76% outsourcing should have about *76 = 34.% delays. Now, delete the Hawaiian Airlines data values. Use the same set of commands to find the new equation. Model R R Square Adjusted R Square Std. Error of the Estimate.484 a a. Predictors: (Constant), Outsource

11 338 (Constant) Outsource a. Dependent Variable: Delay Hawaiian was influential for the regression, but not for the correlation. The correlation increased from to The regression equation changed from DelayPct = *OutsourcePct to DelayPct = *OutsourcePct. The equation that included Hawaiian Airlines predicts 34.% delayed flights for the airline with 76% outsourcing. The prediction from the regression that did not include Hawaiian is 29.8% delayed flights a decrease of 4.3% To make a scatterplot of these data, click Graph, Legacy Dialogs, Scatter/Dot. The default is Simple. Click Define to continue on to the plot definition dialog box. Click to Enter Absorb as the Y variable and Nitrates as the X variable. Click Titles to give your graph a title, then Continue and OK to generate the graph. The plot is extremely linear. To find the correlation, use Analyze, Correlate, Bivariate. Click to enter both variable names and OK.

12 339 Correlations Nitrate Absorb Nitrate Absorb Pearson Correlation ** Sig. (2-tailed).000 N Pearson Correlation.000 **.000 Sig. (2-tailed).000 N **. Correlation is significant at the 0.0 level (2-tailed). The correlation is.000, which is perfect; this calibration will not need to be done again. To find the equation that estimates nitrate levels, use Analyze, Regression, Linear. Click to enter Nitrate as the Response and Absorb as the Predictor. Since we also want to predict nitrate level for an absorbence of 40, add a new observation in absorbence of 40 without a Nitrate value (this will be ignored when computing the regression). Now, click Save, and put a check in the Unstandardized Predicted Values box. (Constant) Absorb a. Dependent Variable: Nitrate The estimated nitrate level for an absorbence of 40 is *40 = Predictions should be very accurate due to the strength of the linear relationship. Returning to the data worksheet, we find that the predicted nitrate level with an absorbence of 40 is

13 Open data file ex The scatterplot was created in the solution to Exercise 4.29; see that solution if you need help recreating this. We ll calculate the regression without and with the outlier. Since the outlier seems to follow the pattern of the rest of the data (making the relationship appear stronger), we expect that point to have little impact on the regression equation, but to increase the correlation (and r 2 ). We first calculate with the outlier included using Analyze, Regression, Linear using Behave as the Dependent variable and Neural as the Independent to find the following results. Model R R Square Adjusted R Square Std. Error of the Estimate.849 a a. Predictors: (Constant), Neural (Constant) Neural a. Dependent Variable: Behave Now, delete the outlier (the last observation) and recompute the regression to find the following: Model R R Square Adjusted R Square Std. Error of the Estimate.70 a a. Predictors: (Constant), Neural (Constant) Neural a. Dependent Variable: Behave With the outlier included, the correlation is r = (the square root of 0.72). With the outlier deleted, the correlation has weakened to r = Notice that the outlier has little

14 34 impact on the regression equation: with the point included, we have BehaviorLoss = * NeuralLoss; with the point deleted, the equation is BehaviorLoss = * NeuralLoss Open data file ta Use Graphs, Legacy Dialogs, Scatter/Dot to create the scatterplot, click to enter Fpg as the Y variable and Hba as the X variable. Click Titles to give your graph an appropriate title; Continue and OK generates the graph. The data point furthest to the right (subject 8) and in the top center (subject 5) are outliers. We ll first calculate the regression with all the data values; then remove subject 8 and recompute. Finally, we ll add back subject 8 and remove subject 5. Use Analyze, Regression, Linear to fit the model. Click to enter Fpg as the Dependent variable and Hba as the Independent variable. OK performs the calculations. Model R R Square Adjusted R Square Std. Error of the Estimate.482 a a. Predictors: (Constant), Hba

15 342 (Constant) Hba a. Dependent Variable: Fpg With all the data, we have the equation Fpg = *Hba. The correlation is r = Delete subject 8 in the worksheet, and return to the regression dialog box to recompute. Model R R Square Adjusted R Square Std. Error of the Estimate.384 a a. Predictors: (Constant), Hba (Constant) Hba a. Dependent Variable: Fpg With subject 8 removed, we have Fpg = *Hba. The correlation has lowered to r = Finally, add subject 8 (9.3, 255) back in and delete subject 5 (move the cursor to each value and press the delete key). Model R R Square Adjusted R Square Std. Error of the Estimate.568 a a. Predictors: (Constant), Hba Model Unstandardized Coefficients Coefficients B Std. Error Beta (Constant) t Sig. Hba a. Dependent Variable: Fpg

16 343 With all but subject 5, we have Fpg = *Hba and the correlation has increased to r = Deleting subject 8 weakened the relationship. This is because this data point was far to the right in the graph (extreme in its x value); visually, one s eyes follow through to points like these they are influential in fitting a model. Subject 5 was relatively in the middle of the x range but was clearly unusually high in his/her Fpg level. This type of point really adds only scatter to a plot, so deleting this increased the correlation. 5.4 We can recreate the scatterplot of all the data and add the three regression lines into the plot using the Chart Editor. If you have just completed Exercise 5.39, add subject 5 s data back into the worksheet; otherwise, open data file ta Create the scatterplot as was done in Exercise Double-click in the graph to bring up the Chart Editor. Click Elements, Fit Line at Total to add the regression line for the entire data set. This also brings up a Properties Box. Close this box. To add the other two lines, click Options, Reference Line from Equation. Type one of the equations into the Custom Equation box, then Apply and Close. Repeat this to add the last equation. Close the Chart Editor.

17 344 The separation between the lines becomes apparent at the right side of the plot. This is due to the influence of Subject 8 at the far right. 5.5 Open data file ex05-5. First, create a scatterplot of the data using Graphs, Scatter/Dot. Enter Stumps as the X variable and Larvae as the Y variable. Give your graph an appropriate title using Titles. We see that these data indicate that there are more beetle larvae with more stumps. Use Analyze, Regression, Linear to fit the line using Stumps as the Independent and Larvae as the Dependent. Model R R Square Adjusted R Square Std. Error of the Estimate.98 a a. Predictors: (Constant), Stumps

18 345 (Constant) stumps a. Dependent Variable: larvae The regression equation is Larvae = * Stumps. The relationship is strong; the regression model explains 84.3% of the variability in larvae (the correlation is r = r =.843 =0.98). These data support the beavers benefit beetles idea Open data for ex Since the data in this file have all the observations in each column with the before and after indicators in the variable named Data, we ll modify the regression dialog box input to fit each line separately. Click Analyze, Regression, Linear. Enter DD as the Independent and Gas as the Dependent variables. Now, Click to enter Data in the Selection Variable box, then Rule. Type in before, then Continue and OK.

19 346 Model Data = before (Selected) R Square R Adjusted R Square Std. Error of the Estimate.995 a a. Predictors: (Constant), DD,b (Constant) DD a. Dependent Variable: Gas b. Selecting only cases for which Data = before The regression equation before the solar panels is Gas = *DD. The relationship is strong r = This regression predicts *45 = hundred cubic feet of gas for a 45 degree-day day. Now, repeat the regression, selecting the after data. R Data = after Adjusted R Std. Error of the Model (Selected) R Square Square Estimate.99 a a. Predictors: (Constant), DD,b (Constant) DD a. Dependent Variable: Gas b. Selecting only cases for which Data = after The after regression equation is Gas = *DD. This is also a very strong relationship; we have r = This regression predicts *45 = 7.98 hundred cubic feet of gas for a 45 degree-day day. The solar panels saved about 60 cubic feet of gas usage.

Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

More information

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Simple Linear Regression, Scatterplots, and Bivariate Correlation 1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.

More information

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices: Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

Formula for linear models. Prediction, extrapolation, significance test against zero slope.

Formula for linear models. Prediction, extrapolation, significance test against zero slope. Formula for linear models. Prediction, extrapolation, significance test against zero slope. Last time, we looked the linear regression formula. It s the line that fits the data best. The Pearson correlation

More information

5. Correlation. Open HeightWeight.sav. Take a moment to review the data file.

5. Correlation. Open HeightWeight.sav. Take a moment to review the data file. 5. Correlation Objectives Calculate correlations Calculate correlations for subgroups using split file Create scatterplots with lines of best fit for subgroups and multiple correlations Correlation The

More information

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

There are six different windows that can be opened when using SPSS. The following will give a description of each of them. SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet

More information

PERFORMING REGRESSION ANALYSIS USING MICROSOFT EXCEL

PERFORMING REGRESSION ANALYSIS USING MICROSOFT EXCEL PERFORMING REGRESSION ANALYSIS USING MICROSOFT EXCEL John O. Mason, Ph.D., CPA Professor of Accountancy Culverhouse School of Accountancy The University of Alabama Abstract: This paper introduces you to

More information

WEB APPENDIX. Calculating Beta Coefficients. b Beta Rise Run Y 7.1 1 8.92 X 10.0 0.0 16.0 10.0 1.6

WEB APPENDIX. Calculating Beta Coefficients. b Beta Rise Run Y 7.1 1 8.92 X 10.0 0.0 16.0 10.0 1.6 WEB APPENDIX 8A Calculating Beta Coefficients The CAPM is an ex ante model, which means that all of the variables represent before-thefact, expected values. In particular, the beta coefficient used in

More information

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.

More information

Directions for using SPSS

Directions for using SPSS Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

More information

Activity 6 Graphing Linear Equations

Activity 6 Graphing Linear Equations Activity 6 Graphing Linear Equations TEACHER NOTES Topic Area: Algebra NCTM Standard: Represent and analyze mathematical situations and structures using algebraic symbols Objective: The student will be

More information

Tutorial for the TI-89 Titanium Calculator

Tutorial for the TI-89 Titanium Calculator SI Physics Tutorial for the TI-89 Titanium Calculator Using Scientific Notation on a TI-89 Titanium calculator From Home, press the Mode button, then scroll down to Exponential Format. Select Scientific.

More information

Calibration and Linear Regression Analysis: A Self-Guided Tutorial

Calibration and Linear Regression Analysis: A Self-Guided Tutorial Calibration and Linear Regression Analysis: A Self-Guided Tutorial Part 1 Instrumental Analysis with Excel: The Basics CHM314 Instrumental Analysis Department of Chemistry, University of Toronto Dr. D.

More information

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red.

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red. Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red. 1. How to display English messages from IBM SPSS Statistics

More information

Absorbance Spectrophotometry: Analysis of FD&C Red Food Dye #40 Calibration Curve Procedure

Absorbance Spectrophotometry: Analysis of FD&C Red Food Dye #40 Calibration Curve Procedure Absorbance Spectrophotometry: Analysis of FD&C Red Food Dye #40 Calibration Curve Procedure Note: there is a second document that goes with this one! 2046 - Absorbance Spectrophotometry. Make sure you

More information

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

More information

How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

More information

Spreadsheets and Laboratory Data Analysis: Excel 2003 Version (Excel 2007 is only slightly different)

Spreadsheets and Laboratory Data Analysis: Excel 2003 Version (Excel 2007 is only slightly different) Spreadsheets and Laboratory Data Analysis: Excel 2003 Version (Excel 2007 is only slightly different) Spreadsheets are computer programs that allow the user to enter and manipulate numbers. They are capable

More information

Multiple Regression. Page 24

Multiple Regression. Page 24 Multiple Regression Multiple regression is an extension of simple (bi-variate) regression. The goal of multiple regression is to enable a researcher to assess the relationship between a dependent (predicted)

More information

Forecasting in STATA: Tools and Tricks

Forecasting in STATA: Tools and Tricks Forecasting in STATA: Tools and Tricks Introduction This manual is intended to be a reference guide for time series forecasting in STATA. It will be updated periodically during the semester, and will be

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests

More information

Using Microsoft Excel to Plot and Analyze Kinetic Data

Using Microsoft Excel to Plot and Analyze Kinetic Data Entering and Formatting Data Using Microsoft Excel to Plot and Analyze Kinetic Data Open Excel. Set up the spreadsheet page (Sheet 1) so that anyone who reads it will understand the page (Figure 1). Type

More information

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce

More information

Scatter Plot, Correlation, and Regression on the TI-83/84

Scatter Plot, Correlation, and Regression on the TI-83/84 Scatter Plot, Correlation, and Regression on the TI-83/84 Summary: When you have a set of (x,y) data points and want to find the best equation to describe them, you are performing a regression. This page

More information

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate 1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,

More information

Correlation and Regression Analysis: SPSS

Correlation and Regression Analysis: SPSS Correlation and Regression Analysis: SPSS Bivariate Analysis: Cyberloafing Predicted from Personality and Age These days many employees, during work hours, spend time on the Internet doing personal things,

More information

Describing, Exploring, and Comparing Data

Describing, Exploring, and Comparing Data 24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter

More information

Regression Clustering

Regression Clustering Chapter 449 Introduction This algorithm provides for clustering in the multiple regression setting in which you have a dependent variable Y and one or more independent variables, the X s. The algorithm

More information

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations.

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. Excel is powerful tool and can make your life easier if you are proficient in using it. You will need to use Excel to complete most of your

More information

SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard

SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard p. 1 SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard The following tutorial is a guide to some basic procedures in SPSS that will be useful as you complete your data assignments for PPA 722. The purpose

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

A Guide to Using Excel in Physics Lab

A Guide to Using Excel in Physics Lab A Guide to Using Excel in Physics Lab Excel has the potential to be a very useful program that will save you lots of time. Excel is especially useful for making repetitious calculations on large data sets.

More information

The importance of graphing the data: Anscombe s regression examples

The importance of graphing the data: Anscombe s regression examples The importance of graphing the data: Anscombe s regression examples Bruce Weaver Northern Health Research Conference Nipissing University, North Bay May 30-31, 2008 B. Weaver, NHRC 2008 1 The Objective

More information

Assignment objectives:

Assignment objectives: Assignment objectives: Regression Pivot table Exercise #1- Simple Linear Regression Often the relationship between two variables, Y and X, can be adequately represented by a simple linear equation of the

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries? Statistics: Correlation Richard Buxton. 2008. 1 Introduction We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries? Do

More information

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs Using Excel Jeffrey L. Rummel Emory University Goizueta Business School BBA Seminar Jeffrey L. Rummel BBA Seminar 1 / 54 Excel Calculations of Descriptive Statistics Single Variable Graphs Relationships

More information

Lin s Concordance Correlation Coefficient

Lin s Concordance Correlation Coefficient NSS Statistical Software NSS.com hapter 30 Lin s oncordance orrelation oefficient Introduction This procedure calculates Lin s concordance correlation coefficient ( ) from a set of bivariate data. The

More information

An SPSS companion book. Basic Practice of Statistics

An SPSS companion book. Basic Practice of Statistics An SPSS companion book to Basic Practice of Statistics SPSS is owned by IBM. 6 th Edition. Basic Practice of Statistics 6 th Edition by David S. Moore, William I. Notz, Michael A. Flinger. Published by

More information

Years after 2000. US Student to Teacher Ratio 0 16.048 1 15.893 2 15.900 3 15.900 4 15.800 5 15.657 6 15.540

Years after 2000. US Student to Teacher Ratio 0 16.048 1 15.893 2 15.900 3 15.900 4 15.800 5 15.657 6 15.540 To complete this technology assignment, you should already have created a scatter plot for your data on your calculator and/or in Excel. You could do this with any two columns of data, but for demonstration

More information

Lab 1: The metric system measurement of length and weight

Lab 1: The metric system measurement of length and weight Lab 1: The metric system measurement of length and weight Introduction The scientific community and the majority of nations throughout the world use the metric system to record quantities such as length,

More information

Copyright 2013 by Laura Schultz. All rights reserved. Page 1 of 7

Copyright 2013 by Laura Schultz. All rights reserved. Page 1 of 7 Using Your TI-83/84/89 Calculator: Linear Correlation and Regression Dr. Laura Schultz Statistics I This handout describes how to use your calculator for various linear correlation and regression applications.

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the

More information

Dealing with Data in Excel 2010

Dealing with Data in Excel 2010 Dealing with Data in Excel 2010 Excel provides the ability to do computations and graphing of data. Here we provide the basics and some advanced capabilities available in Excel that are useful for dealing

More information

January 26, 2009 The Faculty Center for Teaching and Learning

January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Regression and Correlation

Regression and Correlation Regression and Correlation Topics Covered: Dependent and independent variables. Scatter diagram. Correlation coefficient. Linear Regression line. by Dr.I.Namestnikova 1 Introduction Regression analysis

More information

SPSS Manual for Introductory Applied Statistics: A Variable Approach

SPSS Manual for Introductory Applied Statistics: A Variable Approach SPSS Manual for Introductory Applied Statistics: A Variable Approach John Gabrosek Department of Statistics Grand Valley State University Allendale, MI USA August 2013 2 Copyright 2013 John Gabrosek. All

More information

A Basic Guide to Analyzing Individual Scores Data with SPSS

A Basic Guide to Analyzing Individual Scores Data with SPSS A Basic Guide to Analyzing Individual Scores Data with SPSS Step 1. Clean the data file Open the Excel file with your data. You may get the following message: If you get this message, click yes. Delete

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

Using SPSS, Chapter 2: Descriptive Statistics

Using SPSS, Chapter 2: Descriptive Statistics 1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

USING EXCEL ON THE COMPUTER TO FIND THE MEAN AND STANDARD DEVIATION AND TO DO LINEAR REGRESSION ANALYSIS AND GRAPHING TABLE OF CONTENTS

USING EXCEL ON THE COMPUTER TO FIND THE MEAN AND STANDARD DEVIATION AND TO DO LINEAR REGRESSION ANALYSIS AND GRAPHING TABLE OF CONTENTS USING EXCEL ON THE COMPUTER TO FIND THE MEAN AND STANDARD DEVIATION AND TO DO LINEAR REGRESSION ANALYSIS AND GRAPHING Dr. Susan Petro TABLE OF CONTENTS Topic Page number 1. On following directions 2 2.

More information

Table of Contents. Preface

Table of Contents. Preface Table of Contents Preface Chapter 1: Introduction 1-1 Opening an SPSS Data File... 2 1-2 Viewing the SPSS Screens... 3 o Data View o Variable View o Output View 1-3 Reading Non-SPSS Files... 6 o Convert

More information

Scatter Plots with Error Bars

Scatter Plots with Error Bars Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each

More information

Introduction Course in SPSS - Evening 1

Introduction Course in SPSS - Evening 1 ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/

More information

Microsoft Excel. Qi Wei

Microsoft Excel. Qi Wei Microsoft Excel Qi Wei Excel (Microsoft Office Excel) is a spreadsheet application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot

More information

Directions for Frequency Tables, Histograms, and Frequency Bar Charts

Directions for Frequency Tables, Histograms, and Frequency Bar Charts Directions for Frequency Tables, Histograms, and Frequency Bar Charts Frequency Distribution Quantitative Ungrouped Data Dataset: Frequency_Distributions_Graphs-Quantitative.sav 1. Open the dataset containing

More information

Example: Boats and Manatees

Example: Boats and Manatees Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant

More information

hp calculators HP 50g Trend Lines The STAT menu Trend Lines Practice predicting the future using trend lines

hp calculators HP 50g Trend Lines The STAT menu Trend Lines Practice predicting the future using trend lines The STAT menu Trend Lines Practice predicting the future using trend lines The STAT menu The Statistics menu is accessed from the ORANGE shifted function of the 5 key by pressing Ù. When pressed, a CHOOSE

More information

Data exploration with Microsoft Excel: analysing more than one variable

Data exploration with Microsoft Excel: analysing more than one variable Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

Excel Tutorial. Bio 150B Excel Tutorial 1

Excel Tutorial. Bio 150B Excel Tutorial 1 Bio 15B Excel Tutorial 1 Excel Tutorial As part of your laboratory write-ups and reports during this semester you will be required to collect and present data in an appropriate format. To organize and

More information

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5 Using Your TI-83/84 Calculator: Linear Correlation and Regression Elementary Statistics Dr. Laura Schultz This handout describes how to use your calculator for various linear correlation and regression

More information

Data analysis and regression in Stata

Data analysis and regression in Stata Data analysis and regression in Stata This handout shows how the weekly beer sales series might be analyzed with Stata (the software package now used for teaching stats at Kellogg), for purposes of comparing

More information

Introduction to SPSS 16.0

Introduction to SPSS 16.0 Introduction to SPSS 16.0 Edited by Emily Blumenthal Center for Social Science Computation and Research 110 Savery Hall University of Washington Seattle, WA 98195 USA (206) 543-8110 November 2010 http://julius.csscr.washington.edu/pdf/spss.pdf

More information

Curve Fitting in Microsoft Excel By William Lee

Curve Fitting in Microsoft Excel By William Lee Curve Fitting in Microsoft Excel By William Lee This document is here to guide you through the steps needed to do curve fitting in Microsoft Excel using the least-squares method. In mathematical equations

More information

Figure 1. An embedded chart on a worksheet.

Figure 1. An embedded chart on a worksheet. 8. Excel Charts and Analysis ToolPak Charts, also known as graphs, have been an integral part of spreadsheets since the early days of Lotus 1-2-3. Charting features have improved significantly over the

More information

0 Introduction to Data Analysis Using an Excel Spreadsheet

0 Introduction to Data Analysis Using an Excel Spreadsheet Experiment 0 Introduction to Data Analysis Using an Excel Spreadsheet I. Purpose The purpose of this introductory lab is to teach you a few basic things about how to use an EXCEL 2010 spreadsheet to do

More information

1.1. Simple Regression in Excel (Excel 2010).

1.1. Simple Regression in Excel (Excel 2010). .. Simple Regression in Excel (Excel 200). To get the Data Analysis tool, first click on File > Options > Add-Ins > Go > Select Data Analysis Toolpack & Toolpack VBA. Data Analysis is now available under

More information

Summary of important mathematical operations and formulas (from first tutorial):

Summary of important mathematical operations and formulas (from first tutorial): EXCEL Intermediate Tutorial Summary of important mathematical operations and formulas (from first tutorial): Operation Key Addition + Subtraction - Multiplication * Division / Exponential ^ To enter a

More information

A full analysis example Multiple correlations Partial correlations

A full analysis example Multiple correlations Partial correlations A full analysis example Multiple correlations Partial correlations New Dataset: Confidence This is a dataset taken of the confidence scales of 41 employees some years ago using 4 facets of confidence (Physical,

More information

Using Excel for Statistical Analysis

Using Excel for Statistical Analysis Using Excel for Statistical Analysis You don t have to have a fancy pants statistics package to do many statistical functions. Excel can perform several statistical tests and analyses. First, make sure

More information

Estimating a market model: Step-by-step Prepared by Pamela Peterson Drake Florida Atlantic University

Estimating a market model: Step-by-step Prepared by Pamela Peterson Drake Florida Atlantic University Estimating a market model: Step-by-step Prepared by Pamela Peterson Drake Florida Atlantic University The purpose of this document is to guide you through the process of estimating a market model for the

More information

SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout

SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout Analyzing Data SPSS Resources 1. See website (readings) for SPSS tutorial & Stats handout Don t have your own copy of SPSS? 1. Use the libraries to analyze your data 2. Download a trial version of SPSS

More information

Data Analysis for Marketing Research - Using SPSS

Data Analysis for Marketing Research - Using SPSS North South University, School of Business MKT 63 Marketing Research Instructor: Mahmood Hussain, PhD Data Analysis for Marketing Research - Using SPSS Introduction In this part of the class, we will learn

More information

Microsoft Excel Tutorial

Microsoft Excel Tutorial Microsoft Excel Tutorial by Dr. James E. Parks Department of Physics and Astronomy 401 Nielsen Physics Building The University of Tennessee Knoxville, Tennessee 37996-1200 Copyright August, 2000 by James

More information

Activity 5. Two Hot, Two Cold. Introduction. Equipment Required. Collecting the Data

Activity 5. Two Hot, Two Cold. Introduction. Equipment Required. Collecting the Data . Activity 5 Two Hot, Two Cold How do we measure temperatures? In almost all countries of the world, the Celsius scale (formerly called the centigrade scale) is used in everyday life and in science and

More information

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Section 14 Simple Linear Regression: Introduction to Least Squares Regression Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

More information

FREE FALL. Introduction. Reference Young and Freedman, University Physics, 12 th Edition: Chapter 2, section 2.5

FREE FALL. Introduction. Reference Young and Freedman, University Physics, 12 th Edition: Chapter 2, section 2.5 Physics 161 FREE FALL Introduction This experiment is designed to study the motion of an object that is accelerated by the force of gravity. It also serves as an introduction to the data analysis capabilities

More information

When to use Excel. When NOT to use Excel 9/24/2014

When to use Excel. When NOT to use Excel 9/24/2014 Analyzing Quantitative Assessment Data with Excel October 2, 2014 Jeremy Penn, Ph.D. Director When to use Excel You want to quickly summarize or analyze your assessment data You want to create basic visual

More information

Interactive Excel Spreadsheets:

Interactive Excel Spreadsheets: Interactive Excel Spreadsheets: Constructing Visualization Tools to Enhance Your Learner-centered Math and Science Classroom Scott A. Sinex Department of Physical Sciences and Engineering Prince George

More information

Statgraphics Getting started

Statgraphics Getting started Statgraphics Getting started The aim of this exercise is to introduce you to some of the basic features of the Statgraphics software. Starting Statgraphics 1. Log in to your PC, using the usual procedure

More information

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple

More information

Chapter 9 Descriptive Statistics for Bivariate Data

Chapter 9 Descriptive Statistics for Bivariate Data 9.1 Introduction 215 Chapter 9 Descriptive Statistics for Bivariate Data 9.1 Introduction We discussed univariate data description (methods used to eplore the distribution of the values of a single variable)

More information

How to make a line graph using Excel 2007

How to make a line graph using Excel 2007 How to make a line graph using Excel 2007 Format your data sheet Make sure you have a title and each column of data has a title. If you are entering data by hand, use time or the independent variable in

More information

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS. SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed

More information

Can SAS Enterprise Guide do all of that, with no programming required? Yes, it can.

Can SAS Enterprise Guide do all of that, with no programming required? Yes, it can. SAS Enterprise Guide for Educational Researchers: Data Import to Publication without Programming AnnMaria De Mars, University of Southern California, Los Angeles, CA ABSTRACT In this workshop, participants

More information

SPSS Tests for Versions 9 to 13

SPSS Tests for Versions 9 to 13 SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list

More information

SPSS Notes (SPSS version 15.0)

SPSS Notes (SPSS version 15.0) SPSS Notes (SPSS version 15.0) Annie Herbert Salford Royal Hospitals NHS Trust July 2008 Contents Page Getting Started 1 1 Opening SPSS 1 2 Layout of SPSS 2 2.1 Windows 2 2.2 Saving Files 3 3 Creating

More information

Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools. Tools for Summarizing Data Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

More information

Working with SQL Server Integration Services

Working with SQL Server Integration Services SQL Server Integration Services (SSIS) is a set of tools that let you transfer data to and from SQL Server 2005. In this lab, you ll work with the SQL Server Business Intelligence Development Studio to

More information

Microsoft Office Access 2007 Basics

Microsoft Office Access 2007 Basics Access(ing) A Database Project PRESENTED BY THE TECHNOLOGY TRAINERS OF THE MONROE COUNTY LIBRARY SYSTEM EMAIL: TRAININGLAB@MONROE.LIB.MI.US MONROE COUNTY LIBRARY SYSTEM 734-241-5770 1 840 SOUTH ROESSLER

More information

Calculator Notes for the TI-Nspire and TI-Nspire CAS

Calculator Notes for the TI-Nspire and TI-Nspire CAS CHAPTER 11 Calculator Notes for the Note 11A: Entering e In any application, press u to display the value e. Press. after you press u to display the value of e without an exponent. Note 11B: Normal Graphs

More information

How To Use Spss

How To Use Spss 1: Introduction to SPSS Objectives Learn about SPSS Open SPSS Review the layout of SPSS Become familiar with Menus and Icons Exit SPSS What is SPSS? SPSS is a Windows based program that can be used to

More information

The North Carolina Health Data Explorer

The North Carolina Health Data Explorer 1 The North Carolina Health Data Explorer The Health Data Explorer provides access to health data for North Carolina counties in an interactive, user-friendly atlas of maps, tables, and charts. It allows

More information

An introduction to IBM SPSS Statistics

An introduction to IBM SPSS Statistics An introduction to IBM SPSS Statistics Contents 1 Introduction... 1 2 Entering your data... 2 3 Preparing your data for analysis... 10 4 Exploring your data: univariate analysis... 14 5 Generating descriptive

More information