5. Correlation. Open HeightWeight.sav. Take a moment to review the data file.



Similar documents
Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

10. Comparing Means Using Repeated Measures ANOVA

4. Descriptive Statistics: Measures of Variability and Central Tendency

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

SPSS Explore procedure

Data exploration with Microsoft Excel: analysing more than one variable

SPSS Manual for Introductory Applied Statistics: A Variable Approach

Directions for Frequency Tables, Histograms, and Frequency Bar Charts

2: Entering Data. Open SPSS and follow along as your read this description.

How To Use Spss

January 26, 2009 The Faculty Center for Teaching and Learning

containing Kendall correlations; and the OUTH = option will create a data set containing Hoeffding statistics.

This activity will show you how to draw graphs of algebraic functions in Excel.

Scatter Plots with Error Bars

The Dummy s Guide to Data Analysis Using SPSS

Statgraphics Getting started

Introduction Course in SPSS - Evening 1

Gestation Period as a function of Lifespan

Directions for using SPSS

When to use Excel. When NOT to use Excel 9/24/2014

An introduction to IBM SPSS Statistics

Describing, Exploring, and Comparing Data

Excel -- Creating Charts

Linear Models in STATA and ANOVA

CHARTS AND GRAPHS INTRODUCTION USING SPSS TO DRAW GRAPHS SPSS GRAPH OPTIONS CAG08

Main Effects and Interactions

Updates to Graphing with Excel

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Chapter 23. Inferences for Regression

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

SAS Analyst for Windows Tutorial

Data analysis and regression in Stata

Excel Tutorial. Bio 150B Excel Tutorial 1

PURPOSE OF GRAPHS YOU ARE ABOUT TO BUILD. To explore for a relationship between the categories of two discrete variables

Using SPSS, Chapter 2: Descriptive Statistics

Petrel TIPS&TRICKS from SCM

How To Run Statistical Tests in Excel

Excel Charts & Graphs

Microsoft Excel 2010 Charts and Graphs

To Begin Customize Office

SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard

Psyc 250 Statistics & Experimental Design. Correlation Exercise

7. Comparing Means Using t-tests.

SPSS Introduction. Yi Li

SPSS Workbook 1 Data Entry : Questionnaire Data

Homework 11. Part 1. Name: Score: / null

Microsoft Word 2011: Create a Table of Contents

INTRODUCTORY LAB: DOING STATISTICS WITH SPSS 21

Scientific Graphing in Excel 2010

SPSS Step-by-Step Tutorial: Part 1

Factor Analysis Using SPSS

Data analysis process

Chapter 5 Analysis of variance SPSS Analysis of variance

Client Marketing: Sets

MetroBoston DataCommon Training

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

Dealing with Data in Excel 2010

Modifying Colors and Symbols in ArcMap

Microsoft Excel 2013 Tutorial

SPSS for Simple Analysis

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red.

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

Simple Tricks for Using SPSS for Windows

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:

Appendix III: SPSS Preliminary

Graphing in excel on the Mac

Creating Charts in Microsoft Excel A supplement to Chapter 5 of Quantitative Approaches in Business Studies

Formula for linear models. Prediction, extrapolation, significance test against zero slope.

8. Comparing Means Using One Way ANOVA

Chapter 2: Descriptive Statistics

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Charting LibQUAL+(TM) Data. Jeff Stark Training & Development Services Texas A&M University Libraries Texas A&M University

SPSS Guide: Regression Analysis

Introduction to SPSS 16.0

Getting Started with Excel Table of Contents

ITS Training Class Charts and PivotTables Using Excel 2007

Psych. Research 1 Guide to SPSS 11.0

One-Way ANOVA using SPSS SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

PowerPoint Tables / Charts / Graphs

SAS/GRAPH 9.2 ODS Graphics Editor. User s Guide

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

An SPSS companion book. Basic Practice of Statistics

Using Excel for inferential statistics

Creating Graphs. Learning Objective-To create graphs that show data.

EHR: Scavenger Hunt I

Module 3: Correlation and Covariance

SPSS Tests for Versions 9 to 13

Your Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab Exercise #5 Analysis of Time of Death Data for Soldiers in Vietnam

USING SPSS FOR DESCRIPTIVE ANALYSIS

Intermediate PowerPoint

General instructions for the content of all StatTools assignments and the use of StatTools:

Creating an Excel XY (Scatter) Plot

Creating tables of contents and figures in Word 2013

CONSTRUCTING SINGLE-SUBJECT REVERSAL DESIGN GRAPHS USING MICROSOFT WORD : A COMPREHENSIVE TUTORIAL

Using Excel in Research. Hui Bian Office for Faculty Excellence

McAfee Endpoint Encryption Reporting Tool

Data Analysis Tools. Tools for Summarizing Data

Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16

A Picture Really Is Worth a Thousand Words

Transcription:

5. Correlation Objectives Calculate correlations Calculate correlations for subgroups using split file Create scatterplots with lines of best fit for subgroups and multiple correlations Correlation The first inferential statistic we will focus on is correlation. As noted in the text, correlation is used to test the degree of association between variables. All of the inferential statistics commands in SPSS are accessed from the Analyze menu. Let s open SPSS and replicate the correlation between height and weight presented in the text. Open HeightWeight.sav. Take a moment to review the data file. Under Analyze, select Correlate/Bivariate. Bivariate means we are examining the simple association between 2 variables. In the dialog box, select height and weight for Variables. Select Pearson for Correlation Coefficients since the data are continuous. The default for Tests of Significance is Two-tailed. You could change it to Onetailed if you have a directional hypothesis. Selecting Flag significant correlations means that the significant correlations will be noted in the output by asterisks. This is a nice feature. Then click Options.

ow you can see how descriptive statistics are built into other menus. Select Means and standard deviations under Statistics. Missing Values are important. In large data sets, pieces of data are often missing for some variables. For example I may run correlations between height, weight, and blood pressure. One subject may be missing blood pressure data. If I check Exclude cases listwise, SPSS will not include that person s data in the correlation between height and weight, even though those data are not missing. If I check Exclude cases pairwise, SPSS will include that person s data to calculate any correlations that do not involved blood pressure. In this case, the person s data would still be reflected in the correlation between height and weight. You have to decide whether or not you want to exclude cases that are missing any data from all analyses. (ormally it is much safer to go with listwise deletion, even though it will reduce your sample size.) In this case, it doesn t matter because there are no missing data. Click Continue. When you return to the previous dialog box, click Ok. The output follow. Correlations Descriptive Statistics HEIGHT WEIGHT Mean Std. Dev iation 68.72 3.66 92 145.15 23.74 92 HEIGHT WEIGHT Correlations **. Correlation is signif icant at the 0.01 lev el (2-tailed). HEIGHT WEIGHT 1.000.785**..000 92 92.785** 1.000.000. 92 92 otice, the correlation coefficient is.785 and is statistically significant, just as reported in the text. In the text, Howell made the point that heterogeneous samples affect correlation coefficients. In this example, we included both males and females. Let s examine the correlation separately for males and females as was done in the text.

Subgroup Correlations We need to get SPSS to calculate the correlation between height and weight separately for males and females. The easiest way to do this is to split our data file by sex. Let s try this together. In the Data Editor window, select Data/Split file. Select Organize output by groups and Groups Based on Gender. This means that any analyses you specify will be run separately for males and females. Then, click Ok. otice that the order of the data file has been changed. It is now sorted by Gender, with males at the top of the file. ow, select Analyze/Correlation/Bivariate. The same variables and options you selected last time are still in the dialog box. Take a moment to check to see for yourself. Then, click Ok. The output follow broken down by males and females. Correlations SEX = Male HEIGHT WEIGHT a. SEX = Male Descriptive Statistics a Mean Std. Dev iation 70.75 2.58 57 158.26 18.64 57

SEX = Female HEIGHT WEIGHT a. SEX = Female Descriptive Statistics a Mean Std. Dev iation 65.40 2.56 35 123.80 13.37 35 As before, our results replicate those in the text. The correlation between height and weight is stronger for males than females. ow let s see if we can create a more complicated scatterplot that illustrates the pattern of correlation for males and females on one graph. First, we need to turn off split file. Select Data/Split file from the Data Editor window. Then select Analyze all cases, do not compare groups and click Ok. ow, we can proceed. Scatterplots of Data by Subgroups Select Graphs/Legacy/Scatter. Then, select Simple and click Define.

To be consistent with the graph in the text book, select weight as the Y Axis and height as the X Axis. Then, select sex for Set Markers by. This means SPSS will distinguish the males dots from the female dots on the graph. Then, click Ok. When your graph appears, you will see that the only way males and females are distinct from one another is by color. This distinction may not show up well, so let s edit the graph. Double click the graph to activate the Chart Editor. Then double click on one of the female dots on the plot. SPSS will highlight them. (I often have trouble with this. If it selects all the points, click again on a female one. That should do it.) Then click the Marker menu. Select Click on the Chart/Options. circle under Marker Type and chose a Fill color. Then click Apply. Then click on the male dots, and select the open circle in Marker Type and click Apply. Then, close the dialog box. The resulting graph should look just like the one in the textbook. I would like to alter our graph to include the line of best fit for both groups. Under Elements, select Fit Line at Subgroups. Then select Linear and click Continue. (I had to select something else and then go back to Linear to highlight the Apply button.) The resulting graph follows. I think it looks pretty good.

Edit the graph to suit your style as you learned in Chapter 3 (e.g., add a title, change the axes titles and legend). This more complex scatterplot nicely illustrates the difference in the correlation between height and weight for males and females. Let s move on to a more complicated example. Overlay Scatterplots Another kind of scatterplot that might be useful is one that displays the association between different independent variables with the same dependant variable. Above, we compared the same correlation for different groups. This time, we want to compare different correlations. Let s use the course evaluation example from the text. It looks like expected grade is more strongly related to ratings of fairness of the exam than ratings of instructor knowledge is related to the exam. I d like to plot both correlations. I can reasonably plot them on the same graph since all of the questions were rated on the same scale. Open courseevaluation.sav. You do not need to save HeightWeight.sav since you did not change it. So click o.

First, let s make sure the correlations reported in the text are accurate. Click Analyze/Correlation/Bivariate and select all of the variables. Click Ok. The output follow. Do they agree with the text? OVERALL TEACH EXAM KOWLEDG GRADE EROLL **. Correlation is signif icant at the 0.01 lev el (2-tailed). *. Correlation is signif icant at the 0.05 lev el (2-tailed). Correlations OVERALL TEACH EXAM KOWLEDG GRADE EROLL 1.000.804**.596**.682**.301* -.240..000.000.000.034.094.804** 1.000.720**.526**.469** -.451**.000..000.000.001.001.596**.720** 1.000.451**.610** -.558**.000.000..001.000.000.682**.526**.451** 1.000.224 -.128.000.000.001..118.376.301*.469**.610**.224 1.000 -.337*.034.001.000.118..017 -.240 -.451** -.558** -.128 -.337* 1.000.094.001.000.376.017. ow, let s make our scatterplot. Select Graphs/Legacy/Scatter. Then select Overlay and click Define. Click on exam and grade and shift them into Y-X Pairs. Then click on exam and knowledge and click them into Y-X pairs. Since exam is the commonality between both pairs, I d like it to be on the Y axis. If it is not listed as Y, highlight the pair and click on the two-headed arrow. It will reverse the ordering. Exam should then appear first for both. Then, click Ok.

As in the previous example, the dots are distinguished by color. Double click the graph and use the Marker icon to make them more distinct as you learned above. Also use the Elements menu to Fit line at total. It will draw a line for each set of data. ote that the axes are not labeled. You could label the Y Axis Grade. But you could not label the X axis because it represents two different variables-exam and knowledge. That is why the legend is necessary. (If you figure out how to label that axis, please let me know. It should be so easy.) As you can see, the association between expected grade and fairness of the exam is stronger than the correlation between instructor s knowledge and the fairness of the exam. ow, you should have the tools necessary to calculate Person Correlations and to create various scatterplots that compliment those correlations. Complete the following exercises to help you internalize these steps. Exercises Exercises 1 through 3 are based on appendixd.sav. 1. Calculate the correlations between Add symptoms, IQ, GPA, and English grade twice, once using a one-tailed test and once using a two-tailed test. Does this make a difference? Typically, when would this make a difference.

2. Calculate the same correlations separately for those who did and did not drop out, using a two-tailed test. Are they similar or different? 3. Create a scatterplot illustrating the correlation between IQ score and GPA for those who did and did not drop out. Be sure to include the line of best fit for each group. 4. Open courseevaluation.sav. Create a scatterplot for fairness of exams and teacher skills and exam and instructor knowledge on one graph. Be sure to include the lines of best fit. Describe your graph.