Statistical Control using Partial and Semi-partial (Part) Correlations



Similar documents
Comparing a Multiple Regression Model Across Groups

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

SPSS Guide: Regression Analysis

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Multiple Regression. Page 24

Power Analysis for Correlation & Multiple Regression

Moderator and Mediator Analysis

INTRODUCTION TO MULTIPLE CORRELATION

Simple Linear Regression, Scatterplots, and Bivariate Correlation

A full analysis example Multiple correlations Partial correlations

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

SPSS Explore procedure

Eight things you need to know about interpreting correlations:

Using Excel for Statistical Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Simple linear regression

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

STUDENT ATTITUDES TOWARD WEB-BASED COURSE MANAGEMENT SYSTEM FEATURES

An Introduction to Path Analysis. nach 3

Correlational Research

Correlation and Regression Analysis: SPSS

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

The Dummy s Guide to Data Analysis Using SPSS

Binary Logistic Regression

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Formula for linear models. Prediction, extrapolation, significance test against zero slope.

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Chapter 7: Simple linear regression Learning Objectives

Validity of Selection Criteria in Predicting MBA Success

Multiple Regression: What Is It?

Linear functions Increasing Linear Functions. Decreasing Linear Functions

Reliability Analysis

Univariate Regression

Linear Models in STATA and ANOVA

5. Correlation. Open HeightWeight.sav. Take a moment to review the data file.

Application of Predictive Model for Elementary Students with Special Needs in New Era University

Example of Including Nonlinear Components in Regression

Factor Analysis - SPSS

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Introduction to Linear Regression

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models

Chapter 23. Inferences for Regression

Independent samples t-test. Dr. Tom Pierce Radford University

Introduction to proc glm

January 26, 2009 The Faculty Center for Teaching and Learning

Advice for Applying to Grad School in Economics

GMAC. Predicting Success in Graduate Management Doctoral Programs

Introduction to Linear Regression

Moderation. Moderation

Didacticiel - Études de cas

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection

USING SPSS FOR DESCRIPTIVE ANALYSIS

Title: Lending Club Interest Rates are closely linked with FICO scores and Loan Length

Analysing Questionnaires using Minitab (for SPSS queries contact -)

Pearson s Correlation

CALCULATIONS & STATISTICS

Solving Linear Equations - General Equations

How to Get More Value from Your Survey Data

To do a factor analysis, we need to select an extraction method and a rotation method. Hit the Extraction button to specify your extraction method.

Module 3: Correlation and Covariance

Module 5: Multiple Regression Analysis

Updates to Graphing with Excel

Multiple Regression Using SPSS

Chapter 7. Comparing Means in SPSS (t-tests) Compare Means analyses. Specifically, we demonstrate procedures for running Dependent-Sample (or

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

Chapter 7 Factor Analysis SPSS

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Father s height (inches)

II. DISTRIBUTIONS distribution normal distribution. standard scores

MODEL I: DRINK REGRESSED ON GPA & MALE, WITHOUT CENTERING

Multicollinearity Richard Williams, University of Notre Dame, Last revised January 13, 2015

Test Bias. As we have seen, psychological tests can be well-conceived and well-constructed, but

Premaster Statistics Tutorial 4 Full solutions

COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared.

4. Descriptive Statistics: Measures of Variability and Central Tendency

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS

SPSS Guide How-to, Tips, Tricks & Statistical Techniques

Adverse Impact and Test Validation Book Series: Multiple Regression. Introduction. Comparison of Compensation using

Nonparametric statistics and model selection

Mixed 2 x 3 ANOVA. Notes

Homework 8 Solutions

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Module 5: Statistical Analysis

Statistics. Measurement. Scales of Measurement 7/18/2012

The University of Texas at Austin School of Social Work SOCIAL WORK STATISTICS

Chapter 2 Quantitative, Qualitative, and Mixed Research

The 3-Level HLM Model

Chapter One Love Is the Foundation. For Group Discussion. Notes

Chapter 15. Mixed Models Overview. A flexible approach to correlated data.

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

RESEARCH METHODS IN I/O PSYCHOLOGY

Module 4 - Multiple Logistic Regression

Transcription:

Statistical Control using Partial and Semi-partial (Part) A study of performance in graduate school produced the following correlations. 1st year graduate gpa -- criterion variable undergraduate gpa full scale IQ score number weekly study hours Pearson Pearson Pearson Pearson 1st year graduate gpa number -- criterion undergrad full scale weekly study variable uate gpa IQ score hours 1.642.612.268..000.000.003.642 1.464.532.000..000.000.612.464 1.465.000.000..000.268.532.465 1.003.000.000. ot surprisingly graduate grades were positively correlated with all three of these likely predictors -- undergraduate grades, IQ and number of hours studied each week. However the researcher wanted to explore these relationships further. Partial The first question was whether there was a relationship between graduate and undergraduate grades after controlling both for IQ. Since there is no way to randomly assign folks to and manipulate their IQ, statistical control must be applied. This partial correlation is easily obtained using SPSS. Analyze Partial Move the "core variables" into the "Variables" window and the control variable into the "Controlling for" window.

PARTIAL CORR /VARIABLES=GRADGPA UGRADGPA BY iq. variables being correlated BY control(s) We get Contr IQ GRADGPA UGRADGPA GRADGPA UGRADGPA 1.000.511..000 0 119.511 1.000.000. 119 0 There is a substantial, significant positive correlation between the two types of grades, even after controlling both for IQ. otice that the partial correlation is somewhat smaller than the simple correlation. This suggests that part of the simple correlation is due to each of the variables being related to IQ. When IQ is "removed" from the grades, their relationship is somewhat weaker. This is a common result when partialing. A second question was whether there was a relationship between graduate grades and study time after controlling both for IQ. The results were Contr IQ GRADGPA UGRADGPA GRADGPA UGRADGPA 1.000 -.024..797 0 119 -.024 1.000.797. 119 0 While there was a significant positive simple correlation between graduate grades and study time, the correlation "disappears" when both variables are controlled for IQ. Both grade variables are substantially correlated with IQ, so "removing" IQ from both takes away most of the apparent relationship between them. A third question involved the multiple partial correlation of the two grade variables controlling both for IQ and study time. However, after careful consideration the researcher decided that this wouldn't make sense. The study time variable referred to study time during graduate school. So, it didn't make sense to control undergraduate grades for graduate study time! Always carefully consider whether your intended partialing makes sense -- especially consider the temporal aspects of the data collection. Semi-partial (Part) A modification of the third question occurred to the researcher. What is the correlation between the two types of grades controlling graduate grades for graduate study time? This is a semi-partial (part) correlation question and makes sense than the partial version mentioned earlier. We can't get a matrix of semi-partial (or multiple semi-partial) correlations from SPSS, but we can obtain these correlations via multiple regression analyses.

Analyze Regression Linear The semi-partial we are after is rugradgpa(gradgpa.stdyhrs) Be sure to put the control variable and the variable being controlled into the "Independent(s)" window and the other variable into the "Dependent" window. Click "Statistics" and be sure that "Part and partial correlations" is checked. REGRESSIO /STATISTICS COEFF OUTS R AOVA ZPP /DEPEDET ugradgpa /METHOD=ETER gradgpa stdyhrs ZPP gets the partial and semi-partial (part) correlations put the variable not being controlled as the criterion put both the correlate and the control in as predictor Here are the pertinent parts of the resulting output The number we need to answer the research question is.519 -- the semi-partial (part) correlation between undergraduate gpa and graduate gpa, controlling graduate gpa for study time. The t-test of the related regression weight (B) provides a significance test for this semi-partial correlation. So we can see that there is a substantial positive part correlation between these two types of grades -- folks with better undergraduate grades tend to also have better graduate grades, even after controlling the latter for number of study hours. Model 1 (Constant) 1st year graduate gpa -- criterion variable number weekly study hours Unstand ardized Coefficie nts a. Dependent Variable: undergraduate gpa Coefficients a Standar dized Coeffici ents B Beta t Sig. -2.964-8.269.000 Zeroorder Partial Part.902.538 8.455.000.642.613.519.388.388 6.085.000.532.487.373

Multiple Semi-partial (Part) The last research question was whether graduate grades and study time were correlated after controlling the later for IQ and undergraduate grades. This is a multiple semi-partial (part) question. We'd put graduate grades in the "Dependent" window and study hours, IQ and undergraduate grades in the "Independent(s)" window, and get REGRESSIO /STATISTICS COEFF OUTS R AOVA ZPP /DEPEDET ugradgpa /METHOD=ETER stdyhrs iq gradgpa put the variable not being controlled as the criterion put both correlate and controls in as predictors Model 1 (Constant) number weekly study hours full scale IQ score undergraduate gpa Unstanda rdized Coefficien ts Coefficients a Standar dized Coefficie nts B Beta t Sig. 3.299 92.0.000 -.147 -.245-3.32.001.268 -.292 -.198.281.469 6.638.000.612.521.397.332.555 7.523.000.642.569.450 a. Dependent Variable: 1st year graduate gpa -- criterion variable Zeroorder Partial Part The number we want is -.198 -- the multiple semi-partial (part) correlation between graduate grades and study time, controlling the latter for IQ and undergraduate grades. The t-test tells us this correlation is significant. What's interesting here is that this multiple semi-partial correlation is negative, whereas the simple correlation between graduate grades and study time is positive (r=.268). So, study time is positively correlated with graduate grades, but the part of study time that is independent of IQ and undergraduate grades is negatively correlated with graduate grades. A substantive interpretation of this might be that the graduate students with lower IQ and poorer graduate grades do more studying to maintain their graduate grades. You should notice that the correlations involved in this last question are in the.2-.25 range -- if we had had a smaller sample (say, by 1/2) we'd have "missed" the whole issue!!

Customized Control There are occasions when we want to control the X and the Y variables for different other variables. While simple to do, it is important to have a solid a priori good reason to do this. Probably the most meaningful situation is when Y is controlled for a psychometric confound and X is controlled for a design confound. Using these variables (pretending for the moment that the # hours refers to undergraduate), imagine that we want to examine the causal relationship between the causal variable how much is learned in undergraduate school and the effect variable how much is learned in graduate school. We have UGPA and GGPA, however we suspect that the bivariate correlation between the two (r =.642) doesn t tell the whole story. We can t randomly assign and manipulated UGPA. Based on this, we might want to control it for # hours of study (certainly not the only confound involved, but remember, the game is more complex models are more accurate, on average ). Also, it is known that GGPA (perhaps more than other levels of scholastic performance) is related to intelligence. Given this, we might decide that we want to control GGPA for IQ. Thus, we want the correlation between the part of GGPA that is not IQ and the part of UGPA that is not # study hours. Getting these residualizations is simple with SPSS. We ll just 1. Get a model with IQ as the predictor and GGPA as the criterion keeping each participant s residual 2. Get a model with #hrs as the predictor and UGPA as the criterion keeping each participant s residual 3. Correlate the residuals (remember that is really -3, so the p-value of this correlation is bit small) Analyze Regression Linear Here s an example, we would do two of these, the other with UGPA and hrstudy Then we get the correlation between the two residuals REGRESSIO /DEPEDET gradgpa /METHOD=ETER iq /SAVE RESID. REGRESSIO /DEPEDET ugradgpa /METHOD=ETER stdyhrs /SAVE RESID. CORR VARIABLES = RES_1 RES_2. Here s the resulting dataset and correlations Unstandardiz d Residual Unstandardiz d Residual Unstandardiz Unstandardiz ed Residual ed Residual Pearson Correlatio 1.426..000. 120 Pearson Correlatio.426 1.000. 120. We d conclude the part of grad GPA that isn t IQ and the part of undergrad GPA that isn t study hours are related!