# [This document contains corrections to a few typos that were found on the version available through the journal s web page]

Save this PDF as:

Size: px
Start display at page:

Download "[This document contains corrections to a few typos that were found on the version available through the journal s web page]"

## Transcription

1 Online supplement to Hayes, A. F., & Preacher, K. J. (2014). Statistical mediation analysis with a multicategorical independent variable. British Journal of Mathematical and Statistical Psychology, 67, DOI: /bmsp [This document contains corrections to a few typos that were found on the version available through the journal s web page] This document contains instructions for the implementation of the method described in Hayes and Preacher (2014) using Mplus as well as using the PROCESS and MEDIATE macros for SPSS and SAS. Following the code, various miscellaneous issues and extensions are addressed, including interpretation of model coefficients using sequential group coding, accounting for random measurement error, dealing with confounds statistically, and models with multiple mediators. Mplus Code Corresponding to the Web Portal Customization Example Any structural equation modeling program can produce estimates of the coefficients in a mediation model. Mplus offers features such as bootstrap confidence intervals for indirect effects and inferential tests for functions of parameters that make it a particularly good choice for the kind of analysis we describe in the manuscript. Importantly, the constraints of the freely available demonstration version of Mplus (available from do not preclude its use for estimation of mediation models with a single mediator and a categorical independent variable with as many as three levels. The code below implements the method described in the manuscript and can easily be adapted to mediation analysis with multiple mediators, latent variables, or an independent variable with more than three levels. DATA: FILE is c:\sri.txt; VARIABLE: NAMES are cond custom attitude inter; USEVARIABLES are attitude inter d1 d2;!indicator coding DEFINE: if (cond eq 1) then d1 = 0; if (cond eq 1) then d2 = 0; if (cond eq 2) then d1 = 1; if (cond eq 2) then d2 = 0; if (cond eq 3) then d1 = 0; if (cond eq 3) then d2 = 1;!model definition MODEL: inter ON d1 (a1) d2 (a2); attitude ON inter (b) d1 (cp1) d2 (cp2);!relative indirect effects; MODEL INDIRECT: attitude IND inter d1; attitude IND inter d2; MODEL CONSTRAINT: new (tot1 tot2); tot1=a1*b+cp1; tot2=a2*b+cp2; 1

2 The resulting output is below. This output was used to construct parts of Table 2 in the manuscript. MODEL RESULTS Two-Tailed Estimate S.E. Est./S.E. P-Value INTER ON D D ATTITUDE ON INTER D D Intercepts ATTITUDE INTER Residual Variances ATTITUDE INTER New/Additional Parameters TOT TOT TOTAL, TOTAL INDIRECT, SPECIFIC INDIRECT, AND DIRECT EFFECTS Effects from D1 to ATTITUDE Two-Tailed Estimate S.E. Est./S.E. P-Value Sum of indirect Specific indirect ATTITUDE INTER D Effects from D2 to ATTITUDE Sum of indirect Specific indirect ATTITUDE INTER D

3 For contrast coding as described in the text, replace the DEFINE section above with!orthogonal contrast coding DEFINE: if (cond eq 1) then d1 = ; if (cond eq 1) then d2 = 0; if (cond eq 2) then d1 = 0.333; if (cond eq 2) then d2 = -0.5; if (cond eq 3) then d1 = 0.333; if (cond eq 3) then d2 = 0.5; For sequential coding as discussed later in this supplement, replace the DEFINE section of the core program with!sequential coding DEFINE: if (cond eq 1) then d1 = 0; if (cond eq 1) then d2 = 0; if (cond eq 2) then d1 = 1; if (cond eq 2) then d2 = 0; if (cond eq 3) then d1 = 1; if (cond eq 3) then d2 = 1; To generate 95% and 99% bias corrected bootstrap confidence intervals for relative indirect effects (as well as all other parameter estimates), add the lines below to the program. For percentile confidence intervals, change bcbootstrap below to bootstrap. ANALYSIS: bootstrap = 10000; OUTPUT: cinterval (bcbootstrap); Estimation using PROCESS for SPSS and SAS PROCESS is a freely-available regression-based path analysis macro for both SPSS and SAS that estimates the model coefficients in mediation and moderation models of various forms while also providing modern inferential methods for inference about indirect effects including bootstrap confidence intervals. Its use in mediation analysis is described in Hayes (2013) along with documentation of its many features, and can be downloaded from [web address withheld for peer review]) One documented limitation of PROCESS is that only a single X variable can be specified in a mediation model, and it must be either dichotomous or continuous. However, with the strategic use of covariates, manual construction of the indicator codes prior to execution, and multiple executions of the macro, PROCESS can estimate a model as in Figure 2 of the manuscript. The results generated by PROCESS will be identical to what Mplus generates, with the exception of standard errors which will tend to be slightly smaller than OLS standard errors in smaller samples. These differences in standard errors dissipate rapidly as sample size increases. The example SPSS PROCESS code and output below corresponds to the analysis of the web portal customization study using indicator coding of customization condition. Variables named ATTITUDE and INTER contain measurements of attitudes toward the web portal and perceived interactivity, respectively, and variable COND codes experimental condition (1 = control, 2 = moderate customization, 3 = high customization). Because PROCESS allows only a single independent variable that must be either dichotomous or continuous, it must be tricked into estimating a model with a multicategorical independent variable. This is done by running PROCESS k 1 times, where k is the number of levels of the independent variable, and using k 1 group codes constructed prior the execution of PROCESS. At each run, one of the group codes is used as X 3

4 and the others as covariate(s), with the code serving as X being swapped with a covariate at subsequent PROCESS runs. So that the same bootstrap samples are used in consecutive executions, the random number generator should be seeded using the seed = command, with the same seed used time. This seed can be chosen arbitrarily. This code first constructs two dummy variables coding experimental condition with the control condition (cond = 1) as the reference category. The following PROCESS command then executes a mediation model with the first dummy variable as X and the other as a covariate. This generates estimates of a 1, a 2, b, c 1, c 2, c' 1, and c' 2 corresponding to the values in Table 2 of the manuscript, as well as a bias-corrected bootstrap confidence interval for a 1 b based on 10,000 bootstrap samples. The summary table at the end includes the three effects of X, which in this case are the relative total, direct, and indirect effects for moderate customization relative to the control condition (c 1, and c' 1, and a 1 b), in that order. compute d1=(cond=2). compute d2=(cond=3). process vars=attitude inter d1 d2/y=attitude/m=inter/x=d1/total=1/ model=4/boot=10000/seed=3423. Model = 4 Y = attitude X = d1 M = inter Statistical Controls: CONTROL= d2 Sample size 60 ************************************************************************** Outcome: inter Model Summary R R-sq F df1 df2 p Model coeff se t p LLCI ULCI constant d d ************************************************************************** Outcome: attitude Model Summary R R-sq F df1 df2 p Model coeff se t p LLCI ULCI constant inter d d ************************** TOTAL EFFECT MODEL **************************** 4

5 Outcome: attitude Model Summary R R-sq F df1 df2 p Model coeff se t p LLCI ULCI constant d d ***************** TOTAL, DIRECT, AND INDIRECT EFFECTS ******************** Total effect of X on Y Effect SE t p LLCI ULCI Direct effect of X on Y Effect SE t p LLCI ULCI Indirect effect of X on Y Effect Boot SE BootLLCI BootULCI inter ******************** ANALYSIS NOTES AND WARNINGS ************************* Number of bootstrap samples for bias corrected bootstrap confidence intervals: Level of confidence for all confidence intervals in output: Missing from the output above is the relative indirect effect for high customization relative to none (a 2 b) along with a bootstrap confidence interval for inference. The code below generates this relative indirect effect by switching d1 and d2 in the x= specification. Most of the output is identical to the code generated by the command above, so that output is suppressed by using the detail=0 option. Using the same random number seed as in the prior run of PROCESS produces a bootstrap confidence interval based on the same set of bootstrap samples. The effects for X in this summary table are the relative total, direct, and indirect effects for high customization relative to the control condition (c 2, c' 2, and a 2 b), in that order. process vars=attitude inter d1 d2/y=attitude/m=inter/x=d2/total=1/ model=4/boot=10000/seed=3423/detail=0. Model = 4 Y = attitude X = d2 M = inter Statistical Controls: CONTROL= d1 Sample size 60 ***************** TOTAL, DIRECT, AND INDIRECT EFFECTS ******************** 5

6 Total effect of X on Y Effect SE t p LLCI ULCI Direct effect of X on Y Effect SE t p LLCI ULCI Indirect effect of X on Y Effect Boot SE BootLLCI BootULCI inter ******************** ANALYSIS NOTES AND WARNINGS ************************* Number of bootstrap samples for bias corrected bootstrap confidence intervals: Level of confidence for all confidence intervals in output: The SPSS compute commands above generate indicator codes with the control group as the reference group. The commands to generate the contrast codes used in the example analysis would be if (cond=1) d1 = if (cond=1) d2 = 0. if (cond=2) d1 = if (cond=2) d2 = if (cond=3) d1 = if (cond=3) d2 = 0.5. For the sequential coding example described below, the following SPSS commands construct the sequential codes: compute d1 = (cond > 1). compute d2 = (cond > 2). The PROCESS macro is available for SAS but requires PROC IML. The command structure is very similar to the SPSS version, but the construction of group codes requires commands that are different than those used in SPSS. The SAS code below conducts the example analysis using indicator coding of groups, assuming the data reside in a SAS data file named web : data web;set web;d1=(cond=2);d2=(cond=3);run; %process (data=web,vars=attitude inter d1 d2,y=attitude,m=inter,x=d1, total=1,model=4,boot=10000,seed=3423); %process (data=web,vars=attitude inter d1 d2,y=attitude,m=inter,x=d2, total=1,model=4,boot=10000,seed=3423,detail=0); For the contrast codes corresponding to the example analysis in this paper, change the DATA line to read: data web;set web; if (cond=1) then do;d1=-0.667;d2=0;end; if (cond=2) then do;d1=0.333;d2=-0.5;end; if (cond=3) then do;d1=0.333;d2=0.5;end; run; 6

7 For the sequential codes described in the example below, the DATA line should read data web;set web;d1=(cond>1);d2=(cond>2);run; Estimation using MEDIATE for SPSS MEDIATE is a freely available SPSS macro (downloadable from [web address blinded for review]) that facilitates the estimation of mediation models with multicategorical independent variables along with the ability to generate bootstrap confidence intervals for indirect effects. It is very limited in its features relative to PROCESS, but it does have one handy option that automates the construction of codes for a categorical independent variable. The code and output below corresponds to the analysis of the web portal customization study using indicator coding of customization condition. Variables named ATTITUDE and INTER contain measurements of attitudes toward the web portal and perceived interactivity, respectively, and variable COND codes experimental condition (1 = control, 2 = moderate customization, 3 = high customization). The catx=1 option specifies indicator coding and sets the control condition as the reference group. See the documentation for additional information about the MEDIATE macro and its options. mediate y=attitude/x=cond/m=inter/samples=10000/total=1/catx=1. Run MATRIX procedure: VARIABLES IN THE FULL MODEL: Y = attitude M1 = inter X = cond CODING OF CATEGORICAL X FOR ANALYSIS: cond D1 D ***************************************************************************** OUTCOME VARIABLE: attitude MODEL SUMMARY (TOTAL EFFECTS MODEL) R R-sq Adj R-sq F df1 df2 p MODEL COEFFICIENTS (TOTAL EFFECTS MODEL) Coeff. s.e. t p Constant D D ***************************************************************************** OUTCOME VARIABLE: inter MODEL SUMMARY R R-sq Adj R-sq F df1 df2 p

8 MODEL COEFFICIENTS Coeff. s.e. t p Constant D D ***************************************************************************** OUTCOME VARIABLE: attitude MODEL SUMMARY R R-sq adj R-sq F df1 df2 p MODEL COEFFICIENTS Coeff. s.e. t p Constant inter D D TEST OF HOMOGENEITY OF REGRESSION (X*M INTERACTION) R-sq F df1 df2 p inter ***************************************************************************** INDIRECT EFFECT(S) THROUGH: inter Effect SE(boot) LLCI ULCI D D ********************* ANALYSIS NOTES AND WARNINGS ************************* NOTE: Indicator coding is used for categorical X Number of samples used for indirect effect confidence intervals: Level of confidence for confidence intervals: Bias corrected bootstrap confidence intervals for indirect effects are printed in output Sequential Coding of Groups In the web portal customization study, the three levels of the manipulation can be rank ordered with respect to degree of customization (none, moderate, or high). When the categories of a multicategorical predictor can be so ordered, sequential coding can be useful. With sequential codes, the relative direct and indirect effects can be interpreted as the effects of membership in one group relative to the group one step sequentially lower in the ordered system. Darlington (1990, pp ) describes sequential coding for a categorical variable with any number of ordered categories. With only three groups, the coding is relatively 8

9 simple. For the control condition (the lowest level of customization), D 1 and D 2 are set to 0, for the moderately customized condition (the next highest level of customization), D 1 = 1, D 2 = 0, and for the highest level of customization, D 1 = D 2 = 1. Estimating the coefficients in Equations 6, 7, and 8 in the manuscript yields the following results: i 1 = 4.250, i 2 = 2.810, i 3 = 4.335, a 1 = 1.575, p = 0.001; a 2 = 0.675, p = 0.166; b = 0.359, p < 0.001; c' 1 = 1.105, p = 0.003; c' 2 = 1.053, p =.002; c 1 = 1.670, p <.001; c 2 = 1.295, p < As with the other two methods of coding groups described in the manuscript, the resulting models reproduce the group means on M as well as Y (adjusted and unadjusted). The relative indirect effects are still estimated as products of coefficients. The a 1 coefficients quantify the mean differences in perceived interactivity between the moderate customization and control condition (a 1 ) and between the high and moderate customization conditions (a 2 ). That is, and a 1 = M moderate M control a 2 = M high M moderate When a 1 and a 2 are multiplied by the effect of interactivity on attitudes, holding customization condition constant (b = 0.359), the result is the relative indirect effects of customization on attitudes through perceived interactivity: and a 1 b = 1.575(0.359) = a 2 b = 0.675(0.359) = The relative indirect effect a 1 b estimates the indirect effect of moderate customization relative to none through perceived interactivity on attitudes. Those who browsed using a moderately customized portal had attitudes that were units more favorable on average (with a 95% bias-corrected bootstrap confidence interval from to 1.255) than those assigned to the noncustomized portal condition as a result of this indirect mechanism linking customization to attitudes through perceived interactivity. The relative indirect effect a 2 b estimates the indirect effect of high relative to moderate customization through perceived interactivity. Browsing with a highly customized portal resulted in attitudes that were units more favorable on average than browsing using a moderately customized portal as a result of this indirect mechanism linking customization to attitudes through perceived interactivity. Zero cannot be rejected as a plausible value for this indirect effect, as a 95% bias-corrected bootstrap confidence interval straddled zero ( to 0.710). Using this coding system, the relative direct effect c' 1 corresponds to the effect of moderate customization on attitudes relative to none, independent of perceived interactivity, and the relative direct effect c' 2 is the effect of high customization relative to moderate customization. This corresponds to differences between the adjusted means: * moderate * control c' 1 = Y Y = = * * c' 2 = Y high Y moderate = = As when other coding systems are used, the relative total effects can be estimated using Equation 8 in the manuscript or by adding the relative direct and indirect effects. With sequential coding, c 1 estimates the mean difference in attitude between the moderately customized and control groups, and c 2 estimates the mean difference in attitude between the highly customized and moderately customized groups. That is, 9

10 c 1 = Y moderate Y control c 2 = Y high Y moderate As both effects are positive, this suggests attitudes increase in favorability as customization increases. Finally, notice that as with indicator or contrast coding, the relative total effects partition cleanly into the relative direct and relative indirect effects: c 1 = c' 1 + a 1 b = = and c 2 = c' 2 + a 2 b = = Random Measurement Error The example analyses in the manuscript and this supplement ignore the potential influence of random measurement error in X, M, or Y. In experiments, and even when X is an observed categorical variable, measurement error in X is often negligible to nonexistent unless the categories were constructed through some kind of artificial categorization of a continuum or there is some ambiguity or subjectivity in the decision as to which category a particular case in the data belongs. But M and/or Y may and often do contain some random measurement error, such as when they are sum scores from a psychological test, personality inventory, or attitude scale. If M, Y, or both is measured with error, the result is bias in the estimation of the effects of X, reduced statistical power, or both (see, e.g., Darlington, 1990, pp ; Ledgerwood & Shrout, 2012). The method described in the manuscript can easily be extended using Mplus or another SEM program using single indicator latent variables with reliability-weighted errors (see e.g., Kline, 2005) or latent variable model with a measurement model component that links the latent variable causally to its indicators. Both approaches potentially reduce at least some of the deleterious effects of random measurement error. As with any measurement model, the researcher should ascertain whether the measurement model for the latent variable(s) satisfies various criteria for claiming good fit, for direct and indirect effects linking latent variables that are not modeled well have little substantive meaning. For discussions of latent variable mediation analysis, see Cheung and Lau (2008), Coffman and MacCallum (2005), Lau and Cheung (2012), and MacKinnon (2008). Multiple Mediators The approach we have illustrated for estimating relative indirect and direct effects can be extended to models with any number (m) of mediators operating in parallel. Figure S1 depicts a model with m proposed mediators and a multicategorical X with k categories. The relative total effects, c i, can be estimated if desired using Equation 8 in the manuscript, whereas the relative indirect and direct effects are pieced together from parameter estimates from m + 1 linear models, one for each of the m mediators and one for Y: M j = i 1j + a 1j D 1 + a 2j D a (k-1)j D k-1 + e Mj Y = i 2 + c' 1 D 1 + c' 2 D c' k-1 D k-1 + b 1 M 1 + b 2 M b m M m + e Y (S1) (S2) The same relationships among relative total, indirect, and direct effects exist in multiple-mediator models as in single-mediator models. The relative total effect for D i can be partitioned into the relative direct effect for D i plus the sum of the relative specific indirect effects for D i, a i1 b 1 + a i2 b a im b m. That is, m c c a b (S3) i i j 1 ij j This last term in Equation S3 is the relative total indirect effect of D i. Each relative specific indirect effect quantifies the component of the relative total indirect effect that is carried uniquely through that mediator. Inferential tests of relative specific indirect effects can be undertaken just as described in the manuscript, and these would typically be the focus of a mediation analysis. The Mplus code above can be modified without 10

11 difficulty to include multiple mediators, and the PROCESS and MEDIATE procedures for SPSS and SAS allow for multiple mediators operating in parallel in this fashion. See the documentation. Covariates and Confounds In a mediation model, the interpretation of an indirect effect as a causal one assumes that the mediator M is causally located between X and Y. That is, it is assumed that X causes M and M causes Y. When X is experimentally manipulated and sound experimental procedures are followed, a causal association between X and M and between X and Y is established by showing that the k groups differ on M and Y on average. Of course, as many others have emphasized before us (e.g., Bullock et al., 2010; Hayes, 2013; Mathieu, DeShon, & Bergh, 2008; Stone-Romero & Rosopa, 2010), this does not establish that M causes Y. It could be that Y causes M or that M and Y are spuriously associated (both are caused by some variable W) or epiphenomenally associated (M is correlated with the true intermediary variable W). If X is not experimentally manipulated, such threats to causal inference also exist in the interpretation of the association between X and M as well. Spuriousness and epiphenomenality, as alternative explanations at least with respect to a given competing variable W, can be accounted for in a mediation model by including W as an additional predictor or covariate in the models of M and Y. For example, Equations 1, 2, and 3 in the manuscript with the inclusion of W as a covariate would be M = i 1 + ax + d 1 W + e M Y = i 2 + c'x + bm + d 2 W + e Y Y = i 3 + cx + d 3 W+ e Y (S4) (S5) (S6) The addition of covariates is simple in any OLS regression program; covariates can be added to each of the ON statements in the Mplus code above, and the PROCESS and MEDIATE macros also accept covariates. References Bullock, J. G., Green, D. P., & Ha, S. E. (2010). Yes, but what is the mechanism? (Don t expect an easy answer). Journal of Personality and Social Psychology, 98, Cheung, G. W., & Lau, R. S. (2008). Testing mediation and suppression effects of latent variables: Bootstrapping with structural equation models. Organizational Research Methods, 11, Coffman, D. L., & MacCallum, R. C. (2005). Using parcels to convert path models into latent variable models. Multivariate Behavioral Research, 40, Darlington, R. B. (1990). Regression and linear models. New York: McGraw-Hill. Hayes, A. F. (2013). An introduction to mediation, moderation, and conditional process analysis. New York: The Guilford Press. Hayes, A. F., & Preacher, K. J. (2014). Statistical mediation analysis with a multicategorical independent variable. British Journal of Mathematical and Statistical Psychology, 67, Kline, R. B. (2005). Principles and practices of structural equation modeling (2 nd Ed.). New York: Guilford Press. Lau, R. S., & Cheung, G. W. (2012). Estimating and comparing specific mediation effects in complex latent variable models. Organizational Research Methods, 15, Ledgerwood, A., & Shrout, P. E. (2011). The trade-off between accuracy and precision in latent variable models of mediation processes. Journal of Personality and Social Psychology, 101, MacKinnon, D. P. (2008). An introduction to statistical mediation analysis. New York: Routledge. Mathieu, J. E., DeShon, R. P., & Bergh, D. D. (2008). Mediational inferences in organizational research. Organizational Research Methods, 11,

12 Preacher, K. J., & Hayes, A. F. (2008). Asymptotic and resampling strategies for assessing and comparing indirect effects in multiple mediator models. Behavior Research Methods, 40, Stone-Romero, E. F., & Rosopa, P. J. (2010). Research design options for testing mediation models and their implications for facets of validity. Journal of Managerial Psychology, 25,

13 Figure S1. A multiple mediation model in path diagram form corresponding to a model with an independent variable X with k categories and m mediators operating in parallel. When estimating using a structural equation modeling program, it is recommended that the covariance between mediator errors be freely estimated (see e.g., Preacher and Hayes, 2008). 13

### Moderation. Moderation

Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation

### Supplementary PROCESS Documentation

Supplementary PROCESS Documentation This document is an addendum to Appendix A of Introduction to Mediation, Moderation, and Conditional Process Analysis that describes options and output added to PROCESS

### Statistical mediation analysis with a multicategorical independent variable

451 British Journal of Mathematical and Statistical Psychology (2014), 67, 451 470 2013 The British Psychological Society Expert Tutorial Statistical mediation analysis with a multicategorical independent

### Testing mediation using bootstrapping in SPSS

To fully understand the following procedures read these key papers Hayes, A. F. (2009). Beyond Baron and Kenny: Statistical Mediation Analysis in the New Millennium. Communication Monographs, 76(4), 408-420.

### Model Templates for PROCESS for SPSS and SAS c 2013-2015 Andrew F. Hayes and The Guilford Press. Model 1. Conceptual Diagram. Statistical Diagram.

c 203-205 Andrew F. Hayes and The Guilford Press Model M b e M b 2 b 3 M Conditional effect of on = b + b 3 M c 203-205 Andrew F. Hayes and The Guilford Press Model Multicategorical M with k categories

### Mediation with Dichotomous Outcomes

April 19, 2013 This is a draft document to assist researchers. Please do not cite or quote without the author s permission. I thank both Nathaniel R. Herr and Craig Nathanson who suggested corrections

### Hacking PROCESS for Estimation and Probing of Linear Moderation of Quadratic Effects and Quadratic Moderation of Linear Effects

Hacking PROCESS for Estimation and Probing of Linear Moderation of Quadratic Effects and Quadratic Moderation of Linear Effects Andrew F. Hayes The Ohio State University Unpublished White Paper, DRAFT

### Guidelines for the Investigation of Mediating Variables in Business Research

J Bus Psychol (2012) 27:1 14 DOI 10.1007/s10869-011-9248-z Guidelines for the Investigation of Mediating Variables in Business Research David P. MacKinnon Stefany Coxe Amanda N. Baraldi Published online:

### Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building

### CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or

### Using Minitab for Regression Analysis: An extended example

Using Minitab for Regression Analysis: An extended example The following example uses data from another text on fertilizer application and crop yield, and is intended to show how Minitab can be used to

### Regression. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Class: Date: Regression Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Given the least squares regression line y8 = 5 2x: a. the relationship between

### Correlational Research

Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.

### Categorical Variables in Regression: Implementation and Interpretation By Dr. Jon Starkweather, Research and Statistical Support consultant

Interpretation and Implementation 1 Categorical Variables in Regression: Implementation and Interpretation By Dr. Jon Starkweather, Research and Statistical Support consultant Use of categorical variables

### , then the form of the model is given by: which comprises a deterministic component involving the three regression coefficients (

Multiple regression Introduction Multiple regression is a logical extension of the principles of simple linear regression to situations in which there are several predictor variables. For instance if we

### e = random error, assumed to be normally distributed with mean 0 and standard deviation σ

1 Linear Regression 1.1 Simple Linear Regression Model The linear regression model is applied if we want to model a numeric response variable and its dependency on at least one numeric factor variable.

### CHAPTER 3 COMMONLY USED STATISTICAL TERMS

CHAPTER 3 COMMONLY USED STATISTICAL TERMS There are many statistics used in social science research and evaluation. The two main areas of statistics are descriptive and inferential. The third class of

### Running head: ASSUMPTIONS IN MULTIPLE REGRESSION 1. Assumptions in Multiple Regression: A Tutorial. Dianne L. Ballance ID#

Running head: ASSUMPTIONS IN MULTIPLE REGRESSION 1 Assumptions in Multiple Regression: A Tutorial Dianne L. Ballance ID#00939966 University of Calgary APSY 607 ASSUMPTIONS IN MULTIPLE REGRESSION 2 Assumptions

### AP Statistics 2002 Scoring Guidelines

AP Statistics 2002 Scoring Guidelines The materials included in these files are intended for use by AP teachers for course and exam preparation in the classroom; permission for any other use must be sought

### 1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand

### Class 6: Chapter 12. Key Ideas. Explanatory Design. Correlational Designs

Class 6: Chapter 12 Correlational Designs l 1 Key Ideas Explanatory and predictor designs Characteristics of correlational research Scatterplots and calculating associations Steps in conducting a correlational

### LEARNING OBJECTIVES SCALES OF MEASUREMENT: A REVIEW SCALES OF MEASUREMENT: A REVIEW DESCRIBING RESULTS DESCRIBING RESULTS 8/14/2016

UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION LEARNING OBJECTIVES Contrast three ways of describing results: Comparing group percentages Correlating scores Comparing group means Describe

### The Trade-Off Between Accuracy and Precision in Latent Variable Models of Mediation Processes

Journal of Personality and Social Psychology 2011 American Psychological Association 2011, Vol. 101, No. 6, 1174 1188 0022-3514/11/\$12.00 DOI: 10.1037/a0024776 The Trade-Off Between Accuracy and Precision

### DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

### " Y. Notation and Equations for Regression Lecture 11/4. Notation:

Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

### CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships

### Overview of Factor Analysis

Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,

### HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

### MS&E 226: Small Data. Lecture 17: Additional topics in inference (v1) Ramesh Johari

MS&E 226: Small Data Lecture 17: Additional topics in inference (v1) Ramesh Johari ramesh.johari@stanford.edu 1 / 34 Warnings 2 / 34 Modeling assumptions: Regression Remember that most of the inference

### Comparing Conditional Effects in Moderated Multiple Regression: Implementation using PROCESS for SPSS and SAS

Comparing Conditional Effects in Moderated Multiple Regression: Implementation using PROCESS for SPSS and SAS Andrew F. Hayes The Ohio State University Department of Psychology This document describes

### How to Use a Monte Carlo Study to Decide on Sample Size and Determine Power

STRUCTURAL EQUATION MODELING, 9(4), 599 620 Copyright 2002, Lawrence Erlbaum Associates, Inc. TEACHER S CORNER How to Use a Monte Carlo Study to Decide on Sample Size and Determine Power Linda K. Muthén

### Ordinal Regression. Chapter

Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe

### PROCESS: A Versatile Computational Tool for Observed Variable Mediation, Moderation, and Conditional Process Modeling 1

How to cite this document: Hayes, A. F. (2012). PROCESS: A versatile computational tool for observed variable mediation, moderation, and conditional process modeling [White paper]. Retrieved from http://www.afhayes.com/

### Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

Data Mining and Data Warehousing Henryk Maciejewski Data Mining Predictive modelling: regression Algorithms for Predictive Modelling Contents Regression Classification Auxiliary topics: Estimation of prediction

### How to Use a Monte Carlo Study to Decide on Sample Size and Determine Power Linda K. Muthén & Bengt O. Muthén Published online: 19 Nov 2009.

This article was downloaded by: [University of California, Los Angeles (UCLA)] On: 19 February 2014, At: 05:09 Publisher: Routledge Informa Ltd Registered in England and Wales Registered Number: 1072954

### 1.1. Simple Regression in Excel (Excel 2010).

.. Simple Regression in Excel (Excel 200). To get the Data Analysis tool, first click on File > Options > Add-Ins > Go > Select Data Analysis Toolpack & Toolpack VBA. Data Analysis is now available under

### Multivariate Analysis. Overview

Multivariate Analysis Overview Introduction Multivariate thinking Body of thought processes that illuminate the interrelatedness between and within sets of variables. The essence of multivariate thinking

### Multiple Regression: What Is It?

Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in

### Multiple Linear Regression in Data Mining

Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple

### Introduction to Data Analysis in Hierarchical Linear Models

Introduction to Data Analysis in Hierarchical Linear Models April 20, 2007 Noah Shamosh & Frank Farach Social Sciences StatLab Yale University Scope & Prerequisites Strong applied emphasis Focus on HLM

### Introduction to Longitudinal Data Analysis

Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction

### Regression Analysis: A Complete Example

Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

### SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

### Rens van de Schoot a b, Peter Lugtig a & Joop Hox a a Department of Methods and Statistics, Utrecht

This article was downloaded by: [University Library Utrecht] On: 15 May 2012, At: 01:20 Publisher: Psychology Press Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office:

### 1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

### Effect Size Calculations and Elementary Meta-Analysis

Effect Size Calculations and Elementary Meta-Analysis David B. Wilson, PhD George Mason University August 2011 The End-Game Forest-Plot of Odds-Ratios and 95% Confidence Intervals for the Effects of Cognitive-Behavioral

### Applications of Structural Equation Modeling in Social Sciences Research

American International Journal of Contemporary Research Vol. 4 No. 1; January 2014 Applications of Structural Equation Modeling in Social Sciences Research Jackson de Carvalho, PhD Assistant Professor

### MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could

### Bivariate Analysis. Correlation. Correlation. Pearson's Correlation Coefficient. Variable 1. Variable 2

Bivariate Analysis Variable 2 LEVELS >2 LEVELS COTIUOUS Correlation Used when you measure two continuous variables. Variable 2 2 LEVELS X 2 >2 LEVELS X 2 COTIUOUS t-test X 2 X 2 AOVA (F-test) t-test AOVA

### When to Use a Particular Statistical Test

When to Use a Particular Statistical Test Central Tendency Univariate Descriptive Mode the most commonly occurring value 6 people with ages 21, 22, 21, 23, 19, 21 - mode = 21 Median the center value the

### Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation

### Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.

### Psychology 205: Research Methods in Psychology

Psychology 205: Research Methods in Psychology Using R to analyze the data for study 2 Department of Psychology Northwestern University Evanston, Illinois USA November, 2012 1 / 38 Outline 1 Getting ready

### CHAPTER 6 EXAMPLES: GROWTH MODELING AND SURVIVAL ANALYSIS

Examples: Growth Modeling And Survival Analysis CHAPTER 6 EXAMPLES: GROWTH MODELING AND SURVIVAL ANALYSIS Growth models examine the development of individuals on one or more outcome variables over time.

### Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives

### E205 Final: Version B

Name: Class: Date: E205 Final: Version B Multiple Choice Identify the choice that best completes the statement or answers the question. 1. The owner of a local nightclub has recently surveyed a random

### Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)

### CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES

Examples: Monte Carlo Simulation Studies CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Monte Carlo simulation studies are often used for methodological investigations of the performance of statistical

### Perform hypothesis testing

Multivariate hypothesis tests for fixed effects Testing homogeneity of level-1 variances In the following sections, we use the model displayed in the figure below to illustrate the hypothesis tests. Partial

### Moderator and Mediator Analysis

Moderator and Mediator Analysis Seminar General Statistics Marijtje van Duijn October 8, Overview What is moderation and mediation? What is their relation to statistical concepts? Example(s) October 8,

### Regression in ANOVA. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Regression in ANOVA James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 30 Regression in ANOVA 1 Introduction 2 Basic Linear

### Simple Linear Regression Inference

Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

### PSYCH 6811 Statistical Methods in Psychology II Spring 2015. Course Description

PSYCH 6811 Statistical Methods in Psychology II Spring 2015 Lecture : MW 9:35 to 10:55 in PS 35 Lab : Th 9:35 to 10:55 / 11:10 to 12:30 in PS 22 Instructor : Dr. Andrew F. Hayes (hayes.338, LZ 230) Teaching

### Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

### Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com

SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING

### What is correlational research?

Key Ideas Purpose and use of correlational designs How correlational research developed Types of correlational designs Key characteristics of correlational designs Procedures used in correlational studies

### 11/20/2014. Correlational research is used to describe the relationship between two or more naturally occurring variables.

Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

### Structural Equation Models for Comparing Dependent Means and Proportions. Jason T. Newsom

Structural Equation Models for Comparing Dependent Means and Proportions Jason T. Newsom How to Do a Paired t-test with Structural Equation Modeling Jason T. Newsom Overview Rationale Structural equation

### Module 5: Multiple Regression Analysis

Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

### MS 2007-0081-RR Booil Jo. Supplemental Materials (to be posted on the Web)

MS 2007-0081-RR Booil Jo Supplemental Materials (to be posted on the Web) Table 2 Mplus Input title: Table 2 Monte Carlo simulation using externally generated data. One level CACE analysis based on eqs.

### 2015 TUHH Online Summer School: Overview of Statistical and Path Modeling Analyses

: Overview of Statistical and Path Modeling Analyses Prof. Dr. Christian M. Ringle (Hamburg Univ. of Tech., TUHH) Prof. Dr. Jӧrg Henseler (University of Twente) Dr. Geoffrey Hubona (The Georgia R School)

### Chapter 1 Introduction. 1.1 Introduction

Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations

### Research Methods & Experimental Design

Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

### Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection

Directions in Statistical Methodology for Multivariable Predictive Modeling Frank E Harrell Jr University of Virginia Seattle WA 19May98 Overview of Modeling Process Model selection Regression shape Diagnostics

### How To Use A Monte Carlo Study To Decide On Sample Size and Determine Power

How To Use A Monte Carlo Study To Decide On Sample Size and Determine Power Linda K. Muthén Muthén & Muthén 11965 Venice Blvd., Suite 407 Los Angeles, CA 90066 Telephone: (310) 391-9971 Fax: (310) 391-8971

### Can Mediation be Tested with Cross-sectional Studies?

Can Mediation be Tested with Cross-sectional Studies? Mediation Mediation occurs when a variable (IV) affects another variable (DV) wholly or partly via its effect on another variable (MV) 1 Mediation

### It Takes Four to Tango: Why a Variable Cannot Be a Mediator and a Moderator at the Same Time. Johann Jacoby and Kai Sassenberg

A MEDIATOR CANNOT ALSO BE A MODERATOR 1 It Takes Four to Tango: Why a Variable Cannot Be a Mediator and a Moderator at the Same Time Johann Jacoby and Kai Sassenberg Knowledge Media Research Center Tübingen

### Introduction Types of mediation Causal steps approach SPSS example Miscellaneous topics Proportion mediated Sobel test. Mediation analysis

Mediation analysis Seminar General Statistics Matthijs J Warrens GION Multiple regression analysis 1 dependent variable (INT) Several independent variables 1, 2, 3 (INT) (predictors) 1 2 3 Research question

### Using Correlation and Regression: Mediation, Moderation, and More

Using Correlation and Regression: Mediation, Moderation, and More Part 2: Mediation analysis with regression Claremont Graduate University Professional Development Workshop August 22, 2015 Dale Berger,

### 11. Analysis of Case-control Studies Logistic Regression

Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

### Resampling: Bootstrapping and Randomization. Presented by: Jenn Fortune, Alayna Gillespie, Ivana Pejakovic, and Anne Bergen

Resampling: Bootstrapping and Randomization Presented by: Jenn Fortune, Alayna Gillespie, Ivana Pejakovic, and Anne Bergen Outline 1. The logic of resampling and bootstrapping 2. Bootstrapping: confidence

### Predictor Coef StDev T P Constant 970667056 616256122 1.58 0.154 X 0.00293 0.06163 0.05 0.963. S = 0.5597 R-Sq = 0.0% R-Sq(adj) = 0.

Statistical analysis using Microsoft Excel Microsoft Excel spreadsheets have become somewhat of a standard for data storage, at least for smaller data sets. This, along with the program often being packaged

### Ph.D. Workshop. Advanced Methods in Structural Equation Modeling Mediation and Moderation Analysis

Ph.D. Workshop Advanced Methods in Structural Equation Modeling Mediation and Moderation Analysis 15 th 17 th December 2015 Course content and aims: Empirical research in various disciplines (e.g., management

### Introduction to Structural Equation Modeling (SEM) Day 4: November 29, 2012

Introduction to Structural Equation Modeling (SEM) Day 4: November 29, 202 ROB CRIBBIE QUANTITATIVE METHODS PROGRAM DEPARTMENT OF PSYCHOLOGY COORDINATOR - STATISTICAL CONSULTING SERVICE COURSE MATERIALS

### CORRELATION AND SIMPLE REGRESSION ANALYSIS USING SAS IN DAIRY SCIENCE

CORRELATION AND SIMPLE REGRESSION ANALYSIS USING SAS IN DAIRY SCIENCE A. K. Gupta, Vipul Sharma and M. Manoj NDRI, Karnal-132001 When analyzing farm records, simple descriptive statistics can reveal a

### The primary goal of this thesis was to understand how the spatial dependence of

5 General discussion 5.1 Introduction The primary goal of this thesis was to understand how the spatial dependence of consumer attitudes can be modeled, what additional benefits the recovering of spatial

### 10. Analysis of Longitudinal Studies Repeat-measures analysis

Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.

### Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and

### Chapter 4 and 5 solutions

Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,

### Two SPSS programs for interpreting multiple regression results. Universitat Rovira i Virgili, Tarragona, Spain

Behavior Research Methods 2010, 42 (1), 29-35 doi:10.3758/brm.42.1.29 Two SPSS programs for interpreting multiple regression results Urbano Lorenzo-Seva, Pere J. Ferrando, and Eliseo Chico Universitat

### Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models.

Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models. Dr. Jon Starkweather, Research and Statistical Support consultant This month

### Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)

1 Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) I. Authors should report effect sizes in the manuscript and tables when reporting

### Introduction to Multilevel Modeling Using HLM 6. By ATS Statistical Consulting Group

Introduction to Multilevel Modeling Using HLM 6 By ATS Statistical Consulting Group Multilevel data structure Students nested within schools Children nested within families Respondents nested within interviewers

### Main Effects and Interactions

Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly

### 12/31/2016. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Understand when to use multiple Understand the multiple equation and what the coefficients represent Understand different methods

### Basic Statistcs Formula Sheet

Basic Statistcs Formula Sheet Steven W. ydick May 5, 0 This document is only intended to review basic concepts/formulas from an introduction to statistics course. Only mean-based procedures are reviewed,

Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén