[This document contains corrections to a few typos that were found on the version available through the journal s web page]
|
|
|
- May Andrews
- 9 years ago
- Views:
Transcription
1 Online supplement to Hayes, A. F., & Preacher, K. J. (2014). Statistical mediation analysis with a multicategorical independent variable. British Journal of Mathematical and Statistical Psychology, 67, DOI: /bmsp [This document contains corrections to a few typos that were found on the version available through the journal s web page] This document contains instructions for the implementation of the method described in Hayes and Preacher (2014) using Mplus as well as using the PROCESS and MEDIATE macros for SPSS and SAS. Following the code, various miscellaneous issues and extensions are addressed, including interpretation of model coefficients using sequential group coding, accounting for random measurement error, dealing with confounds statistically, and models with multiple mediators. Mplus Code Corresponding to the Web Portal Customization Example Any structural equation modeling program can produce estimates of the coefficients in a mediation model. Mplus offers features such as bootstrap confidence intervals for indirect effects and inferential tests for functions of parameters that make it a particularly good choice for the kind of analysis we describe in the manuscript. Importantly, the constraints of the freely available demonstration version of Mplus (available from do not preclude its use for estimation of mediation models with a single mediator and a categorical independent variable with as many as three levels. The code below implements the method described in the manuscript and can easily be adapted to mediation analysis with multiple mediators, latent variables, or an independent variable with more than three levels. DATA: FILE is c:\sri.txt; VARIABLE: NAMES are cond custom attitude inter; USEVARIABLES are attitude inter d1 d2;!indicator coding DEFINE: if (cond eq 1) then d1 = 0; if (cond eq 1) then d2 = 0; if (cond eq 2) then d1 = 1; if (cond eq 2) then d2 = 0; if (cond eq 3) then d1 = 0; if (cond eq 3) then d2 = 1;!model definition MODEL: inter ON d1 (a1) d2 (a2); attitude ON inter (b) d1 (cp1) d2 (cp2);!relative indirect effects; MODEL INDIRECT: attitude IND inter d1; attitude IND inter d2; MODEL CONSTRAINT: new (tot1 tot2); tot1=a1*b+cp1; tot2=a2*b+cp2; 1
2 The resulting output is below. This output was used to construct parts of Table 2 in the manuscript. MODEL RESULTS Two-Tailed Estimate S.E. Est./S.E. P-Value INTER ON D D ATTITUDE ON INTER D D Intercepts ATTITUDE INTER Residual Variances ATTITUDE INTER New/Additional Parameters TOT TOT TOTAL, TOTAL INDIRECT, SPECIFIC INDIRECT, AND DIRECT EFFECTS Effects from D1 to ATTITUDE Two-Tailed Estimate S.E. Est./S.E. P-Value Sum of indirect Specific indirect ATTITUDE INTER D Effects from D2 to ATTITUDE Sum of indirect Specific indirect ATTITUDE INTER D
3 For contrast coding as described in the text, replace the DEFINE section above with!orthogonal contrast coding DEFINE: if (cond eq 1) then d1 = ; if (cond eq 1) then d2 = 0; if (cond eq 2) then d1 = 0.333; if (cond eq 2) then d2 = -0.5; if (cond eq 3) then d1 = 0.333; if (cond eq 3) then d2 = 0.5; For sequential coding as discussed later in this supplement, replace the DEFINE section of the core program with!sequential coding DEFINE: if (cond eq 1) then d1 = 0; if (cond eq 1) then d2 = 0; if (cond eq 2) then d1 = 1; if (cond eq 2) then d2 = 0; if (cond eq 3) then d1 = 1; if (cond eq 3) then d2 = 1; To generate 95% and 99% bias corrected bootstrap confidence intervals for relative indirect effects (as well as all other parameter estimates), add the lines below to the program. For percentile confidence intervals, change bcbootstrap below to bootstrap. ANALYSIS: bootstrap = 10000; OUTPUT: cinterval (bcbootstrap); Estimation using PROCESS for SPSS and SAS PROCESS is a freely-available regression-based path analysis macro for both SPSS and SAS that estimates the model coefficients in mediation and moderation models of various forms while also providing modern inferential methods for inference about indirect effects including bootstrap confidence intervals. Its use in mediation analysis is described in Hayes (2013) along with documentation of its many features, and can be downloaded from [web address withheld for peer review]) One documented limitation of PROCESS is that only a single X variable can be specified in a mediation model, and it must be either dichotomous or continuous. However, with the strategic use of covariates, manual construction of the indicator codes prior to execution, and multiple executions of the macro, PROCESS can estimate a model as in Figure 2 of the manuscript. The results generated by PROCESS will be identical to what Mplus generates, with the exception of standard errors which will tend to be slightly smaller than OLS standard errors in smaller samples. These differences in standard errors dissipate rapidly as sample size increases. The example SPSS PROCESS code and output below corresponds to the analysis of the web portal customization study using indicator coding of customization condition. Variables named ATTITUDE and INTER contain measurements of attitudes toward the web portal and perceived interactivity, respectively, and variable COND codes experimental condition (1 = control, 2 = moderate customization, 3 = high customization). Because PROCESS allows only a single independent variable that must be either dichotomous or continuous, it must be tricked into estimating a model with a multicategorical independent variable. This is done by running PROCESS k 1 times, where k is the number of levels of the independent variable, and using k 1 group codes constructed prior the execution of PROCESS. At each run, one of the group codes is used as X 3
4 and the others as covariate(s), with the code serving as X being swapped with a covariate at subsequent PROCESS runs. So that the same bootstrap samples are used in consecutive executions, the random number generator should be seeded using the seed = command, with the same seed used time. This seed can be chosen arbitrarily. This code first constructs two dummy variables coding experimental condition with the control condition (cond = 1) as the reference category. The following PROCESS command then executes a mediation model with the first dummy variable as X and the other as a covariate. This generates estimates of a 1, a 2, b, c 1, c 2, c' 1, and c' 2 corresponding to the values in Table 2 of the manuscript, as well as a bias-corrected bootstrap confidence interval for a 1 b based on 10,000 bootstrap samples. The summary table at the end includes the three effects of X, which in this case are the relative total, direct, and indirect effects for moderate customization relative to the control condition (c 1, and c' 1, and a 1 b), in that order. compute d1=(cond=2). compute d2=(cond=3). process vars=attitude inter d1 d2/y=attitude/m=inter/x=d1/total=1/ model=4/boot=10000/seed=3423. Model = 4 Y = attitude X = d1 M = inter Statistical Controls: CONTROL= d2 Sample size 60 ************************************************************************** Outcome: inter Model Summary R R-sq F df1 df2 p Model coeff se t p LLCI ULCI constant d d ************************************************************************** Outcome: attitude Model Summary R R-sq F df1 df2 p Model coeff se t p LLCI ULCI constant inter d d ************************** TOTAL EFFECT MODEL **************************** 4
5 Outcome: attitude Model Summary R R-sq F df1 df2 p Model coeff se t p LLCI ULCI constant d d ***************** TOTAL, DIRECT, AND INDIRECT EFFECTS ******************** Total effect of X on Y Effect SE t p LLCI ULCI Direct effect of X on Y Effect SE t p LLCI ULCI Indirect effect of X on Y Effect Boot SE BootLLCI BootULCI inter ******************** ANALYSIS NOTES AND WARNINGS ************************* Number of bootstrap samples for bias corrected bootstrap confidence intervals: Level of confidence for all confidence intervals in output: Missing from the output above is the relative indirect effect for high customization relative to none (a 2 b) along with a bootstrap confidence interval for inference. The code below generates this relative indirect effect by switching d1 and d2 in the x= specification. Most of the output is identical to the code generated by the command above, so that output is suppressed by using the detail=0 option. Using the same random number seed as in the prior run of PROCESS produces a bootstrap confidence interval based on the same set of bootstrap samples. The effects for X in this summary table are the relative total, direct, and indirect effects for high customization relative to the control condition (c 2, c' 2, and a 2 b), in that order. process vars=attitude inter d1 d2/y=attitude/m=inter/x=d2/total=1/ model=4/boot=10000/seed=3423/detail=0. Model = 4 Y = attitude X = d2 M = inter Statistical Controls: CONTROL= d1 Sample size 60 ***************** TOTAL, DIRECT, AND INDIRECT EFFECTS ******************** 5
6 Total effect of X on Y Effect SE t p LLCI ULCI Direct effect of X on Y Effect SE t p LLCI ULCI Indirect effect of X on Y Effect Boot SE BootLLCI BootULCI inter ******************** ANALYSIS NOTES AND WARNINGS ************************* Number of bootstrap samples for bias corrected bootstrap confidence intervals: Level of confidence for all confidence intervals in output: The SPSS compute commands above generate indicator codes with the control group as the reference group. The commands to generate the contrast codes used in the example analysis would be if (cond=1) d1 = if (cond=1) d2 = 0. if (cond=2) d1 = if (cond=2) d2 = if (cond=3) d1 = if (cond=3) d2 = 0.5. For the sequential coding example described below, the following SPSS commands construct the sequential codes: compute d1 = (cond > 1). compute d2 = (cond > 2). The PROCESS macro is available for SAS but requires PROC IML. The command structure is very similar to the SPSS version, but the construction of group codes requires commands that are different than those used in SPSS. The SAS code below conducts the example analysis using indicator coding of groups, assuming the data reside in a SAS data file named web : data web;set web;d1=(cond=2);d2=(cond=3);run; %process (data=web,vars=attitude inter d1 d2,y=attitude,m=inter,x=d1, total=1,model=4,boot=10000,seed=3423); %process (data=web,vars=attitude inter d1 d2,y=attitude,m=inter,x=d2, total=1,model=4,boot=10000,seed=3423,detail=0); For the contrast codes corresponding to the example analysis in this paper, change the DATA line to read: data web;set web; if (cond=1) then do;d1=-0.667;d2=0;end; if (cond=2) then do;d1=0.333;d2=-0.5;end; if (cond=3) then do;d1=0.333;d2=0.5;end; run; 6
7 For the sequential codes described in the example below, the DATA line should read data web;set web;d1=(cond>1);d2=(cond>2);run; Estimation using MEDIATE for SPSS MEDIATE is a freely available SPSS macro (downloadable from [web address blinded for review]) that facilitates the estimation of mediation models with multicategorical independent variables along with the ability to generate bootstrap confidence intervals for indirect effects. It is very limited in its features relative to PROCESS, but it does have one handy option that automates the construction of codes for a categorical independent variable. The code and output below corresponds to the analysis of the web portal customization study using indicator coding of customization condition. Variables named ATTITUDE and INTER contain measurements of attitudes toward the web portal and perceived interactivity, respectively, and variable COND codes experimental condition (1 = control, 2 = moderate customization, 3 = high customization). The catx=1 option specifies indicator coding and sets the control condition as the reference group. See the documentation for additional information about the MEDIATE macro and its options. mediate y=attitude/x=cond/m=inter/samples=10000/total=1/catx=1. Run MATRIX procedure: VARIABLES IN THE FULL MODEL: Y = attitude M1 = inter X = cond CODING OF CATEGORICAL X FOR ANALYSIS: cond D1 D ***************************************************************************** OUTCOME VARIABLE: attitude MODEL SUMMARY (TOTAL EFFECTS MODEL) R R-sq Adj R-sq F df1 df2 p MODEL COEFFICIENTS (TOTAL EFFECTS MODEL) Coeff. s.e. t p Constant D D ***************************************************************************** OUTCOME VARIABLE: inter MODEL SUMMARY R R-sq Adj R-sq F df1 df2 p
8 MODEL COEFFICIENTS Coeff. s.e. t p Constant D D ***************************************************************************** OUTCOME VARIABLE: attitude MODEL SUMMARY R R-sq adj R-sq F df1 df2 p MODEL COEFFICIENTS Coeff. s.e. t p Constant inter D D TEST OF HOMOGENEITY OF REGRESSION (X*M INTERACTION) R-sq F df1 df2 p inter ***************************************************************************** INDIRECT EFFECT(S) THROUGH: inter Effect SE(boot) LLCI ULCI D D ********************* ANALYSIS NOTES AND WARNINGS ************************* NOTE: Indicator coding is used for categorical X Number of samples used for indirect effect confidence intervals: Level of confidence for confidence intervals: Bias corrected bootstrap confidence intervals for indirect effects are printed in output Sequential Coding of Groups In the web portal customization study, the three levels of the manipulation can be rank ordered with respect to degree of customization (none, moderate, or high). When the categories of a multicategorical predictor can be so ordered, sequential coding can be useful. With sequential codes, the relative direct and indirect effects can be interpreted as the effects of membership in one group relative to the group one step sequentially lower in the ordered system. Darlington (1990, pp ) describes sequential coding for a categorical variable with any number of ordered categories. With only three groups, the coding is relatively 8
9 simple. For the control condition (the lowest level of customization), D 1 and D 2 are set to 0, for the moderately customized condition (the next highest level of customization), D 1 = 1, D 2 = 0, and for the highest level of customization, D 1 = D 2 = 1. Estimating the coefficients in Equations 6, 7, and 8 in the manuscript yields the following results: i 1 = 4.250, i 2 = 2.810, i 3 = 4.335, a 1 = 1.575, p = 0.001; a 2 = 0.675, p = 0.166; b = 0.359, p < 0.001; c' 1 = 1.105, p = 0.003; c' 2 = 1.053, p =.002; c 1 = 1.670, p <.001; c 2 = 1.295, p < As with the other two methods of coding groups described in the manuscript, the resulting models reproduce the group means on M as well as Y (adjusted and unadjusted). The relative indirect effects are still estimated as products of coefficients. The a 1 coefficients quantify the mean differences in perceived interactivity between the moderate customization and control condition (a 1 ) and between the high and moderate customization conditions (a 2 ). That is, and a 1 = M moderate M control a 2 = M high M moderate When a 1 and a 2 are multiplied by the effect of interactivity on attitudes, holding customization condition constant (b = 0.359), the result is the relative indirect effects of customization on attitudes through perceived interactivity: and a 1 b = 1.575(0.359) = a 2 b = 0.675(0.359) = The relative indirect effect a 1 b estimates the indirect effect of moderate customization relative to none through perceived interactivity on attitudes. Those who browsed using a moderately customized portal had attitudes that were units more favorable on average (with a 95% bias-corrected bootstrap confidence interval from to 1.255) than those assigned to the noncustomized portal condition as a result of this indirect mechanism linking customization to attitudes through perceived interactivity. The relative indirect effect a 2 b estimates the indirect effect of high relative to moderate customization through perceived interactivity. Browsing with a highly customized portal resulted in attitudes that were units more favorable on average than browsing using a moderately customized portal as a result of this indirect mechanism linking customization to attitudes through perceived interactivity. Zero cannot be rejected as a plausible value for this indirect effect, as a 95% bias-corrected bootstrap confidence interval straddled zero ( to 0.710). Using this coding system, the relative direct effect c' 1 corresponds to the effect of moderate customization on attitudes relative to none, independent of perceived interactivity, and the relative direct effect c' 2 is the effect of high customization relative to moderate customization. This corresponds to differences between the adjusted means: * moderate * control c' 1 = Y Y = = * * c' 2 = Y high Y moderate = = As when other coding systems are used, the relative total effects can be estimated using Equation 8 in the manuscript or by adding the relative direct and indirect effects. With sequential coding, c 1 estimates the mean difference in attitude between the moderately customized and control groups, and c 2 estimates the mean difference in attitude between the highly customized and moderately customized groups. That is, 9
10 c 1 = Y moderate Y control c 2 = Y high Y moderate As both effects are positive, this suggests attitudes increase in favorability as customization increases. Finally, notice that as with indicator or contrast coding, the relative total effects partition cleanly into the relative direct and relative indirect effects: c 1 = c' 1 + a 1 b = = and c 2 = c' 2 + a 2 b = = Random Measurement Error The example analyses in the manuscript and this supplement ignore the potential influence of random measurement error in X, M, or Y. In experiments, and even when X is an observed categorical variable, measurement error in X is often negligible to nonexistent unless the categories were constructed through some kind of artificial categorization of a continuum or there is some ambiguity or subjectivity in the decision as to which category a particular case in the data belongs. But M and/or Y may and often do contain some random measurement error, such as when they are sum scores from a psychological test, personality inventory, or attitude scale. If M, Y, or both is measured with error, the result is bias in the estimation of the effects of X, reduced statistical power, or both (see, e.g., Darlington, 1990, pp ; Ledgerwood & Shrout, 2012). The method described in the manuscript can easily be extended using Mplus or another SEM program using single indicator latent variables with reliability-weighted errors (see e.g., Kline, 2005) or latent variable model with a measurement model component that links the latent variable causally to its indicators. Both approaches potentially reduce at least some of the deleterious effects of random measurement error. As with any measurement model, the researcher should ascertain whether the measurement model for the latent variable(s) satisfies various criteria for claiming good fit, for direct and indirect effects linking latent variables that are not modeled well have little substantive meaning. For discussions of latent variable mediation analysis, see Cheung and Lau (2008), Coffman and MacCallum (2005), Lau and Cheung (2012), and MacKinnon (2008). Multiple Mediators The approach we have illustrated for estimating relative indirect and direct effects can be extended to models with any number (m) of mediators operating in parallel. Figure S1 depicts a model with m proposed mediators and a multicategorical X with k categories. The relative total effects, c i, can be estimated if desired using Equation 8 in the manuscript, whereas the relative indirect and direct effects are pieced together from parameter estimates from m + 1 linear models, one for each of the m mediators and one for Y: M j = i 1j + a 1j D 1 + a 2j D a (k-1)j D k-1 + e Mj Y = i 2 + c' 1 D 1 + c' 2 D c' k-1 D k-1 + b 1 M 1 + b 2 M b m M m + e Y (S1) (S2) The same relationships among relative total, indirect, and direct effects exist in multiple-mediator models as in single-mediator models. The relative total effect for D i can be partitioned into the relative direct effect for D i plus the sum of the relative specific indirect effects for D i, a i1 b 1 + a i2 b a im b m. That is, m c c a b (S3) i i j 1 ij j This last term in Equation S3 is the relative total indirect effect of D i. Each relative specific indirect effect quantifies the component of the relative total indirect effect that is carried uniquely through that mediator. Inferential tests of relative specific indirect effects can be undertaken just as described in the manuscript, and these would typically be the focus of a mediation analysis. The Mplus code above can be modified without 10
11 difficulty to include multiple mediators, and the PROCESS and MEDIATE procedures for SPSS and SAS allow for multiple mediators operating in parallel in this fashion. See the documentation. Covariates and Confounds In a mediation model, the interpretation of an indirect effect as a causal one assumes that the mediator M is causally located between X and Y. That is, it is assumed that X causes M and M causes Y. When X is experimentally manipulated and sound experimental procedures are followed, a causal association between X and M and between X and Y is established by showing that the k groups differ on M and Y on average. Of course, as many others have emphasized before us (e.g., Bullock et al., 2010; Hayes, 2013; Mathieu, DeShon, & Bergh, 2008; Stone-Romero & Rosopa, 2010), this does not establish that M causes Y. It could be that Y causes M or that M and Y are spuriously associated (both are caused by some variable W) or epiphenomenally associated (M is correlated with the true intermediary variable W). If X is not experimentally manipulated, such threats to causal inference also exist in the interpretation of the association between X and M as well. Spuriousness and epiphenomenality, as alternative explanations at least with respect to a given competing variable W, can be accounted for in a mediation model by including W as an additional predictor or covariate in the models of M and Y. For example, Equations 1, 2, and 3 in the manuscript with the inclusion of W as a covariate would be M = i 1 + ax + d 1 W + e M Y = i 2 + c'x + bm + d 2 W + e Y Y = i 3 + cx + d 3 W+ e Y (S4) (S5) (S6) The addition of covariates is simple in any OLS regression program; covariates can be added to each of the ON statements in the Mplus code above, and the PROCESS and MEDIATE macros also accept covariates. References Bullock, J. G., Green, D. P., & Ha, S. E. (2010). Yes, but what is the mechanism? (Don t expect an easy answer). Journal of Personality and Social Psychology, 98, Cheung, G. W., & Lau, R. S. (2008). Testing mediation and suppression effects of latent variables: Bootstrapping with structural equation models. Organizational Research Methods, 11, Coffman, D. L., & MacCallum, R. C. (2005). Using parcels to convert path models into latent variable models. Multivariate Behavioral Research, 40, Darlington, R. B. (1990). Regression and linear models. New York: McGraw-Hill. Hayes, A. F. (2013). An introduction to mediation, moderation, and conditional process analysis. New York: The Guilford Press. Hayes, A. F., & Preacher, K. J. (2014). Statistical mediation analysis with a multicategorical independent variable. British Journal of Mathematical and Statistical Psychology, 67, Kline, R. B. (2005). Principles and practices of structural equation modeling (2 nd Ed.). New York: Guilford Press. Lau, R. S., & Cheung, G. W. (2012). Estimating and comparing specific mediation effects in complex latent variable models. Organizational Research Methods, 15, Ledgerwood, A., & Shrout, P. E. (2011). The trade-off between accuracy and precision in latent variable models of mediation processes. Journal of Personality and Social Psychology, 101, MacKinnon, D. P. (2008). An introduction to statistical mediation analysis. New York: Routledge. Mathieu, J. E., DeShon, R. P., & Bergh, D. D. (2008). Mediational inferences in organizational research. Organizational Research Methods, 11,
12 Preacher, K. J., & Hayes, A. F. (2008). Asymptotic and resampling strategies for assessing and comparing indirect effects in multiple mediator models. Behavior Research Methods, 40, Stone-Romero, E. F., & Rosopa, P. J. (2010). Research design options for testing mediation models and their implications for facets of validity. Journal of Managerial Psychology, 25,
13 Figure S1. A multiple mediation model in path diagram form corresponding to a model with an independent variable X with k categories and m mediators operating in parallel. When estimating using a structural equation modeling program, it is recommended that the covariance between mediator errors be freely estimated (see e.g., Preacher and Hayes, 2008). 13
Moderation. Moderation
Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation
Supplementary PROCESS Documentation
Supplementary PROCESS Documentation This document is an addendum to Appendix A of Introduction to Mediation, Moderation, and Conditional Process Analysis that describes options and output added to PROCESS
Statistical mediation analysis with a multicategorical independent variable
451 British Journal of Mathematical and Statistical Psychology (2014), 67, 451 470 2013 The British Psychological Society Expert Tutorial Statistical mediation analysis with a multicategorical independent
Model Templates for PROCESS for SPSS and SAS c 2013-2015 Andrew F. Hayes and The Guilford Press. Model 1. Conceptual Diagram. Statistical Diagram.
c 203-205 Andrew F. Hayes and The Guilford Press Model M b e M b 2 b 3 M Conditional effect of on = b + b 3 M c 203-205 Andrew F. Hayes and The Guilford Press Model Multicategorical M with k categories
Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini
NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building
CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA
Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or
Correlational Research
Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand
Overview of Factor Analysis
Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,
CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS
Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships
1.1. Simple Regression in Excel (Excel 2010).
.. Simple Regression in Excel (Excel 200). To get the Data Analysis tool, first click on File > Options > Add-Ins > Go > Select Data Analysis Toolpack & Toolpack VBA. Data Analysis is now available under
Regression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
Multiple Linear Regression in Data Mining
Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple
Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression
Data Mining and Data Warehousing Henryk Maciejewski Data Mining Predictive modelling: regression Algorithms for Predictive Modelling Contents Regression Classification Auxiliary topics: Estimation of prediction
PSYCH 6811 Statistical Methods in Psychology II Spring 2015. Course Description
PSYCH 6811 Statistical Methods in Psychology II Spring 2015 Lecture : MW 9:35 to 10:55 in PS 35 Lab : Th 9:35 to 10:55 / 11:10 to 12:30 in PS 22 Instructor : Dr. Andrew F. Hayes (hayes.338, LZ 230) Teaching
Introduction to Longitudinal Data Analysis
Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
Introduction to Data Analysis in Hierarchical Linear Models
Introduction to Data Analysis in Hierarchical Linear Models April 20, 2007 Noah Shamosh & Frank Farach Social Sciences StatLab Yale University Scope & Prerequisites Strong applied emphasis Focus on HLM
Multiple Regression: What Is It?
Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in
Multivariate Analysis. Overview
Multivariate Analysis Overview Introduction Multivariate thinking Body of thought processes that illuminate the interrelatedness between and within sets of variables. The essence of multivariate thinking
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)
Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation
When to Use a Particular Statistical Test
When to Use a Particular Statistical Test Central Tendency Univariate Descriptive Mode the most commonly occurring value 6 people with ages 21, 22, 21, 23, 19, 21 - mode = 21 Median the center value the
Psychology 205: Research Methods in Psychology
Psychology 205: Research Methods in Psychology Using R to analyze the data for study 2 Department of Psychology Northwestern University Evanston, Illinois USA November, 2012 1 / 38 Outline 1 Getting ready
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
It Takes Four to Tango: Why a Variable Cannot Be a Mediator and a Moderator at the Same Time. Johann Jacoby and Kai Sassenberg
A MEDIATOR CANNOT ALSO BE A MODERATOR 1 It Takes Four to Tango: Why a Variable Cannot Be a Mediator and a Moderator at the Same Time Johann Jacoby and Kai Sassenberg Knowledge Media Research Center Tübingen
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.
MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com
SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
PROCESS: A Versatile Computational Tool for Observed Variable Mediation, Moderation, and Conditional Process Modeling 1
How to cite this document: Hayes, A. F. (2012). PROCESS: A versatile computational tool for observed variable mediation, moderation, and conditional process modeling [White paper]. Retrieved from http://www.afhayes.com/
Rens van de Schoot a b, Peter Lugtig a & Joop Hox a a Department of Methods and Statistics, Utrecht
This article was downloaded by: [University Library Utrecht] On: 15 May 2012, At: 01:20 Publisher: Psychology Press Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office:
Predictor Coef StDev T P Constant 970667056 616256122 1.58 0.154 X 0.00293 0.06163 0.05 0.963. S = 0.5597 R-Sq = 0.0% R-Sq(adj) = 0.
Statistical analysis using Microsoft Excel Microsoft Excel spreadsheets have become somewhat of a standard for data storage, at least for smaller data sets. This, along with the program often being packaged
Moderator and Mediator Analysis
Moderator and Mediator Analysis Seminar General Statistics Marijtje van Duijn October 8, Overview What is moderation and mediation? What is their relation to statistical concepts? Example(s) October 8,
SPSS Guide: Regression Analysis
SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar
2015 TUHH Online Summer School: Overview of Statistical and Path Modeling Analyses
: Overview of Statistical and Path Modeling Analyses Prof. Dr. Christian M. Ringle (Hamburg Univ. of Tech., TUHH) Prof. Dr. Jӧrg Henseler (University of Twente) Dr. Geoffrey Hubona (The Georgia R School)
Applications of Structural Equation Modeling in Social Sciences Research
American International Journal of Contemporary Research Vol. 4 No. 1; January 2014 Applications of Structural Equation Modeling in Social Sciences Research Jackson de Carvalho, PhD Assistant Professor
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
The primary goal of this thesis was to understand how the spatial dependence of
5 General discussion 5.1 Introduction The primary goal of this thesis was to understand how the spatial dependence of consumer attitudes can be modeled, what additional benefits the recovering of spatial
Introduction to Structural Equation Modeling (SEM) Day 4: November 29, 2012
Introduction to Structural Equation Modeling (SEM) Day 4: November 29, 202 ROB CRIBBIE QUANTITATIVE METHODS PROGRAM DEPARTMENT OF PSYCHOLOGY COORDINATOR - STATISTICAL CONSULTING SERVICE COURSE MATERIALS
Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection
Directions in Statistical Methodology for Multivariable Predictive Modeling Frank E Harrell Jr University of Virginia Seattle WA 19May98 Overview of Modeling Process Model selection Regression shape Diagnostics
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
Part 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
Ph.D. Workshop. Advanced Methods in Structural Equation Modeling Mediation and Moderation Analysis
Ph.D. Workshop Advanced Methods in Structural Equation Modeling Mediation and Moderation Analysis 15 th 17 th December 2015 Course content and aims: Empirical research in various disciplines (e.g., management
Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.
Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are
Main Effects and Interactions
Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly
Introduction to Multilevel Modeling Using HLM 6. By ATS Statistical Consulting Group
Introduction to Multilevel Modeling Using HLM 6 By ATS Statistical Consulting Group Multilevel data structure Students nested within schools Children nested within families Respondents nested within interviewers
Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models.
Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models. Dr. Jon Starkweather, Research and Statistical Support consultant This month
Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)
1 Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) I. Authors should report effect sizes in the manuscript and tables when reporting
Mediation Analysis in Social Psychology: Current Practices and New Recommendations
Social and Personality Psychology Compass 5/6 (2011): 359 371, 10.1111/j.1751-9004.2011.00355.x Mediation Analysis in Social Psychology: Current Practices and New Recommendations Derek D. Rucker 1 *, Kristopher
Research Methods & Experimental Design
Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and
10. Analysis of Longitudinal Studies Repeat-measures analysis
Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.
Using Correlation and Regression: Mediation, Moderation, and More
Using Correlation and Regression: Mediation, Moderation, and More Part 2: Mediation analysis with regression Claremont Graduate University Professional Development Workshop August 22, 2015 Dale Berger,
Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)
Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and
Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents
Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
The Basic Two-Level Regression Model
2 The Basic Two-Level Regression Model The multilevel regression model has become known in the research literature under a variety of names, such as random coefficient model (de Leeuw & Kreft, 1986; Longford,
From the help desk: Bootstrapped standard errors
The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
Chapter 4 and 5 solutions
Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,
Chapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software
STATA Tutorial Professor Erdinç Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software 1.Wald Test Wald Test is used
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management
KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To
CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES
Examples: Monte Carlo Simulation Studies CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Monte Carlo simulation studies are often used for methodological investigations of the performance of statistical
Schools Value-added Information System Technical Manual
Schools Value-added Information System Technical Manual Quality Assurance & School-based Support Division Education Bureau 2015 Contents Unit 1 Overview... 1 Unit 2 The Concept of VA... 2 Unit 3 Control
Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480
1) The S & P/TSX Composite Index is based on common stock prices of a group of Canadian stocks. The weekly close level of the TSX for 6 weeks are shown: Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500
Simple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
Handling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
Chapter 1 Introduction. 1.1 Introduction
Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations
Latent Class Regression Part II
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Elements of statistics (MATH0487-1)
Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -
lavaan: an R package for structural equation modeling
lavaan: an R package for structural equation modeling Yves Rosseel Department of Data Analysis Belgium Utrecht April 24, 2012 Yves Rosseel lavaan: an R package for structural equation modeling 1 / 20 Overview
SEM Analysis of the Impact of Knowledge Management, Total Quality Management and Innovation on Organizational Performance
2015, TextRoad Publication ISSN: 2090-4274 Journal of Applied Environmental and Biological Sciences www.textroad.com SEM Analysis of the Impact of Knowledge Management, Total Quality Management and Innovation
Multivariate Logistic Regression
1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation
1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ
STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material
Chapter 6: Multivariate Cointegration Analysis
Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration
CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA
Examples: Mixture Modeling With Longitudinal Data CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA Mixture modeling refers to modeling with categorical latent variables that represent subpopulations
TIMSS 2011 User Guide for the International Database
TIMSS 2011 User Guide for the International Database Edited by: Pierre Foy, Alka Arora, and Gabrielle M. Stanco TIMSS 2011 User Guide for the International Database Edited by: Pierre Foy, Alka Arora,
Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@
Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@ Yanchun Xu, Andrius Kubilius Joint Commission on Accreditation of Healthcare Organizations,
X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
Ordinal Regression. Chapter
Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe
Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.
Consider a study in which How many subjects? The importance of sample size calculations Office of Research Protections Brown Bag Series KB Boomer, Ph.D. Director, [email protected] A researcher conducts
A REVIEW OF CURRENT SOFTWARE FOR HANDLING MISSING DATA
123 Kwantitatieve Methoden (1999), 62, 123-138. A REVIEW OF CURRENT SOFTWARE FOR HANDLING MISSING DATA Joop J. Hox 1 ABSTRACT. When we deal with a large data set with missing data, we have to undertake
COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.
277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies
LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE
LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-
Multiple Linear Regression
Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is
Everything You Wanted to Know about Moderation (but were afraid to ask) Jeremy F. Dawson University of Sheffield
Everything You Wanted to Know about Moderation (but were afraid to ask) Jeremy F. Dawson University of Sheffield Andreas W. Richter University of Cambridge Resources for this PDW Slides SPSS data set SPSS
This can dilute the significance of a departure from the null hypothesis. We can focus the test on departures of a particular form.
One-Degree-of-Freedom Tests Test for group occasion interactions has (number of groups 1) number of occasions 1) degrees of freedom. This can dilute the significance of a departure from the null hypothesis.
Presentation Outline. Structural Equation Modeling (SEM) for Dummies. What Is Structural Equation Modeling?
Structural Equation Modeling (SEM) for Dummies Joseph J. Sudano, Jr., PhD Center for Health Care Research and Policy Case Western Reserve University at The MetroHealth System Presentation Outline Conceptual
MODEL I: DRINK REGRESSED ON GPA & MALE, WITHOUT CENTERING
Interpreting Interaction Effects; Interaction Effects and Centering Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 20, 2015 Models with interaction effects
