Common method bias in PLS-SEM: A full collinearity assessment approach
|
|
|
- Gerald Murphy
- 9 years ago
- Views:
Transcription
1 Common method bias in PLS-SEM: A full collinearity assessment approach Ned Kock Full reference: Kock, N. (2015). Common method bias in PLS-SEM: A full collinearity assessment approach. International Journal of e-collaboration, 11(4), Abstract We discuss common method bias in the context of structural equation modeling employing the partial least squares method (PLS-SEM). Two datasets were created through a Monte Carlo simulation to illustrate the discussion: one contaminated by common method bias, and the other not contaminated. A practical approach is presented for the identification of common method bias based on variance inflation factors generated via a full collinearity test. Our discussion builds on an illustrative model in the field of e-collaboration, with outputs generated by the software WarpPLS. We demonstrate that the full collinearity test is successful in the identification of common method bias with a model that nevertheless passes standard convergent and discriminant validity assessment criteria based on a confirmation factor analysis. KEYWORDS: Partial Least Squares; Structural Equation Modeling; Common Method Bias; Monte Carlo Simulation 1
2 Introduction The method of path analysis has been developed by Wright (1934; 1960) to study causal assumptions in the field of evolutionary biology (Kock, 2011), and now provides the foundation on which structural equation modeling (SEM) rests. Both path analysis and SEM rely on the creation of models expressing causal relationships through links among variables. Two main types of SEM exist today: covariance-based and PLS-based SEM. While the former relies on the minimization of differences between covariance matrices, the latter employs the partial least squares method (PLS) developed by Herman Wold (Wold, 1980). PLS-based SEM is often referred to simply as PLS-SEM, and is widely used in the field of e-collaboration and many other fields. Regardless of SEM flavor, models expressing causal assumptions include latent variables. These latent variables are measured indirectly through other variables generally known as indicators (Maruyama, 1998; Mueller, 1996). Indicator values are usually obtained from questionnaires where answers are provided on numeric scales, of which the most commonly used are Likert-type scales (Cohen et al., 2003). Using questionnaires answered on Likert-type scales constitutes an integral part of an SEM study s measurement method. Common method bias is a phenomenon that is caused by the measurement method used in an SEM study, and not by the network of causes and effects among latent variables in the model being studied. We provide a discussion of common method bias in PLS-SEM, and of a method for its identification based on full collinearity tests (Kock & Lynn, 2012). Our discussion builds on an illustrative model in the field of e-collaboration, with outputs from the software WarpPLS, version 5.0 (Kock, 2015). The algorithm used to generate latent variable scores based on indicators was PLS Mode A, employing the path weighting scheme. While this is the algorithm-scheme combination most commonly used in PLS-SEM, it is by no means the only combination available. The recent emergence of factor-based PLS-SEM algorithms further broadened the space of existing combinations (Kock, 2014). We created two datasets based on a Monte Carlo simulation (Robert & Casella, 2005; Paxton et al., 2001). One of the two datasets was contaminated by common method bias; the other was not. We demonstrate that the full collinearity test is successful in the identification of common method bias with a model that nevertheless passes standard validity assessment criteria based on a confirmation factor analysis. In our discussion all variables are assumed to be standardized; i.e., scaled to have a mean of zero and standard deviation of one. This has no impact on the generality of the discussion. Standardization of any variable is accomplished by subtraction of its mean and division by its standard deviation. A standardized variable can be rescaled back to its original scale by reversing these operations. What is common method bias? Common method bias, in the context of PLS-SEM, is a phenomenon that is caused by the measurement method used in an SEM study, and not by the network of causes and effects in the model being studied. For example, the instructions at the top of a questionnaire may influence the answers provided by different respondents in the same general direction, causing the 2
3 indicators to share a certain amount of common variation. Another possible cause of common method bias is the implicit social desirability associated with answering questions in a questionnaire in a particularly way, again causing the indicators to share a certain amount of common variation. A mathematical understanding of common method bias can clarify some aspects of its nature. The adoption of an illustrative model can help reduce the level of abstraction of a mathematical exposition. Therefore, our discussion is based on the illustrative model depicted in Figure 1, which is inspired by an actual empirical study in the field of e-collaboration (Kock, 2005; 2008; Kock & Lynn, 2012). The illustrative model incorporates three latent variables, each measured through six indicators. It assumes that the unit of analysis is the firm. Figure 1. Illustrative model The latent variables are: collaborative culture (F ), the perceived degree to which a firm s culture promotes continuous collaboration among its members to improve the firm s productivity and the quality of the firm s products; e-collaboration technology use (F ), the perceived degree of use of e-collaboration technologies by the members of a firm; and competitive advantage (F ), the perceived degree of competitive advantage that a firm possesses when compared with firms that compete with it. Mathematically, if our model were not contaminated with common method bias, each of the six indicators x would be derived from its latent variable F (of which there are three in the model) according to (1), where: λ is the loading of indicator x on F, θ is the standardized indicator error term, and ω is the weight of θ with respect to x. x = λ F + ω θ, i = 1 3, j = 1 6. (1) Since θ and F are assumed to be uncorrelated, the value of ω in this scenario can be easily obtained as: ω = 1 λ. If our model were contaminated with common method bias, each of the six indicators x would be derived from its latent variable F according to (2), where: M is a standardized variable 3
4 that represents common method variation, and ω is the common method weight (a.k.a. common method loading, or the positive square root of the common method variance). x = λ F + ω M + ω θ, i = 1 3, j = 1 6. (2) In this scenario, the value of ω can be obtained as: ω = 1 λ ω. In (2) we assume that the common method weight ω is the same for all indicators. An alternative perspective assumes that the common method weight ω is not the same for all indicators, varying based on a number of factors. Two terms are used to refer to these different perspectives, namely congeneric and noncongeneric, although there is some confusion in the literature as to which term refers to what perspective. Note that the term ω M introduces common variation that is shared by all indicators in the model. Since latent variables aggregate indicators in PLS-SEM, this shared variation has the effect of artificially increasing the level of collinearity among latent variables. As we will see later, this also has the predictable effect of artificially increasing path coefficients. Data used in the analysis We created two datasets of 300 rows of data, equivalent to 300 returned questionnaires, with answers provided on Likert-type scales going from 1 to 7. This was done based on a Monte Carlo simulation (Robert & Casella, 2005; Paxton et al., 2001). The data was created for the three latent variables and the eighteen indicators (six per latent variable) in our illustrative model. Using this method we departed from a true model, which is a model for which we know the nature and magnitude of all of the relationships among variables beforehand. One of the two datasets was contaminated by common method bias; the other was not. In both datasets path coefficients and loadings were set as follows: β = β = β =.45. λ =.7, i = 1 3, j = 1 6. That is, all path coefficients were set as. 45 and all indicator loadings as. 7. In the dataset contaminated by common method bias, the common method weight was set to a value slightly lower than the indicator loadings: ω =.6. In Monte Carlo simulations where samples of finite size are created, true sample coefficients vary. Usually true sample coefficients vary according to a normal distribution centered on the true population value. Given this, and since we created a single sample of simulated data, our true sample coefficients differed from the true population coefficients. Nevertheless, when we compared certain coefficients obtained via a PLS-SEM analysis for the two datasets, with and without contamination, the effects of common method bias became 4
5 visible. This is particularly true for path coefficients, which tend to be inflated by common method bias. As noted earlier, path coefficient inflation is a predictable outcome of shared variation among latent variables. Path coefficient inflation Table 1 shows the path coefficients for the models not contaminated by common method bias (No CMB) and contaminated (CMB). As we can see, all three path coefficients were greater in the model contaminated by common method bias. The differences among path coefficients ranged from a little over 20 to nearly 40 percent. Table 1. Path coefficients β 21 β 31 β 32 No CMB CMB Note: CMB = common method bias. This path coefficient inflation effect is one of the key reasons why researchers are concerned about common method bias, as it may cause type I errors (false positives). Nevertheless, common method bias may also be associated with path coefficient deflation, potentially leading to type II errors (false negatives). As we can see, the inflation effect can lead to marked differences in path coefficients. In the case of the path coefficient β, the difference is of approximately percent. As noted earlier, path coefficient inflation occurs because common variation is introduced, being shared by all indicators in the model. As latent variables aggregate indicators, they also incorporate the common variation, leading to an increase in the level of collinearity among latent variables. Greater collinearity levels in turn lead to inflated path coefficients. One of the goals of a confirmatory factor analysis is to assess two main types of validity in a model: convergent and discriminant validity. Acceptable convergent validity occurs when indicators load strongly on their corresponding latent variables. Acceptable discriminant validity occurs when the correlations among a latent variable and other latent variables in a model are lower than a measure of communality among the latent variable indicators. Given these expectations underlying acceptable convergent and discriminant validity, one could expect that a confirmatory factor analysis would allow for the identification of common method bias. In fact, many researchers in the past have proposed the use of confirmatory factor analysis as a more desirable alternative to Harman s single-factor test a widely used common method bias test that relies on exploratory factor analysis. Unfortunately, as we will see in the next section, conducting a confirmatory factor analysis is not a very effective way of identifying common method bias. Models may pass criteria for acceptable convergent and discriminant validity, and still be contaminated by common method bias. Confirmatory factor analysis Table 2 is a combined display showing loadings and cross-loadings. Loadings, shown in shaded cells, are unrotated. Cross-loadings are oblique-rotated. Acceptable convergent validity would normally be assumed if the loadings were all above a certain threshold, typically.5. As we 5
6 can see, all loadings pass this test. This is the case for both models, with and without common method bias contamination. That is, both models present acceptable convergent validity. These results highlight one interesting aspect of the common method bias phenomenon in the context of PLS-SEM. There appears to be a marked inflation in loadings, similarly to what was observed for path coefficients. Since convergent validity relies on the comparison of loadings against a fixed threshold, then it follows that common method bias would tend to artificially increase the level of convergent validity of a model. Table 3 shows correlations among latent variables and square roots of average variances extracted (AVEs). The latter are shown in shaded cells, along diagonals. Acceptable discriminant validity would typically be assumed if the number in the diagonal cell for each column is greater than any of the other numbers in the same column. Table 2. Assessing convergent validity No CMB CMB F 1 F 2 F 3 F 1 F 2 F 3 x x x x x x x x x x x x x x x x x x Notes: CMB = common method bias; loadings are unrotated and cross-loadings are oblique-rotated; loadings shown in shaded cells. Table 3. Assessing discriminant validity No CMB CMB F 1 F 2 F 3 F 1 F 2 F 3 F F F Notes: Square roots of average variances extracted (AVEs) shown on shaded diagonal. That is, if the square root of the AVE for a given latent variable is greater than any correlation involving that latent variable, and this applies to all latent variables in a model, then the model presents acceptable discriminant validity. As we can see, this is the case for both of our models, 6
7 with and without common method bias contamination. Both models can thus be assumed to display acceptable discriminant validity. Here we see another interesting aspect of the common method bias phenomenon in the context of PLS-SEM. While correlations among latent variables increase, the same happens with the AVEs. This simultaneous increase in correlations and AVEs is what undermines the potential of a discriminant validity check in the identification of common method bias. In summary, two key elements of a traditional confirmatory factor analysis are a convergent validity test and a discriminant vadity test. According to our analysis, neither test seems to be very effective in the identification of common method bias. An analogous analysis was conducted by Kock & Lynn (2012), which prompted them to offer the full collinearity test as an effective alternative for the identification of common method bias. The full collinearity test Collinearity has classically been defined as a predictor-predictor phenomenon in multiple regression models. In this traditional perspective, when two or more predictors measure the same underlying construct, or a facet of such construct, they are said to be collinear. This definition is restricted to classic, or vertical, collinearity. Lateral collinearity is defined as a predictor-criterion phenomenon, whereby a predictor variable measures the same underlying construct, or a facet of such construct, as a variable to which it points in a model. The latter is the criterion variable in the predictor-criterion relationship of interest. Kock & Lynn (2012) proposed the full collinearity test as comprehensive procedure for the simultaneous assessment of both vertical and lateral collinearity (see, also, Kock & Gaskins, 2014). Through this procedure, which is fully automated by the software WarpPLS, variance inflation factors (VIFs) are generated for all latent variables in a model. The occurrence of a VIF greater than 3.3 is proposed as an indication of pathological collinearity, and also as an indication that a model may be contaminated by common method bias. Therefore, if all VIFs resulting from a full collinearity test are equal to or lower than 3.3, the model can be considered free of common method bias. Table 4 shows the VIFs obtained for all the latent variables in both of our models, based on a full collinearity test. As we can see, the model contaminated with common method bias includes a latent variable with VIF greater than 3.3, which is shown in a shaded cell. That is, the common method bias test proposed by Kock & Lynn (2012), based on the full collinearity test procedure, seems to succeed in the identification of common method bias. Table 4. Full collinearity VIFs F 1 F 2 F 3 No CMB CMB Note: CMB = common method bias. While it is noteworthy that the full collinearity test was successful in the identification of common method bias in a situation where a confirmation factor analysis was not, this success is not entirely surprising given our previous discussion based on the mathematics underlying common method bias. That discussion clearly points at an increase in the overall level of 7
8 collinearity in a model, corresponding to an increase in the full collinearity VIFs for the latent variables in the model, as a clear outcome of common method bias. Discussion and conclusion There is disagreement among methodological researchers about the nature of common method bias, how it should be addressed, and even whether it should be addressed at al. Richardson et al. (2009) discuss various perspectives about common method bias, including the perspective put forth by Spector (1987) that common method bias is an urban legend. Assuming that the problem is real, what can we do to avoid common method bias in the first place? A seminal source in this respect is Podsakoff et al. (2003), who provide a number of suggestions on how to avoid the introduction of common method bias during data collection. Our discussion focuses on the identification of common method bias based on full collinearity assessment, whereby a model is checked for the existence of both vertical and lateral collinearity (Kock & Gaskins, 2014; Kock & Lynn, 2012). If we find evidence of common method bias, is there anything we can do to eliminate or at least reduce it? The answer is arguably yes, and, given the focus of our discussion, the steps discussed by Kock & Lynn (2012) for dealing with collinearity are an obvious choice: indicator removal, indicator re-assignment, latent variable removal, latent variable aggregation, and hierarchical analysis. Readers are referred to that publication for details on how and when to implement these steps. Full collinearity VIFs tend to increase with model complexity, in terms of number of latent variables in the model, because: (a) the likelihood that questions associated with different indicators will overlap in perceived meaning goes up as the size of a questionnaire increases, which should happen as the number of constructs covered grows; and (b) the likelihood that latent variables will overlap in terms of the facets of the constructs to which they refer goes up as more latent variables are added to a model. Models found in empirical research studies in the field of e-collaboration typically contain more than three latent variables. This applies to many other fields where path analysis and SEM are employed. Therefore, we can reasonably conclude that our illustration of the full collinearity test of common method bias discussed here is conservative in its demonstration of the likely effectiveness of the test in actual empirical studies. Kock & Lynn (2012) pointed out that classic PLS-SEM algorithms are particularly effective at reducing model-wide collinearity, because those algorithms maximize the variance explained in latent variables by their indicators. Such maximization is due in part to classic PLS-SEM algorithms not modeling measurement error, essentially assuming that it is zero. As such, the indicators associated with a latent variable always explain 100 percent of the variance in the latent variable. Nevertheless, one of the key downsides of classic PLS-SEM algorithms is that path coefficients tend to be attenuated (Kock, 2015b). In a sense, they reduce collinearity levels too much. The recently proposed factor-based PLS-SEM algorithms (Kock, 2014) address this problem. Given this, one should expect the use of factor-based PLS-SEM algorithms to yield slightly higher full collinearity VIFs than classic PLS-SEM algorithms, with those slightly higher VIFs being a better reflection of the true values. Consequently, the VIF threshold used in common method bias tests should arguably be somewhat higher than 3.3 when factor-based PLS-SEM algorithms are used. In their discussion of possible thresholds, Kock & Lynn (2012) note that a VIF of 5 could be employed when algorithms that incorporate measurement error are used. Even though they made this remark in 8
9 reference to covariance-based SEM algorithms, the remark also applies to factor-based PLS- SEM algorithms, as both types of algorithms incorporate measurement error. Our goal here is to help empirical researchers who need practical and straightforward methodological solutions to assess the overall quality of their measurement frameworks. To that end, we discussed and demonstrated a practical approach whereby researchers can conduct common method bias assessment based on a full collinearity test of a model. Our discussion was illustrated with outputs of the software WarpPLS (Kock, 2015), in the context of e-collaboration research. Nevertheless, our discussion arguably applies to any field where path analysis and SEM can be used. Acknowledgments The author is the developer of the software WarpPLS, which has over 7,000 users in more than 33 different countries at the time of this writing, and moderator of the PLS-SEM distribution list. He is grateful to those users, and to the members of the PLS-SEM distribution list, for questions, comments, and discussions on topics related to SEM and to the use of WarpPLS. References Cohen, J., Cohen, P., West, S.G., & Aiken, L.S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences. Mahwah, N.J.: L. Erlbaum Associates. Kock, N. (2005). What is e-collaboration. International Journal of E-collaboration, 1(1), 1-7. Kock, N. (2008). E-collaboration and e-commerce in virtual worlds: The potential of Second Life and World of Warcraft. International Journal of e-collaboration, 4(3), Kock, N. (2011). A mathematical analysis of the evolution of human mate choice traits: Implications for evolutionary psychologists. Journal of Evolutionary Psychology, 9(3), Kock, N. (2014). A note on how to conduct a factor-based PLS-SEM analysis. Laredo, TX: ScriptWarp Systems. Kock, N. (2015). WarpPLS 5.0 User Manual. Laredo, TX: ScriptWarp Systems. Kock, N. (2015b). One-tailed or two-tailed P values in PLS-SEM? International Journal of e- Collaboration, 11(2), 1-7. Kock, N., & Gaskins, L. (2014). The mediating role of voice and accountability in the relationship between Internet diffusion and government corruption in Latin America and Sub-Saharan Africa. Information Technology for Development, 20(1), Kock, N., & Lynn, G.S. (2012). Lateral collinearity and misleading results in variance-based SEM: An illustration and recommendations. Journal of the Association for Information Systems, 13(7), Maruyama, G.M. (1998). Basics of structural equation modeling. Thousand Oaks, CA: Sage Publications. Mueller, R.O. (1996). Basic principles of structural equation modeling. New York, NY: Springer. Paxton, P., Curran, P.J., Bollen, K.A., Kirby, J., & Chen, F. (2001). Monte Carlo experiments: Design and implementation. Structural Equation Modeling, 8(2),
10 Podsakoff, P.M., MacKenzie, S.B., Lee, J.Y., & Podsakoff, N.P. (2003). Common method biases in behavioral research: A critical review of the literature and recommended remedies. Journal of Applied Psychology, 88(5), Richardson, H.A., Simmering, M.J., & Sturman, M.C. (2009). A tale of three perspectives: Examining post hoc statistical techniques for detection and correction of common method variance. Organizational Research Methods, 12(4), Robert, C.P., & Casella, G. (2005). Monte Carlo statistical methods. New York, NY: Springer. Spector, P.E. (1987). Method variance as an artifact in self-reported affect and perceptions at work: Myth or significant problem? Journal of Applied Psychology, 72(3), Wold, H. (1980). Model construction and evaluation when theoretical knowledge is scarce. In J. Kmenta and J. B. Ramsey (Eds.), Evaluation of econometric models (pp ). Waltham, MA: Academic Press. Wright, S. (1934). The method of path coefficients. The Annals of Mathematical Statistics, 5(3), Wright, S. (1960). Path coefficients and path regressions: Alternative or complementary concepts? Biometrics, 16(2),
Advanced mediating effects tests, multi-group analyses, and measurement model assessments in PLS-based SEM
Advanced mediating effects tests, multi-group analyses, and measurement model assessments in PLS-based SEM Ned Kock Full reference: Kock, N. (2014). Advanced mediating effects tests, multi-group analyses,
PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA
PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA ABSTRACT The decision of whether to use PLS instead of a covariance
Applications of Structural Equation Modeling in Social Sciences Research
American International Journal of Contemporary Research Vol. 4 No. 1; January 2014 Applications of Structural Equation Modeling in Social Sciences Research Jackson de Carvalho, PhD Assistant Professor
Chapter 1 Introduction. 1.1 Introduction
Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations
Constructing a TpB Questionnaire: Conceptual and Methodological Considerations
Constructing a TpB Questionnaire: Conceptual and Methodological Considerations September, 2002 (Revised January, 2006) Icek Ajzen Brief Description of the Theory of Planned Behavior According to the theory
Overview of Factor Analysis
Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,
Multivariate Analysis of Variance (MANOVA)
Multivariate Analysis of Variance (MANOVA) Aaron French, Marcelo Macedo, John Poulsen, Tyler Waterson and Angela Yu Keywords: MANCOVA, special cases, assumptions, further reading, computations Introduction
This chapter will demonstrate how to perform multiple linear regression with IBM SPSS
CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the
Multiple Imputation for Missing Data: A Cautionary Tale
Multiple Imputation for Missing Data: A Cautionary Tale Paul D. Allison University of Pennsylvania Address correspondence to Paul D. Allison, Sociology Department, University of Pennsylvania, 3718 Locust
UNDERSTANDING THE TWO-WAY ANOVA
UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables
1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand
Exploratory Factor Analysis
Introduction Principal components: explain many variables using few new variables. Not many assumptions attached. Exploratory Factor Analysis Exploratory factor analysis: similar idea, but based on model.
CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C.
CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES From Exploratory Factor Analysis Ledyard R Tucker and Robert C MacCallum 1997 180 CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES In
INTRODUCTION TO MULTIPLE CORRELATION
CHAPTER 13 INTRODUCTION TO MULTIPLE CORRELATION Chapter 12 introduced you to the concept of partialling and how partialling could assist you in better interpreting the relationship between two primary
Canonical Correlation Analysis
Canonical Correlation Analysis LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the similarities and differences between multiple regression, factor analysis,
How to report the percentage of explained common variance in exploratory factor analysis
UNIVERSITAT ROVIRA I VIRGILI How to report the percentage of explained common variance in exploratory factor analysis Tarragona 2013 Please reference this document as: Lorenzo-Seva, U. (2013). How to report
Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS
Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple
Introduction to Fixed Effects Methods
Introduction to Fixed Effects Methods 1 1.1 The Promise of Fixed Effects for Nonexperimental Research... 1 1.2 The Paired-Comparisons t-test as a Fixed Effects Method... 2 1.3 Costs and Benefits of Fixed
Mode and Patient-mix Adjustment of the CAHPS Hospital Survey (HCAHPS)
Mode and Patient-mix Adjustment of the CAHPS Hospital Survey (HCAHPS) April 30, 2008 Abstract A randomized Mode Experiment of 27,229 discharges from 45 hospitals was used to develop adjustments for the
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
Exploratory Factor Analysis and Principal Components. Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016
and Principal Components Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016 Agenda Brief History and Introductory Example Factor Model Factor Equation Estimation of Loadings
Empirical Methods in Applied Economics
Empirical Methods in Applied Economics Jörn-Ste en Pischke LSE October 2005 1 Observational Studies and Regression 1.1 Conditional Randomization Again When we discussed experiments, we discussed already
MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
1 Teaching notes on GMM 1.
Bent E. Sørensen January 23, 2007 1 Teaching notes on GMM 1. Generalized Method of Moment (GMM) estimation is one of two developments in econometrics in the 80ies that revolutionized empirical work in
Reliability Analysis
Measures of Reliability Reliability Analysis Reliability: the fact that a scale should consistently reflect the construct it is measuring. One way to think of reliability is that other things being equal,
An introduction to Value-at-Risk Learning Curve September 2003
An introduction to Value-at-Risk Learning Curve September 2003 Value-at-Risk The introduction of Value-at-Risk (VaR) as an accepted methodology for quantifying market risk is part of the evolution of risk
Test Bias. As we have seen, psychological tests can be well-conceived and well-constructed, but
Test Bias As we have seen, psychological tests can be well-conceived and well-constructed, but none are perfect. The reliability of test scores can be compromised by random measurement error (unsystematic
UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)
UNDERSTANDING ANALYSIS OF COVARIANCE () In general, research is conducted for the purpose of explaining the effects of the independent variable on the dependent variable, and the purpose of research design
Restaurant Tips 1. Restaurant Tips and Service Quality: A Weak Relationship or Just Weak Measurement? Michael Lynn. School of Hotel Administration
Restaurant Tips 1 Running Head: RESTAURANT TIPS AND SERVICE Restaurant Tips and Service Quality: A Weak Relationship or Just Weak Measurement? Michael Lynn School of Hotel Administration Cornell University
Exploratory Factor Analysis
Exploratory Factor Analysis ( 探 索 的 因 子 分 析 ) Yasuyo Sawaki Waseda University JLTA2011 Workshop Momoyama Gakuin University October 28, 2011 1 Today s schedule Part 1: EFA basics Introduction to factor
Factor Analysis. Advanced Financial Accounting II Åbo Akademi School of Business
Factor Analysis Advanced Financial Accounting II Åbo Akademi School of Business Factor analysis A statistical method used to describe variability among observed variables in terms of fewer unobserved variables
Sensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS
Sensitivity Analysis 3 We have already been introduced to sensitivity analysis in Chapter via the geometry of a simple example. We saw that the values of the decision variables and those of the slack and
Moderation. Moderation
Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation
PRINCIPAL COMPONENT ANALYSIS
1 Chapter 1 PRINCIPAL COMPONENT ANALYSIS Introduction: The Basics of Principal Component Analysis........................... 2 A Variable Reduction Procedure.......................................... 2
Moderator and Mediator Analysis
Moderator and Mediator Analysis Seminar General Statistics Marijtje van Duijn October 8, Overview What is moderation and mediation? What is their relation to statistical concepts? Example(s) October 8,
Correlational Research
Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.
Do Commodity Price Spikes Cause Long-Term Inflation?
No. 11-1 Do Commodity Price Spikes Cause Long-Term Inflation? Geoffrey M.B. Tootell Abstract: This public policy brief examines the relationship between trend inflation and commodity price increases and
FACTOR ANALYSIS NASC
FACTOR ANALYSIS NASC Factor Analysis A data reduction technique designed to represent a wide range of attributes on a smaller number of dimensions. Aim is to identify groups of variables which are relatively
Introduction to Principal Components and FactorAnalysis
Introduction to Principal Components and FactorAnalysis Multivariate Analysis often starts out with data involving a substantial number of correlated variables. Principal Component Analysis (PCA) is a
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
Common factor analysis
Common factor analysis This is what people generally mean when they say "factor analysis" This family of techniques uses an estimate of common variance among the original variables to generate the factor
Module 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
Partial Least Squares (PLS) Regression.
Partial Least Squares (PLS) Regression. Hervé Abdi 1 The University of Texas at Dallas Introduction Pls regression is a recent technique that generalizes and combines features from principal component
IMPACT OF TRUST, PRIVACY AND SECURITY IN FACEBOOK INFORMATION SHARING
IMPACT OF TRUST, PRIVACY AND SECURITY IN FACEBOOK INFORMATION SHARING 1 JithiKrishna P P, 2 Suresh Kumar R, 3 Sreejesh V K 1 Mtech Computer Science and Security LBS College of Engineering Kasaragod, Kerala
Association Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
A Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling
A Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling Background Bryan Orme and Rich Johnson, Sawtooth Software March, 2009 Market segmentation is pervasive
Imputing Missing Data using SAS
ABSTRACT Paper 3295-2015 Imputing Missing Data using SAS Christopher Yim, California Polytechnic State University, San Luis Obispo Missing data is an unfortunate reality of statistics. However, there are
2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
Chapter 6: The Information Function 129. CHAPTER 7 Test Calibration
Chapter 6: The Information Function 129 CHAPTER 7 Test Calibration 130 Chapter 7: Test Calibration CHAPTER 7 Test Calibration For didactic purposes, all of the preceding chapters have assumed that the
Trust, Job Satisfaction, Organizational Commitment, and the Volunteer s Psychological Contract
Trust, Job Satisfaction, Commitment, and the Volunteer s Psychological Contract Becky J. Starnes, Ph.D. Austin Peay State University Clarksville, Tennessee, USA [email protected] Abstract Studies indicate
Testing for Granger causality between stock prices and economic growth
MPRA Munich Personal RePEc Archive Testing for Granger causality between stock prices and economic growth Pasquale Foresti 2006 Online at http://mpra.ub.uni-muenchen.de/2962/ MPRA Paper No. 2962, posted
A Brief Introduction to Factor Analysis
1. Introduction A Brief Introduction to Factor Analysis Factor analysis attempts to represent a set of observed variables X 1, X 2. X n in terms of a number of 'common' factors plus a factor which is unique
A Review of Methods. for Dealing with Missing Data. Angela L. Cool. Texas A&M University 77843-4225
Missing Data 1 Running head: DEALING WITH MISSING DATA A Review of Methods for Dealing with Missing Data Angela L. Cool Texas A&M University 77843-4225 Paper presented at the annual meeting of the Southwest
JungHoon (Jay) Lee Hospitality Management and Dietetics Kansas State University. and
Effects of Workplace Friendship on Employee Job Satisfaction, Organizational Citizenship Behavior, Turnover Intention, Absenteeism, and Task Performance JungHoon (Jay) Lee Hospitality Management and Dietetics
WHAT IS A JOURNAL CLUB?
WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
Practical Guide to the Simplex Method of Linear Programming
Practical Guide to the Simplex Method of Linear Programming Marcel Oliver Revised: April, 0 The basic steps of the simplex algorithm Step : Write the linear programming problem in standard form Linear
Introduction to Data Analysis in Hierarchical Linear Models
Introduction to Data Analysis in Hierarchical Linear Models April 20, 2007 Noah Shamosh & Frank Farach Social Sciences StatLab Yale University Scope & Prerequisites Strong applied emphasis Focus on HLM
Calculating VaR. Capital Market Risk Advisors CMRA
Calculating VaR Capital Market Risk Advisors How is VAR Calculated? Sensitivity Estimate Models - use sensitivity factors such as duration to estimate the change in value of the portfolio to changes in
Handling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
Multiple Regression: What Is It?
Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in
Structural Equation Modelling (SEM)
(SEM) Aims and Objectives By the end of this seminar you should: Have a working knowledge of the principles behind causality. Understand the basic steps to building a Model of the phenomenon of interest.
The Method of Least Squares
Hervé Abdi 1 1 Introduction The least square methods (LSM) is probably the most popular technique in statistics. This is due to several factors. First, most common estimators can be casted within this
Chapter 7 Factor Analysis SPSS
Chapter 7 Factor Analysis SPSS Factor analysis attempts to identify underlying variables, or factors, that explain the pattern of correlations within a set of observed variables. Factor analysis is often
Descriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
MANUFACTURING AND CONTINUOUS IMPROVEMENT AREAS USING PARTIAL LEAST SQUARE PATH MODELING WITH MULTIPLE REGRESSION COMPARISON
MANUFACTURING AND CONTINUOUS IMPROVEMENT AREAS USING PARTIAL LEAST SQUARE PATH MODELING WITH MULTIPLE REGRESSION COMPARISON Carlos Monge 1, Jesús Cruz Álvarez 2, Jesús Fabián López 3 Abstract: Structural
Work in Progress # 140. Torsten J. Gerpott/Nima Ahmadi
Torsten J. Gerpott/Nima Ahmadi Composite Indices for the Evaluation of a Country s Information Technology Development Level: Extensions of the IDI of the ITU Work in Progress # 140 December 2014 Abstract
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
Decision Analysis. Here is the statement of the problem:
Decision Analysis Formal decision analysis is often used when a decision must be made under conditions of significant uncertainty. SmartDrill can assist management with any of a variety of decision analysis
Pearson's Correlation Tests
Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation
Validation of the Core Self-Evaluations Scale research instrument in the conditions of Slovak Republic
Validation of the Core Self-Evaluations Scale research instrument in the conditions of Slovak Republic Lenka Selecká, Jana Holienková Faculty of Arts, Department of psychology University of SS. Cyril and
Factor Analysis and Structural equation modelling
Factor Analysis and Structural equation modelling Herman Adèr Previously: Department Clinical Epidemiology and Biostatistics, VU University medical center, Amsterdam Stavanger July 4 13, 2006 Herman Adèr
Organisation Profiling and the Adoption of ICT: e-commerce in the UK Construction Industry
Organisation Profiling and the Adoption of ICT: e-commerce in the UK Construction Industry Martin Jackson and Andy Sloane University of Wolverhampton, UK [email protected] [email protected] Abstract:
TIME SERIES ANALYSIS AS A MEANS OF MANAGERIA DECISION MAKING IN MANUFACTURING INDUSTRY
TIME SERIES ANALYSIS AS A MEANS OF MANAGERIA DECISION MAKING IN MANUFACTURING INDUSTRY 1 Kuranga L.J, 2 Ishola James.A, and 3 Ibrahim Hamzat G. 1 Department of Statistics Kwara State Polytechnic Ilorin,Nigeria
MATHEMATICS AS THE CRITICAL FILTER: CURRICULAR EFFECTS ON GENDERED CAREER CHOICES
MATHEMATICS AS THE CRITICAL FILTER: CURRICULAR EFFECTS ON GENDERED CAREER CHOICES Xin Ma University of Kentucky, Lexington, USA Using longitudinal data from the Longitudinal Study of American Youth (LSAY),
Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection
Chapter 311 Introduction Often, theory and experience give only general direction as to which of a pool of candidate variables (including transformed variables) should be included in the regression model.
Understanding the Impact of Weights Constraints in Portfolio Theory
Understanding the Impact of Weights Constraints in Portfolio Theory Thierry Roncalli Research & Development Lyxor Asset Management, Paris [email protected] January 2010 Abstract In this article,
Exploratory Factor Analysis
Exploratory Factor Analysis Definition Exploratory factor analysis (EFA) is a procedure for learning the extent to which k observed variables might measure m abstract variables, wherein m is less than
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models
Factor Analysis Principal components factor analysis Use of extracted factors in multivariate dependency models 2 KEY CONCEPTS ***** Factor Analysis Interdependency technique Assumptions of factor analysis
How to Get More Value from Your Survey Data
Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................2
FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL
FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL STATIsTICs 4 IV. RANDOm VECTORs 1. JOINTLY DIsTRIBUTED RANDOm VARIABLEs If are two rom variables defined on the same sample space we define the joint
RESEARCH METHODS IN I/O PSYCHOLOGY
RESEARCH METHODS IN I/O PSYCHOLOGY Objectives Understand Empirical Research Cycle Knowledge of Research Methods Conceptual Understanding of Basic Statistics PSYC 353 11A rsch methods 01/17/11 [Arthur]
Understanding Power and Rules of Thumb for Determining Sample Sizes
Tutorials in Quantitative Methods for Psychology 2007, vol. 3 (2), p. 43 50. Understanding Power and Rules of Thumb for Determining Sample Sizes Carmen R. Wilson VanVoorhis and Betsy L. Morgan University
A COMPARISON ANALYSIS ON THE INTENTION TO CONTINUED USE OF A LIFELONG LEARNING WEBSITE
International Journal of Electronic Business Management, Vol. 10, No. 3, pp. 213-223 (2012) 213 A COMPARISON ANALYSIS ON THE INTENTION TO CONTINUED USE OF A LIFELONG LEARNING WEBSITE Hsiu-Li Liao * and
Analysis of Bayesian Dynamic Linear Models
Analysis of Bayesian Dynamic Linear Models Emily M. Casleton December 17, 2010 1 Introduction The main purpose of this project is to explore the Bayesian analysis of Dynamic Linear Models (DLMs). The main
Partial Estimates of Reliability: Parallel Form Reliability in the Key Stage 2 Science Tests
Partial Estimates of Reliability: Parallel Form Reliability in the Key Stage 2 Science Tests Final Report Sarah Maughan Ben Styles Yin Lin Catherine Kirkup September 29 Partial Estimates of Reliability:
Premaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
LINKING BUSINESS ANALYTICS TO DECISION MAKING EFFECTIVENESS: A PATH MODEL ANALYSIS
LINKING BUSINESS ANALYTICS TO DECISION MAKING EFFECTIVENESS: A PATH MODEL ANALYSIS Key words: Business analytics; Information processing capability; Decision-making effectiveness; Information processing
STA 4107/5107. Chapter 3
STA 4107/5107 Chapter 3 Factor Analysis 1 Key Terms Please review and learn these terms. 2 What is Factor Analysis? Factor analysis is an interdependence technique (see chapter 1) that primarily uses metric
Impact of Knowledge Management and Organizational Learning on Different Dimensions of Organizational Performance: A Case Study of Asian Food Industry
Impact of Knowledge Management and Organizational Learning on Different Dimensions of Organizational : A Case Study of Asian Food Industry Huang Hui 1, Che Wan Jasimahbt Wan Mohamed Radzi 1, HashemSalarzadeh
Partial Least Squares Structural Equation Modeling (PLS-SEM) Techniques Using SmartPLS
Partial Least Squares Structural Equation Modeling (PLS-SEM) Techniques Using SmartPLS Ken Kwong-Kay Wong SmartPLS is one of the prominent software applications for Partial Least Squares Structural Equation
Analyzing Structural Equation Models With Missing Data
Analyzing Structural Equation Models With Missing Data Craig Enders* Arizona State University [email protected] based on Enders, C. K. (006). Analyzing structural equation models with missing data. In G.
INTEREST RATES AND FX MODELS
INTEREST RATES AND FX MODELS 8. Portfolio greeks Andrew Lesniewski Courant Institute of Mathematical Sciences New York University New York March 27, 2013 2 Interest Rates & FX Models Contents 1 Introduction
Lasso on Categorical Data
Lasso on Categorical Data Yunjin Choi, Rina Park, Michael Seo December 14, 2012 1 Introduction In social science studies, the variables of interest are often categorical, such as race, gender, and nationality.
