Indices of Model Fit STRUCTURAL EQUATION MODELING 2013

Similar documents
Applications of Structural Equation Modeling in Social Sciences Research

Evaluating the Fit of Structural Equation Models: Tests of Significance and Descriptive Goodness-of-Fit Measures

Goodness of fit assessment of item response theory models

Introduction to Path Analysis

SPSS and AMOS. Miss Brenda Lee 2:00p.m. 6:00p.m. 24 th July, 2015 The Open University of Hong Kong

I n d i a n a U n i v e r s i t y U n i v e r s i t y I n f o r m a t i o n T e c h n o l o g y S e r v i c e s

lavaan: an R package for structural equation modeling

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Confirmatory factor analysis in MPlus

Presentation Outline. Structural Equation Modeling (SEM) for Dummies. What Is Structural Equation Modeling?

SPSS Guide: Regression Analysis

This can dilute the significance of a departure from the null hypothesis. We can focus the test on departures of a particular form.

Psychology 405: Psychometric Theory Homework on Factor analysis and structural equation modeling

Simple Linear Regression Inference

Evaluating Goodness-of-Fit Indexes for Testing Measurement Invariance

Descriptive Statistics

Rens van de Schoot a b, Peter Lugtig a & Joop Hox a a Department of Methods and Statistics, Utrecht

Multivariate Normal Distribution

Specification of Rasch-based Measures in Structural Equation Modelling (SEM) Thomas Salzberger

SEM Analysis of the Impact of Knowledge Management, Total Quality Management and Innovation on Organizational Performance

USING MULTIPLE GROUP STRUCTURAL MODEL FOR TESTING DIFFERENCES IN ABSORPTIVE AND INNOVATIVE CAPABILITIES BETWEEN LARGE AND MEDIUM SIZED FIRMS

Chapter 6: Multivariate Cointegration Analysis

Use of deviance statistics for comparing models

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

The Technology Acceptance Model with Online Learning for the Principals in Elementary Schools and Junior High Schools

Overview of Factor Analysis

How To Know If A Model Is True Or False

Mplus Tutorial August 2012

Use of structural equation modeling in operations management research: Looking back and forward

11. Analysis of Case-control Studies Logistic Regression

Comparison of Estimation Methods for Complex Survey Data Analysis

Chapter 7: Simple linear regression Learning Objectives

Part 2: Analysis of Relationship Between Two Variables

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure

Multivariate Logistic Regression

August 2012 EXAMINATIONS Solution Part I

How To Check For Differences In The One Way Anova

Logistic Regression.

The Latent Variable Growth Model In Practice. Individual Development Over Time

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models

SAS Software to Fit the Generalized Linear Model

Factors Affecting Demand Management in the Supply Chain (Case Study: Kermanshah Province's manufacturing and distributing companies)

HLM software has been one of the leading statistical packages for hierarchical

Regression Analysis: A Complete Example

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne

MAGNT Research Report (ISSN ) Vol.2 (Special Issue) PP:

Structural Equation Modelling (SEM)

Week TSX Index

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

Rank-Based Non-Parametric Tests

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

This content downloaded on Tue, 19 Feb :28:43 PM All use subject to JSTOR Terms and Conditions

Factorial Invariance in Student Ratings of Instruction

How To Understand Multivariate Models

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Gerry Hobbs, Department of Statistics, West Virginia University

Simple Second Order Chi-Square Correction

Factors That Improve the Quality of Information Technology and Knowledge Management System for SME(s) in Thailand

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software

Analyzing Structural Equation Models With Missing Data

Recall this chart that showed how most of our course would be organized:

International Statistical Institute, 56th Session, 2007: Phil Everson

Multivariate Analysis of Variance (MANOVA): I. Theory

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

THE UNIVERSITY OF CHICAGO, Booth School of Business Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay. Solutions to Homework Assignment #2

Technical report. in SPSS AN INTRODUCTION TO THE MIXED PROCEDURE

Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study

Binary Logistic Regression

Using Stata for Categorical Data Analysis

One-Way Analysis of Variance (ANOVA) Example Problem

Statistical Functions in Excel

Multiple Linear Regression in Data Mining

Penalized regression: Introduction

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Chapter 15. Mixed Models Overview. A flexible approach to correlated data.

The lavaan tutorial. Yves Rosseel Department of Data Analysis Ghent University (Belgium) June 28, 2016

Section 13, Part 1 ANOVA. Analysis Of Variance

Longitudinal Data Analysis

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

CHAPTER 5 COMPARISON OF DIFFERENT TYPE OF ONLINE ADVERTSIEMENTS. Table: 8 Perceived Usefulness of Different Advertisement Types

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

2. What is the general linear model to be used to model linear trend? (Write out the model) = or

NCSS Statistical Software

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

Normality Testing in Excel

The Relationship Between Epistemological Beliefs and Self-regulated Learning Skills in the Online Course Environment

Transcription:

Indices of Model Fit STRUCTURAL EQUATION MODELING 2013

Indices of Model Fit A recommended minimal set of fit indices that should be reported and interpreted when reporting the results of SEM analyses: 1. Model chi-square 2. Steiger-Lind root mean square error of approximation (RMSEA; Steiger, 1990) with its 90% confidence interval 3. Bentler comparative fit index (CFI; Bentler, 1990) 4. Standardized root mean square residual (SRMR)

Model Chi-square The model chi-square is the most basic fit statistic, and it is reported in virtually all reports of SEM analyses Most if not all other fit indices include the model chi-square, so it is a key ingredient. It equals the product (N-1) F ML where N-1 are the sample degrees of freedom and F ML is the value of the statistical criterion (fitting function) minimized in ML estimation. In large samples and assuming multivariate normality, (N-1 F ML is distributed as a Pearson chi-square statistic with degrees of freedom equal to df M

Model Chi-square (continued) Recall that df M is the difference between the number of observations and the number of free model parameters. The statistic (N-1) F ML under the assumptions just stated is referred to here as χ 2 M. The term χ 2 M is also known as the likelihood ratio chi-square or generalized likelihood ratio. The value of χ 2 M for a just identified model generally equals zero and has no degrees of freedom If χ 2 M = 0, the model perfectly fits the data

Model Chi-square (continued) As the value of χ 2 M increases, the fit of an overidentified model becomes increasingly worse In the sense of χ 2 M is actually a badness of fit index because the higher its value, the worse the model s correspondence to the data For an overidentified model (i.e. df M 0), χ 2 M tests the null hypothesis that the model is correct (i.e., it has perfect fit in the population) Therefore, it is the failure to reject the null hypothesis that supports the researcher s model

Model Chi-sqs Reported by LISREL Minimum Fit Function Chi-Square Located in table of fit indices in the output (n-1)(minimum VALUE OF FIT FUNCTION) Fit function is maximum likelihood Normal Theory Weighted Least Squares Chi-Square This appears underneath the path diagram and output (n-1)(minimum VALUE OF THE WEIGHTED LEAST SQUARES FIT FUNCTION) Fit function uses weights to correct for non-normality

Chi-Square Statistic a function of sample size The latter means that if the sample size is large, the value of χ 2 M may lead to rejection of the model even though differences between observed and predicted covariances are slight Thus, rejection of basically any overidentified model based on χ 2 M requires only a sufficiently large sample

Chi-Sq/DF Some researchers divide the model chi-square by its degrees of freedom to reduce the effect of sample size This generally results in a lower value called the normed chisquare: NC=χ 2 M/df M However, there is no clear-cut guideline about what value of the NC is minimally acceptable For example, Bollen (1989) notes that values of the NC of 2.0, 3.0, or even as high as 5.0 have been recommended as indicated reasonable fit

Root Mean Square Error of Approximation (RMSEA) The RMSEA has the following characteristics. It: 1. Is a parsimony-adjusted index in that its formula includes a built-in correction for model complexity (i.e. It favors simpler models) 2. Also takes sample size into account 3. Is usually reported in computer output with a confidence interval, which explicitly acknowledges that the RMSEA is subject to sampling error as are all fit indices The RMSEA measures error of approximation, and for this reason it is sometimes referred to as a population-based index The RMSEA is a badness of fit index in that a value of 0 indicates the best fit and higher values indicate worse fit

The formula is RMSEA (continued) RMSEA = RMSEA estimates the amount of error of approximation per model degree of freedom and takes sample size into account RMSEA = 0 says only that χ 2 M = df M, not that χ 2 M perfect) = 0 (i.e., fit is Rules of thumb by Browne and Cudeck (1993): RMSEA.05 indicates close approximate fit Values between.05 and.08 suggest reasonable error of approximation RMSEA 1 suggests poor fit

Comparative Fit Index (CFI) The CFI is one of a class of fit statistics known as incremental or comparative fit indices, which are among the most widely used in SEM All such indexes assess the relative improvement in fit of the researcher s model compared with a baseline model The latter is typically the independence model also called the null model which assumes zero population covariances among the observed variables A rule of thumb for the CFI and other incremental indices is that values greater than roughly.90 may indicate reasonably good fit of the researcher s model (Hu & Beltler, 1999) Similar to the RMSEA, however, CFI = 1.00 means only that χ 2 M, df M not that the model has perfect fit

Standardized Root Mean Square Residual (SRMR) The root mean square residual (RMSR) is a measure of the mean absolute value of the covariance residuals Perfect model fit is indicated by RMR = 0, and increasingly higher values indicate worse fit (i.e., badness of fit index) One problem with the RMR is that because it is computed with unstandardized variables, its range depends on the scales of the observed variables The SRMR is based on transforming both the sample covariance matrix and the predicted covariance matrix into correlation matrices The SRMR is thus a measure of the mean absolute correlation residual, the overall difference between the observed and predicted correlations Values of the SRMR less than.10 are generally considered favorable.

Testing Nested Models Two path models are hierarchical or nested if one is a subset of the other (e.g. a path is dropped from Model A to form Model B) There are two general contexts in which hierarchical path models are compared: 1. Model trimming: Paths are eliminated from a model (i.e. it is simplified) 2. Model building: Paths are added to a model (i.e. it is made more complex) As a model is trimmed, its fit to the data usually becomes progressively worse (e.g. χ 2 M increases). As a model is built, its fit to the data usually becomes progressively better (e.g. χ 2 M decreases) The goal is to find a parsimonious model that still fits the data reasonably well.

Delta Chi-Square Test The Chi-Square difference statistic, df D, can be used to test the statistical significance of the decrement in overall fit as paths are deleted or the improvement in fit as paths are added The statistic χ 2 D is just the difference between the χ 2 M values of two hierarchical models estimated with the same data Its degrees of freedom, df D equal the difference between the two respective values of df M.

Comparing Non-Nested Models Sometimes researchers specify alternative models that are not hierarchically related Although the values of χ 2 M from two different nonhierarchical models can still be compared, the difference between them cannot be interpreted as a test statistic This is where a predictive fit index, such as the Akaike information criterion (AIC) comes in handy A predictive fit index assesses model fit in hypothetical replication samples the same size and randomly drawn from the same population as the researcher s original sample Specifically, the model in a set of competing nonhierarchical models estimated with the same data with the smallest AIC is preferred. This is the model with the best predicted fit in a hypothetical replication sample considering the degree of fit in the actual sample and parsimony compared with the other models