SHORT COURSE ON MPLUS Getting Started with Mplus HANDOUT

Size: px
Start display at page:

Download "SHORT COURSE ON MPLUS Getting Started with Mplus HANDOUT"

Transcription

1 SHORT COURSE ON MPLUS Getting Started with Mplus HANDOUT Instructor: Cathy Zimmer , INTRODUCTION a) Who am I? Who are you? b) Overview of Course i) Mplus capabilities ii) The Mplus interface and Mplus files Data files, program files, analysis output, diagram files iii) Programming in Mplus Basic commands and options iv) Example c) This course is designed to provide a basic introduction to Mplus and will get you started working in the program. The Odum Institute has documentation and consultation resources for specific problems or help with intermediate and advanced usage of Mplus. **The Mplus website is let s see what is there! (Use CNTL-F to search.) Mplus CAPABILITIES a) Mplus estimates a variety of models for continuous and categorical observed variables as well as continuous and categorical latent (unobserved) variables b) Types of Analysis i) Linear regression (continuous outcomes) ii) Probit regression (binary and ordered outcomes) iii) Logistic regression (binary, ordered, and unordered outcomes) iv) Loglinear modeling/poisson or negative binomial regression (count) v) Path analysis vi) Exploratory and confirmatory factor analysis vii) Structural equation modeling viii) Mixture modeling ix) Latent class analysis x) Growth modeling xi) Multiple group analysis xii) Multilevel or hierarchical modeling xiii) Discrete and continuous time survival analysis xiv) Monte Carlo studies (data simulation) c) Features i) Missing Data ii) Sampling/frequency weighting and complex sample analysis (stratification, clustering) iii) Multiple imputation and analyzing multiple data sets from multiple imputation iv) Modeling with means, intercepts, and thresholds v) Random intercepts and slopes vi) Latent variable interactions and non-linear factor analysis AND MORE! 1

2 THE Mplus INTERFACE AND Mplus FILES a) To open Mplus: i) Double click on the Mplus icon on your desktop in the Quantitative Statistics folder. ii) Or go to start All Programs Mplus Mplus Editor b) The Program Window where you can write your Mplus programs and then save. c) Opening an existing program file (.inp extension) text file d) Save data files in same directory as program files (.dat or.csv extension) Text/ASCII file The data must reside in an external file Data must be numeric except for missing value flags; there are NO variable names in the data set -- names are described in the program file Mplus accepts no more that 500 variables in a data set e) Data file can contain individual level data or summary data in the form of a covariance or correlation matrix with or without means and/or standard deviations. Individual data can be in fixed or free format. (Free format is the default.) Summary data (matrix data) must be in free format Free format requires a comma, space, or tab delimiter. f) Run Mplus program files by clicking on the blue RUN icon on the toolbar. Mplus automatically saves an output file that contains analysis results. Mplus applies the same name as the program file (.out extension) A summary of the analysis specified this shows how Mplus has interpreted the input from your program A summary of the analysis this shows which estimator and data file were used in the analysis along with iteration and convergence information A summary of analysis results this provides tests of model fit, parameter estimates, standard errors, z-statistics, standardized parameter estimates, etc. Other information can be included in the output file if you specify this in your program Output files also contain error messages and other notes. It is very important to read through the entire output since these messages are not always easy to see g) Mplus has very limited data management capabilities. Most users clean, code, and format data in another package like SAS, Stata, or SPSS. ** Detour to learn stata2mplus command! PROGRAMMING IN Mplus: a) There are 10 programming commands altogether b) Not all commands are required to run an analysis. However, the data and variable commands are required for every analysis c) The commands may come in any order d) All commands must begin on a new line and be followed by a colon e) Semicolons separate command options (There can be more than one option per line.) f) Lines in the program file cannot exceed 80 columns g) User comments can be included and are preceded by an exclamation point h) The keywords IS, ARE and = are interchangeable (Except with the define command.) i) A hyphen (-) can be used to indicate a list of variables or numbers 2

3 Commands Title Data Variable Define Analysis Model Output Savedata Plot Montecarlo What they do: Gives identifying title to the analysis. Identifies the location and name of the data set to be analyzed. Names and describes the variables in the data set to be analyzed. Provides the ability to transform existing variables and to create new variables. Describes the type of analysis to be performed. Describes the model to be estimated. Specifies options to customize the output. Saves the analysis data and/or model results in ASCII files. Requests graphical displays of data and analysis results. Defines the specifications of a Monte Carlo analysis. The TITLE command: Allows you to specify a title for the analysis in your program. This title is printed on the output file. The title command is optional. TITLE: Clinton Thermometer Regression The DATA command: This command identifies the location of the data set to be analyzed, describes the format and type of data in the data set, and specifies the number of observations for summary form data sets. The data command is required and the file option is required. DATA: FILE IS clinton data.dat; FORMAT IS free; TYPE IS individual; 3

4 The example above specifies a data file called clinton data.dat (which is in the I:\ Mplus zimmer directory). It specifies that the data is in free format and is individual level data (not summary data). Data command options include: File name and location of the data file Format data file format Type type of data file Noobservations number of observations Ngroups number of groups Variances check for zero variances The file option is required. The format and type options are optional. The default format is free and the default type is individual (so we did not really need them in the above example.) Also, you do not need a directory specification if the data is in the same directory as the program file. With fixed data files, a FORTRAN-like format statement must be included on the format option. Data types available under the type option: Individual data matrix with observations on rows and variables on columns Covariance lower triangular covariance matrix Correlation lower triangular correlation matrix Fullcov full covariance matrix Fullcorr full correlation matrix Means - means Stdeviations standard deviations Montecarlo list of data sets Imputation list of data sets Means and standard deviations are combined with correlation matrix data. When using summary data files, the noobservations option is required. This option tells Mplus how many observations are in the analysis. The ngroups option is used when doing a multiple group analysis. The variances option is used to check that the analysis variables do not have zero variance. The VARIABLE command: This command names and describes the variables in the data set to be analyzed. This command is required and the names option is required. The options under the variable command allow you to subset the data set on the observations used or the variables used in an analysis, to specify missing values, to indicate categorical dependent variables, and to identify variables with a special function (e.g., a weight variable.) 4

5 VARIABLE: NAMES ARE clinton dem ind follow liberal moderate badoff attend age married educ faminc male white; USEVARIABLES ARE clinton age married educ faminc male white; MISSING ARE.; In the NAMES option, variable names must be listed in the order that they appear in the data set! The example above names 12 variables, but subsets 7 of the 12 variables for the analysis. Missing values are identified as. for all variables. Variable command options include: Names names of variables in the data set Useobservations selects observations Usevariables variables to be analyzed Missing indicates missing values for each variable (any numeric value or period, asterisk, or blank) Categorical names of categorical dependent variables Nominal names of unordered categorical dependent variables Count names of count dependent variables Censored names of censored dependent variables Variables with Special Functions Nominal names of unordered categorical variables Count names of count variables Grouping name of grouping variable Idvariable name of an ID variable Centering variables to be centered and method of centering Complex sample options Cluster names variable containing cluster information Strata names variable containing stratification information Weight names variable containing the case or sampling weight information The DEFINE command: Allows you to transform existing variables and to create new variables. The define command is optional. DEFINE: lginc = log(faminc); newvar = follow + attend; 5

6 The example above creates a new variable called lginc by taking the base e log of the variable faminc. It also creates newvar which is the sum of the follow and attend variables. There are many other ways to mathematically transform variables and to recode variables under the define command. Defined variables must be listed last in the USEVARIABLES option. We will not be using the define command in our regression example today. The ANALYSIS command: This command is used to describe the type of analysis, the statistical estimator, the matrix to be analyzed, and the specifics of computation. The analysis command does not need to be used if you want to use the program defaults. ANALYSIS: TYPE IS general; ESTIMATOR = ML; The example above tells Mplus that we want to do a general analysis using the maximum likelihood estimator analyzing the covariance matrix. (All of the options in the example above are the defaults and not necessary in the program.) The two important options under the Analysis command are type and estimator. The other options are used to specify computational procedures. Analysis types available under the type option: Mixture mixture modeling Twolevel multilevel modeling EFA exploratory factor analysis Logistic logistic regression General all other analyses (the default). This includes models with relationships among observed variables, among continuous latent variables, and among observed and continuous latent variables. Some common sub-options for a general type analysis include: Basic - sample and descriptive statistics Missing allows analysis of missing data Meanstructure allows estimation of means, thresholds, and intercepts Complex allows for estimation of data that are clustered The estimator option specifies the estimator to be used in the analysis. The default estimator differs depending on the type of analysis and the measurement of the dependent variable. 6

7 Available estimators: ML maximum likelihood MLM maximum likelihood, robust standard errors, & mean adjusted chi-square test statistic MLMV - maximum likelihood, robust standard errors & mean and variance adjusted chi-square test statistic MLR - maximum likelihood with robust standard errors MLF maximum likelihood w/ first order derivative standard errors WLS weighted least squares WLSM - weighted least squares, robust standard errors, & mean adjusted chi-square test statistic WLSMV - weighted least squares, robust standard errors, & mean and variance adjusted chisquare test statistic GLS generalized least squares ULS unweighted least squares There are many computational options under the estimator command including numerical integration, bootstrapping, random starts, etc. The matrix option allows you to specify whether a covariance or correlation matrix is to be analyzed. Some types of analyses require a certain type of matrix. The MODEL command: The model command describes the specific model to be estimated. The components of the model include (1) the measurement model for indicators of continuous latent variables, (2) the measurement model for indicators of categorical latent variables, (3) the structural model relating latent variables. MODEL: clinton ON age married educ faminc male white The model command has three key words: BY ( measured by ) Used to describe regression relationships in the measurement model. Defines the continuous latent variables in the model. ON ( Y regressed on X, as in linear regression) Used to describe the regression relationships in the structural model involving the continuous and categorical latent variables (and observed variables). WITH ( correlated with ) Used to describe correlation (covariance) relationships in the measurement and structural models. 7

8 An example of specifying a more complex model: y1a y2a y3a y4a x1 Fa x2 y1c x3 Fc y2c y3c x4 x5 Fb y4c y1b y2b y3b The measurement models (and the BY keyword): Fa BY y1a y2a y3a y4a Fb BY y1b y2b y3b Fc BY y1c y2c y3c y4c Measurement model defaults: 1. The factor loadings on the right side of the BY statement are freely estimated, except for the first variable (here y1a, y1b, and y1c) which has a factor loading of The start value for factor loadings is Residual variances are estimated. Residual covariances are fixed to zero. 8

9 The structural model (and the ON keyword): Fa Fb ON x1 x2 x3 x4 x5 Fc ON Fa Fb Structural model defaults: 1. The regression coefficients are freely estimated. 2. The start value for regression coefficients is Residual variances of latent variables are estimated. Residuals of the dependent latent variables are correlated if they do not influence any other variables in the model. Covariances (and the WITH keyword): x1 WITH x2 x3 x4 x5 x2 WITH x3 x4 x5 x3 WITH x4 x5 x4 WITH x5 Covariances can be specified: among independent (observed or latent) variables among residuals of dependent (observed or latent) variables The model command is also used to: Provide information about means, variances and covariances of observed and latent variables To specify the scales of unobserved variables To fix and free parameters To constrain parameters to be equal To assign start values Variances/Residual Variances/Means/Intercepts/Thresholds: Variances are estimated for independent variables and residual variances are estimated for dependent variables. (Not the case for categorical latent variables.) 9

10 Fixing and Freeing Parameters and Assigning Start Values: Fa BY y1a* y2a y3a*0.5 Here the factor loading of y1a is freed with the default start value of one (no longer fixed to 1), the y3a loading has a start value of 0.5, and the y4a loading is fixed to one. The asterisk (*) will free a parameter and assign a starting value for the estimation of that parameter. You can assign start values to variances, means, thresholds, and scales. symbol will fix any parameter at the given value. Constraining parameter values to be equal: Fa ON x1 x2 x3 x4 x5 (1) Fb ON x1 x2 x3 x4 x5 (1) Here the regression parameters are constrained to be equal for the two latent variables Fa and Fb. Place the same number in parentheses following the parameters that are to be held equal. This convention can be used for all parameters. The OUTPUT command: Allows you to specify extra output not included by default. The output command is optional. OUTPUT: sampstat cinterval standardized; The example above outputs sample statistics and confidence intervals in addition to the normal output information. Options for the output command are listed below. Some are inappropriate for certain types of analysis and cannot be used. Sampstat sample statistics Modindices modification indices, expected parameter change indices Standardized three types of standardized coefficients and R-square values Residual model estimated means, variances, and covariances and the differences between these and observed sample statistics Cinterval 95% and 99% confidence intervals for all of the parameter estimates Patterns summary of missing data patterns Fscoefficient factor score coefficients and factor score posterior covariance matrix Tech1 through Tech9 various technical outputs See the User s Guide for even more options! 10

11 The SAVEDATA command: Allows you to save data or output from an analysis in ASCII files for later use. You can output the following: individual level analysis data, sample correlation and covariance matrix, analysis results, tech3, and tech4. The savedata command is optional. SAVEDATA: FILE IS analysis.dat; SAMPLE IS sample.dat; RESULTS ARE results.dat; The example above outputs three ASCII files. The first file contains the individual level analysis data, the second file contains the sample covariance matrix, and the third file contains analysis results. The DIAGRAMMER: You can draw your model and an Mplus program is generated from it OR it produces a diagram of your model that you can edit and save AFTER running the model. To get the diagram, click on the Diagram drop-down menu, and then click on View diagram. The diagram is automatically saved, but you can save it under another name if you choose. Let me show you how to use it. 11

12 EXAMPLE FOR PRACTICE: Clinton Thermometer Regression TITLE: Clinton Thermometer Regression DATA: FILE IS clinton data.dat; FORMAT IS free; TYPE IS individual; VARIABLE: NAMES ARE clinton dem ind follow liberal moderate badoff attend age married educ faminc male white; USEVARIABLES ARE clinton age married educ faminc male white; MISSING ARE.;!Comment out the define command because the operations don t make sense, just examples!define:! lginc = log(faminc);! newvar = follow + attend; ANALYSIS: TYPE IS general; ESTIMATOR = ML; MODEL: clinton ON age married educ faminc male white; OUTPUT: sampstat cinterval standardized; SAVEDATA: FILE IS analysis.dat; SAMPLE IS sample.dat; RESULTS ARE results.dat; 12

13 Data description for example data and program files: Data file is clinton data.dat Full regression program is clinton regress.inp Data come from the 1998 National Election Study. There are 1281 observations. Variables 1. clinton Clinton feeling thermometer (0 to 100, not favorable to favorable) 2. democrat party identification (1=Democrat, 0=not Democrat) 3. independent party identification (1=Independent, 0 = not Independent) 4. follow follow government and public affairs (1= hardly at all to 4= most of the time ) 5. liberal liberal/conservative self placement (1=liberal, 0=not liberal) 6. moderate liberal/conservative self placement (1=moderate, 0=not moderate) 7. badoff how much better or worse off than last year (1= much better to 5= much worse ) 8. attend how often attend religious service (1= every week to 5= never ) 9. age age in years 10. married marital status (1=married, 0 not married) 11. educ school completed (1= 8 th grade or less to 7= advanced degree ) 12. faminc family income (1= less than $2,999 to 24= $105,000 or more ) 13. male sex (1=male, 0=female) 14. white race (1=white, 0=other race/ethnicity) Fall

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships

More information

CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES

CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Examples: Monte Carlo Simulation Studies CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Monte Carlo simulation studies are often used for methodological investigations of the performance of statistical

More information

CHAPTER 13 EXAMPLES: SPECIAL FEATURES

CHAPTER 13 EXAMPLES: SPECIAL FEATURES Examples: Special Features CHAPTER 13 EXAMPLES: SPECIAL FEATURES In this chapter, special features not illustrated in the previous example chapters are discussed. A cross-reference to the original example

More information

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or

More information

CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA

CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA Examples: Mixture Modeling With Longitudinal Data CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA Mixture modeling refers to modeling with categorical latent variables that represent subpopulations

More information

CHAPTER 4 EXAMPLES: EXPLORATORY FACTOR ANALYSIS

CHAPTER 4 EXAMPLES: EXPLORATORY FACTOR ANALYSIS Examples: Exploratory Factor Analysis CHAPTER 4 EXAMPLES: EXPLORATORY FACTOR ANALYSIS Exploratory factor analysis (EFA) is used to determine the number of continuous latent variables that are needed to

More information

Binary Logistic Regression

Binary Logistic Regression Binary Logistic Regression Main Effects Model Logistic regression will accept quantitative, binary or categorical predictors and will code the latter two in various ways. Here s a simple model including

More information

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén

More information

Stephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng. LISREL for Windows: PRELIS User s Guide

Stephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng. LISREL for Windows: PRELIS User s Guide Stephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng LISREL for Windows: PRELIS User s Guide Table of contents INTRODUCTION... 1 GRAPHICAL USER INTERFACE... 2 The Data menu... 2 The Define Variables

More information

Mplus Tutorial August 2012

Mplus Tutorial August 2012 August 2012 Mplus for Windows: An Introduction Section 1: Introduction... 3 1.1. About this Document... 3 1.2. Introduction to EFA, CFA, SEM and Mplus... 3 1.3. Accessing Mplus... 3 Section 2: Latent Variable

More information

January 26, 2009 The Faculty Center for Teaching and Learning

January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Converting an SPSS Data File to Mplus. by Paul F. Tremblay September 2013

Converting an SPSS Data File to Mplus. by Paul F. Tremblay September 2013 Converting an SPSS Data File to Mplus by Paul F. Tremblay September 2013 Types of Data Files There are two types of ASCII data files that can be considered. They are referred to as delimited (free) and

More information

Directions for using SPSS

Directions for using SPSS Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

More information

Linear Models in STATA and ANOVA

Linear Models in STATA and ANOVA Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples

More information

HLM software has been one of the leading statistical packages for hierarchical

HLM software has been one of the leading statistical packages for hierarchical Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush

More information

IBM SPSS Statistics for Beginners for Windows

IBM SPSS Statistics for Beginners for Windows ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning

More information

Gamma Distribution Fitting

Gamma Distribution Fitting Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

More information

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives

More information

Stephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng. LISREL for Windows: SIMPLIS Syntax Files

Stephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng. LISREL for Windows: SIMPLIS Syntax Files Stephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng LISREL for Windows: SIMPLIS Files Table of contents SIMPLIS SYNTAX FILES... 1 The structure of the SIMPLIS syntax file... 1 $CLUSTER command... 4

More information

The Latent Variable Growth Model In Practice. Individual Development Over Time

The Latent Variable Growth Model In Practice. Individual Development Over Time The Latent Variable Growth Model In Practice 37 Individual Development Over Time y i = 1 i = 2 i = 3 t = 1 t = 2 t = 3 t = 4 ε 1 ε 2 ε 3 ε 4 y 1 y 2 y 3 y 4 x η 0 η 1 (1) y ti = η 0i + η 1i x t + ε ti

More information

How to set the main menu of STATA to default factory settings standards

How to set the main menu of STATA to default factory settings standards University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be

More information

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS DATABASE MARKETING Fall 2015, max 24 credits Dead line 15.10. ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS PART A Gains chart with excel Prepare a gains chart from the data in \\work\courses\e\27\e20100\ass4b.xls.

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

Using Excel for Statistical Analysis

Using Excel for Statistical Analysis Using Excel for Statistical Analysis You don t have to have a fancy pants statistics package to do many statistical functions. Excel can perform several statistical tests and analyses. First, make sure

More information

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

There are six different windows that can be opened when using SPSS. The following will give a description of each of them. SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet

More information

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the

More information

Regression Modeling Strategies

Regression Modeling Strategies Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions

More information

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

SPSS and AM statistical software example.

SPSS and AM statistical software example. A detailed example of statistical analysis using the NELS:88 data file and ECB, to perform a longitudinal analysis of 1988 8 th graders in the year 2000: SPSS and AM statistical software example. Overall

More information

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade Statistics Quiz Correlation and Regression -- ANSWERS 1. Temperature and air pollution are known to be correlated. We collect data from two laboratories, in Boston and Montreal. Boston makes their measurements

More information

SPSS Introduction. Yi Li

SPSS Introduction. Yi Li SPSS Introduction Yi Li Note: The report is based on the websites below http://glimo.vub.ac.be/downloads/eng_spss_basic.pdf http://academic.udayton.edu/gregelvers/psy216/spss http://www.nursing.ucdenver.edu/pdf/factoranalysishowto.pdf

More information

IBM SPSS Direct Marketing 23

IBM SPSS Direct Marketing 23 IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

IBM SPSS Direct Marketing 22

IBM SPSS Direct Marketing 22 IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release

More information

A Brief Introduction to SPSS Factor Analysis

A Brief Introduction to SPSS Factor Analysis A Brief Introduction to SPSS Factor Analysis SPSS has a procedure that conducts exploratory factor analysis. Before launching into a step by step example of how to use this procedure, it is recommended

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the

More information

How To Understand Multivariate Models

How To Understand Multivariate Models Neil H. Timm Applied Multivariate Analysis With 42 Figures Springer Contents Preface Acknowledgments List of Tables List of Figures vii ix xix xxiii 1 Introduction 1 1.1 Overview 1 1.2 Multivariate Models

More information

Tutorial: Get Running with Amos Graphics

Tutorial: Get Running with Amos Graphics Tutorial: Get Running with Amos Graphics Purpose Remember your first statistics class when you sweated through memorizing formulas and laboriously calculating answers with pencil and paper? The professor

More information

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the

More information

Alabama Department of Postsecondary Education

Alabama Department of Postsecondary Education Date Adopted 1998 Dates reviewed 2007, 2011, 2013 Dates revised 2004, 2008, 2011, 2013, 2015 Alabama Department of Postsecondary Education Representing Alabama s Public Two-Year College System Jefferson

More information

Using SPSS, Chapter 2: Descriptive Statistics

Using SPSS, Chapter 2: Descriptive Statistics 1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

More information

Tutorial: Get Running with Amos Graphics

Tutorial: Get Running with Amos Graphics Tutorial: Get Running with Amos Graphics Purpose Remember your first statistics class when you sweated through memorizing formulas and laboriously calculating answers with pencil and paper? The professor

More information

IBM SPSS Missing Values 22

IBM SPSS Missing Values 22 IBM SPSS Missing Values 22 Note Before using this information and the product it supports, read the information in Notices on page 23. Product Information This edition applies to version 22, release 0,

More information

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.

More information

Imputing Missing Data using SAS

Imputing Missing Data using SAS ABSTRACT Paper 3295-2015 Imputing Missing Data using SAS Christopher Yim, California Polytechnic State University, San Luis Obispo Missing data is an unfortunate reality of statistics. However, there are

More information

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce

More information

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand

More information

Factor Analysis. Chapter 420. Introduction

Factor Analysis. Chapter 420. Introduction Chapter 420 Introduction (FA) is an exploratory technique applied to a set of observed variables that seeks to find underlying factors (subsets of variables) from which the observed variables were generated.

More information

From the help desk: Bootstrapped standard errors

From the help desk: Bootstrapped standard errors The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution

More information

Introduction to Multilevel Modeling Using HLM 6. By ATS Statistical Consulting Group

Introduction to Multilevel Modeling Using HLM 6. By ATS Statistical Consulting Group Introduction to Multilevel Modeling Using HLM 6 By ATS Statistical Consulting Group Multilevel data structure Students nested within schools Children nested within families Respondents nested within interviewers

More information

Craig K. Enders Arizona State University Department of Psychology craig.enders@asu.edu

Craig K. Enders Arizona State University Department of Psychology craig.enders@asu.edu Craig K. Enders Arizona State University Department of Psychology craig.enders@asu.edu Topic Page Missing Data Patterns And Missing Data Mechanisms 1 Traditional Missing Data Techniques 7 Maximum Likelihood

More information

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations.

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. Excel is powerful tool and can make your life easier if you are proficient in using it. You will need to use Excel to complete most of your

More information

SAS Analyst for Windows Tutorial

SAS Analyst for Windows Tutorial Updated: August 2012 Table of Contents Section 1: Introduction... 3 1.1 About this Document... 3 1.2 Introduction to Version 8 of SAS... 3 Section 2: An Overview of SAS V.8 for Windows... 3 2.1 Navigating

More information

Ordinal Regression. Chapter

Ordinal Regression. Chapter Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe

More information

Handling attrition and non-response in longitudinal data

Handling attrition and non-response in longitudinal data Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein

More information

Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab Exercise #5 Analysis of Time of Death Data for Soldiers in Vietnam

Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab Exercise #5 Analysis of Time of Death Data for Soldiers in Vietnam Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab Exercise #5 Analysis of Time of Death Data for Soldiers in Vietnam Objectives: 1. To use exploratory data analysis to investigate

More information

An Introduction to SPSS. Workshop Session conducted by: Dr. Cyndi Garvan Grace-Anne Jackman

An Introduction to SPSS. Workshop Session conducted by: Dr. Cyndi Garvan Grace-Anne Jackman An Introduction to SPSS Workshop Session conducted by: Dr. Cyndi Garvan Grace-Anne Jackman Topics to be Covered Starting and Entering SPSS Main Features of SPSS Entering and Saving Data in SPSS Importing

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

Introduction to Longitudinal Data Analysis

Introduction to Longitudinal Data Analysis Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction

More information

Confirmatory factor analysis in MPlus

Confirmatory factor analysis in MPlus Jan Štochl, Ph.D. Department of Psychiatry University of Cambridge Email: js883@cam.ac.uk Confirmatory factor analysis in MPlus The Psychometrics Centre Agenda of day 1 General ideas and introduction to

More information

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red.

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red. Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red. 1. How to display English messages from IBM SPSS Statistics

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple

More information

MS 2007-0081-RR Booil Jo. Supplemental Materials (to be posted on the Web)

MS 2007-0081-RR Booil Jo. Supplemental Materials (to be posted on the Web) MS 2007-0081-RR Booil Jo Supplemental Materials (to be posted on the Web) Table 2 Mplus Input title: Table 2 Monte Carlo simulation using externally generated data. One level CACE analysis based on eqs.

More information

Chapter 7 Factor Analysis SPSS

Chapter 7 Factor Analysis SPSS Chapter 7 Factor Analysis SPSS Factor analysis attempts to identify underlying variables, or factors, that explain the pattern of correlations within a set of observed variables. Factor analysis is often

More information

Multilevel Modeling Tutorial. Using SAS, Stata, HLM, R, SPSS, and Mplus

Multilevel Modeling Tutorial. Using SAS, Stata, HLM, R, SPSS, and Mplus Using SAS, Stata, HLM, R, SPSS, and Mplus Updated: March 2015 Table of Contents Introduction... 3 Model Considerations... 3 Intraclass Correlation Coefficient... 4 Example Dataset... 4 Intercept-only Model

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

More information

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could

More information

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate 1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,

More information

2015 TUHH Online Summer School: Overview of Statistical and Path Modeling Analyses

2015 TUHH Online Summer School: Overview of Statistical and Path Modeling Analyses : Overview of Statistical and Path Modeling Analyses Prof. Dr. Christian M. Ringle (Hamburg Univ. of Tech., TUHH) Prof. Dr. Jӧrg Henseler (University of Twente) Dr. Geoffrey Hubona (The Georgia R School)

More information

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way

More information

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation

More information

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection Chapter 311 Introduction Often, theory and experience give only general direction as to which of a pool of candidate variables (including transformed variables) should be included in the regression model.

More information

Simple Second Order Chi-Square Correction

Simple Second Order Chi-Square Correction Simple Second Order Chi-Square Correction Tihomir Asparouhov and Bengt Muthén May 3, 2010 1 1 Introduction In this note we describe the second order correction for the chi-square statistic implemented

More information

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

IBM SPSS Statistics 20 Part 1: Descriptive Statistics CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 1: Descriptive Statistics Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the

More information

Regression Clustering

Regression Clustering Chapter 449 Introduction This algorithm provides for clustering in the multiple regression setting in which you have a dependent variable Y and one or more independent variables, the X s. The algorithm

More information

hp calculators HP 50g Trend Lines The STAT menu Trend Lines Practice predicting the future using trend lines

hp calculators HP 50g Trend Lines The STAT menu Trend Lines Practice predicting the future using trend lines The STAT menu Trend Lines Practice predicting the future using trend lines The STAT menu The Statistics menu is accessed from the ORANGE shifted function of the 5 key by pressing Ù. When pressed, a CHOOSE

More information

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING

More information

Psych. Research 1 Guide to SPSS 11.0

Psych. Research 1 Guide to SPSS 11.0 SPSS GUIDE 1 Psych. Research 1 Guide to SPSS 11.0 I. What is SPSS: SPSS (Statistical Package for the Social Sciences) is a data management and analysis program. It allows us to store and analyze very large

More information

Dealing with Data in Excel 2010

Dealing with Data in Excel 2010 Dealing with Data in Excel 2010 Excel provides the ability to do computations and graphing of data. Here we provide the basics and some advanced capabilities available in Excel that are useful for dealing

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Simple Linear Regression, Scatterplots, and Bivariate Correlation 1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.

More information

C:\Users\<your_user_name>\AppData\Roaming\IEA\IDBAnalyzerV3

C:\Users\<your_user_name>\AppData\Roaming\IEA\IDBAnalyzerV3 Installing the IDB Analyzer (Version 3.1) Installing the IDB Analyzer (Version 3.1) A current version of the IDB Analyzer is available free of charge from the IEA website (http://www.iea.nl/data.html,

More information

Specification of Rasch-based Measures in Structural Equation Modelling (SEM) Thomas Salzberger www.matildabayclub.net

Specification of Rasch-based Measures in Structural Equation Modelling (SEM) Thomas Salzberger www.matildabayclub.net Specification of Rasch-based Measures in Structural Equation Modelling (SEM) Thomas Salzberger www.matildabayclub.net This document deals with the specification of a latent variable - in the framework

More information

Common factor analysis

Common factor analysis Common factor analysis This is what people generally mean when they say "factor analysis" This family of techniques uses an estimate of common variance among the original variables to generate the factor

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

lavaan: an R package for structural equation modeling

lavaan: an R package for structural equation modeling lavaan: an R package for structural equation modeling Yves Rosseel Department of Data Analysis Belgium Utrecht April 24, 2012 Yves Rosseel lavaan: an R package for structural equation modeling 1 / 20 Overview

More information

Basics of STATA. 1 Data les. 2 Loading data into STATA

Basics of STATA. 1 Data les. 2 Loading data into STATA Basics of STATA This handout is intended as an introduction to STATA. STATA is available on the PCs in the computer lab as well as on the Unix system. Throughout, bold type will refer to STATA commands,

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Package dsmodellingclient

Package dsmodellingclient Package dsmodellingclient Maintainer Author Version 4.1.0 License GPL-3 August 20, 2015 Title DataSHIELD client site functions for statistical modelling DataSHIELD

More information