861 Example SPLH. 5 page 1. prefer to have. New data in. SPSS Syntax FILE HANDLE. VARSTOCASESS /MAKE rt. COMPUTE mean=2. COMPUTE sal=2. END IF.
|
|
|
- Britton Lucas
- 10 years ago
- Views:
Transcription
1 SPLH 861 Example 5 page 1 Multivariate Models for Repeated Measures Response Times in Older and Younger Adults These data were collected as part of my masters thesis, and are unpublished in this form (to see the way I d prefer to have analyzed the data, see Hoffman & Rovine, 2007 Behavior Research Methods). The outcome was the log-transformed mean per conditionn of response time to detect changes in driving scenes that were either of low/high meaningfulness to driving or low/high visual salience (i.e., a 2x2 repeated measures design). This sample includes 97 younger adults (age range= 18 32) and 59 older adults (age range = 63 86). We will specify piecewise effects of age that create mean differences between youngerr and older adults as well as the effect of age within the older adults. Original data in multivariate format (was one row per person, outcomes in separate columns): New data in stacked format (one row per outcome per person) after transformation code below: SPSS Syntax for Stacking into Univariate (now one row per outcome per person): * Define location of files used in code below. FILE HANDLE filesave /NAME = "C:\Dropbox\14_SPLH861\861_Example5". * Import example 5 multivariate data into work library and stack it. GET FILE = "filesave/spss_example5.sav". DATASET NAME Example5 WINDOW=FRONT. VARSTOCASESS /MAKE rt FROM rt11 rt12 rt21 rt22 /INDEX = condition (4) /KEEP = ALL. * Create condition variables. DO IF (condition=1). COMPUTE mean=1. COMPUTE sal=1. END IF. DO IF (condition=2). COMPUTE mean=1. COMPUTE sal=2. END IF. DO IF (condition=3). COMPUTE mean=2.
2 SPLH 861 Example 5 page 2 COMPUTE sal=1. END IF. DO IF (condition=4). COMPUTE mean=2. COMPUTE sal=2. END IF. * Label new stacked variables. VARIABLE LABELS condition "condition: Index for Outcome (1-4)" mean "Meaning (1=Low, 2=High)" sal "Salience (1=Low, 2=High)" rt "rt: Combined Response Time across Conditions". * Create value labels for conditon variables. VALUE LABELS mean sal 1 "1Low" 2 "2High". * Create variables for analysis. COMPUTE logrt=ln(rt). IF (old=0) yrs65=0. IF (old=1) yrs65=age-65. * Label new variables. VARIABLE LABELS logrt "logrt: Natural Log of Response Time" yrs65 "yrs65: Age in Older Adult Group (0=65)". EXECUTE. STATA Syntax for Stacking into Univariate (now one row per outcome per person): * Defining global variable for file location to be replaced in code below global filesave "C:\Dropbox\14_SPLH861\861_Example5" * Import example 5 multivariate data into work library and stack it * List multivariate variables first, i(personid) j(condition) use "$filesave\stata_example5.dta", clear reshape long rt, i(personid) j(condition) * Create condition variables gen mean=1 gen sal=1 recode mean (1=2) if condition==21 recode mean (1=2) if condition==22 recode sal (1=2) if condition==12 recode sal (1=2) if condition==22 * Label new stacked variables label variable condition "condition: Index for Outcome" label variable mean "Meaning (1=Low, 2=High)" label variable sal "Salience (1=Low, 2=High)" label variable rt "rt: Combined Response Time across Conditions" * Create value labels for condition variables label define fcondition 1 "1Low" 2 "2High" label values mean sal fcondition * Create variables for analysis gen logrt=ln(rt) gen yrs65=0 replace yrs65=age-65 if old==1 * Label new variables label variable logrt "logrt: Natural Log of Response Time" label variable yrs65 "yrs65: Age in Older Adult Group (0=65)" SAS Syntax for Stacking into Univariate (now one row per outcome per person): * Import example 5 multivariate data into work library and stack it; DATA work.example5; SET filesave.sas_example5; condition=1; mean=1; sal=1; rt=rt11; OUTPUT; * Low meaning, low salience; condition=2; mean=1; sal=2; rt=rt12; OUTPUT; * Low meaning, high salience; condition=3; mean=2; sal=1; rt=rt21; OUTPUT; * High meaning, low salience; condition=4; mean=2; sal=2; rt=rt22; OUTPUT; * High meaning, high salience; * Label new stacked variables;
3 SPLH 861 Example 5 page 3 LABEL condition= "condition: Index for Outcome (1-4)" mean= "Meaning (1=Low, 2=High)" sal= "Salience (1=Low, 2=High)" rt= "rt: Combined Response Time across Conditions"; * Drop old multivariate outcomes; DROP rt11--rt22; RUN; * Create format (like value label) to use for condition variables; PROC FORMAT; VALUE fcondition 1="1Low" 2="2High"; RUN; * Create variables for analysis; DATA work.example5; SET work.example5; * Log RT to improve residual normality; logrt=log(rt); * Format condition variables; FORMAT mean sal fcondition.; * Create piecewise slope for age; IF old=0 THEN yrs65=0; ELSE IF old=1 THEN yrs65=age-65; * Label new variables; LABEL logrt= "logrt: Natural Log of Response Time" yrs65= "yrs65: Age in Older Adult Group (0=65)"; RUN; Empty Multivariate Model Predicting Log RT: This model predicts the RT in condition c for person i Although this model doesn t look empty, it is each outcome has its own mean with no other predictors. Condition means are thus created by: Low Salience High Salience Low Meaning High Meaning Let s start with the answer key model for the variance: An unstructured R matrix in which all variances and covariances across the four outcomes are estimated separately ( multivariate ANOVA): ECHO 'SPSS Empty Multivariate Model: RT Mean Differences for Meaning by Salience;'. ECHO 'Unstructured R Matrix'. MIXED logrt BY PersonID condition mean sal SPSS: /PRINT = R provides R /METHOD = REML /PRINT = SOLUTION TESTCOV R matrix, but RCORR is not available /FIXED = mean sal mean*sal /REPEATED = condition COVTYPE(UN) SUBJECT(PersonID). display as result "STATA Empty Multivariate Model:" display as result "RT Mean Differences for Meaning by Salience" display as result "Unstructured R Matrix" mixed logrt ib(last).mean##ib(last).sal, /// personid:, noconstant variance reml /// residuals(unstructured,t(condition)), estat ic, n(156), estat wcorrelation, covariance, estat wcorrelation, estimates store UN STATA: estat ic provides AIC and BIC, where n( ) provides sample size (# persons) to be used in BIC estat wcorrelation, covariance R matrix estat wcorrelation RCORR matrix TITLE1 "SAS Empty Multivariate Model: RT Mean Differences for Meaning by Salience"; TITLE2 "Unstructured R Matrix"; PROC MIXED DATA=work.Example5 COVTEST NOCLPRINT NAMELEN=100 IC METHOD=REML; CLASS PersonID condition mean sal; MODEL logrt = mean sal@2 / SOLUTION DDFM=Satterthwaite; REPEATED condition / R RCORR TYPE=UN SUBJECT=PersonID; SAS: R and RCORR to show in output RUN; TITLE1; TITLE2;
4 SPLH 861 Example 5 page 4 SAS Output from Unstructured R Matrix model: Estimated R Matrix for PersonID 1 Row Col1 Col2 Col3 Col This R matrix holds the variances and covariances across conditions. Given complete data, it will exactly match those in original data (although complete data is not required). Do the variances appear to differ across conditions? Estimated R Correlation Matrix for PersonID 1 Row Col1 Col2 Col3 Col This RCORR matrix holds the correlations across conditions. Given complete data, it will exactly match those in the original data (although complete data is not required). Do the correlations appear to differ across conditions? Fit Statistics -2 Res Log Likelihood AIC (smaller is better) AICC (smaller is better) BIC (smaller is better) This is the sum of the individual log-likelihoods multiplied by 2. It is the best possible fit for the model for the variance. Now let s see if we could have used a simpler model: Compound Symmetry, in which all variances are predicted to be equal and all covariances are predicted to be equal, too ( Univariate ANOVA): ECHO 'SPSS Empty Multivariate Model: RT Mean Differences for Meaning by Salience;'. ECHO 'Compound Symmetry R Matrix'. MIXED logrt BY PersonID condition mean sal /METHOD = REML /PRINT = SOLUTION TESTCOV R /FIXED = mean sal mean*sal /REPEATED = condition COVTYPE(CS) SUBJECT(PersonID). display as result "STATA Empty Multivariate Model:" display as result "RT Mean Differences for Meaning by Salience" display as result "Compound Symmetry R Matrix" mixed logrt ib(last).mean##ib(last).sal, /// personid:, noconstant variance reml /// residuals(exchangeable,t(condition)), estat ic, n(156), estat wcorrelation, covariance, estat wcorrelation, estimates store CS lrtest UN CS TITLE1 "SAS Empty Multivariate Model: RT Mean Differences for Meaning by Salience"; TITLE2 "Compound Symmetry R Matrix"; PROC MIXED DATA=work.Example5 COVTEST NOCLPRINT NAMELEN=100 IC METHOD=REML; CLASS PersonID condition mean sal; MODEL logrt = mean sal@2 / SOLUTION DDFM=Satterthwaite; REPEATED condition / R RCORR TYPE=CS SUBJECT=PersonID; RUN; TITLE1; TITLE2; SAS Output from Compound Symmetry R Matrix model: Estimated R Matrix for PersonID 1 Row Col1 Col2 Col3 Col This R matrix now predicts the residual variance to be regardless of condition. Part of it (0.1460) is due to mean RT differences across persons, and the rest ( = 0.056) is from within-condition residual variation.
5 SPLH 861 Example 5 page 5 Estimated R Correlation Matrix for PersonID 1 Row Col1 Col2 Col3 Col Covariance Parameter Estimates Standard Cov Parm Subject Estimate Error Value Pr Z CS PersonID <.0001 Residual <.0001 Z This RCORR matrix now predicts the residual correlation to be regardless of condition. This table gives the separately estimated parameters that create the R matrix pattern. Do NOT use these p-values! Fit Statistics -2 Res Log Likelihood AIC (smaller is better) AICC (smaller is better) BIC (smaller is better) Does this CS model with only 2 parameters fit worse than the UN model with 10 parameters (1 for each possible variance and covariance; 2LL = 336.6)? 2ΔLL (8) = = 35, p <.001, so yes, CS fits worse (UN fits better) Now let s examine the main and interactive effects of age group and age in the older group on RT using an unstructured R matrix for the variance and covariance across the meaning*salience conditions. Note that interactions of age group by years over 65 are NOT included (and are not logically possible)! ECHO 'SPSS Conditional Multivariate Model: Add Age Group and Years over 65;'. ECHO 'Unstructured R Matrix'. MIXED logrt BY PersonID condition mean sal WITH old yrs65 SPSS: BY = categorical, WITH = continuous /METHOD = REML No fixed effect interaction shortcuts /PRINT = SOLUTION TESTCOV R /FIXED = mean sal mean*sal old old*mean old*sal old*mean*sal yrs65 yrs65*mean yrs65*sal yrs65*mean*sal /REPEATED = condition COVTYPE(UN) SUBJECT(PersonID). display as result "STATA Conditional Multivariate Model:" display as result "Add Age Group and Years over 65" display as result "Unstructured R Matrix" mixed logrt ib(last).mean##ib(last).sal##old /// ib(last).mean##ib(last).sal##yrs65, /// personid:, noconstant variance reml /// residuals(unstructured,t(condition)), estat ic, n(156), estat wcorrelation, covariance, estat wcorrelation, STATA: i. = categorical, c. = continuous ## estimates all possible interaction and lower-order main effects SAS: CLASS = categorical (default is continuous) estimates all interaction and lower-order main effects up to order specified TITLE1 "SAS Conditional Multivariate Model: Add Age Group and Years over 65"; TITLE2 "Unstructured R Matrix"; PROC MIXED DATA=work.Example5 COVTEST NOCLPRINT NAMELEN=100 IC METHOD=REML; CLASS PersonID condition mean sal; MODEL logrt = mean sal old@3 mean sal yrs65@3 / SOLUTION DDFM=Satterthwaite; REPEATED condition / R RCORR TYPE=UN SUBJECT=PersonID; RUN; TITLE1; TITLE2;
6 SPLH 861 Example 5 page 6 Relevant SAS Output, treating meaning and salience as categorical but old and yrs65 as continuous so that it will not marginalize across age in estimating marginal effects of meaning and salience: Type 3 Tests of Fixed Effects Num Den Effect DF DF Chi-Square F Value Pr > ChiSq Pr > F mean <.0001 <.0001 sal <.0001 <.0001 mean*sal Because old and yrs65 are continuous, these are the effects for younger adults. old <.0001 <.0001 old*mean old*sal old*mean*sal These are how the meaning and salience effects DIFFER in the older adult group (conditional at age 65 years). yrs <.0001 <.0001 yrs65*mean yrs65*sal yrs65*mean*sal These are how the meaning and salience effects DIFFER per additional year of age in the older adult group. Based on these results, it appears we can remove some fixed effects, starting with yrs65*mean*sal. The two-way interactions of yrs65*mean and yrs65*sal were still not significant, so those were removed, leaving only the significant main effect of yrs65. Here is the reduced model (in which the highest-order interaction is significant): 65 ECHO 'SPSS Reduced Conditional Multivariate Model: Years over 65 as Main Effect;'. ECHO 'Unstructured R Matrix'. MIXED logrt BY PersonID condition mean sal WITH old yrs65 SPSS: EMMEANS gives conditional means, /METHOD = REML /PRINT = SOLUTION TESTCOV R TEST gets slopes for age group per condition /FIXED = mean sal mean*sal old old*mean old*sal old*mean*sal yrs65 /REPEATED = condition COVTYPE(UN) SUBJECT(PersonID) /EMMEANS = TABLES(mean*sal) COMPARE(mean) WITH(old=0 yrs65=0) /EMMEANS = TABLES(mean*sal) COMPARE(mean) WITH(old=1 yrs65=0) /EMMEANS = TABLES(mean*sal) COMPARE(mean) WITH(old=1 yrs65=10) /EMMEANS = TABLES(mean*sal) COMPARE(sal) WITH(old=0 yrs65=0) /EMMEANS = TABLES(mean*sal) COMPARE(sal) WITH(old=1 yrs65=0) /EMMEANS = TABLES(mean*sal) COMPARE(sal) WITH(old=1 yrs65=10) /TEST= "Old: Low Mean, Low Sal" old 1 mean*old 1 0 sal*old 1 0 mean*sal*old /TEST= "Old: Low Mean, High Sal" old 1 mean*old 1 0 sal*old 0 1 mean*sal*old /TEST= "Old: High Mean, Low Sal" old 1 mean*old 0 1 sal*old 1 0 mean*sal*old /TEST= "Old: High Mean, High Sal" old 1 mean*old 0 1 sal*old 0 1 mean*sal*old display as result "STATA Reduced Conditional Multivariate Model:" display as result "Years over 65 as Main Effect" display as result "Unstructured R Matrix" mixed logrt ib(last).mean##ib(last).sal##old yrs65, /// personid:, noconstant variance reml /// residuals(unstructured,t(condition)), estat ic, n(156), estat wcorrelation, covariance, estat wcorrelation,
7 SPLH 861 Example 5 page 7 margins ib(last).mean#ib(last).sal, at(c.old=0 c.yrs65=0) margins ib(last).mean#ib(last).sal, at(c.old=1 c.yrs65=0) STATA: margins gets margins ib(last).mean#ib(last).sal, at(c.old=1 c.yrs65=10) conditional means, margins ib(last).mean@ib(last).sal, at(c.old=0 c.yrs65=0) lincom gets slopes for margins ib(last).mean@ib(last).sal, at(c.old=1 c.yrs65=0) age group per condition margins ib(last).mean@ib(last).sal, at(c.old=1 c.yrs65=10) margins ib(last).sal@ib(last).mean, at(c.old=0 c.yrs65=0) margins ib(last).sal@ib(last).mean, at(c.old=1 c.yrs65=0) margins ib(last).sal@ib(last).mean, at(c.old=1 c.yrs65=10) lincom c.old*1 + i1.mean#c.old*1 + i1.sal#c.old*1 + i1.mean#i1.sal#c.old*1 lincom c.old*1 + i1.mean#c.old*1 + i2.sal#c.old*1 + i1.mean#i2.sal#c.old*1 lincom c.old*1 + i2.mean#c.old*1 + i1.sal#c.old*1 + i2.mean#i1.sal#c.old*1 lincom c.old*1 + i2.mean#c.old*1 + i2.sal#c.old*1 + i2.mean#i2.sal#c.old*1 TITLE1 "SAS Reduced Conditional Multivariate Model: Years over 65 as Main Effect"; TITLE2 "Unstructured R Matrix"; PROC MIXED DATA=work.Example5 COVTEST NOCLPRINT NAMELEN=100 IC METHOD=REML; CLASS PersonID condition mean sal; MODEL logrt = mean sal old@3 yrs65 / SOLUTION DDFM=Satterthwaite; REPEATED condition / R RCORR TYPE=UN SUBJECT=PersonID; * Getting condition means and simple effect tests at different ages; LSMEANS mean*sal / AT (old yrs65)=(0 0) SLICE=mean SLICE=sal; * For YA; LSMEANS mean*sal / AT (old yrs65)=(1 0) SLICE=mean SLICE=sal; * For age 65; LSMEANS mean*sal / AT (old yrs65)=(1 10) SLICE=mean SLICE=sal; * For age 75; * Getting age group differences per condition -- need all terms with old slope in them; ESTIMATE "Old: Low Mean, Low Sal" old 1 mean*old 1 0 sal*old 1 0 mean*sal*old ; ESTIMATE "Old: Low Mean, High Sal" old 1 mean*old 1 0 sal*old 0 1 mean*sal*old ; ESTIMATE "Old: High Mean, Low Sal" old 1 mean*old 0 1 sal*old 1 0 mean*sal*old ; ESTIMATE "Old: High Mean, High Sal" old 1 mean*old 0 1 sal*old 0 1 mean*sal*old ; RUN; TITLE1; TITLE2; Relevant SAS Output: Meaning Solution for Fixed Effects Salience Rows with 0 s and dots are redundant effects not estimated for the reference group (YA in the high high condition) (1=Low, (1=Low, Standard Effect 2=High) 2=High) Estimate Error DF t Value Pr > t Intercept <.0001 mean 1Low mean 2High sal 1Low <.0001 sal 2High mean*sal 1Low 1Low mean*sal 1Low 2High mean*sal 2High 1Low mean*sal 2High 2High old <.0001 old*mean 1Low old*mean 2High old*sal 1Low old*sal 2High old*mean*sal 1Low 1Low old*mean*sal 1Low 2High old*mean*sal 2High 1Low old*mean*sal 2High 2High How years of age adjusts the intercept in older adults (is same for all conditions) yrs
8 SPLH 861 Example 5 page 8 Type 3 Tests of Fixed Effects Num Den Effect DF DF Chi-Square F Value Pr > ChiSq Pr > F mean <.0001 <.0001 sal <.0001 <.0001 mean*sal old <.0001 <.0001 old*mean old*sal old*mean*sal Because old and yrs65 are continuous, these are the effects for younger adults. These are how the meaning and salience effects DIFFER in the older adult group. Estimates Standard Label Estimate Error DF t Value Pr > t Old: Low Mean, Low Sal <.0001 Old: Low Mean, High Sal <.0001 Old: High Mean, Low Sal <.0001 Old: High Mean, High Sal <.0001 Meaning Least Squares Means Salience (1=Low, (1=Low, Standard Effect 2=High) 2=High) old yrs65 Estimate Error DF t Value Pr > t mean*sal 1Low 1Low <.0001 mean*sal 1Low 2High <.0001 mean*sal 2High 1Low <.0001 mean*sal 2High 2High <.0001 mean*sal 1Low 1Low <.0001 mean*sal 1Low 2High <.0001 mean*sal 2High 1Low <.0001 mean*sal 2High 2High <.0001 mean*sal 1Low 1Low <.0001 mean*sal 1Low 2High <.0001 mean*sal 2High 1Low <.0001 mean*sal 2High 2High <.0001 Meaning Salience Tests of Effect Slices These are the simple slopes for how the YA and OA groups differ in mean RT for each condition. These are the conditional means per condition for each level of age requested (YA, 65, and 75). These are the simple effects of condition within each level of age requested. Note they are same within the 65- and 75-year-olds. (1=Low, (1=Low, Num Den Effect 2=High) 2=High) old yrs65 DF DF F Value Pr > F mean*sal 1Low <.0001 mean*sal 2High <.0001 mean*sal 1Low <.0001 mean*sal 2High mean*sal 1Low <.0001 mean*sal 2High <.0001 mean*sal 1Low mean*sal 2High mean*sal 1Low <.0001 mean*sal 2High <.0001 mean*sal 1Low mean*sal 2High
SAS Syntax and Output for Data Manipulation:
Psyc 944 Example 5 page 1 Practice with Fixed and Random Effects of Time in Modeling Within-Person Change The models for this example come from Hoffman (in preparation) chapter 5. We will be examining
This can dilute the significance of a departure from the null hypothesis. We can focus the test on departures of a particular form.
One-Degree-of-Freedom Tests Test for group occasion interactions has (number of groups 1) number of occasions 1) degrees of freedom. This can dilute the significance of a departure from the null hypothesis.
Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure
Technical report Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure Table of contents Introduction................................................................ 1 Data preparation
Family economics data: total family income, expenditures, debt status for 50 families in two cohorts (A and B), annual records from 1990 1995.
Lecture 18 1. Random intercepts and slopes 2. Notation for mixed effects models 3. Comparing nested models 4. Multilevel/Hierarchical models 5. SAS versions of R models in Gelman and Hill, chapter 12 1
An Introduction to Modeling Longitudinal Data
An Introduction to Modeling Longitudinal Data Session I: Basic Concepts and Looking at Data Robert Weiss Department of Biostatistics UCLA School of Public Health [email protected] August 2010 Robert Weiss
Random effects and nested models with SAS
Random effects and nested models with SAS /************* classical2.sas ********************* Three levels of factor A, four levels of B Both fixed Both random A fixed, B random B nested within A ***************************************************/
Basic Statistical and Modeling Procedures Using SAS
Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom
MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 1 DAVID C. HOWELL 4/26/2010
MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 1 DAVID C. HOWELL 4/26/2010 FOR THE SECOND PART OF THIS DOCUMENT GO TO www.uvm.edu/~dhowell/methods/supplements/mixed Models Repeated/Mixed Models for
Milk Data Analysis. 1. Objective Introduction to SAS PROC MIXED Analyzing protein milk data using STATA Refit protein milk data using PROC MIXED
1. Objective Introduction to SAS PROC MIXED Analyzing protein milk data using STATA Refit protein milk data using PROC MIXED 2. Introduction to SAS PROC MIXED The MIXED procedure provides you with flexibility
Introduction to Longitudinal Data Analysis
Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction
Electronic Thesis and Dissertations UCLA
Electronic Thesis and Dissertations UCLA Peer Reviewed Title: A Multilevel Longitudinal Analysis of Teaching Effectiveness Across Five Years Author: Wang, Kairong Acceptance Date: 2013 Series: UCLA Electronic
Multilevel Modeling Tutorial. Using SAS, Stata, HLM, R, SPSS, and Mplus
Using SAS, Stata, HLM, R, SPSS, and Mplus Updated: March 2015 Table of Contents Introduction... 3 Model Considerations... 3 Intraclass Correlation Coefficient... 4 Example Dataset... 4 Intercept-only Model
Longitudinal Data Analyses Using Linear Mixed Models in SPSS: Concepts, Procedures and Illustrations
Research Article TheScientificWorldJOURNAL (2011) 11, 42 76 TSW Child Health & Human Development ISSN 1537-744X; DOI 10.1100/tsw.2011.2 Longitudinal Data Analyses Using Linear Mixed Models in SPSS: Concepts,
SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout
Analyzing Data SPSS Resources 1. See website (readings) for SPSS tutorial & Stats handout Don t have your own copy of SPSS? 1. Use the libraries to analyze your data 2. Download a trial version of SPSS
Main Effects and Interactions
Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly
Individual Growth Analysis Using PROC MIXED Maribeth Johnson, Medical College of Georgia, Augusta, GA
Paper P-702 Individual Growth Analysis Using PROC MIXED Maribeth Johnson, Medical College of Georgia, Augusta, GA ABSTRACT Individual growth models are designed for exploring longitudinal data on individuals
E(y i ) = x T i β. yield of the refined product as a percentage of crude specific gravity vapour pressure ASTM 10% point ASTM end point in degrees F
Random and Mixed Effects Models (Ch. 10) Random effects models are very useful when the observations are sampled in a highly structured way. The basic idea is that the error associated with any linear,
Directions for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
Technical report. in SPSS AN INTRODUCTION TO THE MIXED PROCEDURE
Linear mixedeffects modeling in SPSS AN INTRODUCTION TO THE MIXED PROCEDURE Table of contents Introduction................................................................3 Data preparation for MIXED...................................................3
Chapter 15. Mixed Models. 15.1 Overview. A flexible approach to correlated data.
Chapter 15 Mixed Models A flexible approach to correlated data. 15.1 Overview Correlated data arise frequently in statistical analyses. This may be due to grouping of subjects, e.g., students within classrooms,
9.2 User s Guide SAS/STAT. The MIXED Procedure. (Book Excerpt) SAS Documentation
SAS/STAT 9.2 User s Guide The MIXED Procedure (Book Excerpt) SAS Documentation This document is an individual chapter from SAS/STAT 9.2 User s Guide. The correct bibliographic citation for the complete
Generalized Mixed Models for Binomial Longitudinal Outcomes (% Correct) using PROC GLIMMIX
Hoffman Psyc 945 Example 6c Generalized Mixed Models for Binomial Longitudinal Outcomes (% Correct) using PROC GLIMMIX The data for this example are based on the publication below, which examined annual
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand
Applied Longitudinal Data Analysis: An Introductory Course
Applied Longitudinal Data Analysis: An Introductory Course Emilio Ferrer UC Davis The Risk and Prevention in Education Sciences (RPES) Curry School of Education - UVA August 2005 Acknowledgments Materials
Interaction effects and group comparisons Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 20, 2015
Interaction effects and group comparisons Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 20, 2015 Note: This handout assumes you understand factor variables,
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
Assignments Analysis of Longitudinal data: a multilevel approach
Assignments Analysis of Longitudinal data: a multilevel approach Frans E.S. Tan Department of Methodology and Statistics University of Maastricht The Netherlands Maastricht, Jan 2007 Correspondence: Frans
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
SUGI 29 Statistics and Data Analysis
Paper 194-29 Head of the CLASS: Impress your colleagues with a superior understanding of the CLASS statement in PROC LOGISTIC Michelle L. Pritchard and David J. Pasta Ovation Research Group, San Francisco,
I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
Using Excel for Statistical Analysis
Using Excel for Statistical Analysis You don t have to have a fancy pants statistics package to do many statistical functions. Excel can perform several statistical tests and analyses. First, make sure
ln(p/(1-p)) = α +β*age35plus, where p is the probability or odds of drinking
Dummy Coding for Dummies Kathryn Martin, Maternal, Child and Adolescent Health Program, California Department of Public Health ABSTRACT There are a number of ways to incorporate categorical variables into
SPSS Guide: Regression Analysis
SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar
Use of deviance statistics for comparing models
A likelihood-ratio test can be used under full ML. The use of such a test is a quite general principle for statistical testing. In hierarchical linear models, the deviance test is mostly used for multiparameter
An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA
ABSTRACT An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA Often SAS Programmers find themselves in situations where performing
An introduction to using Microsoft Excel for quantitative data analysis
Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing
data visualization and regression
data visualization and regression Sepal.Length 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0 I. setosa I. versicolor I. virginica I. setosa I. versicolor I. virginica Species Species
A Brief Introduction to SPSS Factor Analysis
A Brief Introduction to SPSS Factor Analysis SPSS has a procedure that conducts exploratory factor analysis. Before launching into a step by step example of how to use this procedure, it is recommended
Introducing the Multilevel Model for Change
Department of Psychology and Human Development Vanderbilt University GCM, 2010 1 Multilevel Modeling - A Brief Introduction 2 3 4 5 Introduction In this lecture, we introduce the multilevel model for change.
Chapter 29 The GENMOD Procedure. Chapter Table of Contents
Chapter 29 The GENMOD Procedure Chapter Table of Contents OVERVIEW...1365 WhatisaGeneralizedLinearModel?...1366 ExamplesofGeneralizedLinearModels...1367 TheGENMODProcedure...1368 GETTING STARTED...1370
Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
ANOVA. February 12, 2015
ANOVA February 12, 2015 1 ANOVA models Last time, we discussed the use of categorical variables in multivariate regression. Often, these are encoded as indicator columns in the design matrix. In [1]: %%R
HLM software has been one of the leading statistical packages for hierarchical
Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)
This chapter will demonstrate how to perform multiple linear regression with IBM SPSS
CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the
Logistic (RLOGIST) Example #3
Logistic (RLOGIST) Example #3 SUDAAN Statements and Results Illustrated PREDMARG (predicted marginal proportion) CONDMARG (conditional marginal proportion) PRED_EFF pairwise comparison COND_EFF pairwise
How to set the main menu of STATA to default factory settings standards
University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be
Chapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
Multivariate Analysis. Overview
Multivariate Analysis Overview Introduction Multivariate thinking Body of thought processes that illuminate the interrelatedness between and within sets of variables. The essence of multivariate thinking
Moderation. Moderation
Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation
Overview of Methods for Analyzing Cluster-Correlated Data. Garrett M. Fitzmaurice
Overview of Methods for Analyzing Cluster-Correlated Data Garrett M. Fitzmaurice Laboratory for Psychiatric Biostatistics, McLean Hospital Department of Biostatistics, Harvard School of Public Health Outline
A Short Guide to R with RStudio
Short Guides to Microeconometrics Fall 2013 Prof. Dr. Kurt Schmidheiny Universität Basel A Short Guide to R with RStudio 1 Introduction 2 2 Installing R and RStudio 2 3 The RStudio Environment 2 4 Additions
Quick Start to Data Analysis with SAS Table of Contents. Chapter 1 Introduction 1. Chapter 2 SAS Programming Concepts 7
Chapter 1 Introduction 1 SAS: The Complete Research Tool 1 Objectives 2 A Note About Syntax and Examples 2 Syntax 2 Examples 3 Organization 4 Chapter by Chapter 4 What This Book Is Not 5 Chapter 2 SAS
Using PROC MIXED in Hierarchical Linear Models: Examples from two- and three-level school-effect analysis, and meta-analysis research
Using PROC MIXED in Hierarchical Linear Models: Examples from two- and three-level school-effect analysis, and meta-analysis research Sawako Suzuki, DePaul University, Chicago Ching-Fan Sheu, DePaul University,
One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate
1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,
Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation
How To Model A Series With Sas
Chapter 7 Chapter Table of Contents OVERVIEW...193 GETTING STARTED...194 TheThreeStagesofARIMAModeling...194 IdentificationStage...194 Estimation and Diagnostic Checking Stage...... 200 Forecasting Stage...205
5. Multiple regression
5. Multiple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/5 QBUS6840 Predictive Analytics 5. Multiple regression 2/39 Outline Introduction to multiple linear regression Some useful
Simple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@
Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@ Yanchun Xu, Andrius Kubilius Joint Commission on Accreditation of Healthcare Organizations,
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
January 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
Using An Ordered Logistic Regression Model with SAS Vartanian: SW 541
Using An Ordered Logistic Regression Model with SAS Vartanian: SW 541 libname in1 >c:\=; Data first; Set in1.extract; A=1; PROC LOGIST OUTEST=DD MAXITER=100 ORDER=DATA; OUTPUT OUT=CC XBETA=XB P=PROB; MODEL
Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com
SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.
Estimating Multilevel Models using SPSS, Stata, SAS, and R. JeremyJ.Albright and Dani M. Marinova. July 14, 2010
Estimating Multilevel Models using SPSS, Stata, SAS, and R JeremyJ.Albright and Dani M. Marinova July 14, 2010 1 Multilevel data are pervasive in the social sciences. 1 Students may be nested within schools,
8. Comparing Means Using One Way ANOVA
8. Comparing Means Using One Way ANOVA Objectives Calculate a one-way analysis of variance Run various multiple comparisons Calculate measures of effect size A One Way ANOVA is an analysis of variance
Statistics and Pharmacokinetics in Clinical Pharmacology Studies
Paper ST03 Statistics and Pharmacokinetics in Clinical Pharmacology Studies ABSTRACT Amy Newlands, GlaxoSmithKline, Greenford UK The aim of this presentation is to show how we use statistics and pharmacokinetics
Using Computer Programs for Quantitative Data Analysis
1 Using Computer Programs for Quantitative Data Analysis Brian V. Carolan The Graduate School's Graduate Development Conference, 2 Slides available at: http://www.briancarolan.org/site/courses.html 3 Overview
Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases:
Profile Analysis Introduction Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases: ) Comparing the same dependent variables
Using Stata for Categorical Data Analysis
Using Stata for Categorical Data Analysis NOTE: These problems make extensive use of Nick Cox s tab_chi, which is actually a collection of routines, and Adrian Mander s ipf command. From within Stata,
Penalized regression: Introduction
Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood
VI. Introduction to Logistic Regression
VI. Introduction to Logistic Regression We turn our attention now to the topic of modeling a categorical outcome as a function of (possibly) several factors. The framework of generalized linear models
Indices of Model Fit STRUCTURAL EQUATION MODELING 2013
Indices of Model Fit STRUCTURAL EQUATION MODELING 2013 Indices of Model Fit A recommended minimal set of fit indices that should be reported and interpreted when reporting the results of SEM analyses:
Appendix 1: Estimation of the two-variable saturated model in SPSS, Stata and R using the Netherlands 1973 example data
Appendix 1: Estimation of the two-variable saturated model in SPSS, Stata and R using the Netherlands 1973 example data A. SPSS commands and corresponding parameter estimates Copy the 1973 data from the
Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank
Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing C. Olivia Rud, VP, Fleet Bank ABSTRACT Data Mining is a new term for the common practice of searching through
Module 5: Introduction to Multilevel Modelling SPSS Practicals Chris Charlton 1 Centre for Multilevel Modelling
Module 5: Introduction to Multilevel Modelling SPSS Practicals Chris Charlton 1 Centre for Multilevel Modelling Pre-requisites Modules 1-4 Contents P5.1 Comparing Groups using Multilevel Modelling... 4
A REVIEW OF CURRENT SOFTWARE FOR HANDLING MISSING DATA
123 Kwantitatieve Methoden (1999), 62, 123-138. A REVIEW OF CURRENT SOFTWARE FOR HANDLING MISSING DATA Joop J. Hox 1 ABSTRACT. When we deal with a large data set with missing data, we have to undertake
Paper D10 2009. Ranking Predictors in Logistic Regression. Doug Thompson, Assurant Health, Milwaukee, WI
Paper D10 2009 Ranking Predictors in Logistic Regression Doug Thompson, Assurant Health, Milwaukee, WI ABSTRACT There is little consensus on how best to rank predictors in logistic regression. This paper
Didacticiel - Études de cas
1 Topic Linear Discriminant Analysis Data Mining Tools Comparison (Tanagra, R, SAS and SPSS). Linear discriminant analysis is a popular method in domains of statistics, machine learning and pattern recognition.
Logistic Regression. http://faculty.chass.ncsu.edu/garson/pa765/logistic.htm#sigtests
Logistic Regression http://faculty.chass.ncsu.edu/garson/pa765/logistic.htm#sigtests Overview Binary (or binomial) logistic regression is a form of regression which is used when the dependent is a dichotomy
Data Analysis in SPSS. February 21, 2004. If you wish to cite the contents of this document, the APA reference for them would be
Data Analysis in SPSS Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Heather Claypool Department of Psychology Miami University
IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the
To do a factor analysis, we need to select an extraction method and a rotation method. Hit the Extraction button to specify your extraction method.
Factor Analysis in SPSS To conduct a Factor Analysis, start from the Analyze menu. This procedure is intended to reduce the complexity in a set of data, so we choose Data Reduction from the menu. And the
Modeling Lifetime Value in the Insurance Industry
Modeling Lifetime Value in the Insurance Industry C. Olivia Parr Rud, Executive Vice President, Data Square, LLC ABSTRACT Acquisition modeling for direct mail insurance has the unique challenge of targeting
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
Simple Linear Regression, Scatterplots, and Bivariate Correlation
1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.
BIOL 933 Lab 6 Fall 2015. Data Transformation
BIOL 933 Lab 6 Fall 2015 Data Transformation Transformations in R General overview Log transformation Power transformation The pitfalls of interpreting interactions in transformed data Transformations
Introduction to Multilevel Modeling Using HLM 6. By ATS Statistical Consulting Group
Introduction to Multilevel Modeling Using HLM 6 By ATS Statistical Consulting Group Multilevel data structure Students nested within schools Children nested within families Respondents nested within interviewers
7 Generalized Estimating Equations
Chapter 7 The procedure extends the generalized linear model to allow for analysis of repeated measurements or other correlated observations, such as clustered data. Example. Public health of cials can
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce
Cool Tools for PROC LOGISTIC
Cool Tools for PROC LOGISTIC Paul D. Allison Statistical Horizons LLC and the University of Pennsylvania March 2013 www.statisticalhorizons.com 1 New Features in LOGISTIC ODDSRATIO statement EFFECTPLOT
TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2
About Chi Squares TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing
Introduction to Data Analysis in Hierarchical Linear Models
Introduction to Data Analysis in Hierarchical Linear Models April 20, 2007 Noah Shamosh & Frank Farach Social Sciences StatLab Yale University Scope & Prerequisites Strong applied emphasis Focus on HLM
Statistics, Data Analysis & Econometrics
Using the LOGISTIC Procedure to Model Responses to Financial Services Direct Marketing David Marsh, Senior Credit Risk Modeler, Canadian Tire Financial Services, Welland, Ontario ABSTRACT It is more important
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
