Latent class mixed models for longitudinal data. (Growth mixture models)



Similar documents
LCMM: a R package for the estimation of latent class mixed models for Gaussian, ordinal, curvilinear longitudinal data and/or time-to-event data

Introduction to mixed model and missing data issues in longitudinal studies

A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop

An Introduction to Latent Class Growth Analysis and Growth Mixture Modeling

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA

Web-based Supplementary Materials for Bayesian Effect Estimation. Accounting for Adjustment Uncertainty by Chi Wang, Giovanni

Nominal and ordinal logistic regression

STA 4273H: Statistical Machine Learning

Missing data and net survival analysis Bernard Rachet

Christfried Webers. Canberra February June 2015

Linear Classification. Volker Tresp Summer 2015

Basics of Statistical Machine Learning

Lecture 3: Linear methods for classification

Linear Models for Classification

Classification Problems

PATTERN MIXTURE MODELS FOR MISSING DATA. Mike Kenward. London School of Hygiene and Tropical Medicine. Talk at the University of Turku,

Reject Inference in Credit Scoring. Jie-Men Mok

Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression

CS 688 Pattern Recognition Lecture 4. Linear Models for Classification

Introduction to General and Generalized Linear Models

171:290 Model Selection Lecture II: The Akaike Information Criterion

Overview. Longitudinal Data Variation and Correlation Different Approaches. Linear Mixed Models Generalized Linear Mixed Models

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

The Probit Link Function in Generalized Linear Models for Data Mining Applications

Problem of Missing Data

Prediction for Multilevel Models

A Latent Variable Approach to Validate Credit Rating Systems using R

1 Prior Probability and Posterior Probability

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

Statistics Graduate Courses

Exact Inference for Gaussian Process Regression in case of Big Data with the Cartesian Product Structure

Imputation of missing data under missing not at random assumption & sensitivity analysis

Efficient and Practical Econometric Methods for the SLID, NLSCY, NPHS

Longitudinal Data Analyses Using Linear Mixed Models in SPSS: Concepts, Procedures and Illustrations

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén Table Of Contents

Standard errors of marginal effects in the heteroskedastic probit model

Electronic Theses and Dissertations UC Riverside

A Basic Introduction to Missing Data

A hidden Markov model for criminal behaviour classification

Credit Risk Models: An Overview

Multivariate Logistic Regression

Pattern Analysis. Logistic Regression. 12. Mai Joachim Hornegger. Chair of Pattern Recognition Erlangen University

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection

Handling attrition and non-response in longitudinal data

Analysis of Correlated Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

Analyzing Structural Equation Models With Missing Data

Extreme Value Modeling for Detection and Attribution of Climate Extremes

Elements of statistics (MATH0487-1)

Dealing with Missing Data

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Statistical Machine Learning

A Composite Likelihood Approach to Analysis of Survey Data with Sampling Weights Incorporated under Two-Level Models

SAS Software to Fit the Generalized Linear Model

Model based clustering of longitudinal data: application to modeling disease course and gene expression trajectories

Introduction to Longitudinal Data Analysis

Interpretation of Somers D under four simple models

Bayesian Statistics in One Hour. Patrick Lam

Capabilities of R Package mixak for Clustering Based on Multivariate Continuous and Discrete Longitudinal Data

Linear Discrimination. Linear Discrimination. Linear Discrimination. Linearly Separable Systems Pairwise Separation. Steven J Zeil.

Bayesian Approaches to Handling Missing Data

Introduction to Structural Equation Modeling (SEM) Day 4: November 29, 2012

Illustration (and the use of HLM)

A Mixed Model Approach for Intent-to-Treat Analysis in Longitudinal Clinical Trials with Missing Values

Latent Class Regression Part II

Review of the Methods for Handling Missing Data in. Longitudinal Data Analysis

HLM software has been one of the leading statistical packages for hierarchical

E(y i ) = x T i β. yield of the refined product as a percentage of crude specific gravity vapour pressure ASTM 10% point ASTM end point in degrees F

Statistical modelling with missing data using multiple imputation. Session 4: Sensitivity Analysis after Multiple Imputation

Fitting Subject-specific Curves to Grouped Longitudinal Data

Multilevel Modeling of Complex Survey Data

R 2 -type Curves for Dynamic Predictions from Joint Longitudinal-Survival Models

These slides follow closely the (English) course textbook Pattern Recognition and Machine Learning by Christopher Bishop

Course 4 Examination Questions And Illustrative Solutions. November 2000

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

The Latent Variable Growth Model In Practice. Individual Development Over Time

Handling missing data in Stata a whirlwind tour

Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study

Biostatistics Short Course Introduction to Longitudinal Studies

Multilevel Models for Longitudinal Data. Fiona Steele

Models for Longitudinal and Clustered Data

Least Squares Estimation

Department of Epidemiology and Public Health Miller School of Medicine University of Miami

Multiple Choice Models II

Ordinal Regression. Chapter

Assignments Analysis of Longitudinal data: a multilevel approach

Multilevel Modelling of medical data

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

SF2940: Probability theory Lecture 8: Multivariate Normal Distribution

Transcription:

Latent class mixed models for longitudinal data (Growth mixture models) Cécile Proust-Lima & Hélène Jacqmin-Gadda Department of Biostatistics, INSERM U897, University of Bordeaux II INSERM workshop 205 - june 2010 - Saint Raphael

Analysis of change over time Linear mixed model (LMM) for describing change over time : - Correlation between repeated measures (random-effects) - Single mean profile of trajectory Restricted to homogeneous populations Yet, frequently populations are heterogeneous : Latent group structure linked to a behavior, a disease,... - Trajectories of disability before death (Gill, 2010) - Trajectories of alcohol use in young adults (Muthén, 1999) - Cognitive declines in the elderly (Proust-Lima, 2009a) - Progression of prostate cancer after treatment (Proust-Lima, 2009b)

PSA trajectories after radiation therapy (1) 3 2.5 2 log(psa+0.1) 1.5 1 0.5 0-0.5-1 0 1 2 3 4 5 6 7 8 9 time since end of RT

PSA trajectories after radiation therapy (2) 3 2.5 2 log(psa+0.1) 1.5 1 0.5 0-0.5-1 0 1 2 3 4 5 6 7 8 9 time since end of RT

Trajectories of verbal fluency in the elderly (1) 40 35 30 Isaacs Set Test 25 20 15 10 5 0 0 2 4 6 8 10 12 14 time since entry in the cohort

Trajectories of verbal fluency in the elderly (2) 40 35 30 Isaacs Set Test 25 20 15 10 5 0 0 2 4 6 8 10 12 14 time since entry in the cohort

Latent class mixed models / growth mixture models / heterogeneous mixed models Accounts for both individual variability and latent group structure (Verbeke,1996 ; Muthén, 1999) Extension of LMM to account for heterogeneity Extension of LCGA to account for individual variability Two submodels : - Probability of latent class membership - Class-specific trajectory both according to covariates/predictors

Outline of the talk Methodology - Latent class linear mixed model - Estimation methods - Posterior classification - Goodness of fit evaluation Application - PAQUID cohort of cognitive aging - Heterogeneous profiles of verbal fluency and predictors Discussion

Probability of latent class membership Population of N subjects (subscript i, i = 1,..., N) G latent homogeneous classes (subscript g, g = 1,..., G) Discrete latent variable c i for the latent group structure : c i = g if subject i belongs to class g (g = 1,..., G) Every subject belongs to only one latent class Probability of latent class membership explained according to covariates X 1i (multinomial logistic regression) : π ig = P(c i = g X 1i ) = eξ 0g+X 1i ξ 1g G l=1 eξ 0l+X 1i ξ 1l with ξ 0G = 0 and ξ 1g = 0 i.e. class G = reference class

Class-specific LMM : notations and simple example Change over time of a longitudinal outcome Y : - Y ij repeated measure for subject i at occasion j, j = 1,..., n i - t ij time of measurement at occasion j, j = 1,..., n i Number & times of measurements can differ across subjects Linear change over time (without adjustment for covariates) : Y ij ci =g = u 0ig + u 1ig t ij +ǫ ij Class-specific random-effects (u 0ig, u 1ig ) N((µ 0g,µ 1g ), B g ) - µ 0g and µ 1g class-specific mean intercept and slope - B g class-specific variance-covariance (usually B g = B or B g = w 2 gb) Independent errors of measurement ǫ ij N(0,σ 2 ǫ)

Class-specific LMM : General formulation Y ij ci =g = Z iju ig + X 2ijβ + X 3ijγ g +ǫ ij Z ij, X 2ij, X 3ij : 3 different vectors of covariates without overlap Z ij vector of time functions : Z ij = (1, t ij, tij 2, t3 ij,...) for polynomial shapes Z ij = (B 1 (t ij ),..., B K (t ij )) for shapes approximated by splines Z ij = (f 1 (t ij ),..., f K (t ij )) for shapes defined by a set of K parametric functions X 2ij set of covariates with common effects over classes β X 3ij set of covariates with class-specific effects γ g

Estimation of the parameters Estimation of θ G for a fixed number of latent classes G : in a Bayesian framework (Komarek, 2009 ; Elliott, 2005) in a maximum likelihood framework (Verbeke, 1996 ; Muthén, 1999, 2004 ; Proust, 2005) N G L(θ G ) = ln P(c i = g X 1i,θ G ) φ ig (Y i c i = g; X 2i, X 3i, Z i,θ G ) i=1 g=1 with X 2i, X 3i, Z i : matrices of n i row vectors X 2ij, X 3ij, Z ij resp. φ ig pdf of MVN(X 2i β + X 3i γ g + Z i µ g, Z i B g Z i +σ2 ǫ I ni )

Estimation of LCLMM Estimation of ˆθ G for a fixed G Multiple possible maxima grid of initial values Selection of the optimal number of latent classes : - Bayesian Information Criterion (BIC) (Bauer, 2003) - Deviance Information Criterion (DIC, DIC 3,etc) (Celeux, 2006) - Other possible tests (Lo, 2001 ; Nylund, 2007) Programs available : - Mplus (Muthén, 2001) - R function GLMM MCMC in mixak package (Komarek, 2009) - R function hlme in lcmm package (Proust-Lima, 2010) - GLLAMM in Stata (Rabe-Hesketh, 2005)

Posterior classification Posterior probability of class membership For a subject i in latent class g : ˆπ ig = P(c i = g X i, Y i,ˆθ) = P(c i = g X 1i,ˆθ)φ ig (Y i c i = g,θ) G l=1 P(c i = l X 1i,ˆθ)φ il (Y i c i = l, X 2i, X 3i, Z i,ˆθ) Posterior classification : ĉ i = argmax g (ˆπ ig ) Class in which the subject has the highest posterior probability

Goodness-of-fit 1 : Classification Is the classification very discriminative? Or is it ambiguous? Table of posterior classification : Final Mean of the probabilities of belonging to each class class 1... g... G 1 1 N N1 1 i=1 N ˆπ 1 i1... N1 i=1 1 N ˆπ 1 ig... N1 i=1 1 N ˆπ ig 1...... 1 Ng g N g i=1 N ˆπ 1 Ng i1... i=1 g N ˆπ 1 Ng ig... i=1 g N ˆπ ig g.. G N G 1 NG i=1 N i1 G... 1 NG i=1 N πˆ ig... G.... 1 N G NG i=1 ˆπ ig

Goodness-of-fit 2 : Longitudinal predictions Does the longitudinal model correctly fit the data? Class-specific marginal (M) & subject-specific (SS) predictions : - Ŷ (M) ijg - Ŷ (SS) ijg = X T 2ijˆβ + X T 3ijˆγ g + Z T ijˆµ g = X T 2ijˆβ + X T 3ijˆγ g + Z T ijˆµ g + Z T ij ûig with bayes estimates û ig = ω 2 gbz T i V 1 ig (Y i X 2iˆβ + X3iˆγ g + Z iˆµ g ) Weighted average over classes (and corresponding residuals) : Ŷ (M) ij Ŷ (SS) ij = G g=1 πˆ igŷ(m) ijg πigŷ(ss) ijg = G g=1 ˆ ˆR (M) ij ˆR (SS) ij = Y ij Ŷ (M) ij = Y ij Ŷ (SS) ij

4 kinds of possible analyses in LCLMM 1. Exploration of unconditioned and unadjusted trajectories - no covariates in the LMM & the class-membership model raw heterogeneity 2. Exploration of unconditioned adjusted trajectories - covariates in the LMM residual heterogeneity after adjustment for known factors of change over time 3. Exploration of conditioned unadjusted trajectories - covariates in the class-membership model heterogeneity explained by targeted factors 4. Exploration of conditioned and adjusted trajectories - covariates in the class-membership model & the LMM residual heterogeneity explained by targeted factors

PAQUID cohort Population-based prospective cohort of cognitive aging - 3777 subjects of 65 years and older in South West France (random selection from electoral rolls) - Follow-up every 2-3 years : T0 T1 T3 T5 T8 T10 T13 T15 T17 At each visit : - Neuropsychological battery - Two phase diagnosis of dementia - & Information about health, activities, etc

Verbal fluency trajectories Verbal fluency impaired early in pathological aging (Amieva, 2008) description of heterogeneous trajectories - Verbal fluency measured by IST (Isaacs Set Test) - Quadratic trajectory according to time from entry - Patients included :. Not initially demented. At least one measure at IST in T0 - T17 - Covariates of interest :. First evaluation effect (adjustment in the LMM). Gender, education, age at entry (classes predictors)

Heterogeneity predicted by gender, education and age Estimation for a varying number of latent classes : G p L BIC Frequency of the latent classes (%) 1 2 3 4 5 1 11-40651.3 81392.4 100 2 18-40104.8 80356.6 51.3 48.7 3 25-40015.7 80235.6 29.6 50.6 19.8 4 32-39941.3 80144.0 30.7 47.6 3.3 18.4 5 39-39922.2 80163.0 31.4 1.8 31.4 20.7 14.7 number of parameters

Trajectories of verbal fluency in the elderly 40 35 30 25 IST 20 15 10 5 0 class 1 (30.7%) class 2 (47.6%) class 3 (3.3%) class 4 (18.4%) 0 2 4 6 8 10 12 14 16 years from entry in the cohort

Predictors of class-membership predictor class estimate SE p-value male 1 0.051 0.219 0.817 male 2 0.115 0.197 0.561 male 3 0.530 0.291 0.068 male 4 0 education+ 1 5.150 0.587 <0.0001 education+ 2 1.742 0.298 <0.0001 education+ 3 4.379 1.158 0.0001 education+ 4 0 age at entry 1-0.447 0.046 <0.0001 age at entry 2-0.215 0.027 <0.0001 age at entry 3-0.173 0.040 <0.0001 age at entry 4 0

Posterior classification table Final Number of Mean of the class-membership classif. subjects (%) probabilities in class (in %) : 1 2 3 4 1 1083 (30.7%) 81.1 14.4 4.5 <0.1 2 1679 (47.6%) 9.7 74.0 5.5 10.8 3 117 (3.3%) 10.8 20.4 67.7 1.1 4 648 (18.4%) <0.2 19.2 0.7 80.1

Weighted marginal predictions and observations IST 40 35 30 25 20 15 Class 1 10 mean observation 5 95%CI mean prediction 0 0 2 4 6 8 10 12 14 16 time from entry IST Class 2 40 35 30 25 20 15 10 5 0 0 2 4 6 8 10 12 14 16 time from entry IST 40 35 30 25 20 15 10 5 Class 3 0 0 2 4 6 8 10 12 14 16 time from entry IST Class 4 40 35 30 25 20 15 10 5 0 0 2 4 6 8 10 12 14 16 time from entry

Advantages of LCLMM - Accounts for 2 sources of variability. individual variability through random-effects inference possible. latent group structure mean profiles of trajectory (different from LCGA) - MAR assumption for missing data and dropout - Individually varying time (age / exact follow-up) - Includes flexibly covariates : different questions addressed

Limits of LCLMM - Starting values & local solutions (Hipp, 2006). vary the starting values extensively. compare various solutions to determine the stability of the model. assess the frequency of the solution - Interpretation of the latent classes (Bauer, 2003 + discutants) Flexible model that can fit better homogeneous populations relevant assumption of latent groups evaluation of goodness-of-fit - Linear models for Gaussian outcomes only same extension for nonlinear mixed models same extension for multivariate mixed models

References - Amieva H, Le Goff M, Millet X, et al. (2008). Annals of Neurology, 64, 492 :8 - Bauer DJ, Curran PJ (2003). Psychol Meth,8(3), 338 :63 (+ discutants 364 :93) - Elliott MR, Ten Have TR, Gallo J, et al. (2005). Biostatistics, 6, 119 :43. - Gill TM, Gahbauer EA, Han L, Allore HG (2010). The New England journal of medicine, 362, 1173 :80 - Hipp JR, Bauer DJ (2006). Psychological methods, 11(1), 36 :53 - Komarek A (2009). Computational Statistics and Data Analysis, 53(12), 3932 :47 - Lo Y, Mendell NR, Rubin DB (2001). Biometrika,88(3),767 :78 - Muthén B0, Shedden K (1999). Biometrics,55(2),463 :9 - Muthén LK, Muthén B0 (2004). Mplus user s guide. www.statmodel.com - Nylund KL, Asparouhov T, Muthén BO (2007). Structural Equation Modeling,14,535 :69 - Proust C, Jacqmin-Gadda H (2005). Computer Methods and Programs in Biomedicine,78,165 :73 - Proust-Lima C, Joly P, Jacqmin-Gadda H (2009a). Computational Statistics and Data Analysis,53,1142 :54 - Proust-Lima C, Taylor J (2009b). Biostatistics,10,535 :49 - Rabe-Hesketh S, Skrondal A, Pickles A (2004). Psychometrika, 69, 167 :90 - Verbeke G, Lesaffre E (1996). Journal of the American Statistical Association,91(433),217 :21