Methods for Analyzing Longitudinal Health Survey Data. Theory and Applications
|
|
|
- Rolf Jefferson
- 10 years ago
- Views:
Transcription
1 Methods for Analyzing Longitudinal Health Survey Data Theory and Applications
2 Mary E. Thompson University of Waterloo
3 Acknowledgments
4 Acknowledgments The support of the following agencies is acknowledged: Natural Sciences and Engineering Council of Canada (NSERC) National Program on Complex Data Structures (NPCDS) Canadian Institutes of Health Research (CIHR) Transdisciplinary Tobacco Use Research Center (TTURC) at Roswell Park National Institutes of Health (NIH)
5 Outline I. Observational studies and causality II. Overview of models for longitudinal survey data III. Adapting the models and methods to complex survey data IV. Scan of software
6 Part I: Observational Studies and Causality The nature of causality Experimentation and observational studies Replication and the multiphasic approach Role of theory Role of temporal order
7 Reference Marini, M.M. and Singer, B. (1988) Causality in the Social Sciences. Sociological Methodology 18, M & S survey the literature on the ontology and epistemology of causality in the social sciences
8
9 The nature of causality Hume, D. (writing ): The idea of causality arises from the empirical relations of contiguity, temporal succession, and constant conjunction; regularity of association, together with a necessary connection. M&S P
10 The nature of causality Causality is directional: in some sense the cause must exist before the effect Causality is an if-then relationship
11 The nature of causality A mathematical function y=f(x), where x can be assigned (conceptually) there would be more snow in Denver if the Rocky Mountains were lower An algorithm, with output determinable from input through a sequence of instructions
12 The nature of causality A process: a chain of operations/mappings, where input may be assignable at a number of places; intervention; surgery (Pearl, 2000) A number of philosophers have argued that to provide the connection between cause and effect which Hume called the cement of the universe, we must analyze causal relationships in terms of a causal process that connects the two events and explains their relationship. M&S P.361
13 The nature of causality Causality as information transmission: Salmon, W.C. (1984) proposes that we view causal processes as the means by which structure and order are propagated or transmitted from one space-time region of the universe to other times and places. An intervention at a particular point in the process transforms it in a way that persists from that point on. M&S P. 362.
14 The nature of causality The life sciences show us how a stochastic element might enter the sequence of operations:
15 The nature of causality Fisher (1932, 1934): deterministic causality is unintelligent Stochasticity is the creative element in evolutionary change: Only in an indeterministic system has the notion of [causality] restored to it that creative element, that sense of bringing something to pass which otherwise would not have been, which is essential to its commonplace meaning.
16 The nature of causality Holders to determinism need statistical models: In general, the language of causation is more likely to be used when causal laws are molar, or stated in terms of large or complex objects. These laws usually involve delayed causation, mediated via causal chains that operate through time The observation of molar relationships tends to be contingent upon many conditions. Until these conditions are more fully known, molar causal laws will be highly fallible, and hence, probabilistic. M&S P. 359.
17 The nature of causality When we say smoking causes lung cancer we mean smoking seems quite regularly to increase the chances of incurring lung cancer, and we understand something of the process or mechanism If we understood and could see the mechanism fully, we could know which smokers are destined to incur the disease, or how to prevent it
18 Causality in the social sciences Similar discourse The system (person or group) is complex, intelligent How can some factor be said to cause behaviour which is at least partly a matter of will? It is difficult to keep from influencing the processes we are trying to understand
19 Conditioning Wilson, S. E. and Howell, B. L. (2003) Do panel surveys make people sick? Arthritis trends in the Health and Retirement Study Shows prevalence in age group increasing in panel cohorts (longitudinal samples) much more than in general population as indicated by NHIS; increase not accounted for by question differences or attrition bias
20 How do we discern a causal process? Everyday experimentation:
21 How do we discern a causal process? Accidental experiment: Landry, D. W. and Oliver, J.A. (2004) Insights into shock. Scientific American 290(2), In 1997 a serendipitous observation changed the entire direction of our work.
22 How do we discern a causal process? Vasopressin restores blood pressure: It is thought that vasopressin reduces nitric oxide s dilating effects on arterioles and blocks ATP-sensitive potassium channels, allowing the calcium channels to open and the [arteriole muscle] cell to contract.
23 How do we discern a causal process? Menthol cigarettes are no more harmful than other cigarettes, but smokers of menthol cigarettes have more difficulty quitting Pletcher et al (2006) Longitudinal survey 1535 adults followed over 15 years (1200 completed)
24 The randomized experiment Allows us to discern regularity and if-then even under complexity, fallibility An association is observed between X and Y. Either (i) X has caused Y or (ii) Y has caused X or (iii) a third variable Z has caused both. If the value of X has been assigned randomly to units, then we can rule out (ii) and (iii). Causation is established.
25 Observational studies Cannot assign treatments, at least not completely Cannot establish causation Can measure association Can include elements which help in understanding the association
26 Observational studies How can we understand an association? Can we exploit the association? Can we alter the association?
27 Replication and the multiphasic approach In the absence of a randomized experiment, we need a variety of (replicable) approaches. Example: smoking and heart disease (Doll and Hill) Age-adjusted mortality rates were higher for smokers than for non-smokers Light smokers and ex-smokers had ageadjusted mortality rates between those of nonsmokers and heavy smokers.
28 Replication and the multiphasic approach Framingham heart study: 1948, random sample of 2/3 of adults aged in Framingham, Massachusetts; s.s Physical examinations and interviews Offspring sample (and spouses) 5124 began in grandchildren (study genetic influences on cardiovascular disease)
29 Replication and the multiphasic approach Rosenbaum, P. R. (2002) Observational Studies. 2nd edition. Springer-Verlag. when constructing a causal hypothesis one should envisage as many consequences of its truth as possible, and plan observational studies to discover whether each of these consequences is found to hold. (Cochran, quoting Fisher)
30 Role of longitudinal survey Diggle, Liang and Zeger (1994), 1.4 Y ij = β C x i1 + β L (x ij x i1 ) + ε ij, j =1,...,n i i =1,...,m Even when β C = β L, longitudinal studies tend to be more powerful than cross-sectional studies. The basis of inference about β C is a comparison of individuals with a particular value of x to others with a different value
31 Role of longitudinal survey Diggle, Liang and Zeger (1994), 1.4 (Y ij Y i1 ) = β L (x ij x i1 ) + ε ij ε i1 βl In contrast, the parameter is estimated by comparing a person s response at two [or more] times, assuming x changes with time. In a longitudinal study, each person can be thought of as serving as his or her own control.
32 Role of longitudinal survey A temporal ordering A chance to observe change Opportunity to control for a number of possible confounders (Z variables) But: There may be unmeasured confounders
33 Role of longitudinal survey Observations at earlier and later times are snapshots of a complex unfolding process Watching Batman at age 5 may predict school suspension at age 15 Illuminating the process requires other research designs
34 Television and aggression Bushman, B. J. (1995) J. of Personality and Social Psychology 69, Psych 101 students high trait aggressive individuals are more susceptible to the effects of violent media than are low trait aggressive individuals because they possess a relatively large network of aggressive associations that can be activated by violent cues (Conjectures that prior exposure to TV violence may have built up the network.)
35 How can we discern causal processes? To make valid causal inferences about the actions of individuals will require far more direct questioning of individuals and the mounting of longitudinal studies with successive waves of data collection spaced at short intervals. M&S P. 401
36 Role of longitudinal survey Covariation between changes in the values of X and Y Possibility of combining observation with intervention More potential if interviews come with physical, genetic data; subject attributions Broad representativity (more chance of full variety of treatments, more chance of adjustment for confounders; more observation of rare occurrences)
37 Importance of attention to sampling In measuring association, the sampling plan defines the population to which the measurement refers Achieved sampling plan can affect the strength of the part of the association we see, e.g. in a case where non-response is higher in the upper classes and the lower classes than the middle classes.
38 Part II: Overview of models for longitudinal survey data Observational plans Response variables Explanatory variables Marginal models Random effects models Transition models Survival and event history models
39 Part II: Overview of models cont d Missing data: censoring, omission, attrition Cohort and time-in-sample effects
40 Observational plans Continuous (day, hour, minute) Discrete, regular Discrete, irregular Single panel Cohorts Prospective, retrospective
41 Example: NPHS Begun in with 17,000 respondents across Canada Waves every two years; sample sizes Ideal for estimating the effects of risk factors on illness in medium and longer term Useful for examining precursors of risk factors
42 Example: NPHS Shields (1999) Health Reports No relationship between working hours and daily smoking in 1994/95, when other factors such as age and education were taken into account For both sexes, changing from standard to longer working hours between 1994/95 and 1996/97 was significantly associated with an increase in smoking during the period
43 Example: NESARC National Epidemiologic Survey on Alcohol and Related Conditions Waves in and Wave 1 sample: 43,093 completed interviews To increase understanding of the natural history of alcohol use disorders and associated disabilities To identify factors that impact on their remission, chronicity, stability and initiation
44 Example: Dunedin Study 1037 babies enrolled in 1982 Assessed ages 3, 5, 7, 9, 11,13, 15, 21, were assessed at age 26 in Questionnaires on all aspects of lives, and physical examinations DNA with permission Caspi et al (2002)
45 Example: Dunedin study Maltreatment antisocial problems MAOA deficiency is a moderator of that relationship
46 Example: ITC Surveys International Tobacco Control Policy Evaluation Project Longitudinal survey of adult smokers across several countries Approximately annual waves Replenishment: new cohort each wave approximately the size of those lost to attrition
47 Longitudinal response variables of interest Continuous or categorical (discrete) Categorical: binary, ordinal, nominal, count Development, growth, progress Transition, change Trajectories Repeated measures Event times
48 Explanatory variables Fixed Time dependent Internal, external Individual, population level
49 Example: repeated measures Indonesian Children s Health Study (Diggle, Liang and Zeger 1994) Subsample of 275 preschoolers Examined quarterly for up to 6 visits Dependent variable presence of respiratory infection (Y t : t =1,...,6) Main explanatory variable xerophthalmia (vitamin A deficiency) Estimate increase in risk of respiratory infection for children who are vitamin A deficient, controlling for other factors.
50 Marginal model E(Y it ) = μ it log it (μ it ) = log μ it = log P (Y it = 1) 1 μ it P (Y it = 0) = x it T β Var(Y it ) = μ it (1 μ it ) Corr(Y is,y it ) = α
51 Results Generalized estimating equation (GEE) Xerophthalmia increases the log odds of respiratory infection (RI) by 0.64, controlling for gender, height for age, seasonality of RI, age at entry Y t 1 Associations between and are taken into account by estimating a covariance matrix Y t
52 Transitional model H it = history of Y μ c it = E(Y it H it ) Var(Y it H it ) = v c it = v(μ c it )φ logit(μ c it ) = x itt β ** (H it )
53 Results from a transitional model Y t 1 Y t and are strongly associated Y t 1 For cases where = 0, xerophthalmia increases the log odds of RI at time t by 0.78, controlling for age and season Y t 1 For cases where = 1, xerophthalmia increases the log odds of RI at time t by 0.88, controlling for age and season
54 Comments Results (non-significant) are very similar, from marginal model, random effects model (not shown) and transitional model Transitional model assists causal interpretation The conditioned association is not affected by the fact that xerophthalmia is to start with higher among those that are frequently ill Results change somewhat, conditioning on Y t 1 and Y t 2
55 Example: intervention impact International Tobacco Control (ITC) Ireland Survey Interviews of 755 adult smokers in Ireland, 411 in UK, before and after smoke-free law in Ireland Y t =1 if in favor of ban in pubs, =0 if not, time x = indicator for country Fundamental quantities: π ij x = P(Y 1 = i, Y 2 = j x) t
56 Example: intervention impact
57 Model for intervention impact GEE parameterization logit[p(y t =1 x) = α + βx + γt + δ t x Corr(Y 1,Y 2 ) = ρ 5 parameters; the key one is Advantages: Matches prevalence plots Methodology well accepted δ
58 Transition parameterization Initial probabilities Transition probabilities x π i. logit( p x 01 ) = η + ςx logit(p x 10 ) = μ + νx p ij x Up to 6 parameters Key parameter is or ς, ς ν
59 Transition parameterization Advantages Corresponds to a probability model Estimation and interpretation very simple Incorporation of missingness is natural Incorporation of design information is straightforward.
60 Example: longitudinal mediational model ITC Four Country Survey Interviews of adult smokers in Canada,US, UK and Australia Longitudinal with replenishment (2000 smokers or ex-smokers) per country per wave
61 Example: longitudinal mediational model Variables at wave t: Q t H t C t L t P t : quit intention : concern about harm to health : concern about cost : attention to warning labels : attention to price
62 Mediational model Tier 1 Tier 2 Outcome L H Q Model P C L t L t 1 ; P t P t 1 ; H t H t 1,L t,l t 1,P t,p t 1 ; C t C t 1,L t,l t 1,P t,p t 1 ; Q t Q t 1, other variables
63 Mediational model Four waves of data, Canada and Australia Level of L is higher in Canada In both countries, marginal level of health concern is in steady state π i 0.65 Health concern transition probabilities p , p Estimated transition matrix varies with (L t 1,L t ) L t 1 = L t =1 is associated with increase in p 01 for H over two transition periods.
64 Example: survival models Time of death Time of menarche Time of school dropout Time of smoking cessation
65 Survival models Continuous: the exact time of an event is known or knowable for each subject, e.g. time of death Discrete: the time is best thought of as discrete, e.g. month after pregnancy
66 Survivor function, hazard Discrete case: S(j) = probability of surviving beyond time j h(j) = P(T=j T j)= probability of experiencing the event at j, given it is not experienced by time j-1
67 Survivor function, hazard Continuous case: S(t) = P(T > t) = exp{ h(u)du} Hazard function is a transition rate,e.g. life-to-death h(t)dt = P(t < T t + dt T > t) Theory for incorporating right censoring and attrition t 0
68 Example: latent mechanism Cirrhosis trial (Skrondal and Rabe-Hesketh 2004) 488 patients with liver cirrhosis Randomized to treatment with hormone prednisone or to placebo, indicator x Responses: Survival time to death or censoring (lost to follow-up or alive at the end of the observation period) Repeated measurements of prothrobin, a biochemical marker of liver functioning
69 Semiparametric model Proportional hazards model for patient Partial likelihood R(t (r) ) h j (t) = h 0 (t) exp(ν j ) = the set of patients still at risk at the r-th failure time j PL = r exp(ν r ) j R(t( r ) ) exp(ν j )
70 Cirrhosis trial Time at i-th measurement for patient j is Measurement model relating observed marker (2) to latent marker : η ij y ij = β 0 + η (2) ij + ε ij, ε ij ~ N(0,θ) Structural model for latent marker η (2) ij = γ 1 t ij + γ 2 x j + η (3) j, η (3) j ~ N(0,ψ) Random intercept linear growth model t ij
71 Cirrhosis trial Hazard model lnh rj = lnh 0 rj + λη (2) rj + α 4 x j lnh 0 Baseline hazard a cubic polynomial in t
72 Cirrhosis trial Reduced form hazard model lnh rj = lnh 0 (3) rj + [λγ 2 + α 4 ]x j + λη j where α 4 direct effect λγ 2 indirect effect Effect of latent marker on hazard: λ
73 Cirrhosis trial Results: Direct treatment effect: ˆ α 4 = 0.18, Indirect treatment effect (via latent marker): ˆ λ ˆ γ 2 = 0.25, 95%CI :(0.08,0.41) Total treatment effect: ˆ λ ˆ γ 2 + ˆ α 4 = 0.07, 95%CI :( 0.42,0.06) 95%CI :( 0.22,0.35) Treatment directly helpful in reducing hazard, but with negative side effect on liver functioning
74 Cohort and time in sample effects Context: survey has panels or cohorts recruited at different waves Repeated measures have different means and variances, depending on cohort Within a cohort, measures show unexplained trends
75 Example: ITC four country survey Design of the ITC four-country survey Longitudinal within countries: Canada, U.S., U.K., Australia Approximately annual waves: Wave 1 in late 2002, Wave 4 in Fall 2005 International comparisons RDD telephone survey Stratified random sampling with approximately proportional allocation Retention per wave has varied between 80% and 60% (lowest for U.S.) Sample lost is replenished with fresh cohort at each wave, with the same design as the initial one
76 Example: ITC 4C Noticed Information About Dangers of Smoking
77 Missing data Item non-response Loss to follow-up (attrition) MCAR, MAR, informative missingness Discarding Imputation Weighting Modelling Full Information Maximum Likelihood
78 Part III: Adaptations to complex surveys Features: Longitudinal survey designs Survey weights Effects of design on parameter interpretations Effects of design on precision Models with complex likelihoods
79 Example: ITC South East Asia Sampling design in Thailand and Malaysia: Stratification into regions (TH), zones (MY) Two stage sampling of households in large urban areas Three stage sampling of households in rural areas Households sampled from 125 clusters in each country
80 Survey Weights: Definitions initial weight equal to the inverse of the inclusion probability of the unit final weight initial weight adjusted for nonresponse, poststratification and/or benchmarking interpreted as the number of units in the population that the sample unit represents
81 Example: ITC South-East Asia In Malaysia, survey weights for adults are calibrated by age, sex and ethnicity within zones In Thailand, survey weights for adult smokers are calibrated to assumed numbers of smokers in regions, by age and sex
82 Effects of design on parameter interpretations SRS supports iid (independent and identical distribution) assumption the assumption is not supported in complex surveys because of correlations induced by the sampling design or because of the population structure blindly applying standard programs to the analysis can lead to incorrect results
83 Example: ITC South-East Asia Models estimated with design information and without tend to be different We could interpret this as meaning that the corresponding parameters belong to different actual or hypothetical populations
84 General Effect of Complex Surveys on Precision stratification decreases variability (more precise than SRS) clustering increases variability (less precise than SRS) overall, the multistage design has the effect of increasing variability (less precise than SRS)
85 Example: ITC South-East Asia Effects of design on precision: Estimates of proportions show a large design effect Similarly for estimates of logistic regression intercepts Estimates of logistic regression slopes tend to show very little design effect
86 Longitudinal survey design effects Skinner and Vieira (2005) Design effects tend to be larger for parameters of longitudinal models than for parameters of cross-sectional models
87 Estimation of Variance or Precision variance estimation with complex multistage cluster sample design: exact formula for variance estimation is often too complex; use of an approximate approach required NOTE: taking account of the design in variance estimation is as crucial as using the sampling weights for the estimation of a statistic
88 Some Approximate Methods Taylor series methods Resampling methods Balanced Repeated Replication (BRR) Jackknife Bootstrap
89 Assumptions The resulting distribution of a test statistic is based on having a large sample size with the following properties the total number of first stage sampled clusters (or primary sampling units) is assumed large the primary sample size in each stratum is small but the number of strata is large OR the number of primary units in a stratum is large no survey weight is disproportionately large
90 Possible Violations of Assumptions a large-scale survey was done but inferences are desired for small subpopulations stratification in which a few strata (or just one) have very large sampling fractions compared to the rest of the strata the sampling design was uneven, resulting in large variability in the sampling weights
91 Resampling Methods the variance of an estimated parameter can be estimated by taking a large number of independent samples from the original sample each new sample, called a resample, is used to estimate the parameter the variability among the resulting estimates is used to estimate the variance of the full-sample estimate covariance between two different parameter estimates is obtained from the covariance in replicates resampling methods differ in the way the resamples are built
92 Estimating function paradigm Census estimating equation: N φ i (y i ;θ) = 0 i=1 e.g. logistic regression: N x i (y i μ i ) = 0, μ i = e xit β /(1+ e xit β ) i=1
93 Estimating functions paradigm Sample estimating equation: φ s = w i φ i = 0 i s e.g. logistic regression: w i x i (y i μ i ) = 0, μ i = e xit β /(1+ e xit β ) i s
94 Estimating function paradigm Approximate pivot: OR ˆ θ s θ D s 1 φ s, φ s v(φ s ) D s = φ s θ v( ˆ θ s ) = ˆ D s 1 ν(φ s )( ˆ D s 1 ) T
95 Survival models Continuous time, proportional hazards model Complex survey, sampling at discrete times References: Binder (1992) Biometrika Lin (2000) Biometrika Lawless and Boudreau (2006) Canadian Journal of Statistics Rubin-Bleuer (2006)
96 Interaction of sampling and censoring Lawless (2003) If censoring is not informative, we condition on being present at each epoch Weighted Kaplan-Meier estimate assuming T and censoring time C are independent and each iid given : x ˆ λ (t; x ) = i s w i (t) 1 = π(x i ) P(C i t x i,r i ) R i w i (t)d i (t) i s an indicator for inclusion of i in s w i (t)y i (t)
97 Ignoring unobserved heterogeneity Muthén and Masyn (2004) Baseline hazard probabilities biased downward Time-independent covariate effects underestimated Spurious time-dependent effects for observed variables
98 Models with complex likelihoods Measurement error models Models incorporating dropout, missingness Partial likelihood models Multilevel models
99 Adapting the paradigm Replace census sums by weighted sample sums E.g. multilevel model: m logl(θ) = w j log (exp{ w i j j=1 n j i=1 log f (y ij x ij,η j ;θ)}).φ(η j x ij ;ψ)dη j
100 Weighting and replication Construct an artificial population by inflating the sample Perform the census analysis and adjust -- or repeatedly resample from the artificial population
101 Resampling methods Resampling can mirror the original design Artificial population construction and resampling can be captured in bootstrap weights
102 Latent transition models Collins, L. M. and Wugalter, S. E. (1992) Measurement error, measurement model E.g. state=quit intention classification, measured through multiple indicators (questionnaire items, etc.) Hidden Markov model methods Complex survey extension: pseudolikelihood Resampling to estimate standard errors cf Patterson, Dayton, Graubard (2002): latent class analysis in complex surveys
103 Example: linear growth curve analysis Llabre et al (2004) Recovery from stressors might be useful for understanding the relation between hostility and hypertension or CPD 167 adults aged 25-54, 74 women, 93 men Psychosocial variable: Cook-Medley Hostility Inventory Baseline systolic blood pressure (SBP)
104 Example: an LGC analysis Speech tasks (responding to wrongful accusation of shoplifting) Cold pressor (foot in icy water) SBP taken part way into tasks; 3 recovery readings 2 minutes apart SBP equation Y ij = π 0 j + π 1 j t ij + π 2 j t ij 2 + r ij
105 Example: an LGC analysis Individual coefficient dependence on hostility π 0 j = β 00 + β 01 h j + u 0 j π 1 j = β 10 + β 11 h j + u 1 j π 2 j = β 20 + β 21 h j + u 2 j Fit using Mplus, code provided Hostility predicted the coefficients for the speech stressor, not the cold pressor
106 Example: An LGC Analysis Multi-group analysis: allow parameters for men and women to be different, then constrain them to be equal, and test whether the fit deteriorates
107 Part IV: Software scan (partial) SUDAAN SAS SPSS Splus and R Stata, gllamm Mplus WinBUGS HLM,MlWin, LISREL, AMOS,
108 Checklist Designs accommodated Point estimation methods Variance estimation methods Combinability with bootstrap Treatment of subpopulations Treatment of lonely PSUs Treatment of missing data Numerical methods; accuracy; speed
109 Models available Logistic regression Poisson regression Survivor function estimation Proportional hazards models GEE QIF Mixed models; GLMM State space models
110 STATA defining the sampling design: svyset example svyset [pweight=indiv_wt], strata(newstrata) psu(ea) vce(linear) output: pweight: indiv_wt VCE: linearized Strata 1: newstrata SU 1: ea FPC 1: <zero>
111 R: survey package define the sampling design: svydesign wk1de<svydesign(id=~ea,strata=~newstrata,weight= ~indiv_wt,nest=t,data=work1) output > summary(wk1de) Stratified 1 - level Cluster Sampling design With (1860) clusters. svydesign(id = ~ea, strata = ~newstrata, weight = ~indiv_wt, nest = T, data = work1)
112 Syntax STATA: svy: estimate Example: least squares estimation svyset [pweight=indiv_wt], strata(newstrata) psu(ea) svy: regress dbmi bmi R: svy***(*, design, data=,...) Example: least squares estimation wk2de<svydesign(id=~ea,strata=~newstrata,weight=~indi v_wt,nest=t,data=work2) svyglm(dbmi~bmi, data=work2,design=wk2de)
113 GEE: Generalized Estimating Equations Dependent or response variable well-being measured on a 0 to 10 scale Independent or explanatory variables Has responsibility for a child under age 12 (yes = 1, no = 2) gender( male = 1, female = 2 marital status (married = 1, separated = 2, divorced = 3, never married = 5 [widowed removed from the dataset]) employment status (employed = 1, unemployed = 2, family care = 3) Stata syntax tsset pid year, yearly xi: xtgee wellbe i.mlstat i.job i.child i.sex [pweight = axrwght], family(poisson) link(identity) corr(exchangeable)
114 GEE Results Semi-robust wellbe Coef. Std. Err. z P> z [95% Conf. Interval] _Imlstat_ _Imlstat_ _Imlstat_ _Ichild_ _Ijobc_ _Ijobc_ _cons
115 For each type of marital status Married Semi-robust wellbe Coef. Std. Err. z P> z [95% Conf. Interval] _Ichild_ _Ijobc_ _Ijobc_ _cons Separated or divorced _Ichild_ _Ijobc_ _Ijobc_ _cons Never married _Ichild_ _Ijobc_ _Ijobc_ _cons
116 Random effects, categorical data, large complex surveys Grilli and Pratesi (2004) Weighted estimation in multilevel ordinal and binary models in the presence of informative sampling designs SAS proc nlmixed Accommodation of weights through a replication option
117 Haynes et al (2005): HILDA survey 3755 women followed for 3 waves Employment states: ft, pt, not_e Time dependent explanatory variable: log π itj = X it β j + α ij, j =1,2 π it 3 gllamm (AGQ) and WinBUGS (MCMC) gave similar results, both very slow (56 hours vs 42 hours)
118 Other reviews Links from urvey-soft/ Skrondal and Rabe-Hesketh (2004) Chantal, K. and Suchindran, C. (2005)
Survey Data Analysis in Stata
Survey Data Analysis in Stata Jeff Pitblado Associate Director, Statistical Software StataCorp LP Stata Conference DC 2009 J. Pitblado (StataCorp) Survey Data Analysis DC 2009 1 / 44 Outline 1 Types of
Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD
Tips for surviving the analysis of survival data Philip Twumasi-Ankrah, PhD Big picture In medical research and many other areas of research, we often confront continuous, ordinal or dichotomous outcomes
Survey Data Analysis in Stata
Survey Data Analysis in Stata Jeff Pitblado Associate Director, Statistical Software StataCorp LP 2009 Canadian Stata Users Group Meeting Outline 1 Types of data 2 2 Survey data characteristics 4 2.1 Single
Failure to take the sampling scheme into account can lead to inaccurate point estimates and/or flawed estimates of the standard errors.
Analyzing Complex Survey Data: Some key issues to be aware of Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 24, 2015 Rather than repeat material that is
Sampling Error Estimation in Design-Based Analysis of the PSID Data
Technical Series Paper #11-05 Sampling Error Estimation in Design-Based Analysis of the PSID Data Steven G. Heeringa, Patricia A. Berglund, Azam Khan Survey Research Center, Institute for Social Research
Efficient and Practical Econometric Methods for the SLID, NLSCY, NPHS
Efficient and Practical Econometric Methods for the SLID, NLSCY, NPHS Philip Merrigan ESG-UQAM, CIRPÉE Using Big Data to Study Development and Social Change, Concordia University, November 2103 Intro Longitudinal
Overview. Longitudinal Data Variation and Correlation Different Approaches. Linear Mixed Models Generalized Linear Mixed Models
Overview 1 Introduction Longitudinal Data Variation and Correlation Different Approaches 2 Mixed Models Linear Mixed Models Generalized Linear Mixed Models 3 Marginal Models Linear Models Generalized Linear
The American Cancer Society Cancer Prevention Study I: 12-Year Followup
Chapter 3 The American Cancer Society Cancer Prevention Study I: 12-Year Followup of 1 Million Men and Women David M. Burns, Thomas G. Shanks, Won Choi, Michael J. Thun, Clark W. Heath, Jr., and Lawrence
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses
Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses G. Gordon Brown, Celia R. Eicheldinger, and James R. Chromy RTI International, Research Triangle Park, NC 27709 Abstract
Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents
Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén
Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.
Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are
Problem of Missing Data
VASA Mission of VA Statisticians Association (VASA) Promote & disseminate statistical methodological research relevant to VA studies; Facilitate communication & collaboration among VA-affiliated statisticians;
Life Table Analysis using Weighted Survey Data
Life Table Analysis using Weighted Survey Data James G. Booth and Thomas A. Hirschl June 2005 Abstract Formulas for constructing valid pointwise confidence bands for survival distributions, estimated using
Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer
Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer Patricia A. Berglund, Institute for Social Research - University of Michigan Wisconsin and Illinois SAS User s Group June 25, 2014 1 Overview
Statistics in Retail Finance. Chapter 6: Behavioural models
Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural
An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc.
An Application of the G-formula to Asbestos and Lung Cancer Stephen R. Cole Epidemiology, UNC Chapel Hill Slides: www.unc.edu/~colesr/ 1 Acknowledgements Collaboration with David B. Richardson, Haitao
Study Design and Statistical Analysis
Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing
SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing [email protected]
SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing [email protected] IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way
Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data
Youth Risk Behavior Survey (YRBS) Software for Analysis of YRBS Data CONTENTS Overview 1 Background 1 1. SUDAAN 2 1.1. Analysis capabilities 2 1.2. Data requirements 2 1.3. Variance estimation 2 1.4. Survey
The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data ABSTRACT INTRODUCTION SURVEY DESIGN 101 WHY STRATIFY?
The SURVEYFREQ Procedure in SAS 9.2: Avoiding FREQuent Mistakes When Analyzing Survey Data Kathryn Martin, Maternal, Child and Adolescent Health Program, California Department of Public Health, ABSTRACT
Chapter 19 Statistical analysis of survey data. Abstract
Chapter 9 Statistical analysis of survey data James R. Chromy Research Triangle Institute Research Triangle Park, North Carolina, USA Savitri Abeyasekera The University of Reading Reading, UK Abstract
Software for Analysis of YRBS Data
Youth Risk Behavior Surveillance System (YRBSS) Software for Analysis of YRBS Data June 2014 Where can I get more information? Visit www.cdc.gov/yrbss or call 800 CDC INFO (800 232 4636). CONTENTS Overview
Handling missing data in Stata a whirlwind tour
Handling missing data in Stata a whirlwind tour 2012 Italian Stata Users Group Meeting Jonathan Bartlett www.missingdata.org.uk 20th September 2012 1/55 Outline The problem of missing data and a principled
Missing Data in Longitudinal Studies: To Impute or not to Impute? Robert Platt, PhD McGill University
Missing Data in Longitudinal Studies: To Impute or not to Impute? Robert Platt, PhD McGill University 1 Outline Missing data definitions Longitudinal data specific issues Methods Simple methods Multiple
A Population Based Risk Algorithm for the Development of Type 2 Diabetes: in the United States
A Population Based Risk Algorithm for the Development of Type 2 Diabetes: Validation of the Diabetes Population Risk Tool (DPoRT) in the United States Christopher Tait PhD Student Canadian Society for
Introduction to Longitudinal Data Analysis
Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction
Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program
Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program Department of Mathematics and Statistics Degree Level Expectations, Learning Outcomes, Indicators of
THE CORRELATION BETWEEN PHYSICAL HEALTH AND MENTAL HEALTH
HENK SWINKELS (STATISTICS NETHERLANDS) BRUCE JONAS (US NATIONAL CENTER FOR HEALTH STATISTICS) JAAP VAN DEN BERG (STATISTICS NETHERLANDS) THE CORRELATION BETWEEN PHYSICAL HEALTH AND MENTAL HEALTH IN THE
Organizing Your Approach to a Data Analysis
Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize
10. Analysis of Longitudinal Studies Repeat-measures analysis
Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.
MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA
m ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA Jiahe Qian, Educational Testing Service Rosedale Road, MS 02-T, Princeton, NJ 08541 Key Words" Complex sampling, NAEP data,
Statistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
Introduction to mixed model and missing data issues in longitudinal studies
Introduction to mixed model and missing data issues in longitudinal studies Hélène Jacqmin-Gadda INSERM, U897, Bordeaux, France Inserm workshop, St Raphael Outline of the talk I Introduction Mixed models
Mortality Assessment Technology: A New Tool for Life Insurance Underwriting
Mortality Assessment Technology: A New Tool for Life Insurance Underwriting Guizhou Hu, MD, PhD BioSignia, Inc, Durham, North Carolina Abstract The ability to more accurately predict chronic disease morbidity
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)
How to choose an analysis to handle missing data in longitudinal observational studies
How to choose an analysis to handle missing data in longitudinal observational studies ICH, 25 th February 2015 Ian White MRC Biostatistics Unit, Cambridge, UK Plan Why are missing data a problem? Methods:
Complex Survey Design Using Stata
Complex Survey Design Using Stata 2010 This document provides a basic overview of how to handle complex survey design using Stata. Probability weighting and compensating for clustered and stratified samples
INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES
INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES Hukum Chandra Indian Agricultural Statistics Research Institute, New Delhi-110012 1. INTRODUCTION A sample survey is a process for collecting
Handling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
A Composite Likelihood Approach to Analysis of Survey Data with Sampling Weights Incorporated under Two-Level Models
A Composite Likelihood Approach to Analysis of Survey Data with Sampling Weights Incorporated under Two-Level Models Grace Y. Yi 13, JNK Rao 2 and Haocheng Li 1 1. University of Waterloo, Waterloo, Canada
Randomized trials versus observational studies
Randomized trials versus observational studies The case of postmenopausal hormone therapy and heart disease Miguel Hernán Harvard School of Public Health www.hsph.harvard.edu/causal Joint work with James
Department of Epidemiology and Public Health Miller School of Medicine University of Miami
Department of Epidemiology and Public Health Miller School of Medicine University of Miami BST 630 (3 Credit Hours) Longitudinal and Multilevel Data Wednesday-Friday 9:00 10:15PM Course Location: CRB 995
Biostatistics: Types of Data Analysis
Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics [email protected] http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS
Gamma Distribution Fitting
Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics
Guide to Biostatistics
MedPage Tools Guide to Biostatistics Study Designs Here is a compilation of important epidemiologic and common biostatistical terms used in medical research. You can use it as a reference guide when reading
Introduction to Statistics and Quantitative Research Methods
Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.
Chapter 11 Introduction to Survey Sampling and Analysis Procedures
Chapter 11 Introduction to Survey Sampling and Analysis Procedures Chapter Table of Contents OVERVIEW...149 SurveySampling...150 SurveyDataAnalysis...151 DESIGN INFORMATION FOR SURVEY PROCEDURES...152
An Example of SAS Application in Public Health Research --- Predicting Smoking Behavior in Changqiao District, Shanghai, China
An Example of SAS Application in Public Health Research --- Predicting Smoking Behavior in Changqiao District, Shanghai, China Ding Ding, San Diego State University, San Diego, CA ABSTRACT Finding predictors
Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13
Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Overview Missingness and impact on statistical analysis Missing data assumptions/mechanisms Conventional
A Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item
Northumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
Redistributional impact of the National Health Insurance System
Redistributional impact of the National Health Insurance System in France: A microsimulation approach Valrie Albouy (INSEE) Laurent Davezies (INSEE-CREST-IRDES) Thierry Debrand (IRDES) Brussels, 4-5 March
CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS
Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships
Missing Data & How to Deal: An overview of missing data. Melissa Humphries Population Research Center
Missing Data & How to Deal: An overview of missing data Melissa Humphries Population Research Center Goals Discuss ways to evaluate and understand missing data Discuss common missing data methods Know
Missing data and net survival analysis Bernard Rachet
Workshop on Flexible Models for Longitudinal and Survival Data with Applications in Biostatistics Warwick, 27-29 July 2015 Missing data and net survival analysis Bernard Rachet General context Population-based,
Multiple Imputation for Missing Data: A Cautionary Tale
Multiple Imputation for Missing Data: A Cautionary Tale Paul D. Allison University of Pennsylvania Address correspondence to Paul D. Allison, Sociology Department, University of Pennsylvania, 3718 Locust
CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA
Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
Assignments Analysis of Longitudinal data: a multilevel approach
Assignments Analysis of Longitudinal data: a multilevel approach Frans E.S. Tan Department of Methodology and Statistics University of Maastricht The Netherlands Maastricht, Jan 2007 Correspondence: Frans
Elementary Statistics
Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,
VI. Introduction to Logistic Regression
VI. Introduction to Logistic Regression We turn our attention now to the topic of modeling a categorical outcome as a function of (possibly) several factors. The framework of generalized linear models
Case-Control Studies. Sukon Kanchanaraksa, PhD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
A course on Longitudinal data analysis : what did we learn? COMPASS 11/03/2011
A course on Longitudinal data analysis : what did we learn? COMPASS 11/03/2011 1 Abstract We report back on methods used for the analysis of longitudinal data covered by the course We draw lessons for
2009 Mississippi Youth Tobacco Survey. Office of Health Data and Research Office of Tobacco Control Mississippi State Department of Health
9 Mississippi Youth Tobacco Survey Office of Health Data and Research Office of Tobacco Control Mississippi State Department of Health Acknowledgements... 1 Glossary... 2 Introduction... 3 Sample Design
National Longitudinal Study of Adolescent Health. Strategies to Perform a Design-Based Analysis Using the Add Health Data
National Longitudinal Study of Adolescent Health Strategies to Perform a Design-Based Analysis Using the Add Health Data Kim Chantala Joyce Tabor Carolina Population Center University of North Carolina
Generalized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
Data Mining Introduction
Data Mining Introduction Bob Stine Dept of Statistics, School University of Pennsylvania www-stat.wharton.upenn.edu/~stine What is data mining? An insult? Predictive modeling Large, wide data sets, often
[This document contains corrections to a few typos that were found on the version available through the journal s web page]
Online supplement to Hayes, A. F., & Preacher, K. J. (2014). Statistical mediation analysis with a multicategorical independent variable. British Journal of Mathematical and Statistical Psychology, 67,
Regression Modeling Strategies
Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions
Two Tools for the Analysis of Longitudinal Data: Motivations, Applications and Issues
Two Tools for the Analysis of Longitudinal Data: Motivations, Applications and Issues Vern Farewell Medical Research Council Biostatistics Unit, UK Flexible Models for Longitudinal and Survival Data Warwick,
How to use SAS for Logistic Regression with Correlated Data
How to use SAS for Logistic Regression with Correlated Data Oliver Kuss Institute of Medical Epidemiology, Biostatistics, and Informatics Medical Faculty, University of Halle-Wittenberg, Halle/Saale, Germany
Multinomial and Ordinal Logistic Regression
Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,
Calculating the Probability of Returning a Loan with Binary Probability Models
Calculating the Probability of Returning a Loan with Binary Probability Models Associate Professor PhD Julian VASILEV (e-mail: [email protected]) Varna University of Economics, Bulgaria ABSTRACT The
NHS Diabetes Prevention Programme (NHS DPP) Non-diabetic hyperglycaemia. Produced by: National Cardiovascular Intelligence Network (NCVIN)
NHS Diabetes Prevention Programme (NHS DPP) Non-diabetic hyperglycaemia Produced by: National Cardiovascular Intelligence Network (NCVIN) Date: August 2015 About Public Health England Public Health England
Poisson Models for Count Data
Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the
Comparison of Estimation Methods for Complex Survey Data Analysis
Comparison of Estimation Methods for Complex Survey Data Analysis Tihomir Asparouhov 1 Muthen & Muthen Bengt Muthen 2 UCLA 1 Tihomir Asparouhov, Muthen & Muthen, 3463 Stoner Ave. Los Angeles, CA 90066.
Logistic (RLOGIST) Example #3
Logistic (RLOGIST) Example #3 SUDAAN Statements and Results Illustrated PREDMARG (predicted marginal proportion) CONDMARG (conditional marginal proportion) PRED_EFF pairwise comparison COND_EFF pairwise
UNDERGRADUATE DEGREE DETAILS : BACHELOR OF SCIENCE WITH
QATAR UNIVERSITY COLLEGE OF ARTS & SCIENCES Department of Mathematics, Statistics, & Physics UNDERGRADUATE DEGREE DETAILS : Program Requirements and Descriptions BACHELOR OF SCIENCE WITH A MAJOR IN STATISTICS
How to set the main menu of STATA to default factory settings standards
University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be
The Research Data Centres Information and Technical Bulletin
Catalogue no. 12-002 X No. 2014001 ISSN 1710-2197 The Research Data Centres Information and Technical Bulletin Winter 2014, vol. 6 no. 1 How to obtain more information For information about this product
13. Poisson Regression Analysis
136 Poisson Regression Analysis 13. Poisson Regression Analysis We have so far considered situations where the outcome variable is numeric and Normally distributed, or binary. In clinical work one often
I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
Guideline on missing data in confirmatory clinical trials
2 July 2010 EMA/CPMP/EWP/1776/99 Rev. 1 Committee for Medicinal Products for Human Use (CHMP) Guideline on missing data in confirmatory clinical trials Discussion in the Efficacy Working Party June 1999/
Randomization in Clinical Trials
in Clinical Trials Versio.0 May 2011 1. Simple 2. Block randomization 3. Minimization method Stratification RELATED ISSUES 1. Accidental Bias 2. Selection Bias 3. Prognostic Factors 4. Random selection
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables
Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables Introduction In the summer of 2002, a research study commissioned by the Center for Student
PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Elizabeth Comino Centre fo Primary Health Care and Equity 12-Aug-2015
PEER REVIEW HISTORY BMJ Open publishes all reviews undertaken for accepted manuscripts. Reviewers are asked to complete a checklist review form (http://bmjopen.bmj.com/site/about/resources/checklist.pdf)
Missing Data. Paul D. Allison INTRODUCTION
4 Missing Data Paul D. Allison INTRODUCTION Missing data are ubiquitous in psychological research. By missing data, I mean data that are missing for some (but not all) variables and for some (but not all)
Big Data for Population Health and Personalised Medicine through EMR Linkages
Big Data for Population Health and Personalised Medicine through EMR Linkages Zheng-Ming CHEN Professor of Epidemiology Nuffield Dept. of Population Health, University of Oxford Big Data for Health Policy
CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA
Examples: Mixture Modeling With Longitudinal Data CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA Mixture modeling refers to modeling with categorical latent variables that represent subpopulations
