Bayesian modeling of inseparable space-time variation in disease risk



Similar documents
BayesX - Software for Bayesian Inference in Structured Additive Regression

11. Time series and dynamic linear models

Gaussian Processes to Speed up Hamiltonian Monte Carlo

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

Bayesian inference for population prediction of individuals without health insurance in Florida

More details on the inputs, functionality, and output can be found below.

Parallelization Strategies for Multicore Data Analysis

Statistics Graduate Courses

STA 4273H: Statistical Machine Learning

A Bayesian hierarchical surrogate outcome model for multiple sclerosis

An explicit link between Gaussian fields and Gaussian Markov random fields; the stochastic partial differential equation approach

HT2015: SC4 Statistical Data Mining and Machine Learning

Sampling for Bayesian computation with large datasets

Tutorial on Markov Chain Monte Carlo

Applications of R Software in Bayesian Data Analysis

Bayesian Statistics: Indian Buffet Process

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

Combining Visual and Auditory Data Exploration for finding structure in high-dimensional data

STAT3016 Introduction to Bayesian Data Analysis

Dirichlet Processes A gentle tutorial

Journal of Statistical Software

Probabilistic Methods for Time-Series Analysis

Financial TIme Series Analysis: Part II

How To Understand The Theory Of Probability

Bayesian Factorization Machines

Statistics & Probability PhD Research. 15th November 2014

Time Series Analysis

Imperfect Debugging in Software Reliability

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

> plot(exp.btgpllm, main = "treed GP LLM,", proj = c(1)) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(2)) quantile diff (error)

GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x,

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

HIDDEN MARKOV MODELS FOR ALCOHOLISM TREATMENT TRIAL DATA

The Chinese Restaurant Process

Model Calibration with Open Source Software: R and Friends. Dr. Heiko Frings Mathematical Risk Consulting

Spatial modelling of claim frequency and claim size in non-life insurance

An Application of Inverse Reinforcement Learning to Medical Records of Diabetes Treatment

EE 570: Location and Navigation

Bayesian prediction of disability insurance frequencies using economic indicators

Bayesian Statistics in One Hour. Patrick Lam

An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides:

Dynamic bayesian forecasting models of football match outcomes with estimation of the evolution variance parameter

Handling attrition and non-response in longitudinal data

Introduction to Bayesian Analysis Using SAS R Software

Logistic Regression (a type of Generalized Linear Model)

240ST014 - Data Analysis of Transport and Logistics

Gaussian Processes in Machine Learning

NetSurv & Data Viewer

CURRICULUM VITAE. 1 Higher Education. 2 Employment DANI GAMERMAN

Multivariate Normal Distribution

Validation of Software for Bayesian Models Using Posterior Quantiles

Gaussian Conjugate Prior Cheat Sheet


Bayesian credibility methods for pricing non-life insurance on individual claims history

Estimation and comparison of multiple change-point models

Curriculum Vitae of Francesco Bartolucci

Imputing Values to Missing Data

Predicting the Present with Bayesian Structural Time Series

Master programme in Statistics

Measures of explained variance and pooling in multilevel models

Echantillonnage exact de distributions de Gibbs d énergies sous-modulaires. Exact sampling of Gibbs distributions with submodular energies

A Basic Introduction to Missing Data

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University

Incorporating cost in Bayesian Variable Selection, with application to cost-effective measurement of quality of health care.

Nonparametric Prior Elicitation using the Roulette Method

Validation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT

Bayesian Variable Selection in Normal Regression Models

Analysis of Financial Time Series

Section 5. Stan for Big Data. Bob Carpenter. Columbia University

Bayesian Phylogeny and Measures of Branch Support

Monte Carlo Methods and Models in Finance and Insurance

MODELLING CRITICAL ILLNESS INSURANCE DATA

Inference on Phase-type Models via MCMC

Package CoImp. February 19, 2015

Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle

Model Selection and Claim Frequency for Workers Compensation Insurance

Traffic Driven Analysis of Cellular Data Networks

Dirichlet Process Gaussian Mixture Models: Choice of the Base Distribution

Introduction to General and Generalized Linear Models

TIME SERIES ANALYSIS OF COMPOSITIONAL DATA USING A DYNAMIC LINEAR MODEL APPROACH

How To Solve A Save Problem With Time T

Monte Carlo-based statistical methods (MASM11/FMS091)

Psychology 209. Longitudinal Data Analysis and Bayesian Extensions Fall 2012

A Bootstrap Metropolis-Hastings Algorithm for Bayesian Analysis of Big Data

Sections 2.11 and 5.8

APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING

Transcription:

Bayesian modeling of inseparable space-time variation in disease risk Leonhard Knorr-Held Laina Mercer Department of Statistics UW May 23, 2013

Motivation Area and time-specific disease rates Area and time-specific disease rates are of great interest for health care and policy purposes Facilitate effective allocation of resources and targeted interventions Sample size often too small at granular space-time scale for reliable estimates Bayesian approach to borrow strength over space and time to improve reliability 2

Motivation Ohio Lung Cancer Example Lung Cancer Mortality Rates 1972 [0,1.5) [1.5,3.01) [3.01,4.51) [4.51,6.01] 3

Motivation Ohio Lung Cancer Example Ohio Lung cancer mortality by county 1968 1988 ages 55 64 Raw Lung Cancer Deaths per 1000 0 1 2 3 4 5 6 1968 1970 1972 1974 1976 1978 1980 1982 1984 1986 1988 Year 4

Background Previous approaches Previous works have used a Hierarchical Bayesian framework to expand purely spatial models by Besag, York, and Mollie (1991) to a space time framework. Bernardinelli et al. (1995) Area-specific intercept and temporal trends All temporal trends assumed linear Waller et al. (1997) Spatial model for each time point No spatial main effects Knorr-Held and Besag (1998) Included spatial and temporal main effects Does not allow for space time interactions 5

Motivation Ohio Lung Cancer Example Ohio lung cancer mortality by county Lung Cancer Deaths per 1,000 0 1 2 3 4 5 6 1970 1975 1980 1985 Year 6

Motivation Ohio Lung Cancer Example Ohio lung cancer mortality by county Lung Cancer Deaths per 1,000 1.5 2.0 2.5 1970 1975 1980 1985 Year 7

The Model The Set Up Address the situation where the disease variation cannot be separated into temporal and spatial main effects and spatio-temporal interactions become and important feature. n it - persons at risk in county i at time t. y it - cases or deaths in county i at time t y it π it Bin(n it, π it ) ( ) η it = log πit 1 π it 8

The Model Stage 1 η it = µ + α t + γ t + θ i + φ i + δ it µ - overall risk level α t - temporally structured effect of time t γ t - independent effect of time t θ i - spatially structured effect of county i φ i - independent effect of county i δ it - space time interaction 9

The Model Stage 2 - Exchangeable Effects The exchangeable effects γ (for time) and φ (for space) we are assigned multivariate Gaussian priors with mean zero and precision matrix λk: ( ) p(γ λ γ ) N 0, 1 λ γ Kγ 1 p(φ λ φ ) N where K γ = I TxT and K φ = I kxk. ( ) 0, 1 λ φ K 1 φ 10

The Model Stage 2 - First Order Random Walk Temporally structured effect of time α t is assigned a random walk. ( ) 1 α t α t, λ α N 2 (α t 1 + α t+1 ), 1/(2λ α ) Also represented as p(α λ α ) exp ( λα 2 αt K α α ) where 1 1 1 2 1 K α =.......... 1 2 1 1 1 11

The Model Stage 2 - Intrinsic Autoregressive (ICAR) Spatially structured effect of area θ i is assigned θ i θ i, λ θ N 1 1 θ j, m i m i λ θ j:j i where m i is the # of neighbors. ( The improper ) joint distribution can be written as p(θ λ θ ) exp λ θ 2 θ T K θ θ, where m i i = j K θ,ij = 1 i j 0 otherwise 12

The Model Stage 2 - iid prior for δ Type I Independent multivariate Gaussian prior the joint distribution is δ it δ it, λ δ N (0, 1/λ δ ) ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ where K δ = K φ K γ = I kt kt. 13

The Model Stage 2 - Random Walk prior for δ Type II Temporal trends differ by area - first order random walk prior ( ) 1 δ it δ it, λ δ N 2 (δ i,t 1 + δ i,t+1 ), 1/(2λ δ ) The improper joint distribution can be expressed as ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ where K δ = K φ K α (rank k(t 1)). 14

The Model Stage 2 - ICAR prior for δ Type III Spatial trends differ over time - Intrinsic autoregression prior δ it δ it, λ δ N 1 1 δ jt, m i m i λ δ j:j i The improper joint distribution can be expressed as ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ Where K δ = K θ K γ (rank (k 1)T ). 15

The Model Stage 2 - Space-Time prior for δ Type IV Spatio-temporal interaction - conditional depends on first and second order neighbors δ it δ ( it, λ δ 1 N 2 (δ i,t 1 + δ i,t+1 ) + 1 m i δ jt 1 m i j:j i j:j i (δ j,t 1 + δ j,t+1 ), 1 2m i λ δ ) The improper joint distribution can be expressed as ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ Where K δ = K θ K α (rank (k 1)(T 1)). 16

The Model Stage 3 - Hyperpriors Hyperparameters were all assigned λ Gamma(1, 0.01) resulting in a convenient posterior distribution. For example the full conditional for λ δ is: λ δ δ Gamma ( 1 + 0.5 rank(k δ ), 0.01 + 0.5 δ k δ δ ) where rank(k δ ) depends on the interaction type. 17

Model Evaluation Posterior Deviance To compare fit and complexity of each model the saturated deviance was calculated based on 2,500 samples from the posterior. D (s) = 2 where π (s) n i=1 t=1 ( T y itlog ( ) it = exp η (s) it ( 1+exp η (s) it ). y it n it π (s) it ) + (n it y it )log n it y ( it ) n it 1 π (s) it 18

The Model Implementation Knorr-Held (2000) employed Markov chain Monte Carlo to sample from the implied posterior distributions. Univariate Metropolis steps were applied for each parameter and hyperparameters were updated with samples from their full conditionals. MCMC in R: an update for every parameter in an interaction model takes 0.1-0.2s. Note: in INLA the main effects and interaction models type I-III all fit in less than 20min. 19

Ohio Lung Cancer Posterior Distribution of the Deviance Main Effects 0.000 0.010 0.020 1950 2000 2050 2100 2150 2200 2250 Posterior Deviance Type I 0.000 0.010 0.020 1950 2000 2050 2100 2150 2200 2250 Posterior Deviance Type II 0.000 0.010 0.020 1950 2000 2050 2100 2150 2200 2250 Posterior Deviance Type III 0.000 0.010 0.020 1950 2000 2050 2100 2150 2200 2250 Posterior Deviance Type IV 0.000 0.010 0.020 1950 2000 2050 2100 2150 2200 2250 Posterior Deviance 20

Ohio Lung Cancer Posterior Distribution of the Deviance Model Median Mean IQR SD Main Effects 2187 2187 18.6 13.7 2187 2187 18.5 13.9 Type I 2086 2084 48.4 34.9 2083 2082 48.6 35.9 Type II 2073 2073 35.3 25.5 2071 2071 36.6 27.0 Type III 2144 2143 32.7 24.1 2142 2141 32.8 24.8 Type IV 2096 2096 36.2 26.0 2106 2106 32.8 24.7 Table: Laina s values in black and Knorr-Held (2000) in gray. 21

Ohio Lung Cancer Type I interaction Ohio lung cancer mortality by county Type I Interactions Lung Cancer Deaths per 1,000 1.5 2.0 2.5 1970 1975 1980 1985 Year 22

Ohio Lung Cancer Type II interaction Ohio lung cancer mortality by county Type II Interactions Lung Cancer Deaths per 1,000 1.5 2.0 2.5 1970 1975 1980 1985 Year 23

Knorr-Held (2000) Conclusions & Critique Provided and motivated a flexible approach for modeling space-time data. Was thin on MCMC details and diagnostics. Did not motivate use of deviance over DIC or p D for model selection. Focussed exclusively on non-parametric smoothing approaches. No discussion of incorporating covariates. 24

Thank you! Thank you all for your feedback and support throughout the quarter. Specifically, I would like to thank: William for suggesting a hair cut and shaded plots, Bob for suggesting enthusiasm, and Jon for suggesting this paper! 25

References Bernardinelli, L., D. Clayton, C. Pascutto, C. Montomoli, M. Ghislandi, and M. Songini (1995). Bayesian analysis of space-time variation in disease risk. Statistics in Medicine 14(21-22), 2433 2443. Besag, J., J. York, and A. Mollie (1991). Bayesian image restoration, with two applications in spatial statistics. Annals of the Institute of Statistical Mathematics 43, 1 59. Knorr-Held, L. (2000). Bayesian modelling of inseperable space-time variation in disease risk. Statistics in Medicine 19, 2555 2567. Knorr-Held, L. and J. Besag (1998). Modelling risk from a disease in time and space. Statistics in Medicine 17, 2045 2060. Knorr-Held, L. and H. Rue (2002). On block updating in markov random field models for disease mapping. Scandinavian Journal of Statistics 29, 597 614. Rue, H. and L. Held (2005). Gaussian Markov Random Fields Theory and Applications. Number 104 in Monographs on Statistics and Applied Probability. Chapman and Hall. Schrodle, B. and L. Held (2010). Spatio-temporal disease mapping using inla. Environmetrics. Waller, L. A., B. P. Carlin, H. Xia, and A. E. Gelfand (1997). Hierarchical spatio-temporal mapping of disease rates. Journal of the American Statistical Association 92(438), 607 617. 26

Ohio Lung Cancer Type III interaction Ohio lung cancer mortality by county Type III Interactions Lung Cancer Deaths per 1,000 1.5 2.0 2.5 1970 1975 1980 1985 Year 27

Ohio Lung Cancer Type IV interaction Ohio lung cancer mortality by county Type IV Interactions Lung Cancer Deaths per 1,000 1.5 2.0 2.5 1970 1975 1980 1985 Year 28

Knorr-Held (2000) Next Steps A logical next step was to make fitting these models much faster. Knorr-Held and Rue (2002) introduced block updating Rue and Held (2005) great overview of Gaussian Markov Random Fields and more details on block updating Schrodle and Held (2010) describes (poorly) how to fit models from Knorr-Held (2000) in INLA. 29