Bayesian modeling of inseparable space-time variation in disease risk

Size: px
Start display at page:

Download "Bayesian modeling of inseparable space-time variation in disease risk"

Transcription

1 Bayesian modeling of inseparable space-time variation in disease risk Leonhard Knorr-Held Laina Mercer Department of Statistics UW May 23, 2013

2 Motivation Area and time-specific disease rates Area and time-specific disease rates are of great interest for health care and policy purposes Facilitate effective allocation of resources and targeted interventions Sample size often too small at granular space-time scale for reliable estimates Bayesian approach to borrow strength over space and time to improve reliability 2

3 Motivation Ohio Lung Cancer Example Lung Cancer Mortality Rates 1972 [0,1.5) [1.5,3.01) [3.01,4.51) [4.51,6.01] 3

4 Motivation Ohio Lung Cancer Example Ohio Lung cancer mortality by county ages Raw Lung Cancer Deaths per Year 4

5 Background Previous approaches Previous works have used a Hierarchical Bayesian framework to expand purely spatial models by Besag, York, and Mollie (1991) to a space time framework. Bernardinelli et al. (1995) Area-specific intercept and temporal trends All temporal trends assumed linear Waller et al. (1997) Spatial model for each time point No spatial main effects Knorr-Held and Besag (1998) Included spatial and temporal main effects Does not allow for space time interactions 5

6 Motivation Ohio Lung Cancer Example Ohio lung cancer mortality by county Lung Cancer Deaths per 1, Year 6

7 Motivation Ohio Lung Cancer Example Ohio lung cancer mortality by county Lung Cancer Deaths per 1, Year 7

8 The Model The Set Up Address the situation where the disease variation cannot be separated into temporal and spatial main effects and spatio-temporal interactions become and important feature. n it - persons at risk in county i at time t. y it - cases or deaths in county i at time t y it π it Bin(n it, π it ) ( ) η it = log πit 1 π it 8

9 The Model Stage 1 η it = µ + α t + γ t + θ i + φ i + δ it µ - overall risk level α t - temporally structured effect of time t γ t - independent effect of time t θ i - spatially structured effect of county i φ i - independent effect of county i δ it - space time interaction 9

10 The Model Stage 2 - Exchangeable Effects The exchangeable effects γ (for time) and φ (for space) we are assigned multivariate Gaussian priors with mean zero and precision matrix λk: ( ) p(γ λ γ ) N 0, 1 λ γ Kγ 1 p(φ λ φ ) N where K γ = I TxT and K φ = I kxk. ( ) 0, 1 λ φ K 1 φ 10

11 The Model Stage 2 - First Order Random Walk Temporally structured effect of time α t is assigned a random walk. ( ) 1 α t α t, λ α N 2 (α t 1 + α t+1 ), 1/(2λ α ) Also represented as p(α λ α ) exp ( λα 2 αt K α α ) where K α =

12 The Model Stage 2 - Intrinsic Autoregressive (ICAR) Spatially structured effect of area θ i is assigned θ i θ i, λ θ N 1 1 θ j, m i m i λ θ j:j i where m i is the # of neighbors. ( The improper ) joint distribution can be written as p(θ λ θ ) exp λ θ 2 θ T K θ θ, where m i i = j K θ,ij = 1 i j 0 otherwise 12

13 The Model Stage 2 - iid prior for δ Type I Independent multivariate Gaussian prior the joint distribution is δ it δ it, λ δ N (0, 1/λ δ ) ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ where K δ = K φ K γ = I kt kt. 13

14 The Model Stage 2 - Random Walk prior for δ Type II Temporal trends differ by area - first order random walk prior ( ) 1 δ it δ it, λ δ N 2 (δ i,t 1 + δ i,t+1 ), 1/(2λ δ ) The improper joint distribution can be expressed as ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ where K δ = K φ K α (rank k(t 1)). 14

15 The Model Stage 2 - ICAR prior for δ Type III Spatial trends differ over time - Intrinsic autoregression prior δ it δ it, λ δ N 1 1 δ jt, m i m i λ δ j:j i The improper joint distribution can be expressed as ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ Where K δ = K θ K γ (rank (k 1)T ). 15

16 The Model Stage 2 - Space-Time prior for δ Type IV Spatio-temporal interaction - conditional depends on first and second order neighbors δ it δ ( it, λ δ 1 N 2 (δ i,t 1 + δ i,t+1 ) + 1 m i δ jt 1 m i j:j i j:j i (δ j,t 1 + δ j,t+1 ), 1 2m i λ δ ) The improper joint distribution can be expressed as ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ Where K δ = K θ K α (rank (k 1)(T 1)). 16

17 The Model Stage 3 - Hyperpriors Hyperparameters were all assigned λ Gamma(1, 0.01) resulting in a convenient posterior distribution. For example the full conditional for λ δ is: λ δ δ Gamma ( rank(k δ ), δ k δ δ ) where rank(k δ ) depends on the interaction type. 17

18 Model Evaluation Posterior Deviance To compare fit and complexity of each model the saturated deviance was calculated based on 2,500 samples from the posterior. D (s) = 2 where π (s) n i=1 t=1 ( T y itlog ( ) it = exp η (s) it ( 1+exp η (s) it ). y it n it π (s) it ) + (n it y it )log n it y ( it ) n it 1 π (s) it 18

19 The Model Implementation Knorr-Held (2000) employed Markov chain Monte Carlo to sample from the implied posterior distributions. Univariate Metropolis steps were applied for each parameter and hyperparameters were updated with samples from their full conditionals. MCMC in R: an update for every parameter in an interaction model takes s. Note: in INLA the main effects and interaction models type I-III all fit in less than 20min. 19

20 Ohio Lung Cancer Posterior Distribution of the Deviance Main Effects Posterior Deviance Type I Posterior Deviance Type II Posterior Deviance Type III Posterior Deviance Type IV Posterior Deviance 20

21 Ohio Lung Cancer Posterior Distribution of the Deviance Model Median Mean IQR SD Main Effects Type I Type II Type III Type IV Table: Laina s values in black and Knorr-Held (2000) in gray. 21

22 Ohio Lung Cancer Type I interaction Ohio lung cancer mortality by county Type I Interactions Lung Cancer Deaths per 1, Year 22

23 Ohio Lung Cancer Type II interaction Ohio lung cancer mortality by county Type II Interactions Lung Cancer Deaths per 1, Year 23

24 Knorr-Held (2000) Conclusions & Critique Provided and motivated a flexible approach for modeling space-time data. Was thin on MCMC details and diagnostics. Did not motivate use of deviance over DIC or p D for model selection. Focussed exclusively on non-parametric smoothing approaches. No discussion of incorporating covariates. 24

25 Thank you! Thank you all for your feedback and support throughout the quarter. Specifically, I would like to thank: William for suggesting a hair cut and shaded plots, Bob for suggesting enthusiasm, and Jon for suggesting this paper! 25

26 References Bernardinelli, L., D. Clayton, C. Pascutto, C. Montomoli, M. Ghislandi, and M. Songini (1995). Bayesian analysis of space-time variation in disease risk. Statistics in Medicine 14(21-22), Besag, J., J. York, and A. Mollie (1991). Bayesian image restoration, with two applications in spatial statistics. Annals of the Institute of Statistical Mathematics 43, Knorr-Held, L. (2000). Bayesian modelling of inseperable space-time variation in disease risk. Statistics in Medicine 19, Knorr-Held, L. and J. Besag (1998). Modelling risk from a disease in time and space. Statistics in Medicine 17, Knorr-Held, L. and H. Rue (2002). On block updating in markov random field models for disease mapping. Scandinavian Journal of Statistics 29, Rue, H. and L. Held (2005). Gaussian Markov Random Fields Theory and Applications. Number 104 in Monographs on Statistics and Applied Probability. Chapman and Hall. Schrodle, B. and L. Held (2010). Spatio-temporal disease mapping using inla. Environmetrics. Waller, L. A., B. P. Carlin, H. Xia, and A. E. Gelfand (1997). Hierarchical spatio-temporal mapping of disease rates. Journal of the American Statistical Association 92(438),

27 Ohio Lung Cancer Type III interaction Ohio lung cancer mortality by county Type III Interactions Lung Cancer Deaths per 1, Year 27

28 Ohio Lung Cancer Type IV interaction Ohio lung cancer mortality by county Type IV Interactions Lung Cancer Deaths per 1, Year 28

29 Knorr-Held (2000) Next Steps A logical next step was to make fitting these models much faster. Knorr-Held and Rue (2002) introduced block updating Rue and Held (2005) great overview of Gaussian Markov Random Fields and more details on block updating Schrodle and Held (2010) describes (poorly) how to fit models from Knorr-Held (2000) in INLA. 29

BayesX - Software for Bayesian Inference in Structured Additive Regression

BayesX - Software for Bayesian Inference in Structured Additive Regression BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich

More information

Modeling the Distribution of Environmental Radon Levels in Iowa: Combining Multiple Sources of Spatially Misaligned Data

Modeling the Distribution of Environmental Radon Levels in Iowa: Combining Multiple Sources of Spatially Misaligned Data Modeling the Distribution of Environmental Radon Levels in Iowa: Combining Multiple Sources of Spatially Misaligned Data Brian J. Smith, Ph.D. The University of Iowa Joint Statistical Meetings August 10,

More information

MODELLING AND ANALYSIS OF

MODELLING AND ANALYSIS OF MODELLING AND ANALYSIS OF FOREST FIRE IN PORTUGAL - PART I Giovani L. Silva CEAUL & DMIST - Universidade Técnica de Lisboa gsilva@math.ist.utl.pt Maria Inês Dias & Manuela Oliveira CIMA & DM - Universidade

More information

11. Time series and dynamic linear models

11. Time series and dynamic linear models 11. Time series and dynamic linear models Objective To introduce the Bayesian approach to the modeling and forecasting of time series. Recommended reading West, M. and Harrison, J. (1997). models, (2 nd

More information

Disease Mapping. Chapter 14. 14.1 Background

Disease Mapping. Chapter 14. 14.1 Background Chapter 14 Disease Mapping 14.1 Background The mapping of disease incidence and prevalence has long been a part of public health, epidemiology, and the study of disease in human populations (Koch, 2005).

More information

Gaussian Processes to Speed up Hamiltonian Monte Carlo

Gaussian Processes to Speed up Hamiltonian Monte Carlo Gaussian Processes to Speed up Hamiltonian Monte Carlo Matthieu Lê Murray, Iain http://videolectures.net/mlss09uk_murray_mcmc/ Rasmussen, Carl Edward. "Gaussian processes to speed up hybrid Monte Carlo

More information

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data

More information

Bayesian inference for population prediction of individuals without health insurance in Florida

Bayesian inference for population prediction of individuals without health insurance in Florida Bayesian inference for population prediction of individuals without health insurance in Florida Neung Soo Ha 1 1 NISS 1 / 24 Outline Motivation Description of the Behavioral Risk Factor Surveillance System,

More information

Bayesian Inference on a Cox Process Associated with a Dirichlet Process

Bayesian Inference on a Cox Process Associated with a Dirichlet Process International Journal of Computer Applications 97 8887 Volume 9 - No. 8, June ayesian Inference on a Cox Process Associated with a Dirichlet Process Larissa Valmy LAMIA EA Université des Antilles-Guyane

More information

More details on the inputs, functionality, and output can be found below.

More details on the inputs, functionality, and output can be found below. Overview: The SMEEACT (Software for More Efficient, Ethical, and Affordable Clinical Trials) web interface (http://research.mdacc.tmc.edu/smeeactweb) implements a single analysis of a two-armed trial comparing

More information

Parallelization Strategies for Multicore Data Analysis

Parallelization Strategies for Multicore Data Analysis Parallelization Strategies for Multicore Data Analysis Wei-Chen Chen 1 Russell Zaretzki 2 1 University of Tennessee, Dept of EEB 2 University of Tennessee, Dept. Statistics, Operations, and Management

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct

More information

A Bayesian hierarchical surrogate outcome model for multiple sclerosis

A Bayesian hierarchical surrogate outcome model for multiple sclerosis A Bayesian hierarchical surrogate outcome model for multiple sclerosis 3 rd Annual ASA New Jersey Chapter / Bayer Statistics Workshop David Ohlssen (Novartis), Luca Pozzi and Heinz Schmidli (Novartis)

More information

An explicit link between Gaussian fields and Gaussian Markov random fields; the stochastic partial differential equation approach

An explicit link between Gaussian fields and Gaussian Markov random fields; the stochastic partial differential equation approach Intro B, W, M, & R SPDE/GMRF Example End An explicit link between Gaussian fields and Gaussian Markov random fields; the stochastic partial differential equation approach Finn Lindgren 1 Håvard Rue 1 Johan

More information

HT2015: SC4 Statistical Data Mining and Machine Learning

HT2015: SC4 Statistical Data Mining and Machine Learning HT2015: SC4 Statistical Data Mining and Machine Learning Dino Sejdinovic Department of Statistics Oxford http://www.stats.ox.ac.uk/~sejdinov/sdmml.html Bayesian Nonparametrics Parametric vs Nonparametric

More information

Sampling for Bayesian computation with large datasets

Sampling for Bayesian computation with large datasets Sampling for Bayesian computation with large datasets Zaiying Huang Andrew Gelman April 27, 2005 Abstract Multilevel models are extremely useful in handling large hierarchical datasets. However, computation

More information

Tutorial on Markov Chain Monte Carlo

Tutorial on Markov Chain Monte Carlo Tutorial on Markov Chain Monte Carlo Kenneth M. Hanson Los Alamos National Laboratory Presented at the 29 th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Technology,

More information

Applications of R Software in Bayesian Data Analysis

Applications of R Software in Bayesian Data Analysis Article International Journal of Information Science and System, 2012, 1(1): 7-23 International Journal of Information Science and System Journal homepage: www.modernscientificpress.com/journals/ijinfosci.aspx

More information

Bayesian Statistics: Indian Buffet Process

Bayesian Statistics: Indian Buffet Process Bayesian Statistics: Indian Buffet Process Ilker Yildirim Department of Brain and Cognitive Sciences University of Rochester Rochester, NY 14627 August 2012 Reference: Most of the material in this note

More information

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach Refik Soyer * Department of Management Science The George Washington University M. Murat Tarimcilar Department of Management Science

More information

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships

More information

Combining Visual and Auditory Data Exploration for finding structure in high-dimensional data

Combining Visual and Auditory Data Exploration for finding structure in high-dimensional data Combining Visual and Auditory Data Exploration for finding structure in high-dimensional data Thomas Hermann Mark H. Hansen Helge Ritter Faculty of Technology Bell Laboratories Faculty of Technology Bielefeld

More information

STAT3016 Introduction to Bayesian Data Analysis

STAT3016 Introduction to Bayesian Data Analysis STAT3016 Introduction to Bayesian Data Analysis Course Description The Bayesian approach to statistics assigns probability distributions to both the data and unknown parameters in the problem. This way,

More information

Dirichlet Processes A gentle tutorial

Dirichlet Processes A gentle tutorial Dirichlet Processes A gentle tutorial SELECT Lab Meeting October 14, 2008 Khalid El-Arini Motivation We are given a data set, and are told that it was generated from a mixture of Gaussian distributions.

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software MMMMMM YYYY, Volume VV, Issue II. http://www.jstatsoft.org/ sptimer: Spatio-Temporal Bayesian Modeling Using R K.S. Bakar Yale University, USA S.K. Sahu University of

More information

Probabilistic Methods for Time-Series Analysis

Probabilistic Methods for Time-Series Analysis Probabilistic Methods for Time-Series Analysis 2 Contents 1 Analysis of Changepoint Models 1 1.1 Introduction................................ 1 1.1.1 Model and Notation....................... 2 1.1.2 Example:

More information

Financial TIme Series Analysis: Part II

Financial TIme Series Analysis: Part II Department of Mathematics and Statistics, University of Vaasa, Finland January 29 February 13, 2015 Feb 14, 2015 1 Univariate linear stochastic models: further topics Unobserved component model Signal

More information

Health Insurance Estimates for States 1

Health Insurance Estimates for States 1 Health Insurance Estimates for States 1 Robin Fisher and Jennifer Campbell, U.S. Census Bureau, HHES-SAEB, FB 3 Room 1462 Washington, DC, 20233 Phone: (301) 763-5897, email: robin.c.fisher@census.gov Key

More information

How To Understand The Theory Of Probability

How To Understand The Theory Of Probability Graduate Programs in Statistics Course Titles STAT 100 CALCULUS AND MATR IX ALGEBRA FOR STATISTICS. Differential and integral calculus; infinite series; matrix algebra STAT 195 INTRODUCTION TO MATHEMATICAL

More information

Bayesian Factorization Machines

Bayesian Factorization Machines Bayesian Factorization Machines Christoph Freudenthaler, Lars Schmidt-Thieme Information Systems & Machine Learning Lab University of Hildesheim 31141 Hildesheim {freudenthaler, schmidt-thieme}@ismll.de

More information

Statistics & Probability PhD Research. 15th November 2014

Statistics & Probability PhD Research. 15th November 2014 Statistics & Probability PhD Research 15th November 2014 1 Statistics Statistical research is the development and application of methods to infer underlying structure from data. Broad areas of statistics

More information

Time Series Analysis

Time Series Analysis JUNE 2012 Time Series Analysis CONTENT A time series is a chronological sequence of observations on a particular variable. Usually the observations are taken at regular intervals (days, months, years),

More information

Imperfect Debugging in Software Reliability

Imperfect Debugging in Software Reliability Imperfect Debugging in Software Reliability Tevfik Aktekin and Toros Caglar University of New Hampshire Peter T. Paul College of Business and Economics Department of Decision Sciences and United Health

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

> plot(exp.btgpllm, main = "treed GP LLM,", proj = c(1)) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(2)) quantile diff (error)

> plot(exp.btgpllm, main = treed GP LLM,, proj = c(1)) > plot(exp.btgpllm, main = treed GP LLM,, proj = c(2)) quantile diff (error) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(1)) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(2)) 0.4 0.2 0.0 0.2 0.4 treed GP LLM, mean treed GP LLM, 0.00 0.05 0.10 0.15 0.20 x1 x1 0.4

More information

GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x,

GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x, Computing: an indispensable tool or an insurmountable hurdle? Iain Currie Heriot Watt University, Scotland ATRC, University College Dublin July 2006 Plan of talk General remarks The professional syllabus

More information

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way

More information

HIDDEN MARKOV MODELS FOR ALCOHOLISM TREATMENT TRIAL DATA

HIDDEN MARKOV MODELS FOR ALCOHOLISM TREATMENT TRIAL DATA HIDDEN MARKOV MODELS FOR ALCOHOLISM TREATMENT TRIAL DATA By Kenneth E. Shirley, Dylan S. Small, Kevin G. Lynch, Stephen A. Maisto, and David W. Oslin Columbia University, University of Pennsylvania and

More information

The Chinese Restaurant Process

The Chinese Restaurant Process COS 597C: Bayesian nonparametrics Lecturer: David Blei Lecture # 1 Scribes: Peter Frazier, Indraneel Mukherjee September 21, 2007 In this first lecture, we begin by introducing the Chinese Restaurant Process.

More information

Model Calibration with Open Source Software: R and Friends. Dr. Heiko Frings Mathematical Risk Consulting

Model Calibration with Open Source Software: R and Friends. Dr. Heiko Frings Mathematical Risk Consulting Model with Open Source Software: and Friends Dr. Heiko Frings Mathematical isk Consulting Bern, 01.09.2011 Agenda in a Friends Model with & Friends o o o Overview First instance: An Extreme Value Example

More information

Spatial modelling of claim frequency and claim size in non-life insurance

Spatial modelling of claim frequency and claim size in non-life insurance Spatial modelling of claim frequency and claim size in non-life insurance Susanne Gschlößl Claudia Czado April 13, 27 Abstract In this paper models for claim frequency and average claim size in non-life

More information

An Application of Inverse Reinforcement Learning to Medical Records of Diabetes Treatment

An Application of Inverse Reinforcement Learning to Medical Records of Diabetes Treatment An Application of Inverse Reinforcement Learning to Medical Records of Diabetes Treatment Hideki Asoh 1, Masanori Shiro 1 Shotaro Akaho 1, Toshihiro Kamishima 1, Koiti Hasida 1, Eiji Aramaki 2, and Takahide

More information

EE 570: Location and Navigation

EE 570: Location and Navigation EE 570: Location and Navigation On-Line Bayesian Tracking Aly El-Osery 1 Stephen Bruder 2 1 Electrical Engineering Department, New Mexico Tech Socorro, New Mexico, USA 2 Electrical and Computer Engineering

More information

Bayesian prediction of disability insurance frequencies using economic indicators

Bayesian prediction of disability insurance frequencies using economic indicators Bayesian prediction of disability insurance frequencies using economic indicators Catherine Donnelly Heriot-Watt University, Edinburgh, UK Mario V. Wüthrich ETH Zurich, RisLab, Department of Mathematics,

More information

Bayesian Statistics in One Hour. Patrick Lam

Bayesian Statistics in One Hour. Patrick Lam Bayesian Statistics in One Hour Patrick Lam Outline Introduction Bayesian Models Applications Missing Data Hierarchical Models Outline Introduction Bayesian Models Applications Missing Data Hierarchical

More information

Hierarchical Models in Environmental Science

Hierarchical Models in Environmental Science Hierarchical Models in Environmental Science Christopher K. Wikle July 2002 Accepted for: International Statistical Review Abstract Environmental systems are complicated. They include very intricate spatio-temporal

More information

An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc.

An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc. An Application of the G-formula to Asbestos and Lung Cancer Stephen R. Cole Epidemiology, UNC Chapel Hill Slides: www.unc.edu/~colesr/ 1 Acknowledgements Collaboration with David B. Richardson, Haitao

More information

Dynamic bayesian forecasting models of football match outcomes with estimation of the evolution variance parameter

Dynamic bayesian forecasting models of football match outcomes with estimation of the evolution variance parameter Loughborough University Institutional Repository Dynamic bayesian forecasting models of football match outcomes with estimation of the evolution variance parameter This item was submitted to Loughborough

More information

Handling attrition and non-response in longitudinal data

Handling attrition and non-response in longitudinal data Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein

More information

Introduction to Bayesian Analysis Using SAS R Software

Introduction to Bayesian Analysis Using SAS R Software Introduction to Bayesian Analysis Using SAS R Software Joseph G. Ibrahim Department of Biostatistics University of North Carolina Introduction to Bayesian statistics Outline 1 Introduction to Bayesian

More information

Logistic Regression (a type of Generalized Linear Model)

Logistic Regression (a type of Generalized Linear Model) Logistic Regression (a type of Generalized Linear Model) 1/36 Today Review of GLMs Logistic Regression 2/36 How do we find patterns in data? We begin with a model of how the world works We use our knowledge

More information

240ST014 - Data Analysis of Transport and Logistics

240ST014 - Data Analysis of Transport and Logistics Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 240 - ETSEIB - Barcelona School of Industrial Engineering 715 - EIO - Department of Statistics and Operations Research MASTER'S

More information

Gaussian Processes in Machine Learning

Gaussian Processes in Machine Learning Gaussian Processes in Machine Learning Carl Edward Rasmussen Max Planck Institute for Biological Cybernetics, 72076 Tübingen, Germany carl@tuebingen.mpg.de WWW home page: http://www.tuebingen.mpg.de/ carl

More information

NetSurv & Data Viewer

NetSurv & Data Viewer NetSurv & Data Viewer Prototype space-time analysis and visualization software from TerraSeer Dunrie Greiling, TerraSeer Inc. TerraSeer Software sales BoundarySeer for boundary detection and analysis ClusterSeer

More information

Bayesian Analysis of Comparative Survey Data

Bayesian Analysis of Comparative Survey Data Bayesian Analysis of Comparative Survey Data Bruce Western 1 Filiz Garip Princeton University April 2005 1 Department of Sociology, Princeton University, Princeton NJ 08544. We thank Sara Curran for making

More information

CURRICULUM VITAE. 1 Higher Education. 2 Employment DANI GAMERMAN

CURRICULUM VITAE. 1 Higher Education. 2 Employment DANI GAMERMAN CURRICULUM VITAE DANI GAMERMAN Date of birth: 30/10/1957 Nationality: Brazilian Postal address: Instituto de Matemática - UFRJ Caixa Postal 68530, 21945-970 Rio de Janeiro, RJ, Brazil email address: dani@im.ufrj.br

More information

Multivariate Normal Distribution

Multivariate Normal Distribution Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues

More information

Validation of Software for Bayesian Models Using Posterior Quantiles

Validation of Software for Bayesian Models Using Posterior Quantiles Validation of Software for Bayesian Models Using Posterior Quantiles Samantha R. COOK, Andrew GELMAN, and Donald B. RUBIN This article presents a simulation-based method designed to establish the computational

More information

Bayesian Statistics for Small Area Estimation

Bayesian Statistics for Small Area Estimation Bayesian Statistics for Small Area Estimation V. Gómez-Rubio, N. Best, S. Richardson, G. Li Department of Epidemiology and Public Health Imperial College London St. Mary s Campus, Norfolk Place W2 1PG

More information

Gaussian Conjugate Prior Cheat Sheet

Gaussian Conjugate Prior Cheat Sheet Gaussian Conjugate Prior Cheat Sheet Tom SF Haines 1 Purpose This document contains notes on how to handle the multivariate Gaussian 1 in a Bayesian setting. It focuses on the conjugate prior, its Bayesian

More information

Applying MCMC Methods to Multi-level Models submitted by William J Browne for the degree of PhD of the University of Bath 1998 COPYRIGHT Attention is drawn tothefactthatcopyright of this thesis rests with

More information

Bayesian credibility methods for pricing non-life insurance on individual claims history

Bayesian credibility methods for pricing non-life insurance on individual claims history Bayesian credibility methods for pricing non-life insurance on individual claims history Daniel Eliasson Masteruppsats i matematisk statistik Master Thesis in Mathematical Statistics Masteruppsats 215:1

More information

Estimation and comparison of multiple change-point models

Estimation and comparison of multiple change-point models Journal of Econometrics 86 (1998) 221 241 Estimation and comparison of multiple change-point models Siddhartha Chib* John M. Olin School of Business, Washington University, 1 Brookings Drive, Campus Box

More information

Curriculum Vitae of Francesco Bartolucci

Curriculum Vitae of Francesco Bartolucci Curriculum Vitae of Francesco Bartolucci Department of Economics, Finance and Statistics University of Perugia Via A. Pascoli, 20 06123 Perugia (IT) email: bart@stat.unipg.it http://www.stat.unipg.it/bartolucci

More information

MAN-BITES-DOG BUSINESS CYCLES ONLINE APPENDIX

MAN-BITES-DOG BUSINESS CYCLES ONLINE APPENDIX MAN-BITES-DOG BUSINESS CYCLES ONLINE APPENDIX KRISTOFFER P. NIMARK The next section derives the equilibrium expressions for the beauty contest model from Section 3 of the main paper. This is followed by

More information

Imputing Values to Missing Data

Imputing Values to Missing Data Imputing Values to Missing Data In federated data, between 30%-70% of the data points will have at least one missing attribute - data wastage if we ignore all records with a missing value Remaining data

More information

Prediction of Speciated Particulate Matter and Bias Assessment of Numerical Output Data

Prediction of Speciated Particulate Matter and Bias Assessment of Numerical Output Data Prediction of Speciated Particulate Matter and Bias Assessment of Numerical Output Data L.B. Smith 1 *, M. F. Fuentes 1, B.J. Reich 1, B.K. Eder 2 1 Department of Statistics, North Carolina State University,

More information

Predicting the Present with Bayesian Structural Time Series

Predicting the Present with Bayesian Structural Time Series Predicting the Present with Bayesian Structural Time Series Steven L. Scott Hal Varian June 28, 2013 Abstract This article describes a system for short term forecasting based on an ensemble prediction

More information

Master programme in Statistics

Master programme in Statistics Master programme in Statistics Björn Holmquist 1 1 Department of Statistics Lund University Cramérsällskapets årskonferens, 2010-03-25 Master programme Vad är ett Master programme? Breddmaster vs Djupmaster

More information

Measures of explained variance and pooling in multilevel models

Measures of explained variance and pooling in multilevel models Measures of explained variance and pooling in multilevel models Andrew Gelman, Columbia University Iain Pardoe, University of Oregon Contact: Iain Pardoe, Charles H. Lundquist College of Business, University

More information

Echantillonnage exact de distributions de Gibbs d énergies sous-modulaires. Exact sampling of Gibbs distributions with submodular energies

Echantillonnage exact de distributions de Gibbs d énergies sous-modulaires. Exact sampling of Gibbs distributions with submodular energies Echantillonnage exact de distributions de Gibbs d énergies sous-modulaires Exact sampling of Gibbs distributions with submodular energies Marc Sigelle 1 and Jérôme Darbon 2 1 Institut TELECOM TELECOM ParisTech

More information

A Basic Introduction to Missing Data

A Basic Introduction to Missing Data John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item

More information

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University caizhua@gmail.com

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University caizhua@gmail.com Bayesian Machine Learning (ML): Modeling And Inference in Big Data Zhuhua Cai Google Rice University caizhua@gmail.com 1 Syllabus Bayesian ML Concepts (Today) Bayesian ML on MapReduce (Next morning) Bayesian

More information

Incorporating cost in Bayesian Variable Selection, with application to cost-effective measurement of quality of health care.

Incorporating cost in Bayesian Variable Selection, with application to cost-effective measurement of quality of health care. Incorporating cost in Bayesian Variable Selection, with application to cost-effective measurement of quality of health care University of Florida 10th Annual Winter Workshop: Bayesian Model Selection and

More information

Nonparametric Prior Elicitation using the Roulette Method

Nonparametric Prior Elicitation using the Roulette Method Nonparametric Prior Elicitation using the Roulette Method Jeremy E. Oakley, Alireza Daneshkhah 2 and Anthony O Hagan School of Mathematics and Statistics, University of Sheffield, UK and 2 Department of

More information

Validation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT

Validation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT Validation of Software for Bayesian Models using Posterior Quantiles Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT Abstract We present a simulation-based method designed to establish that software

More information

Bayesian Variable Selection in Normal Regression Models

Bayesian Variable Selection in Normal Regression Models Institut für Angewandte Statistik Johannes Kepler Universität Linz Bayesian Variable Selection in Normal Regression Models Masterarbeit zur Erlangung des akademischen Grades Master der Statistik im Masterstudium

More information

Analysis of Financial Time Series

Analysis of Financial Time Series Analysis of Financial Time Series Analysis of Financial Time Series Financial Econometrics RUEY S. TSAY University of Chicago A Wiley-Interscience Publication JOHN WILEY & SONS, INC. This book is printed

More information

Section 5. Stan for Big Data. Bob Carpenter. Columbia University

Section 5. Stan for Big Data. Bob Carpenter. Columbia University Section 5. Stan for Big Data Bob Carpenter Columbia University Part I Overview Scaling and Evaluation data size (bytes) 1e18 1e15 1e12 1e9 1e6 Big Model and Big Data approach state of the art big model

More information

Fahrmeir, Lang, Wolff, Bender: Semiparametric Bayesian Time-Space Analysis of Unemployment Duration

Fahrmeir, Lang, Wolff, Bender: Semiparametric Bayesian Time-Space Analysis of Unemployment Duration Fahrmeir, Lang, Wolff, Bender: Semiparametric Bayesian Time-Space Analysis of Unemployment Duration Sonderforschungsbereich 386, Paper 211 (2000) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner

More information

Bayesian Phylogeny and Measures of Branch Support

Bayesian Phylogeny and Measures of Branch Support Bayesian Phylogeny and Measures of Branch Support Bayesian Statistics Imagine we have a bag containing 100 dice of which we know that 90 are fair and 10 are biased. The

More information

Monte Carlo Methods and Models in Finance and Insurance

Monte Carlo Methods and Models in Finance and Insurance Chapman & Hall/CRC FINANCIAL MATHEMATICS SERIES Monte Carlo Methods and Models in Finance and Insurance Ralf Korn Elke Korn Gerald Kroisandt f r oc) CRC Press \ V^ J Taylor & Francis Croup ^^"^ Boca Raton

More information

MODELLING CRITICAL ILLNESS INSURANCE DATA

MODELLING CRITICAL ILLNESS INSURANCE DATA MODELLING CRITICAL ILLNESS INSURANCE DATA Howard Waters Joint work with: Erengul Dodd (Ozkok), George Streftaris, David Wilkie University of Piraeus, October 2014 1 Plan: 1. Critical Illness Insurance

More information

Inference on Phase-type Models via MCMC

Inference on Phase-type Models via MCMC Inference on Phase-type Models via MCMC with application to networks of repairable redundant systems Louis JM Aslett and Simon P Wilson Trinity College Dublin 28 th June 202 Toy Example : Redundant Repairable

More information

Package CoImp. February 19, 2015

Package CoImp. February 19, 2015 Title Copula based imputation method Date 2014-03-01 Version 0.2-3 Package CoImp February 19, 2015 Author Francesca Marta Lilja Di Lascio, Simone Giannerini Depends R (>= 2.15.2), methods, copula Imports

More information

Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle

Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases Andreas Züfle Geo Spatial Data Huge flood of geo spatial data Modern technology New user mentality Great research potential

More information

Model Selection and Claim Frequency for Workers Compensation Insurance

Model Selection and Claim Frequency for Workers Compensation Insurance Model Selection and Claim Frequency for Workers Compensation Insurance Jisheng Cui, David Pitt and Guoqi Qian Abstract We consider a set of workers compensation insurance claim data where the aggregate

More information

Traffic Driven Analysis of Cellular Data Networks

Traffic Driven Analysis of Cellular Data Networks Traffic Driven Analysis of Cellular Data Networks Samir R. Das Computer Science Department Stony Brook University Joint work with Utpal Paul, Luis Ortiz (Stony Brook U), Milind Buddhikot, Anand Prabhu

More information

Dirichlet Process Gaussian Mixture Models: Choice of the Base Distribution

Dirichlet Process Gaussian Mixture Models: Choice of the Base Distribution Görür D, Rasmussen CE. Dirichlet process Gaussian mixture models: Choice of the base distribution. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 5(4): 615 66 July 010/DOI 10.1007/s11390-010-1051-1 Dirichlet

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

TIME SERIES ANALYSIS OF COMPOSITIONAL DATA USING A DYNAMIC LINEAR MODEL APPROACH

TIME SERIES ANALYSIS OF COMPOSITIONAL DATA USING A DYNAMIC LINEAR MODEL APPROACH Joint Stascal Meengs - Bayesian Stascal Science TIME SERIES ANALYSIS OF COMPOSITIONAL DATA USING A DYNAMIC LINEAR MODEL APPROACH AMITABHA BHAUMIK, DIPAK K. DEY AND NALINI RAVISHANKER Department of Stascs,

More information

How To Solve A Save Problem With Time T

How To Solve A Save Problem With Time T Bayesian Functional Data Analysis for Computer Model Validation Fei Liu Institute of Statistics and Decision Sciences Duke University Transition Workshop on Development, Assessment and Utilization of Complex

More information

Monte Carlo-based statistical methods (MASM11/FMS091)

Monte Carlo-based statistical methods (MASM11/FMS091) Monte Carlo-based statistical methods (MASM11/FMS091) Jimmy Olsson Centre for Mathematical Sciences Lund University, Sweden Lecture 5 Sequential Monte Carlo methods I February 5, 2013 J. Olsson Monte Carlo-based

More information

Psychology 209. Longitudinal Data Analysis and Bayesian Extensions Fall 2012

Psychology 209. Longitudinal Data Analysis and Bayesian Extensions Fall 2012 Instructor: Psychology 209 Longitudinal Data Analysis and Bayesian Extensions Fall 2012 Sarah Depaoli (sdepaoli@ucmerced.edu) Office Location: SSM 312A Office Phone: (209) 228-4549 (although email will

More information

A Bootstrap Metropolis-Hastings Algorithm for Bayesian Analysis of Big Data

A Bootstrap Metropolis-Hastings Algorithm for Bayesian Analysis of Big Data A Bootstrap Metropolis-Hastings Algorithm for Bayesian Analysis of Big Data Faming Liang University of Florida August 9, 2015 Abstract MCMC methods have proven to be a very powerful tool for analyzing

More information

AALBORG UNIVERSITY. An efficient Markov chain Monte Carlo method for distributions with intractable normalising constants

AALBORG UNIVERSITY. An efficient Markov chain Monte Carlo method for distributions with intractable normalising constants AALBORG UNIVERSITY An efficient Markov chain Monte Carlo method for distributions with intractable normalising constants by Jesper Møller, Anthony N. Pettitt, Kasper K. Berthelsen and Robert W. Reeves

More information

Sections 2.11 and 5.8

Sections 2.11 and 5.8 Sections 211 and 58 Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I 1/25 Gesell data Let X be the age in in months a child speaks his/her first word and

More information

APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING

APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING Sulaimon Mutiu O. Department of Statistics & Mathematics Moshood Abiola Polytechnic, Abeokuta, Ogun State, Nigeria. Abstract

More information