Bayesian modeling of inseparable space-time variation in disease risk
|
|
|
- Mariah Eustacia Small
- 9 years ago
- Views:
Transcription
1 Bayesian modeling of inseparable space-time variation in disease risk Leonhard Knorr-Held Laina Mercer Department of Statistics UW May 23, 2013
2 Motivation Area and time-specific disease rates Area and time-specific disease rates are of great interest for health care and policy purposes Facilitate effective allocation of resources and targeted interventions Sample size often too small at granular space-time scale for reliable estimates Bayesian approach to borrow strength over space and time to improve reliability 2
3 Motivation Ohio Lung Cancer Example Lung Cancer Mortality Rates 1972 [0,1.5) [1.5,3.01) [3.01,4.51) [4.51,6.01] 3
4 Motivation Ohio Lung Cancer Example Ohio Lung cancer mortality by county ages Raw Lung Cancer Deaths per Year 4
5 Background Previous approaches Previous works have used a Hierarchical Bayesian framework to expand purely spatial models by Besag, York, and Mollie (1991) to a space time framework. Bernardinelli et al. (1995) Area-specific intercept and temporal trends All temporal trends assumed linear Waller et al. (1997) Spatial model for each time point No spatial main effects Knorr-Held and Besag (1998) Included spatial and temporal main effects Does not allow for space time interactions 5
6 Motivation Ohio Lung Cancer Example Ohio lung cancer mortality by county Lung Cancer Deaths per 1, Year 6
7 Motivation Ohio Lung Cancer Example Ohio lung cancer mortality by county Lung Cancer Deaths per 1, Year 7
8 The Model The Set Up Address the situation where the disease variation cannot be separated into temporal and spatial main effects and spatio-temporal interactions become and important feature. n it - persons at risk in county i at time t. y it - cases or deaths in county i at time t y it π it Bin(n it, π it ) ( ) η it = log πit 1 π it 8
9 The Model Stage 1 η it = µ + α t + γ t + θ i + φ i + δ it µ - overall risk level α t - temporally structured effect of time t γ t - independent effect of time t θ i - spatially structured effect of county i φ i - independent effect of county i δ it - space time interaction 9
10 The Model Stage 2 - Exchangeable Effects The exchangeable effects γ (for time) and φ (for space) we are assigned multivariate Gaussian priors with mean zero and precision matrix λk: ( ) p(γ λ γ ) N 0, 1 λ γ Kγ 1 p(φ λ φ ) N where K γ = I TxT and K φ = I kxk. ( ) 0, 1 λ φ K 1 φ 10
11 The Model Stage 2 - First Order Random Walk Temporally structured effect of time α t is assigned a random walk. ( ) 1 α t α t, λ α N 2 (α t 1 + α t+1 ), 1/(2λ α ) Also represented as p(α λ α ) exp ( λα 2 αt K α α ) where K α =
12 The Model Stage 2 - Intrinsic Autoregressive (ICAR) Spatially structured effect of area θ i is assigned θ i θ i, λ θ N 1 1 θ j, m i m i λ θ j:j i where m i is the # of neighbors. ( The improper ) joint distribution can be written as p(θ λ θ ) exp λ θ 2 θ T K θ θ, where m i i = j K θ,ij = 1 i j 0 otherwise 12
13 The Model Stage 2 - iid prior for δ Type I Independent multivariate Gaussian prior the joint distribution is δ it δ it, λ δ N (0, 1/λ δ ) ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ where K δ = K φ K γ = I kt kt. 13
14 The Model Stage 2 - Random Walk prior for δ Type II Temporal trends differ by area - first order random walk prior ( ) 1 δ it δ it, λ δ N 2 (δ i,t 1 + δ i,t+1 ), 1/(2λ δ ) The improper joint distribution can be expressed as ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ where K δ = K φ K α (rank k(t 1)). 14
15 The Model Stage 2 - ICAR prior for δ Type III Spatial trends differ over time - Intrinsic autoregression prior δ it δ it, λ δ N 1 1 δ jt, m i m i λ δ j:j i The improper joint distribution can be expressed as ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ Where K δ = K θ K γ (rank (k 1)T ). 15
16 The Model Stage 2 - Space-Time prior for δ Type IV Spatio-temporal interaction - conditional depends on first and second order neighbors δ it δ ( it, λ δ 1 N 2 (δ i,t 1 + δ i,t+1 ) + 1 m i δ jt 1 m i j:j i j:j i (δ j,t 1 + δ j,t+1 ), 1 2m i λ δ ) The improper joint distribution can be expressed as ( ) p(δ λ δ ) exp λ δ 2 δ T K δ δ Where K δ = K θ K α (rank (k 1)(T 1)). 16
17 The Model Stage 3 - Hyperpriors Hyperparameters were all assigned λ Gamma(1, 0.01) resulting in a convenient posterior distribution. For example the full conditional for λ δ is: λ δ δ Gamma ( rank(k δ ), δ k δ δ ) where rank(k δ ) depends on the interaction type. 17
18 Model Evaluation Posterior Deviance To compare fit and complexity of each model the saturated deviance was calculated based on 2,500 samples from the posterior. D (s) = 2 where π (s) n i=1 t=1 ( T y itlog ( ) it = exp η (s) it ( 1+exp η (s) it ). y it n it π (s) it ) + (n it y it )log n it y ( it ) n it 1 π (s) it 18
19 The Model Implementation Knorr-Held (2000) employed Markov chain Monte Carlo to sample from the implied posterior distributions. Univariate Metropolis steps were applied for each parameter and hyperparameters were updated with samples from their full conditionals. MCMC in R: an update for every parameter in an interaction model takes s. Note: in INLA the main effects and interaction models type I-III all fit in less than 20min. 19
20 Ohio Lung Cancer Posterior Distribution of the Deviance Main Effects Posterior Deviance Type I Posterior Deviance Type II Posterior Deviance Type III Posterior Deviance Type IV Posterior Deviance 20
21 Ohio Lung Cancer Posterior Distribution of the Deviance Model Median Mean IQR SD Main Effects Type I Type II Type III Type IV Table: Laina s values in black and Knorr-Held (2000) in gray. 21
22 Ohio Lung Cancer Type I interaction Ohio lung cancer mortality by county Type I Interactions Lung Cancer Deaths per 1, Year 22
23 Ohio Lung Cancer Type II interaction Ohio lung cancer mortality by county Type II Interactions Lung Cancer Deaths per 1, Year 23
24 Knorr-Held (2000) Conclusions & Critique Provided and motivated a flexible approach for modeling space-time data. Was thin on MCMC details and diagnostics. Did not motivate use of deviance over DIC or p D for model selection. Focussed exclusively on non-parametric smoothing approaches. No discussion of incorporating covariates. 24
25 Thank you! Thank you all for your feedback and support throughout the quarter. Specifically, I would like to thank: William for suggesting a hair cut and shaded plots, Bob for suggesting enthusiasm, and Jon for suggesting this paper! 25
26 References Bernardinelli, L., D. Clayton, C. Pascutto, C. Montomoli, M. Ghislandi, and M. Songini (1995). Bayesian analysis of space-time variation in disease risk. Statistics in Medicine 14(21-22), Besag, J., J. York, and A. Mollie (1991). Bayesian image restoration, with two applications in spatial statistics. Annals of the Institute of Statistical Mathematics 43, Knorr-Held, L. (2000). Bayesian modelling of inseperable space-time variation in disease risk. Statistics in Medicine 19, Knorr-Held, L. and J. Besag (1998). Modelling risk from a disease in time and space. Statistics in Medicine 17, Knorr-Held, L. and H. Rue (2002). On block updating in markov random field models for disease mapping. Scandinavian Journal of Statistics 29, Rue, H. and L. Held (2005). Gaussian Markov Random Fields Theory and Applications. Number 104 in Monographs on Statistics and Applied Probability. Chapman and Hall. Schrodle, B. and L. Held (2010). Spatio-temporal disease mapping using inla. Environmetrics. Waller, L. A., B. P. Carlin, H. Xia, and A. E. Gelfand (1997). Hierarchical spatio-temporal mapping of disease rates. Journal of the American Statistical Association 92(438),
27 Ohio Lung Cancer Type III interaction Ohio lung cancer mortality by county Type III Interactions Lung Cancer Deaths per 1, Year 27
28 Ohio Lung Cancer Type IV interaction Ohio lung cancer mortality by county Type IV Interactions Lung Cancer Deaths per 1, Year 28
29 Knorr-Held (2000) Next Steps A logical next step was to make fitting these models much faster. Knorr-Held and Rue (2002) introduced block updating Rue and Held (2005) great overview of Gaussian Markov Random Fields and more details on block updating Schrodle and Held (2010) describes (poorly) how to fit models from Knorr-Held (2000) in INLA. 29
BayesX - Software for Bayesian Inference in Structured Additive Regression
BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich
11. Time series and dynamic linear models
11. Time series and dynamic linear models Objective To introduce the Bayesian approach to the modeling and forecasting of time series. Recommended reading West, M. and Harrison, J. (1997). models, (2 nd
Gaussian Processes to Speed up Hamiltonian Monte Carlo
Gaussian Processes to Speed up Hamiltonian Monte Carlo Matthieu Lê Murray, Iain http://videolectures.net/mlss09uk_murray_mcmc/ Rasmussen, Carl Edward. "Gaussian processes to speed up hybrid Monte Carlo
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data
Bayesian inference for population prediction of individuals without health insurance in Florida
Bayesian inference for population prediction of individuals without health insurance in Florida Neung Soo Ha 1 1 NISS 1 / 24 Outline Motivation Description of the Behavioral Risk Factor Surveillance System,
More details on the inputs, functionality, and output can be found below.
Overview: The SMEEACT (Software for More Efficient, Ethical, and Affordable Clinical Trials) web interface (http://research.mdacc.tmc.edu/smeeactweb) implements a single analysis of a two-armed trial comparing
Parallelization Strategies for Multicore Data Analysis
Parallelization Strategies for Multicore Data Analysis Wei-Chen Chen 1 Russell Zaretzki 2 1 University of Tennessee, Dept of EEB 2 University of Tennessee, Dept. Statistics, Operations, and Management
Statistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
STA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! [email protected]! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
A Bayesian hierarchical surrogate outcome model for multiple sclerosis
A Bayesian hierarchical surrogate outcome model for multiple sclerosis 3 rd Annual ASA New Jersey Chapter / Bayer Statistics Workshop David Ohlssen (Novartis), Luca Pozzi and Heinz Schmidli (Novartis)
An explicit link between Gaussian fields and Gaussian Markov random fields; the stochastic partial differential equation approach
Intro B, W, M, & R SPDE/GMRF Example End An explicit link between Gaussian fields and Gaussian Markov random fields; the stochastic partial differential equation approach Finn Lindgren 1 Håvard Rue 1 Johan
HT2015: SC4 Statistical Data Mining and Machine Learning
HT2015: SC4 Statistical Data Mining and Machine Learning Dino Sejdinovic Department of Statistics Oxford http://www.stats.ox.ac.uk/~sejdinov/sdmml.html Bayesian Nonparametrics Parametric vs Nonparametric
Sampling for Bayesian computation with large datasets
Sampling for Bayesian computation with large datasets Zaiying Huang Andrew Gelman April 27, 2005 Abstract Multilevel models are extremely useful in handling large hierarchical datasets. However, computation
Tutorial on Markov Chain Monte Carlo
Tutorial on Markov Chain Monte Carlo Kenneth M. Hanson Los Alamos National Laboratory Presented at the 29 th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Technology,
Applications of R Software in Bayesian Data Analysis
Article International Journal of Information Science and System, 2012, 1(1): 7-23 International Journal of Information Science and System Journal homepage: www.modernscientificpress.com/journals/ijinfosci.aspx
Bayesian Statistics: Indian Buffet Process
Bayesian Statistics: Indian Buffet Process Ilker Yildirim Department of Brain and Cognitive Sciences University of Rochester Rochester, NY 14627 August 2012 Reference: Most of the material in this note
Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach
Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach Refik Soyer * Department of Management Science The George Washington University M. Murat Tarimcilar Department of Management Science
CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS
Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships
Combining Visual and Auditory Data Exploration for finding structure in high-dimensional data
Combining Visual and Auditory Data Exploration for finding structure in high-dimensional data Thomas Hermann Mark H. Hansen Helge Ritter Faculty of Technology Bell Laboratories Faculty of Technology Bielefeld
STAT3016 Introduction to Bayesian Data Analysis
STAT3016 Introduction to Bayesian Data Analysis Course Description The Bayesian approach to statistics assigns probability distributions to both the data and unknown parameters in the problem. This way,
Dirichlet Processes A gentle tutorial
Dirichlet Processes A gentle tutorial SELECT Lab Meeting October 14, 2008 Khalid El-Arini Motivation We are given a data set, and are told that it was generated from a mixture of Gaussian distributions.
Journal of Statistical Software
JSS Journal of Statistical Software MMMMMM YYYY, Volume VV, Issue II. http://www.jstatsoft.org/ sptimer: Spatio-Temporal Bayesian Modeling Using R K.S. Bakar Yale University, USA S.K. Sahu University of
Probabilistic Methods for Time-Series Analysis
Probabilistic Methods for Time-Series Analysis 2 Contents 1 Analysis of Changepoint Models 1 1.1 Introduction................................ 1 1.1.1 Model and Notation....................... 2 1.1.2 Example:
Financial TIme Series Analysis: Part II
Department of Mathematics and Statistics, University of Vaasa, Finland January 29 February 13, 2015 Feb 14, 2015 1 Univariate linear stochastic models: further topics Unobserved component model Signal
How To Understand The Theory Of Probability
Graduate Programs in Statistics Course Titles STAT 100 CALCULUS AND MATR IX ALGEBRA FOR STATISTICS. Differential and integral calculus; infinite series; matrix algebra STAT 195 INTRODUCTION TO MATHEMATICAL
Bayesian Factorization Machines
Bayesian Factorization Machines Christoph Freudenthaler, Lars Schmidt-Thieme Information Systems & Machine Learning Lab University of Hildesheim 31141 Hildesheim {freudenthaler, schmidt-thieme}@ismll.de
Statistics & Probability PhD Research. 15th November 2014
Statistics & Probability PhD Research 15th November 2014 1 Statistics Statistical research is the development and application of methods to infer underlying structure from data. Broad areas of statistics
Time Series Analysis
JUNE 2012 Time Series Analysis CONTENT A time series is a chronological sequence of observations on a particular variable. Usually the observations are taken at regular intervals (days, months, years),
Imperfect Debugging in Software Reliability
Imperfect Debugging in Software Reliability Tevfik Aktekin and Toros Caglar University of New Hampshire Peter T. Paul College of Business and Economics Department of Decision Sciences and United Health
Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.
Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are
> plot(exp.btgpllm, main = "treed GP LLM,", proj = c(1)) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(2)) quantile diff (error)
> plot(exp.btgpllm, main = "treed GP LLM,", proj = c(1)) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(2)) 0.4 0.2 0.0 0.2 0.4 treed GP LLM, mean treed GP LLM, 0.00 0.05 0.10 0.15 0.20 x1 x1 0.4
GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x,
Computing: an indispensable tool or an insurmountable hurdle? Iain Currie Heriot Watt University, Scotland ATRC, University College Dublin July 2006 Plan of talk General remarks The professional syllabus
SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing [email protected]
SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing [email protected] IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way
HIDDEN MARKOV MODELS FOR ALCOHOLISM TREATMENT TRIAL DATA
HIDDEN MARKOV MODELS FOR ALCOHOLISM TREATMENT TRIAL DATA By Kenneth E. Shirley, Dylan S. Small, Kevin G. Lynch, Stephen A. Maisto, and David W. Oslin Columbia University, University of Pennsylvania and
The Chinese Restaurant Process
COS 597C: Bayesian nonparametrics Lecturer: David Blei Lecture # 1 Scribes: Peter Frazier, Indraneel Mukherjee September 21, 2007 In this first lecture, we begin by introducing the Chinese Restaurant Process.
Model Calibration with Open Source Software: R and Friends. Dr. Heiko Frings Mathematical Risk Consulting
Model with Open Source Software: and Friends Dr. Heiko Frings Mathematical isk Consulting Bern, 01.09.2011 Agenda in a Friends Model with & Friends o o o Overview First instance: An Extreme Value Example
Spatial modelling of claim frequency and claim size in non-life insurance
Spatial modelling of claim frequency and claim size in non-life insurance Susanne Gschlößl Claudia Czado April 13, 27 Abstract In this paper models for claim frequency and average claim size in non-life
An Application of Inverse Reinforcement Learning to Medical Records of Diabetes Treatment
An Application of Inverse Reinforcement Learning to Medical Records of Diabetes Treatment Hideki Asoh 1, Masanori Shiro 1 Shotaro Akaho 1, Toshihiro Kamishima 1, Koiti Hasida 1, Eiji Aramaki 2, and Takahide
EE 570: Location and Navigation
EE 570: Location and Navigation On-Line Bayesian Tracking Aly El-Osery 1 Stephen Bruder 2 1 Electrical Engineering Department, New Mexico Tech Socorro, New Mexico, USA 2 Electrical and Computer Engineering
Bayesian prediction of disability insurance frequencies using economic indicators
Bayesian prediction of disability insurance frequencies using economic indicators Catherine Donnelly Heriot-Watt University, Edinburgh, UK Mario V. Wüthrich ETH Zurich, RisLab, Department of Mathematics,
Bayesian Statistics in One Hour. Patrick Lam
Bayesian Statistics in One Hour Patrick Lam Outline Introduction Bayesian Models Applications Missing Data Hierarchical Models Outline Introduction Bayesian Models Applications Missing Data Hierarchical
An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc.
An Application of the G-formula to Asbestos and Lung Cancer Stephen R. Cole Epidemiology, UNC Chapel Hill Slides: www.unc.edu/~colesr/ 1 Acknowledgements Collaboration with David B. Richardson, Haitao
Dynamic bayesian forecasting models of football match outcomes with estimation of the evolution variance parameter
Loughborough University Institutional Repository Dynamic bayesian forecasting models of football match outcomes with estimation of the evolution variance parameter This item was submitted to Loughborough
Handling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
Introduction to Bayesian Analysis Using SAS R Software
Introduction to Bayesian Analysis Using SAS R Software Joseph G. Ibrahim Department of Biostatistics University of North Carolina Introduction to Bayesian statistics Outline 1 Introduction to Bayesian
Logistic Regression (a type of Generalized Linear Model)
Logistic Regression (a type of Generalized Linear Model) 1/36 Today Review of GLMs Logistic Regression 2/36 How do we find patterns in data? We begin with a model of how the world works We use our knowledge
240ST014 - Data Analysis of Transport and Logistics
Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 240 - ETSEIB - Barcelona School of Industrial Engineering 715 - EIO - Department of Statistics and Operations Research MASTER'S
Gaussian Processes in Machine Learning
Gaussian Processes in Machine Learning Carl Edward Rasmussen Max Planck Institute for Biological Cybernetics, 72076 Tübingen, Germany [email protected] WWW home page: http://www.tuebingen.mpg.de/ carl
NetSurv & Data Viewer
NetSurv & Data Viewer Prototype space-time analysis and visualization software from TerraSeer Dunrie Greiling, TerraSeer Inc. TerraSeer Software sales BoundarySeer for boundary detection and analysis ClusterSeer
CURRICULUM VITAE. 1 Higher Education. 2 Employment DANI GAMERMAN
CURRICULUM VITAE DANI GAMERMAN Date of birth: 30/10/1957 Nationality: Brazilian Postal address: Instituto de Matemática - UFRJ Caixa Postal 68530, 21945-970 Rio de Janeiro, RJ, Brazil email address: [email protected]
Multivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
Validation of Software for Bayesian Models Using Posterior Quantiles
Validation of Software for Bayesian Models Using Posterior Quantiles Samantha R. COOK, Andrew GELMAN, and Donald B. RUBIN This article presents a simulation-based method designed to establish the computational
Gaussian Conjugate Prior Cheat Sheet
Gaussian Conjugate Prior Cheat Sheet Tom SF Haines 1 Purpose This document contains notes on how to handle the multivariate Gaussian 1 in a Bayesian setting. It focuses on the conjugate prior, its Bayesian
Applying MCMC Methods to Multi-level Models submitted by William J Browne for the degree of PhD of the University of Bath 1998 COPYRIGHT Attention is drawn tothefactthatcopyright of this thesis rests with
Bayesian credibility methods for pricing non-life insurance on individual claims history
Bayesian credibility methods for pricing non-life insurance on individual claims history Daniel Eliasson Masteruppsats i matematisk statistik Master Thesis in Mathematical Statistics Masteruppsats 215:1
Estimation and comparison of multiple change-point models
Journal of Econometrics 86 (1998) 221 241 Estimation and comparison of multiple change-point models Siddhartha Chib* John M. Olin School of Business, Washington University, 1 Brookings Drive, Campus Box
Curriculum Vitae of Francesco Bartolucci
Curriculum Vitae of Francesco Bartolucci Department of Economics, Finance and Statistics University of Perugia Via A. Pascoli, 20 06123 Perugia (IT) email: [email protected] http://www.stat.unipg.it/bartolucci
Imputing Values to Missing Data
Imputing Values to Missing Data In federated data, between 30%-70% of the data points will have at least one missing attribute - data wastage if we ignore all records with a missing value Remaining data
Predicting the Present with Bayesian Structural Time Series
Predicting the Present with Bayesian Structural Time Series Steven L. Scott Hal Varian June 28, 2013 Abstract This article describes a system for short term forecasting based on an ensemble prediction
Master programme in Statistics
Master programme in Statistics Björn Holmquist 1 1 Department of Statistics Lund University Cramérsällskapets årskonferens, 2010-03-25 Master programme Vad är ett Master programme? Breddmaster vs Djupmaster
Measures of explained variance and pooling in multilevel models
Measures of explained variance and pooling in multilevel models Andrew Gelman, Columbia University Iain Pardoe, University of Oregon Contact: Iain Pardoe, Charles H. Lundquist College of Business, University
Echantillonnage exact de distributions de Gibbs d énergies sous-modulaires. Exact sampling of Gibbs distributions with submodular energies
Echantillonnage exact de distributions de Gibbs d énergies sous-modulaires Exact sampling of Gibbs distributions with submodular energies Marc Sigelle 1 and Jérôme Darbon 2 1 Institut TELECOM TELECOM ParisTech
A Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item
Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University [email protected]
Bayesian Machine Learning (ML): Modeling And Inference in Big Data Zhuhua Cai Google Rice University [email protected] 1 Syllabus Bayesian ML Concepts (Today) Bayesian ML on MapReduce (Next morning) Bayesian
Incorporating cost in Bayesian Variable Selection, with application to cost-effective measurement of quality of health care.
Incorporating cost in Bayesian Variable Selection, with application to cost-effective measurement of quality of health care University of Florida 10th Annual Winter Workshop: Bayesian Model Selection and
Nonparametric Prior Elicitation using the Roulette Method
Nonparametric Prior Elicitation using the Roulette Method Jeremy E. Oakley, Alireza Daneshkhah 2 and Anthony O Hagan School of Mathematics and Statistics, University of Sheffield, UK and 2 Department of
Validation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT
Validation of Software for Bayesian Models using Posterior Quantiles Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT Abstract We present a simulation-based method designed to establish that software
Bayesian Variable Selection in Normal Regression Models
Institut für Angewandte Statistik Johannes Kepler Universität Linz Bayesian Variable Selection in Normal Regression Models Masterarbeit zur Erlangung des akademischen Grades Master der Statistik im Masterstudium
Analysis of Financial Time Series
Analysis of Financial Time Series Analysis of Financial Time Series Financial Econometrics RUEY S. TSAY University of Chicago A Wiley-Interscience Publication JOHN WILEY & SONS, INC. This book is printed
Section 5. Stan for Big Data. Bob Carpenter. Columbia University
Section 5. Stan for Big Data Bob Carpenter Columbia University Part I Overview Scaling and Evaluation data size (bytes) 1e18 1e15 1e12 1e9 1e6 Big Model and Big Data approach state of the art big model
Bayesian Phylogeny and Measures of Branch Support
Bayesian Phylogeny and Measures of Branch Support Bayesian Statistics Imagine we have a bag containing 100 dice of which we know that 90 are fair and 10 are biased. The
Monte Carlo Methods and Models in Finance and Insurance
Chapman & Hall/CRC FINANCIAL MATHEMATICS SERIES Monte Carlo Methods and Models in Finance and Insurance Ralf Korn Elke Korn Gerald Kroisandt f r oc) CRC Press \ V^ J Taylor & Francis Croup ^^"^ Boca Raton
MODELLING CRITICAL ILLNESS INSURANCE DATA
MODELLING CRITICAL ILLNESS INSURANCE DATA Howard Waters Joint work with: Erengul Dodd (Ozkok), George Streftaris, David Wilkie University of Piraeus, October 2014 1 Plan: 1. Critical Illness Insurance
Inference on Phase-type Models via MCMC
Inference on Phase-type Models via MCMC with application to networks of repairable redundant systems Louis JM Aslett and Simon P Wilson Trinity College Dublin 28 th June 202 Toy Example : Redundant Repairable
Package CoImp. February 19, 2015
Title Copula based imputation method Date 2014-03-01 Version 0.2-3 Package CoImp February 19, 2015 Author Francesca Marta Lilja Di Lascio, Simone Giannerini Depends R (>= 2.15.2), methods, copula Imports
Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle
Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases Andreas Züfle Geo Spatial Data Huge flood of geo spatial data Modern technology New user mentality Great research potential
Model Selection and Claim Frequency for Workers Compensation Insurance
Model Selection and Claim Frequency for Workers Compensation Insurance Jisheng Cui, David Pitt and Guoqi Qian Abstract We consider a set of workers compensation insurance claim data where the aggregate
Traffic Driven Analysis of Cellular Data Networks
Traffic Driven Analysis of Cellular Data Networks Samir R. Das Computer Science Department Stony Brook University Joint work with Utpal Paul, Luis Ortiz (Stony Brook U), Milind Buddhikot, Anand Prabhu
Dirichlet Process Gaussian Mixture Models: Choice of the Base Distribution
Görür D, Rasmussen CE. Dirichlet process Gaussian mixture models: Choice of the base distribution. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 5(4): 615 66 July 010/DOI 10.1007/s11390-010-1051-1 Dirichlet
Introduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
TIME SERIES ANALYSIS OF COMPOSITIONAL DATA USING A DYNAMIC LINEAR MODEL APPROACH
Joint Stascal Meengs - Bayesian Stascal Science TIME SERIES ANALYSIS OF COMPOSITIONAL DATA USING A DYNAMIC LINEAR MODEL APPROACH AMITABHA BHAUMIK, DIPAK K. DEY AND NALINI RAVISHANKER Department of Stascs,
How To Solve A Save Problem With Time T
Bayesian Functional Data Analysis for Computer Model Validation Fei Liu Institute of Statistics and Decision Sciences Duke University Transition Workshop on Development, Assessment and Utilization of Complex
Monte Carlo-based statistical methods (MASM11/FMS091)
Monte Carlo-based statistical methods (MASM11/FMS091) Jimmy Olsson Centre for Mathematical Sciences Lund University, Sweden Lecture 5 Sequential Monte Carlo methods I February 5, 2013 J. Olsson Monte Carlo-based
Psychology 209. Longitudinal Data Analysis and Bayesian Extensions Fall 2012
Instructor: Psychology 209 Longitudinal Data Analysis and Bayesian Extensions Fall 2012 Sarah Depaoli ([email protected]) Office Location: SSM 312A Office Phone: (209) 228-4549 (although email will
A Bootstrap Metropolis-Hastings Algorithm for Bayesian Analysis of Big Data
A Bootstrap Metropolis-Hastings Algorithm for Bayesian Analysis of Big Data Faming Liang University of Florida August 9, 2015 Abstract MCMC methods have proven to be a very powerful tool for analyzing
Sections 2.11 and 5.8
Sections 211 and 58 Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I 1/25 Gesell data Let X be the age in in months a child speaks his/her first word and
APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING
APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING Sulaimon Mutiu O. Department of Statistics & Mathematics Moshood Abiola Polytechnic, Abeokuta, Ogun State, Nigeria. Abstract
