Gaussian Processes to Speed up Hamiltonian Monte Carlo



Similar documents
Tutorial on Markov Chain Monte Carlo

Introduction to Markov Chain Monte Carlo

Computational Statistics for Big Data

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University

MCMC Using Hamiltonian Dynamics

Inference on Phase-type Models via MCMC

11. Time series and dynamic linear models

Dirichlet Processes A gentle tutorial

Probabilistic Models for Big Data. Alex Davies and Roger Frigola University of Cambridge 13th February 2014

Bayesian Statistics: Indian Buffet Process

Introduction to Monte Carlo. Astro 542 Princeton University Shirley Ho

Bayesian Statistics in One Hour. Patrick Lam

Dealing with large datasets

A Bayesian Model to Enhance Domestic Energy Consumption Forecast

Parameter estimation for nonlinear models: Numerical approaches to solving the inverse problem. Lecture 12 04/08/2008. Sven Zenker

The Chinese Restaurant Process

STA 4273H: Statistical Machine Learning

Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering

Big Data need Big Model 1/44

HT2015: SC4 Statistical Data Mining and Machine Learning

MCMC-Based Assessment of the Error Characteristics of a Surface-Based Combined Radar - Passive Microwave Cloud Property Retrieval

Imperfect Debugging in Software Reliability

Sampling for Bayesian computation with large datasets

Validation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT

Web-based Supplementary Materials for Bayesian Effect Estimation. Accounting for Adjustment Uncertainty by Chi Wang, Giovanni

Optimizing Prediction with Hierarchical Models: Bayesian Clustering

Probabilistic Fitting

Austerity in MCMC Land: Cutting the Metropolis-Hastings Budget

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)

Model-based Synthesis. Tony O Hagan

A Bootstrap Metropolis-Hastings Algorithm for Bayesian Analysis of Big Data

THE USE OF STATISTICAL DISTRIBUTIONS TO MODEL CLAIMS IN MOTOR INSURANCE

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

Handling attrition and non-response in longitudinal data

Note on the EM Algorithm in Linear Regression Model

Applications of R Software in Bayesian Data Analysis

DURATION ANALYSIS OF FLEET DYNAMICS

BayesX - Software for Bayesian Inference in Structured Additive Regression

Basic Bayesian Methods

Parametric fractional imputation for missing data analysis

Dirichlet Process Gaussian Mixture Models: Choice of the Base Distribution

Statistical Machine Learning from Data

Inference of Probability Distributions for Trust and Security applications

How To Solve A Sequential Mca Problem

Statistics Graduate Courses

Markov Chain Monte Carlo Simulation Made Simple

Time Series Analysis

Least Squares Estimation

Monte Carlo-based statistical methods (MASM11/FMS091)

Section 5. Stan for Big Data. Bob Carpenter. Columbia University

Monte Carlo Simulation

Parallelization Strategies for Multicore Data Analysis

Pattern Analysis. Logistic Regression. 12. Mai Joachim Hornegger. Chair of Pattern Recognition Erlangen University

Package mcmcse. March 25, 2016

Package zic. September 4, 2015

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

Monte Carlo-based statistical methods (MASM11/FMS091)

Model Calibration with Open Source Software: R and Friends. Dr. Heiko Frings Mathematical Risk Consulting

Bayesian inference for population prediction of individuals without health insurance in Florida

Bayesian Hidden Markov Models for Alcoholism Treatment Tria

Big Data, Statistics, and the Internet

Bayesian Estimation Supersedes the t-test

Markov Chain Monte Carlo and Numerical Differential Equations

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence

STAT3016 Introduction to Bayesian Data Analysis

From CFD to computational finance (and back again?)

Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle

Combining Visual and Auditory Data Exploration for finding structure in high-dimensional data

Basics of Statistical Machine Learning

Contrôle dynamique de méthodes d approximation

An Introduction to Using WinBUGS for Cost-Effectiveness Analyses in Health Economics

General Sampling Methods

The Kelly criterion for spread bets

An Empirical Analysis of Sponsored Search Performance in Search Engine Advertising. Anindya Ghose Sha Yang

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

Probability for Estimation (review)

Gaussian Conjugate Prior Cheat Sheet

Christfried Webers. Canberra February June 2015

Introduction to Mobile Robotics Bayes Filter Particle Filter and Monte Carlo Localization

Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression

Extreme Value Modeling for Detection and Attribution of Climate Extremes

Markov Chain Monte Carlo and Applied Bayesian Statistics: a short course Chris Holmes Professor of Biostatistics Oxford Centre for Gene Function

Using SAS PROC MCMC to Estimate and Evaluate Item Response Theory Models

Introduction to Bayesian Analysis Using SAS R Software

The Monte Carlo Framework, Examples from Finance and Generating Correlated Random Variables

Echantillonnage exact de distributions de Gibbs d énergies sous-modulaires. Exact sampling of Gibbs distributions with submodular energies

Monte Carlo Methods in Finance

Transcription:

Gaussian Processes to Speed up Hamiltonian Monte Carlo Matthieu Lê Murray, Iain http://videolectures.net/mlss09uk_murray_mcmc/ Rasmussen, Carl Edward. "Gaussian processes to speed up hybrid Monte Carlo for expensive Bayesian integrals." Bayesian Statistics 7: Proceedings of the 7th Valencia International Meeting. Oxford University Press, 2003. Neal, Radford M (2011). " MCMC Using Hamiltonian Dynamics. " In Steve Brooks, Andrew Gelman, Galin L. Jones, and Xiao-Li Meng. Handbook of Markov Chain Monte Carlo. Chapman and Hall/CRC. Journal Club 11/04/14 1

MCMC Monte Carlo : Rejection sampling, Importance sampling, MCMC : Markov Chain Monte Carlo Sampling technique to estimate a probability distribution Draw samples from complex probability distributions In general : f x P x dx 1 S S s=1 f x s, x (s) ~P(x) Journal Club 11/04/14 2

Objective Bayesian Inference : θ : Model parameter x i : Observations P θ : prior on the model parameters P x 1 x n θ : likelyhood, potentially expensive to evaluate P θ x 1 x n = P x 1 x n θ P θ P x 1 x n θ P θ dθ P θ x 1 x n = 1 Z P x 1 x n θ P θ MCMC : sample P θ x 1 x n without the knowledge of Z Journal Club 11/04/14 3

Metropolis-Hastings We want to draw samples from P α P 1. Propose new θ from the transition function Q(θ, θ) (e.g. N θ, σ 2 ). The transition from one parameter to another is a Markov Chain 2. Accept the new parameter θ with a probability min (1, P θ Q(θ,θ ) P θ Q(θ,θ) ) Q must be chosen to fulfill some technical requirements. Samples are not independent. Sample N 0,1 ) Journal Club 11/04/14 4

Metropolis-Hastings Metropolis-Hastings: 1000 samples, step 0.1 Rejection rate: 44% Problem : The proposed samples come from a Gaussian. There is possibly an important rejection rate. Journal Club 11/04/14 5

Hamiltonian Monte Carlo Same idea as Metropolis-Hastings BUT the proposed samples now come from the Hamiltonian dynamics : H = E pot + E kin E pot = log P θ, E kin = p2 2m θ t = E kin p = p m p t = E pot θ θ θ p p Journal Club 11/04/14 6

Hamiltonian Monte Carlo Algorithm : Sample p according to its known distribution Run the Hamiltonian dynamics during a time T Accept the new sample with probability : min (1, exp E pot + E pot exp ( E kin + E kin )) The Energy is conserved so the acceptance probability should theoretically be 1. Because of the numerical precision, we need the Metropolis-Hastings type decision in the end. Journal Club 11/04/14 7

Hamiltonian Monte Carlo Gibbs Sampling Metropolis-Hastings Hamiltonian Monte Carlo Journal Club 11/04/14

Hamiltonian Monte Carlo Advantage : The Hamiltonian stays (approximately) constant during the dynamic, hence lower rejection rate! 1000 samples, L = 200, ε =0,01 Rejection rate = 0% Problem : Computing the Hamiltonian dynamic requires computing the model partial derivatives, high number of simulation evaluation! Neal, Radford M (2011). " MCMC Using Hamiltonian Dynamics. " In Steve Brooks, Andrew Gelman, Galin L. Jones, and Xiao-Li Meng. Handbook of Markov Chain Monte Carlo. Chapman and Hall/CRC. Journal Club 11/04/14 9

Gaussian Process HMC Same algorithm as HMC BUT the Hamiltonian dynamic is computed using Gaussian process simulating E pot p t = E pot θ Gaussian process = distribution over smooth function to approximate E pot : P E pot θ ~N 0, Σ, Σ pq = ω 0 exp( 1 2 D d=1 x p q 2 2 d x d /ωd ) Journal Club 11/04/14 10

Gaussian Process HMC Once the Gaussian process is defined with a covariance matrix, we can predict new values : P E pot θ, E pot, θ ~N μ, σ 2, If the Gaussian process is good, μ(θ ) target density Algorithm : 1. Initialization : Evaluate the target density at D random points to define the Gaussian process. 2. Exploratory phase : HMC with E pot = μ σ : evaluation of points with high target value and high uncertainty. Evaluate the real target density at the end of each iteration. 3. Sampling phase : HMC with E pot = μ. Journal Club 11/04/14 11

Gaussian Process HMC Rasmussen, Carl Edward. "Gaussian processes to speed up hybrid Monte Carlo for expensive Bayesian integrals." Bayesian Statistics 7: Proceedings of the 7th Valencia International Meeting. Oxford University Press, 2003. Journal Club 11/04/14 12

Conclusion Metropolis-Hastings : few model evaluation per iteration but important rejection rate Hamiltonian Monte Carlo : a lot of model evaluation per iteration but low rejection rate GPHMC : few model evaluation per iteration and low rejection rate BUT : Initialization requires model evaluations to define a good Gaussian process BUT : Exploratory phase requires one model evaluation per iteration Journal Club 11/04/14 13