Applied Spatial Statistics: Lecture 6 Multivariate Normal

Similar documents

Student Academic Learning Services Page 1 of 7. Statistics: The Null and Alternate Hypotheses. A Student Academic Learning Services Guide

Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff

Chapter 6: Continuous Probability Distributions GBS221, Class March 25, 2013 Notes Compiled by Nicolas C. Rouse, Instructor, Phoenix College

Some Statistical Procedures and Functions with Excel

Gauss Law. AP Physics C

Guide to Stata Econ B003 Applied Economics

Data Analytics for Campaigns Assignment 1: Jan 6 th, 2015 Due: Jan 13 th, 2015

FINRA Regulation Filing Application Batch Submissions

Chapter 3: Cluster Analysis

Michigan Transfer Agreement (MTA) Frequently Asked Questions for College Personnel

Spatial basis risk and feasibility of index based crop insurance in Canada: A spatial panel data econometric approach

Operational Amplifier Circuits Comparators and Positive Feedback

TRAINING GUIDE. Crystal Reports for Work

10DS AND APPLICATIONS OF TIME SERIES ANALYSIS PART II: LINEAR STOCHASTIC MODELS TECHNICAL REPORT NO. 12 T. W. ANDERSON AND N. D.

Reading Pie Charts Introduction to Bar Graphs Reading Bar Graphs Introduction to Data in Tables Reading Data in Tables

STIOffice Integration Installation, FAQ and Troubleshooting

1.3. The Mean Temperature Difference

Support Vector and Kernel Machines

Times Table Activities: Multiplication

Sinusoidal Steady State Response of Linear Circuits. The circuit shown on Figure 1 is driven by a sinusoidal voltage source v s (t) of the form

COE: Hybrid Course Request for Proposals. The goals of the College of Education Hybrid Course Funding Program are:

PBS TeacherLine Course Syllabus

McAfee Enterprise Security Manager. Data Source Configuration Guide. Infoblox NIOS. Data Source: September 2, Infoblox NIOS Page 1 of 8

BRILL s Editorial Manager (EM) Manual for Authors Table of Contents

SUB CENTRAL - SEMS/SFE TO DO LIST

Access EEC s Web Applications... 2 View Messages from EEC... 3 Sign In as a Returning User... 3

Meeting Minutes for January 17, 2013

LogMeIn Rescue Web SSO via SAML 2.0 Configuration Guide

GED MATH STUDY GUIDE. Last revision July 15, 2011

Mobile Device Manager Admin Guide. Reports and Alerts

MATHEMATICS FOR ENGINEERING TRIGONOMETRY TUTORIAL 1 TRIGONOMETRIC RATIOS, TRIGONOMETRIC TECHNIQUES AND GRAPHICAL METHODS

learndirect Test Information Guide The National Test in Adult Numeracy

PENNSYLVANIA SURPLUS LINES ASSOCIATION Electronic Filing System (EFS) Frequently Asked Questions and Answers

Statistical Analysis (1-way ANOVA)

Trends and Considerations in Currency Recycle Devices. What is a Currency Recycle Device? November 2003

Electrochemical cells

FW Taylor: Management is the art of getting things done and then seeing that it is done in best possible way

SBA 504 Financing. Long Term Asset Financing Made Possible. Glen Heller, Relationship Manager Nancy Sheridan, Specialty Finance Officer

Connecticut State Department of Education School Health Services Information Survey

Disk Redundancy (RAID)

Configuring and Monitoring SysLog Servers

LeadStreet Broker Guide

Welcome to CNIPS Training: CACFP Claim Entry

SBClient and Microsoft Windows Terminal Server (Including Citrix Server)

User Guide Version 3.9

Responsive Design Fundamentals Chapter 1: Chapter 2: name content

How To Change The University'S Budget

VAPOUR PRESSURE. Ranges of Vapour Pressures for some Important Classes of Organic Compounds at 25 o C

A Walk on the Human Performance Side Part I

Nonparametric Estimation: Smoothing and Data Visualization

Readme File. Purpose. What is Translation Manager 9.3.1? Hyperion Translation Manager Release Readme

Security in Business and Applications. Madison Hajeb Stefan Hurst Benjamin Von Slade

CLIENT PORTAL GUIDE SUMMARY

The ad hoc reporting feature provides a user the ability to generate reports on many of the data items contained in the categories.

Experiment 1: Freezing Point Depression

In addition to assisting with the disaster planning process, it is hoped this document will also::

edoc Lite Recruitment Guidelines

WinFlex Web Single Sign-On (EbixLife XML Format) Version: 1.5

Firewall/Proxy Server Settings to Access Hosted Environment. For Access Control Method (also known as access lists and usually used on routers)

Exercise 5 Server Configuration, Web and FTP Instructions and preparatory questions Administration of Computer Systems, Fall 2008

Instructions for Certify

Chris Chiron, Interim Senior Director, Employee & Management Relations Jessica Moore, Senior Director, Classification & Compensation

Arjun V. Bala Page 25

CSE 231 Fall 2015 Computer Project #4

Regions File Transmission

How to deploy IVE Active-Active and Active-Passive clusters

1.571 Structural Analysis and Control Prof. Connor Section 1: Straight Members with Planar Loading. b Displacements (u, vβ),

Outlook Plug-In. Send Conference Invites from Outlook. Downloading Outlook Plug-In CONFERENCING & COLLABORATION RESERVATIONLESS-PLUS

Network Theorems - Alternating Current examples - J. R. Lucas

General. LC Portal Points of Contact. Payments and Financials: Patricia Stack LC Paperwork: Sarah Evans

Welcome to Microsoft Access Basics Tutorial

A Critical Analysis of Counterparty Credit Risk and CVA in a Basel III World. Jon Gregory, Partner

Research Findings from the West Virginia Virtual School Spanish Program

esupport Quick Start Guide

Computer Science Undergraduate Scholarship

Honors Geometry A. Semester Exam Review Answers

Level 2 Training Module Course Guide

Transcription:

Applied Spatial Statistics: Lecture 6 Multivariate Nrmal Duglas Nychka, Natinal Center fr Atmspheric Research Supprted by the Natinal Science Fundatin Bulder, Spring 2013

Outline additive mdel Multivariate nrmal by a linear transfrmatin Likelihd and lg Likelihd clsed frms fr maximizing ver ρ and d cnditinal density functins applicatin t the spatial statistics prblem. 2

Estimating a curve r surface. The additive statistical mdel: Given n pairs f bservatins (x i, y i ), i = 1,..., n y = f + ɛ ɛ i s are randm errrs E[ɛ] = 0 COV( ɛ) = σ 2 I and f is an unknwn smth functin: f(x) = p j=1 φ j (x)d j + g(x) {φ j } knwn basis functins based n spatial lcatins e.g. plynimials r tpgraphy. d fixed cefficients but nt knwn.. g(x) a Gaussian prcess expected value f 0 and a cvariance functin ρk θ (x 1, x 2 ) frm f k is knwn expect fr sme parameter values: ρ and θ that are unknwn. 3

A minimum set f parameters: fixed effect linear parameters d. nugget, errr variance σ 2 sill, prcess variance ρ ther cvariance parameters such as range, scale θ The gal is t identify the statistical likelihd s we can estimate these (and pssibly thers). Flklre: d is easy just regressin λ = σ 2 /ρ and σ 2 are mre difficult with n clsed frmulas ρ mre sensitive t data and cvariance shape. θ This is hard t estimate with small sample sizes. 4

Multivariate nrmal cnstructin u: a vectr f N independent N(0,1)s u = u 1, u 2,..., u N independent u k Nrmal (0, 1). jint prbability density functin: h(u) = 1 (2π) N/2e (1/2) Nk=1 u 2 k = 1 u 2 (2π) N/2e ut Def 1 v = Au + µ is multivariate nrmal with expectatin µ and cvariance Σ = AA T Def 2 prbability density functin fr v is h(v) = 1 (2π) N/2e (v µ) T Σ 1 (v µ) 2 Σ 1/2 BTW This result is if and nly if Def 1 Def 2 and Def 1 Def2 5

Why des this wrk? In general if T (v) = u is any transfrmatin f a randm vectr then the prbability density functin fr v is h(t (v))j(v) J is the Jacbian term, the determinant (indicated by. ) f the matrix f partial derivatives: J(v) = T (u 1 ) T (u 1 ) v 1 v... 2 T (u 2 ) T (u 2 ) v 1 v... 2. T (u N ) T (u N ) v 1 v... 2 T (u 1 ) v N T (u 2 ) v N T (u 2 ) v N Stats students suffer thrugh devilishly hard change f variables prblems fr their qualifiers where T is cmplex and nnlinear When T is linear: T (v) = A 1 (v µ) = u the Jacbian is filled with cnstant elements and the determinant is simplified by sme identities. (1) 6

Multivariate nrmal density in R A 2-d example but reviewing matrix/vectr techniques in R. mu<- matrix( c(.1, -.5), ncl=1, nrw=2) Sigma<- rbind( c(1,.8), c(.8,1) ) v.grid<- make.surface.grid( list( x= seq( -3,3,,100), y = seq( -3,3,,100) )) z<- rep( NA, 100*100) fr( k in 1:(100*100)){ v<- v.grid[k,] # a vectr with 2 rws z[k] <- ( 1/ (2*pi)^( 2/2) ) * exp( - t(v - mu) %*% slve( Sigma) %*% (v - mu)/2 ) * det( Sigma)^(-2/2) } surface( as.surface( v.grid, z)) 7

the density surface y 3 2 1 0 1 2 3 0.4 0.3 0.2 0.1 0.0 3 2 1 0 1 2 3 x 8

Likelihd fr cvariance parameters Cmbine the randm surface with the measurement errr cmpnent y MN(Xd, ρk θ + σ 2 ) X the fixed part f the mdel K cvariance matrix fr the bservatins based n cvariance functin : K i,j = k θ (x i, x j ) e.g. K i,j = ρe x i x j /θ fr the expnential prbability density functin fr y h(y) = 1 Xd)T exp (y (2π) N/2 ( ρkθ + σ 2 I ) 1 (y Xd) ρk θ + σ 2 I 1/2 2 9

lg Likelihd lgl(y, ρ, σ, θ, d) = (N/2) lg(2π) (y Xd) T ( ρk θ + σ 2 I ) 1 (y Xd) 2 (1/2) lg ρk θ + σ 2 I First maximize ver d clsed frm expressin: ˆd generalized least squares estimatr. Given ˆd maximize ver the ther parameters keeping in mind fr every new set f parameter values ˆd needs t be recmputed. 10

simplificatins First maximize ver d clsed frm expressin: ˆd generalized least squares estimatr. Given ˆd maximize ver the ther parameters keeping in mind fr every new set f parameter values ˆd needs t be recmputed. It is als pssible t maximize ver the ρ parameter in clsed frm alng with d. (1/ρ) lgl(y, ρ, λ, θ, d) = (N/2) lg(2π) [ (y Xd) T (K θ + λi) 1 (y Xd) (1/2) lg (K θ + λi) (N/2) lg(ρ) 2 ] Fr fixed λ and θ easy t shw ˆρ = (1/N) [ (y Xˆd) T (K θ + λi) 1 (y Xˆd) ] 11

The prfiled likelihd Substituting the prfile MLEs in fr d and ρ leaves lgl(y, ˆρ, λ, θ, ˆd) = (N/2) lg(2π) N/2 (1/2) lg K θ + λi (N/2) lg(ˆρ) Keep in mind that ˆρ and ˆd are implicitly functins f the remaining parameters. 12

Even when we knw the parameters we still have t d spatial predictin... Big Idea Find the cnditinal distributin f the spatial field given the bservatins. The best estimate is the cnditinal mean, uncertainty the cnditinal variance. We will get the same answer as Kriging! 13

Cnditinal distributins f(x, y) the jint pdf fr (X, Y )and suppse that g(x) is the pdf just fr X. f(x, y) f(y x) = g(x) The distributin f Y given the value fr X. Here X = x is bserved ( fixed) and we have a distributin fr Y. Nte that the cnditinal density is prprtinal t the jint but with the cnditining variable held fixed. A useful prperty f Multivariate nrmals is that the cnditinal distributins are als nrmal. 14

The tw variable nrmal Fr the bivariate nrmal v 1, v 2 with expected value µ 1, µ 2, variances σ 2 1, σ2 2 crrelatin α. f(v 2 v 1 ) is Nrmal mean : µ 2 + ασ 2 /σ 1 (v 1 µ 1 ), variance: σ 2 2 (1 α2 ) (with sme simplificatin) 15

Sme useful ntatin fr pdfs: [Y ] the pdf fr the randm variable Y [X, Y ] pdf fr jint distributin f X and Y [Y X] cnditinal pdf fr Y given X S the frmula fr the cnditinal is: [Y X] = [X, Y ]/[X] Als nte that [X, Y ] = [Y X][X] 16

Bayes Therem Bayes Therem gives a way f inverting the cnditinal infrmatin. In bracket ntatin it is just [Y X] = [X Y ][Y ] [X] The prf fllws by definitins: [Y X] = [X, Y ] [X] = [X Y ][Y ] [X] Nte that [Y X] is simply prprtinal t the jint density where the nrmalizatin depends n the values f X. (But in many cases the nrmalizatin is difficult t find.) Will cme back t this in sme later lectures 17

Bulder/Fraser data 18 63 64 65 66 67 68 69 57 58 59 60 61 62 63 Bulder Bulder Fraser Bulder Fraser Z

Fraser jint pdf Slicing the surface 57 58 59 60 61 62 63 Fraser Bulder 63 64 65 66 67 68 69 Z Bulder Fraser 19

Cnditinal densities fr Bulder/Fraser (Y is Fraser temps and X is Bulder) Frequency 0 2 4 6 8 10 [Y X=64.5] [Y X=67.5] [Y] 56 58 60 62 64 Degrees F 20

Cnditinal nrmal general case Divide v int tw parts (e.g. what we bserve and what we want t predict.) v = ( v1 v 2 ) µ = ( µ1 µ 2 ) Σ = ( Σ11 Σ 12 Σ 21 Σ 22 ) Σ is divided up s cvariance fr v 1 is Σ 11, cvariance fr v 2 is Σ 22 Cnditinal frmula: [v 2 v 1 ] = N(µ 2 + Σ 2,1 Σ 1 1,1 (v 1 µ 1 ), distributin f v 2 given v 1 Σ 2,2 Σ 2,1 Σ 1 1,1 Σ 1,2) cnditinal mean depends n v 1 cnditinal cvariance des nt depend n v 1 21

Our applicatin is v 1 = y (the Data), µ 1 = Xd and v 2 = {g(x 1 ),...g(x N )} (pints t predict) µ 2 = 0 spatial field values where we wuld like t predict. These can be either at the bservatin lcatins r at ther pints. Srting thrugh pieces f Σ Σ 1,1 = cv(y, y) = ρk + σ 2 I i, j element f Σ 2,1 = cv(y, g) is ρk(x i, x k ) j, k element f Σ 2,2 = cv(g, g) is ρk(x j, x k ) Substituting these int the cnditinal mean and cvariance frmulas gives us the Kriging estimatr 22

Summary Identify a fixed part f the spatial mdel and the cvariance. Estimate the parameters by maximum likelihd pssibly prfiling t make cmputatins easier Find cnditinal mean fr g(x ) using MLE fr parameters Find predictin fr fixed part f mdel e.g. j φ(x ) ˆd j Add tw tgether fr spatial predictin at x Use Kriging cvariance as the uncertainty includes uncertainty fr ˆd 23

Clrad minimum MAM temps. Expnential cvariance search ver λ and θ lg Prfile Likelihd with 95% cnfidence regin (grey) 355 lg theta 1.5 1.0 0.5 0.0 0.5 1.0 360 365 370 375 3 2 1 0 1 lg lambda MLEs: θ = 0.867, λ = 1.111... and frm prfiling ρ = 0.296, σ = 1.019 24

Spatial estimate fr Lyns, CO (See R script t find the d and c cefficients explicitly) Recall ˆd fund by GLS and ĉ fund by Kriging GLS residuals. x.star<- rbind( c(-105.2708,40.2247) ) # Lyns, CO data(rmelevatin) elev.star<- interp.surface(rmelevatin, x.star) X.star<- matrix(c( 1, x.star, elev.star), nrw=1) # fixed.part<- X.star%*%d.MLE ghat.x.star<- t( exp( -rdist( xhw1,x.star)/theta.mle))%*%c.mle #NOTE: lambda frmat s left ut rh in frmula Lyns.predictin <- fixed.part + ghat.x.star Estimate: 0.283 degrees C 25

Using fields as a black bx bj<- mkrig(x = xhw1, y = yhw1, lambda = lambda.mle, Z = elevhw1, theta = theta.mle) predict( bj, x= x.star, Z= elev.star) 26