Spatial Interpolation
|
|
|
- Melina Hill
- 10 years ago
- Views:
Transcription
1 Spatial Interpolation Methods for extending information on spatial data fields John Lewis
2
3 The type of Spatial Interpolation depends on: what type of surface model you want to produce what data are available point, line or area continuous or discrete
4 Interpolation techniques are classed as: Global - surface model considers all the points Local - uses only values in a prescribed nearby neighborhood Exact - retains the original values of the sample on the resulting surface Inexact - assigns new values to known data points Deteministic provides no indication of the extent of possible error Stochastic methods provide probabilistic estimate
5
6 General list of techniques (points) trend surface - a global interpolator which calculates a best overall surface fit IDW - inverse distance weighting loess local weighted smoothing regression (weighted quadratic least squares regression) spline - mathematical piecewise curve fitting Kriging - using geostatistical techniques
7 Other procedures Thiessen or Voronoi tessellation - area divided into regions around the sample data points where every pixel or area in the region is assigned to the data point which is the closest TIN - sample data points become vertices of a set of triangular facets covering the study area
8
9
10
11 Minimum angle triangles, bounded by data points, are used in the linear interpolation algorithm
12
13 (from David Rossiter)
14 Properties: stochastic, inexact, global (from David Rossiter)
15
16 (from David Rossiter)
17 Trend Surface Topo
18
19 Trend surface examples
20 The first-order polynomial of the gravity data set shows the strong east-to-west gradient. Once the background gradient is removed, the local anomalies are displayed more clearly.
21 Trend surface in gstat gstat treats this as a special case of universal kriging with no spatial dependence (i.e. NULL model) ## just to degree=2:this example data(topo) coordinates(topo) <- ~ x+y x=seq(0,6.3, by=0.1);y=seq(0,6.3, by=0.1) topo.grid <- expand.grid(x=x,y=y) gridded(topo.grid) <- ~x+y # 1st-order trend surface x <- gstat(formula=topo$z ~ 1, data=topo, degree=1) kt <- predict(x, newdata=topo.grid) spplot(kt[1], contour=true,main="topo trend surface (1)-gstat") # 2nd order trend surface x <- gstat(formula=topo$z ~ 1, data=topo, degree=2) kt2 <- predict(x, newdata=topo.grid) p1 <- spplot( kt2[1], contour=true, main="topo trend surface (2)gstat",sp.layout=list(list("sp.points", topo@coords, pch=19, col=2))) # prediction error from OLS regression p2 <- spplot(kt2[2], contour=true,sp.layout=list(list("sp.points", topo@coords, pch=19, col=2)),main="prediction error from OLS regression") print(p1, position = c(0,.5,1,1),more=t) print(p2, position = c(0,0,1,.5))
22
23
24 Some Problems with Trend Surfaces 1. Extreme simplicity in form of a polynomial surface as compared to most natural surfaces. 2. Unfortunate tendency to accelerate without limit to higher or lower values in areas where there are no control points, such as along the edges of maps. 3. Computational/statistical difficulties may be encountered if a very high degree polynomial trend surface is fitted. 4. Has problems with outliers
25
26
27
28
29 Inverse distance weighted interpolation (IDW) This is one of the simplest and most readily available methods. It is based on an assumption that the value at an unsampled point can be approximated as a weighted average of values at points within a certain cut-off distance, or from a given number m of the closest points (typically 10 to 30). Weights are usually inversely proportional to a power of distance The method often does not reproduce the local shape implied by data and produces local extrema at the data points
30
31 Inverse Distance Weighting Substitute (di) actual values Zp estimated n n ( Z / d ) / (1 / d ) i i 1 i i i 1 where di = {(xi xp)2 + (yi yp)2}0.5
32
33
34 A PROBLEM
35 #IDW kid <- krige(topo$z ~ 1, topo, topo.grid, NULL) spplot(kid["var1.pred"], main = "TopIDW",contour=TRUE, sp.layout=list(list("sp.points", topo@coords, pch=19, col=2)))
36 Local Polynomial Regression Fitting (LOESS) Fitting is done locally. The fit is made using points in a neighbourhood of x, weighted by their distance from x. The size of the neighbourhood is controlled by α (set by span parameter and controls the degree of smoothing). The neighbourhood points (α < 1) have tricubic weighting (proportional to (1 (dist/maxdist)^3)^3). For α > 1, all points are used, with the maximum distance assumed to be α^(1/p) times the actual maximum distance for p explanatory variables.
37
38
39 Spline Regression Spline regressions are regression line segments that are joined together at special join points known as spline knots. By definition, there are no abrupt breaks in a spline line. However, at their knots, splines may suddenly change direction (i.e., slope), or, for curved lines, there may be a more subtle, but just as sudden, change in the rate at which the slope is changing. Splines are used instead of quadratic, cubic, or higher-order polynomial regression curves because splines are more flexible and are less prone to multicollinearity, which is the bane of polynomial regression. You can add a spline to whatever model, so your dependent variable, can be continuous, bounded, discrete, categorical, or whatever else you can or cannot think of. A spline is added because you think that the effect of a variable is non-linear, so it makes only sense when the explanatory variable is continuous. (Lawrence C. Marsh) B-splines uses a piecewise polynomial to provide a series of patches resulting in a surface that has continuous first and second derivatives ensures continuity in: elevation (zero-order continuity) - surface has no cliffs slope (first-order continuity) - slopes do not change abruptly, there are no kinks in contours curvature (second order continuity) - minimum curvature is achieved produces a continuous surface with minimum curvature
40 SPLINE 15 points
41 Bicubic spline interpolation of a bivariate function, z(x,y), is made on a rectangular grid in the x-y plane. the interpolating function is a piecewise function composed of a set of bicubic (bivariate thirddegree) polynomials, each applicable to a rectangle of the input grid in the x-y plane. Each polynomial is determined locally - with the piecewise segments joined at knots and with various degrees of smoothness.
42 akima.spl <- with(topo, interp(x,y,z, xo=seq(0,6.3, length=100), yo=seq(0,6.3, length=100),linear=false)) image(akima.spl,asp=1,main = "topo smooth interp(* bicubic100x100)") contour(akima.spl,add=true) points(topo,pch=3)
43 Thin Plate Spline The thin plate spline is a two-dimensional variation of the cubic spline. It is the fundamental solution to the biharmonic equation, and has the kernel form U(r)= r^2 ln r. Given a set of data points, a weighted combination of thin plate splines centered about each data point gives the interpolation function that passes through the points exactly while minimizing the so-called "bending energy or tension. Bending energy is defined here as the integral over domain of the squares of the second derivatives: Regularization may be used to relax the requirement that the interpolated point pass through the data points exactly.
44 #Tps from "fields" for topo data(topo) setting up data for Tps x <- topo$x; y <- topo$y; z <- topo$z tp <- cbind (x,y); tpo <- list(x=tp,y=z) tout <- Tps(tpo$x,tpo$y) #thin plate spline function surface(tout) title("thin plate spline from fields(topo)")
45
46 Mult-level B-splines
47
48 Surface approximation from bivariate scattered data using multilevel B-splines require(mba) data(lidar)# points taken over forest area mba.int <- mba.surf(lidar, 300, 300, extend=true) $xyz.est ##Image plot image(mba.int, xaxs="r", yaxs="r",main="mba - LIDAR forest data") contour(mba.int,add=true) ##Perspective plot persp(mba.int, theta = 135, phi = 30, col = "green3", scale = FALSE, ltheta = -120, shade = 0.75, expand = 10, border = NA, box = FALSE)
49 Spline Interpolation Fits a minimum-curvature surface through input points Like a rubber sheet that is bent around the sample points Best for gently varying surfaces such as elevation Estimates can result in spurious ridges and valleys
50 Spatial Prediction-Kriging Model of Spatial Variability theory of regionalized variable Z(x) = μ(x) + k(x) + ε μ(x) is a deterministic function describing the structural component of Z k(x) denotes the stochastic, locally varying but spatially dependent residuals from μ(x)-the regionalized variable ε is a spatially independent Gaussian noise term with zero mean and variance σ2
51 Kriging Kriging is a collection of generalized linear regression techniques for minimizing an estimation variance defined from a prior model of covariance (Olea,1991) zˆ ( so ) n z(s ) i i 1 n i 1 i 1 i with
52 Kriging Principle obtain the best unbiased linear predictor takes into account the covariance structure as a function of distance Best Predictor unbiased: E[zp z]= 0 or no systematic error minimum variance of zp for any other combination of the observed values predict for a new locations (s) based on the covariance
53 Estimating the variogram as a measure of spatial covariance
54 Kriging TOPO
55 Is Kriging Better?? it provides the no. of points needed to compute the local average provides the size, orientation and shape of neighborhood from which the points are drawn indicates if there are better ways to estimate the interpolation weights its strengths are in the statistical quality of its predictions (e.g.unbiasedness) and in the ability to indicate some measure of the spatial distribution of uncertainty. It has been less successful for applications where local geometry and smoothness are the key issues and other methods prove to be competitive or even better
56 Spatial Simulation A stochastic technique of conditional simulation is used to generate alternative, equally probable realizations of a surface, reproducing both data and the estimated covariance. From such a set of statistical samples one can estimate a spatially-dependent picture of the uncertainty which is inherent in the data. Simulation is not an instant picture of reality but only a version of it among a myriad of other possibilities. There are two questions that are paramount when doing simulations: 1) How confident can I be of the results? 2) How many simulations should we generate?
57 (from David Rossiter)
58 (from David Rossiter)
59 Types of Simulation Procedures Simple Monte Carlo method simulate values for each point which has a unique and an equiprobable value. Since each cell is independent-the results are a stationary noise surface. z(x) = Pr(Z) where Z is a spatially independent, normally distributed probability distribution function with mean μ and variance σ2 Unconditional Simulation use the variogram to impart spatial information but the actual data points are not used. With the variogram parameters and Monte Carlo procedures the most likely outcome per cell are computed. z(x) = Pr(Z), γ(h)
60 (from David Rossiter)
61 (from Edzer Pebesma)
62 Steps in Simulation Modelling 1. Set up the usual equations for simple/ordinary kriging with an overall mean. 2. Select an unsampled data point at random. Compute kriging prediction and standard error using data from the data set in the locality of the point. 3. Draw a random value from the probability distribution defined by the prediction and standard error. Add this to the list of data points.
63 Steps in Simulation (continued) 4. Repeat steps 2-3 until all cells/points have been visited and the simulation of one realization is completed. 5. Repeat steps 1-4 until sufficient realizations have been created. 6. Compute surfaces for the mean values and standard deviation from all the realizations.
64
65
66
67
68 Conditional Simulation in gstat with use of raster require(raster) set.seed(234).. v = variogram(log(zinc)~1, meuse) v.m = fit.variogram(v, vgm(1, "Sph", 900, 1)) tsr < sim = krige(log(zinc)~1, meuse, meuse.grid, v.m, nsim = tsr, nmax = 20) s <- brick(sim) rs2 <- calc(s, fun=function(x){exp(x)}) #backtransform log data b1 <- stackapply(rs2, indices=c(rep(1,tsr)), fun=median, progress="window") plot(b1,col=rev(heat.colors(25)),main="simulation-median (Zn) n=1000") contour(b1, add=true) hist(b1,main=("simulation-median (Zn)")) b2 <- stackapply(rs2, indices=c(rep(1,tsr)), fun=mad, progress="window") plot(b2,col=rev(heat.colors(25)),main="simulation-mad (Zn) n=1000") contour(b2, add=true)
69
70
71
72 Model Verification Regression diagnostics Cross Validation AIC
73 AIC Akaike (1973, 1974) introduced a criterion for model adequacy, first for time-series models and then more generally. He relates how his secretary suggested he call it An Information Criterion, AIC.
74 AIC AIC has a very appealing simplicity: AIC = -2 log(maximized likelihood) + 2p where p is the number of estimated parameters. Choose the model with the smallest AIC (and perhaps retain all models within 2 of the minimum).
75 Cross-validation Leave-one-out CV The idea is that given a dataset of N points, we use our model-building procedure on each subset of size N-1, and predict the point we left out. Then the set of predictions can be summarized by some measure of prediction accuracy. Idea goes back at least as far as Mosteller & Wallace (1963).
76 Users/jtleek/Dropbox/Public/courseraPublic/week6/002crossValidation/index.html#1
77 V-fold cross-validation Divide the data into V sets, and amalgamate V - 1 of them, build a model and predict the result for the remaining set. Do this V times leaving a different set out each time. How big should V be? We want the model-building problem to be realistic, so want to leave out a small proportion. We don t want too much work. So usually V is 3 10.
78 file:///users/jtleek/dropbox/public/courserapublic/week6/002crossvalidation/index.html#1
79 (from Edzer Pebesma)
80
81
82
83
84
85 Cross Validation Statistics for TOPO # mean error, ideally 0: > (mpe <- mean(kcv$residual)) [1] > mpe/mean(topo$z)#standardize to the data mean [1] > # MSPE, ideally small > mean(kcv$residual^2) [1] > # Mean square normalized error, ideally close to 1 > mean(kcv$zscore^2) [1] > #RMSE > (rmse <- sqrt(mean(kcv$residual^2))) [1] > rmse/sd(topo$z) [1] > rmse/iqr(topo$z) [1] > # amount of variation (R^2) explained by the models: > 1-var(kcv$residual, na.rm=t)/var(topo$z) [1] > # correlation predicted and residual, ideally 0 > cor(kcv$observed - kcv$residual, kcv$residual) [1]
86 Comparison of TOPO model statistics loess Tps ME UK.cv nfold= RMSE RMSE/sd R^
87
88 ### 5-fold Cross Validation ################################################### v.fit <- vgm(.5, "Exp", 790, 1.7) coordinates(meuse) <- ~x+y cv155 <- krige.cv(log(cadmium)~1, meuse, v.fit, nfold=5) ################################################### bubble(cv155, "residual", main = "log(cadmium): 5-fold Residuals Cross Validation-Residuals",maxsize = 1.5) bubble(cv155, "zscore", main = "log(cadmium): 5-fold Residuals Cross Validation-Z-Scores",maxsize = 1.5)
89 file:///users/jtleek/dropbox/public/courserapublic/week6/002crossvalidation/index.html#1
90 # Validating Kriging Model by subsampling set.seed( ) ################################################### sel100 <- sample(1:155, 100) m.model <- meuse[sel100,] m.valid <- meuse[-sel100,] coordinates(m.model) <- ~ x+y coordinates(m.valid) <- ~ x+y v100.fit <- fit.variogram(variogram(log(cadmium)~1, m.model), vgm(.5, "Exp",800, 1.7)) m.valid.pr <- krige(log(cadmium)~1, m.model, m.valid, v100.fit) # predict at #m.valid sites resid.kr <- log(m.valid$cadmium)- m.valid.pr$var1.pred # residuals summary(resid.kr) resid.mean <- log(m.valid$cadmium) mean(log(m.valid$cadmium)) R2 <- 1 - sum(resid.kr^2)/sum(resid.mean^2) R^2 = 0.34
91
92 CROSS VALIDATION When in prediction mode (i.e., kriging), cross validating the model is one way to diagnose problems occurring with the model s fit. The purpose of checking the modeled variogram with cross validation is to expose obvious mistakes, and to view where the model has possible trouble predicting. It does not prove the model is right; it only shows where the model fails. Likewise, it is not recommended to confirm the correctness of different model types, e.g. spherical versus power models.
93
94 (from David Rossiter)
95 Voronoi Tesselation IDW Helena Mitasova:
96 Kriging Thin plate spline Helena Mitasova:
97 (from David Rossiter)
98 (from David Rossiter)
99 Choosing a Surface Interpolation Model Characteristics of the input sample data Precision of the sample point measurements Frequency and distribution of sample points relative to the scale of variation Surface shape and level of local variability The context in which the surface model will be used
100 Interpolate a surface. Different methods provided by R. #a) Trend surface fitting polynomial regression surface by least squares (surf.ls, trmat) {spatial package} & (krige) {gstat} #b) Trend surface fitting by generalized least squares (surf.gls, trmat) {spatial package} & {gstat} #c) Local polynomial regression fitting with error prediction (loess, predict.loess) {stats}) #d) Bilinear or Bicubic spline interpolation (interp) {akima package} #e) Multi-level B splines (mba.surf) {MBA package} #f) Thin plate spline (Tps) {fields package} #e) Kriging with ##variogram calculation (Variogram) {gstat} & (variog) {geor} ##kriging calculation (krige) {gstat} ; (krige.conv) {geor} ; (Krig) {fields package} ##Bayes kriging (krige.bayes) {geor} #f) other packages: ##variogram calculation (Variogram) {nlme package} ##covariance estimation with correlogram (correlogram, variogram) {MASS package} #g) and many others: ## {constrainedkriging} {GEOmap} {geospt} {intamap} {LatticeKrig} {ModelMap} {ramps} {RandomFields} {sgeostat} {spatstat} {spbayes} {SpatialExtremes} # Automatic interpolation package {automap package} # Areal interpolation for GIS data {AIGIS package} Which method to use, you ask? Why not just explore the gstat or geor package or for that matter any of the other packages doing spatial interpolation?
101 C:\geostat_quebec\spatInter_G13\Demoauto.mappackage&spatial interpolation.htm
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering Department of Industrial Engineering and Management Sciences Northwestern University September 15th, 2014
EXPLORING SPATIAL PATTERNS IN YOUR DATA
EXPLORING SPATIAL PATTERNS IN YOUR DATA OBJECTIVES Learn how to examine your data using the Geostatistical Analysis tools in ArcMap. Learn how to use descriptive statistics in ArcMap and Geoda to analyze
Geostatistics Exploratory Analysis
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras [email protected]
5. Multiple regression
5. Multiple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/5 QBUS6840 Predictive Analytics 5. Multiple regression 2/39 Outline Introduction to multiple linear regression Some useful
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
Introduction to Modeling Spatial Processes Using Geostatistical Analyst
Introduction to Modeling Spatial Processes Using Geostatistical Analyst Konstantin Krivoruchko, Ph.D. Software Development Lead, Geostatistics [email protected] Geostatistics is a set of models and
EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines
EECS 556 Image Processing W 09 Interpolation Interpolation techniques B splines What is image processing? Image processing is the application of 2D signal processing methods to images Image representation
Finite Element Formulation for Plates - Handout 3 -
Finite Element Formulation for Plates - Handout 3 - Dr Fehmi Cirak (fc286@) Completed Version Definitions A plate is a three dimensional solid body with one of the plate dimensions much smaller than the
Package MBA. February 19, 2015. Index 7. Canopy LIDAR data
Version 0.0-8 Date 2014-4-28 Title Multilevel B-spline Approximation Package MBA February 19, 2015 Author Andrew O. Finley , Sudipto Banerjee Maintainer Andrew
Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection
Directions in Statistical Methodology for Multivariable Predictive Modeling Frank E Harrell Jr University of Virginia Seattle WA 19May98 Overview of Modeling Process Model selection Regression shape Diagnostics
Introduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
INTRODUCTION TO GEOSTATISTICS And VARIOGRAM ANALYSIS
INTRODUCTION TO GEOSTATISTICS And VARIOGRAM ANALYSIS C&PE 940, 17 October 2005 Geoff Bohling Assistant Scientist Kansas Geological Survey [email protected] 864-2093 Overheads and other resources available
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
A CRF-based approach to find stock price correlation with company-related Twitter sentiment
POLITECNICO DI MILANO Scuola di Ingegneria dell Informazione POLO TERRITORIALE DI COMO Master of Science in Computer Engineering A CRF-based approach to find stock price correlation with company-related
Annealing Techniques for Data Integration
Reservoir Modeling with GSLIB Annealing Techniques for Data Integration Discuss the Problem of Permeability Prediction Present Annealing Cosimulation More Details on Simulated Annealing Examples SASIM
Geography 4203 / 5203. GIS Modeling. Class (Block) 9: Variogram & Kriging
Geography 4203 / 5203 GIS Modeling Class (Block) 9: Variogram & Kriging Some Updates Today class + one proposal presentation Feb 22 Proposal Presentations Feb 25 Readings discussion (Interpolation) Last
AMARILLO BY MORNING: DATA VISUALIZATION IN GEOSTATISTICS
AMARILLO BY MORNING: DATA VISUALIZATION IN GEOSTATISTICS William V. Harper 1 and Isobel Clark 2 1 Otterbein College, United States of America 2 Alloa Business Centre, United Kingdom [email protected]
Chapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
Common Core Unit Summary Grades 6 to 8
Common Core Unit Summary Grades 6 to 8 Grade 8: Unit 1: Congruence and Similarity- 8G1-8G5 rotations reflections and translations,( RRT=congruence) understand congruence of 2 d figures after RRT Dilations
POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.
Polynomial Regression POLYNOMIAL AND MULTIPLE REGRESSION Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model. It is a form of linear regression
Machine Learning and Data Mining. Regression Problem. (adapted from) Prof. Alexander Ihler
Machine Learning and Data Mining Regression Problem (adapted from) Prof. Alexander Ihler Overview Regression Problem Definition and define parameters ϴ. Prediction using ϴ as parameters Measure the error
Finite Element Formulation for Beams - Handout 2 -
Finite Element Formulation for Beams - Handout 2 - Dr Fehmi Cirak (fc286@) Completed Version Review of Euler-Bernoulli Beam Physical beam model midline Beam domain in three-dimensions Midline, also called
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma [email protected] The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
CSE 167: Lecture 13: Bézier Curves. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2011
CSE 167: Introduction to Computer Graphics Lecture 13: Bézier Curves Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2011 Announcements Homework project #6 due Friday, Nov 18
Experiment #1, Analyze Data using Excel, Calculator and Graphs.
Physics 182 - Fall 2014 - Experiment #1 1 Experiment #1, Analyze Data using Excel, Calculator and Graphs. 1 Purpose (5 Points, Including Title. Points apply to your lab report.) Before we start measuring
Natural cubic splines
Natural cubic splines Arne Morten Kvarving Department of Mathematical Sciences Norwegian University of Science and Technology October 21 2008 Motivation We are given a large dataset, i.e. a function sampled
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
Econometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
Lecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
An Interactive Tool for Residual Diagnostics for Fitting Spatial Dependencies (with Implementation in R)
DSC 2003 Working Papers (Draft Versions) http://www.ci.tuwien.ac.at/conferences/dsc-2003/ An Interactive Tool for Residual Diagnostics for Fitting Spatial Dependencies (with Implementation in R) Ernst
Big Ideas in Mathematics
Big Ideas in Mathematics which are important to all mathematics learning. (Adapted from the NCTM Curriculum Focal Points, 2006) The Mathematics Big Ideas are organized using the PA Mathematics Standards
Spatial Data Analysis
14 Spatial Data Analysis OVERVIEW This chapter is the first in a set of three dealing with geographic analysis and modeling methods. The chapter begins with a review of the relevant terms, and an outlines
Distributed computing of failure probabilities for structures in civil engineering
Distributed computing of failure probabilities for structures in civil engineering Andrés Wellmann Jelic, University of Bochum ([email protected]) Matthias Baitsch, University of Bochum ([email protected])
Section 1: Simple Linear Regression
Section 1: Simple Linear Regression Carlos M. Carvalho The University of Texas McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction
X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
Prentice Hall Algebra 2 2011 Correlated to: Colorado P-12 Academic Standards for High School Mathematics, Adopted 12/2009
Content Area: Mathematics Grade Level Expectations: High School Standard: Number Sense, Properties, and Operations Understand the structure and properties of our number system. At their most basic level
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
Local classification and local likelihoods
Local classification and local likelihoods November 18 k-nearest neighbors The idea of local regression can be extended to classification as well The simplest way of doing so is called nearest neighbor
Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model
Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model Xavier Conort [email protected] Motivation Location matters! Observed value at one location is
Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard
Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express
Penalized regression: Introduction
Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood
GEOENGINE MSc in Geomatics Engineering (Master Thesis) Anamelechi, Falasy Ebere
Master s Thesis: ANAMELECHI, FALASY EBERE Analysis of a Raster DEM Creation for a Farm Management Information System based on GNSS and Total Station Coordinates Duration of the Thesis: 6 Months Completion
Week 5: Multiple Linear Regression
BUS41100 Applied Regression Analysis Week 5: Multiple Linear Regression Parameter estimation and inference, forecasting, diagnostics, dummy variables Robert B. Gramacy The University of Chicago Booth School
Mathematics Review for MS Finance Students
Mathematics Review for MS Finance Students Anthony M. Marino Department of Finance and Business Economics Marshall School of Business Lecture 1: Introductory Material Sets The Real Number System Functions,
Least-Squares Intersection of Lines
Least-Squares Intersection of Lines Johannes Traa - UIUC 2013 This write-up derives the least-squares solution for the intersection of lines. In the general case, a set of lines will not intersect at a
Integrated Resource Plan
Integrated Resource Plan March 19, 2004 PREPARED FOR KAUA I ISLAND UTILITY COOPERATIVE LCG Consulting 4962 El Camino Real, Suite 112 Los Altos, CA 94022 650-962-9670 1 IRP 1 ELECTRIC LOAD FORECASTING 1.1
Smoothing. Fitting without a parametrization
Smoothing or Fitting without a parametrization Volker Blobel University of Hamburg March 2005 1. Why smoothing 2. Running median smoother 3. Orthogonal polynomials 4. Transformations 5. Spline functions
Reflection and Refraction
Equipment Reflection and Refraction Acrylic block set, plane-concave-convex universal mirror, cork board, cork board stand, pins, flashlight, protractor, ruler, mirror worksheet, rectangular block worksheet,
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data
An Introduction to Point Pattern Analysis using CrimeStat
Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign
4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4
4. Simple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/4 Outline The simple linear model Least squares estimation Forecasting with regression Non-linear functional forms Regression
Face detection is a process of localizing and extracting the face region from the
Chapter 4 FACE NORMALIZATION 4.1 INTRODUCTION Face detection is a process of localizing and extracting the face region from the background. The detected face varies in rotation, brightness, size, etc.
Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary
Shape, Space, and Measurement- Primary A student shall apply concepts of shape, space, and measurement to solve problems involving two- and three-dimensional shapes by demonstrating an understanding of:
Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini
NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building
PCHS ALGEBRA PLACEMENT TEST
MATHEMATICS Students must pass all math courses with a C or better to advance to the next math level. Only classes passed with a C or better will count towards meeting college entrance requirements. If
Exercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
Example: Boats and Manatees
Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant
Rob J Hyndman. Forecasting using. 11. Dynamic regression OTexts.com/fpp/9/1/ Forecasting using R 1
Rob J Hyndman Forecasting using 11. Dynamic regression OTexts.com/fpp/9/1/ Forecasting using R 1 Outline 1 Regression with ARIMA errors 2 Example: Japanese cars 3 Using Fourier terms for seasonality 4
APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING
APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING Sulaimon Mutiu O. Department of Statistics & Mathematics Moshood Abiola Polytechnic, Abeokuta, Ogun State, Nigeria. Abstract
College Readiness LINKING STUDY
College Readiness LINKING STUDY A Study of the Alignment of the RIT Scales of NWEA s MAP Assessments with the College Readiness Benchmarks of EXPLORE, PLAN, and ACT December 2011 (updated January 17, 2012)
Cross Validation. Dr. Thomas Jensen Expedia.com
Cross Validation Dr. Thomas Jensen Expedia.com About Me PhD from ETH Used to be a statistician at Link, now Senior Business Analyst at Expedia Manage a database with 720,000 Hotels that are not on contract
Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
Algebra 1 Course Information
Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through
STA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! [email protected]! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
The Basic Two-Level Regression Model
2 The Basic Two-Level Regression Model The multilevel regression model has become known in the research literature under a variety of names, such as random coefficient model (de Leeuw & Kreft, 1986; Longford,
Fitting Subject-specific Curves to Grouped Longitudinal Data
Fitting Subject-specific Curves to Grouped Longitudinal Data Djeundje, Viani Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh, EH14 4AS, UK E-mail: [email protected] Currie,
17. SIMPLE LINEAR REGRESSION II
17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.
Section 14 Simple Linear Regression: Introduction to Least Squares Regression
Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship
Computer Graphics. Geometric Modeling. Page 1. Copyright Gotsman, Elber, Barequet, Karni, Sheffer Computer Science - Technion. An Example.
An Example 2 3 4 Outline Objective: Develop methods and algorithms to mathematically model shape of real world objects Categories: Wire-Frame Representation Object is represented as as a set of points
Measuring Line Edge Roughness: Fluctuations in Uncertainty
Tutor6.doc: Version 5/6/08 T h e L i t h o g r a p h y E x p e r t (August 008) Measuring Line Edge Roughness: Fluctuations in Uncertainty Line edge roughness () is the deviation of a feature edge (as
DATA QUALITY IN GIS TERMINOLGY GIS11
DATA QUALITY IN GIS When using a GIS to analyse spatial data, there is sometimes a tendency to assume that all data, both locational and attribute, are completely accurate. This of course is never the
Smoothing and Non-Parametric Regression
Smoothing and Non-Parametric Regression Germán Rodríguez [email protected] Spring, 2001 Objective: to estimate the effects of covariates X on a response y nonparametrically, letting the data suggest
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!
MATH BOOK OF PROBLEMS SERIES New from Pearson Custom Publishing! The Math Book of Problems Series is a database of math problems for the following courses: Pre-algebra Algebra Pre-calculus Calculus Statistics
Time Series Analysis
Time Series Analysis Identifying possible ARIMA models Andrés M. Alonso Carolina García-Martos Universidad Carlos III de Madrid Universidad Politécnica de Madrid June July, 2012 Alonso and García-Martos
AP Physics 1 and 2 Lab Investigations
AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks
Scatter Plot, Correlation, and Regression on the TI-83/84
Scatter Plot, Correlation, and Regression on the TI-83/84 Summary: When you have a set of (x,y) data points and want to find the best equation to describe them, you are performing a regression. This page
A Study on the Comparison of Electricity Forecasting Models: Korea and China
Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 675 683 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.675 Print ISSN 2287-7843 / Online ISSN 2383-4757 A Study on the Comparison
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )
Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) and Neural Networks( 類 神 經 網 路 ) 許 湘 伶 Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) LR Chap 10 1 / 35 13 Examples
Multivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA
We Can Early Learning Curriculum PreK Grades 8 12 INSIDE ALGEBRA, GRADES 8 12 CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA April 2016 www.voyagersopris.com Mathematical
Time Series Analysis
Time Series Analysis [email protected] Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby 1 Outline of the lecture Identification of univariate time series models, cont.:
Sales forecasting # 2
Sales forecasting # 2 Arthur Charpentier [email protected] 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting
Algebra I Vocabulary Cards
Algebra I Vocabulary Cards Table of Contents Expressions and Operations Natural Numbers Whole Numbers Integers Rational Numbers Irrational Numbers Real Numbers Absolute Value Order of Operations Expression
NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS
NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS TEST DESIGN AND FRAMEWORK September 2014 Authorized for Distribution by the New York State Education Department This test design and framework document
Machine Learning and Pattern Recognition Logistic Regression
Machine Learning and Pattern Recognition Logistic Regression Course Lecturer:Amos J Storkey Institute for Adaptive and Neural Computation School of Informatics University of Edinburgh Crichton Street,
Univariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
Interaction between quantitative predictors
Interaction between quantitative predictors In a first-order model like the ones we have discussed, the association between E(y) and a predictor x j does not depend on the value of the other predictors
Derive 5: The Easiest... Just Got Better!
Liverpool John Moores University, 1-15 July 000 Derive 5: The Easiest... Just Got Better! Michel Beaudin École de Technologie Supérieure, Canada Email; [email protected] 1. Introduction Engineering
Jiří Matas. Hough Transform
Hough Transform Jiří Matas Center for Machine Perception Department of Cybernetics, Faculty of Electrical Engineering Czech Technical University, Prague Many slides thanks to Kristen Grauman and Bastian
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results
IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the
We can display an object on a monitor screen in three different computer-model forms: Wireframe model Surface Model Solid model
CHAPTER 4 CURVES 4.1 Introduction In order to understand the significance of curves, we should look into the types of model representations that are used in geometric modeling. Curves play a very significant
3. Regression & Exponential Smoothing
3. Regression & Exponential Smoothing 3.1 Forecasting a Single Time Series Two main approaches are traditionally used to model a single time series z 1, z 2,..., z n 1. Models the observation z t as a
