Spatial autocorrelation analysis of residuals and geographically weighted regression

Size: px
Start display at page:

Download "Spatial autocorrelation analysis of residuals and geographically weighted regression"

Transcription

1 Spatial autocorrelation analysis of residuals and geographically weighted regression Materials: Use your project from the tutorial Temporally dynamic aspatial regression in SpaceStat Objective: You will undertake a LISA analysis to determine whether regression residuals are spatially autocorrelated. You will then conduct a geographically weighted regression (GWR) to: (1) Improve local predictive power of the regression; (2) Reduce autocorrelation in the residuals; (3) Relax the assumption of stationary regression coefficients. Why GWR? The aspatial regressions you ran in the tutorial Temporally dynamic aspatial regression in SpaceStat applied over the entire study area and assumed the regression coefficients were the same in all locations on the map (were stationary). GWR relaxes this assumption, and may be an appropriate method in those instances when you think the regression coefficients may indeed differ from one location to another. Alternative regression models such as spatial lag regression, spatial error regression, and spatial multi-level regression are also available in SpaceStat. You also may use geostatistical models for spatial prediction. Here we ll be using GWR. The analysis of residuals: We analyze residuals to determine whether the assumptions of regression have been met, primarily that the residuals are IID independent and identically distributed. This means: (1) The errors are independent of one another, and their values do not depend on the value of residuals at neighboring locations (i.e. no spatial autocorrelation in the residuals) (2) The errors have a constant variance (homoskedastic). We therefore inspect (i) timeplots of the residuals to see whether their dispersion changes through time; and (ii) scatterplots of the residuals vs. the predictors to see whether their dispersion depends on the value of the predictors. (3) The errors are normally distributed. We verify this using histograms of the residuals. The relative importance of these assumptions is (1) constant variance, (2) independence, and (3) normality (Griffith and Layne 2000). What happens if these assumptions aren t met? Heteroskedasticity in the residuals causes estimates of the regression coefficients to be less precise. Spatial autocorrelation in the residuals results in an underestimation of the standard error of the estimates of the regression coefficients and a bias towards rejecting the null hypothesis that the value of the coefficient is zero. Non-normality of the residuals compromises interpretability of significance tests of the regression coefficients. Finally, multicollinearity results in over-estimates of the variances of the regression coefficients. For the model to be correct we also assume (1) The relationships between dependent and independent variables are linear (since we used a linear model); and (2) all independent variables are included (the model is properly specified). There are statistical tests for evaluating these assumptions. The approach taken in SpaceStat is primarily through visualization.

2 In the tutorial Temporally dynamic aspatial regression in SpaceStat you checked the assumptions of homoskedasticity and normality. We now assess spatial independence of the regression residuals. Step 1: Spatial autocorrelation analysis of residuals Spatial autocorrelation in the residuals is often interpreted to mean that (1) an important independent variable (predictor) is missing from the regression, or (2) an underlying spatial process that induces spatial autocorrelation in some of the variables is missing from the model (e.g. groundwater flow inducing spatial autocorrelation in heavy metals). Load the project you created in Temporally dynamic aspatial regression in SpaceStat. Complete the analysis of residuals by determining whether they are spatially autocorrelated. Click on Methods.. Clustering.. Local Moran.. Univariate Local Moran. Change the dataset to be Residuals. Look at the advanced and output tabs, using help to explore them (you don t need to make any changes). Run the method. When the method finishes the local Moran map and Moran scatterplot will appear, and the global Moran s I will be written to the log view. If they are not already shown, turn on the graph statistics for the Moran scatterplot. The local Moran map and the Moran scatterplot are already time-synchronized. Animate the Moran scatterplot and local Moran map through time (you will only be able to do this when at least one of the variables in your regression changes through time, otherwise your residuals will be static). Recall the slope of the line on the Moran scatterplot is the global Moran s I coefficient. Is global spatial autocorrelation increasing, decreasing or not changing through time? Open the log view and inspect the table of local Moran coefficients. Is there significant global spatial autocorrelation in the residuals? Is the significance of the global Moran s I changing through time? Inspect the local Moran map, recalling that residuals are calculated as: (observed value of the dependent variable Estimated mean). Red clusters are areas of high regression residuals where the observed value is under predicted, and blue clusters are areas of low residuals where the observed value is over predicted. Does it make sense to interpret the local Moran map when global autocorrelation in the residuals is absent? Why or why not? Identification of localities where your aspatial model is over predicting or under predicting can lead to insights regarding mis-specification of your regression model. Further inspect the local Moran map. Where is your model over predicting? Where is it under predicting? Does this give you any additional insights into some other variable you might choose to incorporate into your model?

3 Step 2: Geographically Weighted Regression Geographically Weighted Regression (GWR) may be used when there is spatial autocorrelation in the residuals from the aspatial regression, or when you have reason to believe the regression coefficients might change from one location to another (e.g. the regression coefficients are not stationary). In SpaceStat GWR uses point geographies. If you are analyzing a polygon geography SpaceStat will automatically calculate polygon centroids, and then apply GWR to those centroids. Should you wish to calculate a centroid geography outside of GWR you would use Tools..create centroid geography, and click ok on the create centroid geography dialog. You can check the data view to verify the centroid geography was created. Now select Methods.. Regression.. Geographically weighted regression. The model you created for aspatial regression should automatically be available for GWR. If not, you will need to recreate your regression model, specifying the dependent and independent variables as you did earlier for aspatial regression. Be sure to create the same regression model you used in aspatial regression, as you will compare results from the two approaches later in this exercise. Click ok on the define regression tab. Open the other tabs ( regression settings, bandwidth settings, and more settings ), and use help to explore how they can be used. For now you don t need to change anything on these other tabs. Now run the GWR method. SpaceStat will then calculate a local regression for every local area in the dataset, using the data from 20 nearest neighbors as the input to each regression. Open the dataview if it isn t already open. Look under your regression s dependent variable for the folder GWR. New datasets created by GWR will be highlighted. Reported are the local weighted means of your dependent and independent variables, standard deviations, and local correlations between the dependent variable and each of your independent variables. You also will see residuals, std error of the mean and R- square. Map the predicted value: Create a map of the estimated mean of [name of your dependent variable] under the GWR model and color it in a fashion similar to what you used for the estimated mean from your aspatial model. How do these compare? Do either of the models appear to do a better job of estimating the observed values of the dependent variable? Map uncertainty in the predicted value: When creating models of spatial data it is essential to always display maps of the uncertainty in the predicted value. Do this by mapping the Std err of mean. This value is the standard error of the estimated mean of [name of your dependent variable], which of course is the value predicted by GWR. Now create a scatterplot with the estimated mean on the x-axis and std err of the mean on the y-axis. Ideally, uncertainty in the estimated mean would be independent of the value of the estimated mean. Is this true for your model? Why or why not?

4 Map the residuals from GWR: The GWR procedure should have created a map of the residuals, using a continuous color scheme with color breaks such that 0 is a neutral color, negative values are a cool color (green), and positive values are a hot color (Purple). Undertake a LISA analysis of the residuals from GWR, and inspect how the clusters (should there be any) vary through time. Inspect the table of global Moran values. Compare these to the global Moran values you obtained for your Moran analysis of the residuals from the aspatial regression. Are the residuals from GWR more autocorrelated or less autocorrelated than those from the aspatial regression? Why might this be? Evaluate local correlations: GWR also calculates local correlations between the dependent and independent variables e.g. correlation b/t ARSENIC and WPROSTATE. Map these correlations and inspect them for pattern (Use a color ramp centered on 0, with red indicating high positive values and blue indicating negative values). This tells you where the underlying correlations between the variables are the same, and where they are different. Strong underlying spatial variability in the correlations among the variables can support the use of a GWR model since the strength of the dependencies varies from one place to another. Should these correlations be stationary you may wish to employ a modeling approach other than GWR. Compare predicted to observed values: As for the aspatial model, one measure of the predictive ability of the model is how strongly the predicted value of the dependent variable corresponds to the observed value of the dependent variable. Assess this by creating a scatterplot of the estimated mean of the dependent variable from GWR to its observed value. Scale the x and y axes to the same range. If your GWR model was perfect the points would fall on the 45 degree line and the correlation shown in the graph statistics would be 1.0. How large is this correlation for your GWR model? Does it change through time? What does this tell you about the predictive ability of your GWR model? Step 3: Compare the aspatial and GWR models Compare the predictive powers of the aspatial and GWR models: Place the scatterplots of the predicted and observed values for your aspatial and GWR models side by side. If you have time-dynamic models, synchronize the two scatterplots and play them. How do the predictive abilities of the aspatial and GWR models compare? Evaluate goodness of fit using R-square: Now explore the local goodness of fit using the R-square from GWR. Right click on R-square in the GWR folder and create a new map. Also create a histogram of the GWR R-square and time synchronize the new map and histogram. Use these to answer the following questions. Do the locations with the largest R-square change through time? What is the maximum of the local R-square from GWR? Is the average R-square from GWR larger or smaller than the R-square from the aspatial linear regression?

5 Recall that GWR relaxes the assumption of stationary regression coefficients. Why might your regression using GWR fit better in some locations than others? Step 4: Think about your results Based on these analyses, how might you change your original question/hypothesis?

Introduction to Exploratory Data Analysis

Introduction to Exploratory Data Analysis Introduction to Exploratory Data Analysis A SpaceStat Software Tutorial Copyright 2013, BioMedware, Inc. (www.biomedware.com). All rights reserved. SpaceStat and BioMedware are trademarks of BioMedware,

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Geographically Weighted Regression

Geographically Weighted Regression Geographically Weighted Regression CSDE Statistics Workshop Christopher S. Fowler PhD. February 1 st 2011 Significant portions of this workshop were culled from presentations prepared by Fotheringham,

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Module 5: Statistical Analysis

Module 5: Statistical Analysis Module 5: Statistical Analysis To answer more complex questions using your data, or in statistical terms, to test your hypothesis, you need to use more advanced statistical tests. This module reviews the

More information

Formula for linear models. Prediction, extrapolation, significance test against zero slope.

Formula for linear models. Prediction, extrapolation, significance test against zero slope. Formula for linear models. Prediction, extrapolation, significance test against zero slope. Last time, we looked the linear regression formula. It s the line that fits the data best. The Pearson correlation

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

More information

EXPLORING SPATIAL PATTERNS IN YOUR DATA

EXPLORING SPATIAL PATTERNS IN YOUR DATA EXPLORING SPATIAL PATTERNS IN YOUR DATA OBJECTIVES Learn how to examine your data using the Geostatistical Analysis tools in ArcMap. Learn how to use descriptive statistics in ArcMap and Geoda to analyze

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices: Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:

More information

New Tools for Spatial Data Analysis in the Social Sciences

New Tools for Spatial Data Analysis in the Social Sciences New Tools for Spatial Data Analysis in the Social Sciences Luc Anselin University of Illinois, Urbana-Champaign anselin@uiuc.edu edu Outline! Background! Visualizing Spatial and Space-Time Association!

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS. SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed

More information

Directions for using SPSS

Directions for using SPSS Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

More information

How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

More information

Homework 11. Part 1. Name: Score: / null

Homework 11. Part 1. Name: Score: / null Name: Score: / Homework 11 Part 1 null 1 For which of the following correlations would the data points be clustered most closely around a straight line? A. r = 0.50 B. r = -0.80 C. r = 0.10 D. There is

More information

Data analysis and regression in Stata

Data analysis and regression in Stata Data analysis and regression in Stata This handout shows how the weekly beer sales series might be analyzed with Stata (the software package now used for teaching stats at Kellogg), for purposes of comparing

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Descriptive statistics consist of methods for organizing and summarizing data. It includes the construction of graphs, charts and tables, as well various descriptive measures such

More information

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the

More information

2. Linear regression with multiple regressors

2. Linear regression with multiple regressors 2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions

More information

Moderation. Moderation

Moderation. Moderation Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

Appendix 1: Time series analysis of peak-rate years and synchrony testing.

Appendix 1: Time series analysis of peak-rate years and synchrony testing. Appendix 1: Time series analysis of peak-rate years and synchrony testing. Overview The raw data are accessible at Figshare ( Time series of global resources, DOI 10.6084/m9.figshare.929619), sources are

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

Multiple Regression: What Is It?

Multiple Regression: What Is It? Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Section 14 Simple Linear Regression: Introduction to Least Squares Regression Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Nonlinear Regression Functions. SW Ch 8 1/54/

Nonlinear Regression Functions. SW Ch 8 1/54/ Nonlinear Regression Functions SW Ch 8 1/54/ The TestScore STR relation looks linear (maybe) SW Ch 8 2/54/ But the TestScore Income relation looks nonlinear... SW Ch 8 3/54/ Nonlinear Regression General

More information

Obesity in America: A Growing Trend

Obesity in America: A Growing Trend Obesity in America: A Growing Trend David Todd P e n n s y l v a n i a S t a t e U n i v e r s i t y Utilizing Geographic Information Systems (GIS) to explore obesity in America, this study aims to determine

More information

You have data! What s next?

You have data! What s next? You have data! What s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511 Spring 2014 Part 1:! Research Questions Part 1:! Research Questions Write down > 2 things you thought were

More information

Correlation and Regression Analysis: SPSS

Correlation and Regression Analysis: SPSS Correlation and Regression Analysis: SPSS Bivariate Analysis: Cyberloafing Predicted from Personality and Age These days many employees, during work hours, spend time on the Internet doing personal things,

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.

More information

Spatial Data Analysis Using GeoDa. Workshop Goals

Spatial Data Analysis Using GeoDa. Workshop Goals Spatial Data Analysis Using GeoDa 9 Jan 2014 Frank Witmer Computing and Research Services Institute of Behavioral Science Workshop Goals Enable participants to find and retrieve geographic data pertinent

More information

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5 Using Your TI-83/84 Calculator: Linear Correlation and Regression Elementary Statistics Dr. Laura Schultz This handout describes how to use your calculator for various linear correlation and regression

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

More information

HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

More information

Lean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

Lean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY Before we begin: Turn on the sound on your computer. There is audio to accompany this presentation. Audio will accompany most of the online

More information

Ordinal Regression. Chapter

Ordinal Regression. Chapter Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions What will happen if we violate the assumption that the errors are not serially

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance 2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample

More information

Exploring Changes in the Labor Market of Health Care Service Workers in Texas and the Rio Grande Valley I. Introduction

Exploring Changes in the Labor Market of Health Care Service Workers in Texas and the Rio Grande Valley I. Introduction Ina Ganguli ENG-SCI 103 Final Project May 16, 2007 Exploring Changes in the Labor Market of Health Care Service Workers in Texas and the Rio Grande Valley I. Introduction The shortage of healthcare workers

More information

Dealing with Data in Excel 2010

Dealing with Data in Excel 2010 Dealing with Data in Excel 2010 Excel provides the ability to do computations and graphing of data. Here we provide the basics and some advanced capabilities available in Excel that are useful for dealing

More information

Spatial Analysis with GeoDa Spatial Autocorrelation

Spatial Analysis with GeoDa Spatial Autocorrelation Spatial Analysis with GeoDa Spatial Autocorrelation 1. Background GeoDa is a trademark of Luc Anselin. GeoDa is a collection of software tools designed for exploratory spatial data analysis (ESDA) based

More information

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2 University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages

More information

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations.

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. Excel is powerful tool and can make your life easier if you are proficient in using it. You will need to use Excel to complete most of your

More information

Scatter Plots with Error Bars

Scatter Plots with Error Bars Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each

More information

Regression and Correlation

Regression and Correlation Regression and Correlation Topics Covered: Dependent and independent variables. Scatter diagram. Correlation coefficient. Linear Regression line. by Dr.I.Namestnikova 1 Introduction Regression analysis

More information

Linear Models in STATA and ANOVA

Linear Models in STATA and ANOVA Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples

More information

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the

More information

An SPSS companion book. Basic Practice of Statistics

An SPSS companion book. Basic Practice of Statistics An SPSS companion book to Basic Practice of Statistics SPSS is owned by IBM. 6 th Edition. Basic Practice of Statistics 6 th Edition by David S. Moore, William I. Notz, Michael A. Flinger. Published by

More information

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection Chapter 311 Introduction Often, theory and experience give only general direction as to which of a pool of candidate variables (including transformed variables) should be included in the regression model.

More information

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 seema@iasri.res.in 1. Descriptive Statistics Statistics

More information

Modifying Colors and Symbols in ArcMap

Modifying Colors and Symbols in ArcMap Modifying Colors and Symbols in ArcMap Contents Introduction... 1 Displaying Categorical Data... 3 Creating New Categories... 5 Displaying Numeric Data... 6 Graduated Colors... 6 Graduated Symbols... 9

More information

An Analysis of the Undergraduate Tuition Increases at the University of Minnesota Duluth

An Analysis of the Undergraduate Tuition Increases at the University of Minnesota Duluth Proceedings of the National Conference On Undergraduate Research (NCUR) 2012 Weber State University March 29-31, 2012 An Analysis of the Undergraduate Tuition Increases at the University of Minnesota Duluth

More information

International Statistical Institute, 56th Session, 2007: Phil Everson

International Statistical Institute, 56th Session, 2007: Phil Everson Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: peverso1@swarthmore.edu 1. Introduction

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association

More information

Solución del Examen Tipo: 1

Solución del Examen Tipo: 1 Solución del Examen Tipo: 1 Universidad Carlos III de Madrid ECONOMETRICS Academic year 2009/10 FINAL EXAM May 17, 2010 DURATION: 2 HOURS 1. Assume that model (III) verifies the assumptions of the classical

More information

Curve Fitting in Microsoft Excel By William Lee

Curve Fitting in Microsoft Excel By William Lee Curve Fitting in Microsoft Excel By William Lee This document is here to guide you through the steps needed to do curve fitting in Microsoft Excel using the least-squares method. In mathematical equations

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Chapter 2: Descriptive Statistics

Chapter 2: Descriptive Statistics Chapter 2: Descriptive Statistics **This chapter corresponds to chapters 2 ( Means to an End ) and 3 ( Vive la Difference ) of your book. What it is: Descriptive statistics are values that describe the

More information

5 Correlation and Data Exploration

5 Correlation and Data Exploration 5 Correlation and Data Exploration Correlation In Unit 3, we did some correlation analyses of data from studies related to the acquisition order and acquisition difficulty of English morphemes by both

More information

Statistics. Measurement. Scales of Measurement 7/18/2012

Statistics. Measurement. Scales of Measurement 7/18/2012 Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

More information

Geographically Weighted Regression

Geographically Weighted Regression Geographically Weighted Regression A Tutorial on using GWR in ArcGIS 9.3 Martin Charlton A Stewart Fotheringham National Centre for Geocomputation National University of Ireland Maynooth Maynooth, County

More information

4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4

4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4 4. Simple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/4 Outline The simple linear model Least squares estimation Forecasting with regression Non-linear functional forms Regression

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU PITFALLS IN TIME SERIES ANALYSIS Cliff Hurvich Stern School, NYU The t -Test If x 1,..., x n are independent and identically distributed with mean 0, and n is not too small, then t = x 0 s n has a standard

More information

An Introduction to Point Pattern Analysis using CrimeStat

An Introduction to Point Pattern Analysis using CrimeStat Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

More information

Geostatistics Exploratory Analysis

Geostatistics Exploratory Analysis Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras cfelgueiras@isegi.unl.pt

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests

More information

Time Series Laboratory

Time Series Laboratory Time Series Laboratory Computing in Weber Classrooms 205-206: To log in, make sure that the DOMAIN NAME is set to MATHSTAT. Use the workshop username: primesw The password will be distributed during the

More information

SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard

SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard p. 1 SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard The following tutorial is a guide to some basic procedures in SPSS that will be useful as you complete your data assignments for PPA 722. The purpose

More information

NetSurv & Data Viewer

NetSurv & Data Viewer NetSurv & Data Viewer Prototype space-time analysis and visualization software from TerraSeer Dunrie Greiling, TerraSeer Inc. TerraSeer Software sales BoundarySeer for boundary detection and analysis ClusterSeer

More information

Petrel TIPS&TRICKS from SCM

Petrel TIPS&TRICKS from SCM Petrel TIPS&TRICKS from SCM Knowledge Worth Sharing Histograms and SGS Modeling Histograms are used daily for interpretation, quality control, and modeling in Petrel. This TIPS&TRICKS document briefly

More information

(More Practice With Trend Forecasts)

(More Practice With Trend Forecasts) Stats for Strategy HOMEWORK 11 (Topic 11 Part 2) (revised Jan. 2016) DIRECTIONS/SUGGESTIONS You may conveniently write answers to Problems A and B within these directions. Some exercises include special

More information

Summary of important mathematical operations and formulas (from first tutorial):

Summary of important mathematical operations and formulas (from first tutorial): EXCEL Intermediate Tutorial Summary of important mathematical operations and formulas (from first tutorial): Operation Key Addition + Subtraction - Multiplication * Division / Exponential ^ To enter a

More information

Topic 13 Predictive Modeling. Topic 13. Predictive Modeling

Topic 13 Predictive Modeling. Topic 13. Predictive Modeling Topic 13 Predictive Modeling Topic 13 Predictive Modeling 13.1 Predicting Yield Maps Talk about the future of Precision Ag how about maps of things yet to come? Sounds a bit far fetched but Spatial Data

More information

Example: Boats and Manatees

Example: Boats and Manatees Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

REGRESSION: FORECASTING USING EXPLANATORY FACTORS

REGRESSION: FORECASTING USING EXPLANATORY FACTORS REGRESSION: FORECASTING USING EXPLANATORY FACTORS Jerry Baugher, sales manager for Lawrence Construction, had just finished gathering data on the company s 30 most recent single-family homes (see Table

More information

A Workflow for Creating and Sharing Maps

A Workflow for Creating and Sharing Maps A Workflow for Creating and Sharing Maps By Keith Mann, Esri What You Will Need Item Source ArcGIS Online for Organizations subscription ArcGIS 10.1 for Desktop (Any license level) ArcGIS Spatial Analyst

More information

5. Multiple regression

5. Multiple regression 5. Multiple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/5 QBUS6840 Predictive Analytics 5. Multiple regression 2/39 Outline Introduction to multiple linear regression Some useful

More information

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce

More information

Causes of Inflation in the Iranian Economy

Causes of Inflation in the Iranian Economy Causes of Inflation in the Iranian Economy Hamed Armesh* and Abas Alavi Rad** It is clear that in the nearly last four decades inflation is one of the important problems of Iranian economy. In this study,

More information

EST.03. An Introduction to Parametric Estimating

EST.03. An Introduction to Parametric Estimating EST.03 An Introduction to Parametric Estimating Mr. Larry R. Dysert, CCC A ACE International describes cost estimating as the predictive process used to quantify, cost, and price the resources required

More information

Excel Tutorial. Bio 150B Excel Tutorial 1

Excel Tutorial. Bio 150B Excel Tutorial 1 Bio 15B Excel Tutorial 1 Excel Tutorial As part of your laboratory write-ups and reports during this semester you will be required to collect and present data in an appropriate format. To organize and

More information

Mapped Data Visualization and Summary

Mapped Data Visualization and Summary Topic 2 Mapped Data Visualization and Summary 2.1 Visceral Visions This and the following topics will apply map analysis techniques in a case study format. The map-ematical procedures used to translate

More information

MULTIPLE REGRESSION EXAMPLE

MULTIPLE REGRESSION EXAMPLE MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if

More information

Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY

Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY ABSTRACT: This project attempted to determine the relationship

More information