AMARILLO BY MORNING: DATA VISUALIZATION IN GEOSTATISTICS
|
|
|
- Marvin Webb
- 10 years ago
- Views:
Transcription
1 AMARILLO BY MORNING: DATA VISUALIZATION IN GEOSTATISTICS William V. Harper 1 and Isobel Clark 2 1 Otterbein College, United States of America 2 Alloa Business Centre, United Kingdom [email protected] Amarillo by morning, Amarillo s where I'll be comes from a country song released by George Strait in the 1980's. Around this time Bill moved to the Amarillo Texas area hoping to bury highlevel nuclear waste. Like a fine country song, this presentation paints a haunting visual image of the Wolfcamp aquifer underlying the waste site. If a breach occurred, how many mornings until the nuclear waste arrived at Amarillo? Using a variety of visualization tools blended with geostatistical methods, an overview of universal kriging is given at an introductory level. This project was the source of Bill's first geostatistical analysis and was sadly killed in 1988 by the US government. On the plus side, he met his lovely Texas bride Paula there. Meanwhile he and coauthor Isobel have continued their geostatistical journey teaching classes which range from 14 year-old Slovakian High School kids to PhD candidates and beyond. GEOSTATISTICAL DATA VISUALIZATION Geostatistics was not initially developed by the statistical community but instead has its roots in mining and geology. Unlike classical statistics in which observations are assumed to be independent, data in 2-D or 3-D are spatially correlated with values close to each other in distance frequently exhibiting similar values. Each data value has a location in space. The major goal is to estimate values at locations that have not been sampled. Geostatistics offers many helpful visualization tools to aid in the analysis of such spatial data. Due to the proceedings page limit this paper focuses on a few visualization tools. A more complete set of visualization tools are found in the associated PowerPoint file presented at the ICOTS meeting and stored on the web at The U.S. Department of Energy studied possible high-level nuclear waste sites in salt, basalt, and tuff. One of the salt sites studied was in the panhandle of northern Texas close to the New Mexico and Oklahoma borders. The Wolfcamp aquifer underlies the potential repository and provides possible travel paths for leached radionuclides to travel. Figure 1 shows the general setting for the 85 potentiometric (groundwater pressures) values from this aquifer (Harper & Furr, 1986). With higher values generally in the lower left (southwest) and lower values in the upper right (northwest) the groundwater gradient would cause water to flow in a northeasterly direction from the repository in Deaf Smith County toward Amarillo in lower Potter county. The geostatistical technique used is universal kriging (Cressie, 1993; Clark & Harper, 2009) that produces minimum variance linear unbiased estimates. Many iterative steps are involved including distribution analysis, possible data transformation, semi-variogram modeling of spatial variability including trend analysis, cross validation of the proposed model, and finally the kriging that produces grids for mapping of both the expected potentiometric values at un-sampled locations along with a corresponding standard error grid. A typical geologic set of data like the Wolfcamp data consists of irregularly spaced measurements collected over a given geologic region. For this analysis the Wolfcamp aquifer is assumed to be a two-dimensional plane with uniform thickness. As mentioned above geologic measurements generally have similar values when the distance between them is small. As this distance increases, the variability often increases at least up to a certain point known as the range. The longitude and latitude values given in Figure 1 are converted to miles for this analysis. Perhaps the most important aspect of a geostatistical analysis is the assessment of the spatial variability used in the kriging prediction that provides both expected values and standard errors at un-sampled locations so that travel paths and times can be computed in a subsequent study not covered here. For data not on a regular grid it is important to evaluate distances between nearest neighbors so that reasonable data collection distance bins may be established to perform semivariogram modeling of spatial variability. Figure 2 provides an interesting graphic depiction of In C. Reading (Ed.), Data and context in statistics education: Towards an evidence-based society. Proceedings of the Eighth International Conference on Teaching Statistics (ICOTS8, July, 2010), Ljubljana, Slovenia. Voorburg, The Netherlands: International Statistical Institute. publications.php [ 2010 ISI/IASE]
2 where the nearest located neighboring value is located for any individual data value. The distances between the points are summarized in a histogram to determine the semi-variogram data bin sizes. Figure 1. Wolfcamp Potentiometric data in Texas and New Mexico Figure 2. Nearest Neighbor identification for the Wolfcamp Potentiometric data The theoretical semi-variogram is estimated by the empirical semi-variogram defined as where is the number of data values roughly a distance of h miles apart perhaps in a specified direction. The process creates a cloud of individual terms of the semi-variogram (found in the PowerPoint presentation) summarized in Figure 3 into directional semi-variograms to provide a visual way to investigate anisotropy and/or trend. Anisotropy implies that the spatial variability changes based on direction.
3 Figure 3 shows the spatial variability on the vertical axis increases more rapidly in the southwest to northeast direction as the distance h between the data values increases on the horizontal axis. For this application each bin covers a distance of 5 miles, e.g., the first bin collects the data for the directional empirical semi-variograms for points that are 0 to 5 miles apart. Each of the four major directions of the compass are shown with arrows pointing in the relevant direction and are also color coded (shown in PowerPoint) to allow the user to more easily follow a given direction. The size of the arrow is indicative of how many data pairs were found for that direction at a given distance apart. Figure 4 shows the corresponding semi-variogram directional shading plot that aids the visual assessment. The parabolic upper right tail of the northeastern directional semi-variogram is indicative of possible trend that must be investigated before assessing anisotropy. Figure 3. Directional Semi-Variograms for the Wolfcamp data Figure 4. Directional Shaded Plot Semi-Variograms for the Wolfcamp data
4 A global trend assessment is performed using traditional least squares regression to assess what local level trend might be advisable in universal kriging. While a quadratic global trend is statistically significant at the 95% confidence level, it only increases the variability explained by the regression from 89% for a linear fit to 91% for the quadratic. Universal kriging combines a local trend fitting in a specified search radius with the kriging weighted individual values in that search area. Using the residuals from the global trend assessment the directional semi-variograms fit to the linear residual trend is shown in the shaded plot (PowerPoint also has the directional semivariogram similar to Figure 3 for the linear residuals) in Figure 5. This shaded semi-variogram plot shows similar variability in all four major directions when the trend has been accounted for. Thus there is no visual evidence of anisotropy and the data is then considered to be isotropic. Figure 5. Directional Shaded Plot Semi-Variograms on Linear Regression Residuals Since the data are felt to be isotropic an omni-directional semi-variogram is used to model theoretical semi-variograms as seen in Figure 6. There are various theoretical semi-variogram to try as may be seen near the top of Figure 6. The Wolfcamp linear residuals are well fit by a spherical semi-variogram with a nugget of 11,000 ft 2, a sill of 34,000 ft 2, and a range of influence of 60 miles. The nugget is an estimate of the spatial variance as the limit between adjacent data values approaches zero. As the distance between sample points increases along the horizontal axis, the spatial variance grows for a spherical semi-variogram until the observations appear to be independent from one another at the range of 60 miles. From 60 miles on the spatial variability levels off at 45,000 ft 2 which is the sum of the nugget and sill. Geostatistics involves iterative modeling to arrive at a model that fits the spatial data. In addition to visual assessments, cross-validation is used to partially verify that the model does not have obvious flaws in fitting the data. Each data value is removed one at a time and estimated by the surrounding values. The cross-validation residuals are normalized by their kriging standard error and these standardized residuals ideally have an average of 0 and a standard deviation of 1. For the above model assumptions using a 60 mile search radius that determines which values are used to estimate each removed value the average and standard deviation are 0.00 and 1.04, respectively. While this does not guarantee the above model choices are all ideal, it does provide a degree of comfort. Cross-validation graphics (shown in PowerPoint) visually identify where the larger deviations of predicted versus actual values occur providing another visualization tool for investigation of spatial data including the search for possibly anomalous data.
5 Figure 6. Omni-direction Semi-variogram on Wolfcamp Linear Residuals After the modeler is content with the cross-validation and the visual assessments, universal kriging is performed. This was done on a 10 mile grid and the results passed to a contour plotting package such as Surfer. Figures 7 and 8 are the resulting predicted potentiometric surface and the corresponding standard error map developed from the universal kriging grids. Thus one can assess the uncertainty of the prediction. Additionally the standard error map can be used to help decide where additional data might be collected to reduce travel time uncertainty among other things. Figure 7. Universal Kriging Potentiometric Surface Estimation for the Wolfcamp Aquifer
6 Figure 8. Universal Kriging Standard Error Map for the Wolfcamp Aquifer Far field flow simulation modeling in the unlikely case of radionuclides leaching into the Wolfcamp aquifer predicted groundwater travel times from 141,000 to roughly 800,000 years (Devary et al., 1984). The most likely flow paths headed toward Amarillo. With the recent change in U.S. policy in 2009, it now appears that the Yucca Mountain tuff site not killed by congress in 1988 may now be dropped and new alternatives considered including possibly re-opening of the old salt and basalt sites. Time will tell what is in store for some morning in Amarillo. CONCLUSION Key geostatistical visualization tools are illustrated with data for the Wolfcamp aquifer that underlies an area that once was considered for a potential high-level nuclear waste repository. Without such visual methods, geostatistical modeling would be a mathematical black box with many errors and misconceptions. As with most statistical applications, visualization tools aid both correct modeling and understanding of data. This is especially true when modeling two or three dimensional spatial data sets. REFERENCES Clark, I., & Harper, W. V. (2009). Practical Geostatistics Case Studies Columbus, Ohio, USA: Ecosse North America LLC. Cressie, N. A. C. (1993). Statistics for Spatial Data. New York: John Wiley. Devary, J. L., Harper, W. V., Sykes, J. F., & Wilson, J. L. (1984). Far-Field Flow Uncertainty Analysis for the Palo Duro Basin. Materials Research Society Symposia Proceedings: Scientific Basis for Nuclear Waste Management VII, 26 (pp ). Burlington, MA, USA: Elsevier Science Publishing. Harper, W. V., & Furr, J. M. (1986). Geostatistical Analysis of Potentiometric Data in the Wolfcamp Aquifer of the Palo Duro Basin, Texas, BMI/ONWI-587, Columbus, Ohio, USA: Battelle Memorial Institute.
INTRODUCTION TO GEOSTATISTICS And VARIOGRAM ANALYSIS
INTRODUCTION TO GEOSTATISTICS And VARIOGRAM ANALYSIS C&PE 940, 17 October 2005 Geoff Bohling Assistant Scientist Kansas Geological Survey [email protected] 864-2093 Overheads and other resources available
Geography 4203 / 5203. GIS Modeling. Class (Block) 9: Variogram & Kriging
Geography 4203 / 5203 GIS Modeling Class (Block) 9: Variogram & Kriging Some Updates Today class + one proposal presentation Feb 22 Proposal Presentations Feb 25 Readings discussion (Interpolation) Last
ATTACHMENT 8: Quality Assurance Hydrogeologic Characterization of the Eastern Turlock Subbasin
ATTACHMENT 8: Quality Assurance Hydrogeologic Characterization of the Eastern Turlock Subbasin Quality assurance and quality control (QA/QC) policies and procedures will ensure that the technical services
Introduction to Modeling Spatial Processes Using Geostatistical Analyst
Introduction to Modeling Spatial Processes Using Geostatistical Analyst Konstantin Krivoruchko, Ph.D. Software Development Lead, Geostatistics [email protected] Geostatistics is a set of models and
EXPLORING SPATIAL PATTERNS IN YOUR DATA
EXPLORING SPATIAL PATTERNS IN YOUR DATA OBJECTIVES Learn how to examine your data using the Geostatistical Analysis tools in ArcMap. Learn how to use descriptive statistics in ArcMap and Geoda to analyze
An Interactive Tool for Residual Diagnostics for Fitting Spatial Dependencies (with Implementation in R)
DSC 2003 Working Papers (Draft Versions) http://www.ci.tuwien.ac.at/conferences/dsc-2003/ An Interactive Tool for Residual Diagnostics for Fitting Spatial Dependencies (with Implementation in R) Ernst
Annealing Techniques for Data Integration
Reservoir Modeling with GSLIB Annealing Techniques for Data Integration Discuss the Problem of Permeability Prediction Present Annealing Cosimulation More Details on Simulated Annealing Examples SASIM
An Introduction to Point Pattern Analysis using CrimeStat
Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign
Iris Sample Data Set. Basic Visualization Techniques: Charts, Graphs and Maps. Summary Statistics. Frequency and Mode
Iris Sample Data Set Basic Visualization Techniques: Charts, Graphs and Maps CS598 Information Visualization Spring 2010 Many of the exploratory data techniques are illustrated with the Iris Plant data
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar What is data exploration? A preliminary exploration of the data to better understand its characteristics.
UNCERT: GEOSTATISTICAL, GROUND WATER MODELING, AND VISUALIZATION SOFTWARE
CHAPTER 6 UNCERT: GEOSTATISTICAL, GROUND WATER MODELING, AND VISUALIZATION SOFTWARE UNCERT is an uncertainty analysis, geostatistical, ground water modeling, and visualization software package. It was
Data Exploration Data Visualization
Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Topics Exploratory Data Analysis Summary Statistics Visualization What is data exploration?
COM CO P 5318 Da t Da a t Explora Explor t a ion and Analysis y Chapte Chapt r e 3
COMP 5318 Data Exploration and Analysis Chapter 3 What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
15.062 Data Mining: Algorithms and Applications Matrix Math Review
.6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 1 What is data exploration? A preliminary
Geostatistical Analyst Tutorial
Copyright 1995-2012 Esri All rights reserved. Table of Contents Introduction to the ArcGIS Geostatistical Analyst Tutorial................... 0 Exercise 1: Creating a surface using default parameters...................
Tutorial - PEST. Visual MODFLOW Flex. Integrated Conceptual & Numerical Groundwater Modeling
Tutorial - PEST Visual MODFLOW Flex Integrated Conceptual & Numerical Groundwater Modeling PEST with Pilot Points Tutorial This exercise demonstrates some of the advanced and exiting opportunities for
Spatial Data Analysis
14 Spatial Data Analysis OVERVIEW This chapter is the first in a set of three dealing with geographic analysis and modeling methods. The chapter begins with a review of the relevant terms, and an outlines
Geostatistics Exploratory Analysis
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras [email protected]
3D Visualization of Seismic Activity Associated with the Nazca and South American Plate Subduction Zone (Along Southwestern Chile) Using RockWorks
3D Visualization of Seismic Activity Associated with the Nazca and South American Plate Subduction Zone (Along Southwestern Chile) Using RockWorks Table of Contents Figure 1: Top of Nazca plate relative
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
Developing sub-domain verification methods based on Geographic Information System (GIS) tools
APPROVED FOR PUBLIC RELEASE: DISTRIBUTION UNLIMITED U.S. Army Research, Development and Engineering Command Developing sub-domain verification methods based on Geographic Information System (GIS) tools
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
A Short Tour of the Predictive Modeling Process
Chapter 2 A Short Tour of the Predictive Modeling Process Before diving in to the formal components of model building, we present a simple example that illustrates the broad concepts of model building.
Section 1.1. Introduction to R n
The Calculus of Functions of Several Variables Section. Introduction to R n Calculus is the study of functional relationships and how related quantities change with each other. In your first exposure to
Module1. x 1000. y 800.
Module1 1 Welcome to the first module of the course. It is indeed an exciting event to share with you the subject that has lot to offer both from theoretical side and practical aspects. To begin with,
How to Discredit Most Real Estate Appraisals in One Minute By Eugene Pasymowski, MAI 2007 RealStat, Inc.
How to Discredit Most Real Estate Appraisals in One Minute By Eugene Pasymowski, MAI 2007 RealStat, Inc. Published in the TriState REALTORS Commercial Alliance Newsletter Spring 2007 http://www.tristaterca.com/tristaterca/
Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary
Shape, Space, and Measurement- Primary A student shall apply concepts of shape, space, and measurement to solve problems involving two- and three-dimensional shapes by demonstrating an understanding of:
VARIANCE REDUCTION TECHNIQUES FOR IMPLICIT MONTE CARLO SIMULATIONS
VARIANCE REDUCTION TECHNIQUES FOR IMPLICIT MONTE CARLO SIMULATIONS An Undergraduate Research Scholars Thesis by JACOB TAYLOR LANDMAN Submitted to Honors and Undergraduate Research Texas A&M University
AMS 5 CHANCE VARIABILITY
AMS 5 CHANCE VARIABILITY The Law of Averages When tossing a fair coin the chances of tails and heads are the same: 50% and 50%. So if the coin is tossed a large number of times, the number of heads and
BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
NEW MEXICO Grade 6 MATHEMATICS STANDARDS
PROCESS STANDARDS To help New Mexico students achieve the Content Standards enumerated below, teachers are encouraged to base instruction on the following Process Standards: Problem Solving Build new mathematical
Using R for Linear Regression
Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional
Dimensionality Reduction: Principal Components Analysis
Dimensionality Reduction: Principal Components Analysis In data mining one often encounters situations where there are a large number of variables in the database. In such situations it is very likely
The process components and related data characteristics addressed in this document are:
TM Tech Notes Certainty 3D November 1, 2012 To: General Release From: Ted Knaak Certainty 3D, LLC Re: Structural Wall Monitoring (#1017) rev: A Introduction TopoDOT offers several tools designed specifically
ArcGIS Geostatistical Analyst: Statistical Tools for Data Exploration, Modeling, and Advanced Surface Generation
ArcGIS Geostatistical Analyst: Statistical Tools for Data Exploration, Modeling, and Advanced Surface Generation An ESRI White Paper August 2001 ESRI 380 New York St., Redlands, CA 92373-8100, USA TEL
Introduction to Geostatistics
Introduction to Geostatistics GEOL 5446 Dept. of Geology & Geophysics 3 Credits University of Wyoming Fall, 2013 Instructor: Ye Zhang Grading: A-F Location: ESB1006 Time: TTh (9:35 am~10:50 am), Office
Data Visualization Techniques and Practices Introduction to GIS Technology
Data Visualization Techniques and Practices Introduction to GIS Technology Michael Greene Advanced Analytics & Modeling, Deloitte Consulting LLP March 16 th, 2010 Antitrust Notice The Casualty Actuarial
Fast and Robust Normal Estimation for Point Clouds with Sharp Features
1/37 Fast and Robust Normal Estimation for Point Clouds with Sharp Features Alexandre Boulch & Renaud Marlet University Paris-Est, LIGM (UMR CNRS), Ecole des Ponts ParisTech Symposium on Geometry Processing
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
A Fuel Cost Comparison of Electric and Gas-Powered Vehicles
$ / gl $ / kwh A Fuel Cost Comparison of Electric and Gas-Powered Vehicles Lawrence V. Fulton, McCoy College of Business Administration, Texas State University, [email protected] Nathaniel D. Bastian, University
11.1. Objectives. Component Form of a Vector. Component Form of a Vector. Component Form of a Vector. Vectors and the Geometry of Space
11 Vectors and the Geometry of Space 11.1 Vectors in the Plane Copyright Cengage Learning. All rights reserved. Copyright Cengage Learning. All rights reserved. 2 Objectives! Write the component form of
Local outlier detection in data forensics: data mining approach to flag unusual schools
Local outlier detection in data forensics: data mining approach to flag unusual schools Mayuko Simon Data Recognition Corporation Paper presented at the 2012 Conference on Statistical Detection of Potential
720 Contour Grading. General. References. Resources. Definitions
720 Contour Grading General Contour grading directs water to a desired point, prevents erosion, provides noise deflection, provides visual fit of the facility into the landscape, and protects desirable
DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
Economic Statistics (ECON2006), Statistics and Research Design in Psychology (PSYC2010), Survey Design and Analysis (SOCI2007)
COURSE DESCRIPTION Title Code Level Semester Credits 3 Prerequisites Post requisites Introduction to Statistics ECON1005 (EC160) I I None Economic Statistics (ECON2006), Statistics and Research Design
ArcGIS 9. Geostatistical Analyst
ArcGIS 9 Using ArcGIS Geostatistical Analyst Copyright 2001, 2003 ESRI All Rights Reserved. Printed in the United States of America. The information contained in this document is the exclusive property
How To Run Statistical Tests in Excel
How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting
Common Core Unit Summary Grades 6 to 8
Common Core Unit Summary Grades 6 to 8 Grade 8: Unit 1: Congruence and Similarity- 8G1-8G5 rotations reflections and translations,( RRT=congruence) understand congruence of 2 d figures after RRT Dilations
Figure 1. An embedded chart on a worksheet.
8. Excel Charts and Analysis ToolPak Charts, also known as graphs, have been an integral part of spreadsheets since the early days of Lotus 1-2-3. Charting features have improved significantly over the
Multivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.
277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies
Simple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
The Effect of Environmental Factors on Real Estate Value
The Effect of Environmental Factors on Real Estate Value Radoslaw CELLMER, Adam SENETRA, Agnieszka SZCZEPANSKA, Poland Key words: environment, landscape, property value, geostatistics SUMMARY The objective
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION
A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS Kyoungjin Park Alper Yilmaz Photogrammetric and Computer Vision Lab Ohio State University [email protected] [email protected] ABSTRACT Depending
Part II Chapter 9 Chapter 10 Chapter 11 Chapter 12 Chapter 13 Chapter 14 Chapter 15 Part II
Part II covers diagnostic evaluations of historical facility data for checking key assumptions implicit in the recommended statistical tests and for making appropriate adjustments to the data (e.g., consideration
Hydrogeological Data Visualization
Conference of Junior Researchers in Civil Engineering 209 Hydrogeological Data Visualization Boglárka Sárközi BME Department of Photogrammetry and Geoinformatics, e-mail: [email protected] Abstract
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
APPLICATION OF DATA MINING TECHNIQUES FOR BUILDING SIMULATION PERFORMANCE PREDICTION ANALYSIS. email [email protected]
Eighth International IBPSA Conference Eindhoven, Netherlands August -4, 2003 APPLICATION OF DATA MINING TECHNIQUES FOR BUILDING SIMULATION PERFORMANCE PREDICTION Christoph Morbitzer, Paul Strachan 2 and
AP Physics 1 and 2 Lab Investigations
AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks
Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes
Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes Simcha Pollack, Ph.D. St. John s University Tobin College of Business Queens, NY, 11439 [email protected]
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
Exploratory Data Analysis
Exploratory Data Analysis Johannes Schauer [email protected] Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
MARS STUDENT IMAGING PROJECT
MARS STUDENT IMAGING PROJECT Data Analysis Practice Guide Mars Education Program Arizona State University Data Analysis Practice Guide This set of activities is designed to help you organize data you collect
Chapter 6: Constructing and Interpreting Graphic Displays of Behavioral Data
Chapter 6: Constructing and Interpreting Graphic Displays of Behavioral Data Chapter Focus Questions What are the benefits of graphic display and visual analysis of behavioral data? What are the fundamental
CRLS Mathematics Department Algebra I Curriculum Map/Pacing Guide
Curriculum Map/Pacing Guide page 1 of 14 Quarter I start (CP & HN) 170 96 Unit 1: Number Sense and Operations 24 11 Totals Always Include 2 blocks for Review & Test Operating with Real Numbers: How are
Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard
Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express
Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares
Linear Regression Chapter 5 Regression Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). We can then predict the average response for all subjects
NUMERICAL ANALYSIS OF THE EFFECTS OF WIND ON BUILDING STRUCTURES
Vol. XX 2012 No. 4 28 34 J. ŠIMIČEK O. HUBOVÁ NUMERICAL ANALYSIS OF THE EFFECTS OF WIND ON BUILDING STRUCTURES Jozef ŠIMIČEK email: [email protected] Research field: Statics and Dynamics Fluids mechanics
Problems With Using Microsoft Excel for Statistics
Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001 Problems With Using Microsoft Excel for Statistics Jonathan D. Cryer ([email protected]) Department of Statistics
Clustering & Visualization
Chapter 5 Clustering & Visualization Clustering in high-dimensional databases is an important problem and there are a number of different clustering paradigms which are applicable to high-dimensional data.
Predictor Coef StDev T P Constant 970667056 616256122 1.58 0.154 X 0.00293 0.06163 0.05 0.963. S = 0.5597 R-Sq = 0.0% R-Sq(adj) = 0.
Statistical analysis using Microsoft Excel Microsoft Excel spreadsheets have become somewhat of a standard for data storage, at least for smaller data sets. This, along with the program often being packaged
AN INVESTIGATION INTO THE USEFULNESS OF THE ISOCS MATHEMATICAL EFFICIENCY CALIBRATION FOR LARGE RECTANGULAR 3 x5 x16 NAI DETECTORS
AN INVESTIGATION INTO THE USEFULNESS OF THE ISOCS MATHEMATICAL EFFICIENCY CALIBRATION FOR LARGE RECTANGULAR 3 x5 x16 NAI DETECTORS Frazier L. Bronson CHP Canberra Industries, Inc. 800 Research Parkway,
How do you compare numbers? On a number line, larger numbers are to the right and smaller numbers are to the left.
The verbal answers to all of the following questions should be memorized before completion of pre-algebra. Answers that are not memorized will hinder your ability to succeed in algebra 1. Number Basics
HW6 Solutions Notice numbers may change randomly in your assignments and you may have to recalculate solutions for your specific case.
HW6 Solutions Notice numbers may change randomly in your assignments and you may have to recalculate solutions for your specific case. Tipler 22.P.053 The figure below shows a portion of an infinitely
How To Use Statgraphics Centurion Xvii (Version 17) On A Computer Or A Computer (For Free)
Statgraphics Centurion XVII (currently in beta test) is a major upgrade to Statpoint's flagship data analysis and visualization product. It contains 32 new statistical procedures and significant upgrades
An Analysis of the Rossby Wave Theory
An Analysis of the Rossby Wave Theory Morgan E. Brown, Elise V. Johnson, Stephen A. Kearney ABSTRACT Large-scale planetary waves are known as Rossby waves. The Rossby wave theory gives us an idealized
Exercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
OPTIMAL GRONDWATER MONITORING NETWORK DESIGN FOR POLLUTION PLUME ESTIMATION WITH ACTIVE SOURCES
Int. J. of GEOMATE, Int. June, J. of 14, GEOMATE, Vol. 6, No. June, 2 (Sl. 14, No. Vol. 12), 6, pp. No. 864-869 2 (Sl. No. 12), pp. 864-869 Geotech., Const. Mat. & Env., ISSN:2186-2982(P), 2186-299(O),
Skewness and Kurtosis in Function of Selection of Network Traffic Distribution
Acta Polytechnica Hungarica Vol. 7, No., Skewness and Kurtosis in Function of Selection of Network Traffic Distribution Petar Čisar Telekom Srbija, Subotica, Serbia, [email protected] Sanja Maravić Čisar
The correlation coefficient
The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative
CHAPTER 4 LEGAL DESCRIPTION OF LAND DESCRIBING LAND METHODS OF DESCRIBING REAL ESTATE
r CHAPTER 4 LEGAL DESCRIPTION OF LAND DESCRIBING LAND A legal description is a detailed way of describing a parcel of land for documents such as deeds and mortgages that will be accepted in a court of
The Science and Art of Market Segmentation Using PROC FASTCLUS Mark E. Thompson, Forefront Economics Inc, Beaverton, Oregon
The Science and Art of Market Segmentation Using PROC FASTCLUS Mark E. Thompson, Forefront Economics Inc, Beaverton, Oregon ABSTRACT Effective business development strategies often begin with market segmentation,
Data Mining and Exploratory Statistics to Visualize Fractures and Migration Paths in the WCBS*
Data Mining and Exploratory Statistics to Visualize Fractures and Migration Paths in the WCBS* Jean-Yves Chatellier 1 and Michael Chatellier 2 Search and Discovery Article #41582 (2015) Posted February
The Fourth International DERIVE-TI92/89 Conference Liverpool, U.K., 12-15 July 2000. Derive 5: The Easiest... Just Got Better!
The Fourth International DERIVE-TI9/89 Conference Liverpool, U.K., -5 July 000 Derive 5: The Easiest... Just Got Better! Michel Beaudin École de technologie supérieure 00, rue Notre-Dame Ouest Montréal
Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
USING DIRECTED ONLINE TUTORIALS FOR TEACHING ENGINEERING STATISTICS
USING DIRECTED ONLINE TUTORIALS FOR TEACHING ENGINEERING STATISTICS Richard J. School of Mathematics and Physics, The University of Queensland, Australia [email protected] Since 2006, an internet
Using Four-Quadrant Charts for Two Technology Forecasting Indicators: Technology Readiness Levels and R&D Momentum
Using Four-Quadrant Charts for Two Technology Forecasting Indicators: Technology Readiness Levels and R&D Momentum Tang, D. L., Wiseman, E., & Archambeault, J. Canadian Institute for Scientific and Technical
9. Sampling Distributions
9. Sampling Distributions Prerequisites none A. Introduction B. Sampling Distribution of the Mean C. Sampling Distribution of Difference Between Means D. Sampling Distribution of Pearson's r E. Sampling
Principles of groundwater flow
Principles of groundwater flow Hydraulic head is the elevation to which water will naturally rise in a well (a.k.a. static level). Any well that is not being pumped will do for this, but a well that is
Common Tools for Displaying and Communicating Data for Process Improvement
Common Tools for Displaying and Communicating Data for Process Improvement Packet includes: Tool Use Page # Box and Whisker Plot Check Sheet Control Chart Histogram Pareto Diagram Run Chart Scatter Plot
THE COST OF COLLEGE EDUCATION PROJECT PACKET
THE COST OF COLLEGE EDUCATION PROJECT PACKET Introduction We live in a society where a college education is considered the norm and not the exception. While everyone is expected to attend a college or
