Data Validation Online References



Similar documents
Topology. Shapefile versus Coverage Views

Working with Geodatabase Topology

GIS Data in ArcGIS. Pay Attention to Data!!!

Editing Common Polygon Boundary in ArcGIS Desktop 9.x

Using Map Topology Editing Tools

GIS Databases With focused on ArcSDE

Topology and Topological Rules Geometric properties that are maintained in spatial databases

Introduction to the ArcGIS Data Model and Application Structure

How To Improve Gis Data Quality

ArcGIS Data Models Practical Templates for Implementing GIS Projects

INTRODUCTION TO ARCGIS SOFTWARE

TELECOM FIBER EDITING TOOLS REFERENCE GUIDE Version 1.2

Database Production and Map Series Management. Using. The Production Line Tool Set

Geodatabase Programming with SQL

GIS Spatial Data Standards

The GeoMedia Fusion Validate Geometry command provides the GUI for detecting geometric anomalies on a single feature.


The Importance of Data Quality in your Enterprise

Spatial data models (types) Not taught yet

Geodatabase Tutorial. Copyright Esri All rights reserved.

City of Tigard. GIS Data Standards

Spatial Adjustment Tools: The Tutorial

An Esri White Paper July 2010 Highway Data Management in ArcGIS

Tutorial Creating a regular grid for point sampling

Challenges and Success of Migrating to an Enterprise Database in York County, PA

ArcGIS. Server. A Complete and Integrated Server GIS

in R Binbin Lu, Martin Charlton National Centre for Geocomputation National University of Ireland Maynooth The R User Conference 2011

ArcGIS Pro. James Tedrick, Esri

Lab 3. GIS Data Entry and Editing.

A HYBRID APPROACH FOR AUTOMATED AREA AGGREGATION

Enterprise GIS: Database Planning

Geodatabase Archiving: Introduction to Concepts and Capabilities

Guidelines for the use of the OGP P6/11 bin grid GIS data model

Web Editing Tutorial. Copyright Esri All rights reserved.

Bentley ArcGIS. Connector

Creating a File Geodatabase

INCOG Transportation Planning Division Spatial Data Management Workflow GIS-T 2008

About As. In a team with the best. ESRI Bulgaria is the exclusive distributor of Esri Inc. for Bulgaria. Esri Inc.

GEOGRAPHIC INFORMATION SYSTEMS

GIS and Mapping Solutions for Developers. ESRI Developer Network (EDN SM)

Validation and automatic repair of two- and three-dimensional GIS datasets

Working with ArcGIS Network Analyst

Spatial Database Support

Performance Tips and Tricks for ArcGIS Desktop 8.1

are aimed for the investigation, planning, implementation, and decision making divisions.

Personal Geodatabase 101

SCALABILITY OF CONTEXTUAL GENERALIZATION PROCESSING USING PARTITIONING AND PARALLELIZATION. Marc-Olivier Briat, Jean-Luc Monnot, Edith M.

Syllabus AGET 782. GIS for Agricultural and Natural Resources Management

ArcGIS Network Analyst: Networks and Network Models

What s new in TIBCO Spotfire 6.5

Data Interoperability Extension Tutorial

Data Visualization Techniques and Practices Introduction to GIS Technology

GIS Data Quality and Evaluation. Tomislav Sapic GIS Technologist Faculty of Natural Resources Management Lakehead University

Geodatabase Tuning and Performance. Gillian Silvertand Greg Cunningham

(Geo)database and Data Management

Working with Data from External Sources

What is GIS. What is GIS? University of Tsukuba. What do you image of GIS? Copyright(C) ESRI Japan Corporation. All rights reserved.

ESRI Mobile GIS Solutions Overview. Shane Clarke ESRI

Editing Strategies for Enterprise Geodatabase

ArcGIS Server and Geodatabase Administration for 10.2

Developing Microsoft SQL Server Databases (20464) H8N64S

The Courses. Covering complete breadth of GIS technology from ESRI including ArcGIS, ArcGIS Server and ArcGIS Engine.

Developing Microsoft SQL Server Databases 20464C; 5 Days

Microsoft SQL Database Administrator Certification

ArcGIS 10.1 Geodatabase Administration. Gordon Sumerling & Christopher Brown

Oracle8i Spatial: Experiences with Extensible Databases

Working with the Geodatabase Using SQL

Five Best Practices for Maintaining an Enterprise Geodatabase

NatureServe s Environmental Review Tool

Database Servers Tutorial

Oracle Database 10g: Building GIS Applications Using the Oracle Spatial Network Data Model. An Oracle Technical White Paper May 2005


20464C: Developing Microsoft SQL Server Databases

DATA QUALITY IN GIS TERMINOLGY GIS11

Government 1009: Advanced Geographical Information Systems Workshop. LAB EXERCISE 3b: Network

A Method Using ArcMap to Create a Hydrologically conditioned Digital Elevation Model

IBM InfoSphere MDM Server v9.0. Version: Demo. Page <<1/11>>

Workflow improvement with FME in Skedsmo municipality

ArcSDE Spatial Data Management Roles and Responsibilities

Data Validation and Quality Assurance with FME

GIS Architecture and Data Management Practices Boone County GIS Created and Maintained by the Boone County Planning Commission GIS Services Division

Exam Name: IBM InfoSphere MDM Server v9.0

A GIS helps you answer questions and solve problems by looking at your data in a way that is quickly understood and easily shared.

TOWARDS AN AUTOMATED HEALING OF 3D URBAN MODELS

Course 20464: Developing Microsoft SQL Server Databases

Developing Microsoft SQL Server Databases

HowTo Rhino & ICEM. 1) New file setup: choose Millimeter (automatically converts to Meters if imported to ICEM)

Integrating Quality Assurance into the GIS Project Life Cycle

Kingdom Of Bahrain Ministry of Works. Enterprise Asset Management System A Geocentric Approach. Presented By Hisham Y.

Prerequisites Attended the previous technical session: Understanding geodatabase editing workflows: Introduction

Soil Data Viewer 5.1 User Guide

Customizing ArcPad solutions

10. Creating and Maintaining Geographic Databases. Learning objectives. Keywords and concepts. Overview. Definitions

How To Build Gis Applications With An Arcgis Engine

Remote Sensing, GPS and GIS Technique to Produce a Bathymetric Map

Transcription:

Data Validation Online References Submitted To: Program Manager GeoConnections Victoria, BC, Canada Submitted By: Jody Garnett Brent Owens Refractions Research Inc. Suite 400, 1207 Douglas Street Victoria, BC, V8W-2E7 jgarnett@refractions.net Phone: (250) 885-0632 Fax: (250) 383-2140

TABLE OF CONTENTS DATA VALIDATION ONLINE REFERENCES...1 TABLE OF CONTENTS...2 1 INTRODUCTION...3 2 DATA VALIDATION TOOLS...3 2.1 AUTOMATED QA...3 2.2 DOG CREEK QC PRO...3 2.3 GEODATA SENTRY...4 2.4 ESRI S GIS DATA REVIEWER...4 2.5 ESRI S ARCGIS 8.3 GEODATABASE...5 2.6 ARCFM ENERGY...7 2.7 PRODUCTION LINE TOOL SET GIS DATA REVIEWER...7 2.8 ENC ANALYZER...7 2.9 GEOSTATISTICAL ANALYST (ARCGIS 8.1 EXTENSION)...8 3 REFERENCES...8-2 -

1 INTRODUCTION Spatial data validation and GIS Quality Assurance tools is an important and vibrant area of development. In this document we will examine online references for validating attribute data using existing tools. 2 DATA VALIDATION TOOLS 2.1 Automated QA This is a representative tool of the vast majority of the validation tools available online these tools perform a number of specific tests and have no facilities for extension. This tools checks: Structural integrity of the spatial data set Spatial features and attributes Database integrity Cartographic annotation Edge-matching across tiles 2.2 Dog Creek QC Pro QC Pro is targeted at validates coverages. Several consulting companies offer services based around its use. This tool offers several innovations worthy of note: Configuration is specified by prototype. The tool is loaded with a template coverage data set that is used to create a Design Registry. Tests are arranged into test suites, called exams, for management and reporting purposes. The results are generated as html web pages. The tool is also willing to use ARC/INFO tests in exam and report specifications. Capabilities: Cleanliness Cover spec: topology matching, table matching, projection matching, precision matching Existence Table Spec: attribute checks - 3 -

Consistency using Arc/Info CONSIST command Duplicates Physical Consistency: Intersect Errors, Label Errors, Node Errors, Regional Errors, Extents Referential Integrity Validity: valid values, null zeroes, range, equality, unquity Limitations: A bulk data tester, does not test per transaction 2.3 GeoData Sentry This tool is similar in design to QC Pro (it is produced by the same company) with a focus on geodatabases (rather than coverages). Tests are organized into test suites. Test can be generated based on geodatabase domain, or composite relationship information. Has a similar set of capabilities for checking attribute validity, referential integrity and spatial relationships: Single Column Attribute Tests: unique values, coded domains, coded ranges, single values, null values, non-standard values, column format, column length Multiple Column Attribute Tests: composite column values, custom SQL query Referential Integrity Tests: general table relationship Spatial Relationship Tests: distance Limitations: ArcGIS geodatabase backed by Oracle and SQLServer The tool is modular in approach allowing the release of Runtime, Professional and a Community Edition and a Test Inventory of testing plug-ins. 2.4 ESRI s GIS Data ReViewer The GIS Data ReViewer is a data quality control management application that simplifies many aspects of automated and visual spatial quality control tasks. Capabilities: Make and log corrections to data Verify corrections made to data Locate errors in data capture/attribution Log error information accurately Coordinate the data review effort - 4 -

Perform batch validation of a geodatabase Eliminate the paper trail associated with error files by storing data error information in a database. Liabilities: Limited for use with ArcInfo and requires ArcView and ArcEditor. 2.5 ESRI s ArcGIS 8.3 Geodatabase ESRI s ArcGIS 8.3 Geodatabase includes toponymy based validation checks. Capabilities: Declare and place limitations on how features to share geometry Arc-Node Topology Region Topology Polygon Topology Node Topology Route Topology Point Events Figure 1 - Sharing Geometry Create features from unstructured geometry using snapping and aggregation. Constraint Support: Relationships between features Validation rules Logical Networks All of these capabilities are expressed as an integrity rule set defining the topology. Topologies can extend across Feature Types with the same spatial reference system. Feature Types can only be included in one Topology or Geometric Network. - 5 -

Topology is defined in Geodatabase using: Cluster Tolerance: The behavior of snapping operations is controlled using Cluster Tolerance. As an example snapping points together that are 20 cm apart is acceptable if the data accuracy is 2 meters. Ranks: When merging shared points, edges or areas, a concept of Feature Type rank is used as a tiebreaker. Feature Types with the same rank are averaged. Rules: Rules define allowable relationships in a topology. Rules serve as the definition for topological integrity. Topology Rules Area boundary must be covered by boundary of Boundary must be covered by Contains Point Endpoint must be covered by Must be covered by Must be covered by boundary of Must be covered by endpoints of Must be covered by feature class of Must be properly inside polygons Must be single part Must cover each other Must not have dangles Must not have gaps Must not have Pseudo-nodes Must not intersect Must not intersect or touch interior Must not overlap Must not overlap with Must not self intersect Must not self overlap Point must be covered by Domain Polygon/Polygon Polygon/ Polygon/Point Polygon/Polygon /Polygon, Point/Polygon Point/ /, Polygon/Polygon Point/Polygon Polygon/Polygon Polygon, Polygon /, Polygon/Polygon Point/ Geodatabase takes a user centered approach to toponymy, focusing on the editing process rather than data consistency. Rule violations are flagged for later correction. - 6 -

2.6 ArcFM Energy A class of validation tools exists that are targeted at specific markets. A representative example of this is ArcFM Energy extension for ArcInfo that is customized for use by the Energy sector. Capabilities: Database schema definitions range and domain value for attributes Default values for attributes Connectivity checking Support for split and merge operations Checking of Specific relationship: high voltage line can only connect to a low voltage line through a transformer Network Completeness: for an electric network or gas distribution network Liabilities Not a general solution An to ArcFM / ArcCatalog Unclear if transaction based validity checks are supported Similar products exist for meeting defense GIS standards. 2.7 Production Tool Set GIS Data Reviewer This ESRI tool supports manual and batch data review of GIS data. The tool maintains a Valid Value Tables and Condition Tables in the database on a per feature class basis. This configuration information is then maintained under the same transaction controls as the data itself. The tool performs a number of topology checks (closed polygons, overlap, duplicate vertices) and spatial analysis checks (size, duplicate points w/ tolerance, overlaps between feature types). The focus is on flagging failed data for manual review, to this end the tool can be used interactively by AQ Staff and in batch mode. The calculation of summaries on a random sample of data is also supported. 2.8 ENC Analyzer www.sevencs.com ENC Analyzer takes a very specific domain, hydrographic charts, and is able to focus on providing a high value tool. Provides support for attribute validation. Generates warnings for suspect data - 7 -

This tool serves as finally tuned example of what is possible for a specific domain. 2.9 Geostatistical Analyst (ArcGIS 8.1 Extension) The Geostatistical Analyst extension offers the following capabilities: Spatial Regression establish relationships between layers Spatial Interpolation reconstructing data Smoothing - looking for patterns in complete data Grouping - classification Prediction Of specific interest to us is using these techniques for prediction we can compare incoming transaction to our predicted expectations and provide warnings (or even errors) when outliers are encountered. 3 REFERENCES Automated QA, http://www.farragut.com/automate.htm Dog Creek QC Pro, GeoData Sentry http://www.laurelhillgis.com/dogcreek.htm ESRI ArcNews Reprint, Vol. 24 No. 2, Summer 2002 http://www.esri.com/library/reprints/pdfs/arcnews_arcgis83-brings.pdf Working With Geodatabase Topology, http://www.esri.com/library/whitepapers/pdfs/geodatabase-topology.pdf ArcGIS 8.3 Geodatabase Topology Rules, http://www.islem.com.tr/haberpic/pdf/topoposter2_92363.pdf GIS Data ReViewer, http://www.esri.com/software/reviewer ArcFM Energy http://www.esri-portugal.pt/mercados/documents/arcfmenergy.pdf - 8 -