GeoMatchMaker: Automatic and Efficient Matching of Vector Data with Spatial Attributes in Unknown Geometry Systems
|
|
|
- Asher Bryant
- 10 years ago
- Views:
Transcription
1 GeoMatchMaker: Automatic and Efficient Matching of Vector Data with Spatial Attributes in Unknown Geometry Systems Mohammad Kolahdouzan, Ching-Chien Chen, Cyrus Shahabi, Craig A. Knoblock ζ Department of Computer Science, University of Southern California ζ Information Sciences Institute, University of Southern California Los Angeles, CA, Abstract Large amount of geospatial data are now available from public and private organizations in vector data formats. Users of these geospatial data usually require the data that are gathered from different sources to be integrated and fused for knowledge discovery. A vital step for fusion of the geospatial datasets is to identify the matched features among the datasets. There have been several efforts to automatically or semi-automatically detect matched features across different vector datasets. These solutions usually require the features to be in the same coordinate system so their spatial attributes can be compared. This renders these solutions impractical for the scenarios where the coordinate systems of the datasets are unknown. In this paper we propose several approaches that are based on utilizing the intersections of the lines as features, to efficiently and accurately detect the matched features across line vector datasets. We first discuss PPM, a brute-force approach to find the transformation of the intersections from one dataset to another. We then briefly discuss Geo-PPM, an improvement over PPM that utilizes some network properties to prune the search space. Finally, we discuss prioritized Geo-PPM where we can further improve the performance of Geo-PPM by first examining the features that have a higher possibility of locating the matching pattern. Our experiments show that prioritized Geo- PPM provides a substantial improvement over Geo-PPM and hence, renders Geo-PPM practical for networks with large sizes. 1 Introduction With the rapid improvement of geospatial data collection techniques, the growth of Internet and the implementation of Open GIS, a large amount of geospatial data are now readily available from public and private organizations as well as on the Internet. These datasets usually cover different areas, have different accuracy and levels of details, and are usually provided in the vector or raster data formats. Some examples of well-known vector datasets are US Census TIGER/Line files 1 (covering most roads in the United States), NAVSTREETS from NAVTEQ 2 and DLG data from USGS 3. The National Map, 4 ESRI s Geography Network, 5 Yahoo Map
2 Service, 6 and Microsoft TerraService 7 [1] are examples of repositories of raster data in form of map or satellite imagery. The users of these data products often need these geospatial data and other related data to be displayed in some integrated fashion for knowledge discovery. That is, the data must be fused to provide additional inferences that cannot be gathered from any single information source. Geospatial data fusion has been one of the central issues in GIS [3, 12]. Geospatial data fusion requires that the system integrates various datasets, and then creates a single composite dataset from the integrated elements. Towards geospatial data fusion, a vital step is identifying the matched features among multiple datasets. There have been a number of efforts to automatically or semi-automatically detect matched features across different road vector datasets [4, 7, 11, 13, 14, 15, 16]. Given a feature point from one dataset, these approaches utilize different matching strategies to discover the corresponding point within a predetermined distance (i.e., a localized area). This implies that these approaches can only handle the matching of vector datasets that are in the same coordinate system. There are also solutions that utilize non-spatial attributes of the points (e.g., point s name) to compare and match vector dataset. However, there are scenarios in which the spatial attributes of the objects are intentionally transformed to a different, and usually unknown, (alien) system, or the datasets do not contain any common non-spatial attributes. An example of this scenario is when the spatial attributes of one dataset are intentionally transformed to an unknown system for security reasons. This renders the available solutions impractical for these scenarios. In this paper, we propose several approaches to automatically and efficiently handle the matching of diverse vector datasets, independent of their coordinate system. The basic idea of our approach is to find the transformation T between the layout (with relative distances) of the feature point set of one vector data and the feature point set of the other dataset. This transformation achieves global alignment between two feature point sets by locating the common point pattern among them. In particular, we utilize road intersections as the feature points to be matched. Road intersections are good candidates for being matched since road intersections are salient points to capture the major feature of the road network. In addition, various GIS researchers have shown that the intersection points on the road networks are good candidates to be identified as an accurate set of matched points [4, 5, 8, 9]. We first discuss PPM, a brute-force method that can find the matching point (intersection) pattern by examining every pair of points in the first dataset S 1 against every pair of points in the second dataset S 2. Although accurate and resistance against noise, PPM is computationally complex and hence suffers from very poor performance. We then briefly discuss Geo-PPM that utilizes some network properties of the features to improve the performance of PPM. The intuition for Geo-PPM is to eliminate the examination of the point pairs that cannot possibly be a candidate for correctly locating the matching pattern. Geo-PPM cannot scale to very larger datasets, especially when the features of the datasets follow a pattern (e.g., a grid pattern in a Manhattan network). This is because in the presence of a pattern, there will be numerous features
3 in S 2 that have the same network properties as a feature in S 1. Hence, Geo-PPM cannot effectively prune the search space for a large number of features. Towards this end, we propose prioritized Geo-PPM, in which we first examine the features in S 1 that have a higher possibility of locating the matching pattern sooner. These features are usually the outliers in the network (e.g., an intersection with the maximum number of intersecting streets, or an intersection with an outlier angle) and their matching features in S 2 are also usually outliers. We then propose several outlier properties that can be utilized in prioritized Geo-PPM. Our experiments show that prioritized Geo-PPM can provide a substantial improvement over Geo-PPM for large networks when it utilizes the minimum number of candidates with the same point angle property. The remainder of this paper is organized as follows. We review the related work on conflation of vector data in Section 2. We then provide an overview of a brute-force algorithm to find the matching between two point patterns as well as Geo-PPM and prioritized Geo-PPM in Section 3. The results of our experiments that compare the performance of our proposed approaches are discussed in Section 4. Finally, we conclude the paper in Section 5. 2 Related Work The idea of vector to vector conflation was first proposed and implemented in 1988 by Saalfeld [11], and the initial focus of conflation was to eliminate the spatial inconsistency between two overlapping vector datasets in order to improve the spatial accuracy of vector datasets. Once the spatial discrepancy is eliminated, it is possible and easier to transfer attributes among datasets to achieve geospatial data fusion. Several important application domains that can benefit from such data fusion are the crisis management, city traffic planning, and military intelligence applications. Towards vector to vector conflation, a vital step is identifying the matched features among multiple datasets. There have been a number of efforts to automatically or semiautomatically detect matched features across different road vector datasets [7, 11, 13, 14]. Walter and Fritsch [13] proposed a relational matching approach to find matched spatial objects based on the similarity of spatial objects at the geometry level (e.g., node to node matching based on distance) and based on the relations between the elements in a data set. They investigated the similarity of spatial objects based on statistical information derived from a random sample of the vector datasets to be integrated. However, their approach requires human intervention to perform an initial affine transformation between datasets. In addition to performing feature matching at the geometry level, Cobb et al. in [7] proposed an approach to perform featurematching at the object level. For example, when comparing two road segments, their approach not only matches the road endpoints, but also matches the non-spatial properties such as street names and widths. Ware et al. [14] presented a technique for matching and aligning vector features in pairs of multi-date coverages. The main feature dealt with in their work is road segments. Their approach also supports the additions/deletions of road nodes in order to deform the matched road segments. There are also several commercial products such as MapMerger [15] and Conflex [16] that can support automatic vector to vector matching with limited human intervention to consolidate multiple vector datasets.
4 The existing algorithms and products can only handle the matching of vector datasets that are in the same geometry system (i.e., the same coordinate system), or require non-spatial attributes (e.g., objects names) to match objects of two datasets. To the best of our knowledge, there is no solution to address the same problem when the spatial attributes of the datasets are in unknown geometry systems and hence, their spatial relations with each other cannot be compared easily. 3 GeoMatchMaker: Finding the Matched Features In this section, we first describe the basic idea to identify two point patterns from two vector datasets and describe a brute-force algorithm, ppm, to match the point patterns and briefly discuss geo-ppm, which utilizes a number of auxiliary geospatial information to improve the performance of ppm. We then propose prioritized geo-ppm to further improve the performance of geo-ppm in order to address large size vector datasets. 3.1 PPM: Point Pattern Matching Let U= {u i u i = (x i, y i )} a set of coordinates, where (x i, y i ) are the locations of the intersections of the first vector dataset, and V= {v i v i = (m i, n i )}, where (m i, n i ) are the locations of the intersections of the second vector dataset. Our objective is to find the set: Rel Pat ={(u i,v i )} where u i is an intersection in the first vector dataset and v i is the corresponding intersection (if any) in the second vector dataset. That is, point u i and v i are formed by the same intersected vector segments. Consider identifying matched points between two road networks. If the system can recognize the names of road segments that meet at the intersections, it can use these road names to infer the set Rel Pat. However, road vector data may not comprise the non-spatial attribute, road name. Instead, our approach relies on some prominent geometric information of the road segments and their intersections to identify the matching point patterns. Hence, the problem of point pattern matching is to find the transformation T between the layout (with relative distances) of the point set U and V. The key computation of matching the two sets of points is calculating a proper transformation matrix T, which is a 2D rigid motion (rotation and translation) with scaling. Because the majority of vector datasets are oriented such that north is up, we only compute the translation transformation with scaling. Without loss of generality, we consider how to compute the transformation where we map from a fraction α of the points of U to the points of V. There are two reasons that only a fraction α of the points of U is considered: 1) One vector dataset may contain data that are not included in the second dataset, and 2) one vector dataset could be represented in detailed level while the other one is represented abstractly. The transformation T brings at least a fraction α of the points of U into a subset of V. This implies: T and U U, such that T(U ) V, where U α U and T(U ) denotes the set of the points that results from applying T to the points of U. Or equivalently, for a 2D point (x, y) in Sx 0 0 the point set U U, T in the matrix form 0 Sy 0, where S x and S y are scale factors along Tx Ty 1 x and y direction and T x and T y are translation factors along x and y directions, respectively, such that
5 Sx 0 0 [x, y, 1] * 0 Sy 0 = [m, n, 1], where U α U and the 2D point (m, n) belongs to the Tx Ty 1 intersection point set V on the second vector dataset. With this setting, we do not expect point coordinates to match exactly because of finite-precision computation or small errors in the datasets. Therefore, when checking whether a 2D point p belongs to the point set V, we declare that p V if there exists a point in V that is within the Euclidean distance δ of p for a small fixed positive threshold value δ, which controls the degree of inaccuracy. The minimum δ such that there is a match for U in V is called Hausdorff distance. Different computations of the minimum Hausdorff distance have been studied in great depth in the computational geometry literature [6]. We do not seek to minimize the value of δ, rather, we intend to adopt an acceptable threshold value for δ. The threshold is relatively small compared to the average inter-point distances in V. In fact, this sort of problem was categorized as Nearly Exact point matching problem in [2]. Given the parameters α and δ, to obtain a proper transformation T, we need to compute the values of the four unknown parameters S x, S y, T x and T y. This implies that at least four different equations are required. A straightforward (brute-force) method is first choosing a point pair (x 1, y 1 ) and (x 2, y 2 ) from U, then, for every pair of distinct points (m 1, n 1 ) and (m 2, n 2 ) in V, the transformation T that maps the point pair on U to the point pair on V is computed by solving the following four equations: S x * x 1 + T x = m 1 S y * y 1 + T y = n 1 S x * x 2 + T x = m 2 S y * y 2 + T y = n 2 Each generated transformation T is thus applied to the entire points in U to check whether there are more than α U points that can be aligned with some points on V within the threshold δ. This process is repeated for each possible point pair from U, which implies that it could require examining O( U 2 ) pairs in the worst case. On the other hand, for each such pair, the algorithm needs to try all possible point pairs on V (i.e., O( V 2 )) and the time required to examine each generated transformation T is O( U log V ). This implies that the worst case running time of this approach has complexity of O( U 3 V 2 log V ). The advantage of ppm is that we can find a mapping (if the mapping exists) with a proper threshold δ even in the presence of very noisy data. However, it suffers from high computation time. One way to improve the efficiency of the algorithm is to utilize randomization in choosing the pair of points from U as proposed in [10], thus achieving the running time of O( V 2 U log V ). However, this approach is not appropriate for all vector datasets since it is possible that one vector dataset is represented in detailed level while the other vector dataset is represented abstractly. Instead, we developed some efficient techniques discussed in the next sections to prune the search space of possible point pattern matches by reducing the numbers of potential matching point pairs needed to be examined. In more recent work, we extended the PPM algorithm by exploiting additional information about the geographic sources to improve the efficiency of the algorithm. The resulting algorithm is called Geo-PPM and the details of this algorithm are described in [17].
6 3.2 Prioritized Geo-PPM The intuition behind prioritized Geo-PPM is that in real world datasets, it is very likely that the matching feature for an outlier feature in one dataset is also an outlier in the other dataset. This means that the number of possible matching features for an outlier feature is most likely very small. Hence, if we can locate the outlier features in the first dataset and then start the Geo-PPM by first examining these features, we can most likely find the matching features faster as compared to when we execute Geo-PPM for a randomly selected feature. We further describe prioritized Geo-PPM using Figure 1. As shown in the figure, the majority of the lines in the blue dataset follow a grid pattern while lines L 1, L 2, L 3 and L 4 (shown with thick lines) are the outliers in this dataset. This means that the intersections of these lines are also outliers (i.e., have uncommon angles). As shown in the figure, the lines in the red dataset that have similar angles as the angles of (L 1, L 2, L 3, L 4 ) are also outliers in the red dataset. Hence, it is likely that we can find the matching line for L 1 (or in other words, the matching intersections for the intersections of L 1 ) faster than the matching line for L 5. This is because there are numerous lines in the red dataset that follow the grid pattern and have the same angles as L 5. Figure 1: Example of outliers in 2 road network datasets We can utilize the properties that are used in Geo-PPM to specify the outliers of a dataset. Some examples of the outlier features we can infer from these properties are: 1. Angles between the points: directly connected intersections that have outlier angles (e.g., intersections of the lines L 1 to L 4 in Figure 1). 2. Angles of the points: intersections that have the minimum number of candidates (i.e., number of candidate intersections in the second dataset that have the same angles as these intersections is minimum). 3. Point connectivity: intersections that have the maximum number of connectivity. 4. Distances between the points: directly connected intersections with the maximum distance (i.e., longest road in the network) Our experiments showed that intersections that have the minimum number of candidates (i.e., the second item above) can provide a better improvement in the performance of Geo-PPM as compared to the other outliers.
7 4 Experiments We performed several experiments with real world datasets to examine and compare the performance of Geo-PPM and prioritized Geo-PPM. For our experiments, we used three datasets obtained from USGS, NGA and US Census, covering the streets in the area of ( , 37.78) to ( , ). The 3 datasets are shown in Figure 2, where the green lines (i.e., most accurate data) represent the streets obtained from USGS, the red lines are from NGA, and the blue lines represent the streets obtained from US Census. As shown in the figure, while the data from USGS and NGA have almost similar granularity, the US Census data is represented in abstract level. Moreover, in order to examine the scalability of our approaches, we filtered the south west quarter of the datasets to generate 3 datasets with smaller sizes. Finally, we only used the combination of connectivity of the points and the angles of the points properties in Geo- PPM. Figure 2: Coverage area of the test data for the second set of the experiments Table 1 shows the results of the second set of our experiments. The first column of the table shows the combination of the datasets that were used and the second column indicates the number of intersections in each combination. As shown in the table, the average number of the candidates for each intersection (i.e., the number of intersections in the second dataset with the same connectivity and angles as compared to that of an intersection in the first dataset) varies between 371 and 637. This shows that the possibility of selecting 2 pairs from the candidate pool which are exactly matched to 2 intersections selected from the first dataset is very low, meaning that the random selection of points in Geo-PPM will result in a very large number of possibilities and hence, to a very large processing time. For example, for the USGS+NGA combination (with larger area), the possibility that randomly selected pair of points from the pool of candidates is exactly matched to the pair of points selected from the first dataset is =. That is, in worst case, to find the right matching pairs, up to 405,769 combinations (on average) must be examined. However, as shown in the table, with Geo-PPM, we can achieve an acceptable level of precision and recall by examining between 33 and 52 candidate pairs, an improvement of up to 4 orders of magnitude over Geo-PPM.
8 Datasets USGS+NGA (large area) USGS+US Census USGA+NGA (small area) Number of Intersections Average Number of Candidates for each point Point Pairs Examined by prioritized Geo-PPM Processing Time of prioritized Geo-PPM Precision Recall sec. 91% 92.5% sec. 82% 96.5% sec. 95.8% 95.8% Table 1: Performance of prioritized Geo-PPM 5 Conclusion In this paper, we discussed PPM, a brute-force approach to locate the matched points between two line vector datasets. We then discussed Geo-PPM in which some network information are utilized to reduce the computational complexity of PPM. In addition, Geo-PPM is independent of the spatial attributes of the datasets, rather, it only depends on the objects relative locations in the space. Hence, it can be applied to the vector datasets that are in unknown geometry systems. Moreover, we proposed prioritized Geo-PPM, in which we start by examining the pairs of points that have the minimum number of candidates. Our experiments showed that prioritized Geo- PPM can achieve an acceptable level of precision and recall by examining only a very small number of pairs of points and hence, render Geo-PPM applicable even for large datasets. Acknowledgements This research is based upon work supported in part by the National Science Foundation under Award No. IIS , IIS (PECASE), in part by the Defense Advanced Research Projects Agency (DARPA) and Air Force Research Laboratory, Air Force Materiel Command, USAF, under agreement number F , and in part by the Air Force Office of Scientific Research under grant number FA The U.S. Government is authorized to reproduce and distribute reports for Governmental purposes notwithstanding any copyright annotation thereon. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of any of the above organizations or any person connected with them. References [1] T. Barclay, J. Gray, and D. Stuz. "Microsoft TerraServer: A Spatial Data Warehouse", In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, TX, ACM Press, May 14-19, 2000, pp [2] D.E. Cardoze and L.J. Schulman. "Pattern Matching for Spatial Point Sets", In Proceedings of the IEEE Symposium on Foundations of Computer Science, 1998, pp
9 [3] C.-C. Chen. Automatically and Accurately Conflating Road Vector Data, Street Maps and Orthoimagery, Ph.D. Dissertation, Computer Science Department, University of Southern California, [4] C.-C. Chen, C. Shahabi, and C.A. Knoblock. "Utilizing Road Network Data for Automatic Identification of Road Intersections from High Resolution Color Orthoimagery", In Proceedings of the Second Workshop on Spatio-Temporal Database Management(STDBM'04), colocated with VLDB, Toronto, Canada, August 30, 2004, 2004, pp [5] C.-C. Chen, S. Thakkar, C.A. Knoblok, and C. Shahabi. "Automatically Annotating and Integrating Spatial Datasets", In Proceedings of the 8th International Symposium on Spatial and Temporal Databases (SSTD'03), Santorini Island, Greece, July 24-27, 2003, pp [6] L.P. Chew, M.T. Goodrich, D.P. Huttenlocher, K. Kedem, J.M. Kleinberg, and D. Kravets. "Geometric pattern matching under Euclidean motion", In Proceedings of the Fifth Canadian Conference on Computational Geometry, 1993, pp [7] M. Cobb, M.J. Chung, V. Miller, H.I. Foley, F.E. Petry, and K.B. Shaw. "A Rule-Based Approach for the Conflation of Attributed Vector Data", GeoInformatica, Vol. 2(1): 7-35, [8] M. Flavie, A. Fortier, D. Ziou, C. Armenakis, and S. Wang. "Automated Updating of Road Information from Aerial Images", In Proceedings of the American Society Photogrammetry and Remote Sensing Conference, Amsterdam, Holland, July 16-23, [9] A. Habib, Uebbing, R., Asmamaw, A. "Automatic Extraction of Primitives for Conflation of Raster Maps", Technical Report, The Center for Mapping, The Ohio State University, [10] S. Irani and P. Raghavan. "Combinatorial and experimental results for randomized point matching algorithms", Computational Geometry, Vol. 12(1-2): 17-31, [11] A. Saalfeld. "Conflation: Automated Map Compilation", International Journal of Geographic Information Sciences, Vol. 2(3): , [12] E.L. Usery, M.P. Finn, and M. Starbuck. "Data Integration of Layers and Features for The National Map", In Proceedings of American Congress on Surveying and Mapping, Phoenix, AZ, March 31- April 2, [13] V. Walter and D. Fritsch. "Matching Spatial Data Sets: a Statistical Approach", International Journal of Geographic Information Sciences, Vol. 13(5): , [14] J.M. Ware and C.B. Jones. "Matching and Aligning Features in Overlayed Coverages", In Proceedings of the 6th ACM International Symposium on Advances in Geographic Information Systems (ACM-GIS'98), Washington, D.C, ACM Press, November 6-7, 1998, pp [15]
10 [16] [17] Ching-Chien Chen, Automatically and Accurately Conflating Road Vector Data, Street Maps and Orthoimagery. Ph. D. Dissertation, University of Southern California, May 2005
Geospatial Data Integration
Geospatial Data Integration 1. Research Team Project Leader: Other Faculty: Post Doc(s): Graduate Students: Industrial Partner(s): Prof. Craig Knoblock, Computer Science Prof. Cyrus Shahabi, Computer Science
Integrating Data from Maps on the World-Wide Web
Integrating Data from Maps on the World-Wide Web Eliyahu Safra 1, Yaron Kanza 2, Yehoshua Sagiv 3, and Yerach Doytsher 1 1 Department of Transportation and Geo-Information, Technion, Haifa, Israel {safra,
Dynamically Composing Web Services from On-line Sources
Dynamically Composing Web Services from On-line Sources Snehal Thakkar, Craig A. Knoblock, Jose Luis Ambite, Cyrus Shahabi University of Southern California Information Sciences Institute 4676 Admiralty
Application of Adaptive Probing for Fault Diagnosis in Computer Networks 1
Application of Adaptive Probing for Fault Diagnosis in Computer Networks 1 Maitreya Natu Dept. of Computer and Information Sciences University of Delaware, Newark, DE, USA, 19716 Email: [email protected]
Institute of Natural Resources Departament of General Geology and Land use planning Work with a MAPS
Institute of Natural Resources Departament of General Geology and Land use planning Work with a MAPS Lecturers: Berchuk V.Y. Gutareva N.Y. Contents: 1. Qgis; 2. General information; 3. Qgis desktop; 4.
Introducing diversity among the models of multi-label classification ensemble
Introducing diversity among the models of multi-label classification ensemble Lena Chekina, Lior Rokach and Bracha Shapira Ben-Gurion University of the Negev Dept. of Information Systems Engineering and
Oracle8i Spatial: Experiences with Extensible Databases
Oracle8i Spatial: Experiences with Extensible Databases Siva Ravada and Jayant Sharma Spatial Products Division Oracle Corporation One Oracle Drive Nashua NH-03062 {sravada,jsharma}@us.oracle.com 1 Introduction
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
Face detection is a process of localizing and extracting the face region from the
Chapter 4 FACE NORMALIZATION 4.1 INTRODUCTION Face detection is a process of localizing and extracting the face region from the background. The detected face varies in rotation, brightness, size, etc.
A View Integration Approach to Dynamic Composition of Web Services
A View Integration Approach to Dynamic Composition of Web Services Snehal Thakkar University of Southern California/ Information Sciences Institute 4676 Admiralty Way, Marina Del Rey, California 90292
APLS 2011. GIS Data: Classification, Potential Misuse, and Practical Limitations
APLS 2011 GIS Data: Classification, Potential Misuse, and Practical Limitations GIS Data: Classification, Potential Misuse, and Practical Limitations Goals & Objectives Develop an easy to use geospatial
Simultaneous Gamma Correction and Registration in the Frequency Domain
Simultaneous Gamma Correction and Registration in the Frequency Domain Alexander Wong [email protected] William Bishop [email protected] Department of Electrical and Computer Engineering University
A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION
A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS Kyoungjin Park Alper Yilmaz Photogrammetric and Computer Vision Lab Ohio State University [email protected] [email protected] ABSTRACT Depending
ADVANTAGES AND DISADVANTAGES OF THE HOUGH TRANSFORMATION IN THE FRAME OF AUTOMATED BUILDING EXTRACTION
ADVANTAGES AND DISADVANTAGES OF THE HOUGH TRANSFORMATION IN THE FRAME OF AUTOMATED BUILDING EXTRACTION G. Vozikis a,*, J.Jansa b a GEOMET Ltd., Faneromenis 4 & Agamemnonos 11, GR - 15561 Holargos, GREECE
A User-Friendly Data Mining System. J. Raul Ramirez, Ph.D. The Ohio State University Center for Mapping [email protected]
A User-Friendly Data Mining System J. Raul Ramirez, Ph.D. The Ohio State University Center for Mapping [email protected] 1. Introduction Image acquisition of the Earth's surface has become a common
Visualize your World. Democratization i of Geographic Data
Visualize your World Democratization i of Geographic Data Session Agenda Google GEO Solutions - More than just a Map Enabling our Government Customers- Examples Summary & Invite to Learn More About Google
Mapping Linear Networks Based on Cellular Phone Tracking
Ronen RYBOWSKI, Aaron BELLER and Yerach DOYTSHER, Israel Key words: Cellular Phones, Cellular Network, Linear Networks, Mapping. ABSTRACT The paper investigates the ability of accurately mapping linear
Google Earth Digitale Wege in eine bekannte Welt Sep. 2006
Google Earth Digitale Wege in eine bekannte Welt Sep. 2006 Joachim Glaubrecht Google Enterprise [email protected] What is Google Enterprise? 2 1 Focus.de: "Der Papst-Besuch in Google Earth" 3 How to Geo
Geocoding in Law Enforcement Final Report
Geocoding in Law Enforcement Final Report Geocoding in Law Enforcement Final Report Prepared by: The Crime Mapping Laboratory Police Foundation August 2000 Report to the Office of Community Oriented Policing
CHAPTER-24 Mining Spatial Databases
CHAPTER-24 Mining Spatial Databases 24.1 Introduction 24.2 Spatial Data Cube Construction and Spatial OLAP 24.3 Spatial Association Analysis 24.4 Spatial Clustering Methods 24.5 Spatial Classification
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES
ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES B. Sirmacek, R. Lindenbergh Delft University of Technology, Department of Geoscience and Remote Sensing, Stevinweg
Object Fusion in Geographic Information Systems
Object Fusion in Geographic Information Systems Catriel Beeri Yaron Kanza Eliyahu Safra Yehoshua Sagiv The Selim and Rachel Benin School of Engineering and Computer Science The Hebrew University Jerusalem,
ENVI THE PREMIER SOFTWARE FOR EXTRACTING INFORMATION FROM GEOSPATIAL IMAGERY.
ENVI THE PREMIER SOFTWARE FOR EXTRACTING INFORMATION FROM GEOSPATIAL IMAGERY. ENVI Imagery Becomes Knowledge ENVI software uses proven scientific methods and automated processes to help you turn geospatial
Introduction to Imagery and Raster Data in ArcGIS
Esri International User Conference San Diego, California Technical Workshops July 25, 2012 Introduction to Imagery and Raster Data in ArcGIS Simon Woo slides Cody Benkelman - demos Overview of Presentation
Categorical Data Visualization and Clustering Using Subjective Factors
Categorical Data Visualization and Clustering Using Subjective Factors Chia-Hui Chang and Zhi-Kai Ding Department of Computer Science and Information Engineering, National Central University, Chung-Li,
Implementing Geospatial Data in Parametric Environment Elçin ERTUĞRUL*
Proceedings of the International Association for Shell and Spatial Structures (IASS) Symposium 2015, Amsterdam 17-20 August 2015, Amsterdam, The Netherlands Implementing Geospatial Data in Parametric Environment
Classifying Manipulation Primitives from Visual Data
Classifying Manipulation Primitives from Visual Data Sandy Huang and Dylan Hadfield-Menell Abstract One approach to learning from demonstrations in robotics is to make use of a classifier to predict if
DATA QUALITY IN GIS TERMINOLGY GIS11
DATA QUALITY IN GIS When using a GIS to analyse spatial data, there is sometimes a tendency to assume that all data, both locational and attribute, are completely accurate. This of course is never the
Neovision2 Performance Evaluation Protocol
Neovision2 Performance Evaluation Protocol Version 3.0 4/16/2012 Public Release Prepared by Rajmadhan Ekambaram [email protected] Dmitry Goldgof, Ph.D. [email protected] Rangachar Kasturi, Ph.D.
COMPUTING CLOUD MOTION USING A CORRELATION RELAXATION ALGORITHM Improving Estimation by Exploiting Problem Knowledge Q. X. WU
COMPUTING CLOUD MOTION USING A CORRELATION RELAXATION ALGORITHM Improving Estimation by Exploiting Problem Knowledge Q. X. WU Image Processing Group, Landcare Research New Zealand P.O. Box 38491, Wellington
Geospatial exploitation Products. GXP WebView. Powered by the GXP Platform
Geospatial exploitation Products GXP WebView Powered by the GXP Platform Exploitation Capabilities GXP WebView Powered by the GXP Platform Detailed annotation (text, points, lines, and polygons) Distance,
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
MAP GENERALIZATION FOR OSMASTERMAP DATA IN LOCATION BASED SERVICES & MOBILE GIS APPLICATIONS
Geoinformatics 2004 Proc. 12th Int. Conf. on Geoinformatics Geospatial Information Research: Bridging the Pacific and Atlantic University of Gävle, Sweden, 7-9 June 2004 MAP GENERALIZATION FOR OSMASTERMAP
A Short Introduction to Computer Graphics
A Short Introduction to Computer Graphics Frédo Durand MIT Laboratory for Computer Science 1 Introduction Chapter I: Basics Although computer graphics is a vast field that encompasses almost any graphical
A Learning Based Method for Super-Resolution of Low Resolution Images
A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 [email protected] Abstract The main objective of this project is the study of a learning based method
Data Validation Online References
Data Validation Online References Submitted To: Program Manager GeoConnections Victoria, BC, Canada Submitted By: Jody Garnett Brent Owens Refractions Research Inc. Suite 400, 1207 Douglas Street Victoria,
Files Used in this Tutorial
Generate Point Clouds Tutorial This tutorial shows how to generate point clouds from IKONOS satellite stereo imagery. You will view the point clouds in the ENVI LiDAR Viewer. The estimated time to complete
SPATIAL ANALYSIS IN GEOGRAPHICAL INFORMATION SYSTEMS. A DATA MODEL ORffiNTED APPROACH
POSTER SESSIONS 247 SPATIAL ANALYSIS IN GEOGRAPHICAL INFORMATION SYSTEMS. A DATA MODEL ORffiNTED APPROACH Kirsi Artimo Helsinki University of Technology Department of Surveying Otakaari 1.02150 Espoo,
Knowledge Discovery from patents using KMX Text Analytics
Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers
Analecta Vol. 8, No. 2 ISSN 2064-7964
EXPERIMENTAL APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN ENGINEERING PROCESSING SYSTEM S. Dadvandipour Institute of Information Engineering, University of Miskolc, Egyetemváros, 3515, Miskolc, Hungary,
PHOTOGRAMMETRIC TECHNIQUES FOR MEASUREMENTS IN WOODWORKING INDUSTRY
PHOTOGRAMMETRIC TECHNIQUES FOR MEASUREMENTS IN WOODWORKING INDUSTRY V. Knyaz a, *, Yu. Visilter, S. Zheltov a State Research Institute for Aviation System (GosNIIAS), 7, Victorenko str., Moscow, Russia
PARTIAL FINGERPRINT REGISTRATION FOR FORENSICS USING MINUTIAE-GENERATED ORIENTATION FIELDS
PARTIAL FINGERPRINT REGISTRATION FOR FORENSICS USING MINUTIAE-GENERATED ORIENTATION FIELDS Ram P. Krish 1, Julian Fierrez 1, Daniel Ramos 1, Javier Ortega-Garcia 1, Josef Bigun 2 1 Biometric Recognition
Context-Aware Online Traffic Prediction
Context-Aware Online Traffic Prediction Jie Xu, Dingxiong Deng, Ugur Demiryurek, Cyrus Shahabi, Mihaela van der Schaar University of California, Los Angeles University of Southern California J. Xu, D.
WEB SITE OPTIMIZATION THROUGH MINING USER NAVIGATIONAL PATTERNS
WEB SITE OPTIMIZATION THROUGH MINING USER NAVIGATIONAL PATTERNS Biswajit Biswal Oracle Corporation [email protected] ABSTRACT With the World Wide Web (www) s ubiquity increase and the rapid development
VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS
VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS Aswin C Sankaranayanan, Qinfen Zheng, Rama Chellappa University of Maryland College Park, MD - 277 {aswch, qinfen, rama}@cfar.umd.edu Volkan Cevher, James
Buffer Operations in GIS
Buffer Operations in GIS Nagapramod Mandagere, Graduate Student, University of Minnesota [email protected] SYNONYMS GIS Buffers, Buffering Operations DEFINITION A buffer is a region of memory used to
Reading Questions. Lo and Yeung, 2007: 2 19. Schuurman, 2004: Chapter 1. 1. What distinguishes data from information? How are data represented?
Reading Questions Week two Lo and Yeung, 2007: 2 19. Schuurman, 2004: Chapter 1. 1. What distinguishes data from information? How are data represented? 2. What sort of problems are GIS designed to solve?
A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1
A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 Yannis Stavrakas Vassilis Plachouras IMIS / RC ATHENA Athens, Greece {yannis, vplachouras}@imis.athena-innovation.gr Abstract.
ArcGIS Data Models Practical Templates for Implementing GIS Projects
ArcGIS Data Models Practical Templates for Implementing GIS Projects GIS Database Design According to C.J. Date (1995), database design deals with the logical representation of data in a database. The
WATER BODY EXTRACTION FROM MULTI SPECTRAL IMAGE BY SPECTRAL PATTERN ANALYSIS
WATER BODY EXTRACTION FROM MULTI SPECTRAL IMAGE BY SPECTRAL PATTERN ANALYSIS Nguyen Dinh Duong Department of Environmental Information Study and Analysis, Institute of Geography, 18 Hoang Quoc Viet Rd.,
Choices, choices, choices... Which sequence database? Which modifications? What mass tolerance?
Optimization 1 Choices, choices, choices... Which sequence database? Which modifications? What mass tolerance? Where to begin? 2 Sequence Databases Swiss-prot MSDB, NCBI nr dbest Species specific ORFS
GIS Data Conversion. GIS maps are digital not analog. Getting the Map into the Computer
Getting the Map into the Computer Getting the Map into the Computer Getting Started with Geographic Information Systems Chapter 4 4.1 Analog-to-Digital Maps 4.2 Finding Existing Map Data 4.3 Digitizing
High-accuracy ultrasound target localization for hand-eye calibration between optical tracking systems and three-dimensional ultrasound
High-accuracy ultrasound target localization for hand-eye calibration between optical tracking systems and three-dimensional ultrasound Ralf Bruder 1, Florian Griese 2, Floris Ernst 1, Achim Schweikard
ArcGIS. Server. A Complete and Integrated Server GIS
ArcGIS Server A Complete and Integrated Server GIS ArcGIS Server A Complete and Integrated Server GIS ArcGIS Server enables you to distribute maps, models, and tools to others within your organization
Big Data Text Mining and Visualization. Anton Heijs
Copyright 2007 by Treparel Information Solutions BV. This report nor any part of it may be copied, circulated, quoted without prior written approval from Treparel7 Treparel Information Solutions BV Delftechpark
Rule based Classification of BSE Stock Data with Data Mining
International Journal of Information Sciences and Application. ISSN 0974-2255 Volume 4, Number 1 (2012), pp. 1-9 International Research Publication House http://www.irphouse.com Rule based Classification
Removing Moving Objects from Point Cloud Scenes
1 Removing Moving Objects from Point Cloud Scenes Krystof Litomisky [email protected] Abstract. Three-dimensional simultaneous localization and mapping is a topic of significant interest in the research
The STC for Event Analysis: Scalability Issues
The STC for Event Analysis: Scalability Issues Georg Fuchs Gennady Andrienko http://geoanalytics.net Events Something [significant] happened somewhere, sometime Analysis goal and domain dependent, e.g.
A View Integration Approach to Dynamic Composition of Web Services
A View Integration Approach to Dynamic Composition of Web Services Snehal Thakkar, Craig A. Knoblock, and José Luis Ambite University of Southern California/ Information Sciences Institute 4676 Admiralty
A HYBRID APPROACH FOR AUTOMATED AREA AGGREGATION
A HYBRID APPROACH FOR AUTOMATED AREA AGGREGATION Zeshen Wang ESRI 380 NewYork Street Redlands CA 92373 [email protected] ABSTRACT Automated area aggregation, which is widely needed for mapping both natural
Using PCI Geomatics Software with Oracle 10g Spatial and GeoRaster A PCI Geomatics Whitepaper
Using PCI Geomatics Software with Oracle 10g Spatial and GeoRaster A PCI Geomatics Whitepaper November 2005 Table of Contents Table of Contents...1 1. Introduction...2 2. How Geomatica adds value to Oracle
An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network
Proceedings of the 8th WSEAS Int. Conf. on ARTIFICIAL INTELLIGENCE, KNOWLEDGE ENGINEERING & DATA BASES (AIKED '9) ISSN: 179-519 435 ISBN: 978-96-474-51-2 An Energy-Based Vehicle Tracking System using Principal
QOS Based Web Service Ranking Using Fuzzy C-means Clusters
Research Journal of Applied Sciences, Engineering and Technology 10(9): 1045-1050, 2015 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scientific Organization, 2015 Submitted: March 19, 2015 Accepted: April
Texas Develops Online Geospatial Data Repository to Support Emergency Management
Bing Maps for Enterprise Microsoft Customer Solution Case Study Texas Develops Online Geospatial Data Repository to Support Emergency Management Overview Country or Region: United States Industry: Government
Blog Post Extraction Using Title Finding
Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School
GEOSPATIAL DIGITAL ASSET MANAGEMENT A SOLUTION INTEGRATING IMAGERY AND GIS WHERE WILL ALL THE PIXELS GO?(AND HOW WILL WE EVER FIND THEM?
GEOSPATIAL DIGITAL ASSET MANAGEMENT A SOLUTION INTEGRATING IMAGERY AND GIS WHERE WILL ALL THE PIXELS GO?(AND HOW WILL WE EVER FIND THEM?) Dr. Joan Lurie, GCC, Inc. 30 West 61 st Street, Apt 9A New York,
ASSESSMENT OF VISUALIZATION SOFTWARE FOR SUPPORT OF CONSTRUCTION SITE INSPECTION TASKS USING DATA COLLECTED FROM REALITY CAPTURE TECHNOLOGIES
ASSESSMENT OF VISUALIZATION SOFTWARE FOR SUPPORT OF CONSTRUCTION SITE INSPECTION TASKS USING DATA COLLECTED FROM REALITY CAPTURE TECHNOLOGIES ABSTRACT Chris Gordon 1, Burcu Akinci 2, Frank Boukamp 3, and
Topology and Topological Rules Geometric properties that are maintained in spatial databases
Topology and Topological Rules Geometric properties that are maintained in spatial databases The definition of topology Topology is a term used around GIS that is sometimes confused with the term topography.
Context-aware taxi demand hotspots prediction
Int. J. Business Intelligence and Data Mining, Vol. 5, No. 1, 2010 3 Context-aware taxi demand hotspots prediction Han-wen Chang, Yu-chin Tai and Jane Yung-jen Hsu* Department of Computer Science and Information
Assessing the implementation of Rawalpindi s Guided Development Plan through GIS and Remote Sensing Muhammad Adeel
Assessing the implementation of Rawalpindi s Guided Development Plan through GIS and Remote Sensing Muhammad Adeel (Lecturer Muhammad Adeel, Institute of Geographical Information Systems, NUST, H-12 Islamabad,
Automatic Calibration of an In-vehicle Gaze Tracking System Using Driver s Typical Gaze Behavior
Automatic Calibration of an In-vehicle Gaze Tracking System Using Driver s Typical Gaze Behavior Kenji Yamashiro, Daisuke Deguchi, Tomokazu Takahashi,2, Ichiro Ide, Hiroshi Murase, Kazunori Higuchi 3,
Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data
CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals Chun Lei He, Ping Zhang, Jianxiong Dong, Ching Y. Suen, Tien D. Bui Centre for Pattern Recognition and Machine Intelligence,
A Review of Anomaly Detection Techniques in Network Intrusion Detection System
A Review of Anomaly Detection Techniques in Network Intrusion Detection System Dr.D.V.S.S.Subrahmanyam Professor, Dept. of CSE, Sreyas Institute of Engineering & Technology, Hyderabad, India ABSTRACT:In
Vision based Vehicle Tracking using a high angle camera
Vision based Vehicle Tracking using a high angle camera Raúl Ignacio Ramos García Dule Shu [email protected] [email protected] Abstract A vehicle tracking and grouping algorithm is presented in this work
Understanding Web personalization with Web Usage Mining and its Application: Recommender System
Understanding Web personalization with Web Usage Mining and its Application: Recommender System Manoj Swami 1, Prof. Manasi Kulkarni 2 1 M.Tech (Computer-NIMS), VJTI, Mumbai. 2 Department of Computer Technology,
Efficiently Managing Firewall Conflicting Policies
Efficiently Managing Firewall Conflicting Policies 1 K.Raghavendra swamy, 2 B.Prashant 1 Final M Tech Student, 2 Associate professor, Dept of Computer Science and Engineering 12, Eluru College of Engineeering
Principles and Practices of Data Integration
Data Integration Data integration is the process of combining data of different themes, content, scale or spatial extent, projections, acquisition methods, formats, schema, or even levels of uncertainty,
CS 534: Computer Vision 3D Model-based recognition
CS 534: Computer Vision 3D Model-based recognition Ahmed Elgammal Dept of Computer Science CS 534 3D Model-based Vision - 1 High Level Vision Object Recognition: What it means? Two main recognition tasks:!
HydroDesktop Overview
HydroDesktop Overview 1. Initial Objectives HydroDesktop (formerly referred to as HIS Desktop) is a new component of the HIS project intended to address the problem of how to obtain, organize and manage
