Abstract. Figure 1: The Taxonomy of Spatial and Spatiotemporal data warehouse

Size: px
Start display at page:

Download "Abstract. Figure 1: The Taxonomy of Spatial and Spatiotemporal data warehouse"

Transcription

1 CSCI 8715 Spatial databases Spatial Data warehouses A Survey Group 4: Nipun Garg Surabhi Mithal

2 Abstract The field of Spatial Data warehouses has been emerging since the past decade due to the need to analyze large volumes of spatial data. The data once stored in a spatial data warehouse has to be queried using spatial online analytical processing (SOLAP) systems. The research in field of spatial data warehouses has been on conceptual models, materialization of spatial indexes, aggregation operations and SOLAP. In this paper we give an overview of the core concepts in a spatial data warehouse and recent advancements in the field. 1. Introduction Spatial data warehouses aim at effective and efficient querying of spatial data. Spatial databases are suited for answering regular transactional queries where there is not a lot of historical component or aggregation. The class of queries that are needed to support the decision making process are difficult on spatial data bases. This gave a rise to the field of spatial data warehouses which is idea of combining the traditional data warehouses with spatial databases. Spatial data warehouses are based on the concepts of Data warehouses and additionally provide support to store, index, and aggregate and analyze spatial data [33]. A data warehouse consists of facts and dimensions modeled in a star or snowflake schema. A data cube is a lattice of cuboids which represents hierarchies. The data cube may have cells which are pre computed for efficient query processing. Common OLAP operations include slicing, dicing, Roll up and Roll down. These concepts are extended to spatial data in a spatial data warehouse. Integration of spatial data and data cubes make many interesting spatial aggregation queries possible. Figure 1: The Taxonomy of Spatial and Spatiotemporal data warehouse Figure 1 shows the relationship between spatial, spatiotemporal and a data warehouse.

3 The major characteristics of a spatial data warehouse include: Conceptual model: Star and snowflake schemas for spatial attributes. Spatially enable components: Spatial measures, spatial dimensions and spatial hierarchies. Spatial OLAP operations: Operations like Roll up, drill down extended to spatial predicates [36]. Efficient query processing: o Indices and materialized views o Joins and aggregations queries. Spatial data warehouses have been an active topic of research in the past decade. This is because of the popularity of spatial information such as maps created from the images/information received from satellites has increased tremendously. These data sets are huge and have to be analyzed efficiently to make best use of the information gathered. The research in this field mostly focuses on: Spatial multidimensional models: Conceptual models for efficient representation of spatial data warehouses. Materialized spatial indexes: Extension of spatial Indexes to store aggregated spatial data. Aggregate operations: Various aggregation operations over hierarchies. SOLAP: Client applications on spatial data warehouses [19]. Some examples of spatial data warehouse are the US census dataset, EOS archives [54], Microsoft Terra server [17] and Spatial Eye [55]. Spatial Data Acquisition & Consolidation (ETL) Addition of Temporal features (Optional) Spatial Data warehouse (Spatial Data Marts and Metdata) Spatial Cubes Formation Presentation thorugh SOLAP tools Figure 2 shows the high level view of the various phases in implementation of a spatial data warehouse

4 Spatial data warehouses have a wide variety of application domains like Logistics, Forecasting, security, detection of environment changes and health monitoring. These involve spatial data which has to be analyzed over various dimensions on multiple resolutions Contributions and Related work Related work on spatial data warehouses is in the form of: Journal papers and conference proceedings [1-45][48-52]: These are the basic resources for research in the field of spatial data warehousing and current trends. Books [46] which explain in details the basic concepts of spatial and temporal data warehouses. Reference book [47] explains the details of terms and terminologies of spatial databases. Online Encyclopedias like Wikipedia [53] are good source for basic level information and concepts. Spatial data warehouses and SOLAP is widely gaining importance due to the capabilities it provides. To our knowledge, there is no recent survey paper in literature on spatial data warehouses. There exists some literature on overview of general concepts of spatial data warehouses [19] [20] [48] but we could not find a recent detailed survey which covers all the important aspects of a spatial data warehouse. In [48] though some aspects of spatial and spatio temporal data warehouses are covered some important concepts like benchmarks and spatial OLAP tools are not discussed at length. There are many open research issues in spatial data warehouses and we have classified them and presented them in a consolidated manner. We have also analysed trends like spatiotemporal data warehouses and why they are gaining importance.

5 The below table Figure 2 shows the classification in terms of topics and research literature which exists for spatial data warehouses. Topic Subcategory Subcategory Papers Conceptual Models Spatial Multidimensional model [1],[4],[ 5],[12],[23] Requirements of a conceptual model for SDW [21] Mapping of Conceptual model to Physical schema [3] Spatio Temporal model [6],[14] Storage and Querying Indexing Materialized [7],[8],[10] Spatiotemporal [9],[43],[44] GIST [27] Selective Materialization Object-Based [11] Aggregation General Concepts and Issues Pre Aggregation [12],[28] Geometric Aggregation Model [30] Aggregation Operations [15],[22],[47] Spatial OLAP ( SOLAP) General Concepts [20],[30],[32] Tools for SOLAP [32],[34],[35],[36] Benchmark [25],[37],[38],[39], [40],[41] Extension of OLAP cubes [18] Spatio-temporal DW General concepts and Issues [13],[16],[48] Trajectory Data Warehousing [13],[42],[44] Table 1: Classification of topics & research literature

6 The broad classification in terms of concepts and algorithms is presented below in Figure 3. Materialized R*a tree artree Index GIST based arb tree Spatiotemporal ahrb tree Spatial Data warehouses Conceptual Model MultiDimER model Other Multidiemnsional models 3 RDB tree Pre Aggregation Concepts Geometric Aggregation Aggregation Aggregation Operations Distribuive, Algebraic and Holistic Big Cube Aggrregation operations Figure 3: Broad classification presented as hierarchy tree 1.2. Scope The scope of this paper is to study the broad concepts in a spatial data warehouse and the research needs. The paper discusses conceptual models, Indices for spatial data warehouses, Materialization and Aggregation over hierarchies, SOLAP and benchmarks. It also outlines the broad research areas in these topics. In addition, the latest emerging trend in this field spatiotemporal data warehouse is explored.

7 1.3. Organization The paper is organized in the following order: First we give a brief overview of conceptual models for spatial warehouses. Focus will be on existing models. In section 3, the current storage and indexing techniques for SDW are presented as well as future research needs are analyzed. Section 4 describes the important concept of aggregation. Section 5 and 6 cover the SOLAP tools for efficiently querying spatial data warehouses and evaluation benchmarks in SDWs respectively. Finally, section 7 describes spatio-temporal data warehouses which are the latest trend. 2. Conceptual models for Spatial Data warehouses A conceptual model is a representation of the concepts and relationships between them [37]. It is primarily for capturing requirements of the decision making users without worrying about the implementation details. There are a conceptual models existing for relational and spatial data bases but they do not scale well to spatial data warehouses due to presence of hierarchies, aggregations, measures and dimensions Existing models. There have been various proposals for multidimensional models for spatial data warehouses. [1] proposes a multidimensional model where measures and dimension are modelled as complex objects. It provides concepts of entity schema and entity instances and uses these to define hierarchies, aggregations and data cube. MultiDimER model [4] [5] is a conceptual model for spatial data warehouse which introduces concepts like spatial level, spatial hierarchies, spatial measures and spatial fact relationships. It is quite flexible in the sense that it does not require spatial dimensions to present for a spatial fact to exist. It allows representation of real word hierarchies [5] in the model. The basic concepts in MultiDimER multidimensional model are: Spatial Level Spatial levels are levels where spatial characteristics are stored. A topological relationship exists between different spatial levels. Spatial Hierarchy A hierarchy which includes at least one spatial level. Spatial dimensions: Extending the concept of dimensions in a Data warehouse, spatial dimensions are dimensions that have at least one spatial hierarchy. In the model spatial dimensions are of 3 types with the following hierarchies: o Non-spatial. o Spatial-to-non-spatial.

8 o Fully spatial Spatial Fact Relationship: A Spatial fact relationship is fact relationship that requires a spatial join between two or more spatial dimensions. Spatial Measures: Spatial measures are measures are either numerical values calculated using topological operators (Length in the figure below) or Geometries which can be aggregated with the hierarchies. Figure 4: MultidimER model for Highway maintenance [4] The MultidimER model in figure 4 has the following attributes, Length is a spatial measure, city and Highway Segment are spatial dimensions as they have spatial hierarchies. An example for the spatial hierarchy here are City and State. [3] describes the mapping of the MultidimER conceptual model into a physical model. The physical model is implemented in Oracle 10g spatial. The paper discusses the implementation issues of schemas created using conceptual models. A Spatial level defined in MultidimER model corresponds to a table in a database. The relationships between levels are represented by many to one relationship between tables. The basic requirements for the design of an effective multidimensional model for spatial data warehouses are described in [21]. We have classified the requirements presented in [21] based on the area they belong to. Figure 5 summarizes our classification:

9 Simplicity Easy to understand but capturing all basic elements Independence in specification s and Implmentation Conceptual Model Implementation independent Flexible in terms of spatia and non spatial attributes Hierarchy Multiple as well as Explicit Handle data with different granularities Irregular spatial hierarchies supported Aggregation Support for thematic & geometric aggregation User defined aggregates Avoid incorrect aggregation Dimenensionless and measure less aggregation Goespatial aggregation Data Handles changes over time OLAP operations including drill through and drill across Handle Uncertainity Figure 5: Requirements of a spatial multidimensional model 3. Storage and Indexing 3.1. Indexing Indices form an important part of a data warehouse spatial or non spatial. If the right index structures are built on columns of dimensions and facts, the performance of queries, especially ad hoc queries is greatly enhanced. There have been some proposals [8] [11] [27] for extending the index structures which currently exist to suit the needs of a spatial data warehouse.

10 [27] describes the extension of the Generalized Index search tree [GIST] framework for efficient OLAP queries on a Spatial Data warehouse. GIST provides 2 interfaces to extend. These are the Predicate and gist interfaces. The search algorithm in GIST uses the predicate Consistent to find all the leaf nodes which are consistent with query predicate. A new state for this predicate is introduced called Partial true. Also, a new search algorithm is proposed for efficient results during a OLAP query Materialized Indexing R*a tree [10] extends the R*- tree for efficient OLAP operations using materialization of the index structure. The paper shows that storing aggregates in the inner nodes of the index tree will improve the response time of OLAP slice and dice queries as the number of accesses to the secondary memory will reduce. A modified recursive range query algorithm is presented in the paper which uses this pre computation and highlights this will be quite useful in range queries. The results also show that the extra space needed for storing the aggregated data is linear to the size of the structure. While R*a tree [10] highlights of concepts of storing aggregates in the index it does not consider spatial objects. The ar tree [8] extends along the same idea of materialization of the index by extending R tree for spatial data warehouses. OLAP operations may need a specific hierarchy which is not defined at design time for spatial data. ar tree stores the results of aggregation functions on all the objects stored by each MBR. The example shown in Figure 6 depicts an ar tree which shows 5 MBRs (a1, a2 a5) and the COUNT of spatial objects within them. ` The advantages of this approach are: Figure 6: The ar tree [8]

11 ar tree defines a hierarchy among MBRs that forms a data cube lattice model. This will give scope for selective materialization of the structure. This idea can be extended to storing results of window queries or all other types of aggregate operators. While the ar tree is considered quite effective for aggregation queries, the effectiveness it provides gets degraded when the number of dimensions is quite large [37]. The complexity is similar to the sequential scanning when the number of dimensions is significant. [7] presents an implementation and exploration of ar-trees [8] for spatial data warehouses Spatiotemporal Indices Most indexing approaches for spatial data warehouses focus on spatial [8] [10] [27] or temporal indexes [43] [44]. Spatio-temporal data warehouses need the integration of spatial and temporal structures for efficiency. The aggregate R- B-tree (arb-tree) [9] is an extended R tree which has a pointer to a B tree which stores historical aggregated data about the MBR. This has been proposed for static spatial dimensions. Figure 7 shows the structure for an arb tree. Figure 7: an arb tree [9] The Aggregate Historical R-B-tree (ahrb) [9] combines the concept of arb tree and Historical R tree [HR tree] for indexing of dynamic spatial dimensions. Each node stores the time span to indicate if it is valid or not and when was it valid in history. Other form of entries of a node is similar to the arb tree. Each time an update happens a new R tree is created at that timestamp. Figure 8 shows the structure of an ahrb tree.

12 Figure 8: an ahrb tree [9] Another proposal for dynamic spatial dimension indexing is the aggregate 3 dimensional RB tree (3RDB-tree) [9] which improves on the limitation of the size of the tree for the ahrb tree. It forms one large R tree for the whole history as opposed to the many small R trees created in ahrb tree. The large R tree stores different version of all the regions in the same tree Selective materialization The selective materialization of a data cube has been studied in detail and techniques have been proposed for effectively choosing the set of cuboids to materialize [40]. 100 A 50 B 30 C D E F 40 G 7 H 10 Figure 9: Example lattice with space cost for selective materialization [40] Figure 9 above shows the lattice model, which forms the key for selective materialization. The edge from node 1 to node 2 (from up to down) shows that the query for node 2 can be answered by the grouping done for node 1. The greedy algorithm proposed in [40] gives the output as the selected nodes to materialize based on space cost minimization.

13 Though the problem of selective materialization extends naturally to spatial data cubes, the difference in the spatial case is that the computation cost. If there is no materialization of the spatial data cubes the online computation becomes very time consuming. This is due to the computationally expensive joins and other operations for spatial data. In [11] a finer granularity approach is suggested for spatial data a cube which focuses on cell level materialization instead of cuboid level. The approach is called object based materialization and focuses on selecting a few spatial objects. The selective materialization is based on the relative access frequency of the sets of mergeable spatial regions. The pre computation occurs if they are expected to be accessed frequently. The algorithms they propose assume that pre computation cuboids are already identified by algorithms in [40] or by minor extension to them Research needs The index structures discussed above focus on materialization of the index structure storing aggregations of spatial measures. Most of the existing work is limited to numerical aggregations and other simple operations. There is a need to study the materialization of indexes for supporting spatio temporal measures like the direction in which a movement is happening. The index selection problem is widely known problem in the databases world. The problem extends naturally to spatial data warehouses where efficiency of retrieval is of prime importance. The methods proposed for selective materialization of spatial data cubes assume that there exists information about the access frequencies of a set of selected cuboids. Methods need to be proposed which are independent of this assumption. 4. Aggregation Aggregation in data warehouses refers to the summarizing of the properties of data over particular dimensions of interest. The most commonly used of these are time and geographic location and applying an aggregation operation of interest to the measure/fact data. Aggregation over spatial data warehouses refers to computing of the aggregated operations on measures on the union of the areas which are considered for aggregation. An example is computing the total size of a union of a number of areas Aggregation operations and techniques Aggregation operations for spatial and non spatial data

14 Aggregations functions for spatial data have been grouped into three categories Distributive, Algebraic and Holistic [22] [47]. Table 2 describes the grouping of the spatial aggregate operations based on three basic categories. Aggregate Operations Data Type Distributive Algebraic Holistic Set of numbers Count, Min, Max, Sum Average, Standard Deviation, MaxN() & Min N() Median, MostFrequent, Rank Set of Geometries Minimal, Orthogonal Bounding Box, Geometric union, Geometric Intersection Centroid, Center of Gravity and Center of Mass Table 2: Set of aggregate operations [47] Nearest Neighbor Index, Equi Partition. [15] describes the BigCube model for multidimensional spatial data. They define aggregation operations as additive, semi-additive and Non Additive and describe how these are incorporated in the multidimensional model. The operations defined are listed in Table 3. Type Big Cube Aggregate Operator Additive Semi Additive Non Additive Count, Min ( Base), Max, Sum (Apex), Concatenate, Convex Hull, Spatial Union, Spatial Intersection Average, Standard Deviation, Variance,MaxN()& Min N() Centroid, Center of Gravity, Center of Mass Median, MostFrequent, Rank, LastNonNullValue, FirstNonNullValue, Minimum Bounding Box, Nearest Neighbor, Equi Partition Agggregation concepts Table 3: Aggregate operations for BigCube [15] The operators presented in [15] [22] [47] work well with spatial objects but aggregation of spatial measures requires to consider the topological relationships existing between them. This is because of the problem of double counting while aggregation. A building listed as a bowling alley and discotheques would be counted twice under aggregation for entertainment [12].

15 [28] deals with this problem and describes the pre-aggregation of spatial measures. The pre-processing of facts is done for computing their disjoint parts. They propose the classification of topological relationships between spatial measures. The preaggregation works if the spatial properties of the objects are distributive over some aggregate function. The drawback of the approach in [28] is that they do not address forms other than polygons. [30] describes a formal model for geometric aggregation. They define three parts namely algebraic part, geometric part, and the Classical OLAP or Application part each of which maintain separate hierarchies and interact with each other to answer queries. Figure 10 shows an example of the three parts Research Needs Figure 10: Geometric, algebraic and application part [30] The Multiple representation problem is widely known problem in spatial databases [52]. The same spatial object may be considered as a point in one application as a polygon in the other. In some other scenario 3 dimensional representation may be followed which considers the object as a cuboid or polyhedron. Figure 11 depicts the three different representations possible of the same spatial object which may be a building in this case.

16 Point Polygon Cuboid Figure 11: Multiple representation of the same object The multiple representation problem is particularly problematic in case of spatial data warehouses because of 2 major reasons [12]. 1. Aggregation and Consolidation of data from different sources where a different representation is followed. 2. SOLAP operations: While doing an operation like roll up and drill down over hierarchies same level may have different representation for the same object making it difficult to choose one. Double Counting while Aggregation Double Counting means incorrect aggregation of measures due to some overlapping property. An example would be the same park being used for a concert and a fair may be counted twice while aggregating the objects classified as entertainment. The problem of double counting has been addressed in [11] considering topological relationships between spatial measures and only doing aggregations over objects which are disjoint thus avoiding the problems of incorrect aggregation. This is still an open problem due to concepts of multiple representation and topological relationships when the objects are represented in 3 dimensions. 5. Spatial Online Analytical Processing (SOLAP) OLAP is an approach to swiftly answer multi-dimensional analytical (MDA) queries [48]. It is a category of decision-support tools often used to provide access in an efficient and intuitive manner to a data warehouse. Some of the examples include Cognos Powerplay, Business Objects and Oracle Express. OLAP tools are not robust to analyze spatial and temporal data. GIS tools are also helpful in analyzing spatial data but still are not good enough to make full utilization of spatio temporal datasets [32]. Therefore, a new approach is to couple of OLAP and GIS functionalities. In this way it will be possible to have decision support tools that are

17 better adapted for spatio temporal exploration and analysis of data. These are called Spatial OLAP systems, or SOLAP Concepts OLAP supports spatial data but it treats a spatial dimension as any other dimension and it does not pay attention to the cartographic component of the data. Data visualization facilitates better understanding of the structure of the data and helps in better decision making capabilities [34]. Maps and graphics do more than make data visible, they can help in driving the historical data analysis. Without a cartographic display, OLAP tools lack an essential feature, which could help the completion of spatiotemporal exploration and analysis processes [30]. OLAP SOLAP GIS Figure 12: SOLAP is created by combining concepts/features of conventional OLAP &GIS This creates a need for SOLAP which has been defined in [24] as a visual platform built especially to support rapid and easy spatio temporal analysis and exploration of data following a multidimensional approach comprised of aggregation levels available in cartographic displays as well as in tabular and diagram Tools for SOLAP In this section, we have summarized the current available tools for SOLAP. SOLAP tools can be divided in three different categories. [32] [34] [35][36]. OLAP dominant (Business Objects, Cognos, Knosys) which provide means for aggregation of data. GIS dominant which focus on geometric operations. Visual data selections or Integrated OLAP and GIS solutions (Geo cube, Sovat).

18 Figure 13 shows the classification of SOLAP tools into various categories. SOLAP TOOLS OLAP based GIS based Integrated Business Objects Cognos Knosya LGS Group Inc. Geo cube SOVAT Figure 13: An overview of Spatial OLAP tools 6. Benchmarks for Spatial Data warehouses Benchmarking is to evaluate or check (something) by comparison with a standard. o How well is the performance of your spatial data warehouse? o Does it need improvements or is it really good? To answer such questions, it is critical to assess the warehouse's performance, relative to an achievable "standard" or "benchmark." Every benchmark should have well-defined success criteria. Before creating a detailed benchmark specification, it is important to decide about the most crucial technical requirements of a data warehouse. This helps in focusing the benchmark in those lines Types of benchmarks There are 2 types of benchmarks: o Functional benchmarks These are the standards to evaluate what functions a system can do. o Performance benchmarks These benchmarks helps to determine and compare how fast the system is. In past few years several concepts are implemented to improve query processing over spatial data warehouses. Few are indices creation and materialized views. In order to evaluate how efficient these techniques are, different datasets with different properties are used. The benchmarks used for spatial data warehouses query processing should fit spatial data warehouse evaluation needs. Also, the benchmark should be able to analyze the performance of operations such as spatial roll up and drill down.

19 6.2. Overview of existing benchmarks for spatial data The following benchmarks exist for spatial data: Benchmark VESPA [37] Measuring performance by considering spatial joins [38] TPC-D TPC-H [39] TPCDS - [40] Star Schema Benchmark (SSB)[41] Spadawan benchmark [25] Description/ Limitations Both of these benchmarks focus on the spatial predicate computation but not aimed at assessing the efficiency of SOLAP operations. Benchmark by Transaction Processing Performance Council for decision support systems. But it does not support indices nor materialized views [39]. This provides individual queries that are not known in advance. However, its schema differs from the traditional star schema. This benchmark is more realistic then the previous ones. It suppresses the schema issue with a snowflake schema, but is aimed at refreshing warehouse with new and changed data originating from the operational side of the business. It extends the TPC-H to enable the analysis of historical trends and provides a set of predefined queries to run over its star schema. The SSB s queries refer to descriptive locations of suppliers and customers. However, the SSB does not hold spatial attributes nor stores maps that would enable multidimensional queries with spatial predicates. Spatial data warehouse benchmark (Spadawan), focuses on this problem by using predefined spatial hierarchies. Helps to address the query processing performance on spatial rollup and drilldown operations. It is a performance benchmark. Table 3 : Various Benchmarks and their limitations Spadawn [25] is the considered very effective for spatial data warehouse benchmarking as it not only generates SDW datasets composed of points and polygons in spatial attributes but also supports evaluation of different types of spatial queries (SOLAP) that enable the performance evaluation of intersection range queries, containment range queries and enclosure range queries in the spatial predicate. It enables the evaluation of spatial roll-up and drill-down operations Research Needs There is a need to research on developing benchmarks for the evaluation of: Spatial data such as lines, polygons with holes and with islands. Spatial data generation and SOLAP query processing.

20 Additional SOLAP query types to analyze drill-across operations on extended SDW schemas. 7. Trends: Spatiotemporal data warehouses Data warehousing applications are based on high-performance databases. Many fields deal with the data that has spatial information as well, like address, location. If we integrate the spatial component of the data with the data warehouse, the decision making potential of such organizations grows manifold Introduction to Spatio- temporal data warehouses Consider the query, How many objects visit a given area during a given time period? This query includes both spatial component and time component. While spatial data warehouses look at many types and dimensions of data including the spatial context, there is a need to include the temporal aspects as well. This will allow applications to see hidden relationships and patterns in data Challenges Many applications refer to moving objects and require spatio-temporal modeling for specific analysis. This type of object motion defines a continue variation in space and time which makes it very difficult to handle such huge datasets Organization of Temporal data Two concepts of time are involved in temporal characteristic of geographic entities- World time and System time [13]. World time refers to the time when an entity change take place in reality whereas the system time means the time that records the entity change in database. Depending on the requirement, users might want to use only system time (eg GIS) or both (Data warehouses) which makes it even harder to model two types of time dimensions in the spatio temporal data warehouses Trajectory data warehousing Tools and techniques Trajectory data warehousing is a branch of spatiotemporal warehousing. Spatio temporal data cubes are essential to support trajectory data. It should allow analysis along temporal dimensions, spatial dimensions at different levels of granularity (point, cell, road) and thematic dimensions, containing, for instance, demographic data.

21 STAU: A spatio-temporal extension for the ORACLE DBMS. It provides data management infrastructure for historical moving objects. It is a system extension to Oracle 10g ORDBMS data management infrastructure for historical MODs. Hermes[13] :Hermes is a database engine for handling objects that change location, shape and size, either discretely or continuously in time.the prototype has been designed as an extension of STAU and it supports the demands of real time dynamic applications (e.g. Location-Based Services LBS). o It is a robust framework that provides functionality for handling spatio-temporal data. o It enables the modeling, construction and querying a database with dynamic objects that change location, shape and size. o Hermes provides spatio-temporal functionality to state-of-the-art Object-Relational DBMS (ORDBMS). The GeoPKDD trajectory data warehouse [42]- GeoPKSS is a project which aims at extracting user-consumable forms of knowledge from large amounts of raw spatio temporal geographic data. Figure 14 below illustrates the GeoPKDD architecture. Figure 14: The GeoPKDD architecture [44] Description of the architecture At the beginning, location data is captured, and is forwarded to a trajectory stream manager, which does some preprocessing operations such as, splitting the raw data according to some criteria, providing a trajectory identifier.

22 These trajectories are then loaded into a moving object database (MOD). MOD is managed by the Hermes system. Basically, the MOD includes a relation MOD Trajectories with schema (Oid, trajectoryid, trajectory), where trajectory is of type Moving Point. In MOD, appropriate querying and Extract-Transform-Load (ETL) processes are applied to update the TDW with trajectory information. The trajectory data warehouse model mentioned is based on the classic star schema. It has a standard temporal dimension, and two spatial dimensions Research needs The motivation of having spatio temporal data warehouses is to utilize valuable information that can be used for decision making purposes in applications, such as mobile marketing, location-based services and traffic control management. Trajectory warehousing is an important step in this. It is an invaluable field which has lot of scope. Owing to high scalability of this type of historical data, future research should focus on modeling, aggregation and indexing to improve efficiency in such warehouses. 8. Future Work Domain specific application of spatial data warehouses are much talked about the research literature [49][50][51]. Future work in this direction would be classification of literature that exists in specific domain and identification of common concepts in each domain. This not only will give present a broad example of use of spatial data warehouses in a domain but also would give an idea of the core concepts which are applied in each domain. Other research directions we would like to include going ahead would be 3 Dimensional spatial objects in terms of spatial data warehouses. 3 Dimensional queries on spatial data warehouses may be helpful in domains like urban planning and Disaster management [12]. The topological relationships for 3 dimensional objects would include relationships like INSIDE, ANYINTERACT [56]. 9. Summary Spatial data warehouses have been an active area for research over the last decade. Concepts like big data are evolving with a big chunk of spatial information to process, store and analyze. Given the latest trends, spatial data warehouses can be considered as a big part of the future research due to their capability to provide decision making users relevant and concise data. The survey we presented covers the broad topics of spatial data warehouses and overview of trends like spatio temporal data warehouses. The topics include conceptual models, storage and indexing, aggregations and spatial OLAP. For some of the topics we have provided the areas

23 where future research is needed. We have also summarized the benchmarks that currently exist and compared and contrasted them for spatial data warehouses. References [1] Towards a Spatial Multidimensional Model - S. Bimonte, A. Tchounikine and M. Miquel DOLAP 05, November 4 5, 2005, Bremen, Germany [2] Modelling multiple representations into spatial data warehouses: a UML-based approach Bédard Yvan, Ph.D, Marie-Josée Proulx, M.Sc, Suzie Larrivée B.Sc., Eveline Bernier, M.Sc. [3] IMPLEMENTING SPATIAL DATAWAREHOUSE HIERARCHIES IN OBJECT- RELATIONAL DBMSs Elzbieta Malinowski and Esteban Zim anyi [4] Representing Spatiality in a Conceptual Multidimensional model [5] Spatial Hierarchies and Topological Relationships in the Spatial MultiDimER model? E. Malinowski?? and E. Zimanyi [6] Multidimensional Model Representing Continuous Fields in Spatial Data Warehouses Alejandro Vaisman, Esteban Zimányi ACM GIS 09 November 4-6, Seattle, WA, USA Copyright 2009 [7] Materialized ar-tree in Distributed Spatial Data Warehouse Marcin Gorawski and Rafal Malczok [8] Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao [9] Indexing Spatio-Temporal Data Warehouses Dimitris Papadias, Yufei Tao, Panos Kalnis, and Jun Zhang [10] The R*a-tree: An improved R*-tree with Materialized Data for Supporting Range Queries on OLAP-Data - Marcus J urgens, Hans-J. Lenz [11] Object-Based Selective Materialization for Efficient Implementation of Spatial Data Cubes Nebojsa Stefanovic, Member, IEEE Computer Society, Jiawei Han, Member, IEEE Computer Society, and Krzysztof Koperski, Member, IEEE Computer Society [12] Spatial Data Warehouses: Some Solutions and Unresolved Problems Elzbieta Malinowski and Esteban Zim anyi [13] RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE WANG Jizhou, LI Chengming [14] Spatio-Temporal Data Warehouse Design for Human Activity Pattern Analysis L. Savary, T. Wan, K. Zeitouni [15] Viswanathan, G., Schneider, M.: BigCube: A MetaModel for Managing Multidimensional Data. In: Proceedings of the 19th Int. Conf. on Software Engineering and Data Engineering (SEDE). (2010) [16] What is Spatio-Temporal Data Warehousing? Alejandro Vaisman and Esteban Zimanyi [17] Microsoft TerraServer: A Spatial Data Warehouse Tom Barclay, Jim Gray and Don Slutz [18] Map Cube: A Visualization Tool for Spatial Data Warehouses S. Shekhar, C.T. Lu, X. Tan, S. Chawla, and R. Vatsavai [19] Fundamentals of spatial data warehousing for geographic knowledge discovery Yvan Bédard, Tim Merrett and Jiawei Han [20] SOLAP: a new type of user interface to support spatio-temporal multidimensional data exploration and analysis S. Rivest, Y. Bédard, M.J. Proulx, M. Nadeau [21] On the Requirements for User-Centric Spatial Data Warehousing and SOLAP Ganesh Viswanathan & Markus Schneider [22] Gray J., Bosworth A., Layman A., Pirahesh H. Data Cube: a Relational Aggregation Operator

24 Generalizing Group-by, Cross-tabs and Subtotals. ICDE,1996 [23] Spatial OLAP Modelling: An Overview Base on Spatial Objects Changing over Time GabrieI Pestana Miguel Mira da Silva Yvan BCdard [24] Bédard, Y., S. Larrivée, M.-J. Proulx, P.-Y. Caron and F. Létourneau Geospatial Data Warehousing: Positionnement technologique et stratégique. Rapport pour le Centre de recherche pour la defense de Valcartier (CRDV) [25] Benchmarking Spatial Data Warehouses Thiago Luís Lopes Siqueira1,2, Ricardo Rodrigues Ciferri2, Valéria Cesário Times3, Cristina Dutra de Aguiar Ciferri [26] Efficient OLAP Operations for Spatial Data Using Peano Trees Baoying Wang Fei Pan Dongmei Ren Yue Cui Qiang Ding William Perrizo [27] Spatial Hierarchy and OLAP-Favored Search in Spatial Data Warehouse Fangyan Rao, Long Zhang, Xiu Lan Yu, Ying Li, Ying Chen [28] Pre Aggregation in Spatial Data Warehouses Torben Bach Pedersen and Nektaria Tryfona [29] Spatial Aggregation: Data Model and Implementation Sofie Haesevoets, Bart Kuijpers and Alejandro Vaisman [30] Hermes A Framework for Location-Based Data Management- Nikos Pelekis, Yannis Theodoridis, Spyros Vosinakis and Themis Panayiotopoulos. [31] Selective Materialization: An Efficient Method for Spatial Data Cube Construction Jiawei Han, Nebojsa Stefanovic, and Krzysztof Koperski [32] TOWARD BETTER SUPPORT FOR SPATIAL DECISION MAKING: DEFINING THE CHARACTERISTICS OF SPATIAL ON-LINE ANALYTICAL PROCESSING (SOLAP)GEOMATICA Vol. 55, No. 4,2001, pp. 539 to 555 [33] MacEachren, A. M. and M.-J. Kraak Research challenges in geovisualization. Cartography and Geographic Information Science. [34] Bimonte, S., Tchounikine, A., Miquel, M.: Geocube, a multidimensional model and navigation operators handling complex measures: Application in spatial olap. Advances in Information Systems (2006) [35] Scotch, M., Parmanto, B.: SOVAT: Spatial OLAP visualization and analysis tool. In: Proceedings of the 38th Annual Hawaii Int. Conf. on System Sciences (HICSS), IEEE (2005) 142b [36] Marchand, P., Brisebois, A., B edard, Y., Edwards, G.: Implementation and evaluation of a hypercube-based method for spatiotemporal exploration and analysis ISPRS journal of photogrammetric and remote sensing 59(1-2) (2004) 6 20 [37] Paton, N.W., Williams, M.H., Dietrich, K., Liew, O., Dinn, A., Patrick, A.: "VESPA: a benchmark for vector spatial databases", In BNCOD, pages , [38] Günther, O., Oria, V., Picouet, P., Saglio, J., Scholl, M.: "Benchmarking spatial joins à la carte", In SSDBM, pages 32-41, [39] Poess, M., Floyd, C.: "New TPC benchmarks for decision support and web commerce", SIGMOD Record, 29(4):64-71, [40] Poess, M., Smith, B., Kollar, L., Larson, P.: "TPC-DS, taking decision support benchmarking to the next level", In SIGMOD, pages , [41] O'Neil, P., O'Neil, E., Chen, X., Revilak, S.: "The star schema benchmark and augmented fact table indexing", In TPCTC, pages , [42] Geographic privacy aware Knowledge Discovery and Delivery, Damiani, Vangenot, Frentzos, Marketos, Theodoridis, Veryklos, and Raffaeta (2007) [43] Kim, J., Kang, S., Kim, M. Effective Temporal Aggregation using Point-based Trees. DEXA, [44] Yang, J., Widom, J. Incremental Computation of Temporal Aggregates. ICDE, [45] Harinarayan V., Rajaraman A., Ullman J. Implementing Data Cubes Efficiently.ACM SIGMOD, [46] Advanced Data Warehouse Design: From Conventional to Spatial and Temporal Applications

25 (Data-Centric Systems and Applications) - Elzbieta Malinowski, Esteban Zimányi ; Springer; 1st ed Corr. 2nd printing edition (April 6, 2011) [47] Spatial Databases: A Tour - Shashi Shekhar, Sanjay Chawla ; Prentice Hall 1 edition (June 20, 2003) [48] Leticia I. Gómez, Bart Kuijpers, Bart Moelans, Alejandro A. Vaisman: A Survey of Spatio- Temporal Data Warehousing ; International Journal of Data Warehousing and Mining 2009 [49] Spatial Data Warehousing for Hospital Organizations : An ESRI whitepaper [50] Octavio Glorio, Jose-Norberto Mazón, Irene Garrigós, Juan Trujillo - Using Web-based Personalization on Spatial Data Warehouses [51] Michael McGuirea, Aryya Gangopadhyayb, Anita Komlodib, Christopher Swanc - A usercentered design for a spatial data warehouse for data exploration in environmental research [52] S. Zlatanova, J.E. Stoter and W.Quak : Management of multiple representations in spatial DBMSs [53] [54] [55] [56]

Benchmarking Spatial Data Warehouses

Benchmarking Spatial Data Warehouses Benchmarking Spatial Data Warehouses Thiago Luís Lopes Siqueira 1,2, Ricardo Rodrigues Ciferri 2, Valéria Cesário Times 3, Cristina Dutra de Aguiar Ciferri 4 1 São Paulo Federal Institute of Education,

More information

Spatial Data Warehouse and Mining. Rajiv Gandhi

Spatial Data Warehouse and Mining. Rajiv Gandhi Spatial Data Warehouse and Mining Rajiv Gandhi Roll Number 05331002 Centre of Studies in Resource Engineering Indian Institute of Technology Bombay Powai, Mumbai -400076 India. As part of the first stage

More information

Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses

Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses Thiago Luís Lopes Siqueira Ricardo Rodrigues Ciferri Valéria Cesário Times Cristina Dutra de

More information

IMPLEMENTING SPATIAL DATA WAREHOUSE HIERARCHIES IN OBJECT-RELATIONAL DBMSs

IMPLEMENTING SPATIAL DATA WAREHOUSE HIERARCHIES IN OBJECT-RELATIONAL DBMSs IMPLEMENTING SPATIAL DATA WAREHOUSE HIERARCHIES IN OBJECT-RELATIONAL DBMSs Elzbieta Malinowski and Esteban Zimányi Computer & Decision Engineering Department, Université Libre de Bruxelles 50 av.f.d.roosevelt,

More information

On the Requirements for User-Centric Spatial Data Warehousing and SOLAP

On the Requirements for User-Centric Spatial Data Warehousing and SOLAP On the Requirements for User-Centric Spatial Data Warehousing and SOLAP Ganesh Viswanathan & Markus Schneider Department of Computer & Information Science & Engineering University of Florida Gainesville,

More information

CONTINUOUS DATA WAREHOUSE: CONCEPTS, CHALLENGES AND POTENTIALS

CONTINUOUS DATA WAREHOUSE: CONCEPTS, CHALLENGES AND POTENTIALS Geoinformatics 2004 Proc. 12th Int. Conf. on Geoinformatics Geospatial Information Research: Bridging the Pacific and Atlantic University of Gävle, Sweden, 7-9 June 2004 CONTINUOUS DATA WAREHOUSE: CONCEPTS,

More information

CHAPTER-24 Mining Spatial Databases

CHAPTER-24 Mining Spatial Databases CHAPTER-24 Mining Spatial Databases 24.1 Introduction 24.2 Spatial Data Cube Construction and Spatial OLAP 24.3 Spatial Association Analysis 24.4 Spatial Clustering Methods 24.5 Spatial Classification

More information

Indexing and Retrieval of Historical Aggregate Information about Moving Objects

Indexing and Retrieval of Historical Aggregate Information about Moving Objects Indexing and Retrieval of Historical Aggregate Information about Moving Objects Dimitris Papadias, Yufei Tao, Jun Zhang, Nikos Mamoulis, Qiongmao Shen, and Jimeng Sun Department of Computer Science Hong

More information

Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses 1

Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses 1 Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses 1 Thiago Luís Lopes Siqueira 1, Ricardo Rodrigues Ciferri 1, Valéria Cesário Times 2, Cristina

More information

Requirements engineering for a user centric spatial data warehouse

Requirements engineering for a user centric spatial data warehouse Int. J. Open Problems Compt. Math., Vol. 7, No. 3, September 2014 ISSN 1998-6262; Copyright ICSRS Publication, 2014 www.i-csrs.org Requirements engineering for a user centric spatial data warehouse Vinay

More information

Spatial and Spatio-Temporal Multidimensional Data Modelling: A Survey

Spatial and Spatio-Temporal Multidimensional Data Modelling: A Survey ISSN 2278 1021 Spatial and Spatio-Temporal Multidimensional Data Modelling: A Survey Saida Aissi 1, Mohamed Salah Gouider 2 Bestmod Laboratory, University of Tunis, High Institute of Management of Tunis,

More information

Continuous Spatial Data Warehousing

Continuous Spatial Data Warehousing Continuous Spatial Data Warehousing Taher Omran Ahmed Faculty of Science Aljabal Algharby University Azzentan - Libya Taher.ahmed@insa-lyon.fr Abstract Decision support systems are usually based on multidimensional

More information

A Survey on Spatio-Temporal Data Warehousing

A Survey on Spatio-Temporal Data Warehousing A Survey on Spatio-Temporal Data Warehousing Leticia Gómez Instituto Tecnológico de Buenos Aires Bart Kuijpers, Bart Moelans, Alejandro Vaisman University of Hasselt and Transnational University of Limburg,

More information

Fuzzy Spatial Data Warehouse: A Multidimensional Model

Fuzzy Spatial Data Warehouse: A Multidimensional Model 4 Fuzzy Spatial Data Warehouse: A Multidimensional Model Pérez David, Somodevilla María J. and Pineda Ivo H. Facultad de Ciencias de la Computación, BUAP, Mexico 1. Introduction A data warehouse is defined

More information

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer Alejandro Vaisman Esteban Zimanyi Data Warehouse Systems Design and Implementation ^ Springer Contents Part I Fundamental Concepts 1 Introduction 3 1.1 A Historical Overview of Data Warehousing 4 1.2 Spatial

More information

CubeView: A System for Traffic Data Visualization

CubeView: A System for Traffic Data Visualization CUBEVIEW: A SYSTEM FOR TRAFFIC DATA VISUALIZATION 1 CubeView: A System for Traffic Data Visualization S. Shekhar, C.T. Lu, R. Liu, C. Zhou Computer Science Department, University of Minnesota 200 Union

More information

Sprogo Research and a Case Study in field Data Management

Sprogo Research and a Case Study in field Data Management A New Relational Spatial OLAP Approach For Multi-resolution and Spatio-multidimensional Analysis of Incomplete Field Data Mehdi Zaamoune, Sandro Bimonte, François Pinet, Philippe Beaune To cite this version:

More information

User-centric spatial data warehousing: a survey of requirements and approaches. Ganesh Viswanathan and Markus Schneider*

User-centric spatial data warehousing: a survey of requirements and approaches. Ganesh Viswanathan and Markus Schneider* Int. J. Data Mining, Modelling and Management, Vol. x, No. x, xxxx 1 User-centric spatial data warehousing: a survey of requirements and approaches Ganesh Viswanathan and Markus Schneider* Department of

More information

HSTB-index: A Hierarchical Spatio-Temporal Bitmap Indexing Technique

HSTB-index: A Hierarchical Spatio-Temporal Bitmap Indexing Technique HSTB-index: A Hierarchical Spatio-Temporal Bitmap Indexing Technique Cesar Joaquim Neto 1, 2, Ricardo Rodrigues Ciferri 1, Marilde Terezinha Prado Santos 1 1 Department of Computer Science Federal University

More information

SPATIO-TEMPORAL QUERIES FOR MOVING OBJECTS DATA WAREHOUSING

SPATIO-TEMPORAL QUERIES FOR MOVING OBJECTS DATA WAREHOUSING SPATIO-TEMPORAL QUERIES FOR MOVING OBJECTS DATA WAREHOUSING 1 Leila Esheiba, Hoda M.O.Mokhtar 2, and Mohamed El-Sharkawi 3 1 Department of Information Systems, Faculty of Computers and Information, Cairo

More information

New Approach of Computing Data Cubes in Data Warehousing

New Approach of Computing Data Cubes in Data Warehousing International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 14 (2014), pp. 1411-1417 International Research Publications House http://www. irphouse.com New Approach of

More information

St-Toolkit: A Framework for Trajectory Data Warehousing

St-Toolkit: A Framework for Trajectory Data Warehousing St-Toolkit: A Framework for Trajectory Data Warehousing Simone Campora 1, Jose Antonio Fernandes de Macedo 2, Laura Spinsanti 3 1 LBD,EPFL Lausanne, CH 2 LIA, UFC Fortaleza, BR 3 LBD,EPFL Lausanne CH)

More information

Designing Data Warehouses for Geographic OLAP querying by using MDA

Designing Data Warehouses for Geographic OLAP querying by using MDA Designing Data Warehouses for Geographic OLAP querying by using MDA Octavio Glorio and Juan Trujillo University of Alicante, Spain, Department of Software and Computing Systems Lucentia Research Group

More information

Towards a Logical Multidimensional Model for Spatial Data Warehousing and OLAP Marcus Costa Sampaio André Gomes de Sousa Cláudio de Souza Baptista

Towards a Logical Multidimensional Model for Spatial Data Warehousing and OLAP Marcus Costa Sampaio André Gomes de Sousa Cláudio de Souza Baptista Towards a Logical Multidimensional Model for Data Warehousing and OLAP Marcus Costa Sampaio André Gomes de Sousa Cláudio de Souza Baptista Information System Laboratory - LSI, Federal University of Campina

More information

Oracle8i Spatial: Experiences with Extensible Databases

Oracle8i Spatial: Experiences with Extensible Databases Oracle8i Spatial: Experiences with Extensible Databases Siva Ravada and Jayant Sharma Spatial Products Division Oracle Corporation One Oracle Drive Nashua NH-03062 {sravada,jsharma}@us.oracle.com 1 Introduction

More information

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key

More information

RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE

RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE WANG Jizhou, LI Chengming Institute of GIS, Chinese Academy of Surveying and Mapping No.16, Road Beitaiping, District Haidian, Beijing, P.R.China,

More information

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

More information

DATA WAREHOUSING AND OLAP TECHNOLOGY

DATA WAREHOUSING AND OLAP TECHNOLOGY DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are

More information

Spatial Data Warehouse Modelling

Spatial Data Warehouse Modelling Spatial Data Warehouse Modelling 1 Chapter I Spatial Data Warehouse Modelling Maria Luisa Damiani, DICO - University of Milan, Italy Stefano Spaccapietra, Ecole Polytechnique Fédérale, Switzerland Abstract

More information

A Brief Tutorial on Database Queries, Data Mining, and OLAP

A Brief Tutorial on Database Queries, Data Mining, and OLAP A Brief Tutorial on Database Queries, Data Mining, and OLAP Lutz Hamel Department of Computer Science and Statistics University of Rhode Island Tyler Hall Kingston, RI 02881 Tel: (401) 480-9499 Fax: (401)

More information

Survey On: Nearest Neighbour Search With Keywords In Spatial Databases

Survey On: Nearest Neighbour Search With Keywords In Spatial Databases Survey On: Nearest Neighbour Search With Keywords In Spatial Databases SayaliBorse 1, Prof. P. M. Chawan 2, Prof. VishwanathChikaraddi 3, Prof. Manish Jansari 4 P.G. Student, Dept. of Computer Engineering&

More information

Fluency With Information Technology CSE100/IMT100

Fluency With Information Technology CSE100/IMT100 Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999

More information

Analyzing Polls and News Headlines Using Business Intelligence Techniques

Analyzing Polls and News Headlines Using Business Intelligence Techniques Analyzing Polls and News Headlines Using Business Intelligence Techniques Eleni Fanara, Gerasimos Marketos, Nikos Pelekis and Yannis Theodoridis Department of Informatics, University of Piraeus, 80 Karaoli-Dimitriou

More information

Efficient OLAP Operations in Spatial Data Warehouses

Efficient OLAP Operations in Spatial Data Warehouses Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong University of Science and Technology Clear Water Bay,

More information

Tracking System for GPS Devices and Mining of Spatial Data

Tracking System for GPS Devices and Mining of Spatial Data Tracking System for GPS Devices and Mining of Spatial Data AIDA ALISPAHIC, DZENANA DONKO Department for Computer Science and Informatics Faculty of Electrical Engineering, University of Sarajevo Zmaja

More information

A Design and implementation of a data warehouse for research administration universities

A Design and implementation of a data warehouse for research administration universities A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1 Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics

More information

OLAP. Business Intelligence OLAP definition & application Multidimensional data representation

OLAP. Business Intelligence OLAP definition & application Multidimensional data representation OLAP Business Intelligence OLAP definition & application Multidimensional data representation 1 Business Intelligence Accompanying the growth in data warehousing is an ever-increasing demand by users for

More information

CUBE INDEXING IMPLEMENTATION USING INTEGRATION OF SIDERA AND BERKELEY DB

CUBE INDEXING IMPLEMENTATION USING INTEGRATION OF SIDERA AND BERKELEY DB CUBE INDEXING IMPLEMENTATION USING INTEGRATION OF SIDERA AND BERKELEY DB Badal K. Kothari 1, Prof. Ashok R. Patel 2 1 Research Scholar, Mewar University, Chittorgadh, Rajasthan, India 2 Department of Computer

More information

Indexing Spatio-Temporal Data Warehouses

Indexing Spatio-Temporal Data Warehouses Indexing Spatio-Temporal Data Warehouses Dimitris Papadias, Yufei Tao, Panos Kalnis, and Jun Zhang Department of Computer Science Hong Kong University of Science and Technology Clear Water Bay, Hong Kong

More information

Week 13: Data Warehousing. Warehousing

Week 13: Data Warehousing. Warehousing 1 Week 13: Data Warehousing Warehousing Growing industry: $8 billion in 1998 Range from desktop to huge: Walmart: 900-CPU, 2,700 disk, 23TB Teradata system Lots of buzzwords, hype slice & dice, rollup,

More information

Design and Implementation of Enterprise Spatial Data Warehouse

Design and Implementation of Enterprise Spatial Data Warehouse Design and Implementation of Enterprise Spatial Data Warehouse Yin Liang 1 2 and Hong Zhang J ISchool ofenvironment Science and Spatial Infonnatics, China University ofmining and Technology, XuZhou 221008,

More information

Analyzing Trajectory Data

Analyzing Trajectory Data Analyzing Trajectory Data Gerasimos Marketos Free University of Bolzano 20/3/2009 Information Systems Lab (InfoLab) University of Piraeus (UniPi), Greece http://infolab.cs.unipi.gr Outline Preliminaries

More information

Integrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making

Integrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making Integrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making Professor Yvan Bedard, PhD, P.Eng. Centre for Research in Geomatics Laval Univ., Quebec, Canada National Technical University

More information

Basics of Dimensional Modeling

Basics of Dimensional Modeling Basics of Dimensional Modeling Data warehouse and OLAP tools are based on a dimensional data model. A dimensional model is based on dimensions, facts, cubes, and schemas such as star and snowflake. Dimensional

More information

When to consider OLAP?

When to consider OLAP? When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: erg@evaltech.com Abstract: Do you need an OLAP

More information

Indexing Techniques for Data Warehouses Queries. Abstract

Indexing Techniques for Data Warehouses Queries. Abstract Indexing Techniques for Data Warehouses Queries Sirirut Vanichayobon Le Gruenwald The University of Oklahoma School of Computer Science Norman, OK, 739 sirirut@cs.ou.edu gruenwal@cs.ou.edu Abstract Recently,

More information

Interactive Exploration of Multi granularity Spatial and Temporal Datacubes: Providing Computer Assisted Geovisualization Support

Interactive Exploration of Multi granularity Spatial and Temporal Datacubes: Providing Computer Assisted Geovisualization Support Interactive Exploration of Multi granularity Spatial and Temporal Datacubes: Providing Computer Assisted Geovisualization Support Véronique Beaulieu 1 & Yvan Bédard 2 Laval University Centre for Research

More information

Mario Guarracino. Data warehousing

Mario Guarracino. Data warehousing Data warehousing Introduction Since the mid-nineties, it became clear that the databases for analysis and business intelligence need to be separate from operational. In this lecture we will review the

More information

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2 Class Announcements TIM 50 - Business Information Systems Lecture 15 Database Assignment 2 posted Due Tuesday 5/26 UC Santa Cruz May 19, 2015 Database: Collection of related files containing records on

More information

The DC-Tree: A Fully Dynamic Index Structure for Data Warehouses

The DC-Tree: A Fully Dynamic Index Structure for Data Warehouses Published in the Proceedings of 16th International Conference on Data Engineering (ICDE 2) The DC-Tree: A Fully Dynamic Index Structure for Data Warehouses Martin Ester, Jörn Kohlhammer, Hans-Peter Kriegel

More information

A Critical Review of Data Warehouse

A Critical Review of Data Warehouse Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 95-103 Research India Publications http://www.ripublication.com A Critical Review of Data Warehouse Sachin

More information

INTEROPERABILITY IN DATA WAREHOUSES

INTEROPERABILITY IN DATA WAREHOUSES INTEROPERABILITY IN DATA WAREHOUSES Riccardo Torlone Roma Tre University http://torlone.dia.uniroma3.it/ SYNONYMS Data warehouse integration DEFINITION The term refers to the ability of combining the content

More information

The DC-tree: A Fully Dynamic Index Structure for Data Warehouses

The DC-tree: A Fully Dynamic Index Structure for Data Warehouses The DC-tree: A Fully Dynamic Index Structure for Data Warehouses Martin Ester, Jörn Kohlhammer, Hans-Peter Kriegel Institute for Computer Science, University of Munich Oettingenstr. 67, D-80538 Munich,

More information

The impact of spatial data redundancy on SOLAP query performance

The impact of spatial data redundancy on SOLAP query performance Journal of the Brazilian Computer Society, 2009; 15(1):19-34. ISSN 0104-6500 The impact of spatial data redundancy on SOLAP query performance Thiago Luís Lopes Siqueira 1,2, Cristina Dutra de Aguiar Ciferri

More information

Turkish Journal of Engineering, Science and Technology

Turkish Journal of Engineering, Science and Technology Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server

More information

Towards the Next Generation of Data Warehouse Personalization System A Survey and a Comparative Study

Towards the Next Generation of Data Warehouse Personalization System A Survey and a Comparative Study www.ijcsi.org 561 Towards the Next Generation of Data Warehouse Personalization System A Survey and a Comparative Study Saida Aissi 1, Mohamed Salah Gouider 2 Bestmod Laboratory, University of Tunis, High

More information

BUILDING OLAP TOOLS OVER LARGE DATABASES

BUILDING OLAP TOOLS OVER LARGE DATABASES BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,

More information

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University Given today s business environment, at times a corporate executive

More information

Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner

Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner 24 Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner Rekha S. Nyaykhor M. Tech, Dept. Of CSE, Priyadarshini Bhagwati College of Engineering, Nagpur, India

More information

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina Data Warehousing Read chapter 13 of Riguzzi et al Sistemi Informativi Slides derived from those by Hector Garcia-Molina What is a Warehouse? Collection of diverse data subject oriented aimed at executive,

More information

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 2. What is a Data warehouse a. A database application

More information

Multi-dimensional index structures Part I: motivation

Multi-dimensional index structures Part I: motivation Multi-dimensional index structures Part I: motivation 144 Motivation: Data Warehouse A definition A data warehouse is a repository of integrated enterprise data. A data warehouse is used specifically for

More information

A Dynamic Load Balancing Strategy for Parallel Datacube Computation

A Dynamic Load Balancing Strategy for Parallel Datacube Computation A Dynamic Load Balancing Strategy for Parallel Datacube Computation Seigo Muto Institute of Industrial Science, University of Tokyo 7-22-1 Roppongi, Minato-ku, Tokyo, 106-8558 Japan +81-3-3402-6231 ext.

More information

Data Warehousing Systems: Foundations and Architectures

Data Warehousing Systems: Foundations and Architectures Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository

More information

Data W a Ware r house house and and OLAP II Week 6 1

Data W a Ware r house house and and OLAP II Week 6 1 Data Warehouse and OLAP II Week 6 1 Team Homework Assignment #8 Using a data warehousing tool and a data set, play four OLAP operations (Roll up (drill up), Drill down (roll down), Slice and dice, Pivot

More information

Building Data Cubes and Mining Them. Jelena Jovanovic Email: jeljov@fon.bg.ac.yu

Building Data Cubes and Mining Them. Jelena Jovanovic Email: jeljov@fon.bg.ac.yu Building Data Cubes and Mining Them Jelena Jovanovic Email: jeljov@fon.bg.ac.yu KDD Process KDD is an overall process of discovering useful knowledge from data. Data mining is a particular step in the

More information

3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools

3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools Paper by W. F. Cody J. T. Kreulen V. Krishna W. S. Spangler Presentation by Dylan Chi Discussion by Debojit Dhar THE INTEGRATION OF BUSINESS INTELLIGENCE AND KNOWLEDGE MANAGEMENT BUSINESS INTELLIGENCE

More information

Distance Learning and Examining Systems

Distance Learning and Examining Systems Lodz University of Technology Distance Learning and Examining Systems - Theory and Applications edited by Sławomir Wiak Konrad Szumigaj HUMAN CAPITAL - THE BEST INVESTMENT The project is part-financed

More information

MODELING, VISUALIZING, AND MINING HYDROLOGIC SPATIAL HIERARCHIES FOR WATER QUALITY MANAGEMENT INTRODUCTION

MODELING, VISUALIZING, AND MINING HYDROLOGIC SPATIAL HIERARCHIES FOR WATER QUALITY MANAGEMENT INTRODUCTION MODELING, VISUALIZING, AND MINING HYDROLOGIC SPATIAL HIERARCHIES FOR WATER QUALITY MANAGEMENT Michael P. McGuire Center for Urban Environmental Research and Education Aryya Gangopadhyay Department of Information

More information

Data Warehousing and OLAP Technology for Knowledge Discovery

Data Warehousing and OLAP Technology for Knowledge Discovery 542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories

More information

Review. Data Warehousing. Today. Star schema. Star join indexes. Dimension hierarchies

Review. Data Warehousing. Today. Star schema. Star join indexes. Dimension hierarchies Review Data Warehousing CPS 216 Advanced Database Systems Data warehousing: integrating data for OLAP OLAP versus OLTP Warehousing versus mediation Warehouse maintenance Warehouse data as materialized

More information

Lecture Data Warehouse Systems

Lecture Data Warehouse Systems Lecture Data Warehouse Systems Eva Zangerle SS 2013 PART A: Architecture Chapter 1: Motivation and Definitions Motivation Goal: to build an operational general view on a company to support decisions in

More information

Data Mining and Database Systems: Where is the Intersection?

Data Mining and Database Systems: Where is the Intersection? Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: surajitc@microsoft.com 1 Introduction The promise of decision support systems is to exploit enterprise

More information

IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance

IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance Data Sheet IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance Overview Multidimensional analysis is a powerful means of extracting maximum value from your corporate

More information

The use of UML to design agricultural data warehouses

The use of UML to design agricultural data warehouses Author-produced version of the paper presented10/4 at AGENG 2020 Conference, Septembre 6-8 2010, Clermont-Ferrand, France The use of UML to design agricultural data warehouses François Pinet 1, André Miralles

More information

Dimensional Modeling for Data Warehouse

Dimensional Modeling for Data Warehouse Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or

More information

SDWM: An Enhanced Spatial Data Warehouse Metamodel

SDWM: An Enhanced Spatial Data Warehouse Metamodel SDWM: An Enhanced Spatial Data Warehouse Metamodel Alfredo Cuzzocrea 1, Robson do Nascimento Fidalgo 2 1 ICAR-CNR & University of Calabria, 87036 Rende (CS), ITALY cuzzocrea@si.deis.unical.it. 2. CIN,

More information

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing CSE 544 Principles of Database Management Systems Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing Class Projects Class projects are going very well! Project presentations: 15 minutes On Wednesday

More information

B.Sc (Computer Science) Database Management Systems UNIT-V

B.Sc (Computer Science) Database Management Systems UNIT-V 1 B.Sc (Computer Science) Database Management Systems UNIT-V Business Intelligence? Business intelligence is a term used to describe a comprehensive cohesive and integrated set of tools and process used

More information

Data Warehousing, OLAP, and Data Mining

Data Warehousing, OLAP, and Data Mining Data Warehousing, OLAP, and Marek Rychly mrychly@strathmore.edu Strathmore University, @ilabafrica & Brno University of Technology, Faculty of Information Technology Advanced Databases and Enterprise Systems

More information

Outline. Data Warehousing. What is a Warehouse? What is a Warehouse?

Outline. Data Warehousing. What is a Warehouse? What is a Warehouse? Outline Data Warehousing What is a data warehouse? Why a warehouse? Models & operations Implementing a warehouse 2 What is a Warehouse? Collection of diverse data subject oriented aimed at executive, decision

More information

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on

More information

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518 International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 Over viewing issues of data mining with highlights of data warehousing Rushabh H. Baldaniya, Prof H.J.Baldaniya,

More information

Course Design Document. IS417: Data Warehousing and Business Analytics

Course Design Document. IS417: Data Warehousing and Business Analytics Course Design Document IS417: Data Warehousing and Business Analytics Version 2.1 20 June 2009 IS417 Data Warehousing and Business Analytics Page 1 Table of Contents 1. Versions History... 3 2. Overview

More information

University of Gaziantep, Department of Business Administration

University of Gaziantep, Department of Business Administration University of Gaziantep, Department of Business Administration The extensive use of information technology enables organizations to collect huge amounts of data about almost every aspect of their businesses.

More information

Data Warehouse Snowflake Design and Performance Considerations in Business Analytics

Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Journal of Advances in Information Technology Vol. 6, No. 4, November 2015 Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Jiangping Wang and Janet L. Kourik Walker

More information

International Journal of Advanced Research in Computer Science and Software Engineering

International Journal of Advanced Research in Computer Science and Software Engineering Volume, Issue, March 201 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com An Efficient Approach

More information

GIS Databases With focused on ArcSDE

GIS Databases With focused on ArcSDE Linköpings universitet / IDA / Div. for human-centered systems GIS Databases With focused on ArcSDE Imad Abugessaisa g-imaab@ida.liu.se 20071004 1 GIS and SDBMS Geographical data is spatial data whose

More information

Designing a Dimensional Model

Designing a Dimensional Model Designing a Dimensional Model Erik Veerman Atlanta MDF member SQL Server MVP, Microsoft MCT Mentor, Solid Quality Learning Definitions Data Warehousing A subject-oriented, integrated, time-variant, and

More information

Introduction. Introduction. Spatial Data Mining: Definition WHAT S THE DIFFERENCE?

Introduction. Introduction. Spatial Data Mining: Definition WHAT S THE DIFFERENCE? Introduction Spatial Data Mining: Progress and Challenges Survey Paper Krzysztof Koperski, Junas Adhikary, and Jiawei Han (1996) Review by Brad Danielson CMPUT 695 01/11/2007 Authors objectives: Describe

More information

Anwendersoftware Anwendungssoftwares a. Data-Warehouse-, Data-Mining- and OLAP-Technologies. Online Analytic Processing

Anwendersoftware Anwendungssoftwares a. Data-Warehouse-, Data-Mining- and OLAP-Technologies. Online Analytic Processing Anwendungssoftwares a Data-Warehouse-, Data-Mining- and OLAP-Technologies Online Analytic Processing Online Analytic Processing OLAP Online Analytic Processing Technologies and tools that support (ad-hoc)

More information

Advanced Data Warehouse Design

Advanced Data Warehouse Design Data-Centric Systems and Applications Advanced Data Warehouse Design From Conventional to Spatial and Temporal Applications Bearbeitet von Elzbieta Malinowski, Esteban Zimányi 1st ed. 2008. Corr. 2nd printing

More information

Web Log Data Sparsity Analysis and Performance Evaluation for OLAP

Web Log Data Sparsity Analysis and Performance Evaluation for OLAP Web Log Data Sparsity Analysis and Performance Evaluation for OLAP Ji-Hyun Kim, Hwan-Seung Yong Department of Computer Science and Engineering Ewha Womans University 11-1 Daehyun-dong, Seodaemun-gu, Seoul,

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 Copyright 2011 Pearson Education, Inc. Student Learning Objectives How does a relational database organize data,

More information

How To Model Data For Business Intelligence (Bi)

How To Model Data For Business Intelligence (Bi) WHITE PAPER: THE BENEFITS OF DATA MODELING IN BUSINESS INTELLIGENCE The Benefits of Data Modeling in Business Intelligence DECEMBER 2008 Table of Contents Executive Summary 1 SECTION 1 2 Introduction 2

More information

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many

More information

SPATIAL DATA CLASSIFICATION AND DATA MINING

SPATIAL DATA CLASSIFICATION AND DATA MINING , pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal

More information

Database Applications. Advanced Querying. Transaction Processing. Transaction Processing. Data Warehouse. Decision Support. Transaction processing

Database Applications. Advanced Querying. Transaction Processing. Transaction Processing. Data Warehouse. Decision Support. Transaction processing Database Applications Advanced Querying Transaction processing Online setting Supports day-to-day operation of business OLAP Data Warehousing Decision support Offline setting Strategic planning (statistics)

More information