St-Toolkit: A Framework for Trajectory Data Warehousing
|
|
- Jacob Wilcox
- 8 years ago
- Views:
Transcription
1 St-Toolkit: A Framework for Trajectory Data Warehousing Simone Campora 1, Jose Antonio Fernandes de Macedo 2, Laura Spinsanti 3 1 LBD,EPFL Lausanne, CH 2 LIA, UFC Fortaleza, BR 3 LBD,EPFL Lausanne CH) simone.campora@epfl.ch, jose.macedo@lia.ufc.br, laura.spinsanti@epfl.ch ABSTRACT Turning a collection of simple time-geography data into mobility knowledgeis a key issue in many research domains, such as social analysis and mobility investigation. Although collecting mobility data has become technologically feasible, turning these huge sets of data into mobility knowledge is still an open issue. Data warehousing is a well-established technique used for analysis of summarized data, but is not yet adequate to support spatio-temporal feature-sets. This paper proposes a framework for improving the design and the implementation of data warehouses that support spatio-temporal concepts on top of relational DBMS. This framework has been used for designing and implementing a trajectory data warehouse (TDW) for analyzing traffic data. Besides, we show that this framework also supports OLAP, SOLAP and STOLAP queries. INTRODUCTION Turning a collection of simple time-geography data into mobility knowledge is an open challenge. For instance, let s consider a traffic management scenario where a manager is asked to take some decisions based on the visualization of vehicles trajectories as showed in Figure 1. In this Figure, lines represent trajectories while points represent points of interest (e.g. restaurants, supermarkets, etc). We can conclude that the manager faces a complex task in extracting knowledge from this dataset, which can support his decisions. The challenges are mainly due to: The excess of data that is being projected on the screen; The lack of semantics for a given trajectory in relation to points of interest;
2 AGILE 2011, April 18-22: Simone Campora, Jose Antonio Fernandes de Macedo, Laura Spinsanti Figure 1:Trajectory and Point of Interest data In addition to these problems, trajectory data is inherently spatiotemporal, requiring sophisticated methods to cope with spatial and temporal operations on large datasets. Therefore, analysis on this domain will require different analytical query capabilities, which can be classified as thematic, spatial, temporal and spatio-temporal. Here it follows some examples of such queries: Thematic: Retrieve the average travel time of December 2009 trajectory groups that includes more than 10% of all the trajectories. Spatial: Retrieve the stops that occur near events of type Restaurant and that are located within 3km of Duomo in Milan. Temporal: Retrieve the stops that occur near an area where more than X other trajectories are stopping at lunch time or dinner time and that are located within 3km of (45.46,9.18) coordinates. Spatio-Temporal: For all summer months, give the number of Congested Trajectories in Milan s dense population neighborhoods. Data Warehouse technology has become de facto standard for complex analytical and customized queries with a rapid execution time. However, there are several limitations in current data warehouse tools to cope with spatio-temporal data as found in the literature (Güting et al., 2004; Vaisman and Zimányi, 2009). From our perspective, a trajectory is not only a set of time-geography points but it is a semantic object that has an identity, sub-components (stops, moves and episodes) and several related thematic information. For this reason, in our work, trajectory concept must be treated as a first class object and the resultant TDW is more than a spatio-temporal data repository. 2
3 St-Toolkit: A Framework for Trajectory Data Warehousing Our contributions are twofold: a semantic model for trajectory data warehouse and a middleware for loading, designing and querying a spatiotemporal data warehouse. The model is based on the conceptual view on trajectories introduced by Spaccapietra et al A trajectory is a segment of the spatio-temporal path covered by a moving object. The data warehouse design we developed is tailored around Episodes from Zhixian et al., An episode is a sequence of location-based points that has the same moving dynamic. Episodes constitute a flexible segmentation of the dataset into semantic groups that can be used to model move and stop occurrences. We produced a generic cross-database framework for spatio-temporal data warehousing, basing the implementation on the approach proposed by Malinowski and Zimányi, This generic middleware has two main functionalities: on one hand it implements a common simple query interface to access the functionality of different proprietary implementation; on the other hand it builds a generic interface to support spatio-temporal concepts in data warehouses. Furthermore we developed a generic middleware capable of encapsulating the data warehouse product dependent technicalities and building a generic design interface that have been extended to support spatio-temporal concepts in data warehouses. TRAJECTORY DATA WAREHOUSE SCHEMA Figure 2: The Proposed Star Schema using MultiDimER notation Since our scope is to provide a generic solution for designing and implementing TDW, we need to develop a model composed by constructs that allow the specification of a trajectory data warehouse schema. Thus, we have devised a model illustrated in Figure 2. From our perspective and 3
4 AGILE 2011, April 18-22: Simone Campora, Jose Antonio Fernandes de Macedo, Laura Spinsanti unlike other solutions (e.g. Orlando 2006), it is significantly important that trajectory identity can be preserved. We adopt the semantic trajectory model, introduced by Spaccapietra et al., 2007, which considers a trajectory as the user defined record of the evolution of the position (perceived as a point) of an object that is moving in space during a given time interval in order to achieve a given goal. Each trajectory can be semantically segmented by defining a sequence of moves and stops. We define the movement primitives focusing the analysis tools on top of the concept of Episode introduced by Zhixian et al., 2010 that is a more general unified representation of a trajectory element. This brings flexibility to our model for further applications in which episodes can be calculated in different ways. An important aspect of data warehousing management architectures is the handling of aggregate operators. The most common cited operators that can be found in the literature have been implemented and fully integrated in our generic middleware, see Table 1. Operator Trajectory Length Number of Stops Number of Moves Average Speed Stops Moves Interpolated Shape Lifespan Average Travel Time Average stop Time Visited Areas Trajectory Clusters Most Frequent Set Type Numeric Numeric Numeric Numeric Spatial Spatial Spatial Temporal Temporal Temporal Spatio-Temporal Spatio-Temporal Spatio-Temporal Table 1: The Aggregate Operators ST-TOOLKIT Our framework proposal is called St-Toolkit 1 and is a generic middleware. It encapsulates the main database and data warehouse components, but also it hides the details of the single data warehouse implementation in order to abstract entities, such as cubes, dimensions, and measures. ST-Toolkit exploits proprietary drivers for OLAP solutions and, when it is possible, it uses vendor-provided APIs in order to boost data warehouse performances. Figure 3 illustrates the main components of ST-Toolkit framework. It is worth noticing that with our framework is possible to implement a data warehouse schema in OLAP servers (e.g. Oracle OLAP and Mondrian), or 1 Web reference: 4
5 St-Toolkit: A Framework for Trajectory Data Warehousing in relational databases.it is a generic interface from which any proprietary database management system drivers can be mapped in order to become a ST-Toolkit access point for hosting Spatio-Temporal Data Warehouses. From a broad point of view the architecture is taking care of the most common aspects in moving object data management from ETL procedures primitives to the core of the library: the Generic Object Interface for Data Warehousing. Taking a brief overview on similar architectures, the closest middleware solution that is capable of handling OLAP and SOLAP queries is GeoMondrian. It is an extension of the Mondrian OLAP Server that enables the use of spatial filtering. Mondrian has a dedicated query engine, which translates MDX OLAP queries into simple SQL relational queries, which is introducing further latencies to the query execution-plans. Figure 3: ST-Toolkit Components From a procedural point of view, Mondrian (and its extension GeoMondrian) needs to look up on an XML description of the data warehouse schema in order to initialize its data connectors. Our solution instead is embedding the schema definition within java objects, allowing the user to gain modularity and making the schema capable of evolutions during execution time. Instantiating ST DW schema ST-Toolkit provides an API for creating and querying a DW schema. The java code presented below shows an example of RelationalTablecreation. First, a database connection is established, then an object representing a relational table tbl_time is created, and then the data referring to this table is 5
6 AGILE 2011, April 18-22: Simone Campora, Jose Antonio Fernandes de Macedo, Laura Spinsanti loaded. A database object is defined as any entity that lives in a persistent state in the database. For instance a RelationalTable object is used to manipulate relational tables; therefore it is extensively used to interface with the data warehouse metadata. DatabaseConnection co = new DatabaseConnection (...); RelationalTabletimeTbl = new RelationalTable("tbl _time"); timetbl.loadobject (co ); In the Java code below, a dimension is specified with one level in its hierarchy, as follows: Level monthlevel = new Level("Month", timetbl, "mon"); Hierarchy timeh = new Hierarchy("Hier Name", Hierarchy.LEVEL_BASED, Hierarchy.NON_GEOMETRIC ); timeh.addlevel ( monthlevel ); Dimension dim = new Dimension("Time", timeh); Semantics The use of semantics to enrich STOLAP queries it is a fundamental aspect in any knowledge-discovery process. We propose a first step to integrate spatio-temporal semantics using an Event Dimension. It subdivides the data in relation to space-time proximity with application specific events. Those events are spatially and timely correlated with the fact data. Examples of events might be Traffic Jams, Festivals, or simple Points of Interest such as Restaurants or Malls. We want to keep this dimension as much independent from the trajectory dataset as possible. This decision has a twofold objective: it allows using the same trajectory dataset for different application purposes (e.g. traffic management and social analysis) and it avoids heavy computational costs while updating the Event dimension. Encapsulated semantics allows to an easiest and more human readable query definition, as is possible to see from the following examples: EventCrossingPatterns_1 and EventVisitingPatterns_1 queries use semantics whereas EventCrossingPatterns_2 and EventVisitingPatterns_2 do not use semantics. EventCrossingPatterns_1 (SOLAP Query): Retrieve all the stops that occur near events of type Restaurant and that are located within 3km of Duomo in Milan. EventCrossingPatterns_2 (SOLAP Query): Retrieve all the stops that occur near an area where more than X other trajectories are 6
7 St-Toolkit: A Framework for Trajectory Data Warehousing stopping at lunch time or dinner time and that are located within 3 km from (45.46,9.18) coordinates. EventVisitingPatterns_1 (STOLAP Query): Give the number of visits of a moving entity for events of type Food-Shop where its own trajectory started near a Residential Area. EventVisitingPatterns_2 (STOLAP Query):Give the number of trajectories that can be contained in spatio-temporal prism which trajectory density is more than X arbitrary value and where any trajectory started near an area where a lot of trajectories start in the morning. Our design module is highly enriched by the capability of handling context-aware, providing extensibility to application domain feature sets (e.g. POIs). CASE STUDY This experimental section is aimed to test and to provide a case study for the trajectory data warehouse architecture. We will briefly describe the possible queries that we can address, showing later with the results on a real dataset of transportation traffic data in Milan, Italy. The dataset we used to test our Trajectory Data Warehouse contains data collected from cars running in Milano provinces, Italy in 2007 (see Table 2 for a dataset description). Please note that tables from 2 and 3 are shown as an informative samples of experimental data that can be extracted using OLAP, SOLAP and STOLAP operators defined for the given data warehouse and they derived from a running instance of ST-Toolkit. Experiments Features Values Number of Records Number of Trajectories Number of Stops Number of Moves Number of POIs POIs Categories 447 Table 2:Dataset Statistical Description EventCrossingPattern_1 query have been explicitly formalized in the previous section, and sub-selecting a base of 1000 randomly chosen trajectory limit (simplified for visualization purpose), have produced as outcome a list of stop points located in Milan, at a distance of meters from a Restaurant location. The graphical geo-localized representation of this result can be seen in Figure 4. As it is possible to denote that the stops are properly located along the principal roads and randomly distributed along Milano (that is consequence of the high density of trajectories that are shown in 7
8 AGILE 2011, April 18-22: Simone Campora, Jose Antonio Fernandes de Macedo, Laura Spinsanti Figure 1). EventVisitingPatterns_1 query uses the whole trajectory dataset composed by trajectories: the top 10 results are shown in Table 3. Figure 4:Milan Stops near Restaurants The presence problem Number of Visits Moving Object Entity Table 3:Moving Objects Visits to Food Shops We run a last test on the spatial aggregation function used to calculate the Presence measure, given a subset of 260 episodes as shown in Figure 5. 8
9 St-Toolkit: A Framework for Trajectory Data Warehousing Figure 5:Presence Test Dataset around Milan municipality Most of the trajectories are traversing multiple areas. The issue encountered while calculating the Presence measure was the so-called double counting problem. It means that rolling-up on multiple areas where different trajectories were crossing those shapes could not have been done with a trivial counting without double counts (e.g. trajectories that were present on two areas were counted twice). Literature calls this the presence problem. It is one of the main drawbacks of the TDW model proposed by Orlando et al., We have developed an algorithm for solving the presence problem as it can be noticed from Figure 6. Figure 6:Presence Algorithm Counting This algorithm is quite straightforward: for each fact-data geometries, a basic spatial operator is checking if the object resides in how many regions in the lowest level space division: that becomes the crossed value. The inverse of crossed is a metadata attribute for each trajectory that can be used to easily weight the spatial count of all the trajectories of a given area without knowing other details on the other regions. The queries to extract the count are expressed as simple as that: 9
10 AGILE 2011, April 18-22: Simone Campora, Jose Antonio Fernandes de Macedo, Laura Spinsanti [ ] OlapQuery query = new OlapQuery(); query.addselection(presencemeasure, OlapQuery.COLUMNS); query.addselection (spacedimension.getlevel( Area ), OlapQuery.ROWS); query.setcube(stcube); query.execute(); This is the Olap Query to extract the count of each trajectory slicing on Municipalities, where presencemeasure is a virtual measure that is embedding our simple algorithm as an operator. Our model uses a fully-geometric spatial dimension that aggregates trajectories belonging to their spatial presence. We have made use of spatial functions to aggregate the OLAP measure that operates the count of super aggregates. In this way, despite of trajectories pass throughout more than one area, the roll up counting result of the SOLAP queries, sliced by the three spatial levels of our data warehouse schema,is correct and equal to the 260 trajectories chosen for the sub-set. RELATED WORK Among the approaches that have been proposed to model spatio-temporal data warehouses we made use of the one proposed by Malinowski and Zimanyi in Their mixed approach is coping with the lack of modeling robustness experienced in the literature. This approach added an object-oriented infrastructure layer to spatial data warehouses experimenting MultiDimER model. Another example of alternative models is the one presented by Stefanovic et al that adds spatial semantics to the whole multidimensional paradigm. The new dimension types that were defined are Non-geometric spatial dimensions (i.e. if it contains only non-geometric data), Geometric-to-non-geometric spatial dimension and Fully geometric spatial dimension. Among the first trajectory-enabled frameworks we can recall the data model implemented in Secondo (Güting et al., 2004) and Hermes (introduced by Pelekis et al., 2006). The first is an extensible database platform that allows implementation of open data models. Several papers are presenting solutions to moving objects problems (such as Düntgen et al., 2009 and Güting et al., 2009). Although this architecture has been enriched with an extensive variety of data models, it narrows data management applications to be used only on top of Secondo DBMS. On the other hand, Hermes is a trajectory data management framework that it is explicitly created for spatio-temporal data storage in order to provide modeling and 10
11 St-Toolkit: A Framework for Trajectory Data Warehousing querying facilities for trajectories of dynamic objects by using Oracle DBMS. Their approaches are both trying to cope with the limitations of data models that can be represented in traditional database systems. Our approach on the contrary is providing an intermediate layer that can be used to model database objects using the best of both worlds: It can be extended with user-defined data models and it can reuse the most deployed database management systems on the market. One of the major weaknesses in spatio-temporal data warehousing is the lack of taxonomies that defines which limitations and which expressive power are addressed in STOLAP applications. Vaisman and Zimanyi, 2009 defined a complete taxonomy for spatio-temporal OLAP applications, which is describing a perspective proposal of the possible features through spatiotemporal calculus algebra. Following this taxonomy we have subdivided the possible queries of our trajectory data warehouses (TDW) into three main categories: OLAP, Spatial OLAP, Spatio-Temporal OLAP. The only true implementation of TDW that can be recalled, is the solution proposed by Orlando et al. in Theirmodel is supporting unbounded and unpredictable stream rates of moving object data in order to feed a regular-grid space division. CONCLUSIONS In the last years we have seen a continuous evolution of tracking technologies in precision, costs and device support. Collecting mobility data is no more a problem, whereas handling and analyzing these spatio-temporal data is still an open challenge. In this context we have developed a generic cross-platform cross-database framework capable of implementing a common simple query interface to access different product-dependent data sources. Moreover, it contains a generic design interface supporting spatiotemporal concepts in data warehouse. We have provided an instantiation of a specific trajectory model to explain and support the system. As in many other fields, the main open issue for the future development and improvement still remain the integration of full semantic aspects into the traditional dimension of analysis. BIBLIOGRAPHY Düntgen C., T. Behr, and R. H. Güting. Berlinmod: a benchmark for moving object databases. VLDB,
12 AGILE 2011, April 18-22: Simone Campora, Jose Antonio Fernandes de Macedo, Laura Spinsanti Güting, R.H., T. Behr, V. Almeida, Z. Ding, F. Hoffmann, and M. Spiekermann. Secondo: An extensible dbms architecture and prototype. Technical report, 2004 Güting, R.H., M. H. Bohlen, M. Erwig, C. S. Jensen, N. A. Lorentzos, M. Schneider, F. Hagen, F. Hagen, and F. Hagen. A foundation for representing and querying moving objects. ACM Transactions on Database Systems, Güting, R.H., A. Braese, T. Behr, and J. Xu. Nearest neighbor search on moving object trajectories in secondo, Advances in Spatial and Temporal Databases 2009 Malinowski, E. and E. Zimanyi. Implementing spatial datawarehouse hierarchies in object-relational dbmss.iniceis 2007 Orlando, S., R. Orsini, A. Raffaeta, and A. Roncato. Trajectory data warehouses: Design and implementation issues. Journal of Computing Science and Engineering, 2007 Pelekis, N., Y. Theodoridis, S. Vosinakis, and T. Panayiotopoulos. Hermes - a framework for location-based data management. In EDBT 2006 Spaccapietra, S., C. Parent, M. L. Damiani, J. A. d. Macedo, F. Porto, and C. Vangenot. A conceptual view on trajectories. Data& Knowledge Engineering, 2007 Stefanovic, N., J. Han, and K. Koperski. Object-based selective materialization for efficient implementation of spatial data cubes. IEEE, 2000 Vaisman, A. and E. Zimányi. What is spatio-temporal data warehousing? In DAWAK, 2009 Zhixian, Y., C. Parent, S. Spaccapietra, and C. Dipanjan. A hybrid model and computing platform for spatio-semantic trajectories. ESWC
Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer
Alejandro Vaisman Esteban Zimanyi Data Warehouse Systems Design and Implementation ^ Springer Contents Part I Fundamental Concepts 1 Introduction 3 1.1 A Historical Overview of Data Warehousing 4 1.2 Spatial
More informationCONTENTS. List of Contributors Preface Acknowledgments. mobility data modeling and representation
CONTENTS List of Contributors Preface Acknowledgments page xi xv xix part i mobility data modeling and representation 1 Trajectories and Their Representations 3 S. Spaccapietra, C. Parent, L. Spinsanti
More informationBreadboard BI. Unlocking ERP Data Using Open Source Tools By Christopher Lavigne
Breadboard BI Unlocking ERP Data Using Open Source Tools By Christopher Lavigne Introduction Organizations have made enormous investments in ERP applications like JD Edwards, PeopleSoft and SAP. These
More informationMOBILITY DATA MODELING AND REPRESENTATION
PART I MOBILITY DATA MODELING AND REPRESENTATION 1 Trajectories and Their Representations Stefano Spaccapietra, Christine Parent, and Laura Spinsanti 1.1 Introduction For a long time, applications have
More informationBUILDING OLAP TOOLS OVER LARGE DATABASES
BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,
More informationPatient Trajectory Modeling and Analysis
Patient Trajectory Modeling and Analysis Jalel Akaichi and Marwa Manaa Higher Institute of Management of tunis, 41 Rue de la Liberté, Cité Bouchoucha, 2000 Bardo, Tunisia j.akaichi@gmail.com, manaamarwa@gmail.com
More informationIMPLEMENTING SPATIAL DATA WAREHOUSE HIERARCHIES IN OBJECT-RELATIONAL DBMSs
IMPLEMENTING SPATIAL DATA WAREHOUSE HIERARCHIES IN OBJECT-RELATIONAL DBMSs Elzbieta Malinowski and Esteban Zimányi Computer & Decision Engineering Department, Université Libre de Bruxelles 50 av.f.d.roosevelt,
More informationData Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
More informationRESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE
RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE WANG Jizhou, LI Chengming Institute of GIS, Chinese Academy of Surveying and Mapping No.16, Road Beitaiping, District Haidian, Beijing, P.R.China,
More informationPaper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram
Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram Cognizant Technology Solutions, Newbury Park, CA Clinical Data Repository (CDR) Drug development lifecycle consumes a lot of time, money
More informationData Warehousing and OLAP Technology for Knowledge Discovery
542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories
More informationCONTINUOUS DATA WAREHOUSE: CONCEPTS, CHALLENGES AND POTENTIALS
Geoinformatics 2004 Proc. 12th Int. Conf. on Geoinformatics Geospatial Information Research: Bridging the Pacific and Atlantic University of Gävle, Sweden, 7-9 June 2004 CONTINUOUS DATA WAREHOUSE: CONCEPTS,
More informationFluency With Information Technology CSE100/IMT100
Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999
More informationCOURSE SYLLABUS COURSE TITLE:
1 COURSE SYLLABUS COURSE TITLE: FORMAT: CERTIFICATION EXAMS: 55043AC Microsoft End to End Business Intelligence Boot Camp Instructor-led None This course syllabus should be used to determine whether the
More informationSPATIO-TEMPORAL QUERIES FOR MOVING OBJECTS DATA WAREHOUSING
SPATIO-TEMPORAL QUERIES FOR MOVING OBJECTS DATA WAREHOUSING 1 Leila Esheiba, Hoda M.O.Mokhtar 2, and Mohamed El-Sharkawi 3 1 Department of Information Systems, Faculty of Computers and Information, Cairo
More informationTracking System for GPS Devices and Mining of Spatial Data
Tracking System for GPS Devices and Mining of Spatial Data AIDA ALISPAHIC, DZENANA DONKO Department for Computer Science and Informatics Faculty of Electrical Engineering, University of Sarajevo Zmaja
More informationIntegrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making
Integrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making Professor Yvan Bedard, PhD, P.Eng. Centre for Research in Geomatics Laval Univ., Quebec, Canada National Technical University
More informationSQL Server 2012 Business Intelligence Boot Camp
SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations
More informationA Seismic Data Management and Mining System
A Seismic Data Management and Mining System Sotiris Brakatsoulas and Yannis Theodoridis Computer Technology Institute, P.O. Box 1122, GR-26110 Patras, Greece http://www.cti.gr/rd3/dke Abstract. A Seismic
More informationWhen to consider OLAP?
When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: erg@evaltech.com Abstract: Do you need an OLAP
More informationData Warehousing Systems: Foundations and Architectures
Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository
More informationThe STC for Event Analysis: Scalability Issues
The STC for Event Analysis: Scalability Issues Georg Fuchs Gennady Andrienko http://geoanalytics.net Events Something [significant] happened somewhere, sometime Analysis goal and domain dependent, e.g.
More informationAN INTEGRATION APPROACH FOR THE STATISTICAL INFORMATION SYSTEM OF ISTAT USING SDMX STANDARDS
Distr. GENERAL Working Paper No.2 26 April 2007 ENGLISH ONLY UNITED NATIONS STATISTICAL COMMISSION and ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS EUROPEAN COMMISSION STATISTICAL
More informationLost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole
Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many
More informationOLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP
Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key
More information2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000
2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000 Introduction This course provides students with the knowledge and skills necessary to design, implement, and deploy OLAP
More informationPrivacy-preserving data warehousing for spatiotemporal
Privacy-preserving data warehousing for spatiotemporal data Maria L. Damiani, Università Milano (I) GEOPKDD - Meeting Venezia 17 Oct 05 1 Report The report contains two contributions: M.L. Damiani, S.
More informationA Survey on Spatio-Temporal Data Warehousing
A Survey on Spatio-Temporal Data Warehousing Leticia Gómez Instituto Tecnológico de Buenos Aires Bart Kuijpers, Bart Moelans, Alejandro Vaisman University of Hasselt and Transnational University of Limburg,
More informationAdvanced Data Warehouse Design
Data-Centric Systems and Applications Advanced Data Warehouse Design From Conventional to Spatial and Temporal Applications Bearbeitet von Elzbieta Malinowski, Esteban Zimányi 1st ed. 2008. Corr. 2nd printing
More informationHow To Evaluate Web Applications
A Framework for Exploiting Conceptual Modeling in the Evaluation of Web Application Quality Pier Luca Lanzi, Maristella Matera, Andrea Maurino Dipartimento di Elettronica e Informazione, Politecnico di
More informationTowards the Next Generation of Data Warehouse Personalization System A Survey and a Comparative Study
www.ijcsi.org 561 Towards the Next Generation of Data Warehouse Personalization System A Survey and a Comparative Study Saida Aissi 1, Mohamed Salah Gouider 2 Bestmod Laboratory, University of Tunis, High
More informationIAF Business Intelligence Solutions Make the Most of Your Business Intelligence. White Paper November 2002
IAF Business Intelligence Solutions Make the Most of Your Business Intelligence White Paper INTRODUCTION In recent years, the amount of data in companies has increased dramatically as enterprise resource
More informationConstructing Semantic Interpretation of Routine and Anomalous Mobility Behaviors from Big Data
Constructing Semantic Interpretation of Routine and Anomalous Mobility Behaviors from Big Data Georg Fuchs 1,3, Hendrik Stange 1, Dirk Hecker 1, Natalia Andrienko 1,2,3, Gennady Andrienko 1,2,3 1 Fraunhofer
More informationRequirements engineering for a user centric spatial data warehouse
Int. J. Open Problems Compt. Math., Vol. 7, No. 3, September 2014 ISSN 1998-6262; Copyright ICSRS Publication, 2014 www.i-csrs.org Requirements engineering for a user centric spatial data warehouse Vinay
More informationDATA WAREHOUSING AND OLAP TECHNOLOGY
DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are
More information4-06-35. John R. Vacca INSIDE
4-06-35 INFORMATION MANAGEMENT: STRATEGY, SYSTEMS, AND TECHNOLOGIES ONLINE DATA MINING John R. Vacca INSIDE Online Analytical Modeling (OLAM); OLAM Architecture and Features; Implementation Mechanisms;
More informationIntegrating SAP and non-sap data for comprehensive Business Intelligence
WHITE PAPER Integrating SAP and non-sap data for comprehensive Business Intelligence www.barc.de/en Business Application Research Center 2 Integrating SAP and non-sap data Authors Timm Grosser Senior Analyst
More informationTraining Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Anne Monceaux 1, Joanna Guss 1 1 EADS-CCR, Centreda 1, 4 Avenue Didier Daurat 31700 Blagnac France
More informationImplementing Data Models and Reports with Microsoft SQL Server 2012 MOC 10778
Implementing Data Models and Reports with Microsoft SQL Server 2012 MOC 10778 Course Outline Module 1: Introduction to Business Intelligence and Data Modeling This module provides an introduction to Business
More informationMario Guarracino. Data warehousing
Data warehousing Introduction Since the mid-nineties, it became clear that the databases for analysis and business intelligence need to be separate from operational. In this lecture we will review the
More informationChapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
More informationOracle Data Integrator: Administration and Development
Oracle Data Integrator: Administration and Development What you will learn: In this course you will get an overview of the Active Integration Platform Architecture, and a complete-walk through of the steps
More informationDatabases in Organizations
The following is an excerpt from a draft chapter of a new enterprise architecture text book that is currently under development entitled Enterprise Architecture: Principles and Practice by Brian Cameron
More informationInvestigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses
Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses Thiago Luís Lopes Siqueira Ricardo Rodrigues Ciferri Valéria Cesário Times Cristina Dutra de
More informationGeoKettle: A powerful open source spatial ETL tool
GeoKettle: A powerful open source spatial ETL tool FOSS4G 2010 Dr. Thierry Badard, CTO Spatialytics inc. Quebec, Canada tbadard@spatialytics.com Barcelona, Spain Sept 9th, 2010 What is GeoKettle? It is
More informationAnalyzing Polls and News Headlines Using Business Intelligence Techniques
Analyzing Polls and News Headlines Using Business Intelligence Techniques Eleni Fanara, Gerasimos Marketos, Nikos Pelekis and Yannis Theodoridis Department of Informatics, University of Piraeus, 80 Karaoli-Dimitriou
More informationMobility data analysis to understand unknown diseases behavior The case of facial paralysis
Mobility data analysis to understand unknown diseases behavior The case of facial paralysis Jalel Akaichi PhD, University of Sciences and Technologies of Lille Associate Professor, University of Tunis
More informationA Service-oriented Architecture for Business Intelligence
A Service-oriented Architecture for Business Intelligence Liya Wu 1, Gilad Barash 1, Claudio Bartolini 2 1 HP Software 2 HP Laboratories {name.surname@hp.com} Abstract Business intelligence is a business
More informationChapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
More informationA Prototype System for Educational Data Warehousing and Mining 1
A Prototype System for Educational Data Warehousing and Mining 1 Nikolaos Dimokas, Nikolaos Mittas, Alexandros Nanopoulos, Lefteris Angelis Department of Informatics, Aristotle University of Thessaloniki
More informationORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process
ORACLE OLAP KEY FEATURES AND BENEFITS FAST ANSWERS TO TOUGH QUESTIONS EASILY KEY FEATURES & BENEFITS World class analytic engine Superior query performance Simple SQL access to advanced analytics Enhanced
More informationM2074 - Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000 5 Day Course
Module 1: Introduction to Data Warehousing and OLAP Introducing Data Warehousing Defining OLAP Solutions Understanding Data Warehouse Design Understanding OLAP Models Applying OLAP Cubes At the end of
More informationFoundations of Business Intelligence: Databases and Information Management
Foundations of Business Intelligence: Databases and Information Management Problem: HP s numerous systems unable to deliver the information needed for a complete picture of business operations, lack of
More informationCourse 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
More informationIndex Selection Techniques in Data Warehouse Systems
Index Selection Techniques in Data Warehouse Systems Aliaksei Holubeu as a part of a Seminar Databases and Data Warehouses. Implementation and usage. Konstanz, June 3, 2005 2 Contents 1 DATA WAREHOUSES
More informationA Novel Approach for a Collaborative Exploration of a Spatial Data Cube
International Journal of Computer and Communication Engineering, Vol. 3, No. 1, January 2014 A Novel Approach for a Collaborative Exploration of a Spatial Data Cube Olfa Layouni and Jalel Akaichi Geographic
More informationJava Metadata Interface and Data Warehousing
Java Metadata Interface and Data Warehousing A JMI white paper by John D. Poole November 2002 Abstract. This paper describes a model-driven approach to data warehouse administration by presenting a detailed
More informationSprogo Research and a Case Study in field Data Management
A New Relational Spatial OLAP Approach For Multi-resolution and Spatio-multidimensional Analysis of Incomplete Field Data Mehdi Zaamoune, Sandro Bimonte, François Pinet, Philippe Beaune To cite this version:
More informationTopBraid Insight for Life Sciences
TopBraid Insight for Life Sciences In the Life Sciences industries, making critical business decisions depends on having relevant information. However, queries often have to span multiple sources of information.
More informationA Next-Generation Analytics Ecosystem for Big Data. Colin White, BI Research September 2012 Sponsored by ParAccel
A Next-Generation Analytics Ecosystem for Big Data Colin White, BI Research September 2012 Sponsored by ParAccel BIG DATA IS BIG NEWS The value of big data lies in the business analytics that can be generated
More informationORACLE DATA INTEGRATOR ENTERPRISE EDITION
ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition 12c delivers high-performance data movement and transformation among enterprise platforms with its open and integrated
More informationTrajectory Data Warehouses: Design and Implementation Issues
Trajectory Data Warehouses: Design and Implementation Issues Salvatore Orlando, Renzo Orsini, Alessandra Raffaetà, Alessandro Roncato Dipartimento di Informatica - Università Ca Foscari Venezia {orlando,
More informationDesigning Data Warehouses for Geographic OLAP querying by using MDA
Designing Data Warehouses for Geographic OLAP querying by using MDA Octavio Glorio and Juan Trujillo University of Alicante, Spain, Department of Software and Computing Systems Lucentia Research Group
More informationOracle Warehouse Builder 10g
Oracle Warehouse Builder 10g Architectural White paper February 2004 Table of contents INTRODUCTION... 3 OVERVIEW... 4 THE DESIGN COMPONENT... 4 THE RUNTIME COMPONENT... 5 THE DESIGN ARCHITECTURE... 6
More informationLection 3-4 WAREHOUSING
Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing
More informationContinuous Spatial Data Warehousing
Continuous Spatial Data Warehousing Taher Omran Ahmed Faculty of Science Aljabal Algharby University Azzentan - Libya Taher.ahmed@insa-lyon.fr Abstract Decision support systems are usually based on multidimensional
More informationCONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University
CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University Given today s business environment, at times a corporate executive
More informationMeta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring
www.ijcsi.org 78 Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring Mohammed Mohammed 1 Mohammed Anad 2 Anwar Mzher 3 Ahmed Hasson 4 2 faculty
More informationOpen source geospatial Business Intelligence (BI) in action!
Open source geospatial Business Intelligence (BI) in action! OGRS 2009 Nantes, France Dr. Thierry Badard Etienne Dubé Belko Diallo Jean Mathieu Mamadou Ouattara GeoSOA research group Centre for Research
More informationSDWM: An Enhanced Spatial Data Warehouse Metamodel
SDWM: An Enhanced Spatial Data Warehouse Metamodel Alfredo Cuzzocrea 1, Robson do Nascimento Fidalgo 2 1 ICAR-CNR & University of Calabria, 87036 Rende (CS), ITALY cuzzocrea@si.deis.unical.it. 2. CIN,
More informationImplementing Data Models and Reports with Microsoft SQL Server
Course 20466C: Implementing Data Models and Reports with Microsoft SQL Server Course Details Course Outline Module 1: Introduction to Business Intelligence and Data Modeling As a SQL Server database professional,
More informationVISUALIZATION OF GEOMETRICAL AND NON-GEOMETRICAL DATA
VISUALIZATION OF GEOMETRICAL AND NON-GEOMETRICAL DATA Maria Beatriz Carmo 1, João Duarte Cunha 2, Ana Paula Cláudio 1 (*) 1 FCUL-DI, Bloco C5, Piso 1, Campo Grande 1700 Lisboa, Portugal e-mail: bc@di.fc.ul.pt,
More informationTopics in basic DBMS course
Topics in basic DBMS course Database design Transaction processing Relational query languages (SQL), calculus, and algebra DBMS APIs Database tuning (physical database design) Basic query processing (ch
More informationOpen Source Business Intelligence Intro
Open Source Business Intelligence Intro Stefano Scamuzzo Senior Technical Manager Architecture & Consulting Research & Innovation Division Engineering Ingegneria Informatica The Open Source Question In
More informationEuropean Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project
European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project Janet Delve, University of Portsmouth Kuldar Aas, National Archives of Estonia Rainer Schmidt, Austrian Institute
More informationDimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
More informationCopyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1
Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics
More informationIntegration of Distributed Healthcare Records: Publishing Legacy Data as XML Documents Compliant with CEN/TC251 ENV13606
Integration of Distributed Healthcare Records: Publishing Legacy Data as XML Documents Compliant with CEN/TC251 ENV13606 J.A. Maldonado, M. Robles, P. Crespo Bioengineering, Electronics and Telemedicine
More informationDesigning an Object Relational Data Warehousing System: Project ORDAWA * (Extended Abstract)
Designing an Object Relational Data Warehousing System: Project ORDAWA * (Extended Abstract) Johann Eder 1, Heinz Frank 1, Tadeusz Morzy 2, Robert Wrembel 2, Maciej Zakrzewicz 2 1 Institut für Informatik
More information1 File Processing Systems
COMP 378 Database Systems Notes for Chapter 1 of Database System Concepts Introduction A database management system (DBMS) is a collection of data and an integrated set of programs that access that data.
More informationA Design and implementation of a data warehouse for research administration universities
A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon
More informationBusiness Intelligence In SAP Environments
Business Intelligence In SAP Environments BARC Business Application Research Center 1 OUTLINE 1 Executive Summary... 3 2 Current developments with SAP customers... 3 2.1 SAP BI program evolution... 3 2.2
More informationK@ A collaborative platform for knowledge management
White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index
More informationGEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington
GEOG 482/582 : GIS Data Management Lesson 10: Enterprise GIS Data Management Strategies Overview Learning Objective Questions: 1. What are challenges for multi-user database environments? 2. What is Enterprise
More informationPowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc.
PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions A Technical Whitepaper from Sybase, Inc. Table of Contents Section I: The Need for Data Warehouse Modeling.....................................4
More informationCOURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design
COURSE OUTLINE Track 1 Advanced Data Modeling, Analysis and Design TDWI Advanced Data Modeling Techniques Module One Data Modeling Concepts Data Models in Context Zachman Framework Overview Levels of Data
More informationEasyTracker: An Android application for capturing mobility behavior
EasyTracker: An Android application for capturing mobility behavior Alexandros Doulamis Dept. of Informatics University of Piraeus Piraeus, Greece alex.doulamis@gmail.com Nikos Pelekis Dept. of Statistics
More informationImplementing Data Models and Reports with Microsoft SQL Server 20466C; 5 Days
Lincoln Land Community College Capital City Training Center 130 West Mason Springfield, IL 62702 217-782-7436 www.llcc.edu/cctc Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5
More informationPhoCA: An extensible service-oriented tool for Photo Clustering Analysis
paper:5 PhoCA: An extensible service-oriented tool for Photo Clustering Analysis Yuri A. Lacerda 1,2, Johny M. da Silva 2, Leandro B. Marinho 1, Cláudio de S. Baptista 1 1 Laboratório de Sistemas de Informação
More information1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing
1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 2. What is a Data warehouse a. A database application
More informationCHAPTER-24 Mining Spatial Databases
CHAPTER-24 Mining Spatial Databases 24.1 Introduction 24.2 Spatial Data Cube Construction and Spatial OLAP 24.3 Spatial Association Analysis 24.4 Spatial Clustering Methods 24.5 Spatial Classification
More informationData Mining and Database Systems: Where is the Intersection?
Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: surajitc@microsoft.com 1 Introduction The promise of decision support systems is to exploit enterprise
More informationSPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
More informationwww.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 1
Data Warehouse Security Akanksha 1, Akansha Rakheja 2, Ajay Singh 3 1,2,3 Information Technology (IT), Dronacharya College of Engineering, Gurgaon, Haryana, India Abstract Data Warehouses (DW) manage crucial
More informationA DATA WAREHOUSE SOLUTION FOR E-GOVERNMENT
A DATA WAREHOUSE SOLUTION FOR E-GOVERNMENT Xiufeng Liu 1 & Xiaofeng Luo 2 1 Department of Computer Science Aalborg University, Selma Lagerlofs Vej 300, DK-9220 Aalborg, Denmark 2 Telecommunication Engineering
More information5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2
Class Announcements TIM 50 - Business Information Systems Lecture 15 Database Assignment 2 posted Due Tuesday 5/26 UC Santa Cruz May 19, 2015 Database: Collection of related files containing records on
More informationIST722 Data Warehousing
IST722 Data Warehousing Components of the Data Warehouse Michael A. Fudge, Jr. Recall: Inmon s CIF The CIF is a reference architecture Understanding the Diagram The CIF is a reference architecture CIF
More information3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools
Paper by W. F. Cody J. T. Kreulen V. Krishna W. S. Spangler Presentation by Dylan Chi Discussion by Debojit Dhar THE INTEGRATION OF BUSINESS INTELLIGENCE AND KNOWLEDGE MANAGEMENT BUSINESS INTELLIGENCE
More informationTowards a Logical Multidimensional Model for Spatial Data Warehousing and OLAP Marcus Costa Sampaio André Gomes de Sousa Cláudio de Souza Baptista
Towards a Logical Multidimensional Model for Data Warehousing and OLAP Marcus Costa Sampaio André Gomes de Sousa Cláudio de Souza Baptista Information System Laboratory - LSI, Federal University of Campina
More information