Continuous Spatial Data Warehousing
|
|
- Norman Hood
- 8 years ago
- Views:
Transcription
1 Continuous Spatial Data Warehousing Taher Omran Ahmed Faculty of Science Aljabal Algharby University Azzentan - Libya Taher.ahmed@insa-lyon.fr Abstract Decision support systems are usually based on multidimensional structures which use the concept of hypercube. Dimensions are the axes of analysis and form a space where a fact is located by a set of coordinates at the intersections of members of dimensions. Conventional multidimensional structures deal with discrete facts linked to discrete dimensions. However, when dealing with natural continuous phenomena the discrete representation is not adequate. There is a need to integrate spatiotemporal continuity within multidimensional structures to enable analysis and exploration of continuous data. There is a multitude of research issues that lead to the integration of spatiotemporal continuity in multidimensional structures. In this paper, we discuss some of these, present briefly a multidimensional model for continuous field data. We also define new aggregation operations. The model and the associated operations and measures are validated by a prototype. 1. Introduction The last few years have seen an explosive growth of the size of data produced by the different kinds of sensors dispersed all over the surface of the earth. These sensors measure different kinds of natural phenomena at different time intervals. Exploiting these huge volumes of data in a decisional context is the main function of spatial data warehouses (SDW). Data warehouses are used in analysis purposes that involve examining data and possibly identifying relationships that may exist between different elements. Since we deal with information on or above the surface of the earth, therefore spatial positions are involved i.e. we deal with spatial data. In an operational processing, geographic information systems (GIS) have shown their ability in managing geographic data. The effectiveness of GIS comes mainly from their capacity of linking different information in a spatial context and in drawing conclusions from the different relations that exist between different phenomena. However GIS are oriented towards spatial data management not towards effective analysis. In this paper we present our research towards integrating spatiotemporal continuity in decision support systems. In the second section we discuss the differences between GIS and decisions support systems. Section 3 presents the research objectives, issues and motivations. The fourth section contains a theoretical background. In section 5 we go briefly over a multidimensional model for field based data and present a prototype of a continuous spatial data warehouse. The conclusions and future work are presented in section GIS and Decision Support Systems GIS is defined as an information system that is used to input, store, retrieve, manipulate, analyze, and output geographically referenced data or geospatial data [6]. The main capabilities of GIS are to integrate large volumes of spatial and non-spatial data and enhance problem understanding through data visualization in map forms. Multidimensional modeling and On-Line Analytical Processing (OLAP) allow intuitive, easy and fast analysis of huge volumes of data [8]. In general, OLAP provides aggregated summary information, collected from multiple sources, and modeled in multidimensional structures. Dimensions usually have hierarchical structures and correspond to a business perspective where each cell at the intersection of members of dimensions may contain the value(s) of some aggregated measure. Data hypercube refer to the computation of an aggregation over all combinations of dimensions.
2 OLAP technology on which decision support systems are based, involves entirely different concepts from the transactional architecture of GIS. This architecture is insufficient for knowledge discovery and data mining [3]. The concepts of dimensions, measures, hierarchies etc are not supported by GIS. Conversely, DSS do not support cartographic display which can be helpful for decision making. Spatial decision support systems are designed to provide analysis tools and help decision makers solve complex spatial problems. Providing a dependable spatial decision support requires the coupling of GIS and OLAP so that the latter provides multidimensionality and the first manipulates spatial data [10]. 3. Motivations and Research Issues Based on the nature of data represented there two data conceptualization in GIS. The first is discrete which concerns spatial data that have clear boundaries and the second is continuous which deals with natural phenomena that exist continuously in time and in space. Despite the huge amounts of continuous data collected by all types of sensors there has not been much work done on data warehouse modeling for this type of data. Most of the work on SDW and SOLAP deal with discrete spatial data. To the best of our knowledge the work done on spatial decision support systems deals only with the discrete representation. Even when continuous data are dealt with in a decisional context [7][12][14] they are treated as discrete. In conventional multidimensional structures all data involved are discrete. Dimensions are organized in discrete hierarchical levels where each level has a finite set of discrete members. A discrete fact may be found at the intersection of members. However for continuous data the spatial and temporal dimensions are not discrete and must not be treated as such. Phenomena take place everywhere and constantly without disruption. Nevertheless, they cannot be measured continuously at all points in space nor can these measurements be stored in databases due to several factors like the discrete nature of computers. Only samples are measured and stored which yields a discrete representation of a continuous phenomenon. This representation is reflected in discrete spatial and temporal dimensions in SDW and SOLAP. Since a distinction is made in GIS on the detailed data level with respect to the continuous and discrete representation, the same distinction should be made on the aggregated level. Our main objective is integrating continuity within multidimensional structures to enable analysis and exploration on both continuous and discrete data. Integrating spatiotemporal continuity within multidimensional structures accomplishes the following objectives [1]: - Recover hidden information: A continuous representation can recover lost or hidden information that results from representing continuous data discretely. - Analysis at Detailed Levels of Hierarchy: There are numerous cases where the need of low granularity analysis arises as in the cases of disaster management. This type of analysis is can be made possible by continuous multidimensional structures. - Continuous analysis: Because of the discrete representation, natural phenomena are analyzed as discrete spatial objects and as snapshots of data values over different periods of time. It would be more realistic to analyze natural phenomena as they occur in real life where they evolve continuously in space and in time rather than as a collection of discrete pieces. There are several issues involved in integrating continuity in multidimensional structures [1]: - Continuous multidimensional model. All existing models deal with discrete dimensions related to discrete facts. To the best of our knowledge, no work has been done on modeling multidimensional structures for continuous data with the exception of [12] who focuses on using the known density of data to calculate queries without accessing the data. - Range of continuity. One of the important issues is where does the continuity begin and end hierarchically? Does continuity imply producing new finer levels or should it only produce data for any instant in time? And can higher levels be continuous? - Choice of Interpolation methods. Interpolation methods differ in their assumptions, methodologies, complexity, and deterministic or stochastic nature. In addition the performance of the method is an important factor for DSS since they require fast response. - Storage and optimization. To obtain fast response OLAP uses pre-aggregation to eliminate the overhead of calculating SQL aggregations during run time. For continuous data, the complexity of the problem is augmented because of interpolation. Interpolating means that pre aggregation can not be performed for several or all operators.
3 - Operations in continuous hypercubes. The classic OLAP operators need to be extended to be applicable in the new structures. The introduction of continuity could change the results of different operations. Also there will be need to formally describe new operators. As an example, the operation sum in the discrete structure will be integration in the continuous structure. - Result visualization. Results must be displayed in an intuitive and attractive manner to provide a user friendly environment for decision making. This includes cartographic display, grid, graphic representation etc. It is also crucial to give an indication of how good the estimation is through the use of quality indicators. 4. Theoretical Background Despite the multitude multidimensional models proposed in the literature, no consensus has been made on one formal model. In spatial data warehousing, most of the work focuses on case studies and prototypes with little concentration on sound formalism. In [9] a framework for multidimensional exploratory spatiotemporal analysis is proposed. It focuses mainly on the hierarchy theory. A dimension hierarchy is defined as a 4-tuple H = (V, F, G, ) where V denotes the node or the vertex of a hierarchy. Each member of V is associated with a domain of elements. G is the dimension path which is a totally ordered list of nodes. The symbol denotes that a dimension path is a linear totally ordered list of nodes. Every adjacent vertex pair in a path is associated with a partition mapping function called, categorization function, F = {f 0, f 1, f 2, } such that fi = f i : domain (v i ) domain (v i+1 ) i 1. The main drawback is the lack of formalism of the common operators. [11] extends a conceptual multidimensional model with spatial dimensions, spatial hierarchies and spatial measures. A multidimensional model is defined as a finite set of dimensions and fact relationships. A dimension is composed of hierarchies which are composed of one or several levels represented as entity types. Hierarchies that contain only one level are called basic hierarchies. Levels are related to each other by a partial order relationship. For any two consecutive related levels l i, l j, the level l j is called parent and l i is called child if l i l j. They also define a category attribute that shows how child members are grouped. In [4] a model that introduces the concept of entity schema and entity instances is presented. The entity schema S e is a tuple of attributes a i defined on a domain dom(a i ) where an attribute is an alphanumeric identifier. The entity instance t i over S e is a tuple linking values from dom(a i ) to each attribute a i. Entity schemas and instances are used to model real world objects of a multidimensional application model. Fact tables are represented by base cube schemas where all dimensions are at their lowest level except the measure which can be at any level. A base cube schema SBC bc is tuple SBC bc = S 1, S m, S f, δ where no 2 schemas are equivalent and δ is a Boolean function indicating whether a value exists at any combination of instances. Cubes are base cubes after aggregation. All of these models deal only with discrete spatial data. 5. A Continuous MD Model The main objective of our work is to define a new model or extend an existing one so that it takes into account spatiotemporal data characteristics. The model proposed in [15] is the best candidate for extension since it formally defines most of the OLAP operations and it is based on the idea of basic cubes that allows serial performing of operations. In this paper we present some of the necessary elements we added. More details are found in [1]. 5.1 Basic cubes We distinguish two types of basic cubes. The first type of basic cubes is the discrete basic cube discc b which is a 3-tuple <D b, L b, R b > where D b is a list of dimensions including a dimension measure M. L b is the list of the lowest levels of each dimension and R b is a set of cells data represented as a set of tuples containing both level members and measures in the form of S=[s 1, s 2, s 3, s n,m] where m is the dimension that represents the measure. To achieve a continuous representation of the basic cube, estimated values are derived from the discc b. Estimated measures related to the infinite members of a given continuous dimension are estimated using actual cell values from the discc b. This is achieved by applying interpolation functions to a sample of the discc b values which will give the second type of basic cubes "continuous basic cube" contc b. The number of tuples in the contc b is theoretically infinite since a dimension level in the class of continuous dimensions contains an infinite number of members. We define
4 contc b, as 4-tuple <D b,( D b, F), L b, R b >. Where D b, is a set of discrete dimensions and D b is a set of continuous dimensions. It is clear seen that discc b contc b as sample values are included within the contc b. 5.2 Cubes Cubes are built from basic cubes. A cube C is defined as 4-tuple <D, L, contc b, R> where D is a list of dimensions including M as defined above, L is the respective dimension level, R is cell data and contc b is the basic cube from which the cube C is built. Because of the nature of continuous field data, different aggregation functions are used to build the cube at higher dimension hierarchies. For example, the sum of the measure for a specific region will be represented as the integration of the function representing the phenomenon. Other aggregation functions like min, max or average will be performed on contc b and their results will be assigned to higher levels of the hierarchy. Based on data values used to obtain an aggregation, the aggregation on continuous multidimensional structures can be classified as either discrete or continuous. Discrete operations use only sample data values and their aggregations will correspond to the discrete higher levels. Continuous operations use all data values of the field (sample and estimated values). Their aggregations correspond to the higher levels resulting in aggregated values based on continuous representation. The same multidimensional schema is used for both classes with the only difference being the detailed data used in calculating the aggregations. One can imagine the existence of a parallel hierarchy that is used for the continuous representations (Figure 1). It should be noted that continuous aggregated values and discrete aggregated values are not necessarily equal. Figure 1. Continuous and discrete representation of dimensions Discrete aggregations : They include the conventional OLAP operations and are calculated based on the real observed values. We list here the most common operations: DiscMax = v i such that v i > v j V* DiscMin = v i such that v i < v j V* DiscSum = n v i i = 1 of observed values. n v i for v i V* where n is the number DiscAvg = i for v i V* where n is the Card (V *) number of observed values. Continuous aggregations The second category of operations concerns the operations that involve all domain values of the phenomenon i.e. all observed and estimated values of the phenomenon are used to produce a hypercube based on the continuous representation. In addition to aggregation functions a new calculated measure has been defined that apply only on continuous data. Continuity is considered on either an interval of time or on a specified region. The continuous aggregations on the measure of the continuous field are 1 : ContMax = v i such that v i > v j V ContMin = v i such that v i < v j V ContSptSum = f x, y, t ) dxdy 1 Definition of continuous field from [1]. ( where (x, y) D
5 ContTmpSum= f ( x, y, t) dxdydt where (x,y) D ContSptAvg= ) ContTemporalAvg = f ( x, y, t dxdy where (x, y) D area f ( x, y, t ) dxdydt where t t 2 1 [t 1 : t 2 ] is an interval of time and (x, y) D The calculated measure gradient, which is the change of the value of the field that results by a change of a unit of space, can be applied to most detailed level of data and its results can be aggregated to higher levels : Gradient = grad((x, y), t) = v= (x, y) D. f f f,, x y t where 5.3 Continuous SOLAP prototype One of the applications that can benefit from continuous multidimensional structures is air pollution analysis. We designed and implemented a prototype of an application that observes air pollution in order to validate the model defined above and to show the potentials of what we termed Continuous SOLAP or CSOLAP. In order to validate our model and test the performance of our data warehouse a considerable volume of data is required. We used data published by AIRPARIF [2]. However this set covers only the Parisian region so we had to simulate data values for the rest of France and we ended up with a large set of data that we stored in an SQL Server data warehouse. To create a continuous representation all dimensions must be at their lowest level then we choose an interpolation function from a list of different functions. This will create a thematic map on the fly. The thematic map contains all values of the field for the chosen dimensions (Figure 2). Figure 2. Interface of CSOLAP We can navigate on this map or from the menus and tools bars to obtain higher level aggregations. 6. Conclusions Spatial decision support systems have benefited from advances made in OLAP and data warehouses technologies. The emergence of SOLAP has made spatial decision support easy and flexible. However the perception of space and time is still limited to the discrete perception which does not represent natural phenomena correctly. Our work aims at the integration of continuity in multidimensional structures. Starting from a discrete hypercube, a continuous hypercube is created by applying interpolation functions over cell data. We have two representations: the first is an internal representation, the second is an external representation. A multidimensional model dedicated to continuous field data was defined along with a set of operations. The model and the prototype are validated by a prototype of a data warehouse of air pollution. However not all research issues listed in this paper have been looked into. Our main objective was defining a formal model. Defining additional operators, metadata management, dealing with different intervals of time, storing continuous hypercube and ad-hoc hierarchies are a list of the perspectives of this research. 7. References [1] T. O. Ahmed, and M. Miquel. Multidimensional Structures Dedicated to Continuous Spatiotemporal Phenomena. Proc 22 nd British National Conf. on Databases (BNCOD22), Sunderland. 2005, [2] AIRAPRIF, Monitoring the quality of air in Ile de France. [3] Y. Bédard, T., Merrett, and J. Han. Fundamentals of Spatial Data Warehousing for Geographic Knowledge Discovery in Geographic Data Mining and Knowledge Discovery. Research Monographs in GIS series edited by Peter Fisher and Jonathan Raper [4] Bimonte, S., Tchounikine, A. and Miquel M. Towards a Spatial Multidimensional Model. In the Proc. of the ACM 8 th Int. Workshop on Data Warehousing and OLAP, 31 Oct. - 5 Nov. 2005, Bremen, Germany. [5] Cowen, D. J. GIS versus CAD versus DBMS: What are the differences? Fotogrammetric Engineering and Remote Sensing, 54, 1988, [6] M. Goodchild. Geographical information science. International Journal of GIS, 2003, 6, [7] H. Hasenauer, I. Haslik, R. Rosenthaler, G. Pernul and D. Stangl. Conceptual framework of a data warehouse for the National park Hohe Tauern. Proc. 13 th Int. Symposium "Informatik für den Umweltschutz" der Gesellschaft für Informatik (GI), Magdeburg, 1999,
6 [8] W. H. Inmon. Building the Data Warehouse. John Wiley and sons [9] Kemp, Z. and Lee, H Multidimensional Model for Explatory Spatiotemporal Analysis. Proc. of the 5th International Conference on GeoComputation, University of Greenwich, UK. [10] Z. Kouba, K. Matousek and P. Milkovsky. On Data Warehouse and GIS integration. Proc. of the 11 th Int. Conf. and Workshop on Database and Expert Systems Applications, Greenwich, 2000, [11] E. Malinowski and E. Zimányi, Representing Spatiality in a Conceptual Multidimensional Model. GIS 04. Washington, DC, USA pp [12] D. G. Morgan, and T. Glover. Distributing Data Ownership: The Northwestern Geospatial Data Network. GIS Vancouver, B.C., February [13] J. Shanmugasundaram Fayyad, U. M. and Bradely, P. S. Compressed data cubes for OLAP Aggregate Query Approximation on Continuous Dimensions. Proc. of the 5 th ACM SIGKGG Int. Conf. on Discovery and Data Mining (KDD99), New York, 1999, [14] X. Tan. Data Warehousing and Its Potential Using in Weather Forecast. Proc. 22 nd Int. Conf. on Interactive Information Processing Systems for Meteorology, Oceanography, and Hydrology. Atlanta, GA [15] P. Vassiliadis. Modeling Multidimensional Databases, Cubes and Cube Operations. Proc. of the 10 th Int. Conf. on Scientific and Statistical Database Management (SSDBM), Capri, Italy, 1998.
Multidimensional Structures Dedicated to Continuous Spatiotemporal Phenomena
Multidimensional Structures Dedicated to Continuous Spatiotemporal Phenomena Taher Omran Ahmed and Maryvonne Miquel LIRIS INSA de Lyon, Bât. Blaise Pascal 501.302 7 Ave. Jean Capelle, 69621 Villeurbanne,
More informationCONTINUOUS DATA WAREHOUSE: CONCEPTS, CHALLENGES AND POTENTIALS
Geoinformatics 2004 Proc. 12th Int. Conf. on Geoinformatics Geospatial Information Research: Bridging the Pacific and Atlantic University of Gävle, Sweden, 7-9 June 2004 CONTINUOUS DATA WAREHOUSE: CONCEPTS,
More informationRequirements engineering for a user centric spatial data warehouse
Int. J. Open Problems Compt. Math., Vol. 7, No. 3, September 2014 ISSN 1998-6262; Copyright ICSRS Publication, 2014 www.i-csrs.org Requirements engineering for a user centric spatial data warehouse Vinay
More informationa Geographic Data Warehouse for Water Resources Management
Towards a Geographic Data Warehouse for Water Resources Management Nazih Selmoune, Nadia Abdat, Zaia Alimazighi LSI - USTHB 2 Introduction A large part of the data in all decisional systems is geo-spatial
More informationIMPLEMENTING SPATIAL DATA WAREHOUSE HIERARCHIES IN OBJECT-RELATIONAL DBMSs
IMPLEMENTING SPATIAL DATA WAREHOUSE HIERARCHIES IN OBJECT-RELATIONAL DBMSs Elzbieta Malinowski and Esteban Zimányi Computer & Decision Engineering Department, Université Libre de Bruxelles 50 av.f.d.roosevelt,
More informationSprogo Research and a Case Study in field Data Management
A New Relational Spatial OLAP Approach For Multi-resolution and Spatio-multidimensional Analysis of Incomplete Field Data Mehdi Zaamoune, Sandro Bimonte, François Pinet, Philippe Beaune To cite this version:
More informationDIMENSION HIERARCHIES UPDATES IN DATA WAREHOUSES A User-driven Approach
DIMENSION HIERARCHIES UPDATES IN DATA WAREHOUSES A User-driven Approach Cécile Favre, Fadila Bentayeb, Omar Boussaid ERIC Laboratory, University of Lyon, 5 av. Pierre Mendès-France, 69676 Bron Cedex, France
More informationOLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP
Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key
More informationCopyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1
Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics
More informationRESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE
RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE WANG Jizhou, LI Chengming Institute of GIS, Chinese Academy of Surveying and Mapping No.16, Road Beitaiping, District Haidian, Beijing, P.R.China,
More informationFuzzy Spatial Data Warehouse: A Multidimensional Model
4 Fuzzy Spatial Data Warehouse: A Multidimensional Model Pérez David, Somodevilla María J. and Pineda Ivo H. Facultad de Ciencias de la Computación, BUAP, Mexico 1. Introduction A data warehouse is defined
More informationIntegrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making
Integrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making Professor Yvan Bedard, PhD, P.Eng. Centre for Research in Geomatics Laval Univ., Quebec, Canada National Technical University
More informationCHAPTER-24 Mining Spatial Databases
CHAPTER-24 Mining Spatial Databases 24.1 Introduction 24.2 Spatial Data Cube Construction and Spatial OLAP 24.3 Spatial Association Analysis 24.4 Spatial Clustering Methods 24.5 Spatial Classification
More informationA Design and implementation of a data warehouse for research administration universities
A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon
More informationSpatial Data Warehouse and Mining. Rajiv Gandhi
Spatial Data Warehouse and Mining Rajiv Gandhi Roll Number 05331002 Centre of Studies in Resource Engineering Indian Institute of Technology Bombay Powai, Mumbai -400076 India. As part of the first stage
More informationSpatial Data Warehouse Modelling
Spatial Data Warehouse Modelling 1 Chapter I Spatial Data Warehouse Modelling Maria Luisa Damiani, DICO - University of Milan, Italy Stefano Spaccapietra, Ecole Polytechnique Fédérale, Switzerland Abstract
More informationHANDLING IMPRECISION IN QUALITATIVE DATA WAREHOUSE: URBAN BUILDING SITES ANNOYANCE ANALYSIS USE CASE
International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XL-2/W1, 213 8th International Symposium on Spatial Data Quality, 3 May - 1 June 213, Hong Kong HANDLING
More informationAlejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer
Alejandro Vaisman Esteban Zimanyi Data Warehouse Systems Design and Implementation ^ Springer Contents Part I Fundamental Concepts 1 Introduction 3 1.1 A Historical Overview of Data Warehousing 4 1.2 Spatial
More informationInvestigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses
Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses Thiago Luís Lopes Siqueira Ricardo Rodrigues Ciferri Valéria Cesário Times Cristina Dutra de
More informationTracking System for GPS Devices and Mining of Spatial Data
Tracking System for GPS Devices and Mining of Spatial Data AIDA ALISPAHIC, DZENANA DONKO Department for Computer Science and Informatics Faculty of Electrical Engineering, University of Sarajevo Zmaja
More informationDimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
More informationPART 1. Representations of atmospheric phenomena
PART 1 Representations of atmospheric phenomena Atmospheric data meet all of the criteria for big data : they are large (high volume), generated or captured frequently (high velocity), and represent a
More informationReading Questions. Lo and Yeung, 2007: 2 19. Schuurman, 2004: Chapter 1. 1. What distinguishes data from information? How are data represented?
Reading Questions Week two Lo and Yeung, 2007: 2 19. Schuurman, 2004: Chapter 1. 1. What distinguishes data from information? How are data represented? 2. What sort of problems are GIS designed to solve?
More informationDEVELOPMENT OF A SOLAP PATRIMONY MANAGEMENT APPLICATION SYSTEM: FEZ MEDINA AS A CASE STUDY
International Journal of Computer Science and Applications, 2008, Vol. 5, No. 3a, pp 57-66 Technomathematics Research Foundation, DEVELOPMENT OF A SOLAP PATRIMONY MANAGEMENT APPLICATION SYSTEM: FEZ MEDINA
More informationData Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
More informationA Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems
Proceedings of the Postgraduate Annual Research Seminar 2005 68 A Model-based Software Architecture for XML and Metadata Integration in Warehouse Systems Abstract Wan Mohd Haffiz Mohd Nasir, Shamsul Sahibuddin
More informationTechnologies & Applications
Chapter 10 Emerging Database Technologies & Applications Truong Quynh Chi tqchi@cse.hcmut.edu.vn Spring - 2013 Contents 1 Distributed Databases & Client-Server Architectures 2 Spatial and Temporal Database
More information2 Associating Facts with Time
TEMPORAL DATABASES Richard Thomas Snodgrass A temporal database (see Temporal Database) contains time-varying data. Time is an important aspect of all real-world phenomena. Events occur at specific points
More informationData warehouses. Data Mining. Abraham Otero. Data Mining. Agenda
Data warehouses 1/36 Agenda Why do I need a data warehouse? ETL systems Real-Time Data Warehousing Open problems 2/36 1 Why do I need a data warehouse? Why do I need a data warehouse? Maybe you do not
More informationData Mining and Database Systems: Where is the Intersection?
Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: surajitc@microsoft.com 1 Introduction The promise of decision support systems is to exploit enterprise
More informationHorizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
More informationSPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
More informationIntroduction to GIS. Dr F. Escobar, Assoc Prof G. Hunter, Assoc Prof I. Bishop, Dr A. Zerger Department of Geomatics, The University of Melbourne
Introduction to GIS 1 Introduction to GIS http://www.sli.unimelb.edu.au/gisweb/ Dr F. Escobar, Assoc Prof G. Hunter, Assoc Prof I. Bishop, Dr A. Zerger Department of Geomatics, The University of Melbourne
More informationData Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier
Data Mining: Concepts and Techniques Jiawei Han Micheline Kamber Simon Fräser University К MORGAN KAUFMANN PUBLISHERS AN IMPRINT OF Elsevier Contents Foreword Preface xix vii Chapter I Introduction I I.
More informationChapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
More informationInformation Visualization WS 2013/14 11 Visual Analytics
1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and
More informationHybrid Support Systems: a Business Intelligence Approach
Journal of Applied Business Information Systems, 2(2), 2011 57 Journal of Applied Business Information Systems http://www.jabis.ro Hybrid Support Systems: a Business Intelligence Approach Claudiu Brandas
More informationInternational Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 Over viewing issues of data mining with highlights of data warehousing Rushabh H. Baldaniya, Prof H.J.Baldaniya,
More informationMulti-dimensional index structures Part I: motivation
Multi-dimensional index structures Part I: motivation 144 Motivation: Data Warehouse A definition A data warehouse is a repository of integrated enterprise data. A data warehouse is used specifically for
More informationCubeView: A System for Traffic Data Visualization
CUBEVIEW: A SYSTEM FOR TRAFFIC DATA VISUALIZATION 1 CubeView: A System for Traffic Data Visualization S. Shekhar, C.T. Lu, R. Liu, C. Zhou Computer Science Department, University of Minnesota 200 Union
More informationCourse 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
More informationNew Approach of Computing Data Cubes in Data Warehousing
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 14 (2014), pp. 1411-1417 International Research Publications House http://www. irphouse.com New Approach of
More informationDesign and Implementation of Double Cube Data Model for Geographical Information System
The International Arab Journal of Information Technology, Vol. 1, No. 2, July 2004 215 Design and Implementation of Double Cube Data Model for Geographical Information System Mohd Shafry Mohd Rahim, Daut
More informationWhy Business Intelligence
Why Business Intelligence Ferruccio Ferrando z IT Specialist Techline Italy March 2011 page 1 di 11 1.1 The origins In the '50s economic boom, when demand and production were very high, the only concern
More informationUniversal. Event. Product. Computer. 1 warehouse.
Dynamic multi-dimensional models for text warehouses Maria Zamr Bleyberg, Karthik Ganesh Computing and Information Sciences Department Kansas State University, Manhattan, KS, 66506 Abstract In this paper,
More informationII. OLAP(ONLINE ANALYTICAL PROCESSING)
Association Rule Mining Method On OLAP Cube Jigna J. Jadav*, Mahesh Panchal** *( PG-CSE Student, Department of Computer Engineering, Kalol Institute of Technology & Research Centre, Gujarat, India) **
More informationIntroduction. Introduction. Spatial Data Mining: Definition WHAT S THE DIFFERENCE?
Introduction Spatial Data Mining: Progress and Challenges Survey Paper Krzysztof Koperski, Junas Adhikary, and Jiawei Han (1996) Review by Brad Danielson CMPUT 695 01/11/2007 Authors objectives: Describe
More informationOLAP. Business Intelligence OLAP definition & application Multidimensional data representation
OLAP Business Intelligence OLAP definition & application Multidimensional data representation 1 Business Intelligence Accompanying the growth in data warehousing is an ever-increasing demand by users for
More informationOLAP Online Privacy Control
OLAP Online Privacy Control M. Ragul Vignesh and C. Senthil Kumar Abstract--- The major issue related to the protection of private information in online analytical processing system (OLAP), is the privacy
More informationChapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
More informationAlexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data
INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are
More informationData Warehousing and OLAP Technology for Knowledge Discovery
542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories
More informationBUILDING OLAP TOOLS OVER LARGE DATABASES
BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,
More informationAvailable online at www.sciencedirect.com Available online at www.sciencedirect.com. Advanced in Control Engineering and Information Science
Available online at www.sciencedirect.com Available online at www.sciencedirect.com Procedia Procedia Engineering Engineering 00 (2011) 15 (2011) 000 000 1822 1826 Procedia Engineering www.elsevier.com/locate/procedia
More information1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing
1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 2. What is a Data warehouse a. A database application
More informationHow To Model Data For Business Intelligence (Bi)
WHITE PAPER: THE BENEFITS OF DATA MODELING IN BUSINESS INTELLIGENCE The Benefits of Data Modeling in Business Intelligence DECEMBER 2008 Table of Contents Executive Summary 1 SECTION 1 2 Introduction 2
More informationTowards the Next Generation of Data Warehouse Personalization System A Survey and a Comparative Study
www.ijcsi.org 561 Towards the Next Generation of Data Warehouse Personalization System A Survey and a Comparative Study Saida Aissi 1, Mohamed Salah Gouider 2 Bestmod Laboratory, University of Tunis, High
More informationFluency With Information Technology CSE100/IMT100
Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999
More informationThe Benefits of Data Modeling in Business Intelligence
WHITE PAPER: THE BENEFITS OF DATA MODELING IN BUSINESS INTELLIGENCE The Benefits of Data Modeling in Business Intelligence DECEMBER 2008 Table of Contents Executive Summary 1 SECTION 1 2 Introduction 2
More informationIMPROVING THE QUALITY OF THE DECISION MAKING BY USING BUSINESS INTELLIGENCE SOLUTIONS
IMPROVING THE QUALITY OF THE DECISION MAKING BY USING BUSINESS INTELLIGENCE SOLUTIONS Maria Dan Ştefan Academy of Economic Studies, Faculty of Accounting and Management Information Systems, Uverturii Street,
More informationLecture Data Warehouse Systems
Lecture Data Warehouse Systems Eva Zangerle SS 2013 PART A: Architecture Chapter 1: Motivation and Definitions Motivation Goal: to build an operational general view on a company to support decisions in
More informationAn Approach for Facilating Knowledge Data Warehouse
International Journal of Soft Computing Applications ISSN: 1453-2277 Issue 4 (2009), pp.35-40 EuroJournals Publishing, Inc. 2009 http://www.eurojournals.com/ijsca.htm An Approach for Facilating Knowledge
More informationPart 22. Data Warehousing
Part 22 Data Warehousing The Decision Support System (DSS) Tools to assist decision-making Used at all levels in the organization Sometimes focused on a single area Sometimes focused on a single problem
More informationDATA WAREHOUSING AND OLAP TECHNOLOGY
DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are
More informationBuilding Data Cubes and Mining Them. Jelena Jovanovic Email: jeljov@fon.bg.ac.yu
Building Data Cubes and Mining Them Jelena Jovanovic Email: jeljov@fon.bg.ac.yu KDD Process KDD is an overall process of discovering useful knowledge from data. Data mining is a particular step in the
More informationINTEROPERABILITY IN DATA WAREHOUSES
INTEROPERABILITY IN DATA WAREHOUSES Riccardo Torlone Roma Tre University http://torlone.dia.uniroma3.it/ SYNONYMS Data warehouse integration DEFINITION The term refers to the ability of combining the content
More informationData Warehousing and Data Mining
Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong gaocong@cs.aau.dk Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge
More informationChapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives
Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives Describe how the problems of managing data resources in a traditional file environment are solved
More informationMining various patterns in sequential data in an SQL-like manner *
Mining various patterns in sequential data in an SQL-like manner * Marek Wojciechowski Poznan University of Technology, Institute of Computing Science, ul. Piotrowo 3a, 60-965 Poznan, Poland Marek.Wojciechowski@cs.put.poznan.pl
More informationGraphical Web based Tool for Generating Query from Star Schema
Graphical Web based Tool for Generating Query from Star Schema Mohammed Anbar a, Ku Ruhana Ku-Mahamud b a College of Arts and Sciences Universiti Utara Malaysia, 0600 Sintok, Kedah, Malaysia Tel: 604-2449604
More informationDATA WAREHOUSING AND ITS POTENTIAL USING IN WEATHER FORECAST
2.2 DATA WAREHOUSING AND ITS POTENTIAL USING IN WEATHER FORECAST Xiaoguang Tan* Institute of Urban Meteorology, CMA, Beijing, China 1. INTRODUCTION The main function of most current forecaster s workbench
More informationB.Sc (Computer Science) Database Management Systems UNIT-V
1 B.Sc (Computer Science) Database Management Systems UNIT-V Business Intelligence? Business intelligence is a term used to describe a comprehensive cohesive and integrated set of tools and process used
More informationWeek 3 lecture slides
Week 3 lecture slides Topics Data Warehouses Online Analytical Processing Introduction to Data Cubes Textbook reference: Chapter 3 Data Warehouses A data warehouse is a collection of data specifically
More informationLost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole
Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many
More informationUSING SELF-ORGANIZING MAPS FOR INFORMATION VISUALIZATION AND KNOWLEDGE DISCOVERY IN COMPLEX GEOSPATIAL DATASETS
USING SELF-ORGANIZING MAPS FOR INFORMATION VISUALIZATION AND KNOWLEDGE DISCOVERY IN COMPLEX GEOSPATIAL DATASETS Koua, E.L. International Institute for Geo-Information Science and Earth Observation (ITC).
More informationISSN: 2319-5967 ISO 9001:2008 Certified International Journal of Engineering Science and Innovative Technology (IJESIT) Volume 2, Issue 4, July 2013
An Architecture for Creation of Multimedia Data Warehouse 1 Meenakshi Srivastava, 2 Dr. S.K.Singh, 3 Dr. S.Q.Abbas 1 Assistant Professor, Amity University,Lucknow Campus, India, 2 Professor, Amity University
More informationPerformance of KDB-Trees with Query-Based Splitting*
Performance of KDB-Trees with Query-Based Splitting* Yves Lépouchard Ratko Orlandic John L. Pfaltz Dept. of Computer Science Dept. of Computer Science Dept. of Computer Science University of Virginia Illinois
More informationANALYTICS IN BIG DATA ERA
ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY, DISCOVER RELATIONSHIPS AND CLASSIFY HUGE AMOUNT OF DATA MAURIZIO SALUSTI SAS Copyr i g ht 2012, SAS Ins titut
More informationCONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University
CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University Given today s business environment, at times a corporate executive
More informationClustering through Decision Tree Construction in Geology
Nonlinear Analysis: Modelling and Control, 2001, v. 6, No. 2, 29-41 Clustering through Decision Tree Construction in Geology Received: 22.10.2001 Accepted: 31.10.2001 A. Juozapavičius, V. Rapševičius Faculty
More informationTOWARDS VIRTUAL WATERSHEDS: INTEGRATED DATA MINING, MANAGEMENT, MAPPING AND MODELING
AWRA 2010 SPRING SPECIALTY CONFERENCE Orlando, FL March 29-31, 2010 Copyright 2010 AWRA TOWARDS VIRTUAL WATERSHEDS: INTEGRATED DATA MINING, MANAGEMENT, MAPPING AND MODELING Yang Cao, Daniel P. Ames, and
More informationData Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 1 What is data exploration? A preliminary
More informationMobility Data Management
Mobility Data Management (Spatio-Temporal Data Warehouses) Esteban ZIMÁNYI Department of Computer & Decision Engineering (CoDe) Université Libre de Bruxelles ezimanyi@ulb.ac.be CIKM Tutorial on Mobility
More informationData Warehouse Snowflake Design and Performance Considerations in Business Analytics
Journal of Advances in Information Technology Vol. 6, No. 4, November 2015 Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Jiangping Wang and Janet L. Kourik Walker
More informationTHE INTELLIGENT BUSINESS INTELLIGENCE SOLUTIONS
THE INTELLIGENT BUSINESS INTELLIGENCE SOLUTIONS ADRIAN COJOCARIU, CRISTINA OFELIA STANCIU TIBISCUS UNIVERSITY OF TIMIŞOARA, FACULTY OF ECONOMIC SCIENCE, DALIEI STR, 1/A, TIMIŞOARA, 300558, ROMANIA ofelia.stanciu@gmail.com,
More informationPrinciples and Practices of Data Integration
Data Integration Data integration is the process of combining data of different themes, content, scale or spatial extent, projections, acquisition methods, formats, schema, or even levels of uncertainty,
More informationORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process
ORACLE OLAP KEY FEATURES AND BENEFITS FAST ANSWERS TO TOUGH QUESTIONS EASILY KEY FEATURES & BENEFITS World class analytic engine Superior query performance Simple SQL access to advanced analytics Enhanced
More informationDATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.
DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,
More informationTowards a Logical Multidimensional Model for Spatial Data Warehousing and OLAP Marcus Costa Sampaio André Gomes de Sousa Cláudio de Souza Baptista
Towards a Logical Multidimensional Model for Data Warehousing and OLAP Marcus Costa Sampaio André Gomes de Sousa Cláudio de Souza Baptista Information System Laboratory - LSI, Federal University of Campina
More informationFoundations of Business Intelligence: Databases and Information Management
Chapter 6 Foundations of Business Intelligence: Databases and Information Management 6.1 2010 by Prentice Hall LEARNING OBJECTIVES Describe how the problems of managing data resources in a traditional
More informationWeb Log Data Sparsity Analysis and Performance Evaluation for OLAP
Web Log Data Sparsity Analysis and Performance Evaluation for OLAP Ji-Hyun Kim, Hwan-Seung Yong Department of Computer Science and Engineering Ewha Womans University 11-1 Daehyun-dong, Seodaemun-gu, Seoul,
More information1. What are the uses of statistics in data mining? Statistics is used to Estimate the complexity of a data mining problem. Suggest which data mining
1. What are the uses of statistics in data mining? Statistics is used to Estimate the complexity of a data mining problem. Suggest which data mining techniques are most likely to be successful, and Identify
More informationBusiness Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers
60 Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative
More informationATLAS2000 Atlases of the Future in Internet
ATLAS2000 Atlases of the Future in Internet M. Friedrich Institute for Physical Geography, University of Freiburg i.br., Germany (mafri@ipg.uni-freiburg.de) M. Melle Institute for Computer Science, University
More informationPrivacy-preserving data warehousing for spatiotemporal
Privacy-preserving data warehousing for spatiotemporal data Maria L. Damiani, Università Milano (I) GEOPKDD - Meeting Venezia 17 Oct 05 1 Report The report contains two contributions: M.L. Damiani, S.
More informationIndex Selection Techniques in Data Warehouse Systems
Index Selection Techniques in Data Warehouse Systems Aliaksei Holubeu as a part of a Seminar Databases and Data Warehouses. Implementation and usage. Konstanz, June 3, 2005 2 Contents 1 DATA WAREHOUSES
More informationDatabases in Organizations
The following is an excerpt from a draft chapter of a new enterprise architecture text book that is currently under development entitled Enterprise Architecture: Principles and Practice by Brian Cameron
More informationImproving Data Mining of Multi-dimension Objects Using a Hybrid Database and Visualization System
Improving Data Mining of Multi-dimension Objects Using a Hybrid Database and Visualization System Yan Xia, Anthony Tung Shuen Ho School of Electrical and Electronic Engineering Nanyang Technological University,
More informationData Warehousing und Data Mining
Data Warehousing und Data Mining Multidimensionale Indexstrukturen Ulf Leser Wissensmanagement in der Bioinformatik Content of this Lecture Multidimensional Indexing Grid-Files Kd-trees Ulf Leser: Data
More informationDraft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001
A comparison of the OpenGIS TM Abstract Specification with the CIDOC CRM 3.2 Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001 1 Introduction This Mapping has the purpose to identify, if the OpenGIS
More information