DESIGN OF A SPATIAL DATA WAREHOUSE BASED ON AN INTEGRATED NON- SPATIAL DATABASE AND GEO-SPATIAL INFORMATION

Size: px
Start display at page:

Download "DESIGN OF A SPATIAL DATA WAREHOUSE BASED ON AN INTEGRATED NON- SPATIAL DATABASE AND GEO-SPATIAL INFORMATION"

Transcription

1 DESIGN OF A SPATIAL DATA WAREHOUSE BASED ON AN INTEGRATED NON- SPATIAL DATABASE AND GEO-SPATIAL INFORMATION Abdulvahit Torun Harita Genel Komutanlığı (General Command of Mapping) (GCM), Kartografya Dairesi, Cebeci, Ankara, Türkiye atorun@hgk.mil.tr Accessing multiple, distributed, heterogeneous and autonomous information sources storing spatial or non-spatial data and integration of those data sources has been an important issue as distributed data processing and management became available with today s technology. Common solutions to data integration in database area are the data integration, schema integration and software developing approaches. The first approach is implemented as a data warehouse (DW) or an integration as a central database (DB). The data in DW has been cleansed, integrated, and preprocessed and infrastructures have been built surrounding DW for efficient data analysis. Main processes to construct a spatial data warehouse are; collecting data from information sources (data warehousing), organizing data in a geo-spatial data store (warehouse organization) and querying and visualizing data. This study covers implementation of first and design of the following two phases of developing spatial data warehouse. In data warehousing phase, required data about population information, information about administrative hierarchy, location information and administrative boundaries are extracted from relevant data sources. After introducing data warehousing step, model of latter two phases are given in this paper. 1. INTRODUCTION There are multiple, distributed, heterogeneous and autonomous information sources storing spatial data within a single company or property of various companies. The information sources vary from legacy systems to operational spatial databases. The need for gathering or defining associations among those data sources motivates to integrate them as a spatial data warehouse (DW) or as a database using integration techniques. Spatial data sets are integrated either through a part of data warehouse or distributed/federated database (by means of wrappers and mediators). DW is a subject-oriented, integrated, time-varying, non-volatile collection of data repository, which has been extracted and integrated from heterogeneous and autonomous distributed data sources. DW is used primarily in organizational decision-making. The data in DW has been cleansed, integrated, and pre-processed and infrastructures have been built surrounding DW for efficient data analysis. Main processes to construct a spatial data warehouse are; collecting data from information sources (data warehousing), organizing data in a geo-spatial data store (warehouse organization) and querying and visualizing data [Ezeife, 2001] [Voisard, et.al., 2002]. In warehousing phase, resolving formats, data integration, data cleansing, consistency-checking tasks are accomplished. Warehouse organization step comprises re-organizing and aggregating the data as base tables, summary tables and metadata. Querying and visualizing a data spatial warehouse considering spatial characteristics of data requires essential tools for analytical processes and visualization of the information both for organizational and ad hoc users.

2 Most of DWs store large amount of data, which is often used for summarization-based on-line analytical processing (OLAP). ROLAP (relational OLAP) and MOLAP (multidimensional OLAP) are two basic OLAP architectures. DWs are usually based on relational technologies but OLAP uses a multidimensional view of aggregate data to provide quick access to strategic information for further analysis. OLAP enables analysts, managers and executive to gain insight into data through fast, consistent, interactive access to a wide variety of possible views of information. However OLAP and DWs are complementary. OLAP transforms raw data so that it reflects the real dimensionality of the enterprise as understood by the user [Niemi, 2002]. Materialized views pre-compute and store (materialize) aggregates from the base data. The data is grouped using categories from the dimensions tables, which corresponds to the subjects of interest (dimensions) of the organization. Storing all possible aggregates poses storage space problems and increases maintenance cost, since all stored aggregates need to be refreshed as updates are being made to the source data sources.. 2. DATA SOURCES AND DATABASE INTEGRATION In most of the DB integration cases, the tendency is management of legacy systems and re-use of data in recent years. The DBs, which are distributed geographically, in different models, in different environment and with different semantics are needed to be integrated to use the available data. This problem is called as DB integration. DBMSs and applications are designed considering incorporate data and process sharing to ease autonomy. Main efforts are spend in three directions; schema integration, federated DB and multi-database language [Elmagarmid et.al., 1999] [Bobak, 1996]. 2.1 Data Warehousing In warehousing phase resolving formats, data integration, data cleansing, consistency-checking tasks are done. New data is added to DW due to format and transformation definitions in metadata. Before adding the data, consistency checks are performed in order to guarantee that data is loaded once and correctly. 2.2 Database Integration Strategy A hybrid integration technique is applied in order to develop Populated Places DB of Turkey () [Torun, 2000] [Torun, 2002_1]. Data integration method integrating several DBs into one unique DB- and schema integration method are used for integration. Data insertion from the component DBs is done by running the programs written in API of. Schema and Data integration is accomplished by using ladder technique in which first of all, schema of newly designed of General Command of Mapping (GCM) is integrated with the data collected for populated places which are stored in a plain table. Then, schema of is extended to integrate it with State Statistics Institute (SSI) DB schema (Figure 1.b). For integrating different data sets describing the same phenomena, firstly a correct understanding of the semantics of the existing data should be developed. Schemas are transformed into a common data model. For instance SSI DB schema into SQL and DB instance into dbf are converted. Then, an accurate correlation among structures (schemas) is established to avoid comparing different type of objects within the same category. The inter-schema correspondence at metalevel and inter-db correspondence at data level are identified and described. Finally, integration is described precisely to prevent merging irrelevant data together. The conflicts are solved semi-automatically. Integrated schema is generated on top of contributing data sources. Schema integration is done by using modified 5 level integration architecture for the purpose of single DB generation (Figure 1.a) [Ozsu, et.al., 1999].

3 Modified 5 Level Schema Integration Architecture 1. Local Schema: Local schema of plain table, SSI DB and DCT are made available by the Component DBs. 2. Component Schema: Representation of Local schema in canonical (standard) data Model is defined by using SQL. Canonical data Model is employed for unifying divergent local schemas in a single schema. 3. Export Schema: Export Schemas are defined from component schemas based on integrated global schema of. 4. Schema Integration: Export Schemas are integrated to form a single schema. Since the desired final DB is not a federated one, the integrated schema is not a Federated Schema, either. Firstly, two component schemas are integrated. Then, the third one is added to the integrated schema. This method is called as ladder technique. Data integration follows schema integration. 5. External Schema: Upon the integrated schema different conceptual/external schemas are defined for different purposes and usage [Torun, 2002_2]. 2.3 Developing Populated Places Database of Turkey as a base for Spatial Data warehouse Data warehousing phase is implemented as follows. Population information and information about administrative hierarchy are extracted from State Statistics Institute (SSI) DB and from Ministry of Interior Populated Places Book respectively. Location information and administrative boundaries are imported from topo-maps at scale 1:25000 and Digital Chart of Turkey (DCT) at scale 1: of General Command of Mapping respectively. Integration of those four data sources is accomplished by using ladder technique to construct Populated Places DB of Turkey () as a central DB at Cartography Department of GCM. Schema 1 Schema 2 Schema 3 I_SDB Declaration of Correspondence Semantic and Structural Conflicts Resolution of Conflicts Integration Rules Schema Fusion Plain Table SSI DB DCT (a) (b) Figure : 1.a Schema Integration, 1.b. DB integration using ladder technique Population information and relevant statistics are collected and stored by SSI. In order to maintain population information up-to-date for map production and for monitoring the changes of populated places, DB integration is necessary to share the legacy data by SSI instead of re-constructing the same content. Defining Associations Turkey has a hierarchical administrative system, which looks like a balanced tree structure. The sequence of residential entities (populated places) from top to bottom is province, district, sub-district, village and suburb respectively.

4 2.4 Database Integration Schema Integration Source DBs can be integrated by means of schema integration or using standardized data or schema models. Schema integration is a practical method such that the data is integrated as a logical DB with a global schema. With a given a set of local DB schemas belonging to an individual DBMS, an integrated schema which subsumes those local schemas is created by synthesizing the schemas. First of all, database schema of is designed by means of actual and future needs. DB schema of is designed considering the administrative binding of populated places. Then, relevant attributes of SSI DB are added to the schema in addition to a foreign key, which provides to connect SSI DB. The final step is designing an Integrated spatial DB (I_SDB) with the three component DBs;, DCT and spatial DB derived from a subset of. Data Integration There are three main data integration techniques. Firstly, a very basic approach is providing a global catalogue of accessible information sources to user to allow him doing integration by himself without attempting any integration. The user should search, find, decide and find tools to select, extract, integrate and query the data. Secondly, a step further is integrating the source DBs as one single DB by putting the data together. The data and applications should be converted into the integrated DB. Thirdly, subsets of source databases are extracted due to main application needs. Data integration process is done in four major steps by using the first two techniques. Data integration is accomplished by applying ladder technique. Firstly, DB schema of is designed considering the available non-spatial digital information sources and further needs. Secondly, is populated with the data from plain table. Therefore, the plain table is mapped into. Thirdly, schema is extended in order to import population information from exported data of SSI DB (Figure 2.a). Derived Spatial DB (d_sdb) is based on a common schema for both and spatial data processor for the time being ESRI-ArcInfo- that will import the data. Export schema is created by means of a non-spatial predicate that cuts the DB both vertically a subset of attributes- and horizontally a subset of tuples-. Primary key is repeated in every fragment in vertical fragmentation. Thus, disjointness is valid only on non-primary key attributes in vertical fragmentation (Devogele, et.al, 1998). The exported non-spatial data is mapped into spatial format to generate d_sdb. These are done by developing a tiny software which extracts data from into a common model, transforms the common model into a spatial DB and visa-versa. Different languages are employed for spatial and non-spatial definitions and manipulations. DCT contains a set of spatial data classes such as administrative boundaries, hydrology, transportation, elevation, populated places (only provinces and sub-provinces), physiography. The relationship among and d_sdb is preserved by keeping the same primary key Populated Place ID- in corresponding relations of both DBs. Problems of Integration Constructing an integrated DB from existing DBs yield some problems due lack of interoperability among DBs. GCM and SSI have different non-spatial data to an extent for the same context. The data differs because of partly semantic but mainly schematic and format (syntax) discrepancy. Updateness is the main reason for this kind of anomalies. If one of the DBs stores a different name than the current one for a populated place the corresponding populated places cannot be matched till the mistake is removed or association is built. This process prevents scalability of the resultant schema and DB

5 instance. Each time there happens a change, the integrated schema and integrated DB should be updated. 3. SPATIAL DATA WAREHOUSE MODEL Data analysis applications typically aggregate data across many dimensions looking for anomalies or unusual patterns. They categorize data values and trends, extract statistical information, and contrast one category with another. Data analysis is done four steps; A query that extracts relevant data from a large database is formulated, The aggregated data from the database into file or table is extracted The results are visualized by using graphic or mapping tools, The results are analyzed and new queries are formulated [Gray et al., 1997]. Data integration and consistency checks are accomplished. The coming stage will comprise organizing DW, defining and developing analytical operations and generating tools to extend integrated global schema of DW and DW instance. Query Interface Query Engine Plain Table for Populated Places of GCM imigration Schema SSI DB Schema Integrated (extended) Schema for SSI DB Modified SSI DB Data Warehouse DB Integration Metadata integration rules Data Integration Extracter (DB API) Source DB 1 Source DB 2 Source DB... Source Data warehouse (a) (b) Figure : 2.a DB intefration for, 2.b. Hybrid model for Spatial DW 3.1 Spatial Data Warehouse Organization Data is organized in a spatial data store in this stage. DW does not allow the user to delete or update the data in the store. However, the growth of DW is managed either transferring into cheaper media or generating summary tables of less used data. Row data is re-structured and aggregated in base tables. Aggregation rules are defined in metadata. Users are sometimes interested in aggregated values such as population of a region or province, which is calculated by summing the population of populated places at lower hierarchies. Getting the result of a query on the fly based on operational DB may not be efficient. On the other hand, the more data is aggregated, the faster queries are responded, but update and storage are increased. A balance among query response time and update time should be considered.

6 A hybrid approach is employed to provide efficient and rich analytical analysis to the user (Figure 2.b). A part of data is organized in a DW while other part of data is extracted from the integrated DB during query processing. In the metadata information about where and how to get the relevant data is stored. A warehouse is useful for data related to whole field. This is useful in geomarketing applications where data, being socio-cultural behaviors or statistical data. For the case of spatial data warehouse, resultant maps of some queries are stored in the DW for quick response and other queries are responded on the fly as a graphic result. Therefore dimension of data cube is extended by adding related maps of each dimension considering aggregate hierarchies. The dimension determined in the actual DB are geometry, time, type of populated places. The geometry dimension comprises geometric boundary of populated places from region, province, district, sub-district and village. Some other geometric hierarchies can be defined according to a property like height zones, earthquake danger zones, fruitful soil zones etc 3.2 Analytical Operations on Datawarehouse Analytical Operations applied to warehouses for are aggregation (consolidation, roll up), roll down (drill down, drill through), selection (screening, filtering, dicing), slicing, pivoting (rotation). Aggregation is summarization of data for the higher level of hierarchy. Roll down is navigation among levels of data ranging from higher-level summary (up) to lower level summary or detailed data (down). Selection is taking a subset of data by means of a criterion which is evaluated against the data or members of a dimension in order to restrict the set of retrieved data. Slicing is selection of all data satisfying a condition along a particular dimension. Pivoting is changing the dimensional orientation of cube [Vassiliadis, 2000] [Vassiliadis, 2002]. A cube is a group of data cells arranged by the dimensions of the data. A dimension is defined as a structural attribute of a cube that is a list of members, all of which are of a similar type in the users perception of the data. Each dimension has an associated hierarchy of levels of aggregated data. For instance time can be detailed as year, month, week, day, hour. Measures (variables, metrics, facts) represent the real world values. A single datapoint that occurs at the intersection is defined by selecting one member from each dimension in a multi-dimensional array (cube). Although relational technologies provide some non-procedural tools for analytical operations, routines should be provided for complex calculations. Operators defined in SQL in relational DBMS are group by (), count (), sum (), minimum (), maximum (), average (), median (), standard deviation (), variance () etc... However, an algebra is defined on cube having the operations group by, cube, roll up [Gray, et.al., 1997]. Visualizing the Results by Means of Graphic and cartographic Tools Visualization tools display trends, clusters and differences by using graphic tools to make the user to understand the metaphor easily. Visualization tools render the results as 2D or 3D graphs in addition to tools of cartography, which are used for mapping statistical distributions. In order to visualize dynamic characteristics dynamic visual variables are used in addition to visual variables. and Cartographic tools in GIS software provide most of the mapping tools. Results of queries against spatial data warehouses are visualized as a value, an array (table), graphic (2D and 3D), maps (2D, 3D, multimedia-animation, sound etc ). 3.3 Extending and Maintaining Spatial DB as a source of Spatial Data Warehouse After founding a spatial DW, the data is updated, changed or model is extended in two ways. Firstly, source sends data regularly or after balk changes. Secondly, warehouse asks for data at certain time periods [Voisard, et.al, 2002].

7 Since, mapping from data sources into is initially done automatically, the tools for mapping are available for further data injection into the DB. Moreover, there are tools to check consistency considering administrative hierarchy and to compare different versions of. The application has tools for entering, manipulating and updating data in addition to intelligent query generator based on administrative hierarchy and standard topographic map indexes. 4. CONCLUSIONS The concepts about database integration are briefed in the text. Data and schema integration techniques are given related with developing an integrated DB comprising populated places, administeral hierarchies, population information which is called as (Populated Places DB of Turkey). Finally, hybrid model for integration processes is defined within the application phase. Syntactic conflicts are removed by using common formats for all data sources. Schematic conflicts are resolved by defining an integrated global schema and removing redundant and repeated information. Since the semantic meanings of administrative units are unique among governmental bodies, almost no semantic conflicts are met. Naming differences are removed by defining unique names or synonyms. Organizing DW and defining analytical operations phases are designed. A hybrid method is going to be used to develop spatial DB. The spatial DW will comprise a set of maps, which are generated for the frequent user queries considering aggregation hierarchies. Acknowledgements. Populated Places Database of Turkey is designed and developed in Cartography Department of General Command of Mapping, Turkey. REFERENCES Bobak, A.R. Distributed and Multi-Database Systems, Artech House, Boston, pp (1996). Devogele, T., C.parent, S.Spaccapietra, On Spatial Database Integration, International Journal of Geographic Information Science, 12(4), pp (1998). Elmagarmid, A. et.al., Management of Heterogeneous And Autonomous Database Systems, Morgan Kaufmann Publishers, San Francisco, pp.2-32 (1999). Ezeife, C.I., Selecting and materializing horizontally partitioned warehouse views, Data & Knowledge Engineering 36, pp , Elsevier Science (2001). Gray, J. et.al., Data Cube: A Relational Aggregation Operator Generalizing Group By, Cross-tab, and Sub-Totals, data Mining and Knowledge Discovery 1, pp 29-53, Kluwer Academic Publishers (1997). Hepner, P., Integrating Heterogenous Databases : An Overview, (1995).(accessed Dec. 2001). Niemi, T., Constructing OLAP Cubes Based on Queries, (accessed Aug. 2002). Özsu, M.T., P. Valduriez, Principles of Distributed Database Systems, Prentice Hall, New Jersey, pp (1999). Torun, A., Populated Places DB Project, General Command of Mapping. Internal Report, Cartography Department, General Command of Mapping, Turkey (2000). Torun, A., Designing Populated Places Database of Turkey () by Using Relational Model, Harita Dergisi, 128 (2002_1).

8 Torun, A., Using Schema and Data Integration Techniques to Integrate Spatial and non-spatial Data: Developing Populated Places DB of Turkey (), ISPRS Comm. IV, Canada (2002_2). Vassiliadis, P., Data Warehouse Modeling and Quality Issues, Ph.D. Thesis, Nat. Tech. Univ. of Athens, Greece (2000). Vassiliadis, P., Modeling Multidimensional Databases, Cubes and Cube Operations, (accessed Aug. 2002). Voisard, A., M. Jürgens, Geospatial Information Extraction: Querying or Quarrying, (accessed Aug. 2002).

USING SCHEMA AND DATA INTEGRATION TECHNIQUE TO INTEGRATE SPATIAL AND NON-SPATIAL DATA : DEVELOPING POPULATED PLACES DB OF TURKEY (PPDB_T)

USING SCHEMA AND DATA INTEGRATION TECHNIQUE TO INTEGRATE SPATIAL AND NON-SPATIAL DATA : DEVELOPING POPULATED PLACES DB OF TURKEY (PPDB_T) USING SCHEMA AND DATA INTEGRATION TECHNIQUE TO INTEGRATE SPATIAL AND NON-SPATIAL DATA : DEVELOPING POPULATED PLACES DB OF TURKEY () Abdulvahit Torun General Command of Mapping (GCM), Cartography Department,

More information

USING SCHEMA AND DATA INTEGRATION TECHNIQUE TO INTEGRATE SPATIAL AND NON-SPATIAL DATA : DEVELOPING POPULATED PLACES DB OF TURKEY (PPDB_T)

USING SCHEMA AND DATA INTEGRATION TECHNIQUE TO INTEGRATE SPATIAL AND NON-SPATIAL DATA : DEVELOPING POPULATED PLACES DB OF TURKEY (PPDB_T) ISPRS SIPT IGU UCI CIG ACSG Table of contents Table des matières Authors index Index des auteurs Search Recherches Exit Sortir USING SCHEMA AND DATA INTEGRATION TECHNIQUE TO INTEGRATE SPATIAL AND NON-SPATIAL

More information

When to consider OLAP?

When to consider OLAP? When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: erg@evaltech.com Abstract: Do you need an OLAP

More information

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing

More information

DATA WAREHOUSING AND OLAP TECHNOLOGY

DATA WAREHOUSING AND OLAP TECHNOLOGY DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are

More information

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

More information

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA OLAP and OLTP AMIT KUMAR BINDAL Associate Professor Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information, which is created by data,

More information

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives

More information

Data Warehousing and OLAP Technology for Knowledge Discovery

Data Warehousing and OLAP Technology for Knowledge Discovery 542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories

More information

www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28

www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28 Data Warehousing - Essential Element To Support Decision- Making Process In Industries Ashima Bhasin 1, Mr Manoj Kumar 2 1 Computer Science Engineering Department, 2 Associate Professor, CSE Abstract SGT

More information

CHAPTER 4 Data Warehouse Architecture

CHAPTER 4 Data Warehouse Architecture CHAPTER 4 Data Warehouse Architecture 4.1 Data Warehouse Architecture 4.2 Three-tier data warehouse architecture 4.3 Types of OLAP servers: ROLAP versus MOLAP versus HOLAP 4.4 Further development of Data

More information

Fluency With Information Technology CSE100/IMT100

Fluency With Information Technology CSE100/IMT100 Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999

More information

DATA WAREHOUSING - OLAP

DATA WAREHOUSING - OLAP http://www.tutorialspoint.com/dwh/dwh_olap.htm DATA WAREHOUSING - OLAP Copyright tutorialspoint.com Online Analytical Processing Server OLAP is based on the multidimensional data model. It allows managers,

More information

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 2. What is a Data warehouse a. A database application

More information

Turkish Journal of Engineering, Science and Technology

Turkish Journal of Engineering, Science and Technology Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server

More information

Week 13: Data Warehousing. Warehousing

Week 13: Data Warehousing. Warehousing 1 Week 13: Data Warehousing Warehousing Growing industry: $8 billion in 1998 Range from desktop to huge: Walmart: 900-CPU, 2,700 disk, 23TB Teradata system Lots of buzzwords, hype slice & dice, rollup,

More information

Dimensional Modeling for Data Warehouse

Dimensional Modeling for Data Warehouse Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or

More information

CHAPTER SIX DATA. Business Intelligence. 2011 The McGraw-Hill Companies, All Rights Reserved

CHAPTER SIX DATA. Business Intelligence. 2011 The McGraw-Hill Companies, All Rights Reserved CHAPTER SIX DATA Business Intelligence 2011 The McGraw-Hill Companies, All Rights Reserved 2 CHAPTER OVERVIEW SECTION 6.1 Data, Information, Databases The Business Benefits of High-Quality Information

More information

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key

More information

Part 22. Data Warehousing

Part 22. Data Warehousing Part 22 Data Warehousing The Decision Support System (DSS) Tools to assist decision-making Used at all levels in the organization Sometimes focused on a single area Sometimes focused on a single problem

More information

CHAPTER-24 Mining Spatial Databases

CHAPTER-24 Mining Spatial Databases CHAPTER-24 Mining Spatial Databases 24.1 Introduction 24.2 Spatial Data Cube Construction and Spatial OLAP 24.3 Spatial Association Analysis 24.4 Spatial Clustering Methods 24.5 Spatial Classification

More information

Building Data Cubes and Mining Them. Jelena Jovanovic Email: jeljov@fon.bg.ac.yu

Building Data Cubes and Mining Them. Jelena Jovanovic Email: jeljov@fon.bg.ac.yu Building Data Cubes and Mining Them Jelena Jovanovic Email: jeljov@fon.bg.ac.yu KDD Process KDD is an overall process of discovering useful knowledge from data. Data mining is a particular step in the

More information

Databases in Organizations

Databases in Organizations The following is an excerpt from a draft chapter of a new enterprise architecture text book that is currently under development entitled Enterprise Architecture: Principles and Practice by Brian Cameron

More information

Data Warehousing and Data Mining

Data Warehousing and Data Mining Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong gaocong@cs.aau.dk Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Chapter 6 Foundations of Business Intelligence: Databases and Information Management 6.1 2010 by Prentice Hall LEARNING OBJECTIVES Describe how the problems of managing data resources in a traditional

More information

Data Warehouse Snowflake Design and Performance Considerations in Business Analytics

Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Journal of Advances in Information Technology Vol. 6, No. 4, November 2015 Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Jiangping Wang and Janet L. Kourik Walker

More information

Week 3 lecture slides

Week 3 lecture slides Week 3 lecture slides Topics Data Warehouses Online Analytical Processing Introduction to Data Cubes Textbook reference: Chapter 3 Data Warehouses A data warehouse is a collection of data specifically

More information

Data Warehousing Systems: Foundations and Architectures

Data Warehousing Systems: Foundations and Architectures Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository

More information

What is OLAP - On-line analytical processing

What is OLAP - On-line analytical processing What is OLAP - On-line analytical processing Vladimir Estivill-Castro School of Computing and Information Technology With contributions for J. Han 1 Introduction When a company has received/accumulated

More information

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS DATA WAREHOUSE CONCEPTS A fundamental concept of a data warehouse is the distinction between data and information. Data is composed of observable and recordable facts that are often found in operational

More information

Business Intelligence, Analytics & Reporting: Glossary of Terms

Business Intelligence, Analytics & Reporting: Glossary of Terms Business Intelligence, Analytics & Reporting: Glossary of Terms A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Ad-hoc analytics Ad-hoc analytics is the process by which a user can create a new report

More information

Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives

Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives Describe how the problems of managing data resources in a traditional file environment are solved

More information

Data W a Ware r house house and and OLAP II Week 6 1

Data W a Ware r house house and and OLAP II Week 6 1 Data Warehouse and OLAP II Week 6 1 Team Homework Assignment #8 Using a data warehousing tool and a data set, play four OLAP operations (Roll up (drill up), Drill down (roll down), Slice and dice, Pivot

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Foundations of Business Intelligence: Databases and Information Management Content Problems of managing data resources in a traditional file environment Capabilities and value of a database management

More information

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000 2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000 Introduction This course provides students with the knowledge and skills necessary to design, implement, and deploy OLAP

More information

14. Data Warehousing & Data Mining

14. Data Warehousing & Data Mining 14. Data Warehousing & Data Mining Data Warehousing Concepts Decision support is key for companies wanting to turn their organizational data into an information asset Data Warehouse "A subject-oriented,

More information

Integrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making

Integrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making Integrating GIS and BI: a Powerful Way to Unlock Geospatial Data for Decision-Making Professor Yvan Bedard, PhD, P.Eng. Centre for Research in Geomatics Laval Univ., Quebec, Canada National Technical University

More information

Data Integration using Agent based Mediator-Wrapper Architecture. Tutorial Report For Agent Based Software Engineering (SENG 609.

Data Integration using Agent based Mediator-Wrapper Architecture. Tutorial Report For Agent Based Software Engineering (SENG 609. Data Integration using Agent based Mediator-Wrapper Architecture Tutorial Report For Agent Based Software Engineering (SENG 609.22) Presented by: George Shi Course Instructor: Dr. Behrouz H. Far December

More information

Learning Objectives. Definition of OLAP Data cubes OLAP operations MDX OLAP servers

Learning Objectives. Definition of OLAP Data cubes OLAP operations MDX OLAP servers OLAP Learning Objectives Definition of OLAP Data cubes OLAP operations MDX OLAP servers 2 What is OLAP? OLAP has two immediate consequences: online part requires the answers of queries to be fast, the

More information

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006 Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006 What is a Data Warehouse? A data warehouse is a subject-oriented, integrated, time-varying, non-volatile

More information

Introduction to Data Warehousing. Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in

Introduction to Data Warehousing. Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in Introduction to Data Warehousing Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in Necessity is the mother of invention Why Data Warehouse? Scenario 1 ABC Pvt Ltd is a company with branches at Mumbai,

More information

A Design and implementation of a data warehouse for research administration universities

A Design and implementation of a data warehouse for research administration universities A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon

More information

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration DW Source Integration, Tools, and Architecture Overview DW Front End Tools Source Integration DW architecture Original slides were written by Torben Bach Pedersen Aalborg University 2007 - DWML course

More information

UNIT-3 OLAP in Data Warehouse

UNIT-3 OLAP in Data Warehouse UNIT-3 OLAP in Data Warehouse Bharati Vidyapeeth s Institute of Computer Applications and Management, New Delhi-63, by Dr.Deepali Kamthania U2.1 OLAP Demand for Online analytical processing Major features

More information

Basics of Dimensional Modeling

Basics of Dimensional Modeling Basics of Dimensional Modeling Data warehouse and OLAP tools are based on a dimensional data model. A dimensional model is based on dimensions, facts, cubes, and schemas such as star and snowflake. Dimensional

More information

B.Sc (Computer Science) Database Management Systems UNIT-V

B.Sc (Computer Science) Database Management Systems UNIT-V 1 B.Sc (Computer Science) Database Management Systems UNIT-V Business Intelligence? Business intelligence is a term used to describe a comprehensive cohesive and integrated set of tools and process used

More information

Course 103402 MIS. Foundations of Business Intelligence

Course 103402 MIS. Foundations of Business Intelligence Oman College of Management and Technology Course 103402 MIS Topic 5 Foundations of Business Intelligence CS/MIS Department Organizing Data in a Traditional File Environment File organization concepts Database:

More information

(Week 10) A04. Information System for CRM. Electronic Commerce Marketing

(Week 10) A04. Information System for CRM. Electronic Commerce Marketing (Week 10) A04. Information System for CRM Electronic Commerce Marketing Course Code: 166186-01 Course Name: Electronic Commerce Marketing Period: Autumn 2015 Lecturer: Prof. Dr. Sync Sangwon Lee Department:

More information

Unit -3. Learning Objective. Demand for Online analytical processing Major features and functions OLAP models and implementation considerations

Unit -3. Learning Objective. Demand for Online analytical processing Major features and functions OLAP models and implementation considerations Unit -3 Learning Objective Demand for Online analytical processing Major features and functions OLAP models and implementation considerations Demand of On Line Analytical Processing Need for multidimensional

More information

A Technical Review on On-Line Analytical Processing (OLAP)

A Technical Review on On-Line Analytical Processing (OLAP) A Technical Review on On-Line Analytical Processing (OLAP) K. Jayapriya 1., E. Girija 2,III-M.C.A., R.Uma. 3,M.C.A.,M.Phil., Department of computer applications, Assit.Prof,Dept of M.C.A, Dhanalakshmi

More information

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies Ashish Gahlot, Manoj Yadav Dronacharya college of engineering Farrukhnagar, Gurgaon,Haryana Abstract- Data warehousing, Data Mining,

More information

Data Warehouse: Introduction

Data Warehouse: Introduction Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1 Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics

More information

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina Data Warehousing Read chapter 13 of Riguzzi et al Sistemi Informativi Slides derived from those by Hector Garcia-Molina What is a Warehouse? Collection of diverse data subject oriented aimed at executive,

More information

Tracking System for GPS Devices and Mining of Spatial Data

Tracking System for GPS Devices and Mining of Spatial Data Tracking System for GPS Devices and Mining of Spatial Data AIDA ALISPAHIC, DZENANA DONKO Department for Computer Science and Informatics Faculty of Electrical Engineering, University of Sarajevo Zmaja

More information

Lecture Data Warehouse Systems

Lecture Data Warehouse Systems Lecture Data Warehouse Systems Eva Zangerle SS 2013 PART A: Architecture Chapter 1: Motivation and Definitions Motivation Goal: to build an operational general view on a company to support decisions in

More information

Overview of Data Warehousing and OLAP

Overview of Data Warehousing and OLAP Overview of Data Warehousing and OLAP Chapter 28 March 24, 2008 ADBS: DW 1 Chapter Outline What is a data warehouse (DW) Conceptual structure of DW Why separate DW Data modeling for DW Online Analytical

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 Copyright 2011 Pearson Education, Inc. Student Learning Objectives How does a relational database organize data,

More information

OLAP Theory-English version

OLAP Theory-English version OLAP Theory-English version On-Line Analytical processing (Business Intelligence) [Ing.J.Skorkovský,CSc.] Department of corporate economy Agenda The Market Why OLAP (On-Line-Analytic-Processing Introduction

More information

CHAPTER 5: BUSINESS ANALYTICS

CHAPTER 5: BUSINESS ANALYTICS Chapter 5: Business Analytics CHAPTER 5: BUSINESS ANALYTICS Objectives The objectives are: Describe Business Analytics. Explain the terminology associated with Business Analytics. Describe the data warehouse

More information

Data Warehousing. Outline. From OLTP to the Data Warehouse. Overview of data warehousing Dimensional Modeling Online Analytical Processing

Data Warehousing. Outline. From OLTP to the Data Warehouse. Overview of data warehousing Dimensional Modeling Online Analytical Processing Data Warehousing Outline Overview of data warehousing Dimensional Modeling Online Analytical Processing From OLTP to the Data Warehouse Traditionally, database systems stored data relevant to current business

More information

OLAP. Business Intelligence OLAP definition & application Multidimensional data representation

OLAP. Business Intelligence OLAP definition & application Multidimensional data representation OLAP Business Intelligence OLAP definition & application Multidimensional data representation 1 Business Intelligence Accompanying the growth in data warehousing is an ever-increasing demand by users for

More information

M2074 - Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000 5 Day Course

M2074 - Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000 5 Day Course Module 1: Introduction to Data Warehousing and OLAP Introducing Data Warehousing Defining OLAP Solutions Understanding Data Warehouse Design Understanding OLAP Models Applying OLAP Cubes At the end of

More information

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing CSE 544 Principles of Database Management Systems Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing Class Projects Class projects are going very well! Project presentations: 15 minutes On Wednesday

More information

Data Warehouse design

Data Warehouse design Data Warehouse design Design of Enterprise Systems University of Pavia 21/11/2013-1- Data Warehouse design DATA PRESENTATION - 2- BI Reporting Success Factors BI platform success factors include: Performance

More information

Mario Guarracino. Data warehousing

Mario Guarracino. Data warehousing Data warehousing Introduction Since the mid-nineties, it became clear that the databases for analysis and business intelligence need to be separate from operational. In this lecture we will review the

More information

TIM 50 - Business Information Systems

TIM 50 - Business Information Systems TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz March 1, 2015 The Database Approach to Data Management Database: Collection of related files containing records on people, places, or things.

More information

Business Intelligence for SUPRA. WHITE PAPER Cincom In-depth Analysis and Review

Business Intelligence for SUPRA. WHITE PAPER Cincom In-depth Analysis and Review Business Intelligence for A Technical Overview WHITE PAPER Cincom In-depth Analysis and Review SIMPLIFICATION THROUGH INNOVATION Business Intelligence for A Technical Overview Table of Contents Complete

More information

Multi-dimensional index structures Part I: motivation

Multi-dimensional index structures Part I: motivation Multi-dimensional index structures Part I: motivation 144 Motivation: Data Warehouse A definition A data warehouse is a repository of integrated enterprise data. A data warehouse is used specifically for

More information

Anwendersoftware Anwendungssoftwares a. Data-Warehouse-, Data-Mining- and OLAP-Technologies. Online Analytic Processing

Anwendersoftware Anwendungssoftwares a. Data-Warehouse-, Data-Mining- and OLAP-Technologies. Online Analytic Processing Anwendungssoftwares a Data-Warehouse-, Data-Mining- and OLAP-Technologies Online Analytic Processing Online Analytic Processing OLAP Online Analytic Processing Technologies and tools that support (ad-hoc)

More information

4-06-35. John R. Vacca INSIDE

4-06-35. John R. Vacca INSIDE 4-06-35 INFORMATION MANAGEMENT: STRATEGY, SYSTEMS, AND TECHNOLOGIES ONLINE DATA MINING John R. Vacca INSIDE Online Analytical Modeling (OLAM); OLAM Architecture and Features; Implementation Mechanisms;

More information

Adobe Insight, powered by Omniture

Adobe Insight, powered by Omniture Adobe Insight, powered by Omniture Accelerating government intelligence to the speed of thought 1 Challenges that analysts face 2 Analysis tools and functionality 3 Adobe Insight 4 Summary Never before

More information

Data Mining for Successful Healthcare Organizations

Data Mining for Successful Healthcare Organizations Data Mining for Successful Healthcare Organizations For successful healthcare organizations, it is important to empower the management and staff with data warehousing-based critical thinking and knowledge

More information

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data

More information

RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE

RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE WANG Jizhou, LI Chengming Institute of GIS, Chinese Academy of Surveying and Mapping No.16, Road Beitaiping, District Haidian, Beijing, P.R.China,

More information

Business Intelligence Solutions. Cognos BI 8. by Adis Terzić

Business Intelligence Solutions. Cognos BI 8. by Adis Terzić Business Intelligence Solutions Cognos BI 8 by Adis Terzić Fairfax, Virginia August, 2008 Table of Content Table of Content... 2 Introduction... 3 Cognos BI 8 Solutions... 3 Cognos 8 Components... 3 Cognos

More information

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria

More information

Outline. Data Warehousing. What is a Warehouse? What is a Warehouse?

Outline. Data Warehousing. What is a Warehouse? What is a Warehouse? Outline Data Warehousing What is a data warehouse? Why a warehouse? Models & operations Implementing a warehouse 2 What is a Warehouse? Collection of diverse data subject oriented aimed at executive, decision

More information

1. What are the uses of statistics in data mining? Statistics is used to Estimate the complexity of a data mining problem. Suggest which data mining

1. What are the uses of statistics in data mining? Statistics is used to Estimate the complexity of a data mining problem. Suggest which data mining 1. What are the uses of statistics in data mining? Statistics is used to Estimate the complexity of a data mining problem. Suggest which data mining techniques are most likely to be successful, and Identify

More information

OLAP and Data Warehousing! Introduction!

OLAP and Data Warehousing! Introduction! The image cannot be displayed. Your computer may not have enough memory to open the image, or the image may have been corrupted. Restart your computer, and then open the file again. If the red x still

More information

DATA CUBES E0 261. Jayant Haritsa Computer Science and Automation Indian Institute of Science. JAN 2014 Slide 1 DATA CUBES

DATA CUBES E0 261. Jayant Haritsa Computer Science and Automation Indian Institute of Science. JAN 2014 Slide 1 DATA CUBES E0 261 Jayant Haritsa Computer Science and Automation Indian Institute of Science JAN 2014 Slide 1 Introduction Increasingly, organizations are analyzing historical data to identify useful patterns and

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Foundations of Business Intelligence: Databases and Information Management Problem: HP s numerous systems unable to deliver the information needed for a complete picture of business operations, lack of

More information

University of Gaziantep, Department of Business Administration

University of Gaziantep, Department of Business Administration University of Gaziantep, Department of Business Administration The extensive use of information technology enables organizations to collect huge amounts of data about almost every aspect of their businesses.

More information

Lection 3-4 WAREHOUSING

Lection 3-4 WAREHOUSING Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing

More information

Data Warehousing. Paper 133-25

Data Warehousing. Paper 133-25 Paper 133-25 The Power of Hybrid OLAP in a Multidimensional World Ann Weinberger, SAS Institute Inc., Cary, NC Matthias Ender, SAS Institute Inc., Cary, NC ABSTRACT Version 8 of the SAS System brings powerful

More information

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on

More information

Database Marketing, Business Intelligence and Knowledge Discovery

Database Marketing, Business Intelligence and Knowledge Discovery Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski

More information

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 1 What is data exploration? A preliminary

More information

PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc.

PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc. PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions A Technical Whitepaper from Sybase, Inc. Table of Contents Section I: The Need for Data Warehouse Modeling.....................................4

More information

Data Warehousing and Data Mining in Business Applications

Data Warehousing and Data Mining in Business Applications 133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business

More information

BUILDING OLAP TOOLS OVER LARGE DATABASES

BUILDING OLAP TOOLS OVER LARGE DATABASES BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,

More information

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are

More information

Concepts of Database Management Seventh Edition. Chapter 9 Database Management Approaches

Concepts of Database Management Seventh Edition. Chapter 9 Database Management Approaches Concepts of Database Management Seventh Edition Chapter 9 Database Management Approaches Objectives Describe distributed database management systems (DDBMSs) Discuss client/server systems Examine the ways

More information

CHAPTER 4: BUSINESS ANALYTICS

CHAPTER 4: BUSINESS ANALYTICS Chapter 4: Business Analytics CHAPTER 4: BUSINESS ANALYTICS Objectives Introduction The objectives are: Describe Business Analytics Explain the terminology associated with Business Analytics Describe the

More information

Data Warehousing and OLAP

Data Warehousing and OLAP 1 Data Warehousing and OLAP Hector Garcia-Molina Stanford University Warehousing Growing industry: $8 billion in 1998 Range from desktop to huge: Walmart: 900-CPU, 2,700 disk, 23TB Teradata system Lots

More information

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success Developing an MDM Strategy Key Components for Success WHITE PAPER Table of Contents Introduction... 2 Process Considerations... 3 Architecture Considerations... 5 Conclusion... 9 About Knowledgent... 10

More information

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES MUHAMMAD KHALEEL (0912125) SZABIST KARACHI CAMPUS Abstract. Data warehouse and online analytical processing (OLAP) both are core component for decision

More information

SQL Server 2012 Business Intelligence Boot Camp

SQL Server 2012 Business Intelligence Boot Camp SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations

More information

Why Business Intelligence

Why Business Intelligence Why Business Intelligence Ferruccio Ferrando z IT Specialist Techline Italy March 2011 page 1 di 11 1.1 The origins In the '50s economic boom, when demand and production were very high, the only concern

More information

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction

More information