Data Warehousing With Limited Database Operations

Size: px
Start display at page:

Download "Data Warehousing With Limited Database Operations"

Transcription

1 ThoughtPaper Data Warehousing With Limited Database Operations By Muhammad Ahmad Shahzad Managing Principal Consultant, PMP Collabera

2 Data warehousing is a very expensive solution and should be designed as per requirements. This paper focuses on the need to reduce database operations while implementing the data warehouse. A data warehouse can be divided into following phases: extraction, initialization, transformation and scrubbing, categorization, loading, summarization and reporting. Most of these phases are not directly based on database, and can be handled using flat file manipulation. This paper will outline different phases of data warehousing, and how to limit operations in these phases and after implementation. The Need for Limiting Database Operations Developing data warehouses is a lengthy and expensive process. One of the most important decisions a project manager faces is deciding to what extent database operations should be used for the implementation of data warehouse. The data warehouse project can be divided in phases: extraction, initialization, transformation and scrubbing, categorization, loading, summarization and reporting. The existing data servers have the capability to implement all of the above phases of data warehousing. Database servers like Oracle 8i and SQL Server 7.0 have special features tuned to implement data warehouses more efficiently. With the help of database design tools, fast loads and OLAP reporting front end, the process of data warehouse implementation has improved considerably. The most prominent issue with implementing all of the above-mentioned phases using database operations is that databases were initially designed for implementing transactional systems rather than reporting applications. Thus, features like data consistency, integrity, normalization, concurrency, and rollbacks, which are crucial for transaction systems, are not that relevant when we want to implement data warehouses. Most of the commercially available database servers, which have evolved in the direction of very large databases, still support online transactional systems, and therefore include the aforementioned features. These features are important in data warehouses, but are not of high importance. To understand why most of the features of databases do not directly apply on data warehouses, we must look into the basic features of data warehouses. A data warehouse is a subject-oriented, integrated, non-volatile and time-variant collection of data in support of management s decisions. It is about molding data into information, and storing this information based on the subject rather than application. Essentially, data warehousing is the warehousing of data outside operational systems and this has not significantly changed with the evolution of data warehousing systems. A prime reason of this separation is that the evaluation and analysis, done by analysts, require complex and analytic queries - the effect of which is the performance degradation of operational systems. Most of the data warehousing possess three-tier architecture first-tier being the

3 transfer of data from transactional systems to database server, second tier being the database server itself and the third tier is client-side analysis tools. If we closely analyze the aforementioned architecture, it is very clear that the database itself is just a part of overall picture. There exist a lot of operations in data warehousing which are not directly based on database. Though the database itself is very important and the database design takes a pivotal role in the overall implementation, most of the database operations can be undermined as a result of the associated overhead. Data being loaded into the warehouse is landing as big chunks consisting of hundreds of Megs, or even Gigs - and once they are loaded into the database, they will not change. Most of these applications are designed with some type of batch window for loading data, during which time some portion of the data warehouse will be down. This gives some slack for implementation, but the issue stills exists to attempt to minimize this window. Before discussing the phases of data warehousing in detail, it must be mentioned that the data entering the warehouse is hardly acceptable for loading into the database in its raw format. Data is coming from diverse sources with different hardware, software, and databases. Often, it is dirty data in terms of redundancy, key management, referential integrity and inconsistency in data elements. This is an issue that must be addressed. Now, this paper will discuss all the phases of data warehousing and how to implement those with limited database operations. Extraction and Initialization Data in the warehouse comes from either the transactional database for which the data warehouse is implemented, or the data warehouse itself. For unloading data from databases, a

4 fast unload utility can be used rather than using SQL operations. These fast-unload utilities unload the data into data files using control files to specify the format. Extracting data from transactional databases is understandable, but why extract data from existing data warehouse? The reason behind this will be clearer when we take a look at the other phases, but in short it s to minimize database operations for lookups like primary key management or foreign key management. The importance of Meta data is beyond doubt in the implementation of data warehouses. Meta data files are used to recognize and manage data files. In a wisely organized warehouse, there exist different sets of meta data files for different phases. As the files evolve from dirty to load-able, the meta data itself gets closer to the database design. As mentioned previously, typically data coming into the database from transactional systems needs a lot of grooming. The biggest issue is to make sense of dirty data. There exist various reasons for this problem, but almost all of warehouse designers have to face this. Also, there exist primary key and referential integrity management and the removal of inconsistencies from the data. Almost all of these problems are handled in the transformation and scrubbing phase, and the initial preparation is done in this phase, as is the first level of validation. The files coming into a data warehouse may or may not follow some kind of standard format. If they do not follow any format or there exists more than one format for different transactional systems, then these files have to be changed to a standard warehouse format: Unified Warehouse File Format or UWF format. Following is an overview of UWF format: Each object/table will have two files: one is the data file and the other one is definition or Meta file. There should exist only one set of files for one object. One file should have data for only one object. The definition file should have the type of file (fixed length or variable length), delimitation information, data types, default values and characteristics of fields which would include information like is nullable, primary key, reference key or partition key. Most of the time it s also useful to include objects type inside the definition file, showing if the object is a fact table, dimension table or a summary table. Once the files are reformatted into UWF format, the rest of the programs and phases will be handling using just one kind of file format. Often, it s useful to have the first level of validation in the Extraction phase. This validation includes checks like existence of Mandatory information,

5 Valid Number, Valid Date, and Valid Partition Key. This is a very primitive check, but is also very essential. After converting files into UWF format and passing through the first level validation, files for transactional data are ready for the transformation and scrubbing phase. The final step in the extraction phase is unloading from the warehouse. For the transformation phase, there is a need for primary key and foreign key information. For all of the objects, files will be maintained in some kind of Database Lookup area. Transformation will use these files to generate new primary keys and maintain referential integrity. If the domain information is implemented in the database, then foreign key management will handle this; otherwise, there should be domain file management in place. These domain files will have information like target value and a set of source values. The following figure shows a domain file for sex. Transformation and Scrubbing This phase is the backbone of data warehousing. If implemented properly, based on the overall requirements and design, the project is 70% completed. The transformation and scrubbing phase is about 65-80% of the overall development of data warehouse. This phase becomes even more important if we want to implement the data warehouse without the staging database. There is no need to have a physically separate staging database to store data before scrubbing and transforming it. The data coming from extraction phase are in the form of flat files, and should remain as flat files even after this phase. All of the processing should be done on flat files, as flat file manipulation is faster and can be handled in parallel.

6 The overall goal of the Transformation and Scrubbing phase is to present data in a load-able format. The load phase should not be performing any file manipulation. All of the manipulation should be part of this phase. As a reminder the data coming into this phase has already been reformatted to UWM format. Following are the main tasks for this phase. Remove redundant information; Resolve inconsistency from data; Primary key management, and Foreign key management These tasks may seem simple, but the solutions for these problems are very complex and vary from data warehouse to data warehouse. The complexity of the first two tasks is a function of dirty data. If the data extracted from a legacy system is very dirty, then accomplishing these tasks will need a lot of work, whereas if the legacy systems were designed properly and integrating them is not a problem, there is not much to resolve in transaction phase. The key management is directly related to the physical database design of data warehouse. Redundant Data Ideally, data coming into the data warehouse system should be clean. But the world is not ideal, and this is especially true for big systems. There can exist a multitude of reasons for this problem, but data to be loaded into the data warehouse often has redundant data. The following can be different reasons behind this problem: There exists redundant data in a transactional system or legacy system; This problem was raised during the extraction of data from legacy system; Overlap in the extraction window of data, and The same data file is sent twice. Whatever the reason may be, the solution for this is complex if one has to implement the solution in the transformation phase. Some of these issues should be resolved on the transactional system side or in the way data is commuting between the two systems. But this is not always a solution: most of the time, it s much more expensive to handle this before transformation phase. Thus, data warehouse developers have to resolve this issue.

7 The redundancy problem can be sub-divided into two classes; the solution for both is different. The first class is related to redundancy in the same batch load. Whereas the second is related to resending redundant data, the data itself may not be duplicate but has already been loaded into the data warehouse, so will become redundant if loaded. Both have different solutions: the first one is a self-contained problem, and the solution for the second one needs some kind of database feedback. There may exist scenarios in which both of these exist simultaneously in one batch load. In that case, the first class has more priority then the second class. First Class of Redundant Data As the problem is self-contained, the solution lies in file manipulation. Some dynamic utilities can be written to manipulate files that are already in UWM format. The utilities can open UWM file and find out about the primary key information, then scan data for duplicates and then remove the redundant data. A scheme for removing duplicates can be decided with the discussion between data architects, data warehouse designers and business users. Sometimes, it s a good practice to inform concerned parties about duplication so that they may modify their systems to minimize this problem. The database-oriented solution for this is to first load data into a staging database, and then manipulating this data to remove redundant data. Those of us who have manipulated huge amount of data using queries know that how expensive this solution will be. No matter how one wants to do it, there will exist at least a few full-table scans in order to remove duplicates. A full table scan is the most expensive type of scan in database operations. The following figure shows the solution of this problem graphically:

8 Second Class of Redundant Data This class approaches the redundancy problem in relation to duplication between data to load and data already loaded in database. The solution for this needs a feedback from database. For example: the Extraction phase is generating a file called Primary Key File (PKF) for each table. This file has candidate key information from the transaction system and the mapped primary key for data warehouse. An example of PKF for a patient table is given below. Where AHBC and AHBB are two hospital Ids: This PKF file will also be in UWM file format. A dynamic utility can be developed which will compare the Transactional System candidate keys between data file and PKF file and check if a particular key was already in the PKF file or not. The output of this utility will be a file with data that is not already loaded. An alternative approach is to load this data into the staging database and then compare this data at the time of loading this into data warehouse. The problem with this approach is that again this will require a lot of database operations to compare the data before loading. The most probable approach will be to have a cursor of the staging area data and for each record compare it with the data in the tables of data warehouse. However, a row-by-row traversal on cursor of millions of records and then comparing this with another table of millions of records is a very expensive database operation. The following figure shows the solution of this problem graphically

9 Third Class of Redundant Data If both of the above mentioned problems exist in data to be loaded (for example, there exists redundant data within a data file and part of this was already loaded.) The solution is quite simple: pass this file from the first phase for removing redundancy from a data file, and the second phase will be to create a file with only new data. Resolving Inconsistency in Data Inconsistency is a very common problem in data warehousing where data is coming from different legacy or transactional systems and each of those have there own methods of abbreviation or domain ids. A simple example can be of marital status. The following table shows different marital statuses and ids used by different systems The solution for this is very simple. The domain files for each system that will have mappings from respective system to the desired value must be maintained. The file will be matched against these domain files and the output file will have the required value for each item. The complexity is in creating these domain files. If all possible combinations of data items and their domain values are already known, then it s not a problem for making such a file. However, sometimes this is not known beforehand. For example, consider a case of implementing a health care data warehouse and making a domain file for allergies. Even if it is considered that there are no typos in transactional systems, there exist so many allergies that it s quite difficult to make a domain file in the beginning. In cases like this the domain file will evolve to have all the required values. The utility that is made to map domain values will have to be developed in such a way that whenever there is a new item value it should append that into the domain file. If the domain value for that item is simple enough to generate, then the utility should generate it; otherwise there has to be an involvement from business user. The aforementioned scenarios do not deal with the dirty data problem. What if there exist typos in the data coming to data warehouse? They become important when you are generating some kind of domain information based on descriptive fields. Taking the same example of allergies: what if the data coming to the system is not code based, but the descriptive field is the name of allergy?

10 Making the domain file for such a field is expensive. However, most of the time, it is required for mapping data from different systems. Business users must be involved for resolving discrepancies. The following figure shows the solution for resolving inconsistencies. Primary Key Management This phase will need the PKF file, which was mentioned previously. The keys of the transactional systems have to be mapped on the keys of the data warehouse. The PKF file is generated in the extraction phase, and this phase will use the values from that file for generating new primary keys. After passing through this phase the data files will have data warehouse based keys. If it is important to retain the transaction system s primary key information, then this system will not remove that. Some dimensional tables can be created to map keys so that querying for required data can be accomplished. Foreign Key Management This phase will also need the PKF file. In theory foreign keys should be the primary keys of referenced table. The logic is very much similar to primary key handling. The difference comes in the sense that, in foreign key management, the process is simple mapping from primary keys of the transactional system to primary keys of the data warehouse system. This phase will not generate any new keys. Problem comes when key information is not in PKF files. There can be two approaches to handle this problem. One will be to discard the record; the second approach can be to put some default value or dummy value. Whatever approach is opted for situations like this, data should not be loaded when there is no reference data available. Generally in data warehouses referential integrity is not implemented as part of database, as it will slow the

11 database operations, so handling foreign keys becomes part of the Foreign Key Management phase. Loading This obviously is a database-oriented phase. In this phase a data file is loaded into the table. By this time data has been cleaned and is in ready-to-load form. A simple fast loading utility can load data. As this paper is focused on non-database operations in data warehouses, we will not discuss this phase. Summarization of Data Most of the time, data viewed by business users is in a summarized level. The summarization can be done on different levels. For example, if we consider time as the base of summarization, then summarization can be done on week, month, quarter and year basis. There can also exist a multidimensional summarization in data warehouses. Depending on the amount of data it may be more efficient to unload the data first into the flat file, then summarize the data as per requirements and reload it into different summary tables. Most of the time, data warehouses are implemented in levels. At the base level there exists a centralized Enterprise Data Warehouse and above that there are data marts. OLAP tools and DSS communicate to the data marts rather than EDW. There will exist a layer of Extraction, Transformation and Load between EDW and data marts. This layer will handle the summarization of data. Another approach can be to summarize data at the transformation before even loading data into EDW, and loading this summarized data directly to the data marts. The following figures show both of the above-mentioned approaches. Updating Data Warehouses Ideally, once data is loaded into the data warehouse, there should not be any changes to it. In data warehouses, data should not be modified after it s loaded. As defined, a data warehouse is a

12 time variant system. However, in reality there comes a time when either data is updated in data warehouse or there is a need to remove some portion. There was a purpose in defining data warehouses a time variant system: data warehouses are so large that Deleting Data from Data Warehouses It is very expensive to use database operation to delete data from tables. Imagine removing a couple of thousand records from a table of million rows especially, if these records are random in nature. It s been our observation that if we want to remove more than one percentage of tables with more than a million records, it s cheaper to unload the table, remove the records and reload the table again. Thus, there has to be some kind of logic implemented in the process of deleting data. It is more of an observational decision and varies considerably from resources to resources. Updating Data in Data Warehouses Conceptually updating data is a combination of deletion and insertion. The solution is pretty clear: if it is required to update some data in a data warehouse, those records must be removed from the database using the above-mentioned approach. Then the data must be re-loaded as a new load. Summary Data warehousing is a very expensive solution and should be designed as per requirements. Though most of the commercial databases are tuned to implement very large databases like data warehouses, there exist some limitations that restrict efficient implementation of data warehouses using database operations. One approach is to limit the database operations and use flat file manipulations for data warehousing. Manipulating flat files are less expensive than doing the same thing in data warehouses. It also reduces the batch window in loading the data, which inturn reduces the down time for querying. This allows a data warehouse to remain an effective and time and cost-efficient tool for your organization. About Collabera Collabera is one of the fastest growing end-to-end information technology services and solutions firms worldwide. We work with leading Global 2000 firms from the Financial Services, Manufacturing & Retail, Technology, Communications & Media domains. Collabera delivers highly responsive and innovative solutions that bridge our client's Execution Gaps through our proprietary methodologies - Momentum workshops, ABS (Asset Based Services) and global delivery model - helping them experience accelerated value.

13 With revenues of over USD 300 million, Collabera employs over 4000 professionals across 22 locations and seven world-class delivery centers in the US, Europe, India and China regions. Our Global Delivery model leverages a best-in-class Quality system (including SEI CMMI Level 5 and ISO security certifications) to deliver a full portfolio of services that include Application Development and Management, Independent Testing, Outsourced Product Development, Enterprise Software Solutions, Business Intelligence & Datawarehousing.

14 References Inmon, W.H. What is Data Warehouse? Gupta, Vivek R., Senior Consultant, Services Corporation, Chicago, Illinois. An Introduction to Data Warehousing Heise, David, CIO Andrews University Data Warehousing at Avondale College Goodnight, James, CEO SAS Institute Inc. Data Warehousing: Understanding Its Role in Business Management Architecture Hill, Janelle, Analyst Challenges of Data Preparation for a Data Warehouse Hall, Curt Data Warehousing Issues and Trends McGuff, Frank Designing the Perfect Data Warehouse Gordon, K. I. The Why of Data Standards Do You Really Know Your Data? Kimball, Ralph and Reeves, Laura and Ross, Margy and Thornwaite, Warren The Chess Pieces Larissa Moss Data Cleansing: A Dichotomy of Data Warehousing? Orli, R. and Santos, F. Data Extraction, Transformation, and Migration Tools Orr, Ken

15 Data Warehouse Technology Firestone, Joseph M. Data Warehouses and Data Marts: A Dynamic View Symons, Van Three Tiered Data Warehousing Structure

16

University Data Warehouse Design Issues: A Case Study

University Data Warehouse Design Issues: A Case Study Session 2358 University Data Warehouse Design Issues: A Case Study Melissa C. Lin Chief Information Office, University of Florida Abstract A discussion of the design and modeling issues associated with

More information

Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8

Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8 Enterprise Solutions Data Warehouse & Business Intelligence Chapter-8 Learning Objectives Concepts of Data Warehouse Business Intelligence, Analytics & Big Data Tools for DWH & BI Concepts of Data Warehouse

More information

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES MUHAMMAD KHALEEL (0912125) SZABIST KARACHI CAMPUS Abstract. Data warehouse and online analytical processing (OLAP) both are core component for decision

More information

Deriving Business Intelligence from Unstructured Data

Deriving Business Intelligence from Unstructured Data International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 9 (2013), pp. 971-976 International Research Publications House http://www. irphouse.com /ijict.htm Deriving

More information

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction

More information

OptiCare A Unique Shared-Services Model

OptiCare A Unique Shared-Services Model OptiCare A Unique Shared-Services Model Copyright Collabera 2008. All rights reserved. Introduction Applications drive a company s business processes and the task of supporting myriad applications is a

More information

www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28

www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28 Data Warehousing - Essential Element To Support Decision- Making Process In Industries Ashima Bhasin 1, Mr Manoj Kumar 2 1 Computer Science Engineering Department, 2 Associate Professor, CSE Abstract SGT

More information

When to consider OLAP?

When to consider OLAP? When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: erg@evaltech.com Abstract: Do you need an OLAP

More information

Data Warehousing and OLAP Technology for Knowledge Discovery

Data Warehousing and OLAP Technology for Knowledge Discovery 542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories

More information

Data Warehousing and Data Mining

Data Warehousing and Data Mining Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong gaocong@cs.aau.dk Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge

More information

ETL-EXTRACT, TRANSFORM & LOAD TESTING

ETL-EXTRACT, TRANSFORM & LOAD TESTING ETL-EXTRACT, TRANSFORM & LOAD TESTING Rajesh Popli Manager (Quality), Nagarro Software Pvt. Ltd., Gurgaon, INDIA rajesh.popli@nagarro.com ABSTRACT Data is most important part in any organization. Data

More information

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA OLAP and OLTP AMIT KUMAR BINDAL Associate Professor Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information, which is created by data,

More information

Jagir Singh, Greeshma, P Singh University of Northern Virginia. Abstract

Jagir Singh, Greeshma, P Singh University of Northern Virginia. Abstract 224 Business Intelligence Journal July DATA WAREHOUSING Ofori Boateng, PhD Professor, University of Northern Virginia BMGT531 1900- SU 2011 Business Intelligence Project Jagir Singh, Greeshma, P Singh

More information

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take

More information

Lection 3-4 WAREHOUSING

Lection 3-4 WAREHOUSING Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing

More information

Speeding ETL Processing in Data Warehouses White Paper

Speeding ETL Processing in Data Warehouses White Paper Speeding ETL Processing in Data Warehouses White Paper 020607dmxwpADM High-Performance Aggregations and Joins for Faster Data Warehouse Processing Data Processing Challenges... 1 Joins and Aggregates are

More information

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies Ashish Gahlot, Manoj Yadav Dronacharya college of engineering Farrukhnagar, Gurgaon,Haryana Abstract- Data warehousing, Data Mining,

More information

B.Sc (Computer Science) Database Management Systems UNIT-V

B.Sc (Computer Science) Database Management Systems UNIT-V 1 B.Sc (Computer Science) Database Management Systems UNIT-V Business Intelligence? Business intelligence is a term used to describe a comprehensive cohesive and integrated set of tools and process used

More information

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing

More information

Application Of Business Intelligence In Agriculture 2020 System to Improve Efficiency And Support Decision Making in Investments.

Application Of Business Intelligence In Agriculture 2020 System to Improve Efficiency And Support Decision Making in Investments. Application Of Business Intelligence In Agriculture 2020 System to Improve Efficiency And Support Decision Making in Investments Anuraj Gupta Department of Electronics and Communication Oriental Institute

More information

POLAR IT SERVICES. Business Intelligence Project Methodology

POLAR IT SERVICES. Business Intelligence Project Methodology POLAR IT SERVICES Business Intelligence Project Methodology Table of Contents 1. Overview... 2 2. Visualize... 3 3. Planning and Architecture... 4 3.1 Define Requirements... 4 3.1.1 Define Attributes...

More information

Part 22. Data Warehousing

Part 22. Data Warehousing Part 22 Data Warehousing The Decision Support System (DSS) Tools to assist decision-making Used at all levels in the organization Sometimes focused on a single area Sometimes focused on a single problem

More information

A Critical Review of Data Warehouse

A Critical Review of Data Warehouse Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 95-103 Research India Publications http://www.ripublication.com A Critical Review of Data Warehouse Sachin

More information

Data Warehouse Architecture Overview

Data Warehouse Architecture Overview Data Warehousing 01 Data Warehouse Architecture Overview DW 2014/2015 Notice! Author " João Moura Pires (jmp@di.fct.unl.pt)! This material can be freely used for personal or academic purposes without any

More information

Data Management, Analysis Tools, and Analysis Mechanics

Data Management, Analysis Tools, and Analysis Mechanics Chapter 2 Data Management, Analysis Tools, and Analysis Mechanics This chapter explores different tools and techniques for handling data for research purposes. This chapter assumes that a research problem

More information

Turkish Journal of Engineering, Science and Technology

Turkish Journal of Engineering, Science and Technology Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server

More information

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

More information

The Role of Data Warehousing Concept for Improved Organizations Performance and Decision Making

The Role of Data Warehousing Concept for Improved Organizations Performance and Decision Making Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 10, October 2014,

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1 Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics

More information

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many

More information

Introduction to Data Warehousing. Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in

Introduction to Data Warehousing. Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in Introduction to Data Warehousing Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in Necessity is the mother of invention Why Data Warehouse? Scenario 1 ABC Pvt Ltd is a company with branches at Mumbai,

More information

High-Volume Data Warehousing in Centerprise. Product Datasheet

High-Volume Data Warehousing in Centerprise. Product Datasheet High-Volume Data Warehousing in Centerprise Product Datasheet Table of Contents Overview 3 Data Complexity 3 Data Quality 3 Speed and Scalability 3 Centerprise Data Warehouse Features 4 ETL in a Unified

More information

PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc.

PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc. PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions A Technical Whitepaper from Sybase, Inc. Table of Contents Section I: The Need for Data Warehouse Modeling.....................................4

More information

Data Integration and ETL Process

Data Integration and ETL Process Data Integration and ETL Process Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, second

More information

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives

More information

A Survey on Data Warehouse Architecture

A Survey on Data Warehouse Architecture A Survey on Data Warehouse Architecture Rajiv Senapati 1, D.Anil Kumar 2 1 Assistant Professor, Department of IT, G.I.E.T, Gunupur, India 2 Associate Professor, Department of CSE, G.I.E.T, Gunupur, India

More information

Master Data Management. Zahra Mansoori

Master Data Management. Zahra Mansoori Master Data Management Zahra Mansoori 1 1. Preference 2 A critical question arises How do you get from a thousand points of data entry to a single view of the business? We are going to answer this question

More information

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006 Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006 What is a Data Warehouse? A data warehouse is a subject-oriented, integrated, time-varying, non-volatile

More information

Dr. Osama E.Sheta Department of Mathematics (Computer Science) Faculty of Science, Zagazig University Zagazig, Elsharkia, Egypt oesheta75@gmail.

Dr. Osama E.Sheta Department of Mathematics (Computer Science) Faculty of Science, Zagazig University Zagazig, Elsharkia, Egypt oesheta75@gmail. Evaluating a Healthcare Data Warehouse For Cancer Diseases Dr. Osama E.Sheta Department of Mathematics (Computer Science) Faculty of Science, Zagazig University Zagazig, Elsharkia, Egypt oesheta75@gmail.com

More information

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria

More information

CHAPTER 4: BUSINESS ANALYTICS

CHAPTER 4: BUSINESS ANALYTICS Chapter 4: Business Analytics CHAPTER 4: BUSINESS ANALYTICS Objectives Introduction The objectives are: Describe Business Analytics Explain the terminology associated with Business Analytics Describe the

More information

Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer

Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer Data Warehousing Overview, Terminology, and Research Issues 1 Heterogeneous Database Integration Integration System World Wide Web Digital Libraries Scientific Databases Personal Databases Collects and

More information

BENEFITS OF AUTOMATING DATA WAREHOUSING

BENEFITS OF AUTOMATING DATA WAREHOUSING BENEFITS OF AUTOMATING DATA WAREHOUSING Introduction...2 The Process...2 The Problem...2 The Solution...2 Benefits...2 Background...3 Automating the Data Warehouse with UC4 Workload Automation Suite...3

More information

IST722 Data Warehousing

IST722 Data Warehousing IST722 Data Warehousing Components of the Data Warehouse Michael A. Fudge, Jr. Recall: Inmon s CIF The CIF is a reference architecture Understanding the Diagram The CIF is a reference architecture CIF

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Foundations of Business Intelligence: Databases and Information Management Content Problems of managing data resources in a traditional file environment Capabilities and value of a database management

More information

Master Data Management and Data Warehousing. Zahra Mansoori

Master Data Management and Data Warehousing. Zahra Mansoori Master Data Management and Data Warehousing Zahra Mansoori 1 1. Preference 2 IT landscape growth IT landscapes have grown into complex arrays of different systems, applications, and technologies over the

More information

ETL Process in Data Warehouse. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

ETL Process in Data Warehouse. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT ETL Process in Data Warehouse G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT Outline ETL Extraction Transformation Loading ETL Overview Extraction Transformation Loading ETL To get data out of

More information

Business Intelligence, Data warehousing Concept and artifacts

Business Intelligence, Data warehousing Concept and artifacts Business Intelligence, Data warehousing Concept and artifacts Data Warehousing is the process of constructing and using the data warehouse. The data warehouse is constructed by integrating the data from

More information

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Appliances and DW Architectures John O Brien President and Executive Architect Zukeran Technologies 1 TDWI 1 Agenda What

More information

CHAPTER 5: BUSINESS ANALYTICS

CHAPTER 5: BUSINESS ANALYTICS Chapter 5: Business Analytics CHAPTER 5: BUSINESS ANALYTICS Objectives The objectives are: Describe Business Analytics. Explain the terminology associated with Business Analytics. Describe the data warehouse

More information

Data Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc.

Data Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc. Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc. Introduction Abstract warehousing has been around for over a decade. Therefore, when you read the articles

More information

Is ETL Becoming Obsolete?

Is ETL Becoming Obsolete? Is ETL Becoming Obsolete? Why a Business-Rules-Driven E-LT Architecture is Better Sunopsis. All rights reserved. The information contained in this document does not constitute a contractual agreement with

More information

SAS BI Course Content; Introduction to DWH / BI Concepts

SAS BI Course Content; Introduction to DWH / BI Concepts SAS BI Course Content; Introduction to DWH / BI Concepts SAS Web Report Studio 4.2 SAS EG 4.2 SAS Information Delivery Portal 4.2 SAS Data Integration Studio 4.2 SAS BI Dashboard 4.2 SAS Management Console

More information

Data Warehouse Snowflake Design and Performance Considerations in Business Analytics

Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Journal of Advances in Information Technology Vol. 6, No. 4, November 2015 Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Jiangping Wang and Janet L. Kourik Walker

More information

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University Bussiness Intelligence and Data Warehouse Schedule Bussiness Intelligence (BI) BI tools Oracle vs. Microsoft Data warehouse History Tools Oracle vs. Others Discussion Business Intelligence (BI) Products

More information

Topics. Database Essential Concepts. What s s a Good Database System? Using Database Software. Using Database Software. Types of Database Programs

Topics. Database Essential Concepts. What s s a Good Database System? Using Database Software. Using Database Software. Types of Database Programs Topics Software V:. Database concepts: records, fields, data types. Relational and objectoriented databases. Computer maintenance and operation: storage health and utilities; back-up strategies; keeping

More information

Data Warehouse Overview. Srini Rengarajan

Data Warehouse Overview. Srini Rengarajan Data Warehouse Overview Srini Rengarajan Please mute Your cell! Agenda Data Warehouse Architecture Approaches to build a Data Warehouse Top Down Approach Bottom Up Approach Best Practices Case Example

More information

Cisco Data Preparation

Cisco Data Preparation Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and

More information

Advanced Data Management Technologies

Advanced Data Management Technologies ADMT 2015/16 Unit 2 J. Gamper 1/44 Advanced Data Management Technologies Unit 2 Basic Concepts of BI and Data Warehousing J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements:

More information

Traditional BI vs. Business Data Lake A comparison

Traditional BI vs. Business Data Lake A comparison Traditional BI vs. Business Data Lake A comparison The need for new thinking around data storage and analysis Traditional Business Intelligence (BI) systems provide various levels and kinds of analyses

More information

Original Research Articles

Original Research Articles Original Research Articles Researchers Sweety Patel Department of Computer Science, Fairleigh Dickinson University, USA Email- sweetu83patel@yahoo.com Different Data Warehouse Architecture Creation Criteria

More information

The Data Warehouse ETL Toolkit

The Data Warehouse ETL Toolkit 2008 AGI-Information Management Consultants May be used for personal purporses only or by libraries associated to dandelon.com network. The Data Warehouse ETL Toolkit Practical Techniques for Extracting,

More information

www.dotnetsparkles.wordpress.com

www.dotnetsparkles.wordpress.com Database Design Considerations Designing a database requires an understanding of both the business functions you want to model and the database concepts and features used to represent those business functions.

More information

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data

More information

SimCorp Solution Guide

SimCorp Solution Guide SimCorp Solution Guide Data Warehouse Manager For all your reporting and analytics tasks, you need a central data repository regardless of source. SimCorp s Data Warehouse Manager gives you a comprehensive,

More information

Course 103402 MIS. Foundations of Business Intelligence

Course 103402 MIS. Foundations of Business Intelligence Oman College of Management and Technology Course 103402 MIS Topic 5 Foundations of Business Intelligence CS/MIS Department Organizing Data in a Traditional File Environment File organization concepts Database:

More information

SENG 520, Experience with a high-level programming language. (304) 579-7726, Jeff.Edgell@comcast.net

SENG 520, Experience with a high-level programming language. (304) 579-7726, Jeff.Edgell@comcast.net Course : Semester : Course Format And Credit hours : Prerequisites : Data Warehousing and Business Intelligence Summer (Odd Years) online 3 hr Credit SENG 520, Experience with a high-level programming

More information

JOURNAL OF OBJECT TECHNOLOGY

JOURNAL OF OBJECT TECHNOLOGY JOURNAL OF OBJECT TECHNOLOGY Online at www.jot.fm. Published by ETH Zurich, Chair of Software Engineering JOT, 2008 Vol. 7, No. 8, November-December 2008 What s Your Information Agenda? Mahesh H. Dodani,

More information

Business Intelligence: Effective Decision Making

Business Intelligence: Effective Decision Making Business Intelligence: Effective Decision Making Bellevue College Linda Rumans IT Instructor, Business Division Bellevue College lrumans@bellevuecollege.edu Current Status What do I do??? How do I increase

More information

DATA MINING AND WAREHOUSING CONCEPTS

DATA MINING AND WAREHOUSING CONCEPTS CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation

More information

OLAP Theory-English version

OLAP Theory-English version OLAP Theory-English version On-Line Analytical processing (Business Intelligence) [Ing.J.Skorkovský,CSc.] Department of corporate economy Agenda The Market Why OLAP (On-Line-Analytic-Processing Introduction

More information

Bringing Big Data into the Enterprise

Bringing Big Data into the Enterprise Bringing Big Data into the Enterprise Overview When evaluating Big Data applications in enterprise computing, one often-asked question is how does Big Data compare to the Enterprise Data Warehouse (EDW)?

More information

THE DATA WAREHOUSE ETL TOOLKIT CDT803 Three Days

THE DATA WAREHOUSE ETL TOOLKIT CDT803 Three Days Three Days Prerequisites Students should have at least some experience with any relational database management system. Who Should Attend This course is targeted at technical staff, team leaders and project

More information

Data Warehousing Systems: Foundations and Architectures

Data Warehousing Systems: Foundations and Architectures Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository

More information

Extraction Transformation Loading ETL Get data out of sources and load into the DW

Extraction Transformation Loading ETL Get data out of sources and load into the DW Lection 5 ETL Definition Extraction Transformation Loading ETL Get data out of sources and load into the DW Data is extracted from OLTP database, transformed to match the DW schema and loaded into the

More information

How to Enhance Traditional BI Architecture to Leverage Big Data

How to Enhance Traditional BI Architecture to Leverage Big Data B I G D ATA How to Enhance Traditional BI Architecture to Leverage Big Data Contents Executive Summary... 1 Traditional BI - DataStack 2.0 Architecture... 2 Benefits of Traditional BI - DataStack 2.0...

More information

Database Management System Dr. S. Srinath Department of Computer Science & Engineering Indian Institute of Technology, Madras Lecture No.

Database Management System Dr. S. Srinath Department of Computer Science & Engineering Indian Institute of Technology, Madras Lecture No. Database Management System Dr. S. Srinath Department of Computer Science & Engineering Indian Institute of Technology, Madras Lecture No. # 31 Introduction to Data Warehousing and OLAP Part 2 Hello and

More information

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2 Class Announcements TIM 50 - Business Information Systems Lecture 15 Database Assignment 2 posted Due Tuesday 5/26 UC Santa Cruz May 19, 2015 Database: Collection of related files containing records on

More information

Data Vault and The Truth about the Enterprise Data Warehouse

Data Vault and The Truth about the Enterprise Data Warehouse Data Vault and The Truth about the Enterprise Data Warehouse Roelant Vos 04-05-2012 Brisbane, Australia Introduction More often than not, when discussion about data modeling and information architecture

More information

ETL as a Necessity for Business Architectures

ETL as a Necessity for Business Architectures Database Systems Journal vol. IV, no. 2/2013 3 ETL as a Necessity for Business Architectures Aurelian TITIRISCA University of Economic Studies, Bucharest, Romania aureliantitirisca@yahoo.com Today, the

More information

A Design Technique: Data Integration Modeling

A Design Technique: Data Integration Modeling C H A P T E R 3 A Design Technique: Integration ing This chapter focuses on a new design technique for the analysis and design of data integration processes. This technique uses a graphical process modeling

More information

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on

More information

Integrating SAP and non-sap data for comprehensive Business Intelligence

Integrating SAP and non-sap data for comprehensive Business Intelligence WHITE PAPER Integrating SAP and non-sap data for comprehensive Business Intelligence www.barc.de/en Business Application Research Center 2 Integrating SAP and non-sap data Authors Timm Grosser Senior Analyst

More information

Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring

Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring www.ijcsi.org 78 Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring Mohammed Mohammed 1 Mohammed Anad 2 Anwar Mzher 3 Ahmed Hasson 4 2 faculty

More information

Indexing Techniques for Data Warehouses Queries. Abstract

Indexing Techniques for Data Warehouses Queries. Abstract Indexing Techniques for Data Warehouses Queries Sirirut Vanichayobon Le Gruenwald The University of Oklahoma School of Computer Science Norman, OK, 739 sirirut@cs.ou.edu gruenwal@cs.ou.edu Abstract Recently,

More information

Data Warehouse: Introduction

Data Warehouse: Introduction Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,

More information

MS SQL Performance (Tuning) Best Practices:

MS SQL Performance (Tuning) Best Practices: MS SQL Performance (Tuning) Best Practices: 1. Don t share the SQL server hardware with other services If other workloads are running on the same server where SQL Server is running, memory and other hardware

More information

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS DATA WAREHOUSE CONCEPTS A fundamental concept of a data warehouse is the distinction between data and information. Data is composed of observable and recordable facts that are often found in operational

More information

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach 2006 ISMA Conference 1 Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach Priya Lobo CFPS Satyam Computer Services Ltd. 69, Railway Parallel Road, Kumarapark West, Bangalore 560020,

More information

Chapter 5. Learning Objectives. DW Development and ETL

Chapter 5. Learning Objectives. DW Development and ETL Chapter 5 DW Development and ETL Learning Objectives Explain data integration and the extraction, transformation, and load (ETL) processes Basic DW development methodologies Describe real-time (active)

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 Copyright 2011 Pearson Education, Inc. Student Learning Objectives How does a relational database organize data,

More information

Designing a Dimensional Model

Designing a Dimensional Model Designing a Dimensional Model Erik Veerman Atlanta MDF member SQL Server MVP, Microsoft MCT Mentor, Solid Quality Learning Definitions Data Warehousing A subject-oriented, integrated, time-variant, and

More information

CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS

CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS In today's scenario data warehouse plays a crucial role in order to perform important operations. Different indexing techniques has been used and analyzed using

More information

Fluency With Information Technology CSE100/IMT100

Fluency With Information Technology CSE100/IMT100 Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999

More information

ETL PROCESS IN DATA WAREHOUSE

ETL PROCESS IN DATA WAREHOUSE ETL PROCESS IN DATA WAREHOUSE OUTLINE ETL : Extraction, Transformation, Loading Capture/Extract Scrub or data cleansing Transform Load and Index ETL OVERVIEW Extraction Transformation Loading ETL ETL is

More information

Data warehousing with PostgreSQL

Data warehousing with PostgreSQL Data warehousing with PostgreSQL Gabriele Bartolini http://www.2ndquadrant.it/ European PostgreSQL Day 2009 6 November, ParisTech Telecom, Paris, France Audience

More information

Customer Analysis - Customer analysis is done by analyzing the customer's buying preferences, buying time, budget cycles, etc.

Customer Analysis - Customer analysis is done by analyzing the customer's buying preferences, buying time, budget cycles, etc. Data Warehouses Data warehousing is the process of constructing and using a data warehouse. A data warehouse is constructed by integrating data from multiple heterogeneous sources that support analytical

More information

PartJoin: An Efficient Storage and Query Execution for Data Warehouses

PartJoin: An Efficient Storage and Query Execution for Data Warehouses PartJoin: An Efficient Storage and Query Execution for Data Warehouses Ladjel Bellatreche 1, Michel Schneider 2, Mukesh Mohania 3, and Bharat Bhargava 4 1 IMERIR, Perpignan, FRANCE ladjel@imerir.com 2

More information

Data Warehousing Concepts

Data Warehousing Concepts Data Warehousing Concepts JB Software and Consulting Inc 1333 McDermott Drive, Suite 200 Allen, TX 75013. [[[[[ DATA WAREHOUSING What is a Data Warehouse? Decision Support Systems (DSS), provides an analysis

More information