Data warehouse design

Size: px
Start display at page:

Download "Data warehouse design"

Transcription

1 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, Data warehouse design DATA WAREHOUSE: DESIGN - 1 Risk factors Database and data mining group, High user expectation the data warehouse is the solution of the company s problems Data and OLTP process quality incomplete or unreliable data non integrated or non optimized business processes Political management of the project cooperation with information owners system acceptance by end users deployment appropriate training DATA WAREHOUSE: DESIGN - 2 Pag. 1

2 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, Top-down approach the data warehouse provides a global and complete representation of business data significant cost and time consuming implementation complex analysis and design tasks Bottom-up approach incremental growth of the data warehouse, by adding data marts on specific business areas separately focused on specific business areas limited cost and delivery time easy to perform intermediate checks DATA WAREHOUSE: DESIGN - 3 Database and data mining group, Business Dimensional Lifecycle Planning (Kimball) Requirement definition Project management Architecture design Product selection and installation TECHNOLOGY Dimensional modeling Physiscal design Feeding design and implementation DATA User Application Analysis User Application Development APPLICATIONS Deployment Maintenance DATA WAREHOUSE: DESIGN - 4 warehouse, teoria e pratica della progettazione, McGraw Hill 2006 Pag. 2

3 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Data mart design user requirements reconciled schema Database and data mining group, CONCEPTUAL DESIGN workload data volume logical model operational source schemas RECONCILIATION reconciled schema fact schema LOGICAL DESIGN logical schema workload data volume DBMS FEEDING DESIGN PHYSICAL DESIGN feeding schema physical schema DATA WAREHOUSE: DESIGN - 5 Requirement analysis Database and data mining group, It collects data analysis requirements to be supported by the data mart implementation constraints due to existing information systems Requirement sources business users operational system administrators The first selected data mart is crucial for the company feeded by (few) reliable sources DATA WAREHOUSE: DESIGN - 6 Pag. 3

4 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, Application requirements Description of relevant events (facts) each fact represents a of events which are relevant for the company examples: (in the CRM domain) complaints, services characterized by descriptive dimensions (setting the granularity), history span, relevant measures informations are gathered in a glossary Workload description periodical business reports queries expressed in natural language example: number of complaints for each product in the last month DATA WAREHOUSE: DESIGN - 7 Structural requirements Database and data mining group, Feeding periodicity Available space for data derived data (indices, materialized views) System architecture level number dependent or independent data marts Deployment planning start up training DATA WAREHOUSE: DESIGN - 8 Pag. 4

5 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, Conceptual design DATA WAREHOUSE: DESIGN - 9 Conceptual design Database and data mining group, No currently adopted modeling formalism ER model not adequate Dimensional Fact Model (Golfarelli, Rizzi) graphical model supporting conceptual design for a given fact, it defines a fact schema modelling dimensions hierarchies measures it provides design documentation both for requirement review with users, and after deployment DATA WAREHOUSE: DESIGN - 10 Pag. 5

6 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Dimensional Fact Model Database and data mining group, Fact it models a set of relevant events (sales, shippings, complaints) it evolves with time Dimension it describes the analysis coordinates of a fact (e.g., each sale is described by the sale, the and the sold product) it is characterized by many, typically categorical, attributes Measure it describes a numerical property of a fact (e.g., each sale is characterized by a sold quantity) aggregates are frequently performed on measures product fact dimension DATA WAREHOUSE: DESIGN - 11 SALE sold quantity sale amount number of customers unit price measure DFM: Hierarchy Database and data mining group, Each dimension can have a set of associated attributes The attributes describe the dimension at different abstraction levels and can be structured as a hierarchy The hierarchy represents a generalization relationship among a subset of attributes in a dimension (e.g., geografic hierarchy for the dimension) The hierarchy represents a functional dependency (1:n relationship) quarter month day holiday week DATA WAREHOUSE: DESIGN - 12 department marketing group type brand city product SALE brand sold quantity sale amount number of customers unit price sale manager sale district city dimension attribute region country hierarchy Pag. 6

7 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Comparison with ER Database and data mining group, department DEPARTMENT quarter month day holiday week department marketing group type brand city product brand SALE sold quantity sale amount number of customers unit price sale manager sale district city region country country region city COUNTRY REGION CITY MARKETING marketing GROUP YEAR CATEGORY BRAND Brand city CITY QUARTER type TYPE BRAND brand MONTH product PRODUCT quarter month SHOP (0,n) (0,n) SALE (0,n) DATE SALES MGR. sales mgr DATA WAREHOUSE: DESIGN - 13 SALES DISTRICT sale district sold qty sale amount cust. num. unit price HOLIDAY holiday WEEK DAY day week Aggregation Database and data mining group, Events can be aggregated based on the values of the attributes along the hierarchies Measures for the aggregated event are obtained by aggregating the measures of the corresponding events in the original fact schema standard aggregate operators: SUM, MIN, MAX, AVG, COUNT Aggregation computes measures with a coarser granularity than those in the original fact schema detail reduction is usually obtained by climbing a hierarchy DATA WAREHOUSE: DESIGN - 14 Pag. 7

8 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Aggregation Database and data mining group, quart. I 99 II 99 III 99 IV 99 I 00 II 00III 00IV 00 type product Brillo Washing r home Sbianco powder cleaning Lucido soap Manipulite Scent Latte F Slurp milk Latte U Slurp food Yogurt Slurp soda Bevimi Colissima Measure: sold quantity quart. I 99 II 99 III 99 IV 99 I 00 II 00 III 00 IV 00 home clean food DATA WAREHOUSE: DESIGN - 15 warehouse, teoria e pratica della Aggregation Database and data mining group, quart. I 99 II 99 III 99 IV 99 I 00 II 00III 00IV 00 type product Brillo Washing r home Sbianco powder cleaning Lucido soap Manipulite Scent Latte F Slurp milk Latte U Slurp food Yogurt Slurp soda Bevimi Colissima Measure: sold quantity quart. I 99 II 99 III 99 IV 99 I 00 II 00 III 00 IV 00 home clean food type home p. washing cleaning soap food milk soda DATA WAREHOUSE: DESIGN - 16 warehouse, teoria e pratica della Pag. 8

9 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Aggregation Database and data mining group, quart. I 99 II 99 III 99 IV 99 I 00 II 00III 00IV 00 type product Brillo Washing r home Sbianco powder cleaning Lucido soap Manipulite Scent Latte F Slurp milk Latte U Slurp food Yogurt Slurp soda Bevimi Colissima Measure: sold quantity quart. I 99 II 99 III 99 IV 99 I 00 II 00 III 00 IV 00 home clean food type home p. washing cleaning soap food milk soda DATA WAREHOUSE: DESIGN - 17 home clean food warehouse, teoria e pratica della additive: Measure characteristics Database and data mining group, can be aggregated along a given hierarchy by means of the SUM operator not additive: cannot be aggregated along a given hierarchy by means of the SUM operator not aggregable: cannot be aggregated along any hierarchy by means of any aggregate operator DATA WAREHOUSE: DESIGN - 18 Pag. 9

10 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Aggregation Database and data mining group, non-additivity quarter month day holiday week department marketing group type brand city product brand SALE sold quantity sale amount number of customers unit price (AVG) sale manager sale district city region country DATA WAREHOUSE: DESIGN - 19 Measure classification Stream measures can be evaluated cumulatively at the end of a time period can be aggregated by means of all standard operators examples: sold quantity, sale amount Level measures evaluated at a given time (snapshot) not additive along the time dimension examples: inventory level, account balance Unit measures evaluated at a given time and expressed in relative terms not additive along any dimension examples: unit price of a product Database and data mining group, DATA WAREHOUSE: DESIGN - 20 Pag. 10

11 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Aggregate operators Database and data mining group, Distributive can always compute higher level aggregations from more detailed data examples: sum, min, max Algebraic can compute higher level aggregations from more detailed data only when supplementary support measures are available examples: avg (it requires count) Olistic can not compute higher level aggregations from more detailed data examples: mode, median DATA WAREHOUSE: DESIGN - 21 Database and data mining group, Distributive operator: SUM quart. I 99 II 99 III 99 IV 99 I 00 II 00III 00IV 00 type product Brillo washing r home Sbianco powder cleaning Lucido soap Manipulite Scent Measure: sold quantity Latte F Slurp milk Latte U Slurp food Yogurt Slurp soda Bevimi Colissima quart. I 99 II 99 III 99 IV 99 I 00 II 00 III 00 IV 00 home clean food home clean food DATA WAREHOUSE: DESIGN - 22 type home p. washing cleaning soap food milk soda warehouse, teoria e pratica della Pag. 11

12 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Algebraic operator: AVG Database and data mining group, 1999 quart. I 99 II 99 III 99 IV 99 type product Brillo 2 2 2,2 2,5 washing Sbianco 1,5 1,5 2 2,5 home powder Lucido cleaning Manipulite 1 1,2 1,5 1,5 soap Scent 1,5 1,5 2 Measure: unit price 1999 quart. I 99 II 99 III 99 IV 99 type home wash. p. 1,75 2,17 2,40 2,67 cleaning soap 1,25 1,35 1,75 1,50 avg: 1,50 1,76 2,08 2, quart. I 99 II 99 III 99 IV 99 home clean. 1,50 1,84 2,14 2,38 DATA WAREHOUSE: DESIGN - 23 warehouse, teoria e pratica della Advanced DFM Database and data mining group, descritptive attribute quarter month dept. manager manager marketing department group brand city weight type brand product holiday day diet sales manager sales district week SALE sold quantity sale amount number of customers unit price (AVG) region city address phone country descritptive attribute DATA WAREHOUSE: DESIGN - 24 Pag. 12

13 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Advanced DFM Database and data mining group, quarter month dept. manager manager marketing department group optional edge brand city weight type brand product holiday day diet sales manager sales district week SALE sold quantity sale amount number of customers unit price (AVG) region city address phone country start end cost promotion discount advertisement optional dimension DATA WAREHOUSE: DESIGN - 25 quarter month Advanced DFM dept. manager manager marketing department group brand city weight type brand product holiday day diet sales manager sales district week SALE sold quantity sale amount number of customers unit price (AVG) Database and data mining group, country region city address phone convergence start end cost promotion discount advertisement DATA WAREHOUSE: DESIGN - 26 Pag. 13

14 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Advanced DFM Database and data mining group, shared hierarchy caller usage. caller district called district called usage caller number called number PHONE CALL number length hour month district usage phone number caller called PHONE CALL number length hour month role product SHIPPING number cost warehouse city region country order customer ship month shared hierarchy shared hierarchy DATA WAREHOUSE: DESIGN - 27 Advanced DFM Database and data mining group, genre SALE author book number amount month multiple edge diagnosis ward ADMISSION cost patient name surname gender income level city birth diagnosis diagnosis group ward name surname ADMISSION gender cost patient income level city birth DATA WAREHOUSE: DESIGN - 28 Pag. 14

15 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Factless fact schema Database and data mining group, Some events are not characterized by measures empty (i.e., factless) fact schema it records occurrence of an event Used for counting occurred events (e.g., course attendance) representing a coverage set (e.g., promotions which took place) semester address name nationality age student ATTENDANCE (COUNT) course area school gender DATA WAREHOUSE: DESIGN - 29 Representing time DATA WAREHOUSE: DESIGN - 30 Database and data mining group, Data modification over time is explicitly represented by event occurrences time dimension events stored as facts Also dimensions may change over time modifications are typically slower slowly changing dimension [Kimball] examples: client demographic data, product description if required, dimension evolution should be explicitly modeled Pag. 15

16 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, How to represent time (type I) Snapshot of the current value data is overwritten with the current value it overrides the past with the current situation used when an explicit representation of the data change is not needed example customer Mario Rossi changes marital status after marriage all his purchases correspond to the married customer Also known as type I DATA WAREHOUSE: DESIGN - 31 Database and data mining group, How to represent time (type II) Events are related to the temporally corresponding dimension value after each state change in a dimension a new dimension instance is created new events are related to the new dimension instance events are partitioned after the changes in dimensional attributes example customer Mario Rossi changes marital status after marriage his purchases are partitioned in purchases performed by unmarried Mario Rossi and purchases performed by married Mario Rossi (a new instance of Mario Rossi) Also known as type II DATA WAREHOUSE: DESIGN - 32 Pag. 16

17 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, How to represent time (type III) All events are mapped to a dimension value sampled at a given time it requires the explicit management of dimension changes during time the dimension schema is modified by introducing two timestamps: validity start and validity end a new attribute which allows identifying the sequence of modifications on a given instance (e.g., a master attribute pointing to the root instance) each state change in the dimension requires the creation of a new instance example customer Mario Rossi changes marital status after marriage validity end timestamp of first Mario Rossi instance is given by the marriage validity start timestamp of the new instance is the same day purchases are partitioned as in type II a new attribute allows tracking all changes of Mario Rossi instance Also known as type III DATA WAREHOUSE: DESIGN - 33 Workload Database and data mining group, Workload defined by standard reports approximate estimates discussed with users Actual workload difficult to evaluate at design time if the data warehouse succeeds, user and query number may grow query type may vary over time Data warehouse tuning performed after system deployment requires monitoring the actual system workload DATA WAREHOUSE: DESIGN - 34 Pag. 17

18 DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Data volume Database and data mining group, Estimation of the space required by the data mart for data for derived data (indices, materialized views) To be considered event cardinality for each fact domain cardinality (number of distinct values) for hierarchy attributes attribute length It depends on the temporal span of data storage Sparsity occurred events are not all combinations of the dimension elements example: the percentage of products actually sold in each and day is roughly 10% of all combinations DATA WAREHOUSE: DESIGN - 35 Sparsity Database and data mining group, It decreases with increasing data aggregation level May significantly affect the accuracy in estimating aggregated data cardinality DATA WAREHOUSE: DESIGN - 36 Pag. 18

Data warehouse Architectures and processes

Data warehouse Architectures and processes Database and data mining group, Data warehouse Architectures and processes DATA WAREHOUSE: ARCHITECTURES AND PROCESSES - 1 Database and data mining group, Data warehouse architectures Separation between

More information

Data Warehouse: Introduction

Data Warehouse: Introduction Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,

More information

Data Warehouse Logical Design. Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato)

Data Warehouse Logical Design. Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato) Data Warehouse Logical Design Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato) Data Mart logical models MOLAP (Multidimensional On-Line Analytical Processing) stores data

More information

Data Warehouse Design

Data Warehouse Design Data Warehouse Design Modern Principles and Methodologies Matteo Golfarelli Stefano Rizzi Translated by Claudio Pagliarani Mc Grauu Hill New York Chicago San Francisco Lisbon London Madrid Mexico City

More information

Multidimensional Modeling - Stocks

Multidimensional Modeling - Stocks Bases de Dados e Data Warehouse 06 BDDW 2006/2007 Notice! Author " João Moura Pires ([email protected])! This material can be freely used for personal or academic purposes without any previous authorization

More information

THE DIMENSIONAL FACT MODEL: A CONCEPTUAL MODEL FOR DATA WAREHOUSES 1

THE DIMENSIONAL FACT MODEL: A CONCEPTUAL MODEL FOR DATA WAREHOUSES 1 THE DIMENSIONAL FACT MODEL: A CONCEPTUAL MODEL FOR DATA WAREHOUSES 1 MATTEO GOLFARELLI, DARIO MAIO and STEFANO RIZZI DEIS - Università di Bologna, Viale Risorgimento 2, 40136 Bologna, Italy {mgolfarelli,dmaio,srizzi}@deis.unibo.it

More information

COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design

COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design COURSE OUTLINE Track 1 Advanced Data Modeling, Analysis and Design TDWI Advanced Data Modeling Techniques Module One Data Modeling Concepts Data Models in Context Zachman Framework Overview Levels of Data

More information

Designing a Dimensional Model

Designing a Dimensional Model Designing a Dimensional Model Erik Veerman Atlanta MDF member SQL Server MVP, Microsoft MCT Mentor, Solid Quality Learning Definitions Data Warehousing A subject-oriented, integrated, time-variant, and

More information

Data warehouse life-cycle and design

Data warehouse life-cycle and design SYNONYMS Data Warehouse design methodology Data warehouse life-cycle and design Matteo Golfarelli DEIS University of Bologna Via Sacchi, 3 Cesena Italy [email protected] DEFINITION The term data

More information

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on

More information

DATA WAREHOUSING AND OLAP TECHNOLOGY

DATA WAREHOUSING AND OLAP TECHNOLOGY DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are

More information

Elena Baralis, Silvia Chiusano Politecnico di Torino. Pag. 1. Physical Design. Phases of database design. Physical design: Inputs.

Elena Baralis, Silvia Chiusano Politecnico di Torino. Pag. 1. Physical Design. Phases of database design. Physical design: Inputs. Phases of database design Application requirements Conceptual design Database Management Systems Conceptual schema Logical design ER or UML Physical Design Relational tables Logical schema Physical design

More information

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING Ramesh Babu Palepu 1, Dr K V Sambasiva Rao 2 Dept of IT, Amrita Sai Institute of Science & Technology 1 MVR College of Engineering 2 [email protected]

More information

Dimensional Data Modeling for the Data Warehouse

Dimensional Data Modeling for the Data Warehouse Lincoln Land Community College Capital City Training Center 130 West Mason Springfield, IL 62702 217-782-7436 www.llcc.edu/cctc Dimensional Data Modeling for the Data Warehouse Prerequisites Students should

More information

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina Data Warehousing Read chapter 13 of Riguzzi et al Sistemi Informativi Slides derived from those by Hector Garcia-Molina What is a Warehouse? Collection of diverse data subject oriented aimed at executive,

More information

1. Dimensional Data Design - Data Mart Life Cycle

1. Dimensional Data Design - Data Mart Life Cycle 1. Dimensional Data Design - Data Mart Life Cycle 1.1. Introduction A data mart is a persistent physical store of operational and aggregated data statistically processed data that supports businesspeople

More information

Fluency With Information Technology CSE100/IMT100

Fluency With Information Technology CSE100/IMT100 Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999

More information

Mastering Data Warehouse Aggregates. Solutions for Star Schema Performance

Mastering Data Warehouse Aggregates. Solutions for Star Schema Performance Brochure More information from http://www.researchandmarkets.com/reports/2248199/ Mastering Data Warehouse Aggregates. Solutions for Star Schema Performance Description: - This is the first book to provide

More information

Advanced Data Management Technologies

Advanced Data Management Technologies ADMT 2015/16 Unit 2 J. Gamper 1/44 Advanced Data Management Technologies Unit 2 Basic Concepts of BI and Data Warehousing J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements:

More information

Report on the Dagstuhl Seminar Data Quality on the Web

Report on the Dagstuhl Seminar Data Quality on the Web Report on the Dagstuhl Seminar Data Quality on the Web Michael Gertz M. Tamer Özsu Gunter Saake Kai-Uwe Sattler U of California at Davis, U.S.A. U of Waterloo, Canada U of Magdeburg, Germany TU Ilmenau,

More information

The Data Warehouse ETL Toolkit

The Data Warehouse ETL Toolkit 2008 AGI-Information Management Consultants May be used for personal purporses only or by libraries associated to dandelon.com network. The Data Warehouse ETL Toolkit Practical Techniques for Extracting,

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1 Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics

More information

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration DW Source Integration, Tools, and Architecture Overview DW Front End Tools Source Integration DW architecture Original slides were written by Torben Bach Pedersen Aalborg University 2007 - DWML course

More information

DATA WAREHOUSE DESIGN

DATA WAREHOUSE DESIGN DATA WAREHOUSE DESIGN ICDE 2001 Tutorial Stefano Rizzi, Matteo Golfarelli DEIS - University of Bologna, Italy 1 Motivation Building a data warehouse for an enterprise is a huge and complex task, which

More information

Dimodelo Solutions Data Warehousing and Business Intelligence Concepts

Dimodelo Solutions Data Warehousing and Business Intelligence Concepts Dimodelo Solutions Data Warehousing and Business Intelligence Concepts Copyright Dimodelo Solutions 2010. All Rights Reserved. No part of this document may be reproduced without written consent from the

More information

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key

More information

PeopleSoft EPM 9.1 Supply Chain Management Warehouse PeopleBook

PeopleSoft EPM 9.1 Supply Chain Management Warehouse PeopleBook PeopleSoft EPM 9.1 Supply Chain Management Warehouse PeopleBook April 2012 PeopleSoft EPM 9.1 Supply Chain Management Warehouse PeopleBook SKU epm91pscwb0412 Copyright 1988, 2012, Oracle and/or its affiliates.

More information

Basics of Dimensional Modeling

Basics of Dimensional Modeling Basics of Dimensional Modeling Data warehouse and OLAP tools are based on a dimensional data model. A dimensional model is based on dimensions, facts, cubes, and schemas such as star and snowflake. Dimensional

More information

Lection 3-4 WAREHOUSING

Lection 3-4 WAREHOUSING Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing

More information

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 2. What is a Data warehouse a. A database application

More information

Customer-Centric Data Warehouse, design issues. Announcements

Customer-Centric Data Warehouse, design issues. Announcements CRM Data Warehouse Announcements Assignment 2 is on the subject web site Students must form assignment groups ASAP: refer to the assignment for details 2 -Centric Data Warehouse, design issues Data modelling

More information

The Data Quality Continuum*

The Data Quality Continuum* The Data Quality Continuum* * Adapted from the KDD04 tutorial by Theodore Johnson e Tamraparni Dasu, AT&T Labs Research The Data Quality Continuum Data and information is not static, it flows in a data

More information

Data Warehouse (DW) Maturity Assessment Questionnaire

Data Warehouse (DW) Maturity Assessment Questionnaire Data Warehouse (DW) Maturity Assessment Questionnaire Catalina Sacu - [email protected] Marco Spruit [email protected] Frank Habers [email protected] September, 2010 Technical Report UU-CS-2010-021

More information

THE DATA WAREHOUSE ETL TOOLKIT CDT803 Three Days

THE DATA WAREHOUSE ETL TOOLKIT CDT803 Three Days Three Days Prerequisites Students should have at least some experience with any relational database management system. Who Should Attend This course is targeted at technical staff, team leaders and project

More information

Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification

Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Outline More Complex SQL Retrieval Queries

More information

Sterling Business Intelligence

Sterling Business Intelligence Sterling Business Intelligence Concepts Guide Release 9.0 March 2010 Copyright 2009 Sterling Commerce, Inc. All rights reserved. Additional copyright information is located on the documentation library:

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Foundations of Business Intelligence: Databases and Information Management Content Problems of managing data resources in a traditional file environment Capabilities and value of a database management

More information

SAP InfiniteInsight Explorer Analytical Data Management v7.0

SAP InfiniteInsight Explorer Analytical Data Management v7.0 End User Documentation Document Version: 1.0-2014-11 SAP InfiniteInsight Explorer Analytical Data Management v7.0 User Guide CUSTOMER Table of Contents 1 Welcome to this Guide... 3 1.1 What this Document

More information

Data Warehousing. Jens Teubner, TU Dortmund [email protected]. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

Data Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1 Jens Teubner Data Warehousing Winter 2015/16 1 Data Warehousing Jens Teubner, TU Dortmund [email protected] Winter 2015/16 Jens Teubner Data Warehousing Winter 2015/16 13 Part II Overview

More information

How To Write A Diagram

How To Write A Diagram Data Model ing Essentials Third Edition Graeme C. Simsion and Graham C. Witt MORGAN KAUFMANN PUBLISHERS AN IMPRINT OF ELSEVIER AMSTERDAM BOSTON LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE

More information

DATA WAREHOUSE E KNOWLEDGE DISCOVERY

DATA WAREHOUSE E KNOWLEDGE DISCOVERY DATA WAREHOUSE E KNOWLEDGE DISCOVERY Prof. Fabio A. Schreiber Dipartimento di Elettronica e Informazione Politecnico di Milano DATA WAREHOUSE (DW) A TECHNIQUE FOR CORRECTLY ASSEMBLING AND MANAGING DATA

More information

2 Associating Facts with Time

2 Associating Facts with Time TEMPORAL DATABASES Richard Thomas Snodgrass A temporal database (see Temporal Database) contains time-varying data. Time is an important aspect of all real-world phenomena. Events occur at specific points

More information

CHAPTER 4 Data Warehouse Architecture

CHAPTER 4 Data Warehouse Architecture CHAPTER 4 Data Warehouse Architecture 4.1 Data Warehouse Architecture 4.2 Three-tier data warehouse architecture 4.3 Types of OLAP servers: ROLAP versus MOLAP versus HOLAP 4.4 Further development of Data

More information

POLAR IT SERVICES. Business Intelligence Project Methodology

POLAR IT SERVICES. Business Intelligence Project Methodology POLAR IT SERVICES Business Intelligence Project Methodology Table of Contents 1. Overview... 2 2. Visualize... 3 3. Planning and Architecture... 4 3.1 Define Requirements... 4 3.1.1 Define Attributes...

More information

Outline. Data Warehousing. What is a Warehouse? What is a Warehouse?

Outline. Data Warehousing. What is a Warehouse? What is a Warehouse? Outline Data Warehousing What is a data warehouse? Why a warehouse? Models & operations Implementing a warehouse 2 What is a Warehouse? Collection of diverse data subject oriented aimed at executive, decision

More information

Data W a Ware r house house and and OLAP Week 5 1

Data W a Ware r house house and and OLAP Week 5 1 Data Warehouse and OLAP Week 5 1 Midterm I Friday, March 4 Scope Homework assignments 1 4 Open book Team Homework Assignment #7 Read pp. 121 139, 146 150 of the text book. Do Examples 3.8, 3.10 and Exercise

More information

Optimizing Your Data Warehouse Design for Superior Performance

Optimizing Your Data Warehouse Design for Superior Performance Optimizing Your Data Warehouse Design for Superior Performance Lester Knutsen, President and Principal Database Consultant Advanced DataTools Corporation Session 2100A The Problem The database is too complex

More information

Data Warehousing with Oracle

Data Warehousing with Oracle Data Warehousing with Oracle Comprehensive Concepts Overview, Insight, Recommendations, Best Practices and a whole lot more. By Tariq Farooq A BrainSurface Presentation What is a Data Warehouse? Designed

More information

Data Warehousing and OLAP

Data Warehousing and OLAP 1 Data Warehousing and OLAP Hector Garcia-Molina Stanford University Warehousing Growing industry: $8 billion in 1998 Range from desktop to huge: Walmart: 900-CPU, 2,700 disk, 23TB Teradata system Lots

More information

OLAP Theory-English version

OLAP Theory-English version OLAP Theory-English version On-Line Analytical processing (Business Intelligence) [Ing.J.Skorkovský,CSc.] Department of corporate economy Agenda The Market Why OLAP (On-Line-Analytic-Processing Introduction

More information

Data Warehouse Management Using SAP BW Alexander Prosser

Data Warehouse Management Using SAP BW Alexander Prosser Data Warehouse Management Using SAP BW Alexander Prosser Overview What problems does a data warehouse answer? SEITE 2 Overview CEO: We may have a problem in procurement. I have a feeling that we employ

More information

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take

More information

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing CSE 544 Principles of Database Management Systems Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing Class Projects Class projects are going very well! Project presentations: 15 minutes On Wednesday

More information

Database Applications. Advanced Querying. Transaction Processing. Transaction Processing. Data Warehouse. Decision Support. Transaction processing

Database Applications. Advanced Querying. Transaction Processing. Transaction Processing. Data Warehouse. Decision Support. Transaction processing Database Applications Advanced Querying Transaction processing Online setting Supports day-to-day operation of business OLAP Data Warehousing Decision support Offline setting Strategic planning (statistics)

More information

Data Warehouse design

Data Warehouse design Data Warehouse design Design of Enterprise Systems University of Pavia 11/11/2013-1- Data Warehouse design DATA MODELLING - 2- Data Modelling Important premise Data warehouses typically reside on a RDBMS

More information

Customer Analysis - Customer analysis is done by analyzing the customer's buying preferences, buying time, budget cycles, etc.

Customer Analysis - Customer analysis is done by analyzing the customer's buying preferences, buying time, budget cycles, etc. Data Warehouses Data warehousing is the process of constructing and using a data warehouse. A data warehouse is constructed by integrating data from multiple heterogeneous sources that support analytical

More information

Establish and maintain Center of Excellence (CoE) around Data Architecture

Establish and maintain Center of Excellence (CoE) around Data Architecture Senior BI Data Architect - Bensenville, IL The Company s Information Management Team is comprised of highly technical resources with diverse backgrounds in data warehouse development & support, business

More information

Database Design Patterns. Winter 2006-2007 Lecture 24

Database Design Patterns. Winter 2006-2007 Lecture 24 Database Design Patterns Winter 2006-2007 Lecture 24 Trees and Hierarchies Many schemas need to represent trees or hierarchies of some sort Common way of representing trees: An adjacency list model Each

More information

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2009 Lecture 15 - Data Warehousing: Cubes

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2009 Lecture 15 - Data Warehousing: Cubes CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2009 Lecture 15 - Data Warehousing: Cubes Final Exam Overview Open books and open notes No laptops and no other mobile devices

More information

Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann

Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann Trivadis White Paper Comparison of Data Modeling Methods for a Core Data Warehouse Dani Schnider Adriano Martino Maren Eschermann June 2014 Table of Contents 1. Introduction... 3 2. Aspects of Data Warehouse

More information

Data Warehousing and Decision Support. Torben Bach Pedersen Department of Computer Science Aalborg University

Data Warehousing and Decision Support. Torben Bach Pedersen Department of Computer Science Aalborg University Data Warehousing and Decision Support Torben Bach Pedersen Department of Computer Science Aalborg University Talk Overview Data warehousing and decision support basics Definition Applications Multidimensional

More information

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM M. Mayilvaganan 1, S. Aparna 2 1 Associate

More information

Week 13: Data Warehousing. Warehousing

Week 13: Data Warehousing. Warehousing 1 Week 13: Data Warehousing Warehousing Growing industry: $8 billion in 1998 Range from desktop to huge: Walmart: 900-CPU, 2,700 disk, 23TB Teradata system Lots of buzzwords, hype slice & dice, rollup,

More information

Data Warehousing Systems: Foundations and Architectures

Data Warehousing Systems: Foundations and Architectures Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository

More information

Building an Effective Data Warehouse Architecture James Serra

Building an Effective Data Warehouse Architecture James Serra Building an Effective Data Warehouse Architecture James Serra Global Sponsors: About Me Business Intelligence Consultant, in IT for 28 years Owner of Serra Consulting Services, specializing in end-to-end

More information

BENEFITS OF AUTOMATING DATA WAREHOUSING

BENEFITS OF AUTOMATING DATA WAREHOUSING BENEFITS OF AUTOMATING DATA WAREHOUSING Introduction...2 The Process...2 The Problem...2 The Solution...2 Benefits...2 Background...3 Automating the Data Warehouse with UC4 Workload Automation Suite...3

More information

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer Alejandro Vaisman Esteban Zimanyi Data Warehouse Systems Design and Implementation ^ Springer Contents Part I Fundamental Concepts 1 Introduction 3 1.1 A Historical Overview of Data Warehousing 4 1.2 Spatial

More information

MDM and Data Warehousing Complement Each Other

MDM and Data Warehousing Complement Each Other Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There

More information

Data Warehouse Logical Modeling and Design (6)

Data Warehouse Logical Modeling and Design (6) Data Warehouse Logical Modeling and Design (6) Bernard ESPINASSE Professeur à Aix-Marseille Université (AMU) Ecole Polytechnique Universitaire de Marseille October 2013 Methodological framework Logical

More information

Demystified CONTENTS Acknowledgments xvii Introduction xix CHAPTER 1 Database Fundamentals CHAPTER 2 Exploring Relational Database Components

Demystified CONTENTS Acknowledgments xvii Introduction xix CHAPTER 1 Database Fundamentals CHAPTER 2 Exploring Relational Database Components Acknowledgments xvii Introduction xix CHAPTER 1 Database Fundamentals 1 Properties of a Database 1 The Database Management System (DBMS) 2 Layers of Data Abstraction 3 Physical Data Independence 5 Logical

More information

Design of a Multi Dimensional Database for the Archimed DataWarehouse

Design of a Multi Dimensional Database for the Archimed DataWarehouse 169 Design of a Multi Dimensional Database for the Archimed DataWarehouse Claudine Bréant, Gérald Thurler, François Borst, Antoine Geissbuhler Service of Medical Informatics University Hospital of Geneva,

More information

MS-50401 - Designing and Optimizing Database Solutions with Microsoft SQL Server 2008

MS-50401 - Designing and Optimizing Database Solutions with Microsoft SQL Server 2008 MS-50401 - Designing and Optimizing Database Solutions with Microsoft SQL Server 2008 Table of Contents Introduction Audience At Completion Prerequisites Microsoft Certified Professional Exams Student

More information

Data warehousing. Han, J. and M. Kamber. Data Mining: Concepts and Techniques. 2001. Morgan Kaufmann.

Data warehousing. Han, J. and M. Kamber. Data Mining: Concepts and Techniques. 2001. Morgan Kaufmann. Data warehousing Han, J. and M. Kamber. Data Mining: Concepts and Techniques. 2001. Morgan Kaufmann. KDD process Application Pattern Evaluation Data Mining Task-relevant Data Data Warehouse Selection Data

More information

Practical Considerations for Real-Time Business Intelligence. Donovan Schneider Yahoo! September 11, 2006

Practical Considerations for Real-Time Business Intelligence. Donovan Schneider Yahoo! September 11, 2006 Practical Considerations for Real-Time Business Intelligence Donovan Schneider Yahoo! September 11, 2006 Outline Business Intelligence (BI) Background Real-Time Business Intelligence Examples Two Requirements

More information

OLAP Systems and Multidimensional Expressions I

OLAP Systems and Multidimensional Expressions I OLAP Systems and Multidimensional Expressions I Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master

More information

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies Ashish Gahlot, Manoj Yadav Dronacharya college of engineering Farrukhnagar, Gurgaon,Haryana Abstract- Data warehousing, Data Mining,

More information

THE OPEN UNIVERSITY OF TANZANIA FACULTY OF SCIENCE TECHNOLOGY AND ENVIRONMENTAL STUDIES BACHELOR OF SIENCE IN DATA MANAGEMENT

THE OPEN UNIVERSITY OF TANZANIA FACULTY OF SCIENCE TECHNOLOGY AND ENVIRONMENTAL STUDIES BACHELOR OF SIENCE IN DATA MANAGEMENT THE OPEN UNIVERSITY OF TANZANIA FACULTY OF SCIENCE TECHNOLOGY AND ENVIRONMENTAL STUDIES BACHELOR OF SIENCE IN DATA MANAGEMENT ODM 106.DATABASE CONCEPTS COURSE OUTLINE 1.0 Introduction This introductory

More information

Data Warehousing and OLAP Technology for Knowledge Discovery

Data Warehousing and OLAP Technology for Knowledge Discovery 542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories

More information

Patterns of Information Management

Patterns of Information Management PATTERNS OF MANAGEMENT Patterns of Information Management Making the right choices for your organization s information Summary of Patterns Mandy Chessell and Harald Smith Copyright 2011, 2012 by Mandy

More information

PeopleSoft EPM 9.1 Customer Relationship Management Warehouse PeopleBook

PeopleSoft EPM 9.1 Customer Relationship Management Warehouse PeopleBook PeopleSoft EPM 9.1 Customer Relationship Management Warehouse PeopleBook April 2012 PeopleSoft EPM 9.1 Customer Relationship Management Warehouse PeopleBook SKU epm91pcrwb0412 Copyright 1988, 2012, Oracle

More information

Presented by: Jose Chinchilla, MCITP

Presented by: Jose Chinchilla, MCITP Presented by: Jose Chinchilla, MCITP Jose Chinchilla MCITP: Database Administrator, SQL Server 2008 MCITP: Business Intelligence SQL Server 2008 Customers & Partners Current Positions: President, Agile

More information

DIMENSIONAL MODELLING

DIMENSIONAL MODELLING ASSIGNMENT 1 TO BE COMPLETED INDIVIDUALLY DIMENSIONAL MODELLING Describe and analyse the dimensional modelling (DM) design feature allocated to you. (The allocation of a design feature to a student will

More information

Business Intelligence : a primer

Business Intelligence : a primer Business Intelligence : a primer Rev April 2012 - Gianmario Motta [email protected] Introduction & overview The paradigm of BI systems Platforms Appendix Review questions Introduction & overview Business

More information

14. Data Warehousing & Data Mining

14. Data Warehousing & Data Mining 14. Data Warehousing & Data Mining Data Warehousing Concepts Decision support is key for companies wanting to turn their organizational data into an information asset Data Warehouse "A subject-oriented,

More information

6NF CONCEPTUAL MODELS AND DATA WAREHOUSING 2.0

6NF CONCEPTUAL MODELS AND DATA WAREHOUSING 2.0 ABSTRACT 6NF CONCEPTUAL MODELS AND DATA WAREHOUSING 2.0 Curtis Knowles Georgia Southern University [email protected] Sixth Normal Form (6NF) is a term used in relational database theory by Christopher

More information

Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi [email protected]

Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi kishorejaladi@yahoo.com Data Warehousing: Data Models and OLAP operations By Kishore Jaladi [email protected] Topics Covered 1. Understanding the term Data Warehousing 2. Three-tier Decision Support Systems 3. Approaches

More information

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach 2006 ISMA Conference 1 Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach Priya Lobo CFPS Satyam Computer Services Ltd. 69, Railway Parallel Road, Kumarapark West, Bangalore 560020,

More information

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA OLAP and OLTP AMIT KUMAR BINDAL Associate Professor Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information, which is created by data,

More information

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES MUHAMMAD KHALEEL (0912125) SZABIST KARACHI CAMPUS Abstract. Data warehouse and online analytical processing (OLAP) both are core component for decision

More information