Chapter 7 Multidimensional Data Modeling (MDDM)



Similar documents
Data W a Ware r house house and and OLAP Week 5 1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Module 1: Introduction to Data Warehousing and OLAP

Part 22. Data Warehousing

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

14. Data Warehousing & Data Mining

Designing a Dimensional Model

SAS BI Course Content; Introduction to DWH / BI Concepts

IST722 Data Warehousing

Data Warehousing. Outline. From OLTP to the Data Warehouse. Overview of data warehousing Dimensional Modeling Online Analytical Processing

Unlock your data for fast insights: dimensionless modeling with in-memory column store. By Vadim Orlov

Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi

Sterling Business Intelligence

Chapter 3, Data Warehouse and OLAP Operations

Course Code CE609. Lecture : 03. Practical : 01. Course Credit. Tutorial : 00. Total : 04. Course Learning Outcomes

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

Dimensional Data Modeling for the Data Warehouse

OLAP Theory-English version

Presented by: Jose Chinchilla, MCITP

OLAP. Business Intelligence OLAP definition & application Multidimensional data representation

Design tip #156 containing the drill- across macro:

MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

LEARNING SOLUTIONS website milner.com/learning phone

DATA WAREHOUSING - OLAP

Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Course 20467A; 5 Days

Overview of Data Warehousing and OLAP

Understanding Data Warehousing. [by Alex Kriegel]

Republic Polytechnic School of Information and Communications Technology C355 Business Intelligence. Module Curriculum

CHAPTER : 5. MODELS & HYBRID METHODS

CHAPTER 3. Data Warehouses and OLAP

Overview. Data Warehousing and Decision Support. Introduction. Three Complementary Trends. Data Warehousing. An Example: The Store (e.g.

Data Warehousing Concepts

Dimensional Modeling for Data Warehouse

Basics of Dimensional Modeling

Designing Business Intelligence Solutions with Microsoft SQL Server 2012

CHAPTER 5: BUSINESS ANALYTICS

Turkish Journal of Engineering, Science and Technology

Turning your Warehouse Data into Business Intelligence: Reporting Trends and Visibility Michael Armanious; Vice President Sales and Marketing Datex,

Microsoft Data Warehouse in Depth

Migrating a Discoverer System to Oracle Business Intelligence Enterprise Edition

DIMENSIONAL MODELLING

Designing Self-Service Business Intelligence and Big Data Solutions

Data Warehousing. SQL Server 2008 R2, Denali

MIS636 AWS Data Warehousing and Business Intelligence Course Syllabus

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Dimodelo Solutions Data Warehousing and Business Intelligence Concepts

A Multidimensional Design for the Genesis Data Set

When to consider OLAP?

Data Warehouse Snowflake Design and Performance Considerations in Business Analytics

SENG 520, Experience with a high-level programming language. (304) , Jeff.Edgell@comcast.net

<Insert Picture Here> Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option

8902 How to Generate Universes from SAP Sybase PowerDesigner. Revision:

CHAPTER 4: BUSINESS ANALYTICS

DATA WAREHOUSING AND OLAP TECHNOLOGY

Data Warehousing and Data Mining

The Design and the Implementation of an HEALTH CARE STATISTICS DATA WAREHOUSE Dr. Sreèko Natek, assistant professor, Nova Vizija,

Business Intelligence. 1. Introduction September, 2013.

Data Warehouse - Basic Concepts

Optimizing Your Data Warehouse Design for Superior Performance

Building Scalable & High Performance Datamarts with MySQL

8. Business Intelligence Reference Architectures and Patterns

TRANSFORMING YOUR BUSINESS

Data warehousing. Han, J. and M. Kamber. Data Mining: Concepts and Techniques Morgan Kaufmann.

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

SAS Business Intelligence Online Training

DATA CUBES E Jayant Haritsa Computer Science and Automation Indian Institute of Science. JAN 2014 Slide 1 DATA CUBES

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:

Business Intelligence, Data warehousing Concept and artifacts

University of Gaziantep, Department of Business Administration

Course Design Document. IS417: Data Warehousing and Business Analytics

OLAP & DATA MINING CS561-SPRING 2012 WPI, MOHAMED ELTABAKH

A Design and implementation of a data warehouse for research administration universities

Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5 Days

Data Warehousing Systems: Foundations and Architectures

Data Warehousing and Decision Support. Torben Bach Pedersen Department of Computer Science Aalborg University

Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann

Kimball Dimensional Modeling Techniques

Oracle OLAP 11g and Oracle Essbase

SAP BUSINESS OBJECTS BO BI 4.1 amron

Data Warehousing, OLAP, and Data Mining

Fluency With Information Technology CSE100/IMT100

Analysis Services Step by Step

Dimensional Modeling and E-R Modeling In. Joseph M. Firestone, Ph.D. White Paper No. Eight. June 22, 1998

SQL Server 2012 Business Intelligence Boot Camp

Data Warehousing and OLAP Technology

Dimensional Modeling 101. Presented by: Michael Davis CEO OmegaSoft,LLC

PowerDesigner WarehouseArchitect The Model for Data Warehousing Solutions. A Technical Whitepaper from Sybase, Inc.

SAP BO Course Details

COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design

Data Warehouse. MIT-652 Data Mining Applications. Thimaporn Phetkaew. School of Informatics, Walailak University. MIT-652: DM 2: Data Warehouse 1

Transcription:

Chapter 7 Multidimensional Data Modeling (MDDM) Fundamentals of Business Analytics

Learning Objectives and Learning Outcomes Learning Objectives 1. To assess the capabilities of OLTP and OLAP systems 2. To understand Dimensional Modeling (DM) 3. To learn data warehousing in further detail through a case-study Learning Outcomes (a) Appreciate the differences between OLTP and OLAP systems (b) Understanding of DM, basics of data warehousing and related terminology; also an overview of what facts and dimensions are (c) Understand how to convert an OLTP schema into a dimensional schema model through various techniques of data warehousing

Session Plan Lecture time : 90 minutes approx. Q/A : 15 minutes

Agenda Database Concepts Recap Introduction to On-line Analytical Processing (OLAP) Multidimensional Data Modeling (MDDM) To Answer Why? Where? When? and How?

Recap Databases and Tables Normalization and Keys ACID Properties Primary, Foreign and Surrogate Keys Cardinality Transactions On-Line Transaction Processing On-Line Analytical Processing

Recap (contd.) Difference between OLTP and OLAP

Comparison of OLTP and DSS OLTP Capability Examples DSS Capability Examples Search & locate student(s) Print student scores Filter students above 90% marks Update student Grade Group by Batch and compute average score Find top 10 high performance students Which courses have productivity impact on-the-job? Which colleges need to be rewarded for supplying students with consistent high on-the-job performance? What is the customer satisfaction improvement due to extended training? How project level profitability is influenced by certification? How much training is needed on future technologies for non-linear growth in BI? Fundamentals of Business Analytics

Still a little fuzzy about what OLAP can do? Fundamentals of Business Analytics

Introduction to On-line Analytical Processing (OLAP) Scenario: Internal systems department at Infosys maintains all relevant data in a database. Conceptual schema is as shown below.

OLAP Contd. CEO of the company wants the following information from the IS department. Number of employees added in the role of the company during the last quarter/6 months/1 year Q1. How many table(s) is/are required?

OLAP Contd. Q2. How many employees are currently in the projects? Q3. How many are on bench?

OLAP Contd. Q4. Which customer from a region has given maximum business during the previous quarter on a domain under specific technology and who are the PMs of the project and assets owned by them.

SOLUTION? Fundamentals of Business Analytics

Answer a Quick Question So if there were very few updates and more of data-retrieval queries being made on your database, what do you think would be a better schema to adopt? OLTP or OLAP

Introduction to Dimensional Modeling (DM) DM is a logical design technique used in Data Warehouses (DW). It is quite directly related to OLAP systems DM is a design technique for databases intended to support end-user queries in a DW It is oriented around understandability, as opposed to database administration

However, before we actually jump into MDDM let s first understand the language of Dimensional Modeling

MDDM Terminology Grain Fact Dimension Cube Star Snowflake

Of hierarchies and levels Fundamentals of Business Analytics

What Is a Grain? Identifying the grain also means deciding the level of detail that will be made available in the dimensional model Granularity is defined as the detailed level of information stored in a table The more the detail, the lower is the level of granularity The lesser the detail, higher is the level of granularity

What can be measured, can be controlled and do you know how such measurements are stored in a data warehouse?

Facts and Fact Tables Consists of at least two or more foreign keys Generally has huge numbers of records Useful facts tend to be numeric and additive Fundamentals of Business Analytics

Fact Types Additive Facts Semi Additive Non Additive Factless Fact Fundamentals of Business Analytics

And what about descriptive data? Fundamentals of Business Analytics

What Are Dimensions/Dimension Tables? The dimensional tables contain attributes (descriptive) which are typically static values containing textual data or discrete numbers which behave as text values. Main functionalities : Query filtering\constraining Query result set labeling Fundamentals of Business Analytics

Types of Dimensions Rapidly Changing Dimension Degenerate Dimension Junk (garbage) Dimension Slowly Changing Dimension Dimension Type Role-playing Dimension

Understanding Dimension Cube An extension to the two-dimensional Table. For example in the previous scenario CEO wants a report on revenue generated by different services across regions during each quarter Fundamentals of Business Analytics

Understanding Dimension Cube (contd.) Dimension Hierarchy Grain Fact N.America Europe Testing Consulting Production Support Asia Pacific Q1 Q2 Q3 Q4 Fundamentals of Business Analytics

Answer a Quick Question Can a data warehouse schema take different forms with respect to normalization?

Star Schema The basic star schema contains four components. These are: Fact table, Dimension tables, Attributes and Dimension hierarchies Fundamentals of Business Analytics

Snow Flake Schema Normalization and expansion of the dimension tables in a star schema result in the implementation of a snowflake design. A dimension table is said to be snow flaked when the low-cardinality attributes in the dimension have been removed to separate normalized tables and these normalized tables are then joined back into the original dimension table.

Snow Flaking Example Consider the Normalized form of Region dimension RegionID Region Country Code State Code City Code Country Country Code Country Name State Code City Code State Decreases performance because more tables State code will need to be joined to satisfy queries State Name City Code City City Code City Name ZIP

Armed with these weapons that we call Concepts, let s step into the battlefield! Fundamentals of Business Analytics

Case Study Conversion of a ER Model to a Dimensional Model Fundamentals of Business Analytics

CUSTOMER customer_id (PK) customer_name credit_profile purchase_profile address STORE store_id (PK) store_name floor_type address district CLERK clerk_id (PK) clerk_name clerk_grade ER Diagram ORDER order_num (PK) customer_id (FK) store_id (FK) clerk_id (FK) date PRODUCT SKU (PK) Description category brand ORDER-LINE order_num (PK) (FK) SKU (PK) (FK) promotion_key (FK) dollars_cost dollars_sold units_sold PROMOTION promotion_num (PK) promotion_name ad_type price_type

TIME time_key (PK) SQL_date day_of_week month STORE store_key (PK) store_id store_name floor_type address district CLERK clerk_key (PK) clerk_id clerk_name clerk_grade DIMENSONAL MODEL FACT time_key (FK) customer_key (FK) store_key (FK) clerk_key (FK) product_key (FK) promotion_key (FK) dollars_cost dollars_sold units_sold PRODUCT product_key (PK) SKU category description brand CUSTOMER customer_key (PK) customer_name credit_profile purchase_profile address PROMOTION promotion_key (PK) promotion_name ad_type price_type

Summary Basics of Database OLTP MDDM Cube Star Schema Snowflake schema

Food for Thought! 1. Who according to you would be the user of OLAP? 2. Who would need the Multidimensional perspective of data?