Dimensional Data Modeling for the Data Warehouse



Similar documents
COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design

THE DATA WAREHOUSE ETL TOOLKIT CDT803 Three Days

The Data Warehouse ETL Toolkit

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

Microsoft Data Warehouse in Depth

Kimball Dimensional Modeling Techniques

Designing a Dimensional Model

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

Mastering Data Warehouse Aggregates. Solutions for Star Schema Performance

MICROSOFT DATA WAREHOUSE IN DEPTH

DATA WAREHOUSE / BUSINESS

Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5 Days

Multidimensional Modeling - Stocks

DATA WAREHOUSING AND OLAP TECHNOLOGY

Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Course 20467A; 5 Days

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

Lection 3-4 WAREHOUSING

Turning your Warehouse Data into Business Intelligence: Reporting Trends and Visibility Michael Armanious; Vice President Sales and Marketing Datex,

BI4Dynamics provides rich business intelligence capabilities to companies of all sizes and industries. From the first day on you can analyse your

Building a Data Warehouse

Demystified CONTENTS Acknowledgments xvii Introduction xix CHAPTER 1 Database Fundamentals CHAPTER 2 Exploring Relational Database Components

Data Warehousing and Decision Support. Torben Bach Pedersen Department of Computer Science Aalborg University

MIS636 AWS Data Warehousing and Business Intelligence Course Syllabus

Presented by: Jose Chinchilla, MCITP

The Data Webhouse. Toolkit. Building the Web-Enabled Data Warehouse WILEY COMPUTER PUBLISHING

Analysis Services Step by Step

BUILDING A WEB-ENABLED DATA WAREHOUSE FOR DECISION SUPPORT IN CONSTRUCTION EQUIPMENT MANAGEMENT

PeopleSoft EPM 9.1 Customer Relationship Management Warehouse PeopleBook

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING

DATA WAREHOUSE BUSINESS INTELLIGENCE FOR MICROSOFT DYNAMICS NAV

Data Warehouse: Introduction

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

PeopleSoft EPM 9.1 Supply Chain Management Warehouse PeopleBook

How To Write A Diagram

Dimensional modeling for CRM Applications

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

The Data Warehouse Toolkit

Data warehouse design

Business Intelligence, Data warehousing Concept and artifacts

SAP BusinessObjects Accounts Receivable Rapid Mart XI 3.2, version for SAP solutions - User Guide

Analytics: Pharma Analytics (Siebel 7.8) Student Guide

Data Warehousing and Data Mining

Data Warehouse Overview. Srini Rengarajan

Oracle Database 11g: Data Warehousing Fundamentals

Fluency With Information Technology CSE100/IMT100

IBML. Data Modeling Techniques for Data Warehousing. Chuck Ballard, Dirk Herreman, Don Schau, Rhonda Bell, Eunsaeng Kim, Ann Valencic

When to consider OLAP?

SQL Server Analysis Services Complete Practical & Real-time Training

The Data Model Resource Book Revised Edition Volume 2

DIMENSIONAL MODELLING

1. Dimensional Data Design - Data Mart Life Cycle

Establish and maintain Center of Excellence (CoE) around Data Architecture

Sterling Business Intelligence

Data Warehouse (DW) Maturity Assessment Questionnaire

Upon successful completion of this course, a student will meet the following outcomes:

LEARNING SOLUTIONS website milner.com/learning phone

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Data Warehousing. Outline. From OLTP to the Data Warehouse. Overview of data warehousing Dimensional Modeling Online Analytical Processing

MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 28

Part 22. Data Warehousing

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Why Business Intelligence

Design and Implementation

Understanding Data Warehousing. [by Alex Kriegel]

Multidimensional Modeling - Order Management

Data Warehousing. Jens Teubner, TU Dortmund Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

A Multidimensional Design for the Genesis Data Set

Business Intelligence: Effective Decision Making

Customer-Centric Data Warehouse, design issues. Announcements

Data Warehousing, OLAP, and Data Mining

SQL Server 2012 End-to-End Business Intelligence Workshop

Brief Contents. Part Three: Decisions and Strategies. Part One: Information Technology Infrastructure. Part Four: Organizing Businesses and Systems

Data Warehouse Design

IST722 Data Warehousing

Chapter 7 Multidimensional Data Modeling (MDDM)

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

Structure of the presentation

Microsoft Implementing Data Models and Reports with Microsoft SQL Server

Oracle Data Integrator: Administration and Development

Data Modeling Master Class Steve Hoberman s Best Practices Approach to Developing a Competency in Data Modeling

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

BUSINESS ANALYSIS FDR INTELLIGENCE

SAS BI Course Content; Introduction to DWH / BI Concepts

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

Dimensional Modeling 101. Presented by: Michael Davis CEO OmegaSoft,LLC

Data Warehouse Snowflake Design and Performance Considerations in Business Analytics

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Class News. Basic Elements of the Data Warehouse" 1/22/13. CSPP 53017: Data Warehousing Winter 2013" Lecture 2" Svetlozar Nestorov" "

Transcription:

Lincoln Land Community College Capital City Training Center 130 West Mason Springfield, IL 62702 217-782-7436 www.llcc.edu/cctc Dimensional Data Modeling for the Data Warehouse Prerequisites Students should have at least some experience with any relational database management system. Who Should Attend This course is targeted at technical staff, team leaders, project managers, and users who need to understand how to design a data warehouse using multi-dimensional data modeling techniques Course Description This course provides students with the skills necessary to design a successful data warehouse using multi-dimensional data modeling techniques. The course also discusses additional industry-wide best practices concerning Dimensional Modeling and Star Schemas. The class uses the free CA Erwin Community Edition data modeling software. Course Topics Dimensional Modeling Primer Retail Sales Inventory Procurement Order Management Customer Relationship Management Accounting Human Resources Management Financial Services Telecommunications And Utilities Transportation Education Health Care Electronic Commerce Insurance Building The Data Warehouse

I. DIMENSIONAL MODELING PRIMER A. Operational Systems (OLTP) B. Analytical Processing (OLAP) C. Data Warehousing Requirements D. Data Warehousing Team Responsibilities E. Data Warehousing Components 1. Operational Source Systems 2. Data Staging Area a. Extraction b. Transformation i. Cleaning ii. Conforming c. Loading 3. Data Presentation Area 4. Data Access Tooling F. Data Warehouse Terminology 1. Star Schema 2. On-Line Analytical Processing (OLAP) 3. Cubes G. What Is Metadata? 1. Staging Meta Data 2. DBMS Meta Data 3. Data Access Tooling Meta Data H. What Is a Fact? I. What Is a Dimension? J. Dimensional Modeling Myths K. Avoiding Common Pitfalls II. RETAIL SALES A. Retail Sales Case Study B. The Kimball 4 Step Design Process 1. Select the Business Process 2. Declare the Granularity of the Fact Table 3. Choose the Dimensions 4. Identify the Facts C. Non-Additive Facts D. Date Dimension E. Product Dimension F. Sales Amount and Quantity By Department Report G. What Is a Drill Down? H. What Is a Roll Up? I. Location (Geographical or Store) Dimension J. Promotion Dimension K. What Is a Factless Fact Table? L. What Is a Degenerate Dimension?

M. Star Schema Extensibility N. What Is Snow Flaking? O. Too Few or Too Many Dimensions P. Surrogate Keys Versus Natural Keys III. IV. INVENTORY A. Inventory Periodic Snapshot Fact Table B. Semi-Additive Facts C. Enhanced Inventory Facts D. Inventory Transaction Fact Table E. Inventory Accumulating Snapshot Fact Table F. Shared Common Dimensions G. The Data Warehouse Bus Matrix H. What Are Conformed Dimensions? I. What Are Conforming Roll-Up Dimensions? J. What Are Conforming Dimension Subsets? K. What Are Conformed Facts? PROCUREMENT A. Procurement Case Study B. Procurement Transaction-Grained Star Schema C. Multiple Versus Single Procurement Transaction-Grained Fact Tables D. One Fact Table Per Major Feeder System E. What Are Slowly Changing Dimensions? F. Type 1 Slowly Changing Dimension Strategy G. Type 2 Slowly Changing Dimension Strategy H. Type 3 Slowly Changing Dimension Strategy I. Type 4 Slowly Changing Dimension Strategy J. Predictable Changes with Multiple Version Overlay Strategy K. Unpredictable Changes with Single Version Overlay Strategy V. ORDER MANAGEMENT A. Order Management Bus Matrix B. Order Product Transaction Fact C. Role-Playing Dimensions (Using Views on Dimensions) D. Common Product Dimensions E. Product Master to Dimension Mapping F. Customer Ship-To and Bill-To Dimension G. An Outrigger Address Dimension H. Deal Dimension I. Order Number Degenerate Dimension J. Junk Dimension K. Designing for Multiple Currencies L. Allocating Facts to Lower Granularities M. Shipment Invoice Line Item Fact N. Order Fulfillment Pipeline

O. Order Fulfillment Accumulating Snapshot Fact P. Designing for Multiple Units of Measure Q. Three Types of Fact Table Comparisons R. Introduction to Real-Time Data Warehouse Design VI. VII. CUSTOMER RELATIONSHIP MANAGEMENT A. Customer Relationship Management Overview B. Operational and Analytic CRM C. Buy or Build Your CRM System D. Customer Dimension E. Name and Address Quality Issues F. Customer Dimension Columns G. Date Dimension Outrigger H. Customer Segmentation Columns I. Adding a County Outrigger Dimension J. Designing Dimension Outriggers K. Large and Rapidly Changing Customer Dimension L. Tuning Rapidly Changing Dimensions M. Mini-Dimension Characteristics N. Customer Mini-Dimension ERD O. Variable-Width Dimension Columns P. Fixed-Depth Customer Hierarchies Q. Variable-Depth Customer Hierarchies R. Parent Subsidiary Company Hierarchies S. Parent Subsidiary Bridge Table T. Parent Subsidiary ERD U. Parent Subsidiary Revenue Select Statements ACCOUNTING A. Finance and Accounting Data Warehouses B. General Ledger Periodic Snapshot C. General Ledger Periodic Snapshot Schema D. General Ledger Journal Entry Transaction Fact E. Annual Budget Schema F. Budget Variance Schema G. OLAP Analytic Solutions

VIII. IX. HUMAN RESOURCES MANAGEMENT A. Employee Transaction Schema B. Employee Transaction Dimension C. Employee Periodic Snapshot Schema D. Audit Dimension E. Employee Skill Outrigger Dimension F. Employee Skill Outrigger Select Statement G. Another Employee Skill Outrigger Solution H. Another Employee Skill Outrigger Select Statement I. A Third Employee Skill Outrigger Solution J. Analyzing Survey Data with Star Schemas FINANCIAL SERVICES A. Banking Data Warehouse Requirements B. Monthly Account Balance Periodic Snapshot C. Dimension Table Check List D. Account Household Periodic Snapshot E. The Fact Table as an Associative Entity F. Customer Account Associative Bridge G. More On Mini-Dimension Outriggers H. Arbitrary Banding Range Technique I. How to Design a Band Definition Table J. How to Track Point-In-Time Balances K. Account Balance as of a Given Date Select Statement L. Context-Dependent Outriggers or Dimension Super-Type and Sub-Types X. TELECOMMUNICATIONS AND UTILITIES A. Telecommunications Data Warehouse Bus Matrix B. How to Perform a Star Schema Design Review C. Version 1 Customer Billing Star Schema D. Problems with the Version 1 Customer Billing Star Schema E. Version 2 Customer Billing Star Schema F. Problems with the Version 2 Customer Billing Star Schema G. Version 3 Customer Billing Star Schema H. Geographic Location Dimension I. Geographic Information Systems J. Analyzing Geo-Spatial Data

XI. XII. XIII. TRANSPORTATION A. Airline Frequent-Flyer Case Study B. Multiple Fact Granularities C. Segment-Level Flight Activity D. Segment and Trip-Level Granularity in One Fact Table E. Cargo Shipper Schema F. Travel Services Hotel Schema G. Combining Small Dimensions Into One H. Country-Specific Calendar Outriggers I. Time of Day as a Dimension or a Fact? J. Designing for Multiple Time Zones EDUCATION A. Admissions Tracking Accumulating Snapshot B. Accumulating Snapshot Design for Moving Forward Through a Series of Events C. Accumulating Snapshot Design for Events that Happen Out of Sequence D. Accumulating Snapshot Design for Tracking Events that Back Up E. Horizontal Versus Vertical Event Tracking F. Factless Fact Table for Student Registration G. Factless Fact Table for Facilities Utilization H. Factless Fact Table for Tracking Events that May Not Happen I. Factless Fact Table for Tracking Weighted Events J. Factless Promotion Coverage Fact Table K. Promoted Products that Did Not Sell Select Statement HEALTH CARE A. What Is a Value Circle and Why Do Designers Need to Know About It? B. Discovering Dimensions in the Value Circle C. Health Care Billing Accumulating Snapshot D. Designing the Date Dimension to Track Events that Have Not Happened Yet E. Designing the Date Dimension for Events that Have Been Skipped F. Dropping the Granularity of Accumulating Snapshots to Avoid the Multi- Valued Problem G. Three Techniques for Eliminating Multiple Values within a Fact Table Column H. Turning a Billing Fact into a Profitability Fact I. Handling Complex Events with an Accumulating Snapshot J. Mining Free-Form Comments into Dimension Constraints and Facts K. Handling Facts with Sparse Data Values L. Handling Late Arriving Historical Dimension Data

XIV. XV. ELECTRONIC COMMERCE A. What Is a Uniform Resource Locator (URL)? B. Client Server Interactions On the Internet C. Click Stream Characteristics D. Identifying Web-Site Visitor Origin E. Identifying the Session F. Identifying the Visitor G. Proxy Web Server Challenges H. Click Stream Dimensions I. Click Stream Session Tracking J. Click Stream Web Page Tracking K. Aggregated Click Stream Facts L. Tracking E-Commerce Profitability INSURANCE A. Insurance Value Chain B. Insurance Bus Matrix C. Insurance Policy Transaction Schema D. Automobile Transaction Schema E. Policy Premium Periodic Snapshot Schema F. Multi-Valued Policy Holder Insured Driver Bridge G. Keeping Your Bus Matrix Up To Date H. Tracking Policy Claims I. Policy Claims Transactions J. Policy Claims Transaction Schema K. Policy Claims Periodic Snapshot L. Policy Claims Accumulating Snapshot M. Consolidating Policy Premium and Claim Amounts N. Factless Fact Table Accident Events O. Dimensional Modeling Design Mistakes

XVI. BUILDING THE DATA WAREHOUSE A. Data Warehouse Life Cycle Road Map B. DW Project Planning and Management C. DW Project Scoping D. DW Project Justification E. DW Project Business Staffing F. DW Project Business or IT Staffing G. DW Project IT Staffing H. DW Requirements Planning I. Collecting DW Requirements J. DW Effective Interviewing Techniques K. DW Interview Wrap-up L. DW Post Interview Documentation M. DW Requirements Prioritization and Consensus N. DW Business Impact and Feasibility Prioritization of Requirements in Quadrants O. DW Technical Architecture in 8 Steps P. DW Tool Suite Selection and Installation Q. DW Star Schema Physical Design Considerations R. DW Star Schema Aggregation Strategies S. DW Star Schema Indexing Strategies T. DW Data Staging Physical Design Considerations U. DW Dimension Table Staging Considerations V. DW Master Dimension Cross-Referencing Strategies W. DW Fact Table Staging X. DW Analytics Specification Y. DW Deployment Z. DW Maintenance and Growth AA. Ten Common DW Design Mistakes to Avoid

XVII. WORKSHOPS [The students do various workshops depending on how many days the course is taught.] A. Use the instructor supplied order and shipment star schema tables to do various drill downs and roll ups. [We use MS Access since it is readily available in most IT environments, not that you would ever implement a star schema with Access.] B. Use Ralph Kimball s 4 step methodology to begin to develop a star schema for a business process identified at each student team. Each team identifies a fact table, defines the granularity of the fact table, chooses dimensions, and defines additive, semi-additive, or non-additive numeric facts. Each team may do the same or different star schemas as other teams. Each team writes up their star schema requirements with MS Word and implements it with the data modeling software chosen for the class. Each team presents their star schema to the class. C. Each team modifies their star schemas based upon the comments that the class has given them. D. The class is shown how to do a drill across using the instructor supplied order and shipment star schemas developed in Microsoft Access. The students do their own drill across in Access. E. If the students bring real star schemas to class, the class may do a design review on the selected star schemas.