COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design



Similar documents
Mastering Data Warehouse Aggregates. Solutions for Star Schema Performance

Dimensional Data Modeling for the Data Warehouse

Trends in Data Warehouse Data Modeling: Data Vault and Anchor Modeling

Designing a Dimensional Model

Kimball Dimensional Modeling Techniques

Data Warehouse Overview. Srini Rengarajan

B. 3 essay questions. Samples of potential questions are available in part IV. This list is not exhaustive it is just a sample.

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

THE DATA WAREHOUSE ETL TOOLKIT CDT803 Three Days

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

SAS BI Course Content; Introduction to DWH / BI Concepts

The Data Warehouse ETL Toolkit

When to consider OLAP?

Business Intelligence, Data warehousing Concept and artifacts

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Implementing a Data Warehouse with Microsoft SQL Server

Data Modeling Master Class Steve Hoberman s Best Practices Approach to Developing a Competency in Data Modeling

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

Data Warehouse Design

East Asia Network Sdn Bhd

DIMENSIONAL MODELLING

Implementing a Data Warehouse with Microsoft SQL Server

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

How To Write A Diagram

Data Warehousing Systems: Foundations and Architectures

Building an Effective Data Warehouse Architecture James Serra

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Data Warehouse: Introduction

Chapter 5. Learning Objectives. DW Development and ETL

Implementing a Data Warehouse with Microsoft SQL Server 2012

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Data Warehousing, OLAP, and Data Mining

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

DATA WAREHOUSE / BUSINESS

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

LEARNING SOLUTIONS website milner.com/learning phone

SAS Business Intelligence Online Training

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Reflections on Agile DW by a Business Analytics Practitioner. Werner Engelen Principal Business Analytics Architect

OLAP Theory-English version

Turning your Warehouse Data into Business Intelligence: Reporting Trends and Visibility Michael Armanious; Vice President Sales and Marketing Datex,

DATA WAREHOUSING AND OLAP TECHNOLOGY

Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012

70-467: Designing Business Intelligence Solutions with Microsoft SQL Server

Data Warehousing and Decision Support. Torben Bach Pedersen Department of Computer Science Aalborg University

Course Outline. Module 1: Introduction to Data Warehousing

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Implementing a Data Warehouse with Microsoft SQL Server 2012

Modeling: Operational, Data Warehousing & Data Marts

Unlock your data for fast insights: dimensionless modeling with in-memory column store. By Vadim Orlov

Understanding Data Warehousing. [by Alex Kriegel]

Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Data Warehouse (DW) Maturity Assessment Questionnaire

Fluency With Information Technology CSE100/IMT100

Data Integration and ETL with Oracle Warehouse Builder NEW

Data Testing on Business Intelligence & Data Warehouse Projects

MDM and Data Warehousing Complement Each Other

MIS636 AWS Data Warehousing and Business Intelligence Course Syllabus

Course 20463:Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server 2012

Implementing a Data Warehouse with Microsoft SQL Server

Data Vault and The Truth about the Enterprise Data Warehouse

Implementing a Data Warehouse with Microsoft SQL Server

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING

SQL Server 2012 Business Intelligence Boot Camp

Data Warehousing. SQL Server 2008 R2, Denali

Data Integration and ETL with Oracle Warehouse Builder: Part 1

MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

US Department of Education Federal Student Aid Integration Leadership Support Contractor January 25, 2007

Improving your Data Warehouse s IQ

Analysis Services Step by Step

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Class News. Basic Elements of the Data Warehouse" 1/22/13. CSPP 53017: Data Warehousing Winter 2013" Lecture 2" Svetlozar Nestorov" "

Requirements are elicited from users and represented either informally by means of proper glossaries or formally (e.g., by means of goal-oriented

Data warehouse design

Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Course 20467A; 5 Days

The IBM Cognos Platform

Deductive Data Warehouses and Aggregate (Derived) Tables

Key organizational factors in data warehouse architecture selection

Establish and maintain Center of Excellence (CoE) around Data Architecture

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

M6P. TDWI Data Warehouse Automation: Better, Faster, Cheaper You Can Have It All. Mark Peco

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration

IBML. Data Modeling Techniques for Data Warehousing. Chuck Ballard, Dirk Herreman, Don Schau, Rhonda Bell, Eunsaeng Kim, Ann Valencic

M Designing and Implementing OLAP Solutions Using Microsoft SQL Server Day Course

Data Warehouse and Business Intelligence Testing: Challenges, Best Practices & the Solution

A Survey on Data Warehouse Architecture

AV-005: Administering and Implementing a Data Warehouse with SQL Server 2014

Multidimensional Modeling - Stocks

Microsoft Data Warehouse in Depth

SQL Server Administrator Introduction - 3 Days Objectives

A Service-oriented Architecture for Business Intelligence

Dimensional Modeling for Data Warehouse

Oracle BI 11g R1: Build Repositories

Optimizing Your Data Warehouse Design for Superior Performance

Transcription:

COURSE OUTLINE Track 1 Advanced Data Modeling, Analysis and Design TDWI Advanced Data Modeling Techniques Module One Data Modeling Concepts Data Models in Context Zachman Framework Overview Levels of Data Models Enterprise Perspective Levels of Data Models Project Perspective Entity Relationship Diagram Overview Entity Relationship Diagram and Its Components Normalized and Dimensional Models Standards Normalization Normalization to Third Normal Form Higher Normal Form Module Two Business Data Model Development Model Components Entities Entities (Continued) Supertype and Subtype Relationships Attributes Business Data Model Development Approaches Top-Down Bottom-Up Generic Models Limited Depth Model Business Stakeholder Data Modeling Roles Business Stakeholder Data Steward Subject Matter Expert Business Analyst Data Analyst / Data Modeler Business Data Model Application Basis for System Data Model Transformation and Integration Foundation Data Profiling Package Selection Business Communications Special Considerations Recursive Relationships State and Status in ERD Diagramming Options Metadata Tool Exploitation

Module Three System and Physical Data Model Development Data Modeling Roles Data Analyst / Data Modeler Database Administrator Business Analyst Developer Application Implications Model Differences Denormalization Overview Time Dependencies History Optimization Indexing Horizontal Partitioning Vertical Partitioning Special Considerations Surrogate Keys Columnar Databases Point-in-Time vs. Over-Time Models Data Warehouse Load Implications Metadata Module Four Additional Concepts Complementary Models State Transition Model Function Models Process Models Use Case Model Model Management Model Validation and Testing Model Synchronization Tools Data Modeling Tools Repositories Module Five Summary and Conclusions Critical Success Factors Common Mistakes Appendix A Bibliography and References Appendix B s 1: Standards 2: Normalization to Third Normal Form 3: Normalization to Higher Normal Forms 4: Subject Areas 5: Entity-Level Business Data Model 6: Attribute-Level Business Data Model 7: Model Application for Data Profiling 8: Application System Model Development 9: History Implications 10: Indexing 11: Model Synchronization

Advanced Dimensional Modeling Techniques for Practitioners Module 1 Review and Architecture Guiding principles Review dimensional modeling basics Star schema tables and characteristics Surrogate keys vs. natural keys Additivity and non-additivity Slowly changing dimensions type 1, type 2 Three architectures that include dimensional models Inmon / Imhoff Corporate Information Factory Kimball Dimensional Data Warehouse Stand-alone data marts Common components of all solutions Incorporating aggregates and OLAP Quiz: Test your design skills Critique a star schema design Module 2 Multiple Fact Tables Beyond a single fact table design Challenge: facts with differing grain or periodicity Pitfalls of single fact table design Designing for multiple fact tables Querying multiple fact tables Why a single query returns wrong results The concept of drilling across Conformance Ensuring subject areas work together Enabling incremental implementation Schema design exercise: Design a multiple fact table solution based on provided scenario Module 3 Advanced Fact Table Design Snapshots Challenge: determining status from transaction history The snapshot solution Semi-additive facts Use of snapshot vs. transaction tables Implications on density and slowly changing dimensions Accumulating snapshots

Challenge: studying fixed process milestones The accumulating snapshot Performing lag analysis Implications for ETL process (fact table is repeatedly touched ) Heterogeneous Dimensions Challenge: heterogeneous attributes within a dimension Implications for single fact table Building custom fact tables for homogenous subsets The view alternative Factless fact tables Challenge: nothing to measure but dimensions The factless fact table Using the factless fact table (counts) Coverage tables for tracking things that did not happen Schema design exercise: Employ advanced fact table techniques to design a solution supporting provided business scenario Module 4 ETL Processing Processing sequence Initial load Useful for prototyping Dimension processing Fact processing and lookups Incremental load Real world loads Processing slow changes Fact processing Designing for ETL Design elements that support ETL processing Changed data identification Hashing strategies and other optimizations Dealing with Bad data Cleaning up data What not to clean up When automated cleansing fails Paper exercise: filling in warehouse tables

Given a schema design and source data dumps, what should end result of load process look like? Module 5 Advanced Dimension Table Design Dimension reuse Roles Querying with roles Outriggers Understanding basic dimensional hierarchies Drilling without hierarchies Hierarchies Multiple hierarchies in a single dimension table Why to model hierarchies Snowflakes Mini-dimensions Challenge: large and expanding dimensions The use of a mini-dimension Loading the mini-dimension Transaction dimensions Challenge: point in time analysis of dimensional data Where the type 2 change breaks down A transaction dimension Identifying status at a specific point in time Using in relation to a fact table More Slow Change Techniques Type 3 Slow changes Hybrid Slow Changes : Schema design exercise: Employ advanced dimension techniques in support of the supplied scenario Module 6 Many to Many Relationships and Bridge Tables Dimension Bridge Tables Challenge: a multi-valued dimension Why flattening does not solve the problem The dimension bridge table Using the bridge: avoid double counting Variations: allocation Hiding the bridge Attribute Bridge Tables Challenge: a multi-valued attribute The attribute bridge table Use of the bridge Hierarchy bridge Table

Challenge: the organizational hierarchy dilemma Capturing all parent/subsidiary relationships: the bridge table Using the bridge to analyze data under an organization Using the bridge to analyze data above an organization Working without the bridge Design exercise: Modify a supplied schema design to support a business case that involves many-to-many relationships Module 7 Designing for Scalability and Performance Dispelling common myths Exploration warehouse or mart in Corporate Information Factory Standalone data mart Dimensional data warehouse Scaling in Scope Conformance across subject areas Conformed dimensions that are not identical Tools for planning conformance Scaling for Performance: Aggregate schemas Principles of aggregation Single table approach and drawbacks Aggregate fact tables Invisible aggregates Scaling for Performance: Derived Schemas Merged fact tables Pivoted fact table Set operations on fact tables Sliced fact tables Conformance quiz Develop a conformance matrix based on supplied scenario Module 8 The Project Designing and building a dimensional schema Early data load Value of iterative Design Activities and deliverables for typical project stages Project initiation

Project definition & control of scope Architectural issues Roles Strategy and Initial Design Requirements analysis Source system analysis Dimensional design Project strategy Documenting requirements and design Design and Build Initial load and feedback Design review and modification Controlling scope Developing the load Developing information products Test and deploy Testing procedures Finalization and documentation Transitioning to production Training Maintenance Load maintenance Maintaining a BI Competency center