Building a Data Warehouse



Similar documents
MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

SQL Server 2012 Business Intelligence Boot Camp

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

SQL Server 2012 End-to-End Business Intelligence Workshop

Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Course 20467A; 5 Days

East Asia Network Sdn Bhd

SAS BI Course Content; Introduction to DWH / BI Concepts

LEARNING SOLUTIONS website milner.com/learning phone

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Implementing a Data Warehouse with Microsoft SQL Server 2012

Implementing a Data Warehouse with Microsoft SQL Server

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Microsoft Data Warehouse in Depth

Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)

Implementing a Data Warehouse with Microsoft SQL Server 2014

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

Course Outline. Module 1: Introduction to Data Warehousing

Implementing a SQL Data Warehouse 2016

AV-005: Administering and Implementing a Data Warehouse with SQL Server 2014

Data Warehouse Overview. Srini Rengarajan

Implementing Data Models and Reports with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server 2012

MDM and Data Warehousing Complement Each Other

Implementing a Data Warehouse with Microsoft SQL Server

Course: SAS BI(business intelligence) and DI(Data integration)training - Training Duration: 30 + Days. Take Away:

Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012

Implementing a Data Warehouse with Microsoft SQL Server 2012

MS 50511A The Microsoft Business Intelligence 2010 Stack

Implementing Data Models and Reports with Microsoft SQL Server 2012 MOC 10778

Implementing a Data Warehouse with Microsoft SQL Server 2012

Online Courses. Version 9 Comprehensive Series. What's New Series

Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5 Days

Beta: Implementing a Data Warehouse with Microsoft SQL Server 2012

Microsoft SQL Business Intelligence Boot Camp

Microsoft Implementing Data Models and Reports with Microsoft SQL Server

Course 6234A: Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services

COURSE SYLLABUS COURSE TITLE:

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

Designing Business Intelligence Solutions with Microsoft SQL Server 2012

SQL Server Administrator Introduction - 3 Days Objectives

End to End Microsoft BI with SQL 2008 R2 and SharePoint 2010

James Serra Data Warehouse/BI/MDM Architect JamesSerra.com

IBM Cognos Training: Course Brochure. Simpson Associates: SERVICE associates.co.uk

Data Integration Checklist

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

Structure of the presentation

Data Warehousing Fundamentals for IT Professionals. 2nd Edition

Business Intelligence (BI) Data Store Project Discussion / Draft Outline for Requirements Document

Data Integration and ETL with Oracle Warehouse Builder NEW

Course 20463:Implementing a Data Warehouse with Microsoft SQL Server

Data Warehouse (DW) Maturity Assessment Questionnaire

MOC 20467B: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

Designing Self-Service Business Intelligence and Big Data Solutions

OLAP Theory-English version

Analysis Services Step by Step

For Sales Kathy Hall

Reduce and manage operating costs and improve efficiency. Support better business decisions based on availability of real-time information

SAS Business Intelligence Online Training

Implementing Data Models and Reports with Microsoft SQL Server

POLAR IT SERVICES. Business Intelligence Project Methodology

Course Syllabus. Maintaining a Microsoft SQL Server 2005 Database. At Course Completion

70-467: Designing Business Intelligence Solutions with Microsoft SQL Server

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

ASYST Intelligence South Africa A Decision Inc. Company

ETL-EXTRACT, TRANSFORM & LOAD TESTING

Breadboard BI. Unlocking ERP Data Using Open Source Tools By Christopher Lavigne

Microsoft Business Intelligence

Building Views and Charts in Requests Introduction to Answers views and charts Creating and editing charts Performing common view tasks

THE DATA WAREHOUSE ETL TOOLKIT CDT803 Three Days

Fluency With Information Technology CSE100/IMT100

The Microsoft Business Intelligence 2010 Stack Course 50511A; 5 Days, Instructor-led

Analytics: Pharma Analytics (Siebel 7.8) Student Guide

SAP BODS - BUSINESS OBJECTS DATA SERVICES 4.0 amron

Master Data Management and Data Warehousing. Zahra Mansoori

Customer Insight Appliance. Enabling retailers to understand and serve their customer

IST722 Data Warehousing

Building Cubes and Analyzing Data using Oracle OLAP 11g

The Data Warehouse ETL Toolkit

Management Accountants and IT Professionals providing Better Information = BI = Business Intelligence. Peter Simons peter.simons@cimaglobal.

How to Enhance Traditional BI Architecture to Leverage Big Data

Foundations of Business Intelligence: Databases and Information Management

Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives

Business Intelligence: Effective Decision Making

Retail POS Data Analytics Using MS Bi Tools. Business Intelligence White Paper

The IBM Cognos Platform

Would-be system and database administrators. PREREQUISITES: At least 6 months experience with a Windows operating system.

SQL SERVER TRAINING CURRICULUM

Managing Data in Motion

MICROSOFT DATA WAREHOUSE IN DEPTH

Exploring Oracle BI Apps: How it Works and What I Get NZOUG. March 2013

SQL Server Integration Services Design Patterns

BENEFITS OF AUTOMATING DATA WAREHOUSING

Business Intelligence Tutorial

Transcription:

Building a Data Warehouse With Examples in SQL Server EiD Vincent Rainardi BROCHSCHULE LIECHTENSTEIN Bibliothek Apress

Contents About the Author. ; xiij Preface xv ^CHAPTER 1 Introduction to Data Warehousing 1 What Is a Data Warehouse?....^ 1 Retrieves Data 4 Consolidates Data 5 Periodically 6 Dimensional Data Store 7 Normalized Data Store 8 History 10 Query n Business Intelligence 12 Other Analytical Activities 14 Updated in Batches ; 15 Other Definitions 16 Data Warehousing Today 17 Business Intelligence 17 Customer Relationship Management 18 Data Mining 19 Master Data Management (MDM) 20 Customer Data Integration, 23 Future Trends in Data Warehousing 24 Unstructured Data 24 Search 25 Service-Oriented Architecture (SOA) 26 Real-Time Data Warehouse 27 Summary 27 vii

VIII 1C0NTENTS 1CHAPTER 2 Data Warehouse Architecture 29 Data Flow Architecture 29 Single DDS : 33 NDS + DDS 35 ^ODS + DDS 38 Federated Data Warehouse 39 System Architecture 42 Case Study 44 Summary, ; 47 CHAPTER 3 Data Warehouse Development Methodology 49 Waterfall Methodology 49 Iterative Methodology 54 Summary 59 CHAPTER 4 Functional and Nonfunctional Requirements 61 Identifying Business Areas 61 Understanding Business Operations 62 Defining Functional Requirements 63 Defining Nonfunctional Requirements 65 Conducting a Data Feasibility Study 67 Summary 70 KHAPTER 5 Data Modeling 71 Designing the Dimensional Data Store 71 Dimension Tables 76 Date Dimension 77 Slowly Changing Dimension 80 Product, Customer, and Store Dimensions 83 Subscription Sales Data Mart 89 Supplier Performance Data Mart 94 CRM Data Marts^. 96 Data Hierarchy 101 Source System Mapping 102 Designing the Normalized Data Store 106 Summary 111

ICONTENTS...CHAPTER 6 Physical Database Design 113 Hardware Platform.., 113 Storage Considerations 120 Configuring Databases 123 Creating DDS Database Structure 128 Creating the Normalized Data Store 139 Using Views 157 Summary Tables 161 Partitioning 162 Indexes 166 Summary ^ 171 SCHAPTER 7 Data Extraction 173 Introduction to ETL 173 ETL Approaches and Architecture 174 General Considerations 177 Extracting Relational Databases 180 Whole Table Every Time 180 Incremental Extract 181 Fixed Range 185 Related Tables 186 Testing Data Leaks 187 Extracting File Systems 187 Extracting Other Source Types 190 Extracting Data Using SSIS 191 Memorizing the Last Extraction Timestamp 200 Extracting from Files 208 Summary 214 CHAPTER 8 Populating the Data Warehouse 215 Stage Loading 216 Data Firewall 218 Populating NDS 219 Using SSIS to Populate NDS '. 228 Upsert Using SQL and Lookup 235 Normalization ; 242 Practical Tips on SSIS 249

ICONTENTS Populating DDS Dimension Tables 250 Populating DDS Fact Tables 266 Batches, Mini-batches, and Near Real-Time ETL 269 Pushing the Data In 270 Summary 271 ICHAPTER 9 Assuring Data Quality 273 Data Quality Process 274 Data Cleansing and Matching 277 Cross-checking with External Sources 290 Data Quality Rules!"' 291 Action: Reject, Allow, Fix 293 Logging and Auditing 296. Data Quality Reports and Notifications 298 Summary 300 ICHAPTER 10 Metadata 301 Metadata in Data Warehousing 301 Data Definition and Mapping Metadata 303 Data Structure Metadata 308 Source System Metadata 313 ETL Process Metadata 318 Data Quality Metadata 320 Audit Metadata 323 Usage Metadata... 324 Maintaining Metadata 325 Summary 327 1CHAPTER 11 Building Reports 329 Data Warehouse Reports 329 When to Use Reports and When Not to Use Them 332 Report Wizard 334 Report Layout. 340 Report Parameters 342 Grouping, Sorting, and Filtering 351 Simplicity ; 356 Spreadsheets. : 357 Multidimensional Database Reports, 362. Deploying Reports 366

ICONTENTS Managing Reports,,'..-.- 370 Managing Report Security 370 Managing Report Subscriptions 372 Managing Report Execution 374 Summary 375 [CHAPTER 12 Multidimensional Database 377 What a Multidimensional Database Is 377 Online Analytical Processing 380 Creating a Multidimensional Database 381 Processing a Multidimensional Database.388 Querying a Multidimensional Database... t 394 Administering a Multidimensional Database 396 Multidimensional Database Security, 397 Processing Cubes, 399 Backup and Restore 405 Summary 409 ICHAPTER 13 Using Data Warehouse for Business Intelligence 411 Business Intelligence Reports 412 Business Intelligence Analytics 413 Business Intelligence Data Mining 416 Business Intelligence Dashboards,'. 432 Business Intelligence Alerts 437 Business Intelligence Portal 438 Summary 439 CHAPTER 14 Using Data Warehouse for Customer Relationship Management.441 Single Customer View 442 Campaign Segmentation 447 Permission Management 450 Delivery and Response Data 454 Customer Analysis.: 460 Customer Support 463 Personalization 464 Customer Loyalty Scheme 465 Summary " 466

xii CONTENTS ICHAPTER 15 Other Data Warehouse Usage 467 Customer Data Integration 467 Unstructured Data 470 Search in Data Warehousing 474 Summary 476 ICHAPTER 16 Testing Your Data Warehouse 477 Data Warehouse ETL Testing, 478 Functional Testing 480 Performance Testing * 482 Security Testing 485 User Acceptance Testing 486 End-to-End Testing 487 Migrating to Production 487 Summary 489 ICHAPTER 17 Data Warehouse Administration 491 Monitoring Data Warehouse ETL 492 Monitoring Data Quality 495 Managing Security 498 Managing Databases 499 Making Schema Changes 501 Updating Applications 503 Summary 503 APPENDIX Normalization Rules 505 IINDEX 509