Data Vault at work. Does Data Vault fulfill its promise? GDF SUEZ Energie Nederland

Similar documents
Reflections on Agile DW by a Business Analytics Practitioner. Werner Engelen Principal Business Analytics Architect

POLAR IT SERVICES. Business Intelligence Project Methodology

Data Vault and The Truth about the Enterprise Data Warehouse

Data Warehouse Overview. Srini Rengarajan

Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann

Trends in Data Warehouse Data Modeling: Data Vault and Anchor Modeling

Modeling: Operational, Data Warehousing & Data Marts

IST722 Data Warehousing

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Implementing a SQL Data Warehouse 2016

Course Outline. Module 1: Introduction to Data Warehousing

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Agile Business Intelligence Data Lake Architecture

Business Analytics For All

Whitepaper. Data Warehouse/BI Testing Offering YOUR SUCCESS IS OUR FOCUS. Published on: January 2009 Author: BIBA PRACTICE

Chapter 5. Learning Objectives. DW Development and ETL

Beta: Implementing a Data Warehouse with Microsoft SQL Server 2012

Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012

Implementing a Data Warehouse with Microsoft SQL Server 2012

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Implementing a Data Warehouse with Microsoft SQL Server 2012

Agile Testing of Business Intelligence. Cinderella 2.0

Implementing a Data Warehouse with Microsoft SQL Server 2012

Business Intelligence for Financial Services: A Case Study

For Sales Kathy Hall

Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8

Implementing a Data Warehouse with Microsoft SQL Server 2012

BIG DATA & the Data Warehouse

SAS BI Course Content; Introduction to DWH / BI Concepts

Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

BUSINESSOBJECTS DATA INTEGRATOR

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

Building an Effective Data Warehouse Architecture James Serra

D83167 Oracle Data Integrator 12c: Integration and Administration

Master Data Management and Data Warehousing. Zahra Mansoori

Intelligent BI Testing. Key to Reliable Information. Data to Impact.

Establish and maintain Center of Excellence (CoE) around Data Architecture

Lection 3-4 WAREHOUSING

Whitepaper. Data Warehouse/BI Testing Offering. Published on: January 2010 Author: Sena Periasamy

BUSINESSOBJECTS DATA INTEGRATOR

IBM InfoSphere Discovery: The Power of Smarter Data Discovery

East Asia Network Sdn Bhd

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

Table of Contents Chapter 1 - Getting Started with Oracle Data Relationship Management (DRM) 1

Oracle Data Integrator 12c: Integration and Administration

SimCorp Solution Guide

Service Oriented Data Management

Traditional BI vs. Business Data Lake A comparison

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Implementing a Data Warehouse with Microsoft SQL Server

Welcome to online seminar on. Oracle Agile PLM BI. Presented by: Rapidflow Apps Inc. January, 2011

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

Improving your Data Warehouse s IQ

USG DATA WAREHOUSE REDESIGN

High-Volume Data Warehousing in Centerprise. Product Datasheet

70-467: Designing Business Intelligence Solutions with Microsoft SQL Server

Data Warehousing. Jens Teubner, TU Dortmund Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

Data Warehousing & Business Intelligence

IBM WebSphere DataStage Online training from Yes-M Systems

Implementing a Data Warehouse with Microsoft SQL Server

SQL Server 2012 Business Intelligence Boot Camp

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Welcome To Today s Webinar: Dynamics Insights SM for Microsoft Dynamics AX

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

<Insert Picture Here> Evaluating the Risk & Rewards of Effective Integration of Systems & Technology within a Financial Institution

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

90% of your Big Data problem isn t Big Data.

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

INFORMATION TECHNOLOGY STANDARD

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Introduction to Data Vault Modeling

How to Enhance Traditional BI Architecture to Leverage Big Data

Understanding Data Warehousing. [by Alex Kriegel]

Datalynx Project Delivery Methodology and PCTM Methodology For Legacy Data Cleansing & Migration

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Data Vault + Data Virtualization = Double Flexibility

Unified Data Integration Across Big Data Platforms

Original Research Articles

White Paper. Unified Data Integration Across Big Data Platforms

Agile Enterprise Data Warehousing Radical idea or practical concept?

Extraction Transformation Loading ETL Get data out of sources and load into the DW

Bringing agility to Business Intelligence Metadata as key to Agile Data Warehousing. 1 P a g e.

CMG s approach to application managed services on SAS data warehouses. Jan Willem Sijthoff

Automated Business Intelligence

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

Reverse Engineering in Data Integration Software

Implementing a Data Warehouse with Microsoft SQL Server 2014

A Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities

DMM301 Benefits and Patterns of a Logical Data Warehouse with SAP BW on SAP HANA

GoldenGate and ODI - A Perfect Match for Real-Time Data Warehousing

TRANSFORMING YOUR BUSINESS

Oracle Data Integrator Overview and Demo. Maria Mundrova ETL Developer/DW Consultant

Mission-Driven Big Data

Nothing in this job description restricts management's right to assign or reassign duties and responsibilities to this job at any time.

Business Intelligence at the University of Minnesota

Transcription:

Data Vault at work Does Data Vault fulfill its promise?

Leading player on Dutch energy market Approximately 1,000 employees Production capacity: 3,813 MW 20% of the total Dutch electricity production capacity 07-06-2013 2

GDF SUEZ Portfolio Management 07-06-2013 3

GDF SUEZ Portfolio Management 07-06-2013 4

Direct Cause Before: BO Universe directly on source systems Error in the conversion of gas to electricity detected in one of the reports Further investigation revealed that this error occurred in every report Fixing this error took several months -> unacceptable to the business 07-06-2013 5

Requirements Validation of data quality Single point of definition for Business Rules Insight in calculations and origin of information Less dependency on IT Frequently changing source systems Data Vault Short load times, quick responses Robust, documented and well-managed 07-06-2013 6

DWH4GSPM Architecture Metadata Source Systems Staging Area Data Layer Information Layer User Access Layer 07-06-2013 7

DWH4GSPM Tools & Techniques Data Vault for Data Layer Star Schemas for Data Marts Project approach: SCRUM Reporting tool: Business Objects BOXI ETL Tool: Business Objects Data Services (BODS) Metadata: Business Objects Metadata Manager Database: Oracle 10g 07-06-2013 8

The promise of Data Vault The Data Vault is the optimal choice for modeling the EDW in the DW 2.0 framework" Bill Inmon Source data is stored unaltered (1:1 with source system) No integration No cleansing Changes in source systems do not lead to changes in existing Data Vault tables Only new tables or new rows When should Data Vault be used? When accountability and traceability are important (legal obligations, audit by accountant) When source systems change often When Business Logic (formulas, enrichment, integration, etc.) changes often 07-06-2013 9

Data Vault: From ETL to ELT Extract Transform Load Staging Area EDW 3NF Data Mart Data Mart Extract Staging Area Load Data Vault Transform Data Mart Data Mart 07-06-2013 10

Does Data Vault live up to its promises? Source data is stored unaltered Content Structure Changes in source systems do not lead to changes in existing Data Vault tables True, but it does lead to a more complex transformation from Data Vault to Data Mart No change in total design and development effort, from Source System to Reporting tool. When should Data Vault be used? When accountability and traceability are important (legal obligations, audit by accountant) True, if change in structure is acceptable When source systems change often Data Vault only brings small advantages When Business Logic (formulas, enrichment, integration, etc.) changes often True, no data has been lost due to former Business Logic Not true, more complex transformations from Data Vault tor Data Mart 07-06-2013 11

Issues during project Refactoring Incremental approach for Star Schemas does not work well Leads to complex transformations from Data Vault to Data Mart And frequently changing BO Universe Load times Solution: Data Vault encourages parallel loads Low understanding of SCRUM by business and project team members No out-of-the-box support for automated testing Test scripts could not keep up with frequently changing ETL Data Vault is not yet supported by automated test tooling 07-06-2013 12

Business Evaluation Response times are good Load takes too long Design of Star Schemas optimized for reporting Do not support analysis well Change of source takes too much time 07-06-2013 13

Is the traditional DWH architecture getting obsolete? Metadata Near-Realtime Zone Source Systems Staging Area Data Layer Intermediate Layer Information Layer User Access Layer Agile Development Zone Data gathering Information delivery 07-06-2013 14

What will be the impact of recent developments? 07-06-2013 15