Reflections on Agile DW by a Business Analytics Practitioner. Werner Engelen Principal Business Analytics Architect

Similar documents
Data Vault at work. Does Data Vault fulfill its promise? GDF SUEZ Energie Nederland

The Role of the BI Competency Center in Maximizing Organizational Performance

OLAP Theory-English version

Establish and maintain Center of Excellence (CoE) around Data Architecture

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design

Data Vault and The Truth about the Enterprise Data Warehouse

Agile BI With SQL Server 2012

ACCESS INTELLIGENCE. an intelligent step beyond Access Management. White Paper

IST722 Data Warehousing

Data Warehouse Overview. Srini Rengarajan

Data Warehouse (DW) Maturity Assessment Questionnaire

POLAR IT SERVICES. Business Intelligence Project Methodology

Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora

Understanding Data Warehousing. [by Alex Kriegel]

Who Doesn t Want to be Agile? By: Steve Dine President, Datasource Consulting, LLC 7/10/2008

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

Building an Effective Data Warehouse Architecture James Serra

A Service-oriented Architecture for Business Intelligence

Agile Testing of Business Intelligence. Cinderella 2.0

Business Intelligence and Service Oriented Architectures. An Oracle White Paper May 2007

White Paper February IBM Cognos Supply Chain Analytics

Welcome to online seminar on. Oracle Agile PLM BI. Presented by: Rapidflow Apps Inc. January, 2011

Microsoft Data Warehouse in Depth

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

Lection 3-4 WAREHOUSING

Improving your Data Warehouse s IQ

White Paper

Business Intelligence

Data Warehousing. Jens Teubner, TU Dortmund Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

The Benefits of Data Modeling in Data Warehousing

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Master Data Management and Data Warehousing. Zahra Mansoori

MDM and Data Warehousing Complement Each Other

Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann

Republic Polytechnic School of Information and Communications Technology C355 Business Intelligence. Module Curriculum

Management Consulting Systems Integration Managed Services WHITE PAPER DATA DISCOVERY VS ENTERPRISE BUSINESS INTELLIGENCE

Data warehouse and Business Intelligence Collateral

Ten Cornerstones of a Modern Data Warehouse Environment

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Building a Custom Data Warehouse

From Agile by Design. Full book available for purchase here.

Tiber Solutions. The DNA of a Successful Business Intelligence Effort. Jim Hadley

Data Management Roadmap

Making Business Intelligence Easy. White Paper Agile Business Intelligence

Budgeting and Planning with Microsoft Excel and Oracle OLAP

Oracle Daily Business Intelligence. PDF created with pdffactory trial version

Implementation of Big Data and Analytics Projects with Big Data Discovery and BICS March 2015

SAS Business Intelligence Online Training

Deploying Governed Data Discovery to Centralized and Decentralized Teams. Why Tableau and QlikView fall short

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

SCM & Agile Business Intelligence. Anja Cielen

Presented By: Leah R. Smith, PMP. Ju ly, 2 011

Integrating Netezza into your existing IT landscape

LEARNING SOLUTIONS website milner.com/learning phone

MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

Business Intelligence and Healthcare

Extensibility of Oracle BI Applications

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

Data Warehousing Systems: Foundations and Architectures

Agile Business Intelligence Data Lake Architecture

Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Course 20467A; 5 Days

CHAPTER SIX DATA. Business Intelligence The McGraw-Hill Companies, All Rights Reserved

BI Dashboards the Agile Way

Request for Information Page 1 of 9 Data Management Applications & Services

Business Intelligence

Creating a Business Intelligence Competency Center to Accelerate Healthcare Performance Improvement

MOC 20467B: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

QlikView Business Discovery Platform. Algol Consulting Srl

Atomate Development Process. Quick Guide

From Business Models to BI Models. Lawrence Corr

Agile BI The Future of BI

Best Practices for Deploying Managed Self-Service Analytics and Why Tableau and QlikView Fall Short

DMM301 Benefits and Patterns of a Logical Data Warehouse with SAP BW on SAP HANA

How To Be Successful At Business Intelligence

A Knowledge Management Framework Using Business Intelligence Solutions

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

Bringing agility to Business Intelligence Metadata as key to Agile Data Warehousing. 1 P a g e.

Dimodelo Solutions Data Warehousing and Business Intelligence Concepts

Semantic Data Modeling: The Key to Re-usable Data

A Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities

Business Intelligence Project Management 101

Whitepaper. Data Warehouse/BI Testing Offering YOUR SUCCESS IS OUR FOCUS. Published on: January 2009 Author: BIBA PRACTICE

SAS BI Course Content; Introduction to DWH / BI Concepts

Melissa Coates. Tools & Techniques for Implementing Corporate and Self-Service BI. Triad SQL BI User Group 6/25/2013. BI Architect, Intellinet

If you re serious about Business Intelligence, you need a BI Competency Centre

Sterling Business Intelligence

Data Warehouse: Introduction

Agile Software Development

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION

Implementing Oracle BI Applications during an ERP Upgrade

Transcription:

Reflections on Agile DW by a Business Analytics Practitioner Werner Engelen Principal Business Analytics Architect

Introduction Werner Engelen Active in BI & DW since 1998 + 6 years at element61 Previously: Oracle, PwC Consulting & IBM Global Business Services Proven track record in dimensional modeling, data quality, setup BICCs, project methodologies, project management, quality assurance, business analysis & ETL design But you can also talk to me about photography, urban exploration & landscape design

Just jump & swim?

Offer some (re-usable) food for thought Data discovery Governance Sources ETL Architecture Model Automate

Agile BI? Waterfall BI Requirements Design Code Test Rather than doing all of one thing at a time... agile BI teams do a little of everything all the time Agile BI

Agile in a nutshell Sprint retrospective

Offer some (re-usable) food for thought Data discovery Governance Sources ETL Architecture Model Automate

All we want is... a dimensional model SALES DATE CUSTOMER REVENUE STORE PRODUCT PRO - MOTION

How do we ask questions? WHAT? WHEN? WHAT? WHO? HOW MANY? How do this month s sales by sales rep of nonfood products which we promoted to consumers in Japan compare with previous years? WHERE? WHY? WHEN? WHO?

BI model canvas Need for a common questions framework WHEN When does it happen? date, time period, timeline... WHERE Where does it happen? Where does it refer to? location, store, facility... HOW How does it happen? How do we know it happened? How do we uniquely define an event? transaction type, transaction identifier... HOW MANY How many/much is involved? How long does it take? revenues, costs, quanities, durations... WHY Why does it happen? cause, reason, promotion... WHO Who does what? Who else is involved? Who is organizated how? customer, employee, supplier, sales rep... WHAT What is involved? What is the value proposition? product, service, resource...

Link business questions to design Product backlog Data model & Source to Target design

Offer some (re-usable) food for thought Data discovery Governance Sources ETL Architecture Model Automate

Governance? The scope of a BICC is # 100% of all BI related applications But: still a minimal insight & governance is required Each BI application can be defined within a certain category Define degree of governance by BICC for each category Mandatory deliverables? (at a certain point in time a departmental BI application might be promoted to a corporate BI application) How to a approach a BI project (requirements...) Framework, standards, guidelines Naming conventions Tools set... 0% 100% Special-purpose BI applications Departmental BI applications Cross-functional / cross-departmental BI applications Corporate BI applications

Offer some (re-usable) food for thought Data discovery Governance Sources ETL Architecture Model Automate

Possible architectural issues? Modify fact to lower granularity Modify leading sources Modify definitions Add dimensions Add fields Add history Modify functionality (transactional accumulated snapshot)... inflexible architecture & data model Costs & time go up

Data model / architecture anticipation 3-tier architecture Get the data (extract) source, landing zone, staging area... Store the data (register) data warehouse EDW, Data Vault, ODS, Kimball 1st level, Kimball granular, 3NF... Present the data data mart Kimball (combination 1st & 2nd level), cubes... IN KEEP ALL RELEVANT OUT SOURCE ORIENTED TARGET ORIENTED

Offer some (re-usable) food for thought Data discovery Governance Sources ETL Architecture Model Automate

ETL & modeling de-composition Breadth or depth? Split-up ETL & modeling in smaller pieces Minimize ETL and modeling in early iterations De-composition helps in planning activities De-composition supports early feedback Breadth Simplified load of the most important dimension models Early feedback, earlier build of dependent systems Depth Complete load of one dimension at a time Early deployment of complete usable sub systems

Start with a small thingy? FROM TO Divide dimension tables no history (current view only) include history Divide rows Group records by type Divide rows Subset of data (e.g. Customer: consumer, business) all types 10 % of data n % 100% Divide by columns columns from source 1 columns from source 2 all columns Data quality include only non-outliers include outliers ETL complexity simpler / earlier tasks complex tasks ETL refresh frequency one time load incremental load (monthly daily) ETL transformations (raw) data directly aggregations and/or business rules applied ETL target layer ETL degree of automation Subject area completeness source directly Manual most important star, dimension, attributes in a dimension staging presentation BI tools (semantic layer) fully automated all data model elements

Offer some (re-usable) food for thought Data discovery Governance Sources ETL Architecture Model Automate

Agile BI = data intensive Traditional BI Proven answers to known questions High-value reporting specifies drives need for. new. adjusted BI content for. better Data discovery Functional data connection Early access to data Fast answers to new questions Short-term reporting Source of requirements Helps in prioritizing Data profiling Data quality insight Identify & confront with issues asap Source of requirements

Offer some (re-usable) food for thought Data discovery Governance Sources ETL Architecture Model Automate

Keep your consumers close by Keep your data providers even closer

When is BI impacted? releases abstraction layers screens retention... direct & indirect DML Table 1 Column A - PK Column B relation - ship Table 4 Column I - PK Column J relationships current & historical Table 3 Column E - PK Column F - UK Table 2 Column C - PK Column D - FK interface Table 4 Column G - PK Column H - FK data source database 1 source database2... processes quality

Offer some (re-usable) food for thought Data discovery Governance Sources ETL Architecture Model Automate

Automation Get your act together before things start of Don t try certain things for the first time Have a near perfect way & means of working Keep it simple Automate the simple / repetitive things Metadata driven generation Focus on time consuming: e.g. source analysis, ETL & testing (unit, regression ) Re-use Develop best practices & reuse (think big, start small) Focus on the more difficult processes E.g. gathering good requirements, complex dimensional models, business rules Welcome change, but Is your architecture fit enough? (which layers) Are your tools fit enough?

Offer some (re-usable) food for thought Data discovery Governance Sources ETL Architecture Model Automate

Divide & conquer: De-composition is the key

Thank you Werner Engelen Principal Business Analytics Architect