Data Vault Modeling The Next Generation DW Approach

Size: px
Start display at page:

Download "Data Vault Modeling The Next Generation DW Approach"

Transcription

1 Data Vault Modeling The Next Generation DW Approach Presented by: Siva Govindarajan 1

2 Data Vault Modeling Agenda Introduction Data Vault Place in Evolution Data Warehousing Architecture Data Vault Components Case Study Typical 3NF / Star Schema Data Vault Approach Temporal Data + Auditability Implementation of Data Vault Conclusion 2

3 Introduction Data Vault concept originally conceived by Dan Linstedt Enterprise Data Warehouse Modeling approach. Hybrid approach - 3NF & Star Schema Adaptable to changes Auditable Timed right with Technology 3

4 Data Vault Place in Evolution 1960s - Codd, Date et. al Normal Forms 1980s - Normal Forms adapted to DWs Star Schema for OLAP 1990s - Data Vault concept developed Dan Linstedt Data Vault concept published by Dan Linstedt 4

5 Data Warehousing Architecture Bill Inmon s Data Warehouse Architecture OLTP Systems Staging & ODS Enterprise DW Staging (optional) Data Mart Data Mart Data Marts Data Marts Adaptive Marts Kimball s Data Warehouse Bus Architecture Data Mart Data Mart OLTP Systems Staging Data Mart Data Mart 5

6 Data Vault Components Hub Entities Candidate Keys + Load Time + Source Link Entities FKs from Hub + Load Time + Source Satellite Entities Descriptive Data + Load Time + Source + End Time Dimension = Hub+ Satellite Fact = Satellite + Link [+ Hub ] 6

7 Case Study Modeling Approaches Typical 3NF / Star Schema Data Vault approach Scenario Typical Fact and Dimension New Dimension data Reference Data change - 1:M to M:M Additional attributes to Dimension 7

8 Typical 3 F / Star Schema 1. Typical Fact and Dimension TR ORDER tr_order_id dim_product_id system order id system root order id global root order id order quantity total executed qty event timestamp DIM PRODUCT dim_product_id product_id product_code product desc product_name le ngth 8

9 Typical 3 F / Star Schema 2. ew Dimension Data TR ORDER tr_order_id dim_product_id system order id system root order id global root order id order quantity total executed qty event timestamp DIM PRODUCT dim_product_id product_id product_code product desc product_name product_category_id product_category_name product_category_code product_category_desc is_strategy REF PRODUCT CAT EGORY product_category_id product_category_name product_category_code product_category_desc is_strategy 9

10 Typical 3 F / Star Schema 2. ew Dimension Data TR ORDER tr_order_id dim_product_id system order id system root order id global root order id order quantity total executed qty event timestamp d im_p ro d uct_ca te g o ry_id DIM PRODUCT dim_product_id product_id product_code product desc product_name le ng th DIM PRODUCT CAT EGORY d im_p ro d uct_ca te g o ry_id product_category_code product_category_name product_category_desc is_strategy namespace REF PRODUCT CAT EGORY product_category_id product_category_name product_category_code product_category_desc is_strategy 10

11 Typical 3 F / Star Schema 3. Reference 1:M to M:M Change TR ORDER tr_order_id dim_product_id system order id system root order id global root order id order quantity total executed qty event timestamp d im_p ro d uct_ca te g o ry_id order owner firm id? DIM PRODUCT dim_product_id product_id product_code product desc product_name le ng th DIM PRODUCT CAT EGORY d im_p ro d uct_ca te g o ry_id product_category_code product_category_name product_category_desc is_strategy namespace? REF PRODUCT CAT EGORY product_category_id product_category_name product_category_code product_category_d esc is_strategy 11

12 Typical 3 F / Star Schema 4. Additional attributes to Dimension TR ORDER tr_order_id dim_product_id system order id system root order id global root order id order quantity total executed qty event timestamp dim_p rod uct_ca teg ory_id order owner firm id DIM PRODUCT dim_product_id product_id product_code product desc product_name le ng th he ig ht we ig ht p a cka ging co lo r size 12

13 Data Vault Approach 1. Typical Hub, Satellite and Link (Replacing Fact/Dimension) LNK_ORDER lnk_o rd e r_id lnk_lo a d _d a te lnk_re co rd _so urce hub _p ro d uct_id hub _o rd e r_id HUB_PRODUCT hub_p ro duct_id hub_re co rd _so urce hub_lo a d _d a te product_id product_code HUB_ORDER hub _o rd e r_id hub _reco rd_so urce hub _lo a d _da te tr_order_id system order id system root order id global root order id SAT _PRODUCT 01 sa t_pro d uct01_id hub _p ro d uct_id sa t_lo a d _d a te sa t_end _da te sa t_re co rd _so urce product desc product_name symbol SAT ORDER01 sa t_o rd er01_id hub _o rd e r_id sa t_lo a d _d a te sa t_e nd_d a te sa t_re co rd _so urce order quantity total executed qty event timestamp 13

14 Data Vault Approach 2. ew Dimension Data LNK_ORDER lnk_order_id lnk_load_d ate lnk_reco rd_source hub_pro duct_id hub_ord er_id LNK_PRODUCT _X_CAT EGORY lnk_p rod uct_x_ca te gory_id lnk_loa d_date lnk_record _source hub_product_id hub_ca te gory_id HUB_PRODUCT hub_product_id hub_reco rd_source hub_load_d ate product_id product_code HUB_ORDER hub _o rde r_id hub _record_source hub _loa d_date tr_order_id system order id system root order id global root order id HUB_CAT EGORY hub_categ ory_id hub_record_source hub_lo ad_date product_category_id product_category_code SAT_PRODUCT01 sat_product01_id hub_pro duct_id sat_load_da te sat_end_date sat_re cord_so urce product desc product_name symbol SAT ORDER01 sat_o rd er01_id hub _o rde r_id sat_lo ad_date sat_e nd _d ate sat_record_source order quantity total executed qty event timestamp SAT _CAT EGORY01 sa t_category01_id hub_ca te gory_id sa t_load_da te sa t_end_date sa t_re co rd_source product_category_name product_category_desc is_strategy 14

15 LNK_ORDER lnk_order_id lnk_load_d ate lnk_reco rd_source hub_pro duct_id hub_ord er_id Data Vault Approach 3. Reference 1:M to M:M Change HUB_PRODUCT hub_product_id hub_reco rd_source hub_load_d ate product_id product_code HUB_ORDER hub _o rde r_id hub _record_source hub _loa d_date tr_order_id system order id system root order id global root order id SAT_PRODUCT01 sat_product01_id hub_pro duct_id sat_load_da te sat_end_date sat_re cord_so urce product desc product_name symbol SAT ORDER01 sat_o rd er01_id hub _o rde r_id sat_lo ad_date sat_e nd _d ate sat_record_source order quantity total executed qty event timestamp LNK_PRODUCT _X_CAT EGORY lnk_p rod uct_x_ca te gory_id lnk_loa d_date lnk_record _source hub_product_id hub_ca te gory_id HUB_CAT EGORY hub_categ ory_id hub_record_source hub_lo ad_date product_category_id product_category_code SAT _CAT EGORY01 sa t_category01_id hub_ca te gory_id sa t_load_da te sa t_end_date sa t_re co rd_source product_category_name product_category_desc is_strategy 15

16 Data Vault Approach 4. Additional attributes to Dimension SAT_PRODUCT01 sat_product01_id HUB_PRODUCT hub _p roduct_id hub _re co rd _source hub _lo ad_date product_id product_code hub _product_id sat_load_date sat_end _d ate sat_re cord_source product desc product_name SAT _PRODUCT 02 sat_pro duct02_id hub_product_id sat_load_d ate sat_end _date sat_record_so urce length height weight packaging color 16

17 Temporal Data + Auditability SAT _PRODUCT01 sa t_p ro duct01_id HUB_PRODUCT hub_prod uct_id sa t_loa d_da te sa t_e nd_d ate sa t_record_source product desc product_name hub_p ro duct_id hub_record_source hub_loa d_da te product_id product_code SAT _PRODUCT 02 sa t_p ro duct02_id hub_prod uct_id sa t_loa d_da te sa t_e nd_d ate sa t_record_source length height weight packaging color LNK_PRODUCT_X_CATEGORY lnk_prod uct_x_catego ry_id lnk_load_date lnk_record_source hub _product_id hub _catego ry_id LNK_ORDER lnk_o rd er_id lnk_load _date lnk_reco rd _source hub_orde r_id hub_prod uct_id hub_categ ory_id HUB_ORDER hub_orde r_id hub_reco rd _source hub_load _date tr_order_id system order id system root order id global root order id SAT ORDER01 sa t_o rd er01_id hub_orde r_id sa t_loa d_da te sa t_e nd_d ate sa t_record_source order quantity total executed qty event timestamp source_system SAT _CAT EGORY01 HUB_CATEGORY sat_catego ry01_id hub_cate gory_id hub_record_source hub_loa d_da te product_category_id product_category_code hub_cate gory_id sat_lo ad_d ate sat_end _date sat_record_source product_category_name product_category_desc is_strategy 17

18 Implementation of Data Vault Make it Simple View for Dimension View For Fact Data Loading Take advantage of Technology DW Appliances High-throughput storage devices RDBMS Features 18

19 Make it Simple Create views for Dimension From our Demo model, Join Product related Hubs and Satellites to build View for Product Dimension as shown below : HUB_PRODUCT SAT_PRODUCT01 SAT_PRODUCT02 19

20 Make it Simple Create Views For Fact Join Order related Hubs, Satellites and Links to build View for ORDER Fact using ORDER related Hubs/ Satellites and Links. HUB_ORDER SAT_ORDER01 LNK_ORDER 20

21 Data Loading Typically data loading for a Data Vault is in the following sequence. Hubs for Dimensions Links for Dimensions Satellites for Dimensions Hubs for Fact (if any ) Links for Fact Satellites for Fact Refer to Data Vault Series article 5 of Linstedt for further details 21

22 DW Appliances Next wave of IT Data Warehouse hardware solutions are the Data Warehouse Appliances. Take advantage of the technology where possible. Few DW Appliances in the market : Oracle Exadata Netezza Teradata RedBrick GreenPlum 22

23 High-throughput storage devices Utilize Storage devices designed for DW / OLAP applications. Following are some examples only and would change over time. Please do your research based on your sizing requirements. EMC CLARiiON Storage family Texas Memory systems RamSan Solid State devices Hitachi USP / VSP Storage family Other Hybrid solutions with High performance storage and Solid State devices 23

24 RDBMS Features Some Oracle Features catering DW / OLAP applications: Exadata Smart Scans OLAP Based Materialized views Partitioning Reference partition Advanced data compression Automatic degree of parallelism (ADOP). Star query optimization Oracle OLAP/DWH Features. 24

25 Conclusion In this presentation we have addressed high level overview of Data Vault Architecture. Topics discussed are Data Vault concept and architecture Data Vault Components such as Hubs, Satellites and Link tables Typical modeling challenges with traditional modeling approaches How those challenges could be handled using Data Vault Modeling Approach. Auditing and Temporal data capture using DV Approach. And finally, some implementation details If interested in learning more, try Dan Linstedt's special coaching area at: 25

26 Questions?? 26

27 References Data Vault Series articles by Dan E. Linstedt -- Referred few articles from Genesee Academy -- Articles and books by Ralph Kimball and Bill Inmon from multiple sources. Technical Documents from 27

28 Special Thanks to Dan Linstedt for review and corrections to the article. My colleague Mark Bruscke for his assistance in preparing the article. 28

29 The End Contact siva02@globytes.com 29

Business Analytics For All

Business Analytics For All Business Analytics For All Unlocking the Value within the Data Vault BA4All Insight Session April 29 th 2014 Guy Van der Sande Vincent Greslebin Fifthplay : Architecture Smart Homes Platform Data Warehouse

More information

Trends in Data Warehouse Data Modeling: Data Vault and Anchor Modeling

Trends in Data Warehouse Data Modeling: Data Vault and Anchor Modeling Trends in Data Warehouse Data Modeling: Data Vault and Anchor Modeling Thanks for Attending! Roland Bouman, Leiden the Netherlands MySQL AB, Sun, Strukton, Pentaho (1 nov) Web- and Business Intelligence

More information

Data Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

Data Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1 Jens Teubner Data Warehousing Winter 2015/16 1 Data Warehousing Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Winter 2015/16 Jens Teubner Data Warehousing Winter 2015/16 13 Part II Overview

More information

Modeling: Operational, Data Warehousing & Data Marts

Modeling: Operational, Data Warehousing & Data Marts Course Description Modeling: Operational, Data Warehousing & Data Marts Operational DW DMs GENESEE ACADEMY, LLC 2013 Course Developed by: Hans Hultgren DATA MODELING IMMERSION Modeling: Operational, Data

More information

Main Memory Data Warehouses

Main Memory Data Warehouses Main Memory Data Warehouses Robert Wrembel Poznan University of Technology Institute of Computing Science Robert.Wrembel@cs.put.poznan.pl www.cs.put.poznan.pl/rwrembel Lecture outline Teradata Data Warehouse

More information

Understanding Data Warehousing. [by Alex Kriegel]

Understanding Data Warehousing. [by Alex Kriegel] Understanding Data Warehousing 2008 [by Alex Kriegel] Things to Discuss Who Needs a Data Warehouse? OLTP vs. Data Warehouse Business Intelligence Industrial Landscape Which Data Warehouse: Bill Inmon vs.

More information

Data Warehouse: Introduction

Data Warehouse: Introduction Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,

More information

Data Vault and The Truth about the Enterprise Data Warehouse

Data Vault and The Truth about the Enterprise Data Warehouse Data Vault and The Truth about the Enterprise Data Warehouse Roelant Vos 04-05-2012 Brisbane, Australia Introduction More often than not, when discussion about data modeling and information architecture

More information

Data Warehouse Overview. Srini Rengarajan

Data Warehouse Overview. Srini Rengarajan Data Warehouse Overview Srini Rengarajan Please mute Your cell! Agenda Data Warehouse Architecture Approaches to build a Data Warehouse Top Down Approach Bottom Up Approach Best Practices Case Example

More information

Unlock your data for fast insights: dimensionless modeling with in-memory column store. By Vadim Orlov

Unlock your data for fast insights: dimensionless modeling with in-memory column store. By Vadim Orlov Unlock your data for fast insights: dimensionless modeling with in-memory column store By Vadim Orlov I. DIMENSIONAL MODEL Dimensional modeling (also known as star or snowflake schema) was pioneered by

More information

Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann

Trivadis White Paper. Comparison of Data Modeling Methods for a Core Data Warehouse. Dani Schnider Adriano Martino Maren Eschermann Trivadis White Paper Comparison of Data Modeling Methods for a Core Data Warehouse Dani Schnider Adriano Martino Maren Eschermann June 2014 Table of Contents 1. Introduction... 3 2. Aspects of Data Warehouse

More information

Lection 3-4 WAREHOUSING

Lection 3-4 WAREHOUSING Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing

More information

Tiber Solutions. Understanding the Current & Future Landscape of BI and Data Storage. Jim Hadley

Tiber Solutions. Understanding the Current & Future Landscape of BI and Data Storage. Jim Hadley Tiber Solutions Understanding the Current & Future Landscape of BI and Data Storage Jim Hadley Tiber Solutions Founded in 2005 to provide Business Intelligence / Data Warehousing / Big Data thought leadership

More information

IST722 Data Warehousing

IST722 Data Warehousing IST722 Data Warehousing Components of the Data Warehouse Michael A. Fudge, Jr. Recall: Inmon s CIF The CIF is a reference architecture Understanding the Diagram The CIF is a reference architecture CIF

More information

EMC CUSTOMER UPDATE. 31 mei 2011 Fort Voordorp. Bart Sjerps. Greenplum Data Warehouse. Copyright 2011 EMC Corporation. All rights reserved.

EMC CUSTOMER UPDATE. 31 mei 2011 Fort Voordorp. Bart Sjerps. Greenplum Data Warehouse. Copyright 2011 EMC Corporation. All rights reserved. EMC CUSTOMER UPDATE 31 mei 2011 Fort Voordorp Bart Sjerps Greenplum Data Warehouse 1 Introduction & Agenda What is Data warehousing? And what s Business Intelligence? Evolution in the Data Warehouse Business

More information

2009 Oracle Corporation 1

2009 Oracle Corporation 1 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,

More information

SQL Server PDW. Artur Vieira Premier Field Engineer

SQL Server PDW. Artur Vieira Premier Field Engineer SQL Server PDW Artur Vieira Premier Field Engineer Agenda 1 Introduction to MPP and PDW 2 PDW Architecture and Components 3 Data Structures 4 PDW Tools Data Load / Data Output / Administrative Console

More information

Introduction to Data Warehousing. Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in

Introduction to Data Warehousing. Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in Introduction to Data Warehousing Ms Swapnil Shrivastava swapnil@konark.ncst.ernet.in Necessity is the mother of invention Why Data Warehouse? Scenario 1 ABC Pvt Ltd is a company with branches at Mumbai,

More information

Introduction to Data Vault Modeling

Introduction to Data Vault Modeling Introduction to Data Vault Modeling Compiled and Edited by Kent Graziano, Senior BI/DW Consultant Note: This article is substantially excerpted (with permission) from the book Super Charge Your Data Warehouse:

More information

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business

More information

Indexing Techniques for Data Warehouses Queries. Abstract

Indexing Techniques for Data Warehouses Queries. Abstract Indexing Techniques for Data Warehouses Queries Sirirut Vanichayobon Le Gruenwald The University of Oklahoma School of Computer Science Norman, OK, 739 sirirut@cs.ou.edu gruenwal@cs.ou.edu Abstract Recently,

More information

Big Data Technologies Compared June 2014

Big Data Technologies Compared June 2014 Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development

More information

Data Vault at work. Does Data Vault fulfill its promise? GDF SUEZ Energie Nederland

Data Vault at work. Does Data Vault fulfill its promise? GDF SUEZ Energie Nederland Data Vault at work Does Data Vault fulfill its promise? Leading player on Dutch energy market Approximately 1,000 employees Production capacity: 3,813 MW 20% of the total Dutch electricity production capacity

More information

Dimensional Modeling 101. Presented by: Michael Davis CEO OmegaSoft,LLC

Dimensional Modeling 101. Presented by: Michael Davis CEO OmegaSoft,LLC Dimensional Modeling 101 Presented by: Michael Davis CEO OmegaSoft,LLC Agenda Brief history of Database Design Dimension Modeling Terminology Case study overview 4 step Dimensional Modeling Process Additional

More information

Getting Value from Big Data with Analytics

Getting Value from Big Data with Analytics Getting Value from Big Data with Analytics Edward Roske, CEO Oracle ACE Director info@interrel.com BLOG: LookSmarter.blogspot.com WEBSITE: www.interrel.com TWITTER: Eroske About interrel Reigning Oracle

More information

Data Warehousing Systems: Foundations and Architectures

Data Warehousing Systems: Foundations and Architectures Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository

More information

Dimensional Modeling and E-R Modeling In. Joseph M. Firestone, Ph.D. White Paper No. Eight. June 22, 1998

Dimensional Modeling and E-R Modeling In. Joseph M. Firestone, Ph.D. White Paper No. Eight. June 22, 1998 1 of 9 5/24/02 3:47 PM Dimensional Modeling and E-R Modeling In The Data Warehouse By Joseph M. Firestone, Ph.D. White Paper No. Eight June 22, 1998 Introduction Dimensional Modeling (DM) is a favorite

More information

Data Search. Searching and Finding information in Unstructured and Structured Data Sources

Data Search. Searching and Finding information in Unstructured and Structured Data Sources 1 Data Search Searching and Finding information in Unstructured and Structured Data Sources Erik Fransen Senior Business Consultant 11.00-12.00 P.M. November, 3 IRM UK, DW/BI 2009, London Centennium BI

More information

Data Vault Modeling in a Day

Data Vault Modeling in a Day Course Description Data Vault Modeling in a Day GENESEE ACADEMY, LLC 2013 Course Developed by: Hans Hultgren DATA VAULT DAY Data Vault Modeling in a Day Overview Data Vault modeling is quickly becoming

More information

Master Data Management and Data Warehousing. Zahra Mansoori

Master Data Management and Data Warehousing. Zahra Mansoori Master Data Management and Data Warehousing Zahra Mansoori 1 1. Preference 2 IT landscape growth IT landscapes have grown into complex arrays of different systems, applications, and technologies over the

More information

SAP Real-time Data Platform. April 2013

SAP Real-time Data Platform. April 2013 SAP Real-time Data Platform April 2013 Agenda Introduction SAP Real Time Data Platform Overview SAP Sybase ASE SAP Sybase IQ SAP EIM Questions and Answers 2012 SAP AG. All rights reserved. 2 Introduction

More information

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach 2006 ISMA Conference 1 Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach Priya Lobo CFPS Satyam Computer Services Ltd. 69, Railway Parallel Road, Kumarapark West, Bangalore 560020,

More information

IBM Data Retrieval Technologies: RDBMS, BLU, IBM Netezza, and Hadoop

IBM Data Retrieval Technologies: RDBMS, BLU, IBM Netezza, and Hadoop IBM Data Retrieval Technologies: RDBMS, BLU, IBM Netezza, and Hadoop Frank C. Fillmore, Jr. The Fillmore Group, Inc. Session Code: E13 Wed, May 06, 2015 (02:15 PM - 03:15 PM) Platform: Cross-platform Objectives

More information

<Insert Picture Here> Oracle Exadata Database Machine Overview

<Insert Picture Here> Oracle Exadata Database Machine Overview Oracle Exadata Database Machine Overview Exadata Database Machine Best Platform to Run the Oracle Database Best Machine for Data Warehousing Best Machine for OLTP Best Machine for

More information

How To Build An Exadata Database Machine X2-8 Full Rack For A Large Database Server

How To Build An Exadata Database Machine X2-8 Full Rack For A Large Database Server Oracle Exadata Database Machine Overview Exadata Database Machine Best Platform to Run the Oracle Database Best Machine for Data Warehousing Best Machine for OLTP Best Machine for

More information

Building an Effective Data Warehouse Architecture James Serra

Building an Effective Data Warehouse Architecture James Serra Building an Effective Data Warehouse Architecture James Serra Global Sponsors: About Me Business Intelligence Consultant, in IT for 28 years Owner of Serra Consulting Services, specializing in end-to-end

More information

Concept and Applications of Data Mining. Week 1

Concept and Applications of Data Mining. Week 1 Concept and Applications of Data Mining Week 1 Topics Introduction Syllabus Data Mining Concepts Team Organization Introduction Session Your name and major The dfiiti definition of dt data mining i Your

More information

INFORMATICA WORLD TOUR August 2012

INFORMATICA WORLD TOUR August 2012 INFORMATICA WORLD TOUR August 2012 1 Maximise Your Return on Big Data Girish Pancha Chief Product Officer Informatica 2 IT leaders must think about big data and all the dimensions it implies in order to

More information

Big Data and Its Impact on the Data Warehousing Architecture

Big Data and Its Impact on the Data Warehousing Architecture Big Data and Its Impact on the Data Warehousing Architecture Sponsored by SAP Speaker: Wayne Eckerson, Director of Research, TechTarget Wayne Eckerson: Hi my name is Wayne Eckerson, I am Director of Research

More information

Focus on the business, not the business of data warehousing!

Focus on the business, not the business of data warehousing! Focus on the business, not the business of data warehousing! Adam M. Ronthal Technical Product Marketing and Strategy Big Data, Cloud, and Appliances @ARonthal 1 Disclaimer Copyright IBM Corporation 2014.

More information

Is Business Intelligence an Oxymoron?

Is Business Intelligence an Oxymoron? Is Business Intelligence an Oxymoron? Presentation by Agenda A Quiz! BI Definition and Concepts Components of a BI Solution Project Methodology Business Analysis BI Products BI Roadmap (time permitting)

More information

Presented by: Jose Chinchilla, MCITP

Presented by: Jose Chinchilla, MCITP Presented by: Jose Chinchilla, MCITP Jose Chinchilla MCITP: Database Administrator, SQL Server 2008 MCITP: Business Intelligence SQL Server 2008 Customers & Partners Current Positions: President, Agile

More information

Inge Os Sales Consulting Manager Oracle Norway

Inge Os Sales Consulting Manager Oracle Norway Inge Os Sales Consulting Manager Oracle Norway Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database Machine Oracle & Sun Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database

More information

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise Solutions Group The following is intended to outline our

More information

Data Warehouse (DW) Maturity Assessment Questionnaire

Data Warehouse (DW) Maturity Assessment Questionnaire Data Warehouse (DW) Maturity Assessment Questionnaire Catalina Sacu - csacu@students.cs.uu.nl Marco Spruit m.r.spruit@cs.uu.nl Frank Habers fhabers@inergy.nl September, 2010 Technical Report UU-CS-2010-021

More information

Endeca Introduction to Big Data Analytics

Endeca Introduction to Big Data Analytics Endeca Introduction to Big Data Analytics Overview May 8, 2013 1 Agenda Introduction Overview Analytics for Big Data Overview Endeca Information Discovery Q & A 2 Introduction Business vs. IT Big Data

More information

Data warehousing with PostgreSQL

Data warehousing with PostgreSQL Data warehousing with PostgreSQL Gabriele Bartolini http://www.2ndquadrant.it/ European PostgreSQL Day 2009 6 November, ParisTech Telecom, Paris, France Audience

More information

Data Warehousing Fundamentals for IT Professionals. 2nd Edition

Data Warehousing Fundamentals for IT Professionals. 2nd Edition Brochure More information from http://www.researchandmarkets.com/reports/2171973/ Data Warehousing Fundamentals for IT Professionals. 2nd Edition Description: Cutting-edge content and guidance from a data

More information

Why Business Intelligence

Why Business Intelligence Why Business Intelligence Ferruccio Ferrando z IT Specialist Techline Italy March 2011 page 1 di 11 1.1 The origins In the '50s economic boom, when demand and production were very high, the only concern

More information

Oracle Database 12c Plug In. Switch On. Get SMART.

Oracle Database 12c Plug In. Switch On. Get SMART. Oracle Database 12c Plug In. Switch On. Get SMART. Duncan Harvey Head of Core Technology, Oracle EMEA March 2015 Safe Harbor Statement The following is intended to outline our general product direction.

More information

Implementing a SQL Data Warehouse 2016

Implementing a SQL Data Warehouse 2016 Implementing a SQL Data Warehouse 2016 http://www.homnick.com marketing@homnick.com +1.561.988.0567 Boca Raton, Fl USA About this course This 4-day instructor led course describes how to implement a data

More information

Business Intelligence: How to Unleash the Real Power of Microsoft s Fast Track and PDW Appliances

Business Intelligence: How to Unleash the Real Power of Microsoft s Fast Track and PDW Appliances Business Intelligence: How to Unleash the Real Power of Microsoft s Fast Track and PDW Appliances September, 2013 Author Andy Isley Scalability Experts, BI Practice Leader Executive Summary If you are

More information

Designing a Dimensional Model

Designing a Dimensional Model Designing a Dimensional Model Erik Veerman Atlanta MDF member SQL Server MVP, Microsoft MCT Mentor, Solid Quality Learning Definitions Data Warehousing A subject-oriented, integrated, time-variant, and

More information

<Insert Picture Here> Real-Time Data Integration for BI and Data Warehousing

<Insert Picture Here> Real-Time Data Integration for BI and Data Warehousing Real-Time Data Integration for BI and Data Warehousing Agenda Why Real-Time Data for BI? Architectures for Real-Time BI Oracle GoldenGate for Real-Time Data Integration Customer Examples

More information

Introducing Oracle Exalytics In-Memory Machine

Introducing Oracle Exalytics In-Memory Machine Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle

More information

Business Intelligence. 1. Introduction September, 2013.

Business Intelligence. 1. Introduction September, 2013. Business Intelligence 1. Introduction September, 2013. The content of the first lecture Introduction to data warehousing and business intelligence Star join 2 Data hierarchy Strategical data Operational

More information

Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses

Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses Investigating the Effects of Spatial Data Redundancy in Query Performance over Geographical Data Warehouses Thiago Luís Lopes Siqueira Ricardo Rodrigues Ciferri Valéria Cesário Times Cristina Dutra de

More information

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction

More information

Introduction to Database as a Service

Introduction to Database as a Service Introduction to Database as a Service Exadata Platform Revised 8/1/13 Database as a Service (DBaaS) Starts With The Business Needs Establish an IT delivery model that reduces costs, meets demand, and fulfills

More information

Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi kishorejaladi@yahoo.com

Data Warehousing: Data Models and OLAP operations. By Kishore Jaladi kishorejaladi@yahoo.com Data Warehousing: Data Models and OLAP operations By Kishore Jaladi kishorejaladi@yahoo.com Topics Covered 1. Understanding the term Data Warehousing 2. Three-tier Decision Support Systems 3. Approaches

More information

Data Warehousing with Oracle

Data Warehousing with Oracle Data Warehousing with Oracle Comprehensive Concepts Overview, Insight, Recommendations, Best Practices and a whole lot more. By Tariq Farooq A BrainSurface Presentation What is a Data Warehouse? Designed

More information

Building a Cube in Analysis Services Step by Step and Best Practices

Building a Cube in Analysis Services Step by Step and Best Practices Building a Cube in Analysis Services Step by Step and Best Practices Rhode Island Business Intelligence Group November 15, 2012 Sunil Kadimdiwan InfoTrove Business Intelligence Agenda Data Warehousing

More information

MICHAEL SCHMITZ NOVEMBER 20-22, 2006 NOVEMBER 23-24, 2006 RESIDENZA DI RIPETTA - VIA DI RIPETTA, 231 ROME (ITALY)

MICHAEL SCHMITZ NOVEMBER 20-22, 2006 NOVEMBER 23-24, 2006 RESIDENZA DI RIPETTA - VIA DI RIPETTA, 231 ROME (ITALY) TECHNOLOGY TRANSFER PRESENTS MICHAEL SCHMITZ DATA WAREHOUSING Advanced Design and Implementation Issues ETL FOR THE DATA WAREHOUSE A Template-Driven Approach NOVEMBER 20-22, 2006 NOVEMBER 23-24, 2006 RESIDENZA

More information

FIFTH EDITION. Oracle Essentials. Rick Greenwald, Robert Stackowiak, and. Jonathan Stern O'REILLY" Tokyo. Koln Sebastopol. Cambridge Farnham.

FIFTH EDITION. Oracle Essentials. Rick Greenwald, Robert Stackowiak, and. Jonathan Stern O'REILLY Tokyo. Koln Sebastopol. Cambridge Farnham. FIFTH EDITION Oracle Essentials Rick Greenwald, Robert Stackowiak, and Jonathan Stern O'REILLY" Beijing Cambridge Farnham Koln Sebastopol Tokyo _ Table of Contents Preface xiii 1. Introducing Oracle 1

More information

BIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting

BIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting BIG DATA APPLIANCES July 23, TDWI R Sathyanarayana Enterprise Information Management & Analytics Practice EMC Consulting 1 Big data are datasets that grow so large that they become awkward to work with

More information

BUILDING OLAP TOOLS OVER LARGE DATABASES

BUILDING OLAP TOOLS OVER LARGE DATABASES BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,

More information

Mastering Data Warehouse Aggregates. Solutions for Star Schema Performance

Mastering Data Warehouse Aggregates. Solutions for Star Schema Performance Brochure More information from http://www.researchandmarkets.com/reports/2248199/ Mastering Data Warehouse Aggregates. Solutions for Star Schema Performance Description: - This is the first book to provide

More information

Master Data Management. Zahra Mansoori

Master Data Management. Zahra Mansoori Master Data Management Zahra Mansoori 1 1. Preference 2 A critical question arises How do you get from a thousand points of data entry to a single view of the business? We are going to answer this question

More information

Real Life Performance of In-Memory Database Systems for BI

Real Life Performance of In-Memory Database Systems for BI D1 Solutions AG a Netcetera Company Real Life Performance of In-Memory Database Systems for BI 10th European TDWI Conference Munich, June 2010 10th European TDWI Conference Munich, June 2010 Authors: Dr.

More information

SUN ORACLE DATABASE MACHINE

SUN ORACLE DATABASE MACHINE SUN ORACLE DATABASE MACHINE FEATURES AND FACTS FEATURES From 1 to 8 database servers From 1 to 14 Sun Oracle Exadata Storage Servers Each Exadata Storage Server includes 384 GB of Exadata Smart Flash Cache

More information

BIPM H6001: Bus Intel & Process Modelling

BIPM H6001: Bus Intel & Process Modelling Short Title: Full Title: Bus Intel & APPROVED Bus Intel & Module Code: BIPM H6001 Credits: 7.5 NFQ Level: 9 Field of Study: Management and administration Module Delivered in no programmes Reviewed By:

More information

資 料 倉 儲 (Data Warehousing)

資 料 倉 儲 (Data Warehousing) 商 業 智 慧 實 務 Prac&ces of Business Intelligence Tamkang University 資 料 倉 儲 (Data Warehousing) 1022BI04 MI4 Wed, 9,10 (16:10-18:00) (B113) Min-Yuh Day 戴 敏 育 Assistant Professor 專 任 助 理 教 授 Dept. of Information

More information

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW Roger Breu PDW Solution Specialist Microsoft Western Europe Marcus Gullberg PDW Partner Account Manager Microsoft Sweden

More information

Real-Time Data Warehousing & Fraud Detection with Oracle 11gR2. Dr.-Ing. Holger Friedrich

Real-Time Data Warehousing & Fraud Detection with Oracle 11gR2. Dr.-Ing. Holger Friedrich Real-Time Data Warehousing & Fraud Detection with Oracle 11gR2 Dr.-Ing. Holger Friedrich Agenda Introduction Scope & Challenges Tools & Infrastructure Architecture & Implementation Prospects Conclusions

More information

Cost-Effective Business Intelligence with Red Hat and Open Source

Cost-Effective Business Intelligence with Red Hat and Open Source Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,

More information

BIG DATA & the Data Warehouse

BIG DATA & the Data Warehouse 25568 Genesee Trail Rd Golden, Colorado 80401 (303) 526-0340 Data Vault Modeling and Approach DW2.0 and Unstructured Data Master Data Management and Metadata BIG DATA & the Data Warehouse 2012 Genesee

More information

IBM WebSphere DataStage Online training from Yes-M Systems

IBM WebSphere DataStage Online training from Yes-M Systems Yes-M Systems offers the unique opportunity to aspiring fresher s and experienced professionals to get real time experience in ETL Data warehouse tool IBM DataStage. Course Description With this training

More information

Data warehouse Architectures and processes

Data warehouse Architectures and processes Database and data mining group, Data warehouse Architectures and processes DATA WAREHOUSE: ARCHITECTURES AND PROCESSES - 1 Database and data mining group, Data warehouse architectures Separation between

More information

Data Warehousing and Data Mining

Data Warehousing and Data Mining Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong gaocong@cs.aau.dk Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge

More information

The Technology Evaluator s Cheat Sheets. Business Intelligence & Analy:cs

The Technology Evaluator s Cheat Sheets. Business Intelligence & Analy:cs The Technology Evaluator s Cheat Sheets Business Intelligence & Analy:cs Summary So1ware Stacks Full Stacks (DB + ETL Tools + Front- End So1ware) Back- End Stacks (DB and/or ETL Tools Only) Front- End

More information

Driving Peak Performance. 2013 IBM Corporation

Driving Peak Performance. 2013 IBM Corporation Driving Peak Performance 1 Session 2: Driving Peak Performance Abstract We know you want the fastest performance possible for your deployments, and yet that relies on many choices across data storage,

More information

Exadata in the Retail Sector

Exadata in the Retail Sector Exadata in the Retail Sector Jon Mead Managing Director - Rittman Mead Consulting Agenda Introduction Business Problem Approach Design Considerations Observations Wins Summary Q&A What it is not... Introductions

More information

Data Warehouse Modeling Industry Models

Data Warehouse Modeling Industry Models Data Warehouse Modeling Industry Models Modeling Techniques come from Mars and Industry Models come from Venus? Maarten Ketelaars Agenda Introduction High level architecture Technical Aspects Functional

More information

Oracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture

Oracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture Oracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture Ron Weiss, Exadata Product Management Exadata Database Machine Best Platform to Run the

More information

Dimensional Modeling for Data Warehouse

Dimensional Modeling for Data Warehouse Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or

More information

Business Intelligence, Data warehousing Concept and artifacts

Business Intelligence, Data warehousing Concept and artifacts Business Intelligence, Data warehousing Concept and artifacts Data Warehousing is the process of constructing and using the data warehouse. The data warehouse is constructed by integrating the data from

More information

Oracle Architecture, Concepts & Facilities

Oracle Architecture, Concepts & Facilities COURSE CODE: COURSE TITLE: CURRENCY: AUDIENCE: ORAACF Oracle Architecture, Concepts & Facilities 10g & 11g Database administrators, system administrators and developers PREREQUISITES: At least 1 year of

More information

Life Cycle of a Data Warehousing Project in Healthcare

Life Cycle of a Data Warehousing Project in Healthcare Life Cycle of a Data Warehousing Project in Healthcare Ravi Verma, Jeannette Harper ABSTRACT Hill Physicians Medical Group (and its medical management firm, PriMed Management) early on recognized the need

More information

Lecture Data Warehouse Systems

Lecture Data Warehouse Systems Lecture Data Warehouse Systems Eva Zangerle SS 2013 PART A: Architecture Chapter 1: Motivation and Definitions Motivation Goal: to build an operational general view on a company to support decisions in

More information

The Data Webhouse. Toolkit. Building the Web-Enabled Data Warehouse WILEY COMPUTER PUBLISHING

The Data Webhouse. Toolkit. Building the Web-Enabled Data Warehouse WILEY COMPUTER PUBLISHING The Data Webhouse Toolkit Building the Web-Enabled Data Warehouse Ralph Kimball Richard Merz WILEY COMPUTER PUBLISHING John Wiley & Sons, Inc. New York Chichester Weinheim Brisbane Singapore Toronto Contents

More information

Data Warehousing. SQL Server 2008 R2, Denali

Data Warehousing. SQL Server 2008 R2, Denali Data Warehousing SQL Server 2008 R2, Denali Delta Sport Delta Sport ekskluzivni je distributer kompanije Nike i vodeći je sportski maloprodajni lanac u regionu. Franšizni je partner holandskog modnog brenda

More information

Aaron Werman. aaron.werman@gmail.com

Aaron Werman. aaron.werman@gmail.com Aaron Werman aaron.werman@gmail.com Complex integration of capital markets trading data Hundreds of ETLs, Thousands of tables 10K+ ETL executions per day, many highly complex Near real time SLAs ODS with

More information

Establish and maintain Center of Excellence (CoE) around Data Architecture

Establish and maintain Center of Excellence (CoE) around Data Architecture Senior BI Data Architect - Bensenville, IL The Company s Information Management Team is comprised of highly technical resources with diverse backgrounds in data warehouse development & support, business

More information

Data warehouse design

Data warehouse design DataBase and Data Mining Group of DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, Data warehouse design DATA WAREHOUSE: DESIGN - 1 Risk factors Database

More information

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database 1 Best Practices for Extreme Performance with Data Warehousing on Oracle Database Rekha Balwada Principal Product Manager Agenda Parallel Execution Workload Management on Data Warehouse

More information

Introducing Oracle Data Integrator and Oracle GoldenGate Marco Ragogna

Introducing Oracle Data Integrator and Oracle GoldenGate Marco Ragogna Introducing Oracle Data Integrator and Oracle GoldenGate Marco Ragogna EMEA Principal Sales Consultant Data integration Solutions IT Obstacles to Unifying Information What is it costing you to unify your

More information

2010 Ingres Corporation. Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation

2010 Ingres Corporation. Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation Agenda Need for Fast Data Analysis & The Data Explosion Challenge Approaches Used Till

More information

Data Warehouse Architecture Overview

Data Warehouse Architecture Overview Data Warehousing 01 Data Warehouse Architecture Overview DW 2014/2015 Notice! Author " João Moura Pires (jmp@di.fct.unl.pt)! This material can be freely used for personal or academic purposes without any

More information

COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design

COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design COURSE OUTLINE Track 1 Advanced Data Modeling, Analysis and Design TDWI Advanced Data Modeling Techniques Module One Data Modeling Concepts Data Models in Context Zachman Framework Overview Levels of Data

More information