So Many Tools, So Much Data, and So Much Meta Data



Similar documents
Data Vault + Data Virtualization = Double Flexibility

What is Data Virtualization? Rick F. van der Lans, R20/Consultancy

Data Services: The Marriage of Data Integration and Application Integration

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

Data Warehouse Optimization

Data Virtualization for Business Intelligence Agility

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

Evolving Data Warehouse Architectures

Luncheon Webinar Series May 13, 2013

Big Data and Your Data Warehouse Philip Russom

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Dell Information Management solutions

Tableau Visual Intelligence Platform Rapid Fire Analytics for Everyone Everywhere

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition

Agile BI The Future of BI

Data Virtualization. Paul Moxon Denodo Technologies. Alberta Data Architecture Community January 22 nd, Denodo Technologies

Getting Started Practical Input For Your Roadmap

Data Virtualization Usage Patterns for Business Intelligence/ Data Warehouse Architectures

Business Intelligence with SharePoint 2010

Business Intelligence: Effective Decision Making

Integrating Netezza into your existing IT landscape

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

Introducing Oracle Exalytics In-Memory Machine

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Independent process platform

Management Accountants and IT Professionals providing Better Information = BI = Business Intelligence. Peter Simons peter.simons@cimaglobal.

IT FUSION CONFERENCE. Build a Better Foundation for Business

Il mondo dei DB Cambia : Tecnologie e opportunita`

Achieving Business Value through Big Data Analytics Philip Russom

Common Situations. Departments choosing best in class solutions for their specific needs. Lack of coordinated BI strategy across the enterprise

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS

Building Dashboards for Real Business Results. Cindi Howson BIScorecard December 11, 2012

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Data warehouse and Business Intelligence Collateral

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

Oracle Business Intelligence 11g Business Dashboard Management

SQL Server 2012 Business Intelligence Boot Camp

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Design of Electricity & Energy Review Dashboard Using Business Intelligence and Data Warehouse

The Lab and The Factory

Parallel Data Warehouse

The IBM Cognos Platform

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

What is Data Virtualization?

An Enterprise Framework for Business Intelligence

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

QlikView Business Discovery Platform. Algol Consulting Srl

Microsoft Analytics Platform System. Solution Brief

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Big Analytics: A Next Generation Roadmap

Key Data Replication Criteria for Enabling Operational Reporting and Analytics

Armanino McKenna LLP Welcomes You To Today s Webinar:

Data Virtualization A Potential Antidote for Big Data Growing Pains

Data Integration Alternatives & Best Practices

Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora

MDM and Data Warehousing Complement Each Other

DATA MINING AND WAREHOUSING CONCEPTS

The 3 questions to ask yourself about BIG DATA

BI FUTURES: BI Like You ve not Seen Before! Babar Jan-Haleem APAC Director Specialist Architecture Team

Ganzheitliches Datenmanagement

Oracle Business Intelligence Suite Enterprise Edition

Microsoft Services Exceed your business with Microsoft SharePoint Server 2010

The Ultimate Guide to Buying Business Analytics

The IBM Cognos Platform for Enterprise Business Intelligence

A Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities

Data Search. Searching and Finding information in Unstructured and Structured Data Sources

Salesforce.com and MicroStrategy. A functional overview and recommendation for analysis and application development

INTELLIGENT BUSINESS STRATEGIES WHITE PAPER

ETL-EXTRACT, TRANSFORM & LOAD TESTING

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

Title Business Intelligence: A Discussion on Platforms, Technologies, and solutions

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Breadboard BI. Unlocking ERP Data Using Open Source Tools By Christopher Lavigne

The Ultimate Guide to Buying Business Analytics

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Open Source Business Intelligence Intro

Information Architecture

Providing real-time, built-in analytics with S/4HANA. Jürgen Thielemans, SAP Enterprise Architect SAP Belgium&Luxembourg

ENABLING OPERATIONAL BI

W H I T E P A P E R B u s i n e s s I n t e l l i g e n c e S o lutions from the Microsoft and Teradata Partnership

Cloud First Does Not Have to Mean Cloud Exclusively. Digital Government Institute s Cloud Computing & Data Center Conference, September 2014

Management Consulting Systems Integration Managed Services WHITE PAPER DATA DISCOVERY VS ENTERPRISE BUSINESS INTELLIGENCE

Investor Presentation. Second Quarter 2015

The big data business model: opportunity and key success factors

Transcription:

So Many Tools, So Much Data, and So Much Meta Data Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands. All rights reserved. No part of this material may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photographic, or otherwise, without the explicit written permission of the copyright owners. by Rick F. van der Lans R20/Consultancy BV Twitter: rick_vanderlans www.r20.nl Rick F. van der Lans Rick F. van der Lans is an independent consultant, lecturer, and author. He specializes in data warehousing, business intelligence, service oriented architectures, and database technology. He is managing director of R20/Consultancy B.V.. Rick has been involved in various projects in which SOA, data warehousing, and integration technology was applied. Rick van der Lans is an internationally acclaimed lecturer. He has lectured professionally for the last twenty years in many of the European and Middle East countries, the USA, South America, and in Australia. He has been invited by several major software vendors to present keynote speeches. He is the author of several books on computing, including Myths on Computing. Some of these books are available in different languages. Books such as the popular Introduction to SQL and SQL for MySQL Developers, are available in English, Dutch, Italian, Chinese, and German and are sold world wide. This year he released The SQL Guide to Ingres. As author for BeyeNetwork.com, writer of whitepapers, as chairman for the annual European Data Warehouse and Business Intelligence Conference, and as columnist for a few IT magazines, he has close contacts with many vendors. R20/Consultancy B.V. is located in The Hague, The Netherlands, www.r20.nl. You can get in touch with Rick via: Email: rick@r20.nl Twitter: http://twitter.com/rick_vanderlans LinkedIn: http://www.linkedin.com/pub/rick-van-der-lans/9/207/223 Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 2 1

What is Business Intelligence? The success of most organizations is highly dependent on the quality of their decision making The field of business intelligence focuses on supporting and possibly improving the decision making process of an organization Definition by Boris Evelson of Forrester Research: Business Intelligence is a set of methodologies, processes, architectures, and technologies that transform raw data into meaningful and useful information used to enable more effective strategic, tactical, and operational insights and decisionmaking. Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 3 Too Much Data Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 4 2

Chris J. Date - 1977 An enterprise should store its operational data in an integrated database to provide the enterprise with centralized control of its operational data This is in sharp contrast to the situation that prevails in most enterprises today [1977], where typically each application has its own private files so that the data is widely dispersed. Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 5 Utopia: One Large Database Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 6 3

Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 7 Chain of Databases production databases staging area data warehouse datamarts personal data stores Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 8 4

I BM I BM I BM The Chain is a Complex Network personal data stores production databases data marts data staging area operational data store data warehouse Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 9 Real Life Architecture VSAM VSAM DB2 UDB SQL Server 2000 Operational Systems Facets Core Proxy Extension Pseudo CHF, Diabetes Women's Health others ValuTech ITS PeopleSoft SFA (Onyx) Pegasystems External Feeds ASK / Dental NASCO FEP PBM - Wellpoint Vendors - Lab [2] - Vision [2] - Chiropractic Drug Claims Intelligence & SQR SQR Reports ExStream Intelligence & SQR SAR SAR RTF Excel Spreadsheets CCMS (McKesson) SQL Server 2000 SQR SEGRA CRMS (McKesson) Query Builder/ OLAP HEDIS Baseline Assessment Tool PB App RPA Mainframe Files Medical Claims Membership Drug Claims Wellpoint Premium Capitation Provider Lookup/Dimension Files [28] RPA Database Medical Claims Membership Drug Claims - Wellpoint Claims Repository VB App Outsourced Intelligence EDW (Ingenix) Analyzers Config & Surveys Adjustments SAR Postscript Extream Extream PIP SAR SAR Analysis Services SQR Extream Postscript FAMS? (IBM) SQL Server 2000 Intelligence PBViews? ~50 MS Access DB Applications Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 10 5

Disadvantages of Data Duplication and Distribution Data latency increases Costs of storing and managing duplicate data increases Flexibility decreases Data quality decreases (potentially) Costs of data integration increases Data security more complex And many more Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 11 2011 TDWI BI Benchmark Report The average time needed to add a new data source was 8.4 weeks in 2009, 7.4 weeks in 2010, and 7.8 weeks in 2011. 33% needed more than 3 months. Developing a complex report or dashboard with about 20 dimensions, 12 measures, and 6 user access rules, took on average 6.3 weeks in 2009, 6.6 weeks in 2010, and 7 weeks in 2011. 30% of the respondents indicated they needed at least 3 months or more for such a development exercise. Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 12 6

Problems with Current DW Platforms Poor query response Can t support advanced analytics Inadequate data load speed Can t scale to large data volumes Cost of scaling up is too expensive Poorly suited to real-time or on demand workloads Current platform is a legacy we must phase out Can t support data modeling we need We need platform that supports mixed workloads Can t support large concurrent user count Inadequate high availability Inadequate support for in-memory processing Inadequate support for web services and SOA Current platform is 32-bit, and we need 64-bit Current platform is SMP, and we need MPP We need platform better suited to cloud or virtualization Can t secure the data properly Other No problems 45% 40% 39% 37% 33% 29% 23% 23% 21% 20% 19% 16% 16% 15% 14% 13% 11% 4% 3% Source: P. Russom, Next Generation Data Warehouse Platforms, TDWI Best Practices Report, fourth quarter 2009. Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 13 New Forms of The BI New BI Agile BI 360 reporting Exploratory analysis Operational BI Deep analytics Big data analytics Self-service BI Semi-structured and unstructured data analytics Disposable reports And many more Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 14 7

Operational BI Supporting decision making on the operational management level Different forms Operational reporting Operational analytics Embedded analytics Exception reporting All forms need access to operational data Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 15 Big Data (More Databases?) Size Does Matter Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 16 8

Need for more Database Power Financial Organization: 1000 users concurrently 50-80 queries concurrently 100 Terabytes data warehouse Data latency 30 minutes Trend towards data normalization (each fact only once) Technical/engineering company: 40 Terabytes data warehouse 4 Petabytes of I/O per day Peek: more than 100 queries concurrently Business critical data warehouse Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 17 Data Virtualization to the Rescue production application reporting & analytics SOA SQL statement SQL statement SQL statement Data Virtualization SQL statement Server Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 18 9

Too Many Tools Reporting Tools Executive Reporting Tools OLAP Tools BAM/KPI/Dashboarding Tools Spreadsheets Data Visualization Tools Geo Visualization Tools Analytical Tools Predictive Modeling Tools Forecasting Tools Optimization Tools (Operations Research) Statistical Analysis Tools Data Mining Tools Text Mining Tools Data Discovery/Exploitation Tools Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 19 And Even More Tools Data Integration Tools ETL ELT Replication Data Virtualization Data Services (ESB) Data StorageTools SQL Databases NoSQL Databases Cubes multi-dimensional Data Warehouse Appliances In-memory Databases Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 20 10

Gartner Magic Quadrant 2012 for BI Platforms Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 21 Is IT (BICC) losing control? Gartner: By 2012, business units will control Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 22 at least 40% of the total budget for BI 11

Quote Gartner on Self-Service BI Vocal, demanding and influential business users are increasingly driving BI purchasing decisions. They re choosing easier to use data discovery tools over traditional BI platforms with or without IT's consent. Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 23 Self-Service BI Self-Service Reporting Self-Service Analytics Self-Service ETL Self-Service Cleansing Self-Service Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 24 12

BI in the Cloud production databases ODS data warehouse datamarts personal data stores Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 25 Example: BlinkLogic Headquartered in San Rafael, CA Formerly known as DataJungle Software BI solution includes dashboards, analytics, collaboration, annotation, key performance indicators (KPI) monitoring, notifications to smart phones, location intelligence, Web reports, export to Excel, and portable document format (PDF) They aim at midsize customers Most customers are not IT professionals Runs on Oracle with OLAP cubes Outsourced the client databases to UpSource Do they still exist? Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 26 13

Can Copyright be downloaded 1991-2012 R20/Consultancy from www.r20.nl B.V., The Hague, The Netherlands 27 Too Much and Too Many Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 28 14

Let s Wrap it Up! Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 29 No/Minimal Integration of Meta Data Each tool stores its own meta data Each tool creates its own meta data Effect Proliferation of meta data Duplication and no sharing of meta data No one wants a meta data warehouse project Integration of meta data is a must! Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 30 15

Current (Sad) Situation It s reducing time to market It s reducing flexibility (now when it is needed) Bad for data quality Trust in data factory diminishes Too much time is spent on technical aspects and not user requests Unneccessary costs Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 31 Recommendations Select BI platforms where meta data is integrated Select database servers that support mixed workload Super database power Simplify BI architecture Try to avoid introducing extra databases because of performance reasons Use data virtualization to introduce flexibility Copyright 1991-2012 R20/Consultancy B.V., The Hague, The Netherlands 32 16