Modern Data Warehouse



Similar documents
Parallel Data Warehouse

Global outlook on the perspectives of technologies like Power Hub

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Architecting for the Internet of Things & Big Data

Whitepaper: Solution Overview - Breakthrough Insight. Published: March 7, Applies to: Microsoft SQL Server Summary:

A New Era Of Analytic

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Microsoft technológie pre BigData. Ľubomír Goryl Solution Professional

Better Business Analytics with Powerful Business Intelligence Tools

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Armanino McKenna LLP Welcomes You To Today s Webinar:

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal

Modernizing Your Data Warehouse for Hadoop

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

SAP Big Data and Cloud Application Development. Mark Mumy Director, Enterprise Architecture and Big Data

HDP Hadoop From concept to deployment.

Big Data Executive Survey

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

Agile BI With SQL Server 2012

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

HDP Enabling the Modern Data Architecture

Big Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA

Dell Information Management solutions

Microsoft BI Platform Overview

Getting Started Practical Input For Your Roadmap

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

Please give me your feedback

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

The Lab and The Factory

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Big Data and Trusted Information

Priyo Lahiri Partner Technical Consultant Microsoft Corporation

Microsoft Big Data 解決方案與案例分享

Business Intelligence with SharePoint 2010

VIEWPOINT. High Performance Analytics. Industry Context and Trends

Business Intelligence and Healthcare

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

The Future of Data Management

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Disrupt or be disrupted IT Driving Business Transformation

CREATING PACKAGED IP FOR BUSINESS ANALYTICS PROJECTS

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Management Consulting Systems Integration Managed Services WHITE PAPER DATA DISCOVERY VS ENTERPRISE BUSINESS INTELLIGENCE

Extending The Value of SAP with the SAP BusinessObjects Business Intelligence Platform Product Integration Roadmap

ENTERPRISE BI AND DATA DISCOVERY, FINALLY

INTELLIGENT BUSINESS STRATEGIES WHITE PAPER

Blueprints for Big Data Success

The BIg Picture. Dinsdag 17 september 2013

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

Analance Data Integration Technical Whitepaper

IBM Big Data in Government

Intelligent BI Testing. Key to Reliable Information. Data to Impact.

Safe Harbor Statement

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir

ENABLING OPERATIONAL BI

Microsoft Services Exceed your business with Microsoft SharePoint Server 2010

Big Data Analytics. Lucas Rego Drumond

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Big Analytics: A Next Generation Roadmap

Data Virtualization. Paul Moxon Denodo Technologies. Alberta Data Architecture Community January 22 nd, Denodo Technologies

How the oil and gas industry can gain value from Big Data?

Step by Step: Big Data Technology. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 25 August 2015

Understanding the Value of In-Memory in the IT Landscape

Providing Answers for Business through Big Data

CONNECTING DATA WITH BUSINESS

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Analance Data Integration Technical Whitepaper

Integrating a Big Data Platform into Government:

Common Situations. Departments choosing best in class solutions for their specific needs. Lack of coordinated BI strategy across the enterprise

How To Create A Business Intelligence (Bi)

Outline. BI and Enterprise-wide decisions BI in different Business Areas BI Strategy, Architecture, and Perspectives

A Knowledge Management Framework Using Business Intelligence Solutions

UNIFY YOUR (BIG) DATA

Microsoft Analytics Platform System. Solution Brief

Ganzheitliches Datenmanagement

The difference between. BI and CPM. A white paper prepared by Prophix Software

The Enterprise Data Hub and The Modern Information Architecture

The Principles of the Business Data Lake

August 2014 San Antonio Texas The Power of Embedded Analytics with SAP BusinessObjects

Course 40009A: Updating your Business Intelligence Skills to Microsoft SQL Server 2012

Industrial Dr. Stefan Bungart

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining

Extend your analytic capabilities with SAP Predictive Analysis

Transcription:

1

Modern Data Warehouse Are you ready for Big Data? Does your DWH / BI roadmap contain all the necessary components? IDG: Big data technologies describe a new generation of technologies and architectures, designed to economically extract value from very large volumes of a wide variety of data, by enabling high-velocity capture, discovery, and/or analysis. it represents a unique, perhaps once-in-a-career opportunity to drive growth for their enterprises. They will need to lead the enterprise in the adoption of new information-taming technologies, best practices for leveraging and extracting value from data, and the creation of new roles and organizational design. Each step will require organizational change, not just a few new computers or more software. The success of many enterprises in the coming years will be determined by how successful CIOs are in driving the required enterprisewide adjustment to the new realities of the digital universe. 2

Big Data everyone talks about it The McKinsey Global Institute estimates that data volume is growing 40% per year, and will grow 44x between 2009 and 2020. Gartner: By 2015, organizations that build a modern information management system will outperform their peers financially by 20 percent. Harward Business Review: Data driven decisions are better decisions its as simple as that. Using big data enables managers to decide on the basis of evidence rather than intuition. For that reason it has the potential to revolutionize management. Larry Feinsmith (Managing Director at JPMorgan Chase): Integrating Hadoop with existing IT investments is vitally important. Picture created 08/2014 using: http://www.google.cz/trends Big Data is like teenage sex: everyone talks about it, nobody really knows how to do it, everyone thinks everyone else is doing it, so everyone claims they are doing it "data warehouse" + "datawarehouse" "business intelligence" "big data" "hadoop" 3

Data Volume [ZB] Data explosion 40,00 35,00 30,00 25,00 20,00 15,00 10,00 5,00 Data Explosion Researchers at IDC estimate: 2010: more than 1 ZB (zettabyte, 1 ZB = 10 21 B) 2015: 6,8 ZB 2020: more than 36 ZB 90% of data on the planet was created in the past 2 years Most of the data is unstructured or semi-structured 90% 10% Structured Unstructured 0,00 Transportation One airplane generates 10 TB of data every 30 minutes. The total amount of data generated daily climbs into the petabyte scale. Utility One gas turbine blade supervision: 588 GB per day (Source: GE) How much data do we create? We call and use smart phones We use banking services, internet and mobile banking, payment cards We shop in stores We shop in e-shops We travel by plane, by car, public transport, etc. We go to the doctor We communicate and have fun 4

Big Data overview Data scientists, Big Data jsou jakákoliv data, která je obtížné efektivně Business analysts zpracovat (uložit, analyzovat) ve stávajících uložištích a nástrojích, Advanced zpravidla & Big Data analytics, Hadoop relačních databázích. Data Lakes Architecture and Data governance, Evangelization 5

Current DWH/BI (reality) Because of rigid DWH/BI platform business users developed their own, not managed, solution(s). They are used to utilize MS Excel and deliver reports as files (XLS, PDF, PPT). Data stored in many storages Many DWHs and Data Marts Excel, Access, etc. used to process and consolidate data Majority of reports & analysis created in MS Excel Data preparation using SQL Analysis performed via contingency tables and graphs Majority of reports & analysis delivered as files Mostly XLS, PDF, PPT format is used Gap between IT and business Users spend 90+% effort manually creating reports Rigid, expensive, slow changes / development Missing or dysfunctional BICC Missing tools / knowledge / trust related to data exploration 6

DWH / BI Study The figures below are based on real DWH/BI study from one client. But such situation is more or less common for many companies 90 80 70 60 50 40 30 20 10 0 Reporting data source Operational Systems DWH Sandbox / SQL 100 80 60 40 20 0 Report creation & delivery Enterprise BI tool MS Excel File Online Not all data available in DWH Data preparation using SQL Sandbox used as a permanent storage Majority of reports & analysis created in MS Excel by different business departments Analysis performed via contingency tables and graphs Missing code / knowledge sharing between misc. users Confusing reports (the same number calculated by different algorithms) Majority of reports & analysis delivered as files Mostly XLS, PDF, PPT format is used Gap between IT and business Users spend 90+% effort manually creating reports Missing or dysfunctional BICC Missing tools / knowledge / trust related to data exploration 7

Current DWH/BI (challenges) Gartner, The State of Data Warehousing in 2012: data warehousing has reached the most significant tipping point since its inception. The biggest, possibly most elaborate data management system in IT is changing. 4 3 1 5 6 New data sources & types, increasing data volumes More advanced data processing (including unstructured data) Text mining, semantic technologies, NPL Natural language processing, Speech recognition Near real-time data processing BI tools to support flexible data analysis Self-Service BI In-memory analytics, Visual Data Discovery, Data Visualization, Data mining, Predictive analytics, Graph / Network analytics, Path analytics, Sentiment analysis New roles and knowledge BICC Architecture & Data Governance Security Data Quality TCO (affordability) 2 7 8

Comprehensive DWH/BI architecture The traditional DWH ecosystem extended to include new big data sources and technologies. Utilize relevant, affordable platforms to efficiently store, process and analyze data (do more with less). Heterogeneous (but integrated!) big data management platform consist of: Operational Data Store Enterprise Data Warehouse Hadoop Streaming (CEP) Business Intelligence tools should cover Corporate BI, Self-Service BI, Advanced Analytics Efficient BICC is crucial 9

Architecture overview (Microsoft DWH / BI Stack) Big Data Sources (Raw, Unstructured) Streaming Data & Compute Intensive Application, Data Discovery SQL Server StreamInsight Alerts, Notifications Operational Dashboards Business Insights Interactive Reports Performance Scorecards Corporate BI Sensors Devices Web Social Networks Fast Load Hadoop on Windows Azure Hadoop on Windows Server Historical Data (Beyond Active Window) Summarize & Load Analytics Platform System (PDW & HDI) Integrate/ Enrich SQL Server Data Marts SQL Server Analysis Server SQL Server Reporting Services Excel SharePoint Collaboration Self-service Office Integration Data Visualization Data Mining Data Discovery Self-service BI Advanced Analytics Enterprise ETL with SSIS, DQS, MDS BI Governance ERP CRM LOB APPS Source Systems Azure Power BI Administration & Development BICC End-to-End DW & Big Data Platform, Driving Analytics on any Data 10