IOT & Big Data: The Future Information Processing Architecture



Similar documents
Building Scalable Big Data Pipelines

Evolution of Information Management Architecture and Development

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

#TalendSandbox for Big Data

Data Integration Checklist

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:

Ganzheitliches Datenmanagement

Big Data Analytics Nokia

BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

Talend Big Data. Delivering instant value from all your data. Talend

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study

Big Data and Fast Data combined is it possible?

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

How To Use Big Data For Business

Big Data. Marriage of RDBMS-DWH and Hadoop & Co. Author: Jan Ott Trivadis AG Trivadis. Big Data - Marriage of RDBMS-DWH and Hadoop & Co.

How To Extend An Enterprise Bio Solution

Course 20467: Designing Self-Service Business Intelligence and Big Data Solutions

Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO. Big Data Everywhere Conference, NYC November 2015

How to Enhance Traditional BI Architecture to Leverage Big Data

Dominik Wagenknecht Accenture

HDP Hadoop From concept to deployment.

Service Oriented Data Management

Roadmap Talend : découvrez les futures fonctionnalités de Talend

The Role of the BI Competency Center in Maximizing Organizational Performance

Information Architecture

Designing Self-Service Business Intelligence and Big Data Solutions

CAPTURING & PROCESSING REAL-TIME DATA ON AWS

The Internet of Things and Big Data: Intro

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Fast Innovation requires Fast IT

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Bringing Big Data to People

Migrating Discoverer to OBIEE Lessons Learned. Presented By Presented By Naren Thota Infosemantics, Inc.

CUIT with Visual Studio and TFS

Information Builders Mission & Value Proposition

Managing Data in Motion

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

secure intelligence collection and assessment system Your business technologists. Powering progress

Big Data and Industrial Internet

Native Connectivity to Big Data Sources in MSTR 10

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

Azure Data Lake Analytics

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Global outlook on the perspectives of technologies like Power Hub

Making Sense of Big Data in Insurance

BIG DATA ARCHITECTURE AND ANALYTICS BIG DATA STRATEGIES FOR BUSINESS GROWTH

... Foreword Preface... 19

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Big Data on Microsoft Platform

Microsoft Business Intelligence

Artur Borycki. Director International Solutions Marketing

Introduction to Apache Kafka And Real-Time ETL. for Oracle DBAs and Data Analysts

The Lab and The Factory

Big Data Architect Certification Self-Study Kit Bundle

Luncheon Webinar Series May 13, 2013

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Data Virtualization and ETL. Denodo Technologies Architecture Brief

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

IBM Big Data Platform

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

INTRODUCTION TO APACHE HADOOP MATTHIAS BRÄGER CERN GS-ASE

Big Data & Analytics Reference Architecture

This Symposium brought to you by

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

The Future of Data Management

Big Data at Cloud Scale

SAP HANA SPS 09 - What s New? HANA IM Services: SDI and SDQ

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

Secure Test Data Management with ORACLE Data Masking

Informatica and the Vibe Virtual Data Machine

Big Data: Making Sense of it all!

Practical Hadoop by Example

Architecting Open source solutions on Azure. Nicholas Dritsas Senior Director, Microsoft Singapore

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Information Management and Big Data

INTELLIGENT BUSINESS STRATEGIES WHITE PAPER

Building Blocks of Cortana Intelligence Suite

Challenges of Analytics

Analytics: Pharma Analytics (Siebel 7.8) Student Guide

What's New in SAS Data Management

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

A Scalable Data Transformation Framework using the Hadoop Ecosystem

SQL Server 2012 Business Intelligence Boot Camp

IT FUSION CONFERENCE. Build a Better Foundation for Business

SAS BI Course Content; Introduction to DWH / BI Concepts

Microsoft Azure. IaaS Networking Storage. Stefan Geiger Gerry

HDP Enabling the Modern Data Architecture

Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p.

Agile Business Intelligence Data Lake Architecture

Integrating Netezza into your existing IT landscape

The Big Data Ecosystem at LinkedIn Roshan Sumbaly, Jay Kreps, and Sam Shah LinkedIn

Oracle Big Data Handbook

PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP

Transcription:

IOT & Big Data: The Future Information Processing Architecture Dr. Michael Faden Dirk Weise BASEL BERN BRUGG GENF LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN 1

Oder: Was mach ich nur mit den ganzen Daten? 2

Agenda 1. Introduction 2. Terms and Definitions 3. Reference Architecture 4. Architecture Building Blocks 5. Architecture Patterns 6. Solution Building Blocks 3

Agenda 1. Introduction 2. Terms and Definitions 3. Reference Architecture 4. Architecture Building Blocks 5. Architecture Patterns 6. Solution Building Blocks 4

Discussions about Big Data Is our use case Big Data or not? What shall we do with all that Sensor Data Do we have to use Hadoop or NoSQL for Big Data? Do we miss out if we don t do Big Data? 5

Understand Drivers for Big Data Undertakings Opportunity Business Scenario Business Demand Capability A business scenario bridges three drivers end to end: Opportunity There is something one could do. Capability There is something we can do. Business demand There is something we need to do. 6

BIG Respelled Business Value: Investment in Big Data, be it in terms of personnel or technology, has to be rectified by convincing business demand. Integration: Big Data sources have to be integrated with the enterprise architecture semantically, technologically and procedurally. Governance: Big Data must not dilute the enterpise information model. Business processes have to be aligned to data source properties (e.g. topicality, reliability). 7

BIG Respelled Business Value: Investment in Big Data, be it in terms of personnel or technology, has to be rectified by convincing business demand. Integration: Big Data sources have to be integrated with the enterprise architecture semantically, technologically and procedurally. Governance: Big Data must not dilute the enterpise information model. Business processes have to be aligned to data source properties (e.g. topicality, reliability). 8

..how to avoid this? 9

Separate Concepts from Implementation Required Capability What? Provided Capability How? Required capability Conceptual requirements capture and high-level solution Independent from technology, organisation and processes Provided capability Concrete detailed solution and implementation Specific to technology, organisation and processes 10

Agenda 1. Introduction 2. Terms and Definitions 3. Reference Architecture 4. Architecture Building Blocks 5. Architecture Patterns 6. Solution Building Blocks 11

Terminology Architecture Fundamentals of a system Context, structure, abstraction, process Architecture View System seen from the perspective of specific concerns Comprehensible, comprehensive Building Block Potentially reusable component of an architecture Relevant, interrelated, fractal ISO/IEC 42010 Systems and software engineering Architecture description TOGAF Version 9.1 Definitions 12

Reference Architecture Architecture Building Blocks Solution Building Blocks Our Approach Method & Patterns Requirements view Implementation view Operation view Reference architecture. The canvas for our architecture ABBs. Commonly found requirements SBBs. Commonly applied solutions Views. Tailored to stakeholders Patterns. Commonly found requirement/solution profiles Method. Guideline to populate the canvas including consistency checks 13

Agenda 1. Introduction 2. Terms and Definitions 3. Reference Architecture 4. Architecture Building Blocks 5. Architecture Patterns 6. Solution Building Blocks 14

Reference Architecture Governance Data Sources Data Ingestion Data Factory & Managed Enterprise Information Information Provisioning Information Query & Visualisation Discovery Lab Organisation 15

Agenda 1. Introduction 2. Terms and Definitions 3. Reference Architecture 4. Architecture Building Blocks 5. Architecture Patterns 6. Solution Building Blocks 16

Architecture Building Blocks Governance Metadata Management Master Data Management Information Quality & Accountability Security Legal Compliance Data Sources Data Ingestion Data Factory & Managed Enterprise Information Information Provisioning Information Query & Visualisation Data Engines & Poly-structured Sources Structured Data Sources Data Source Connectivity & Capturing Raw Data in Motion Managed Information Layer Access & Performance Layer Virtualisation & Query Federation Export (push) Query (pull) Prebuilt & Adhoc BI Assets Information Services Master & Reference Data Sources Data Factory Layer Discovery Lab Content Raw Data at Rest Discovery Lab Sandboxes Advanced Analysis & Data Science Tools Organisation BI Competence Centre IT Operations Business Stakeholders 17

Agenda 1. Introduction 2. Terms and Definitions 3. Reference Architecture 4. Architecture Building Blocks 5. Architecture Patterns 6. Solution Building Blocks 18

Architecture Patterns Schema on Write Governance Data Sources Data Ingestion Data Factory & Managed Enterprise Information Information Provisioning Information Query & Visualisation Data Source Connectivity & Capturing Managed Information Raw Data in Motion Factory Discovery Lab Raw Data at Rest Organisation 19

Architecture Patterns Classic BI Governance Metadata Management Master Data Management Information Quality & Accountability Security Legal Compliance Data Sources Data Ingestion Data Factory & Managed Enterprise Information Information Provisioning Information Query & Visualisation Structured Data Sources Data Source Connectivity & Capturing Managed Information Access & Performance Layer Query (pull) Prebuilt & Adhoc BI Assets Master & Reference Data Sources Factory Discovery Lab Organisation BI Competence Centre IT Operations Business Stakeholders 20

Architecture Patterns Classic BI (in its own words) Governance Metadata Management Master Data Management Information Quality & Accountability Security Legal Compliance Data Sources Data Ingestion Data Factory & Managed Enterprise Information Information Provisioning Information Query & Visualisation Structured Data Sources Data Source Connectivity & Capturing Core DWH Data Marts Query (pull) Reporting & Ad-hoc Queries BI Portal etc. Master & Reference Data Sources ETL incl. Staging & Cleansing Advanced Analytics ODS incl. Historisation Data Mining Organisation BI Competence Centre IT Operations Business Stakeholders 21

Architecture Patterns Schema on Read Governance Data Sources Data Ingestion Data Factory & Managed Enterprise Information Information Provisioning Information Query & Visualisation Data Source Connectivity & Capturing Raw Data in Motion Discovery Lab Raw Data at Rest Organisation 22

Architecture Patterns Lambda Architecture Governance Data Sources Data Ingestion Data Factory & Managed Enterprise Information Information Provisioning Information Query & Visualisation Data Source Connectivity & Capturing Raw Data in Motion Factory Managed Information Access & Performance Layer Managed Informations Discovery Lab Raw Data at Rest Factory Organisation 23

Architecture Patterns Discovery Lab Governance Data Sources Data Ingestion Data Factory & Managed Enterprise Information Information Provisioning Information Query & Visualisation Data Source Connectivity & Capturing Raw Data in Motion Managed Information Layer Discovery Lab Raw Data at Rest Discovery Lab Sandboxes Advanced Analysis & Data Science Tools Organisation BI Competence Centre IT Operations Business Stakeholders 24

Agenda 1. Introduction 2. Terms and Definitions 3. Reference Architecture 4. Architecture Building Blocks 5. Architecture Patterns 6. Solution Building Blocks 25

StreamInsight Solutions Data Ingestion Architecture Solutions An RDBMS Way (synchronous) An MS Way (asynchronous) Data Source Connectivity & Capturing DB Upsert EventHub Raw Data in Motion Data Factory DB Triggers Raw Data at Rest DB Staging Tables SQLServer Similar: CDC Similar: ESB Kafka & Storm 27

Solutions Data Factory (Integration) Architecture Solutions To RDBMS To Hadoop To NoSQL Data Store (RDBMS) DB Link JDBC SQOOP, JDBC SPARQL Data Store (Hadoop) Data Store SQOOP, JDBC Hive, Pig, Drill Data Store (RDBMS) Hive, Pig, Drill Data Store (Hadoop) SPARQL Data Store (NoSQL) Data Store (NoSQL) SPARQL SPARQL SPARQL 28

Solutions Data Factory (Interpretation) Architecture Solutions Schema Application Identity Resolution Data Cleansing Data Capture Raw Data Raw Data Raw Data Raw Data Raw Data in Raw Motion Data MEI Parsing NLP OCR Normalisation MEI Matching Algorithms MEI Cleansing Algorithms MEI Raw Data at Raw Rest Data Reference Data Operational Data Reference Data 29

Solution Building Block Governance Data Sources Data Ingestion Azure HDInsight Data Factory & Managed Enterprise Information Power BI Information Provisioning Information Query & Visualisation ML Discovery Studio Lab Organisation 30

Questions & Answers Dr. Michael Faden Senior Consultant Tel.: +49 162 291 47 74 michael.faden@trivadis.com Dirk Weise Senior Consultant Tel.: +41 79 909 72 48 dirk.weise@trivadis.com BASEL BERN BRUGG GENF LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN