IBM Smarter Analytics für Big Data



Similar documents
IBM Big Data in Government

HDP Hadoop From concept to deployment.

IBM Big Data Platform

IBM Big Data. Hadoop-tietoisku kumppaneille Pekka Leppänen, IBM Analytics Platform Leader Finland IBM Corporation

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

The Future of Data Management

Exploiting Data at Rest and Data in Motion with a Big Data Platform

The Internet of Things

Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada

Comprehensive Analytics on the Hortonworks Data Platform

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Big Data & Analytics. The. Deal. About. Jacob Büchler jbuechler@dk.ibm.com Cand. Polit. IBM Denmark, Solution Exec IBM Corporation

HDP Enabling the Modern Data Architecture

Bringing Big Data to People

The Future of Data Management with Hadoop and the Enterprise Data Hub

Luncheon Webinar Series May 13, 2013

How the oil and gas industry can gain value from Big Data?

How Companies are! Using Spark

Upcoming Announcements

Talend Big Data. Delivering instant value from all your data. Talend

BIG Data Analytics Move to Competitive Advantage

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Extend your analytic capabilities with SAP Predictive Analysis

Big Data Management and Security

Delivering secure, real-time business insights for the Industrial world

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Safe Harbor Statement

Native Connectivity to Big Data Sources in MSTR 10

Big Data Realities Hadoop in the Enterprise Architecture

The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer,

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Dell In-Memory Appliance for Cloudera Enterprise

Dominik Wagenknecht Accenture

Making big data simple with Databricks

Bringing the Power of SAS to Hadoop. White Paper

Advanced Big Data Analytics with R and Hadoop

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Big Data: Making Sense of it all!

Informix Product Strategy and Roadmap Data, Cloud, Analytics, Internet of Things

What s next for the Berkeley Data Analytics Stack?

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Why Spark on Hadoop Matters

Roadmap Talend : découvrez les futures fonctionnalités de Talend

From Spark to Ignition:

Are You Ready for Big Data?

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

#TalendSandbox for Big Data

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Modern Data Architecture for Predictive Analytics

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

Are You Big Data Ready?

Data Security in Hadoop

Big Data Strategies with IMS

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

BIG DATA TRENDS AND TECHNOLOGIES

Information Builders Mission & Value Proposition

How To Write A Trusted Analytics Platform (Tap)

Integrating a Big Data Platform into Government:

SAP and Hortonworks Reference Architecture

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Self-service BI for big data applications using Apache Drill

How To Create A Data Visualization With Apache Spark And Zeppelin

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

IBM InfoSphere BigInsights Enterprise Edition

The Digital Enterprise Demands a Modern Integration Approach. Nada daveiga, Sr. Dir. of Technical Sales Tony LaVasseur, Territory Leader

Harnessing big data with Hortonworks Data Platform and Red Hat JBoss Data Virtualization

Modernizing Your Data Warehouse for Hadoop

Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop

BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data?

Databricks. A Primer

Self-service BI for big data applications using Apache Drill

ANALYTICS CENTER LEARNING PROGRAM

The Internet of Things and Big Data: Intro

Big Data and the new trends for BI and Analytics Juha Teljo Business Intelligence and Predictive Solutions Executive IBM Europe

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

Big Data and New Paradigms in Information Management. Vladimir Videnovic Institute for Information Management

Ali Ghodsi Head of PM and Engineering Databricks

Microsoft Big Data. Solution Brief

Gain Contextual Awareness for a Smarter Digital Enterprise with SAP HANA Vora

and Hadoop Technology

Big Data and Industrial Internet

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

IBM BigInsights for Apache Hadoop

P4.1 Reference Architectures for Enterprise Big Data Use Cases Romeo Kienzler, Data Scientist, Advisory Architect, IBM Germany, Austria, Switzerland

White Paper: Hadoop for Intelligence Analysis

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Are You Ready for Big Data?

Transcription:

FZI Forschungszentrum Informatik am Karlsruher Institut für Technologie Technologie Workshop Big Data 22. Juni 2015 Axel J. Schwarz Mobil: +49 171 5619419 E-Mail: axel.j.schwarz@de.ibm.com IBM Smarter Analytics für Big Data v1.3 1

Paradigm shift enabled by Big Data Analytics is expanding from enterprise data to big data, creating new opportunities for competitive advantage Traditional Approach Structured, analytical, logical New Approach Creative, holistic thought, intuition Transaction Data Data Warehouse Hadoop & Streaming Data Web Logs, URLs Contact Center notes Customer Data Core System Data Structured Repeatabl e Linear Enterprise Integration Unstructured Exploratory Iterative Text Data: emails, chats Social Data Log data 3 rd Party Data Traditional Sources New Sources Geolocation data 2

Paradigm shift enabled by Big Data Leverage more of the data being captured TRADITIONAL APPROACH BIG DATA APPROACH All available information Analyzed information All available information analyzed Analyze small subsets of information Analyze all information 3

Paradigm shift enabled by Big Data Reduce effort required to leverage data TRADITIONAL APPROACH BIG DATA APPROACH Small amount of carefully organized information Large amount of messy information Carefully cleanse information before any analysis Analyze information as is, cleanse as needed 4

Paradigm shift enabled by Big Data Data leads the way and sometimes correlations are good enough TRADITIONAL APPROACH BIG DATA APPROACH Hypothesis Question Data Exploration Answer Data Insight Correlation Start with hypothesis and test against selected data Explore all data and identify correlations 5

Paradigm shift enabled by Big Data Leverage data as it is captured TRADITIONAL APPROACH BIG DATA APPROACH Data Analysis Data Repository Analysis Insight Insight Analyze data after it s been processed and landed in a warehouse or mart Analyze data in motion as it s generated, in real-time 6

TRADITIONAL APPROACH IBM Smarter Analytics für Big Data Paradigm shift enabled by Big Data - Summary Analyze small subsets of information Analyze all information All available information Analyzed information All available information analyzed BIG DATA APPROACH Hypothesis Question Data Exploration Answer Data Insight Correlation Start with hypothesis and test against selected data Explore all data and identify correlations 7

TRADITIONAL APPROACH IBM Smarter Analytics für Big Data Paradigm shift enabled by Big Data - Summary Analyze small subsets of information Analyze all information EVERY All available information EXACT SLOW Hypothesis Question Analyzed information THING LACK OF Data Exploration All available information analyzed BIG DATA APPROACH Answer Data Insight CONTROL Correlation Start with hypothesis and test against selected data Explore all data and identify correlations 8

IBM Big Data & Analytics Referenz-Architektur Komponenten - Überblick 9

IBM Big Data & Analytics Referenz-Architektur Komponenten - Detaillierung 10

IBM Big Data & Analytics Referenz-Architektur Komponenten Fokus 11

In-Memory Big Data Processing - Apache Spark IBM Investing in Four Catalysts for Big Data Adoption Open Source Innovation Technical Standards Familiar Interfaces & Integration with Established Tools New Analytics Capabilities 12

In-Memory Big Data Processing - Apache potentially the most significant open source project of the next decade. To further accelerate open source innovation for the Spark ecosystem, IBM is taking the following actions: 13 IBM will build Spark into the core of the company s analytics and commerce platforms. IBM's Watson Health Cloud will leverage Spark as a key underpinning for its insight platform, helping to deliver faster time to value for medical providers and researchers as they access new analytics around population health data. IBM will open source its breakthrough IBM SystemML machine learning technology and collaborate with Databricks to advance Spark s machine learning capabilities. IBM will offer Spark as a Cloud service on IBM Bluemix to make it possible for app developers to quickly load data, model it, and derive the predictive artifact to use in their app. IBM will commit more than 3,500 researchers and developers to work on Spark-related projects [ ], and open a Spark Technology Center in San Francisco [ ]. IBM will educate more than 1 million data scientists and data engineers on Spark through extensive partnerships with AMPLab, DataCamp, MetiStream, Galvanize and BigData University MOOC.

In-Memory Big Data Processing - Apache Spark The Combination: The Flexibility of Spark on a Stable Hadoop Platform Unlimited Scale Ease of Development In-Memory Performance Enterprise Platform Wide Range of Data Formats Combine Workflows 14

In-Memory Big Data Processing - Apache Spark Die wichtigsten Bewohner des Hadoop-Zoos Quelle: Uwe Seiler: Zoo voller Gehege, in: ix Developer Big Data, 2/2015, S.37 15

In-Memory Big Data Processing - Apache Spark What s the scoop with Hadoop? Der Link zum Video findet sich im Anhang dieser Präsentation. 16

In-Memory Big Data Processing - Apache Spark Spark Libraries Spark SQL Spark Streaming GraphX MLlib SparkR Apache Spark 17

In-Memory Big Data Processing - Apache Spark Spark on Hadoop Spark SQL Spark Streaming GraphX MLlib SparkR Apache Spark Apache Hadoop-YARN Apache Hadoop-HDFS Slave node 1 Slave node 2 Slave node n Resource management Storage management Compute layer 18

In-Memory Big Data Processing - Apache Spark IBM Open Platform with Apache Hadoop 100% open source code Commitment to currency: days, not months Includes Spark Free for production use Decoupled Apache Hadoop from IBM analytics and data science technologies Production support offering available IBM Open Platform with Apache Hadoop HDFS MapReduce Spark Hive HCatalog Pig YARN Ambari HBase Flume Sqoop Solr/Lucene Apache Open Source Components 19

In-Memory Big Data Processing - Apache Spark IBM is Committed to Open Source Open source technologies are the base for IBM software and solutions IBM s long history of deep open source commitment Apache Software Foundation: Founding member in 1999 Cloud Foundry: #1 contributor; Basis for Bluemix OpenStack: #4 contributor; Basis for IBM s IaaS Linux: #3 contributor; IBM first enterprise backer of Linux Hadoop/Spark: Extensive investment in open source contribution; Integration with Analytics software Application Systems 20 Infrastructure

In-Memory Big Data Processing - Apache Spark IBM BigInsights for Apache Hadoop for Apache Hadoop IBM BigInsights Analyst Big SQL BigSheets IBM BigInsights Data Scientist Text Analytics Machine Learning with Big R Big R Big SQL BigSheets IBM BigInsights Enterprise Management POSIX Distributed File System Multi-workload, Multi-tenant scheduling IBM Open Platform with Apache Hadoop 21

Link: http://www.sidap.de 22

Exkurs: SIDAP Echtzeit-Betriebsanalyse Betrieb auf Basis globaler Erfahrung Otto von Bismarck, Winston Churchill: Der Weise lernt von den Fehlern der Anderen, der Dumme aus den Eigenen 23 Device: Gerät, Apparat, Einrichtung, Verfahren, Einheit Quelle: Dr. T. Pötter, Dr. M. Steffen (beide Bayer Technology Services)

Exkurs: SIDAP Echtzeit-Betriebsanalyse Betrieb auf Basis globaler Erfahrung 24 Quelle: Dr. T. Pötter, Dr. M. Steffen (beide Bayer Technology Services)

Exkurs: SIDAP Echtzeit-Betriebsanalyse Betrieb auf Basis globaler Erfahrung 25 Quelle: Dr. T. Pötter, Dr. M. Steffen (beide Bayer Technology Services)

Exkurs: SIDAP Echtzeit-Betriebsanalyse Betrieb auf Basis globaler Erfahrung Die Qualität der Vorhersagen von Fehlern wächst mit der Anzahl der Einträge in die Datenbasis. 26 Quelle: Dr. T. Pötter, Dr. M. Steffen (beide Bayer Technology Services)

Exkurs: SIDAP Datenaggregation im Rahmen von SIDAP Quelle: Dr. T. Pötter, Dr. M. Steffen (beide Bayer Technology Services) 27

Exkurs: SIDAP Use-Cases 28

Exkurs: SIDAP Use-Cases und Aufgabenbereiche der Partner 29

Raw Logs and Machine Data IBM Smarter Analytics für Big Data Exkurs: SIDAP Echtzeit-Betriebsanalyse (Operations Analysis) Real-time Monitoring InfoSphere Streams Capture Data Stream SPSS Modeler Identify Anomaly Decision Management Historical Reporting and Analysis InfoSphere BigInsights Raw Data SPSS Modeler Predict and Classify Aggregate Results Dashboard/BI SPSS Modeler Predict and Score Data Warehouse Store Results Federated Navigation and Discovery 30

Exkurs: Internet of Things IBM s Internet of Things Foundation Secure Device Registration Scalable Device Connectivity Historian Visual wiring PAYG SaaS pricing Powered by IBM MessageSight technology Manage Assemble Collect Connect 31

Exkurs: Internet of Things IoT becomes a Composable Business IoT end-end solutions Connected appliance solutions, Smarter home solutions App tips open community IoT-related Bluemix services Rules, Push, Geo location, Analytics, Asset management, Predictive Maintenance IoT Foundation Secure Device Registration, Scalable Device Connectivity, Historian, Visual wiring Device recipe open community 32 Devices & Gateways

Exkurs: Internet of Things IBM Bluemix How to build a smarter App Der Link zum Video findet sich im Anhang dieser Präsentation. 33

Exkurs: Internet of Things IBM Internet of Things Foundation 34

Exkurs: Internet of Things IBM Internet of Things Foundation Status Quo 35

Exkurs: Internet of Things IBM Internet of Things Foundation Status Quo 36

Exkurs: Internet of Things IBM Internet of Things Foundation Status Quo 37

Exkurs: Internet of Things IBM Internet of Things Foundation Status Quo 38

Exkurs: Internet of Things IBM Internet of Things Foundation Status Quo 39

Exkurs: Internet of Things Device Recipes make it faster Connect 40

Exkurs: Internet of Things Application Connection Connect Publish the same data to many applications with MQTT Visualisation on app or Brower based interface App on Bluemix Access control via Application Registration & Secure Token Compose with other IoT Services in Bluemix e.g. HD Insights, Notification services, Twitter analytics 41

IBM Smarter Analytics für Big Data Exkurs: Internet of Things Beispiel 42

IBM Smarter Analytics für Big Data Exkurs: Internet of Things Beispiel 43

Empfehlung Ia IBM Bluemix Link: http://www.ibm.com/bluemix 44

Empfehlung Ib IBM Internet of Things Zone auf Bluemix IBM Internet of Things Foundation IoT Zone in Bluemix ibm.biz/try_iot oder https://bluemix.net/solutions/iot 45

Empfehlung II IBM Watson Analytics Link: http://www.ibm.co m/analytics/watson -analytics/ 46

Empfehlung III Big Data University Link: http://bigdatauniversity.co m/ 47

Dipl. Wirtsch.-Ing. Axel J. Schwarz Mobik: +49 171 5619419 E-Mail: axel.j.schwarz@de.ibm.com Software Client Architect Vielen Dank! 48

Quellen und Referenzen Literatur/Blog-Beiträge Videos Links Ryan Baxter: Bluemix and the Internet of Things http://ryanjbaxter.com/2014/07/16/bluemix-and-the-internet-of-things/ Bernard Marr: Spark Or Hadoop -- Which Is The Best Big Data Framework? http://www.forbes.com/sites/bernardmarr/2015/06/22/spark-or-hadoop-which-is-the-best-big-data-framework/2/ Paul Miller - IBM Backs Apache Spark For Big Data Analytics http://www.forbes.com/sites/paulmiller/2015/06/15/ibm-backs-apache-spark-for-big-data-analytics/ IBM IBM Big Data: How it works - https://www.youtube.com/watch?v=u5jwc89xbzi What s the Scoop with Hadoop? - https://www.youtube.com/watch?v=bcgqijxcdo8 How to build a smarter app - https://www.youtube.com/watch?v=e5tknbwl2co Learn how to create a mobile app quickly using IBM BlueMix! - https://www.youtube.com/watch?v=dbxn2mz90b0 IBM Emerging Technology - Node-RED: http://nodered.org IBM developerworks - IoT: https://www.ibmdw.net/iot/ IBM developerworks - Build a cloud-ready temperature sensor with the Arduino Uno and the IBM IoT Foundation: http://www.ibm.com/developerworks/cloud/library/cl-bluemix-arduino-iot1/index.html Bluemi Bildquellen IBM sowie lizenzfreie digitale Bilder der Medien-Bibliotheken: 123RF (http://www.123rf.com) und Stock.XCHNG (http:///www.sxc.hu). 49