Big Data and Trusted Information

Similar documents
IBM Data Warehousing and Analytics Portfolio Summary

Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada

How the oil and gas industry can gain value from Big Data?

BAO & Big Data Overview Applied to Real-time Campaign GSE. Joel Viale Telecom Solutions Lab Solution Architect. Telecom Solutions Lab

Exploiting Data at Rest and Data in Motion with a Big Data Platform

IBM Solution Framework for Lifecycle Management of Research Data IBM Corporation

IBM Big Data Platform

Are You Ready for Big Data?

Industry Impact of Big Data in the Cloud: An IBM Perspective

Luncheon Webinar Series May 13, 2013

Are You Ready for Big Data?

IBM Big Data Platform

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Architecting for the Internet of Things & Big Data

How To Get More Data From Your Computer

Achieving Business Value through Big Data Analytics Philip Russom

Business Intelligence. Advanced visualization. Reporting & dashboards. Mobile BI. Packaged BI

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Big Data & Analytics. The. Deal. About. Jacob Büchler jbuechler@dk.ibm.com Cand. Polit. IBM Denmark, Solution Exec IBM Corporation

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

The Lab and The Factory

A New Era Of Analytic

IBM Information Management Overview

Beyond Watson: The Business Implications of Big Data

BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP

IBM Big Data in Government

Deploying Big Data to the Cloud: Roadmap for Success

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Big Data & Analytics for Semiconductor Manufacturing

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Modernizing Your Data Warehouse for Hadoop

Beyond the Single View with IBM InfoSphere

Big Data and the new trends for BI and Analytics Juha Teljo Business Intelligence and Predictive Solutions Executive IBM Europe

An Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP Oracle ESG Data Systems Architecture

Getting Started Practical Input For Your Roadmap

Big Data Use Case Deep Dive 5 Game Changing Use Cases for Big Data

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Smarter Analytics Leadership Summit Big Data. Real Solutions. Big Results.

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Sources: Summary Data is exploding in volume, variety and velocity timely

SAP and Hortonworks Reference Architecture

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Big Data overview. Livio Ventura. SICS Software week, Sept Cloud and Big Data Day

Operational Intelligence: Real-Time Business Analytics for Big Data Philip Russom

Big Data Strategies with IMS

Big Data System and Architecture

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

Big Data and Your Data Warehouse Philip Russom

BIG Data Analytics Move to Competitive Advantage

Leveraging Information For Smarter Business Outcomes With IBM Information Management Software

Parallel Data Warehouse

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Big Data Analytics: Today's Gold Rush November 20, 2013

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

An Oracle White Paper June Oracle: Big Data for the Enterprise

Data Integration Checklist

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

Dell Information Management solutions

An Oracle White Paper October Oracle: Big Data for the Enterprise

VIEWPOINT. High Performance Analytics. Industry Context and Trends

The Enterprise Data Hub and The Modern Information Architecture

The BIg Picture. Dinsdag 17 september 2013

Transforming Government with Big Data and Analytics

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Big Data, Integration and Governance: Ask the Experts

Intelligent Business Operations

IBM InfoSphere BigInsights Enterprise Edition

The Future of Data Management

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

The Next Wave of Data Management. Is Big Data The New Normal?

Smarter Analytics. Barbara Cain. Driving Value from Big Data

Predictive Analytics. Noam Zeigerson, CTO

BIG DATA : PAST, PRESENT AND FUTURE - AN ANALYST S PERSPECTIVE

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

YOU VS THE SENSORS. Six Requirements for Visualizing the Internet of Things. Dan Potter Chief Marketing Officer, Datawatch Corporation

Big Data Er Big Data bare en døgnflue? Lasse Bache-Mathiesen CTO BIM Norway

Taming the Beast of Big Data

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Modern Data Warehouse

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

Il mondo dei DB Cambia : Tecnologie e opportunita`

IBM BigInsights for Apache Hadoop

Extend your analytic capabilities with SAP Predictive Analysis

Big Data & Analytics Heute & Morgen

Transcription:

Dr. Oliver Adamczak Big Data and Trusted Information CAS Single Point of Truth 7. Mai 2012

The Hype Big Data: The next frontier for innovation, competition and productivity McKinsey Global Institute 2012 will be the year of 'big data' BBC Nov 30 2011 Searches for "big data" on Gartner's website have increased 981% between March 2011 - October 2011 most enterprise data warehouse (EDW) and BI teams currently lack a clear understanding of big data technologies They are increasingly asking the question, "How can we use big data to deliver new insights?" Gartner 2012 Big Data - We are at a huge inflection point and this opportunity comes only once. We are declaring that IBM is the #1 leader in providing a Big Data platform. Alyse Passarelli, WW VP IM Sales Jan 10 th 2012 2 2

V 3 Big Data Platform Variety Analyze telemetry, fuel consumption, schedule and weather patterns to optimize shipping logistics. Velocity Analyze 100k records/ second to address customer satisfaction in real time Volume Optimize capital investments based on 6 Petabytes of information 3

IBM s Big Data Platform Vision Bringing Big Data to the Enterprise IBM Big Data Solutions Client and Partner Solutions Big Data User Environments Developers End Users Administrators Data Warehouse InfoSphere Warehouse Warehouse Appliances Netezza Master Data Mgmt InfoSphere MDM AGENTS Big Data Enterprise Engines Streaming Analytics Internet Scale Analytics Open Source Foundational Components Hadoop HBase Pig Lucene Jaql Linux Eclipse UIMA OpenCV INTEGRATION Information Server Database DB2 Content Analytics ECM Business Analytics Cognos & SPSS Marketing Unica Data Growth Management InfoSphere Optim 4

Forrester Research Study 2012 Requirements for Big Data Data volume 75% Analysis driven requirements 58% Data diversity 52% Data sources for Big Data Existing transactional data 75% Sensor / device data 58% Social media 52% 5

Big Data is a key growth adjacency for data warehouse Data Warehouse CGR 2010-15 : 8.5% Big Data 2010-15 CGR: 13.8% DW Appliance CGR 2010-15 : 13.7% Soruce: GMV 1H2012 2H2011 and IBM MI estimates 6 6

Merging the Traditional and Big Data Approaches Traditional Approach Structured & Repeatable Analysis Big Data Approach Iterative & Exploratory Analysis Business Users Determine what question to ask IT Delivers a platform to enable creative discovery IT Structures the data to answer that question Business Explores what questions could be asked Monthly sales reports Profitability analysis Customer surveys Brand sentiment Product strategy Maximum asset utilization 7

Vestas optimizes capital investments based on 2.5 Petabytes of information. Model the weather to optimize placement of turbines, maximizing power generation and longevity. Reduce time required to identify placement of turbine from weeks to hours. Incorporate 2.5 PB of structured and semi-structured information flows. Data volume expected to grow to 6 PB. 8 8

InfoSphere Streams Delivers Real Time Analytic Processing A Platform to Run In-Motion Analytics on BIG Data Real time delivery ICU Monitoring Environment Monitoring Algo Trading Powerful Analytics Telco churn predict Volume Terabytes per second Petabytes per day Cyber Security Government / Law enforcement Smart Grid Variety All kinds of data All kinds of analytics Millions of events per second Microsecond Latency Velocity Insights in microseconds Traditional / Non-traditional data sources 9

Enterprise Integration Data Warehouse Big Data Platform Trusted Information & Governance Companies need to govern what comes in, and the insights that come out Data Management Insights from Big Data must be incorporated into the warehouse Enterprise Integration Traditional Sources New Sources 10

One Example - The 360 Multi-Channel Customer Sentiment Analysis Business Processes Events and Alerts Master Data Management Campaign Management Cognos Consumer Insight Big Data Platform Web Traffic and Social Media Insight Website Logs Social Media Internet Scale Analytics Information Integration Data Warehouse Call Detail Reports (CDRs) Streaming Analytics Call Behavior and Experience Insight 11

Big Data is an integral part of the Enterprise Data Platform Control point for data starting from the instant it enters the enterprise High fidelity for all data without changing its original format. Source data available for new uses, analyses, and integrations. Cognos Applications Big Data Applications Operational Data Store InfoSphere Warehouse Cubing Services IBM Big Data Solutions Big Data Platform Client and Partner Solutions InfoSphere Information Server Big Data User Environment Developers End Users Administrato rs Traditional data sources (ERP, CRM, databases, etc.) Big Data Enterprise Engine Operators Applications Languages Orchestration Prioritization Quality of Service Optimizations Storage and Indexing 12 12 Source Data from every source (Web, sensor, data, network, social, RFID, media)

Trusted Information Delivery Architecture Source Systems Transformation & Harmonisation Target Systems Reports Staging & Error Tables Information Analyzer Common Metadata Repository Business Terms Specifications Development Infrastructure Reports DQ Dashboard 13

Information Server Hadoop Integration Exchange of information with big data sources Move enterprise information into big data sources so it can be included in analytics Take analytical results of Hadoop and apply them into other IT solutions Parallelism and scale Support for HDFS provides massive scalability via the Information Server parallel engine Lineage of jobs with Big Insights source/target steps Using extensibility feature in Information Server Business Value: Fueling and helping organizations leverage big data analysis across the enterprise. 14

Information Server - Netezza Integration Netezza Next Generation Connector (with migration tool to replace current Netezza Enterprise stage) Scalable, high-performance data exchange for DataStage, QualityStage and Info Analyzer Shared metadata across Information Server Enhanced lookups, statistics, other functions Balanced Optimization for Netezza Execute either traditional ETL on the Information Server engine or push parts/all the processing into the Netezza appliance Maximizes performance where data is already in Netezza CDC and CDD for Netezza Enable captured changes to be applied directly to Netezza (available today via User Exit from services, productization planned for next major release) Netezza Data Warehouse Appliance Business Value: Improves performance and accelerates time to value for organizations using InfoSphere Information Server with an IBM Netezza appliance 15

Conclusions Big Data enhances the BI portfolio Larger data volumes (petabyte compared to terabytes) Access to new sources (Internet, unstructured, sensor data) Real time analysis of data streams Explorative analytics Traditional Approach Structured & Repeatable Analysis Business Users Determine what question to ask IT Structure s the data to answer Monthly sales reports that Profitability question analysis Customer surveys Big Data Approach Iterative & Exploratory Analysis IT Delivers a platform to enable creative discovery Business Explores what questions could be asked Brand sentiment Product strategy Maximum asset utilization Businesses already get competitive advantages out of Big Data However, BI maturity in most companies is low to medium Cross domain analysis Predictive analysis Real-time DWH Analytical process support Business Processes Events and Alerts Master Data Management Campaign Management Cognos Consumer Insight DWH with Trusted Information remains the base for enterprise analytics Integration tools and DWH have adapted to the new technologies Website Log s Social Med ia Big Data Platform Internet Scale Analytics Web Traffic and Social Media Insight Information Integration Data Warehous e Call Detail Reports (CDRs) Streaming Analytics Call Behavior and Experience Insight 16