BIG DATA : PAST, PRESENT AND FUTURE - AN ANALYST S PERSPECTIVE

Similar documents
Big Data Strategies with IMS

Get Ready for Big Data with IBM System z

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

IBM Big Data Platform

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada

Industry Impact of Big Data in the Cloud: An IBM Perspective

How the oil and gas industry can gain value from Big Data?

Deploying Big Data to the Cloud: Roadmap for Success

Poslovni slučajevi upotrebe IBM Netezze

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

How to Leverage Big Data in the Cloud to Gain Competitive Advantage

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Luncheon Webinar Series May 13, 2013

Johan Hallberg Research Manager / Industry Analyst IDC Nordic Services & Sourcing Digital Transformation Global CIO Agenda

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Why Big Data in the Cloud?

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Building Confidence in Big Data Innovations in Information Integration & Governance for Big Data

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

HDP Hadoop From concept to deployment.

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

IDC MaturityScape Benchmark: Big Data and Analytics in Government

IBM Data Warehousing and Analytics Portfolio Summary

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

The Future of Data Management

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data?

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data?

In-memory computing with SAP HANA

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

An Oracle White Paper October Oracle: Big Data for the Enterprise

III JORNADAS DE DATA MINING

Big Data and Your Data Warehouse Philip Russom

Why DBMSs Matter More than Ever in the Big Data Era

Welcome to The Future of Analytics In Action

IBM Software Information Management Creating an Integrated, Optimized, and Secure Enterprise Data Platform:

IDC MaturityScape Benchmark: Big Data and Analytics in Government. Adelaide O Brien Research Director IDC Government Insights June 20, 2014

A Next-Generation Analytics Ecosystem for Big Data. Colin White, BI Research September 2012 Sponsored by ParAccel

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Microsoft Big Data. Solution Brief

Focus on the business, not the business of data warehousing!

Real World Use of BIG DATA. Tim Brown Information Management Technical Pre-Sales Aruna Kolluru Information Management Technical Pre-Sales 04/2013

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

Introducing Oracle Exalytics In-Memory Machine

How To Make Data Streaming A Real Time Intelligence

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

Big Data and Trusted Information

An Oracle White Paper June Oracle: Big Data for the Enterprise

Safe Harbor Statement

Next-Generation Cloud Analytics with Amazon Redshift

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Your Data, Any Place, Any Time.

Making Sense of Big Data in Insurance

Oracle Big Data Management System

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

INTELLIGENT BUSINESS STRATEGIES WHITE PAPER

Getting Started Practical Input For Your Roadmap

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

IBM BigInsights Has Potential If It Lives Up To Its Promise. InfoSphere BigInsights A Closer Look

Big Data and the new trends for BI and Analytics Juha Teljo Business Intelligence and Predictive Solutions Executive IBM Europe

IBM Software Hadoop in the cloud

Real-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Session 1: IT Infrastructure Security Vertica / Hadoop Integration and Analytic Capabilities for Federal Big Data Challenges

Emerging Requirements and DBMS Technologies:

Hadoop in the Hybrid Cloud

Self-Service Big Data Analytics for Line of Business

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

IBM BigInsights for Apache Hadoop

Extend your analytic capabilities with SAP Predictive Analysis

BIG DATA TRENDS AND TECHNOLOGIES

Big Data Analytics Nokia

Emerging Geospatial Trends The Convergence of Technologies. Jim Steiner Vice President, Product Management

Welcome to The Future of Analytics In Action IBM Corporation

Evolving Solutions Disruptive Technology Series Modern Data Warehouse

Using OBIEE for Location-Aware Predictive Analytics

Il mondo dei DB Cambia : Tecnologie e opportunita`

Actian SQL in Hadoop Buyer s Guide

Tap into Hadoop and Other No SQL Sources

The Enterprise Data Hub and The Modern Information Architecture

HDP Enabling the Modern Data Architecture

The Evolution of Business Analytics and Big Data Integration on zenterprise

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Big Data overview. Livio Ventura. SICS Software week, Sept Cloud and Big Data Day

An Oracle White Paper September Oracle: Big Data for the Enterprise

IBM Analytics. Just the facts: Four critical concepts for planning the logical data warehouse

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

NoSQL for SQL Professionals William McKnight

Smarter Analytics Leadership Summit Big Data. Real Solutions. Big Results.

Leveraging Machine Data to Deliver New Insights for Business Analytics

Oracle Big Data Strategy Simplified Infrastrcuture

Big Data Use Case Deep Dive 5 Game Changing Use Cases for Big Data

The 4 Pillars of Technosoft s Big Data Practice

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Transcription:

BIG DATA : PAST, PRESENT AND FUTURE - AN ANALYST S PERSPECTIVE Carl Olofson : Research Vice President, IDC Mark Simmonds, IBM Enterprise Architect and Senior Product Marketing Manager, IBM Software Group January 21 2014 1

Webcast Agenda Big data in the enterprise Carl Olofson, IDC Making big data a reality - IBM System z Information Management solutions Summary and Call to Action Q&A 2

Big Data in the Enterprise Carl Olofson Research Vice President IDC

Agenda Market Demand for Big Data The Maturing of Big Data in the Enterprise Big Data Technologies and Their Uses Future Directions Conclusions and Recommendations IDC Visit us at IDC.com and follow us on Twitter: @IDC 4

Market Forces: The 3rd Platform IDC Visit us at IDC.com and follow us on Twitter: @IDC 5

The Demand for Broad-Based Analytics Q. Are the following types of data analyzed in your organization? Structured transactional data Customer interaction related text from non-social network sources Geographic or spatial data Text f rom content repositories Mobile device data Web clickstream Sensor data or machine-generated data Rich media Chatter or text f rom social networks 0 10 20 30 40 50 60 70 80 90 (% of respondents) n = 330 Source: IDC and Computerworld Business Analytics and Big Data Survey, June 2013 IDC Visit us at IDC.com and follow us on Twitter: @IDC 6

What Does Big Data Mean To You? CPG company is conducting experiments in digital marketing with online retail and social media partners. Telecommunications company is improving predictability of customer churn Manufacturer of wind turbines uses weather data and big data technology to identify optimal locations Mean response Retailer is scoring customers based on their on- and off-line behavior Revenue growth Cost control or reduction 3.81 3.80 Credit card company is improving fraud detection capabilities Risk reduction Governance or regulatory requirements 3.50 3.67 How important are each of the following outcomes as drivers of business analytics initiatives at your organization? IDC and Computerworld 2013 Business Analytics Survey IDC Visit us at IDC.com and follow us on Twitter: @IDC 7

IDC Big Data & Analytics Maturity Model Business vision, strategic intent, operational metrics, sponsorship, project and program justification Optimized Managed Repeatable Quality, relevance, optimized availability, trustworthiness, governance, security, and accessibility Technology and analytic skills, intra- and intergroup collaboration, organizatio nal structures and cultural readiness, talent sourcing Opportunistic Ad Hoc Monitoring, collection, consol idation, integration, analysis, i nformation dissemination & consumption, and decision making, visualization, reporti ng Appropriateness, applicability, and performance of technology and IT architecture to the relevant workloads, integration strategy IDC Visit us at IDC.com and follow us on Twitter: @IDC 8

Big Data: The Second Wave Hadoop will share the Big Data space with other technologies. Some conventional scalable data warehouse (RDBMS) platforms may also have to yield to these newer technologies. A whole new workload, based on pattern and relationship analysis will be served Hadoop is especially useful for large scale ingestion of unorganized data, and for bulk aggregation and other deep analytics. Other technologies will address very demanding analytic workloads not well served by Hadoop or RDBMS. NewSQL for Complex Transactions (traffic management example) Hadoop for Data Ingestion and Bulk Processing Graph Database IDC Visit us at IDC.com and follow us on Twitter: @IDC 9

Use Case Classes Transactional Operations: (RDBMS, Document DBMS, Simple K-V DBMS) Structured Analytics: (RDBMS, NewSQL RDBMS, Graph DBMS, Hadoop) Mixed Analytics: (Document DBMS, NewSQL DBMS, Hadoop) Text Analytics (Document DBMS, Content DBMS) IDC Visit us at IDC.com and follow us on Twitter: @IDC 10

Extreme Transaction Processing in the Real Time Enterprise Hadoop Real Time BI Streaming Data CEP Engine Immediate Query IMDB Dynamic, S calable OLTP DB Business Action Enterprise Analytics Data Warehouse Dynamic Data Movement Extreme Transaction Processing IDC Visit us at IDC.com and follow us on Twitter: @IDC 11

Important Technology Advances RDBMS, Including NewSQL Memory-optimized technology Columnar compression that actually improves performance Vector processing Document-Based DBMS Improved compression and IMDB support Improved consistency and recoverability Schema elements that blur the lines with NewSQL All DBMS technology (hardware) Advances in memory addressability Non-volatile memory (NVM) IDC Visit us at IDC.com and follow us on Twitter: @IDC 12

Key Challenges System Level Management Existing technologies are disjoint. No overall control is possible. Granular Data Security Sensitive data can be accessed easily. Protection of data at the field level is difficult. Integrated Systems and Functions Various Hadoop, NoSQL, Graph DB, etc., technologies are not designed to work together or with conventional IT resources. Holistic, Cloud-Based System Architecture An economical, manageable approach to Big Data requires virtualization, flexibility, and elastic scalability. The ideal is a core platform that exudes these qualities. IDC Visit us at IDC.com and follow us on Twitter: @IDC 13

Conclusions / Recommendations Conclusions We are still in the early days of Big Data deployment. Technologies continue to evolve. Applications and tools will make these technologies more nearly address real-world workloads with manageability, in time. Recommendations Learn about all the emerging technologies and determine their applicability to various problem spaces. Assess your organization s needs for both deep and real-time analytics, and for large scale data management for extreme transaction processing. Develop an architectural vision, and a strategy for getting from where you are to where you want to be. Identify a key vendor as your partner in this journey who has the right vision, and can support your objectives. IDC Visit us at IDC.com and follow us on Twitter: @IDC 14

Webcast Agenda Big data in the enterprise Carl Olofson, IDC Making big data a reality - IBM System z Information Management solutions Summary and Call to Action Q&A 15

Big Data Helping to Create a Smarter Planet ZBs of transaction data Business models under constant pressure Cloud Customers are more demanding and connected Mobile Social Great relationships trump great products Internet of Things 16 16

Majority of today s analytics based on relational / structured data Analytics and decision engines reside where the DWH / transaction data is Noise Noise (veracity) surrounds the core business data Data Warehouse Integration - Social Media, emails, docs, telemetry, voice, video, content What data are you prepared to TRUST? Business Analytics DB2 for z/os IMS Information Governance Where do you put your trusted Data? Circle of trust 17

Demand for differently structured data to be seamlessly integrated, to augment analytics / decisions Analytics and decision engines reside where the DWH / transaction data is Noise (veractity) surrounds the core business data Social Media, emails, docs, telemetr y, voice, video, content Expanding our insights getting closer to the truth Data Warehouse Integration Business Analytics DB2 for z/os IMS Information Governance Lower risk and cost Increased profitability Circle of trust widens 18

zenterprise and Big Data A significant data source for today s business critical analytics Data that originates and/or resides on zenterprise 2/3 of business transactions for U.S. retail banks 80% of world s corporate data Businesses that run on zenterprise 66 of the top 66 worldwide banks 24 of the top 25 U.S. retailers 10 of the top 10 global life/ health insurance providers The downtime of an application running on zenterprise = apprx 5 minutes per yr 1,300+ ISVs run zenterprise today More than 275 of these selling over 800 applications on Linux 19

Forward Thinking Organizations are creating value from Big Data The power of Data coming together to deliver Improved Business Outcomes 1. Enrich your information base VolumeVelocity VarietyVeracity with Big Data Exploration 2. Improve customer interaction with Enhanced 360º View of the Customer with the power of Technology 3. Optimize operations with Operations Analysis 4. Gain IT efficiency and scale with Data Warehouse Augmentation 5. Prevent crime with Security and Intelligence Extension 20

Big Data & Analytics on IBM zenterprise Bringing it all together > Driving greater insight IBM Big Data Platform Stream * Computing Hadoop Data Systems Warehouse Business Intelligence, Predictiv e Analytics Other Big Data Sources Data Warehousing A Single Solution to: Enterprise Data Sources * Business System / OLTP 21 ACCESS, COMBINE & MANAGE a relevant mix of information TIMELY ACCESS for more accurate answers

The Hybrid Vision Delivering business critical analytics Combined Workloads Transactional Processing, Traditional Analytics & Business Critical Analytics Hybrid DB Reduced Latency. Greater Security. Improved Data Governance. Reduced Complexity. High volume business transactions and batch reporting running concurrently with complex queries 22

Core data management solutions for the 21st Century DB2 11 for z/os IMS 13 Delivering the highest levels of performance, availability, security, and scalability in the industry Unmatched availability, reliability, and security for business critical information Up to 40% CPU reductions and performance improvements for (OLTP), batch, and business analytics Breaking through 100k TPS 800% greater than IMS 12 CPU reductions up to 62% for Java Apps Improved data sharing performance and efficiency Integration with InfoSphere BigInsights / Hadoop and nosql support SQL access to IMS data from both.net and COBOL applications Greater flexibility and faster deployment for new applications with database versioning Improved utility performance and additional ziip eligible workload 23

Enhancing Big Data Analytics with IMS and DB2 for z/os Much of the world s operational data resides on z/os Unstructured data sources are growing fast 1. Merge this data with trusted OLTP data from zenterprise data sources 2. Integrate this data so that insights from Big Data sources can drive business actions IMS & DB2 - connectors & DB capability to allow BigInsights to easily & efficiently access each data source DB2 - connectors & DB capability to allow DB2 apps to easily and efficiently access Hadoop data sources Integrate the big data nuggets Integrate IBM BigInsights Merge Relational projection of IMS model 24 Invoke a query against Hadoop data

DB2 11 Support for Big Data Goal: integrate DB2 for z/os with IBM Hadoop based BigInsights Bigdata platform Enabling traditional applications on DB2 z/os to access Big Data analytics. Analytic jobs can be specified using JSON Query Language (Jaql) Submitted to BigInsights Results stored in Hadoop Distributed File System (HDFS). A table UDF (HDFS_READ) reads the Bigdata analytic result from HDFS, for subsequent use in an SQL query. Must have a variable shape of HDFS_READ output table DB2 11 supports generic table UDF, enabling this function 25

IBM PureData System for Hadoop Accelerate Hadoop analytics with appliance simplicity Accelerate Big Data projects with built-in expertise Explore new ways to use all data Unlock new insights from unstructured data Establish a cost efficient on-line data archive Simplify with integrated system management InfoSphere BigInsights software Compute and Storage hardware Ensure production grade security and governance Easily integrate with other systems in the IBM big data platform CIO:Insight Apr 29 2013 Issues surrounding how long it takes to get a Hadoop application into production coupled with a lack of real-time capabilities are proving to be important barriers to deployment. As a result, the respondents are reporting that both the number of Hadoop applications and the size of the overall Hadoop environment remain relatively small. 26

Approaches 1 2 Processing done outside z (Extract and move data) Processing in Appliance (z remains the master) Move data over network Log files in z/os Additional infrastructure. Challenges with scale, governance, inge stion. Processing done on System z (MR cluster on zlinux) DB2 MR jobs, result Log files in z/os $$$$ 27 3 Log files in z/os z/os z/vm $$ $$$ Appliance approach with PureData System for Hadoop. High speed load. z is the control point. Provision new node quickly Near linear scale. High speed load. z is the control point.

TECHNOLOGY Deliver dramatically faster analysis IBM DB2 Analytics Accelerator 4.1 A high performance appliance that integrates Netezza technology with zenterprise technology, to deliver dramatically faster business analysis Speeds up complex queries up to 2000x Lowers the cost of long term storage Minimizes data latency Improves security and reduces risk Complements existing investments 28

Big Data Innovation with IBM zenteprise Banco do Brasil purchases the largest ever DB2 Analytics Accelerator solution to drive customer insight from operational data. The 120-way system can hold 1.28 Petabytes of data. Queries that previously took 11 hours to run now complete in 26 seconds, over 1500 times faster! Banca Carige chooses System z to provide real time analytics as part of their Big Data client solution GPS & sensor information volumes exceeded the capabilities of the existing system. It was redesigned as an enterprise mission critical application using DB2 for z/os and System z data sharing to now provide the availability and scalability to meet the current and future requirements for this solution. 29

Webcast Agenda Big data in the enterprise Carl Olofson, IDC Making big data a reality - IBM System z Information Management solutions Summary and Call to Action Q&A 30

IBM understands your big data requirements IBM understands all kinds of data Game-Changing Innovation such as Watson, IBM DB2 Accelerator for z/os, streaming analytics, Hadoop implementations, expert integrated systems; 20 years of patent leadership, best of breed databases / data warehouses DB2 for z/os, IMS. Business-Ready Capabilities big data and analytics capabilities, integrated and hardened for serious use, with mission and business critical qualities of service unmatched in the industry IBM knows how to turn data into value Client Expertise deep industry know-how and solutions with global reach Strong Ecosystem growing investment with 360+ business partners & 100+ universities Build on Current Investments enhance existing analytics and information infrastructure with unparalleled breadth and depth of new capabilities IBM continued investments in big data and analytics processing $16B+ in Acquisitions coupled with game-changing innovations IBM zenterprise - The platform with the highest levels of integrity, qualities of service and governance for your business critical applications and processes. Analytics Solution Centers visited by 4000+ organizations accessing global expertise 31

Take Action Now! Next steps: For additional information including whitepapers and demos, please visit: IBM big data and business analytics www.ibm.com/bigdata/z Education Free online education at bigdatauniversity.com 70,000+ registered students Community Join Big data and World of DB2 for z/os on LinkedIn Further developments: Watch for future webcast and announcements Wanting to experiment on a big data integration project but not sure where to start? Partner with IBM Silicon Valley and other laboratories nearest you.. 32

33