3 Predictive Analytics and Big Data SAPSA Vårimpuls 2014 Niklas Packendorff Solution Advisor, SAP HANA & Analytics
4 Disclaimer This presentation outlines our general product direction and should not be relied on in making a purchase decision. This presentation is not subject to your license agreement or any other agreement with SAP. SAP has no obligation to pursue any course of business outlined in this presentation or to develop or release any functionality mentioned in this presentation. This presentation and SAP's strategy and possible future developments are subject to change and may be changed by SAP at any time for any reason without notice. This document is provided without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. SAP assumes no responsibility for errors or omissions in this document, except if such damages were caused by SAP intentionally or grossly negligent SAP AG. All rights reserved. 4
5 What is Big Data?
6 What is Big Data? The 3 + V s 1 V s Demand Volume Explosion in the amount of data Velocity Fast collection, processing and consumption real-time Variety Multiple data formats; non-structured data boom Value Use the data to gain insight and business value Mobile CRM Data Opportunities Transactions Customer Voice & Video Things Instant Messages Planning Internet of things Inventory Demand Mobile Location Planning Transactions Sales Order Big Data Customer CRM
7 BIG DATA DIVERTS DRIVERS BEFORE FATAL ACCIDENTS HAPPEN
8 Big Data Challenges Staffing and skills Data Governance Unsure of the Technology Requirements Cost Unsure Value of Big Data Connect people to information in the moment they need it and in the right experience (bridging the gap)
9 Big Data success demands full coverage Deep Answer complex questions on granular data Predict the best next action Massive data scale Many data types Broad Accessible On any device or to any user Self service and intuitive interactions Real-time streams of data Ask a question, get an immediate answer Real Time Simple No data preparation No pre-aggregates No tuning 9
10 SAP Big Data Platform Industry/LOB Applications Analytic Applications Visualize and Act SAP Big Data Apps & Analytics Custom Applications SQL R Predictive Text Analysis Analyze Spatial Application server Sybase IQ SAP HANA Act Hadoop Information Management Stream Processing Acquire SAP Big Data Platform Big Data Science Services Real-time Replication Unstructured Data Semi-structured Data Structured Data
11 Big Data provisioning with SAP SAP Business Suite SAP BW Trigger Based, Real Time SAP HANA smart data access Data Virtualization SAP LT Replication Server DB Connection SAP Sybase IQ /ASE Non SAP Data Sources ETL, Batch Log Based SAP Data Services SAP Sybase Replication Server ODBC ECH SAP HANA Trading & Order Management Systems Network Devices- Wired/Wireless Event Streams Data Synchronization SAP Sybase Event Stream Processor SAP Sybase SQL Anywhere ODBC ODBC SAP IQ Data Sources Your Own Applications ODBC/ JDBC/ OData 11
12 SAP Big Data Platform Industry/LOB Applications Analytic Applications Visualize and Act SAP Big Data Apps & Analytics Custom Applications SQL R Predictive Text Analysis Analyze Spatial Application server Sybase IQ SAP HANA Act Hadoop Information Management Stream Processing Acquire SAP Big Data Platform Big Data Science Services Real-time Replication Unstructured Data Semi-structured Data Structured Data
13 SAP Big Data Platform SAP Sybase IQ Query Federation Load results into SAP HANA SAP Sybase IQ (Data Services) SAP HANA Smart Data Access (Data Virtualization) Loading data for Pre-process Log files Direct SAP HANA-Hadoop connectivity Virtual Table (SAP HANA smart data access) Virtual HANA table to federate a Hive table at query time Query federation with SAP Sybase IQ Integration at ETL layer Data Services provides bi-directional SAP Hadoop connectivity: HIVE, HDFS, Push down entity extraction to Hadoop as MapReduce jobs ETL data into SAP Sybase IQ Unstructured data 2013 SAP AG. All rights reserved. 13
14 SAP Big Data Platform Industry/LOB Applications Analytic Applications Visualize and Act SAP Big Data Apps & Analytics Custom Applications SQL R Predictive Text Analysis Analyze Spatial Application server Sybase IQ SAP HANA Act Hadoop Information Management Stream Processing Acquire SAP Big Data Platform Big Data Science Services Real-time Replication Unstructured Data Semi-structured Data Structured Data
16 Reinventing the (Big) Data platform with SAP In-Memory Data Fabric
17 SAP HANA: Text Analysis for Big Data File Filtering Unlock text from binary documents Ability to extract and process unstructured text data from various file formats (txt, html, xml, pdf, doc, ppt, xls, rtf, msg) SAP HANA Text & Sentiment Analysis Load binary, flat, and other documents directly into HANA for native text search and analysis Native Text Analysis Give structure to unstructured textual content Expose linguistic markup for text mining uses Classify entities (people, companies, things, etc.) Identify domain facts (sentiments, topics, requests, etc.) Analyze Search Predict Supports up to 31 languages for linguistic mark-up and extraction dictionary and 11 languages for predefined core extractions 18
18 SAP Big Data Platform Industry/LOB Applications Analytic Applications Visualize and Act SAP Big Data Apps & Analytics Custom Applications SQL R Predictive Text Analysis Analyze Spatial Application server Sybase IQ SAP HANA Act Hadoop Information Management Stream Processing Acquire SAP Big Data Platform Big Data Science Services Real-time Replication Unstructured Data Semi-structured Data Structured Data
19 What if you could turn new signals into business value? :-) Brand Sentiment Predictive Maintenance Network Optimization Insider Threats Asset Tracking Personalized Care Product Recommendation Risk Mitigation, Real-time Propensity to Churn Real-time Demand/ Supply Forecast 360 O Customer View Fraud Detection
20 What if you could fight fraudulent activities?
21 What if you could predict your customers behaviour?
22 What if you could identify completely new business opportunities in your own organization?
23 Big Data Applications Make Big Data insights actionable via industry specific, business focused applications from SAP and companies in the SAP Startup Focus program. Audience Discovery (CEI) Customer Value Intelligence (CEI) Account Intelligence (CEI) Fraud Management :-) Sentiment Intelligence (RDS) Manufacturing (Operational Intelligence) Social Contact Intelligence (CEI) Demand Signal Management Manufacturing (Responsive Manufacturing)
24 Agile Visualization & Analysis Intuitively explore and present data to reveal new insights at-a-glance Discover hidden patterns in information VISUALIZE Increase understanding and use of data for better decisions Analytics reach the fringes of the organizations
25 Agile Visualization & Analysis Intuitively explore and present data to reveal new insights at-a-glance 1. Quickly source, cleanse, transform and join data from corporate and personal sources Merge data from different data sets based on common values DEMO 2. Explore and analyze your data using beautiful, interactive visualizations and advanced analytical tools 3. Share curated data and insight with the whole team also directly to mobile devices
26 SAP Predictive Analytics Vision Reduce Decision Latency with Advanced Analytics Operationalize predictive models across the enterprise Bringing Predictive Analytics to a broad spectrum of users Integrated Platform, Open & Flexible In-database, inmemory performance with SAP and 3 rd party databases Harnessing the power of big data with real time analysis Helping customers leverage existing investment in data infrastructure Embedding predictive into business processes Reducing time to deploy Predictive models Services brings advanced analytics to business Obviating the need for customers to have in-house data scientists Common tools and experience for all users User-friendly UI and advanced visualizations Self-service Enterprise-ready Seamless integration with 3 rd Party and Open- Source (Teradata, SAS, Netezza, Oracle, DB2, Hadoop, SQL Server, HANA, R) Cloud or On- Premise 2013 SAP AG. All rights reserved. 27
27 SAP Predictive Analytics Platform Transforming the Future with Insight Today Hadoop/ Sybase IQ, Sybase ASE, Teradata Spatial, Machine, Real-time data Virtual Tables Text Analysis Spatial Data SAP HANA Main Memory SQL Script Optimized Query Plan PAL R-scripts Unstructured KNN classification K-means ABC classification Weighted score tables R-Engine Regression C4.5 decision tree Associate analysis: market basket HANA Studio/AFM, Apps & Tools Accelerate predictive analysis and scoring with in-database algorithms delivered out-of-the-box. Adapt the models frequently Execute R commands as part of overall query plan by transferring intermediate DB tables directly to R as vector-oriented data structures Predictive analytics across multiple data types and sources. (e.g.: Unstructured Text, Geospatial, Hadoop) 2013 SAP AG or an SAP affiliate company. All rights reserved. 28
28 Traditional Approaches SAP Predictive / InfiniteInsight Productive Big Data Made Easy Fast & Accurate Manual data mining and predictive analytics process. Weeks or months spent in data prep, modeling and deployment tasks. IT experts create large datasets. Statisticians pre-select variables, hence excluding potentially predictive information. Time-intensive, manual processes. Linear relationship between model accuracy and analyst time. Predictive analytics process made efficient. Automated data prep, modeling and deployments tasks. Models in minutes or hours. Scales for terabytes and petabytes of data. Rapid insight from 1,000's and 10,000's of variables with no expert intervention. Automation cuts human time. Increased accuracy by including all potentially predictive variables and eliminating manual errors No PhD Required Statistical library where the right algorithm must be selected, tested and fine-tuned. Easy yet sophisticated. Model building and deployment in clicks. Quick Win Lower TCO Corporate Knowledge Complex installation. Long and specialized training programs. Slow learning curve. Expensive infrastructure. Typically requires teams of statisticians and data scientists. Costly and slow implementation. Hard to maintain, share and port model code or workflow diagrams. Knowledge limited to individuals. Quick installation. Short training. Leapfrog to best-in-class analytics. Leverage existing infrastructure. No need for additional resources. Payback in weeks. Models incorporated into the business process. Knowledge shared and retained across the organization.
29 SAP Big Data Platform Industry/LOB Applications Analytic Applications Visualize and Act SAP Big Data Apps & Analytics Custom Applications SQL R Predictive Text Analysis Analyze Spatial Application server Sybase IQ SAP HANA Act Hadoop Information Management Stream Processing Acquire SAP Big Data Platform Big Data Science Services Real-time Replication Unstructured Data Semi-structured Data Structured Data
30 Big Data Services Rapid Implementation & Deployment Shortening the time to Start, Deploy and Run via better re-use of Best Practices Traditional Project Approach Prep Business blueprint Realization Testing Go-live RDS Methodology Start Deploy Run At least 40% time and effort reduction compared to similar scope of traditional project
31 SAP Data Scientists Top (PhD) level global team with experience from working with 100+ customers and creating some of the most sophisticated use cases Pioneers in demand science, with significant reusable IP and deep analytic competencies Insights and Compliance Mathematical Modeling, Forecasting, Simulation, and Optimization Experts in relevant SAP Technology: HANA, SAP Business Objects, SAP Predictive Analysis, Visualization Flexible delivery PAL and R integration SAP HANA Platform and beyond ( x ) 2 + e /(2 ) + f ( x) 2 Data Science 2 SAP Predictive Analysis SAP Business Objects Dashboards 32
32 SAP HANA delivers in real-time the optimum predicted route to Tokyo s taxi drivers, based on real-time, geo-tagged traffic mass data from commodity smart phones. We are currently performing tests on large volumes of traffic data using SAP HANA. The product has enabled us to search through 360 million data records in just a little over one second. We had heard that SAP HANA enabled extremely strong performance, but to actually see and experience the speed and high performance firsthand is simply amazing. Now and in the future, speed is the key to adapting to an everchanging business environment. The speed SAP HANA enables is sudden and significant, and has the potential to transform entire business models. Akihiko Nakamura, corporate senior vice president, Services & Industrial Solution Division 2011 SAP AG. All rights reserved. 33
33 SAP transforms both Businesses and IT Reduce waste & fraud in government fund <2 min for detecting 100,000 names over 90M records Sharpen marketing effectiveness 56x faster reporting: micro-targeted customer offers Identify cancer DNA variants for treatment 216x faster results: 3 days 20 minutes Improve diagnostic through pattern detection 300M records; analysis in 2-10 seconds Predict customer purchase sentiment Seasonality Analysis in 5 seconds Improve labor utilization 1131x faster reporting time Perfect order experience 60x faster real-time insights Accelerate monthly close & spending insight 75% reduction in CRM query ~23 to 6 seconds Launch new products or markets 400x faster report execution: Forecast sales-trends in real-time Remote roadside diagnostics in real-time Analyze 15 years 1 TB data in seconds Deeper customer relationships 360 customer view and comprehensive experience
34 What is Big Data? The V s Volume Explosion in the amount of data Velocity Fast collection, processing and consumption real-time Variety Multiple data formats; non-structured data boom Value Use the data to gain insight and business value
35 SAP makes Big Data Actionable Big Data Platform Big Data Analytics & Apps Big Data Science Real Time Real Value Real Results 36
36 For more information about SAP s Big Data solutions please visit the Big Data landing page: Here you will find a wide range of materials including customer stories, information about our platform, solutions and services, as well as articles and whitepapers on the topic of Big Data.
37 40+ years of innovation, and we re just getting started. Join us in our world-changing journey! 2014 SAP AG or an SAP affiliate company. All rights reserved. 38
Big Data Mining with SAP HANA Reinventing Businesses through Innovation, Value & Simplicity Dr Asadul Islam Senior Researcher Strategic Customer Engagement, Product & Innovation Innovate with speed SAP
SAP Brief Analytics s from SAP SAP Predictive Analytics Objectives Confidently Anticipate and Drive Better Business Outcomes See the future more clearly with predictive analytics See the future more clearly
An Oracle White Paper March 2013 Big Data Analytics Advanced Analytics in Oracle Database Advanced Analytics in Oracle Database Disclaimer The following is intended to outline our general product direction.
SAP Statement of Direction Business Intelligence Solutions Business Intelligence Solutions from SAP: Statement of Direction Table of Contents 3 Quick Facts 4 Driving Business Innovation Through Radical
Hurwitz ViCtOrY index Advanced Analytics: The Hurwitz Victory Index Report SAP Hurwitz Index d o u b l e v i c t o r Marcia Kaufman COO and Principal Analyst Daniel Kirsch Senior Analyst Table of Contents
SAP BusinessObjects Business Intelligence SAP BusinessObjects Business Intelligence 4.0 Solutions Empowering the Real-Time, Mobile, Social, and Global Enterprise SAP BusinessObjects Business Intelligence
SAP Brief SAP Technology SAP HANA Objectives SAP HANA An In-Memory Data Platform for Real-Time Business Real-time business: a competitive advantage Real-time business: a competitive advantage Uncertainty
PARC and SAP Co-innovation: High-performance Graph Analytics for Big Data Powered by SAP HANA Harnessing the combined power of SAP HANA and PARC s HiperGraph graph analytics technology for real-time insights
DATAMEER USE CASES EBOOK Top Five High-Impact Use Cases for Big Data Analytics You ve been collecting data for years. Learn how to use it to grow your business and gain a competitive edge. INTRODUCTION
SAP Brief SAP HANA Objectives Transform Your Future with Better Business Insight Using Predictive Analytics Dealing with the new reality Dealing with the new reality Organizations like yours can identify
White Paper Data Warehouse Optimization with Hadoop A Big Data Reference Architecture Using Informatica and Cloudera Technologies This document contains Confidential, Proprietary and Trade Secret Information
QLIKVIEW AND BIG DATA: HAVE IT YOUR WAY A QlikView White Paper November 2012 qlikview.com Table of Contents Executive Summary 3 Introduction 3 The Two Sides of Big Data Analytics 3 How Big Data Flows from
IBM Industries White paper Business analytics in the cloud Driving business innovation through cloud computing and analytics solutions 2 Business analytics in the cloud Contents 2 Abstract 3 The case for
SAP HANA Understanding In-Memory Computing By David Marks SAP Executive Solution Engineer / Keith Johnson DCS Federal DCS Federal Who We are DCS is a Business Intelligence (BI) and Data Warehousing (DW)
BIG DATA PLATFORM Reinventing Businesses through Innovation, Value & Simplicity Dr. Jan Teichmann, Product & Strategy Dec 5, 2013 Agenda HANA Data Data Platform Platform Journey Platform Platform Future
IBM Software Big Data & Analytics Thought Leadership White Paper Better business outcomes with IBM Big Data & Analytics The insights to transform your business with speed and conviction 2 Better business
OPEN DATA CENTER ALLIANCE : sm Big Data Consumer Guide SM Table of Contents Legal Notice...3 Executive Summary...4 Introduction...5 Objective...5 Big Data 101...5 Defining Big Data...5 Big Data Evolution...7
INTELLIGENT BUSINESS STRATEGIES W H I T E P A P E R Architecting A Big Data Platform for Analytics By Mike Ferguson Intelligent Business Strategies October 2012 Prepared for: Table of Contents Introduction...
IBM Software Big Data Retail Capitalizing on the power of big data for retail Adopt new approaches to keep customers engaged, maintain a competitive edge and maximize profitability 2 Capitalizing on the
SAP Brief SAP Technology SAP HANA Objectives SAP HANA An In-Memory Data Platform for Real-Time Business Fast, broad, and meaningful insight at your service Real-time analytics Fast, broad, and meaningful
Big Data Getting Value from Big Data: Focus on the Opportunities, Not the Obstacles Table of Contents 2 Embark on Your Big Data Journey with Confidence Getting Started, Keeping Moving 3 Big Data Hype Versus
How to embrace Big Data A methodology to look at the new technology Contents 2 Big Data in a nutshell 3 Big data in Italy 3 Data volume is not an issue 4 Italian firms embrace Big Data 4 Big Data strategies
QLIKVIEW AND BIG DATA A QlikView Technology White Paper July 2012 victa.nl firstname.lastname@example.org +31 74 2915208 Introduction There is an incredible amount of interest in the topic of Big Data at present: for many
For Big Data Analytics There s No Such Thing as Too Big The Compelling Economics and Technology of Big Data Computing March 2012 By: 4syth.com Emerging big data thought leaders Forsyth Communications 2012.
SAP NetWeaver Identity Management Technical Overview Presentation SAP AG Walldorf, April 2009 1 Disclaimer This presentation outlines our general product direction and should not be relied on in making