Endeca Introduction to Big Data Analytics Overview May 8, 2013 1
Agenda Introduction Overview Analytics for Big Data Overview Endeca Information Discovery Q & A 2
Introduction Business vs. IT Big Data Initiatives What Technologies are being used? Projects underway? Familiarity with Big Data - What is Big Data? What technologies support it? Terms like Hadoop, HDFS Familiarity with Endeca Your Goals for this Session 3
4 V s of Big Data 4
Sample Big Data flow Customer Records from Various Silos Customer Monitoring Data Transformation Track Life Events Explore current Customer Information For Identified customers, track bank web pages visited Collect Real Time Web Log Entries Batch Archiving of Web Log Entries Summarize Customer related Information for Analytics Identify Statistical Outliers Products of Interest Historical metrics Monitor and track Identified metrics of interest Alert Anomalies Source Recommend Action Acquire Organize Analyze Decide 5
Big Data Software Components Customer Records from Various Silos For Identified customers, track bank web pages visited Social Media Monitoring NoSQL Database Hadoop Connectivity Data Warehouse Advanced Analytics Oracle Endeca Information Discovery Business Intelligence Decisioning Source Acquire Organize Analyze Decide 6
Big Data Software Components Customer Records from Various Silos Connectivity For Identified customers, track bank web pages visited Source NoSQL Database Hadoop Data Warehouse Advanced Analytics Business Intelligence Big Data Appliance Exadata Acquire Organize Analyze Decide 7
Endeca Information Discovery 8
Data Volume, Variety Growth Presents New Challenges More and Different Data Rapid growth in structured and unstructured, internal and external data More Change and Uncertainty Unanticipated business needs can t be addressed by predefined models, dashboards, and reports More Unanticipated Questions Self-service ability to explore data, add new data, and construct analysis as required 9
Dialogues Contain Critical, Untapped Insights Why should I pay a checking fee if other banks don t charge one? She saved the project with strong leadership & building trust with the customer.. Competitive pricing is 15% lower than yours and they offer discounted shipping. RT @finwiz. Checkout video http://bit.ly/wle6y2 Sweet! Customers Workforce 3 rd Parties The Public 10
New Insights Drive New Opportunities Customers Improved Targeting Increased Customer Satisfaction Brand Awareness & Perception Impact Workforce Employee Satisfaction Improved Performance Hiring & Retention of Top Talent 3 rd Parties Revenue Improved & margin Operational Operational Efficiency efficiency. Better Better partnerships Partnerships Product Product positioning. Awareness The Public Improved Brand Perception Insight into Topics and Trends Public Awareness 11
Volume of Data Generated Challenge Difficult to Harness Unstructured Data 80% of the world s information is unstructured 80% Diverse sources, spread out everywhere Unpredictable structure, impossible to model Growing exponentially, constantly changing Full of noise, poor data quality Social Media Websites Big Data Existing solutions are not enough Content Systems, Files, Email Text in Enterprise Applications 20% 2005 2010 2015 Structured Data Unstructured Data 12
Unlocking the Value of Unstructured Data Structured & Unstructured Analysis Business Intelligence Who, What, When, Where? Information Discovery Why, How? Answer All the Questions Using All the Relevant Data 13
Extend Business Analytics with Unstructured Data Introducing Oracle Endeca Information Discovery Oracle Business Intelligence Best platform for integrated ROLAP and MOLAP BI Server + Essbase Common Enterprise Information Model Oracle Endeca Information Discovery Best platform for Unstructured Analytics Endeca Server Hybrid Search/Analytical Database Flexible Data Model Structured Data Unstructured Data OLTP & ODS Systems Enterprise Applications (Oracle, SAP, Others) Data Warehouse & Data Marts Websites Content Systems, Files, Email Social Media Big Data 15
Oracle Endeca Information Discovery Understand the complete picture with context from any source. BI Star Schemas Metrics: Margin = Sales - Cost, Dimensions: Customer, Product, Transactions TxnID ProductI Amoun Category D t 12324 506 Mountain Bike 499 12325 507 Road Bike 1399 Enterprise Content System MS Office Documents, Internal Notes Websites Product Reviews Marketing Strategy Document Chapter 1. Enterprise Applications Unstructured Text Fields Customer reported receiving wheel with bent spokes Why did sales fall? Was it bad reviews? An unsuccessful marketing campaign? Quality issues? Social Media Customer Comments The new 506 from Acme is a solid road bike. 3.5 stars. Thinking about buying a new road bike. Anyone know anything about the Acme 506? 17
Oracle Endeca Information Discovery Rapid, intuitive exploration and analysis of data from any combination of structured and unstructured sources Unstructured Analytics Benefits Unprecedented Information Visibility Leverage Existing BI Investments Self-Service Data Discovery Reduced IT Costs, Better Business Decisions Unique Features Contextual Search, Navigation, Analytics Dynamic Data and Metadata Content Acquisition and Text Enrichment In-Memory Performance 19
Questions 20