Oracle Big Data Discovery Unlock Potential in Big Data Reservoir Gokula Mishra Premjith Balakrishnan Business Analytics Product Group September 29, 2014 Copyright 2014, Oracle and/or its affiliates. All rights reserved.
Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle s products remains at the sole discretion of Oracle. Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 3
Agenda 1 2 Introduction to Big Data Discovery Q&A Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 4
Agenda 1 2 Introduction to Big Data Discovery Q&A Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 5
Hadoop Data Reservoir Concept Gaining Momentum Data Warehouse Data Reservoir Existing Sources Emerging Sources Source: 451 Research Total Data Warehousing: 2013-2018 Source: wikibon.org/wiki/v/big_data_vendor_revenue_and_market_forecast_2013-2017 Source: The Forrester WaveTM: Big Data Hadoop Solutions, Q1 2014 Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 6
Not Easy to Get Analytic Value from Hadoop Data Reservoir Volume, Variety, Velocity = Complexity Data not organized Complex, non-integrated tools Specialized skills required? Path to Production Unclear Difficult to share with masses Hard to secure Lack of governance Impact: Lack of Analytic Agility 80% effort spent on data preparation vs. analytics Impact: Poor Enterprise Adoption Insights not widely leveraged across the organization Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 7
The Big Data Opportunity What if we could reverse: 80% - Data Preparation 20% - Analysis Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 8
Requires a Fundamentally New Approach An intuitive, interactive and visual user interface for anyone to quickly find, explore, transform and analyze data in Hadoop Data Scientist then share results for enterprise leverage Advanced Analytics Find Explore Business Intelligence Discover Transform Other Hadoop Tools Business User Business Analyst Data Warehouse Increase Analytic Agility Maximize Enterprise Adoption Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 9
Oracle Big Data Discovery. The Visual Face of Hadoop Find Explore Discover Transform Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 10
Easily Find Relevant Data Sets Navigate a rich catalog of all data in the Hadoop cluster Familiar search and guided navigation for ease of use Access data set summaries, annotation and recommendations Provision your own data through self-service upload Browse personal big data projects and those shared by the community Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 11
Explore the Data and Understand Potential Understand shape of the data. Visualize attributes by type Entropy based sorting by information potential View attribute statistics, data quality and outliers Use scratch pad to see statistical correlations between attribute combinations Evaluate whether a data set is worthy of further investment Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 12
Transform and Enrich Data to Make it Ready Intuitive user driven data wrangling Library of data transformations to replace values, convert types, collapse, reshape, pivot, group, custom tag, merge and much more Data enrichments for inferring location and language. Theme, entity and sentiment enrichments for text Preview results, undo, commit and replay transforms Run on sample data in memory or full data set in Hadoop Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 13
Analyze the Data to Discover New Insights Mash up different data sets for deeper perspectives Drag and drop from a rich library of interactive visualizations to compose discovery dashboards Filter through data with powerful search and intuitive guided navigation Publish blended data sets back to Hadoop Share projects, bookmarks and snapshots with team members for collaboration Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 14
Share Results and Publish for Enterprise Leverage Find Discover Explore Transform Share & Collaborate raw data transformed data Publish data reservoir (HDFS) Leverage advanced analytics Oracle Big Data Discovery plays well with the big data ecosystem business intelligence other hadoop tools data warehouse Share and collaborate with the team Share projects, bookmarks and snapshots then collaborate and iterate Publish back to Hadoop Transforms and enrichments may be applied to original data sets in Hadoop Publish blended data sets back to HDFS Leverage results in other tools Publish data to Hadoop in format optimized for advanced analytic tools (e.g. ORAAH) Hadoop compliant BI tools (e.g. OBIFS) can burst out to the masses Leverage any native Hadoop tooling (e.g. Pig, Hive, Impala, Python, etc) Integrate BDD data sets with DWH to secure, govern and optimize for query performance (e.g. Oracle Big Data SQL) Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 15
Unstructured Data Structured Data Oracle s Unified Big Data Management and Analytics Strategy Exalytics Oracle BI Foundation Suite In-Memory Appliance Oracle SQL Queries Experiment, Prototype, Collaborate Quickly find, explore, transform, discover and share in BDD Productize, Secure & Govern Exadata Oracle Advanced Analytics Oracle Database Oracle Big Data SQL Tables in DB Publish results to HDFS Use to build predictive models with Oracle R for Hadoop Productize, Secure, Govern Experiment, Prototype & Collaborate BDA Data Warehouse Oracle Big Data Discovery Hadoop (HDFS) Data Reservoir Oracle R for Hadoop SQL join Tables in Hadoop Connect published HDFS files to secure Oracle DB using Oracle Big Data SQL No data movement required Seamlessly extends existing DWH and BI investments with non-traditional data in Hadoop Available as Engineered Systems Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal
Oracle Big Data Discovery. A Game Changing Platform Benefits to the Business Get Value Faster. Rapidly turn raw data into actionable insights that can be leveraged across the enterprise Democratize Value from Big Data. Increase the size, diversify the skills, and improve the efficiency of Big Data project teams Benefits to IT Destroy Existing Technical Barriers. Run natively on Hadoop cluster for maximum scalability and performance Share, Publish, Secure and Leverage. Integrate with Hadoop open standards and leverage the Oracle big data ecosystem Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 17
Agenda 1 2 Introduction to Big Data Discovery Q&A Copyright 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential Internal 18
Copyright 2014 Oracle and/or its affiliates. All rights reserved.