Value 10/06/2015 2015 MapR Technologies 2015 MapR Technologies 1 Information Builders Mission & Value Proposition Economies of Scale & Increasing Returns (Note: Not to be confused with diminishing returns on scale! ) Data Integration, Management, & Analytics 1
Information Builders and MapR Data Sources Clickstream Sensor Data Billing Data Product Catalog CRM / ERP Social Media Server Logs Merchant Listings iway NFS Drill Sqoop HDFS Processing and Analytics HBase Pig MapReduce v1 & v2 MapR DB Spark Storm Oozie Hive MLLib Solr YARN Mahout Access Drill MapReduce Hive Impala NFS Online Chat Call Detail Records MapR Data Platform Integration Data Quality Master Data Management Real Time Data Real Time Applications Real Time Analytics Internet of Things (IOT) Decision Support Information Distribution Analytics CIO Must Have Strategies Over Time Distributed Computing? Open Source? Web? Ecommerce? Cloud? Social? Mobile? Big Data? Back Office Automation Cost reduction Retrospective Front Office Customer intimacy Revenue creation Just-in-time 2015 MapR Technologies 4 2
Advantage From Speeding Data-to-Action Cycle What happened historically? What s happening now? What can we do to affect change? The As it happens Business 2015 MapR Technologies 5 Great. So why this Hadoop thing? 2015 2015 MapR MapR Technologies Technologies 6 3
2015 MapR Technologies 7 volume velocity variety 2015 MapR Technologies 8 4
2015 MapR Technologies 9 How can I build a how can I reduce graph of the Internet? customer unlimited churn? How can I index which performance are the most trillions of emails? profitable products? How can I make how mountains fraction can I prevent of of cash problems before they selling ads? start? the cost 2015 MapR Technologies 10 5
MapR is the technology leader in Hadoop Premiere Investors Best Product Apache Open Source + Innovation Hadoop NoSQL 700+ Customers 2015 MapR Technologies 11 Empowering the As-it-happens business by speeding up the data-to-action cycle 2015 MapR Technologies 12 6
Mobile Data Messages Social Media Email Today s Data Comes in Different Shapes Sensors Clickstream Audio 2015 MapR Technologies 13 Unstructured data will account for more than 80% of the data collected by organizations STRUCTURED DATA SEMI-STRUCTURED DATA Total Data Stored 1980 1990 2000 2010 2020 Source: Human-Computer Interaction & Knowledge Discovery in Complex Unstructured, Big Data 2015 MapR Technologies 14 7
1 2 Scale of analytics Speed of operations Source: TDWI, April 2014 2015 MapR Technologies 15 Agility by Reducing Distance to Data Short analytic life cycles with no upfront schema creation and management FROM: Total Time to Value: Weeks to Months Traditional Approach Hadoop Data Schema Design Transforma tion Data Movement Users Data Preparation New Business Questions TO: Total Time to Value: Minutes New Approach Hadoop Data Users Drill enables the As-It-Happens business with instant SQL analytics on complex data New Business Questions 2015 MapR Technologies 16 8
Drill is the Top-Ranked SQL-on-Hadoop 2015 MapR Technologies 17 Hadoop Architecture Apache Hadoop is an open source software project that enables distributed processing of large data sets across clusters of commodity servers. Drill Spark SQL Impala Hive 18 9
HDFS Batch Bottleneck MapR Real-Time Distribution Operational apps on HBase/Accumulo must be run in in a separate cluster from the analytics cluster. 1 MapR-DB runs in the same cluster as the analytics cluster (Hadoop), to avoid batch data copies across clusters. HBase/Accumulo suffer from service disruptions due due to to compactions, garbage collection, and and region splits. All All data movement into into HDFS forces batch processing. 2 MapR-DB architecture ensures consistently high responsiveness (low latency). MapR ingests data in real-time via MapR-DB, HDFS API, and NFS. 1 Analytics Analytics with with 11 st st generation generation SQL-on- SQL-on- Hadoop Hadoop requires requires ETL and ETL schema and schema creation. creation. 3 Apache Drill provides immediate self-service data exploration with no waiting on IT. 2 2 1 3 3 2015 MapR Technologies 19 Production Success with Hadoop 2015 MapR Technologies 20 10
2000+ Nodes Fortune 100 Retailer 2015 MapR Technologies 21 RETAILER Targeted Marketing: In-store Geo-located Offers 2000+ MapR Hadoop nodes Largest deployment in retail 200 DATA SCIENTISTS 245M CUSTOMERS per week +2% CONVERSION RATE IMPROVEMENT 40TB per NODE 7PB per CLUSTER +50 PRODUCTION APPLICATIONS 2015 MapR Technologies 22 11
2015 MapR Technologies 23 2015 MapR Technologies 24 12
Manage and Adapt to Climate Change 10T DATA POINTS from 2.5M SENSORS < 100TB DATA 60Yrs CROP-YIELD statistics 85% OF FARMER RISK IS WEATHER RELATED 2M LOCATIONS Natl. Weather Service Doppler Scans 10K Corn, weat growing OUTCOMES per location 2015 MapR Technologies 25 Customer Testimonials on MapR 2015 MapR Technologies 26 13
Recap: Analytics at the Speed of Thought 80% of cost of analytics projects is in data management and integration 80/20 Rule 80% of time is spent is on processing queries and not on analysis 80% of users do not get analytics in consumable format, i.e., easy to use applications A partnership that turns those into 20%!!!! 2015 MapR Technologies 27 Q & A Engage with us! @mapr mapr-technologies maprtech MapR twhite@mapr.com maprtech 2015 MapR Technologies 28 14