Information Builders Mission & Value Proposition

Similar documents
Self-service BI for big data applications using Apache Drill

Self-service BI for big data applications using Apache Drill

Time-Series Databases and Machine Learning

The Future of Data Management

The Future of Data Management with Hadoop and the Enterprise Data Hub

Why Spark on Hadoop Matters

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

MapR: Best Solution for Customer Success

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

HDP Hadoop From concept to deployment.

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

SQL on NoSQL (and all of the data) With Apache Drill

Talend Big Data. Delivering instant value from all your data. Talend

Upcoming Announcements

The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer,

The Internet of Things and Big Data: Intro

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Comprehensive Analytics on the Hortonworks Data Platform

#TalendSandbox for Big Data

HDP Enabling the Modern Data Architecture

Big Data and Industrial Internet

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

The Top 10 7 Hadoop Patterns and Anti-patterns. Alex

Real Time Big Data Processing

Saving Millions through Data Warehouse Offloading to Hadoop. Jack Norris, CMO MapR Technologies. MapR Technologies. All rights reserved.

Moving From Hadoop to Spark

Ali Ghodsi Head of PM and Engineering Databricks

Dominik Wagenknecht Accenture

Cloudera Enterprise Data Hub in Telecom:

Roadmap Talend : découvrez les futures fonctionnalités de Talend

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

A Modern Data Architecture with Apache Hadoop

Survival Tips for Big Data Impact on Performance Share Pittsburgh Session 15404

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Apache Hadoop: Past, Present, and Future

Driving Growth in Insurance With a Big Data Architecture

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

Modernizing Your Data Warehouse for Hadoop

Oracle Database 12c Plug In. Switch On. Get SMART.

Hadoop Trends and Practical Use Cases. April 2014

Integrating a Big Data Platform into Government:

The Digital Enterprise Demands a Modern Integration Approach. Nada daveiga, Sr. Dir. of Technical Sales Tony LaVasseur, Territory Leader

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

HADOOP. Revised 10/19/2015

So What s the Big Deal?

Big Data Analytics Nokia

SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Data Analyst Program- 0 to 100

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

Native Connectivity to Big Data Sources in MSTR 10

Hortonworks Data Platform for Hadoop and SAP HANA

Big Data Storage Challenges for the Industrial Internet of Things

Dell In-Memory Appliance for Cloudera Enterprise

Tap into Hadoop and Other No SQL Sources

How to Hadoop Without the Worry: Protecting Big Data at Scale

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Investor Presentation. Second Quarter 2015

Hadoop Ecosystem B Y R A H I M A.

Unified Big Data Processing with Apache Spark. Matei

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Large scale processing using Hadoop. Ján Vaňo

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

The Enterprise Data Hub and The Modern Information Architecture

Agenda. Big Data & Hadoop ViPR HDFS Pivotal Big Data Suite & ViPR HDFS ViON Customer Feedback #EMCVIPR

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

How To Handle Big Data With A Data Scientist

Case Study : 3 different hadoop cluster deployments

How Companies are! Using Spark

Apache Hadoop: The Pla/orm for Big Data. Amr Awadallah CTO, Founder, Cloudera, Inc.

Data Services Advisory

We are building the next generation of Big Data and Analytics solutions!

Hadoop implementation of MapReduce computational model. Ján Vaňo

SAP and Hortonworks Reference Architecture

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

BIG DATA What it is and how to use?

Workshop on Hadoop with Big Data

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Certified Big Data and Apache Hadoop Developer VS-1221

Ganzheitliches Datenmanagement

MySQL and Hadoop Big Data Integration

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

Big Data Zurich, November 23. September 2011

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Bringing Big Data to People

brief contents PART 1 BACKGROUND AND FUNDAMENTALS...1 PART 2 PART 3 BIG DATA PATTERNS PART 4 BEYOND MAPREDUCE...385

Oracle Big Data SQL Technical Update

Hadoop. for Oracle database professionals. Alex Gorbachev Calgary, AB September 2013

Transcription:

Value 10/06/2015 2015 MapR Technologies 2015 MapR Technologies 1 Information Builders Mission & Value Proposition Economies of Scale & Increasing Returns (Note: Not to be confused with diminishing returns on scale! ) Data Integration, Management, & Analytics 1

Information Builders and MapR Data Sources Clickstream Sensor Data Billing Data Product Catalog CRM / ERP Social Media Server Logs Merchant Listings iway NFS Drill Sqoop HDFS Processing and Analytics HBase Pig MapReduce v1 & v2 MapR DB Spark Storm Oozie Hive MLLib Solr YARN Mahout Access Drill MapReduce Hive Impala NFS Online Chat Call Detail Records MapR Data Platform Integration Data Quality Master Data Management Real Time Data Real Time Applications Real Time Analytics Internet of Things (IOT) Decision Support Information Distribution Analytics CIO Must Have Strategies Over Time Distributed Computing? Open Source? Web? Ecommerce? Cloud? Social? Mobile? Big Data? Back Office Automation Cost reduction Retrospective Front Office Customer intimacy Revenue creation Just-in-time 2015 MapR Technologies 4 2

Advantage From Speeding Data-to-Action Cycle What happened historically? What s happening now? What can we do to affect change? The As it happens Business 2015 MapR Technologies 5 Great. So why this Hadoop thing? 2015 2015 MapR MapR Technologies Technologies 6 3

2015 MapR Technologies 7 volume velocity variety 2015 MapR Technologies 8 4

2015 MapR Technologies 9 How can I build a how can I reduce graph of the Internet? customer unlimited churn? How can I index which performance are the most trillions of emails? profitable products? How can I make how mountains fraction can I prevent of of cash problems before they selling ads? start? the cost 2015 MapR Technologies 10 5

MapR is the technology leader in Hadoop Premiere Investors Best Product Apache Open Source + Innovation Hadoop NoSQL 700+ Customers 2015 MapR Technologies 11 Empowering the As-it-happens business by speeding up the data-to-action cycle 2015 MapR Technologies 12 6

Mobile Data Messages Social Media Email Today s Data Comes in Different Shapes Sensors Clickstream Audio 2015 MapR Technologies 13 Unstructured data will account for more than 80% of the data collected by organizations STRUCTURED DATA SEMI-STRUCTURED DATA Total Data Stored 1980 1990 2000 2010 2020 Source: Human-Computer Interaction & Knowledge Discovery in Complex Unstructured, Big Data 2015 MapR Technologies 14 7

1 2 Scale of analytics Speed of operations Source: TDWI, April 2014 2015 MapR Technologies 15 Agility by Reducing Distance to Data Short analytic life cycles with no upfront schema creation and management FROM: Total Time to Value: Weeks to Months Traditional Approach Hadoop Data Schema Design Transforma tion Data Movement Users Data Preparation New Business Questions TO: Total Time to Value: Minutes New Approach Hadoop Data Users Drill enables the As-It-Happens business with instant SQL analytics on complex data New Business Questions 2015 MapR Technologies 16 8

Drill is the Top-Ranked SQL-on-Hadoop 2015 MapR Technologies 17 Hadoop Architecture Apache Hadoop is an open source software project that enables distributed processing of large data sets across clusters of commodity servers. Drill Spark SQL Impala Hive 18 9

HDFS Batch Bottleneck MapR Real-Time Distribution Operational apps on HBase/Accumulo must be run in in a separate cluster from the analytics cluster. 1 MapR-DB runs in the same cluster as the analytics cluster (Hadoop), to avoid batch data copies across clusters. HBase/Accumulo suffer from service disruptions due due to to compactions, garbage collection, and and region splits. All All data movement into into HDFS forces batch processing. 2 MapR-DB architecture ensures consistently high responsiveness (low latency). MapR ingests data in real-time via MapR-DB, HDFS API, and NFS. 1 Analytics Analytics with with 11 st st generation generation SQL-on- SQL-on- Hadoop Hadoop requires requires ETL and ETL schema and schema creation. creation. 3 Apache Drill provides immediate self-service data exploration with no waiting on IT. 2 2 1 3 3 2015 MapR Technologies 19 Production Success with Hadoop 2015 MapR Technologies 20 10

2000+ Nodes Fortune 100 Retailer 2015 MapR Technologies 21 RETAILER Targeted Marketing: In-store Geo-located Offers 2000+ MapR Hadoop nodes Largest deployment in retail 200 DATA SCIENTISTS 245M CUSTOMERS per week +2% CONVERSION RATE IMPROVEMENT 40TB per NODE 7PB per CLUSTER +50 PRODUCTION APPLICATIONS 2015 MapR Technologies 22 11

2015 MapR Technologies 23 2015 MapR Technologies 24 12

Manage and Adapt to Climate Change 10T DATA POINTS from 2.5M SENSORS < 100TB DATA 60Yrs CROP-YIELD statistics 85% OF FARMER RISK IS WEATHER RELATED 2M LOCATIONS Natl. Weather Service Doppler Scans 10K Corn, weat growing OUTCOMES per location 2015 MapR Technologies 25 Customer Testimonials on MapR 2015 MapR Technologies 26 13

Recap: Analytics at the Speed of Thought 80% of cost of analytics projects is in data management and integration 80/20 Rule 80% of time is spent is on processing queries and not on analysis 80% of users do not get analytics in consumable format, i.e., easy to use applications A partnership that turns those into 20%!!!! 2015 MapR Technologies 27 Q & A Engage with us! @mapr mapr-technologies maprtech MapR twhite@mapr.com maprtech 2015 MapR Technologies 28 14