Laurence Liew General Manager, APAC. Economics Is Driving Big Data Analytics to the Cloud



Similar documents
High Performance Predictive Analytics in R and Hadoop:

Dell* In-Memory Appliance for Cloudera* Enterprise

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

The Future of Data Management

Revolution R Enterprise

Big Data Processing: Past, Present and Future

Revolution R Enterprise: Efficient Predictive Analytics for Big Data

HDP Enabling the Modern Data Architecture

Modernizing Your Data Warehouse for Hadoop

SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES

Decision Trees built in Hadoop plus more Big Data Analytics with Revolution R Enterprise

BIG DATA TRENDS AND TECHNOLOGIES

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Data Refinery with Big Data Aspects

BIG DATA USING HADOOP

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

Big Data and Industrial Internet

Microsoft Analytics Platform System. Solution Brief

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence

SQL Server Everything built-in. Csom Gergely Microsoft Adat platform szakértő

R and Hadoop: Architectural Options. Bill Jacobs VP Product Marketing & Field CTO, Revolution

APPROACHABLE ANALYTICS MAKING SENSE OF DATA

EVERYTHING THAT MATTERS IN ADVANCED ANALYTICS

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

RevoScaleR Speed and Scalability

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

Scalable Data Analysis in R. Lee E. Edlefsen Chief Scientist UserR! 2011

On Demand Satellite Image Processing

Platfora Big Data Analytics

Hadoop & SAS Data Loader for Hadoop

Data processing goes big

The Future of Data Management with Hadoop and the Enterprise Data Hub

In-Database Analytics Deep Dive with Teradata and Revolution R

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Please give me your feedback

Investor Presentation. Second Quarter 2015

Find the Hidden Signal in Market Data Noise

Monitis Project Proposals for AUA. September 2014, Yerevan, Armenia

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Building your Big Data Architecture on Amazon Web Services

Proact whitepaper on Big Data

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Big Data and Data Science: Behind the Buzz Words

Parallel Data Warehouse

Delivering Value from Big Data with Revolution R Enterprise and Hadoop

INTRODUCTION TO CASSANDRA

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

Using Microsoft R Server to Address Scalability Issues

How Companies are! Using Spark

Ubuntu and Hadoop: the perfect match

Data Centers and Cloud Computing. Data Centers

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Flash Use Cases Traditional Infrastructure vs Hyperscale

The Next Wave of Data Management. Is Big Data The New Normal?

Interactive data analytics drive insights

Hadoop-based Open Source ediscovery: FreeEed. (Easy as popcorn)

SQream Technologies Ltd - Confiden7al

HDP Hadoop From concept to deployment.

Comprehensive Analytics on the Hortonworks Data Platform

Big Data Performance Growth on the Rise

A Study of Data Management Technology for Handling Big Data

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router

The 3 questions to ask yourself about BIG DATA

Assignment # 1 (Cloud Computing Security)

Microsoft SQL Server 2012 with Hadoop

The Power of Pentaho and Hadoop in Action. Demonstrating MapReduce Performance at Scale

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center

Unisys ClearPath Forward Fabric Based Platform to Power the Weather Enterprise

Data Centric Systems (DCS)

SQLSaturday #399 Sacramento 25 July, Big Data Analytics with Excel

Upcoming Announcements

Big Data and Big Data Modeling

Grab some coffee and enjoy the pre-show banter before the top of the hour!

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER

Unstructured Data Accelerator (UDA) Author: Motti Beck, Mellanox Technologies Date: March 27, 2012

Big Data. Lyle Ungar, University of Pennsylvania

Big Data. Fast Forward. Putting data to productive use

Big Data Analytics in R

Agenda. Big Data. Dell Cloud Solutions A Dell Story Summary. Concepts Market Trends and Challenges Dell Solutions

INVESTOR PRESENTATION. First Quarter 2014

Accelerating Hadoop MapReduce Using an In-Memory Data Grid

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

Building and Deploying Customer Behavior Models

Accelerating Enterprise Big Data Success. Tim Stevens, VP of Business and Corporate Development Cloudera

Apache Hadoop's Role in Your Big Data Architecture

Transcription:

Laurence Liew General Manager, APAC Economics Is Driving Big Data Analytics to the Cloud

Big Data 101 The Analytics Stack Economics of Big Data Convergence of the 3 forces Big Data Analytics in the Cloud with CloudR

Who we are Leading provider of commercial analytics platform based on open source R statistical computing language Our Software Delivers Power: Distributed, scalable high performance advanced analytics Productivity: Easier to build and deploy analytic applications Enterprise Readiness: Multi-platform Our Services Deliver Knowledge: Our experts enable you to be experts Time-to-Value: Our QuickStart projects give you a jumpstart Guidance: Our customer support team is here to help you Our Philosophy Customer-centric innovation Easy to do business with Customers 200+ Global 2000 Global Presence North America / EMEA / APAC Global Industries Served Financial Services Digital Media Government Health & Life Sciences High Tech Manufacturing Retail Telco 3

Revolution Confidential 200 Corporate Customers and Growing Finance & Insurance Healthcare & Life Sciences Academic & Gov t Consumer & Info Svcs Manuf & Tech 4

Centre of Excellence (CoE) Partner with iles to create new IPs in big data analytics in Singapore Conduct and run Big data analytics training/workshops to promote the use and adoption of big data technologies and analytics We will have our data scientist and developers work alongside our collaboration partners.

Centre of Attachment To accelerate formation of data science team within organization Analytics/statistics skills Big data infrastructure skills such as Hadoop and HPC clusters 3-months program consisting of: 1. Classroom training spread over 2 months 2. Inter-spaced with practical hands-on and guidance and 1-on-1 consultations with Revo s data science team 3. 1 month project work to deploy model into organization s infrastructure

Backdrop - Massive Data Volumes Exabytes 3D/4D Seismic Realtime Telemetry Machine Sensors Communication Logs Petabytes Systems Logs Vehicle Monitoring Geospatial ESRI Video And Imagery Terabytes Gigabytes Cost Records Volumes ERP Logistics Summary Operating Statistics Incidents Alarms Daily Activity Reports Text Instructions Workorders Reports Increasing Volume, Variety and Velocity 7 Decision Management Solutions, 2013

Volume Variety Velocity What s big data?

Big Data is big. Data set so large it cannot be managed in conventional database with acceptable performance and at acceptable cost. Volume What s big data?

Big Data is messy. 70-90% of all data generated lacks predefined structure or is difficult to map into a conventional data model. Variety What s big data?

Big Data moves. ICU: predict patient events FICO: flag suspect transactions Oreo: Superbowl ad from Tweets Retail: push in-store offers What s big data? Velocity

Big Data 101 The Analytics Stack Economics of Big Data Convergence of the 3 forces Big Data Analytics in the Cloud with CloudR

Next Generation Big Data Analytics Players??? ANALYTICS HDD -> SSD -> In-Memory INFRASTRUCTURE AND DATABASES 14

Hadoop The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers 1 node = 12TB 10 nodes = 120TB 100 nodes = 1.2PB 15

Hadoop Dell PowerEdge Servers 16

R or Revo R video goes here

= Language + Analytics Statistical data analysis programming language Huge library algorithms for data access, manipulation, analysis & graphics

Data Analytics Workflow INGEST DISTILL & ANALYZE CONSUME

Write Once. Deploy Anywhere. Hadoop Hortonworks Cloudera, Intel EDW Teradata ConnectR DeployR Clustered Systems Workstations & Servers Linux HPC Windows HPC Desktop Server Linux ScaleR DistributedR In the Cloud Microsoft Azure Amazon AWS (CloudR) DESIGNED FOR SCALE, PORTABILITY & PERFORMANCE 20

Big Data 101 The Analytics Stack Economics of Big Data Convergence of the 3 forces Big Data Analytics in the Cloud with CloudR

Why Now? THE PERFECT STORM CONVERGENCE OF

Data Science - The Tool 23

Computer Science - The infrastructure DISRUPTIVE TECHNOLOGY 1. Commodity Hardware 2. Open source Linux Hadoop R

Computer Science - Attack of the Exponentials 1TB: $14M in 1980. ~ $4.70 $9 99GFlops Cloud is the launching pad for data startups. 25

Management Science - The Data Scientist 20% 20% 60% Magic Statistics Communications 26

Management Science - The Team Data Integration Mashups Applications Models Visualization Predictions Uncertainty Problems Data Sources Credibility Effective Data Applications Drew Conway http://www.dataists.com/2010/09/the-data-science-venn-diagram/ 27

Big Data 101 The Analytics Stack Economics of Big Data Convergence of the 3 forces Big Data Analytics in the Cloud with CloudR

The Cloud More than buying VMs PaaS/APIs SaaS Per hour pricing infrastructure in 10mins upon sign-up CHEAP Enabling innovations and focusing on your core IPs

Analytics Platform as a Service Hi-Mem instances HPC Clusters Analytics Platform as a Service Hadoop Clusters Databases

HPA Benchmarking comparison* Logistic Regression LEADING LEGACY ANALYTICS SOFTWARE Rows of data 1 billion 1 billion 1 billion Parameters just a few 7 7 Time 80 seconds 44 seconds 95 seconds Data location In memory On disk On disk Nodes 32 5 8 Cores 384 20 24 RAM 1,536 GB 80 GB 120GB Revolution R is faster on the same amount of data, despite using approximately a 20 th as many cores, a 20 th as much RAM, a 6 th as many nodes, and not preloading data into RAM. Months Weeks 10 minutes

CloudR

Thank you Revolution Analytics is the leading commercial provider of software and support for the popular open source R statistics language. E: Laurence.liew@revolutionanalytics.com W: www.revolutionanalytics.com 33