Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Similar documents
Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop

The Lab and The Factory

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

This Symposium brought to you by

Bringing Big Data to People

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

HDP Hadoop From concept to deployment.

The Future of Data Management with Hadoop and the Enterprise Data Hub

HDP Enabling the Modern Data Architecture

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

IBM Big Data Platform

UNIFY YOUR (BIG) DATA

The Future of Data Management

#TalendSandbox for Big Data

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Roadmap Talend : découvrez les futures fonctionnalités de Talend

Big Data Analytics Nokia

Getting Value from Big Data with Analytics

Big Data Use Cases. To Start Today. Paul Scholey Sales Director, EMEA. 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866)

Modernizing Your Data Warehouse for Hadoop

Data Integration Checklist

Integrating a Big Data Platform into Government:

Big Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

IBM Data Warehousing and Analytics Portfolio Summary

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

ANALYTICS CENTER LEARNING PROGRAM

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Luncheon Webinar Series May 13, 2013

Are You Big Data Ready?

BIG DATA TRENDS AND TECHNOLOGIES

How to Enhance Traditional BI Architecture to Leverage Big Data

Big Data and Trusted Information

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Big Data and Data Science. The globally recognised training program

Using Tableau Software with Hortonworks Data Platform

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

Native Connectivity to Big Data Sources in MSTR 10

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

Three Open Blueprints For Big Data Success

Loss Prevention Data Mining Using big data, predictive and prescriptive analytics to enpower loss prevention

The 4 Pillars of Technosoft s Big Data Practice

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Big Data Big Data/Data Analytics & Software Development

How To Make Data Streaming A Real Time Intelligence

Making Sense of Big Data in Insurance

Architecting for the Internet of Things & Big Data

Transforming the Telecoms Business using Big Data and Analytics

Big Data Integration: A Buyer's Guide

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

Modern Data Warehouse

Increase Revenue THE JOURNEY TO BIG DATA. Gary Evans. CTO EMC Ireland. Twitter.com/Gary3vans. Copyright 2013 EMC Corporation. All rights reserved.

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

How To Turn Big Data Into An Insight

W H I T E P A P E R. Building your Big Data analytics strategy: Block-by-Block! Abstract

Building Scalable Big Data Pipelines

BIG DATA TECHNOLOGY. Hadoop Ecosystem

SAP and Hortonworks Reference Architecture

ENTERPRISE BI AND DATA DISCOVERY, FINALLY

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist

Cloudera Enterprise Data Hub in Telecom:

Information Builders Mission & Value Proposition

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

IBM Big Data in Government

Big Data 101 Webinar

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

Safe Harbor Statement

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

Timo Elliott VP, Global Innovation Evangelist SAP SE or an SAP affiliate company. All rights reserved. 1

Big Data Executive Survey

INTELLIGENT BUSINESS STRATEGIES WHITE PAPER

Comprehensive Analytics on the Hortonworks Data Platform

Investor Presentation. Second Quarter 2015

Oracle Big Data SQL Technical Update

EMC ADVERTISING ANALYTICS SERVICE FOR MEDIA & ENTERTAINMENT

The Potential of Big Data in the Cloud. Juan Madera Technology Consultant

Talend Big Data. Delivering instant value from all your data. Talend

Navigating Big Data business analytics

How To Handle Big Data With A Data Scientist

INVESTOR PRESENTATION. Third Quarter 2014

BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?

The Technology of the Business Data Lake

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies

BIG DATA - HADOOP PROFESSIONAL amron

The Digital Enterprise Demands a Modern Integration Approach. Nada daveiga, Sr. Dir. of Technical Sales Tony LaVasseur, Territory Leader

Transcription:

Big Data Analytics 1

Priority Discussion Topics What are the most compelling business drivers behind big data analytics? Do you have or expect to have data scientists on your staff, and what will be their charter? What are the different product, technology and architectural components that need to be considered? What process challenges for data collection, data cleansing and data quality concern you most? 2

It s a Whole New Big Data World 3

More than just data volume, big data analytics must also consider data velocity, variety, and complexity New insights on customers, products, and operations Velocity Volume Contextual and location-aware delivery to any device Variety Complexity Documents Transactional Data Smart Grid Images Audio Text Video Volume: data volumes approaching multiple petabytes Velocity: data being generated and ingested for analysis in real-time Variety: tabular, documents, e-mail, metering, network, video, image, audio Complexity: different standards, domain rules, and storage formats per data type Source: Gartner, March 2011 4

Big data analytics provides potential for more timely, complete, actionable business insights Over the last 25 years, companies have been focused on leveraging maybe 5% of the information available to them In order to compete well, companies are looking to dip into the rest of the 95% that can make them better than anyone else. Today s Situation Less than 10% of available enterprise data Rearview mirror reports, dashboards, and analysis Big Data Analytics Ramifications Vast majority of available data, including external sources Forward looking predictions with recommendations Weeks, months, or even quarters old Real-time or near real-time Incomplete, inaccurate, and disjointed data Architectures and methods that take 6 to 18 months to exploit Correlated, high confidence, governed data Vastly accelerated time to market Source: Forrester Research Inc. 5

What are the most compelling business drivers behind big data analytics (i.e., what gets your business stakeholders excited)? 6

Do you have or expect to have data scientists on your staff? Will they be in the business or in IT? What will be their charter? How will you measure their effectiveness? 7

Successful organizations continuously uncover and publish new insights about the business Data scientist (GigaOM) Obtain, scrub, explore, model,and interpret data, blending hacking, statistics, and machine learning, with good understanding of the business processes and goals 5) Business Consumes insights and measures effectiveness 4) IT Publishes new insights 1) Business Defines mandate and requirements 5 4 1 Strategic Business Initiative 2 3 2) IT Acquires and integrates data 3) Data Scientists Builds and refines analytic models 8

What are the different product, technology, and architectural components that need to be considered in a big data analytics project? 9

Data Marts Map- Reduce EMC Big Data Analytics Reference Architecture Data Sources Hadoop Alerts Documents Mobile Machine Multimedia Web/Social LOB data ERP CRM POS Data Quality MDM ETL Ecosystem* Map- Reduce Key Values Documents Other NoSql Enterprise Data Warehouse Federated Data Warehouse HDFS NoSQL Stores SQL Stores BU 1 BU 2 BU 3 BI as a Service Genetic Algorithms OLAP Neural Nets Statistics Data Mining Operations Research Dashboards Reports Spreadsheets Mobile Data Visualization Data Input Integration Data Stores and Access Data Analysis Presentation & Delivery Structured data sources Traditional data Integration Traditional data warehousing Big data analytics ramifications *Hadoop Ecosystem includes: Hive, Pig, Mahout, HBase, ZooKeeper, Oozie, Sqoop, Avro 10

What process challenges for data collection, data cleansing, and data quality concern you most with respect to big data and advanced analytics? 11

EMC IT use case of performance and security event management Data Volume, Velocity, Variety AND Complexity Challenges High volume of event data Numerous data types across thousands of collection points 12 MB/collection point per hour Information silo ed and difficult to aggregate and correlate Manually-intensive ad-hoc analytics Approach Created fast aggregation capabilities with Hadoop and a single data framework with the Greenplum database Mapped GRC model to control management layer Leveraged modern, integrated and interrelated analytic tools for correlation of events Implemented real-time data loading and analysis at high frequency Benefits Framework for single management of controls Faster investigation of incidents Automated and aggregated analysis Security embedded in virtual infrastructure 12

THANK YOU 13