Big Data Success Step 1: Get the Technology Right



Similar documents
Using Data Mining and Machine Learning in Retail

Big Data Analytics - Accelerated. stream-horizon.com

Presenters: Luke Dougherty & Steve Crabb

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Bringing Big Data into the Enterprise

Tap into Hadoop and Other No SQL Sources

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Big Data Technologies Compared June 2014

Performance and Scalability Overview

The Future of Data Management

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

Big Data and Hadoop for the Executive A Reference Guide

A Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities

How To Handle Big Data With A Data Scientist

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

Oracle Big Data SQL Technical Update

In-Memory Analytics for Big Data

Native Connectivity to Big Data Sources in MSTR 10

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Advanced In-Database Analytics

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper

[Hadoop, Storm and Couchbase: Faster Big Data]

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Comprehensive Analytics on the Hortonworks Data Platform

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Talend Big Data. Delivering instant value from all your data. Talend

VIEWPOINT. High Performance Analytics. Industry Context and Trends

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Saving Millions through Data Warehouse Offloading to Hadoop. Jack Norris, CMO MapR Technologies. MapR Technologies. All rights reserved.

Data Challenges in Telecommunications Networks and a Big Data Solution

Dell Cloudera Syncsort Data Warehouse Optimization ETL Offload

Whitepaper. An Rx for Enterprise Big Data Success. by Paul Barth and Prithwi Thakuria NVP. NewVantage Partners

Integrate Big Data into Business Processes and Enterprise Systems. solution white paper

The Internet of Things and Big Data: Intro

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

Driving Growth in Insurance With a Big Data Architecture

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.

SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES

Deploying an Operational Data Store Designed for Big Data

Big Data and Data Science: Behind the Buzz Words

Dominik Wagenknecht Accenture

Big Data and Its Impact on the Data Warehousing Architecture

EMC BACKUP MEETS BIG DATA

Big Data for Investment Research Management

Virtualizing Apache Hadoop. June, 2012

Why All Enterprise Data Integration Products Are Not Equal

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

White. Paper. EMC Isilon: A Scalable Storage Platform for Big Data. April 2014

Information Builders Mission & Value Proposition

Cloudera Enterprise Data Hub in Telecom:

BIG DATA TRENDS AND TECHNOLOGIES

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld

Cost-Effective Business Intelligence with Red Hat and Open Source

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities

HDP Enabling the Modern Data Architecture

Building a real-time, self-service data analytics ecosystem Greg Arnold, Sr. Director Engineering

Understanding How Sensage Compares/Contrasts with Hadoop

An Oracle White Paper October Oracle: Big Data for the Enterprise

Has been into training Big Data Hadoop and MongoDB from more than a year now

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

INDUS / AXIOMINE. Adopting Hadoop In the Enterprise Typical Enterprise Use Cases

IBM BigInsights for Apache Hadoop

Advanced Big Data Analytics with R and Hadoop

There s no way around it: learning about Big Data means

Big Data and Trusted Information

The Future of Data Management with Hadoop and the Enterprise Data Hub

So What s the Big Deal?

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada

Performance and Scalability Overview

Data Analytics Infrastructure

HDP Hadoop From concept to deployment.

Luncheon Webinar Series May 13, 2013

Using Big Data for Smarter Decision Making. Colin White, BI Research July 2011 Sponsored by IBM

Please give me your feedback

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Enterprise Operational SQL on Hadoop Trafodion Overview

Peninsula Strategy. Creating Strategy and Implementing Change

Transcription:

Big Data Success Step 1: Get the Technology Right TOM MATIJEVIC Director, Business Development ANDY MCNALIS Director, Data Management & Integration MetaScale is a subsidiary of Sears Holdings Corporation

Speaker Overview TOM MATIJEVIC Director, Business Development Responsible for business development, client acquisition, and long-term customer profitability Collaborates with organizations across diverse industries to help deliver value from big data giving business users throughout the enterprise access to more data faster than ever Over 25 years experience with a diverse background that includes data center infrastructure, entrepreneurship, marketing, and business development ANDY MCNALIS Director, Data Management & Integration Leading member of the team that builds, deploys and manages an enterprise-scale Hadoop platform at Sears Member of the Sears / MetaScale Big Data Center of Excellence Involved in the development of design best practices for Hadoop and production-ready big data environments Spent over seven years as a Data Warehouse Manager at Sears, managing Teradata, Netezza, Greenplum and other data warehouse environments 2 2 2

MetaScale A Company of Sears Holdings MetaScale grew out of a Fortune 100 enterprise that leverages the Hadoop ecosystem to manage several petabytes of data and new business intelligence capabilities We manage reliable production environments for our parent company and other enterprise customers: Over 3 petabytes of effective storage Over 500 nodes in multiple data centers Hadoop, Storm, Kafka, Cassandra in production Our Big Data Center of Excellence is comprised of 200+ practitioners with deep practical experience in designing, developing, and integrating production Hadoop and NoSQL solutions. Tap into the best enterprise Hadoop talent available in the marketplace! 3 3 3

Over a Century of Innovation A Fortune 100 company, nearly $40 billion in annual revenue The nation s fourth largest broad line retailer with almost 2,500 full-line and specialty retail stores in the US and Canada A front runner in big data efforts including driving personalized marketing and generating savings from legacy migration Running one of the biggest rewards programs that captures and analyzes large volume of customer transactions quickly 4 4 4

Sears A Technology Perspective Online Mobile In-home Membership In-store 5 5

Everything is Great What Happens at the Backend? Hundreds of Nodes of Hadoop clusters Massive Virtual Private Cloud computing Hadoop Highly optimized shared Databases Over 5 Petabytes of Data Billions of transactions and Price changes NoSQL databases with Hadoop 6 6

The Classic Enterprise Challenge Constant pressure to lower costs, deliver faster, migrate to faster processing cycles or real time and answer more difficult questions Tight IT budgets Growing data volumes Shortened processing windows Latency in data The Challenge Escalating costs ETL complexity Demanding business requirements Hitting scalability ceilings 7 7

It All Started With a Pricing Problem The Challenge Intensive computational and large storage requirements Needed to calculate item price elasticity based on 8 billion rows of sales data Could only be run quarterly and on subset of data Needed more often Business need - React to market conditions and new product launches 1.4B SKUs across 3,400 Sites Technology Stack Legacy Systems - Mainframe Teradata / Exadata All others The Solution Implemented a Hadoop and Cassandra based infrastructure Reduced Price Elasticity Calculations to weekly Increased data set volumes and granularity Meet Business Requirement SLAs 8 8

Data Warehousing Tools and Products DATA WAREHOUSE Traditional Proprietary High costs licensing, support, etc. Teradata Market leader Solution includes Software and Hardware Works as advertised Costly to scale Greenplum (EMC/Pivotal) Software only option (but they do have an appliance) Offered MapReduce Columner projects Netezza (IBM) 9 9

Big Data Tools and Products BIG DATA Open source Many contributors Less expensive to deploy as compared to traditional Data Warehouse vendors Hadoop HDFS Hadoop Distributed File System Map/Reduce processing framework Batch centric Becoming more database like NoSQL HBase Cassandra MongoDB Real-time analytics Scales easily and inexpensively 10 10

Right Tool for the Right Job 11 11

Enterprise Data Hub 12 12

Modernizing Legacy Systems Mainframe MIPS Optimization Eliminate ETL Bottlenecks Mainframe batch business process would not scale needed to process 100 times more detail to handle rollout of high value business critical functionality. We migrated sections of batch process from mainframe to Hadoop and back, eliminating MIPS and improving overall cycle time without disruption to business users. ELT X -- Extract Load Transform, Transform, Transform ETL processing with DataStage platform was taking over ten hours to complete. With Enterprise Data Hub model on Hadoop, were are able to use Pig to complete the transformations in less than an hour. 13 13

Browsing History Hadoop and HBase 14 14

Customer Analytics with Social Media Data Brand Perception and Sentiment Analysis Brand A Brand A Brand B Brand C Brand A Brand B Mixed Brand C Brand C Brand Perception Comparison Brand B Analyze conversation sentiment about multiple brands 15 15

Key Takeaways Enterprise Data Hub and single version of truth for all data Hadoop can help you answer questions that were difficult or cost prohibitive to answer before Hadoop can transform your organization s approach to how you use data and ask questions you never even thought of Must have a clear strategy and long-term plan Leverage the right partnerships to achieve your goals 16 16

How We Help Achieve Long Term Success Start Small >> Define Success >> Strategic Partnership >> MetaScale takes a holistic approach to advising organizations in the development of use cases with clearly defined success criteria. We design and build relevant business cases, custom big data programs and long term strategy to meet current and future analytics requirements. Our proven methodologies, best practices, patent pending tools and experienced resources make us your best strategic partner for accelerating your big data initiatives. 17 17

Your One-Stop Big Data Helpline For further information phone: email: visit: 1-800-234-8769 contact@metascale.com www.metascale.com 18 18