Cisco IT Hadoop Journey



Similar documents
Cisco IT Hadoop Journey

HDP Hadoop From concept to deployment.

Ganzheitliches Datenmanagement

The Future of Data Management

Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora

Native Connectivity to Big Data Sources in MSTR 10

Production ready hadoop. By Deepak Rao Na,onal Head Datawarehousing Bajaj Finserv

HDP Enabling the Modern Data Architecture

IBM Solution Framework for Lifecycle Management of Research Data IBM Corporation

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Comprehensive Analytics on the Hortonworks Data Platform

Big Data Analytics Nokia

How Cisco IT Built Big Data Platform to Transform Data Management

Data Virtualization A Potential Antidote for Big Data Growing Pains

SAP BusinessObjects Business Intelligence 4.1 One Strategy for Enterprise BI. May 2013

SAP and Hortonworks Reference Architecture

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

The Impact of PaaS on Business Transformation

Big Data and the Data Lake. February 2015

Migrating Discoverer to OBIEE Lessons Learned. Presented By Presented By Naren Thota Infosemantics, Inc.

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Descriptive to Predictive to Prescriptive Analytics: Move Up the Value Chain. Suren Nathan CTO

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

Cisco IT Automates Workloads for Big Data Analytics Environments

Integrating a Big Data Platform into Government:

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

Decision Ready Data: Power Your Analytics with Great Data. Murthy Mathiprakasam

Successfully Deploying Alternative Storage Architectures for Hadoop Gus Horn Iyer Venkatesan NetApp

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Winning Against All Odds: Big Data for the Budget Travel Industry. Silviu Preoteasa Head of Marketing Technology

Beyond Lambda - how to get from logical to physical. Artur Borycki, Director International Technology & Innovations

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer

How To Use A Big Data Platform For A Business

Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop

The Digital Enterprise Demands a Modern Integration Approach. Nada daveiga, Sr. Dir. of Technical Sales Tony LaVasseur, Territory Leader

Providing real-time, built-in analytics with S/4HANA. Jürgen Thielemans, SAP Enterprise Architect SAP Belgium&Luxembourg

The Future of Data Management with Hadoop and the Enterprise Data Hub

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

ANALYTICS CENTER LEARNING PROGRAM

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Service Oriented Data Management

MDM and Data Warehousing Complement Each Other

The BIg Picture. Dinsdag 17 september 2013

Data Governance in the Hadoop Data Lake. Kiran Kamreddy May 2015

More Data in Less Time

Accelerating the path to SAP BW powered by SAP HANA

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

Luncheon Webinar Series May 13, 2013

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

FUJITSU Software Interstage Business Operations Platform: A Foundation for Smart Process Applications

Build Your Competitive Edge in Big Data with Cisco. Rick Speyer Senior Global Marketing Manager Big Data Cisco Systems 6/25/2015

Roadmap Talend : découvrez les futures fonctionnalités de Talend

Are You Big Data Ready?

Cisco Solutions for Big Data and Analytics

The Enterprise Data Hub and The Modern Information Architecture

QlikView Business Discovery Platform. Algol Consulting Srl

SAP Database Strategy Overview. Uwe Grigoleit September 2013

Big Data Can Drive the Business and IT to Evolve and Adapt

EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.

Oracle Big Data SQL Technical Update

Hadoop & Spark Using Amazon EMR

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

Cisco Data Preparation

Predictive Analytics. Noam Zeigerson, CTO

Bringing Big Data to People

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Exploring the Synergistic Relationships Between BPC, BW and HANA

STREAM ANALYTIX. Industry s only Multi-Engine Streaming Analytics Platform

Big Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems

Beyond High Availability Replication s Changing Role

A Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities

The Internet of Things and Big Data: Intro

Deploying an Operational Data Store Designed for Big Data

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP

UNIFY YOUR (BIG) DATA

Data Warehouse Overview. Srini Rengarajan

Data Integration Checklist

Hadoop Trends and Practical Use Cases. April 2014

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Big Data for Investment Research Management

Transcription:

Cisco IT Hadoop Journey Srini Desikan, Program Manager IT 2015 MapR Technologies 1

Agenda Hadoop Platform Timeline Key Decisions / Lessons Learnt Data Lake Hadoop s place in IT Data Platforms Use Cases 2013 Cisco and/or its affiliates. All rights reserved. 2

Bringing Hadoop into Cisco IT in 2011-2012 Paradigm shift from database based application development of last 2 decades at Cisco IT - Cost Structure - Development Methodology & Project lifecycle - Programming Model - Maturity curve of the technology is different FUD Fear, Uncertainty and Doubt Availability of skilled workforce Rapid pace of innovation and constantly changing industry dynamics 2013 Cisco and/or its affiliates. All rights reserved. 3

Hadoop Journey in Cisco IT Use Cases Deployment Enterprise Data Lake 2014 Growth & Expanding Ecosytem POCs 2011 Multi-tenant Shared Platform July 2012 Starting 2013. 2013 Cisco and/or its affiliates. All rights reserved. 4

Key Decisions Rationale Open Source vs Distribution Architecture Operational Excellence, Availability, Performance, Skill set UCS Common Platform Architecture Support Growth & Leverage Ecosystem Hive (SQL), Mahout, Hbase, Cost & Ecosystem Environment Lifecycle Data Lake Production, Stage, Development & Technical POC (Isolate usage by Risk & Development lifecycle) Data Governance, Reduce cost, Eliminate duplication 2013 Cisco and/or its affiliates. All rights reserved. 5

Lessons from Technology Journey Architecture Choice (s) Multi-tenant Mission critical features Start Small & Grow Support: Open Source or Distribution Leverage Skills. Use components that help users leverage the existing skills like Informatica and SQL Tiered Integrated Architecture to manage data across multiple platforms 2013 Cisco and/or its affiliates. All rights reserved. 6

Lessons from Technology Journey Hive doesn t support ANSI SQL Reusable UDFs for Hive were created Tidal Enterprise Scheduler allowed for easy workload management and error handling Hadoop scales linearly and our platform grew 100% in the first year. Invest in architecture that allows you to grow. 2013 Cisco and/or its affiliates. All rights reserved. 7

Data Platform Reference Architecture v3 Data Sources Data Storage and Processing Data Consumption (Mobile / Browser / Data Service) Databases ALL other Sources Cisco Data Virtualization (Composite) Logical Data Abstraction Layer across transactional, SaaS, Big Data & DW Experience Toolkit Rapid Prototyping / Data Integration / Data Services Databases Agile Analytics Self Service Dashboard Rapid Business Intell. Customer Registry ERP SFDC Docs, Cases, Content, Social Media, Clicksteam Customer Network, Product Usage Internet of Everything (IoE) Big Data Platform Hadoop & Spark on UCS Machine Learning Data Archiving Data Science Network of Truth SAP HANA on UCS Prrediictive Engine Real time BI Mission Critical Reporting Legacy EDW Financial SSOTs Stable core Controlled Change Cisco Data Virtualization (Composite) Analytics & Modeling HANA Hadoop & Spark SAS Data Exploration Real time Predictive Data Analysis, Analytics Mission Critical Operational Reports Text Machine Learning,, Statistical Analysis (R) Machine Data Insights (e.g. In supply chain) Financial Reporting & Extract Operational Intelligence IT App & System Logs & Config. Index & Search Operational Intelligence(Splunk UI) 2013 Cisco and/or its affiliates. All rights reserved. 8

Shared Data Rich Analytics Engineering Advanced Services Cisco Services Marketing Enterprise Platform(s) IT Sales Security Finance Supply Chain 2013 Cisco and/or its affiliates. All rights reserved. 9

Enterprise Data Lake Metadata driven utilities to automate ingestion of Data Access Management Driven by Metadata Scalable Cost Effective 2013 Cisco and/or its affiliates. All rights reserved. 10

Hadoop Use Cases EDS CSTG Organization (vs) Adoption Level -icam -Party Ranking Service -Teradata ETL Offload -Data Lake Production -Connected Analytics Network Deployment (CAND) -Smart Call Home -Cloud Consumption (Sentinel) -NOS Online -Network SSOT Pipeline Marketing -Multi-Channel Scoring -Automatic Qualified Leads CWCS Metadata -Content Auto-Tagging CITS -Cisco Partner Annuity Initiative -Social Media Services GIS -Collaboration Dashboard -Item, BOM & Compliance Data Analytics Legal Supply Chain -Data Warehouse Expansion -Measurement -ACTS -TST 2013 Cisco and/or its affiliates. All rights reserved. 11

Cisco IT Use Cases for Hadoop in Production Data Platform Option to Reduce Cost Marketing & Content Management Services Risk & Compliance Migrate ETL Processing from EDW (Teradata) Data Lake & Adhoc Data Analysis Data Archiving Customer Segmentation Multi-Channel Scoring Content Autotagging Smart Analytics Offerings Service Opportunity Identification Organization Network Analytics Engineering Source Code Monitoring 2013 Cisco and/or its affiliates. All rights reserved. 12

Hadoop Distribution: MapR Advantage(s) for Cisco IT High Availability Distributed Name Node Snapshots Volume Based Disaster Recovery Performance Higher performance and fewer nodes ($) Operational Cost / Productivity HBase (MapR DB) and Hadoop on the same cluster NFS (Fully Read & Write) Multiple simultaneous versions on same cluster 2013 Cisco and/or its affiliates. All rights reserved. 13

Thank You 2013 Cisco and/or its affiliates. All rights reserved. 14