A Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities



Similar documents
Enabling Big Data with Cloud. Go faster Reduce risk Scale as you grow Avoid mistakes

Big Data Tools: Game Changer for Mainstream Enterprises

Traditional BI vs. Business Data Lake A comparison

UNIFY YOUR (BIG) DATA

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

Tableau Visual Intelligence Platform Rapid Fire Analytics for Everyone Everywhere

Integrating Netezza into your existing IT landscape

Big Data Success Step 1: Get the Technology Right

Service Oriented Data Management

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

Oracle Big Data Strategy Simplified Infrastrcuture

More Data in Less Time

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition

Big Data Integration: A Buyer's Guide

The Future of Data Management

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

IBM Data Warehousing and Analytics Portfolio Summary

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

Ten Cornerstones of a Modern Data Warehouse Environment

VIEWPOINT. High Performance Analytics. Industry Context and Trends

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

The Enterprise Data Hub and The Modern Information Architecture

How to Enhance Traditional BI Architecture to Leverage Big Data

A Tipping Point for Automation in the Data Warehouse.

Establish and maintain Center of Excellence (CoE) around Data Architecture

Accelerate your Big Data Strategy. Execute faster with Capgemini and Cloudera s Enterprise Data Hub Accelerator

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper

The Principles of the Business Data Lake

The Lab and The Factory

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

Bangkok, Thailand 22 May 2008, Thursday

Advanced Analytic Dashboards at Lands End. Brenda Olson and John Kruk April 2004

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Three Open Blueprints For Big Data Success

Deploying Governed Data Discovery to Centralized and Decentralized Teams. Why Tableau and QlikView fall short

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Cisco IT Hadoop Journey

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Exploring the Synergistic Relationships Between BPC, BW and HANA

Parallel Data Warehouse

Data Virtualization. Paul Moxon Denodo Technologies. Alberta Data Architecture Community January 22 nd, Denodo Technologies

Native Connectivity to Big Data Sources in MSTR 10

Welcome to The Future of Analytics In Action IBM Corporation

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

Next Generation Data Warehousing Appliances

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Data Integration Checklist

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

Ganzheitliches Datenmanagement

BIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting

NAVIGATING THE BIG DATA JOURNEY

BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP

A HIGH-PERFORMANCE, SCALABLE BIG DATA APPLIANCE LAURA CHU-VIAL, SENIOR PRODUCT MARKETING MANAGER JOACHIM RAHMFELD, VP FIELD ALLIANCES OF SAP

Presenters: Luke Dougherty & Steve Crabb

Making Sense of the Madness

Accelerate Data Loading for Big Data Analytics Attunity Click-2-Load for HP Vertica

MDM and Data Warehousing Complement Each Other

An Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP Oracle ESG Data Systems Architecture

Bringing Big Data into the Enterprise

Understanding the Value of In-Memory in the IT Landscape

Big Data Technologies Compared June 2014

Making Sense of Big Data in Insurance

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Endeca Introduction to Big Data Analytics

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

Driving Peak Performance IBM Corporation

Big Data and Analytics in Government

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Cost-Effective Business Intelligence with Red Hat and Open Source

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Architecting your Business for Big Data Your Bridge to a Modern Information Architecture

ENTERPRISE BI AND DATA DISCOVERY, FINALLY

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

Composite Software Data Virtualization Five Steps to More Effective Data Governance

Big Data Defined Introducing DataStack 3.0

Escape from Data Jail: Getting business value out of your data warehouse

Evolution to Revolution: Big Data 2.0

Business Usage Monitoring for Teradata

Integrating SAP and non-sap data for comprehensive Business Intelligence

RapidDecision EDW: THE BETTER WAY TO DATA WAREHOUSE

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

HP Business Intelligence Solutions. Connected intelligence. Outcomes that matter.

How To Handle Big Data With A Data Scientist

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Getting Started Practical Input For Your Roadmap

ENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR

Achieving Business Value through Big Data Analytics Philip Russom

The Future of Data Management with Hadoop and the Enterprise Data Hub

Production ready hadoop. By Deepak Rao Na,onal Head Datawarehousing Bajaj Finserv

Empowering Users with Self-Service Analytics and Agile BI. MicroStrategy World 2014 Cynthia Wagner BI Solution Architect, BMC Software

Business Intelligence. Advanced visualization. Reporting & dashboards. Mobile BI. Packaged BI

The ESB and Microsoft BI

Business Intelligence In SAP Environments

Detecting Anomalous Behavior with the Business Data Lake. Reference Architecture and Enterprise Approaches.

Three Reasons Why Visual Data Discovery Falls Short

Building Your Big Data Team

Transcription:

A Whole New World Big Data Technologies Big Discovery Big Insights Endless Possibilities Dr. Phil Shelley

Query Execution Time Why Big Data Technology? Days EDW Hours Hadoop Minutes Presto Seconds Milliseconds In-Memory Gbytes Tbytes 100 s Tbytes Pbytes Data Size

ETL is obsolete ETL is a half century old Transaction ETL BI Data Hub Legacy BI Transaction Real-Time Analytics

Data Silos to Data Insight Single Copy Single Point of Truth Structured for Analysis Loaded in near-real-time Analytics and Discovery impossible or very difficult and slow Analytics and Discovery easier and fast with nosql

Why Big Data From an IT Perspective Responsiveness Business user no-longer have to wait for an ETL setup or change Data is available without IT involvement No concept of building a schema or cube before the business can use the data ETL complexity is needed no-longer DATA HUB Source Once Re-Use many times ETL changes to ELTTTTTT Latency in data is a thing of the past Analysis is routinely possible within minutes of data creation Long-running workload Can be eliminated and executed at any time Run times are a fraction of the original clock-time Batch processing on mainframes or Data Warehouse workload Moved to Hadoop Run 10, 50, even 100 times faster Intelligent Archive Put your archives/tape data on Hadoop and make it Intelligent Archive with the ability to run analytics or join it with other data Modernize Legacy Mainframe MIPs reduction has very attractive ROI Move Data Warehouse workload Reduce Cost Go Faster Avoid traditional warehouses The perpetual MDM challenge can take a completely different path

Why Big Data From a Business Perspective Focus on questions that bring value from the data... Test hypotheses without data or system restrictions Capture and link data from many and disparate sources Focus on discovery Clustering Heat mapping Linkages Text Analytics Drive the business based on those discoveries Now the inquisitive mind is needed (data scientist) I wonder if, then test it and measure the outcome Ask the previously impossible questions Almost unlimited compute and storage Capture and discover value from the full history & granularity The trend is Real-Time Automated discovery and response

DATA IN DATA OUT EDW or In-Memory High-Speed Marts Making Hadoop a Data Hub is increasingly accepted as a game-changing trend NoSQL and in-memory are driving massive change in the Data Warehousing space, shrinking the role of traditional Data Warehouses Batch Processing Area ESB Temporary Marts GOLD Data Hub Persisted Marts User Access 100% Source Data Leveraging the Source-Once and Re-Use approach, drives efficiency, reduces data silos reduces latency and time to value, massively improves analytics and discovery, greatly reduces costs Real-Time Analytics Real-Time Databases Transactional System Sources DATA IN DATA OUT

Layered Data Architecture Transactional Systems HADOOP Hbase PORTALS ESB Staging Incoming Persisted Full History Full Detail Concatenated De-Normalized Data Qualty GOLD Single Point of Truth De-Normalized Concatenated Segmented Use-Defined Marts Marts Persisted Marts Temporary Marts Export Marts Security-Driven Marts Batch working area EDW Appliance In-Memory USERS High-Speed SQL Drill-Down and Interaction

Use-Case Examples Performance Modernization Batch processing ETL replacement Cost reductions

Use-Case Examples Performance Long running batch jobs Slow ETL Slow queries due to system load Analytics conflicting with transactional systems Data latency System unable to meet production schedules Job dependencies cause schedule unreliability

Use-Case Examples Modernization Retire obsolete systems Language of code is hard to support Skill sets are becoming harder to find Applications hard to maintain Older systems will not scale with the business

Use-Case Examples Batch processing Faster Easier to maintain Cheaper Mainframe COBOL HP Unix AIX Sun DB2 Oracle

Use-Case Examples ETL replacement A sweet spot for Hadoop Massive performance improvement Shrink ETL footprint Reduce licensing costs Reduce the number of data copies Reduce latency from data creation to data use Build a data integration hub Move from ETL to ELttttt...

Use-Case Examples Cost reductions Hadoop can be 100% open-source Very large cost reduction opportunities Mainframe MIPS DB2 Oracle ETL tools EDW Teradata, Netezza

Business Value/Cost Savings Generate and Prioritize Use-Cases Complexity/Cost/Time to Implement

Consider Your Options

Program Macro View POC to discover Data Sourcing, Integration and Use-Case Challenges Establish Landscape Source Data for Initial Use-Cases Establish Governance and Security Implement Initial Use-Cases

Tracks and Milestones Establish Landscape Build dev. Build Prod. Apply Security, Access Methods & Controls Determine Access and Visualization Tools Implement Initial Visualization Tools Source Data for Initial Use-Cases Identify Data One-Time Source Dev Data -- Establish Data Architecture -- Build Sourcing and QA Scripts Initial Load -- Schedule Incremental Validate Sourcing Establish Governance and Security Design Data Source Standards Establish Governance Roles Design Security and Access Controls Validate Governance and Security Implement Initial Use-Cases POC of 1 st Use-Case -- Design Use-Cases Develop Use-cases Validate Use-Cases

Acceleration Establish Landscape Build dev. Build Prod. Apply Security, Access Methods & Controls Determine Access and Visualization Tools Implement Initial Visualization Tools Source Data for Initial Use-Cases Identify Data One-Time Source Dev Data -- Establish Data Architecture -- Build Sourcing and QA Scripts Initial Load -- Schedule Incremental Validate Sourcing Establish Governance and Security Design Data Source Standards Establish Governance Roles Design Security and Access Controls Validate Governance and Security Implement Initial Use-Cases POC of 1 st Use-Case -- Design Use-Cases Develop Use-cases Validate Use-Cases Start ASAP

Big Data - Fast Start Know where to start Know how to Start Know how to avoid traps How to integrate Big Data tools into your Enterprise Vendor agnostic Understand how to think Big Data, but start small

Starting to think about Big Data? Wondering how to put all the pieces together?

Many companies are considering how to begin their Big Data journey Over 80% are uncertain about how to start What are the best use-cases Fear of making strategic and costly mistakes stall most companies Uncertainty prevails about options, products and services

Which Hadoop Distribution Do I need other products or tools What vendors should I go with Will I end-up locked-in in the fast moving space What about the skills of my team and how to close gaps What options do I have to outsource or supplement my internal team

Newton Park Partners gives you the start you need An interactive model to bring you to the point of starting your Big Data Journey Workshops to mature ideas and use-cases to the point of POC - Fast Led by Dr. Phil Shelley, NPP brings years of Big Data planning and operational expertise What works and what does not As a former CIO/CTO/CEO leading your kick-start, you are sure to be talking the same language Vendor agnostic, just simple IT management experience in the Big Data space

NPP will walk you through the steps to your Big Data ventures... Background on Big Data tools and trends Discovery workshop with your team Human Talent review Governance and Security guidance Enterprise Data Architecture Vendors, partners and tool selection Use-case generation for your 1st POC

In one month from start to POC-ready Executive alignment Are you ready to start? Discovery Use-case generation Talent assessment Enterprise Data Architecture alignment Governance and Security alignment POC options identified

Workshop Agenda (Example) Duration Focus Attendees 3 Hours Big Data overview Use-case examples How Big Data tools can be used Company objectives Unmet business needs CIO CIO direct reports Business Unit Heads 3 Hours Use-case generation Use-case prioritization CIO CIO direct reports Multiple 1-hour Sessions Hadoop overview Why Hadoop and nosql Building Hadoop and nosql environments Data integration Governance and data quality Infrastructure requirements Hadoop operations HR and talent requirements Vendor and commercial product options BI tools IT functional leaders As applicable to each topic Project manager(s)

Phil Shelley phil@consultnpp.com