Teradata s Big Data Technology Strategy & Roadmap



Similar documents
BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Artur Borycki. Director International Solutions Marketing

GO BIG WITH DATA PLATFORMS: HADOOP AND TERADATA 1700

TERADATA QUERY GRID. Teradata User Group September 2014

Teradata Unified Big Data Architecture

Welcome. Host: Eric Kavanagh. The Briefing Room. Twitter Tag: #briefr

Investor Presentation. Second Quarter 2015

ADVANCED ANALYTICS AND FRAUD DETECTION THE RIGHT TECHNOLOGY FOR NOW AND THE FUTURE

SAS and Teradata Partnership

UNIFY YOUR (BIG) DATA

INVESTOR PRESENTATION. First Quarter 2014

Bringing Intergalactic Data Speak (a.k.a.: SQL) to Hadoop Martin Willcox Director Big Data Centre of Excellence (Teradata

INVESTOR PRESENTATION. Third Quarter 2014

Oracle Big Data SQL Technical Update

Ganzheitliches Datenmanagement

Integrated Big Data: Hadoop + DBMS + Discovery for SAS High Performance Analytics

Luncheon Webinar Series May 13, 2013

Big Data and Your Data Warehouse Philip Russom

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Extend your analytic capabilities with SAP Predictive Analysis

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

VIEWPOINT. High Performance Analytics. Industry Context and Trends

The Future of Data Management

BIG Data Analytics Move to Competitive Advantage

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Up Your R Game. James Taylor, Decision Management Solutions Bill Franks, Teradata

UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX

Architecting for the Internet of Things & Big Data

SAP and Hortonworks Reference Architecture

Harnessing the Value of Big Data Analytics

Introducing Oracle Exalytics In-Memory Machine

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

Ramesh Bhashyam Teradata Fellow Teradata Corporation

MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy. Satish Krishnaswamy VP MDM Solutions - Teradata

Data Warehouse Hadoop. Shimpei Kodama 2015/9/29

So What s the Big Deal?

CERULIUM TERADATA COURSE CATALOG

Harnessing the Value of Big Data Analytics

Big Data Technologies Compared June 2014

The Internet of Things and Big Data: Intro

Energy Insight from OMNETRIC Group. Achieving quality and speed in analytics with data discovery

HDP Hadoop From concept to deployment.

HIGH PERFORMANCE ANALYTICS FOR TERADATA

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Data Warehousing and Analytics Infrastructure at Facebook. Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com

Parallel Data Warehouse

Analance Data Integration Technical Whitepaper

Beyond Lambda - how to get from logical to physical. Artur Borycki, Director International Technology & Innovations

Getting Started Practical Input For Your Roadmap

How To Analyze Data In A Database In A Microsoft Microsoft Computer System

Oracle Big Data Strategy Simplified Infrastrcuture

How To Turn Big Data Into An Insight

How To Use Big Data For Business

2015 Ironside Group, Inc. 2

How To Learn To Use Big Data

Oracle Big Data Handbook

Navigating the Big Data infrastructure layer Helena Schwenk

Oracle Database 12c Plug In. Switch On. Get SMART.

IBM BigInsights for Apache Hadoop

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Advanced In-Database Analytics

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

How To Handle Big Data With A Data Scientist

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

InfiniteGraph: The Distributed Graph Database

How To Use Big Data For Telco (For A Telco)

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Efficient Big Data Analytics using SQL and Map-Reduce

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Microsoft Big Data. Solution Brief

Analance Data Integration Technical Whitepaper

The Future of Data Management with Hadoop and the Enterprise Data Hub

Big Data, Start Small! Dr. Frank Säuberlich, Director Advanced Analytics (Teradata International) 26 th May 2015

Data Warehouse as a Service. Lot 2 - Platform as a Service. Version: 1.1, Issue Date: 05/02/2014. Classification: Open

OWB Users, Enter The New ODI World

Best Practices for Hadoop Data Analysis with Tableau

Oracle Big Data Essentials

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Exploring the Synergistic Relationships Between BPC, BW and HANA

MapR: Best Solution for Customer Success

Big Data Can Drive the Business and IT to Evolve and Adapt

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Constructing a Data Lake: Hadoop and Oracle Database United!

Big Data and Your Data Warehouse Philip Russom

Please give me your feedback

Safe Harbor Statement

Greenplum Database. Getting Started with Big Data Analytics. Ofir Manor Pre Sales Technical Architect, EMC Greenplum

Integrating a Big Data Platform into Government:

Big Data Integration: A Buyer's Guide

Cost-Effective Business Intelligence with Red Hat and Open Source

REAL-TIME BIG DATA ANALYTICS

Achieving Business Value through Big Data Analytics Philip Russom

Transcription:

Teradata s Big Data Technology Strategy & Roadmap Artur Borycki, Director International Solutions Marketing 18 March 2014

Agenda > Introduction and level-set > Enabling the Logical Data Warehouse > Any Data > Any Analytic > Virtual Compute > Summary & conclusions

Big Data Introduction and level-set Big Data

WHAT IS BIG DATA?

BIG DATA IS NOT A TECHNOLOGY

BIG DATA IS NOT THE THREE V S

BIG DATA IS NOT A USE CASE

BIG DATA IS NOT AN ARCHITECTURE

BIG DATA IS A MOVEMENT DEMANDING MORE ANALYTICS ON ALL DATA

CREATE A DATA CULTURE

Data-Driven Business SUCCESS STRATEGIC OPERATIONAL CULTURAL View Develop Focus Accelerate Integrate Measure Empower Build Take Value Foster Leverage Possess

Enhanced customer experience 55 9 Process efficiency 49 15 New products/new business model 42 17 More targeted marketing 41 12 Cost reduction 37 13 Improved risk management 32 9 Monetize information directly 23 9 Regulatory compliance Enhanced security capabilities 17 16 10 13 others 5 3 Big Data Adoption in 2013 Shows Substance Behind the Hype Gartner N = 465; multiple responses allowed 0 10 20 30 40 50 60 70 Percentage of Respondents Business issues now addressing Likely to address (12-14 months)

DATA INSIGHT ACTION Why - Companies who exploit ALL their data achieve competitive advantage How Implement an enterprise data architecture that includes three components: staging, discovery, and DW But you don t throw away what you ve already done and start over...

The four forces are leading to the rise of the Logical Data Warehouse Unified Data Architecture ERP MOVE MANAGE ACCESS Marketing Marketing Executives SCM CRM Images DATA PLATFORM DATA WAREHOUSE Applications Business Intelligence Operational Systems Customers Partners Audio and Video Data Mining Frontline Workers Machine Logs DISCOVERY PLATFORM Math and Stats Business Analysts Text Data Scientists Languages Web and Social Engineers SOURCES ANALYTIC TOOLS & APPS USERS

Teradata Unified Data Difference

Big Data Big Data Enabling the Logical Data Warehouse

Teradata s technology strategy: enable the Logical Data Warehouse, a.k.a.: Unified Data Architecture Any Data Structured, schemaless or name-value pair Any Analytic Path, graph, affinity, time-series, text, etc., etc. Virtual Compute Transparent Orchestration of Analytic Services throughout the Unified Data Architecture Seamless data synchronisation Simplified Systems Management & Administration 1-click data movement and management throughout the Unified Data Architecture Single pain of glass admin; multiple moving parts that look like one system (and manage themselves wherever possible); proactive monitoring & alerting

Big Data Any Data Big Data

The Internet of Things and the evolution of Information Management Increased ceremony (integrity, query performance) Increased flexibility and load performance Schema on load Key-Value Pair Schema on read

Teradata s Integrated Big Data Appliance is optimised for set-based Analytics on structured data Contextual Analytics Resource Flexibility Always On Corporate memory Deep analytics Data Labs Data refinery Hadoop integration Ad hoc projects Peak workload assist Disaster recovery High availability Archive reporting & retrieval Audit and compliance

can support management and Analytics of name-value pair data today BI tools Source data Schema Weblogs ETL Data Warehouse CLOB SQL + parse/extract functions Load time Runtime Early binding Late binding

with native JSON support coming in Teradata 15.0 SELECT box.mfg_line.product.color box.mfg_line.product.size box.mfg_line.product.prod_id box.mfg_line.product.create_time AS "Color", AS "Size", AS "Prod_ID", AS "Create_Time" FROM mfgtable WHERE CAST(box.MFG_Line.Product.Create_Time AS TIMESTAMP) >= TIMESTAMP'2013-06-16 00:00:00' AND box.mfg_line.product.prod_id = 96; Color Size Prod_ID Create_Time ----- ----- ------- ------------------- Blue Small 96 2013-06-17 20:07:27

Teradata Vital Infrastructure Need to manage and process large volumes of filebased data? We have you covered Aster and Teradata SQL-H Teradata Studio with Smart Loader Value Added So ware from Partners Teradata Viewpoint Teradata Connector for Hadoop (TDCH) Intelligent Start and Stop NameNode Failover Teradata Distribu on for Hadoop (Based on Hortonworks HDP) Op mized hardware for Hadoop BYNET V5 40GB/s InfiniBand interconnect

One solution, Many uses Contextual Analytics Resource Flexibility Always On Corporate memory Unrefined Multi-structured data Current data Archival data Raw data IDW data years 1-5 IDW data years 5-10 Unrefined structured data

Big Data Any Analytic Big Data

Need to move subsets of that data into the Exploration & Discovery environment, without transformation? SQL has been described as Intergalactic Data Speak. It is the lingua franca of relational database technology. But relational theory assumes that ordering doesn t matter - and support for iteration and relationship Analytics is correspondingly weak in SQL. What if we could elegantly extend SQL to include iterative styles of Analytics?

Teradata-Aster: runs MapReduce, Speaks SQL MapReduce-based path Analytics SELECT * FROM npath ( ON ( ) PARTITION BY sba_id ORDER BY datestamp MODE (NONOVERLAPPING) PATTERN ('(OTHER_EVENT FEE_EVENT)+') SYMBOLS ( event LIKE '%REVERSE FEE%' AS FEE_EVENT, event NOT LIKE '%REVERSE FEE%' AS OTHER_EVENT) RESULT ( ) ) n;

Graph Basics Graphs model relationships between objects like people, products, processes, bank accounts Graphs are made up of vertices or nodes (entities) and lines called edges (relationships) that connect them Two Major Categories of Graph Technologies Navigational Graph databases (Neo4J), RDF/SPARQL (IBM, Oracle) Analytical Graph engines (Aster, Google, Hadoop Giraph)

Aster SQL-GR Engine Built on a scalable BSP framework to enable Big Graph Feature Native graph processing Massively scalable, not bound by memory limits Pre-built graph functions Integrated with SQL Designed for Analytics GRAPH Benefits Richer insights with powerful Graph processing Large scale graph processing with best price performance Brings Graph processing to SQL audience

Teradata-Aster s SNAP Framework will soon enable more Analytic engines, more native data stores TEXT T STATS PATH SQL MAP REDUCE GRAPH SNAP FRAMEWORK INTEGRATED OPTIMIZER INTEGRATED EXECUTER UNIFIED SQL INTERFACE STORAGE SYSTEM AND SERVICES ROW STORE COLUMN STORE FILE STORE

Big Data Virtual Compute Big Data

Virtual Compute Capability Enabling the UDA Vision TERADATA DATABASE HADOOP TERADATA ASTER DATABASE TERADATA DATABASE GRID ASTER Remote, push-down processing in Hadoop Bi-directional data movement Leverage Hive query language (push foreign grammar) Results returned to Teradata for additional processing Leverage SQL-MR functions in Aster Pass SQL-MR syntax/grammar to Aster Push local TD table for remote processing SQL-MR (e.g. npath, Sessionize) functions executed in Aster Teradata to Teradata SQL sub-query sent to Teradata Database appliance Additional processing using data from appliance in Teradata IDW Leverage GRID compute (SAS, Perl, Python, Ruby, R) Data streamed from TD to GRID nodes for processing Isolates compute resource use and potential faults from database

Remote Processing On Hadoop Leverage data platform resources, reduce data movement Query through Teradata Sent to Hadoop through Hive MapReduce processing on Hadoop Results returned to Teradata Additional processing joins data in Teradata Final results sent back to application/user Available in Teradata 15.0!

Execute SQL-MR Functions In Aster Leverage pre-packaged functions in Aster Query through Teradata SQL-MR request sent to Aster Sessionize function performed in Aster Results returned to Teradata Additional processing using session results in Teradata Final results sent back to application/user Available in a future release

Big Data Summary and conclusions Big Data

Teradata s technology strategy: enable the Logical Data Warehouse, a.k.a.: Unified Data Architecture Any Data Any Analytic Virtual Compute Seamless data synchronisation Simplified Systems Management & Administration Name-value pair operators (available now) JSON (Teradata 15.0) Aster File System (Aster 6.0) BSP-based Graph Engine (Aster 6.0) More Analytic engines coming to the Aster SNAP framework soon Fabric-Based Computing (available now with further enhancements & extensions planned) Transparent Orchestration (starting in Teradata 15.0) Unity Data Mover & Unity Ecosystem Manager (available now for multi-teradata system environments, support for Aster, Hadoop coming soon) Viewpoint provides Single pain of glass management and administration (available now with further enhancements & extensions planned)

The UDA provides cost-effective storage for any data

Why UDA Architecture Framework is important Hadoop JSON Store NoSQL Store

BEST TECH TO ENABLING A DATA CULTURE IS UNIFIED DATA ARCHITECTURE

THE BIG ALL DATA IS DATA A MOVEMENT

UNIFIED DATA ARCHITECTURE MOVEMENT