Enterprise Operational SQL on Hadoop Trafodion Overview

Similar documents
Trafodion Operational SQL-on-Hadoop

Cloud Scale Distributed Data Storage. Jürmo Mehine

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

How To Handle Big Data With A Data Scientist

Cloud Big Data Architectures

So What s the Big Deal?

REAL-TIME BIG DATA ANALYTICS

Open Source Technologies on Microsoft Azure

Big Data and Data Science: Behind the Buzz Words

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

How Companies are! Using Spark

Luncheon Webinar Series May 13, 2013

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

The Internet of Things and Big Data: Intro

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Dominik Wagenknecht Accenture

Information Builders Mission & Value Proposition

SQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford

Why NoSQL? Your database options in the new non- relational world IBM Cloudant 1

Big Data Management and Security

INTRODUCTION TO CASSANDRA

Can the Elephants Handle the NoSQL Onslaught?

Evaluator s Guide. McKnight. Consulting Group. McKnight Consulting Group

Oracle Big Data SQL Technical Update

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Big Data Management in the Clouds. Alexandru Costan IRISA / INSA Rennes (KerData team)

BIG DATA: STORAGE, ANALYSIS AND IMPACT GEDIMINAS ŽYLIUS

NoSQL Databases. Nikos Parlavantzas

Thomas Baumann Swiss Mobiliar Bern, Switzerland

Applications for Big Data Analytics

Big Data Big Data/Data Analytics & Software Development

extensible record stores document stores key-value stores Rick Cattel s clustering from Scalable SQL and NoSQL Data Stores SIGMOD Record, 2010

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Peninsula Strategy. Creating Strategy and Implementing Change

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Big Data Technologies Compared June 2014

NoSQL Databases. Institute of Computer Science Databases and Information Systems (DBIS) DB 2, WS 2014/2015

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world

MongoDB in the NoSQL and SQL world. Horst Rechner Berlin,

Lecture Data Warehouse Systems

Hadoop in the Enterprise

How To Scale Out Of A Nosql Database

Big Data and Industrial Internet

Oracle Database 12c Plug In. Switch On. Get SMART.

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

A very short talk about Apache Kylin Business Intelligence meets Big Data. Fabian Wilckens EMEA Solutions Architect

Big Data and Hadoop for the Executive A Reference Guide

[Hadoop, Storm and Couchbase: Faster Big Data]

Next-Gen Big Data Analytics using the Spark stack

Big Data. Facebook Wall Data using Graph API. Presented by: Prashant Patel Jaykrushna Patel

Challenges for Data Driven Systems

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Choosing The Right Big Data Tools For The Job A Polyglot Approach

TRAINING PROGRAM ON BIGDATA/HADOOP

Introduction to Apache Cassandra

Splice Machine: SQL-on-Hadoop Evaluation Guide

Comparing SQL and NOSQL databases

Sentimental Analysis using Hadoop Phase 2: Week 2

NoSQL for SQL Professionals William McKnight

Real Time Fraud Detection With Sequence Mining on Big Data Platform. Pranab Ghosh Big Data Consultant IEEE CNSV meeting, May Santa Clara, CA

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

Apache Trafodion. Table of contents ENTERPRISE CLASS OPERATIONAL SQL-ON-HADOOP

ITG Software Engineering

Self-service BI for big data applications using Apache Drill

BIG DATA What it is and how to use?

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

A modern, flexible approach to Hadoop implementation incorporating innovations from HP Vertica & IDOL

Il mondo dei DB Cambia : Tecnologie e opportunita`

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

NOSQL DATABASES AND CASSANDRA

Upcoming Announcements

Moving From Hadoop to Spark

Self-service BI for big data applications using Apache Drill

BIG DATA TOOLS. Top 10 open source technologies for Big Data

Navigating the Big Data infrastructure layer Helena Schwenk

Getting Started Practical Input For Your Roadmap

Actian SQL in Hadoop Buyer s Guide

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

BIG DATA TRENDS AND TECHNOLOGIES

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

Hadoop2, Spark Big Data, real time, machine learning & use cases. Cédric Carbone Twitter

Introduction to Polyglot Persistence. Antonios Giannopoulos Database Administrator at ObjectRocket by Rackspace

Ubuntu and Hadoop: the perfect match

Transcription:

Enterprise Operational SQL on Hadoop Trafodion Overview Rohit Jain Distinguished & Chief Technologist Strategic & Emerging Technologies Enterprise Database Solutions Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Agenda Directions in Hadoop Directions in NoSQL The case for Enterprise Operational SQL on Hadoop HP s Enterprise Operational SQL on Hadoop Solution Trafodion 2 HP PRIVATE Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Directions in Hadoop Characteristics Lower cost BI & Analytics Open Source & proprietary Eco-system Big Data Volume Elastic Scalability Unstructured Lower cost Software Server & storage MapReduce Machine Learning Stinger Text / Document Images Social media Video email Audio Mobile Enterprise Integrated Hadoop Data Lake Real-time Analytics Internet of things Storm Enterprise Readiness & Manageability Apache Yarn HP Data Services Manager Now available on the Vertica Marketplace Kiji 3 HP PRIVATE Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

and what s missing What s missing Integration of structured, semistructured, and unstructured support Operational transactional workloads Using Hadoop for all operational SQL needs Not Only Big Data Free at last! Capture data directly into open file structures Open distributed HDFS structures HBase & Hive Accessible for reporting & analytics with no latency Structured Semi- structured Unstructured Transaction Item id Description Cost Price TV Book Type Display Size Resolution Brand Model 3D ISBN Author Publish Date Format Dept Image Review Add item BEGIN WORK INSERT item into Trafodion table ITEM (item_id, desc, cost, price, ) INSERT item attributes for TV or book into HBase table ITEM_ATTR as col-value pairs using item_id END WORK Queries SELECT all TVs WHERE Price > 2000 and Type = Plasma and Display Size > 50 and customer sentiment is very positive Orders needing transactional support across Order and Order Detail Backend operational workloads Order tracking, supply chain, inventory control, 4 HP PRIVATE Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Directions in NoSQL and what s missing Characteristics Big Data Volume Elastic Scalability Lower cost Lower cost Software Server & storage Variety: Semi-structured & Unstructured Distributed across data centers Eventual consistency What s missing Full ACID transactions SQL querying capability Joins Data in open HDFS except for HBase Velocity Key-value (Riak, Redis) Document JSON / BSON (MongoDB, Couchbase) Column families (Cassandra, HBase) Graph (Neo4j, Giraph, Titan) Low Latency High Availability Schema Flexibility In-memory 5 HP PRIVATE Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

The Case for Enterprise Operational SQL-on-Hadoop Sector Road Map: SQL-on-Hadoop platforms in 2013 Joseph Turian, March 20, 2013 If a strong player or two emerges in the category, it will completely shake up the big data and database landscape. If Hadoop were operational, it could be used to power websites and store transactions. Traditional SQL databases would no longer be necessary. The data stack would be significantly simplified. An operational database offers write access, not just read access, to data. However, there are other key features for an operational database: concurrency, interactive write speed, and distributed transactional support (guarantees about data consistency). Currently no existing SQL-on-Hadoop solution satisfies these requirements. 5 Reasons Hadoop is Kicking Can and Taking Names Mike Gualtieri, October 22, 2013 #5 The future of Hadoop is real-time and transactional. The key commercial vendors are focusing on fast SQL access, real-time streaming, and manageability features that enterprises demand. The groundwork is being laid for an eruption in data management technologies as Hadoop sneaks its way into the transactional database market. The Future of Hadoop: What Happened & What's Possible? Doug Cutting, Oct 30 2013 So I think the prediction we can make here is that it is inevitable that we will see just about every kind of workload be moved to this platform even Online Transaction Processing. 6 HP PRIVATE Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Trafodion Enterprise Operational SQL on Hadoop NonStop SQL/MX Neoview SeaQuest Open distributed HDFS Semi-structured & unstructured support Schema flexibility Elastic scalability Automatic rebalancing Replication for High Availability (k-safety) Disaster Recovery (via MapR) Column level access control Column level encryption Lower cost cheap storage & servers Space quotas (via MapR) Huge open source & proprietary eco-system Versioning snapshot support & incremental data replication Cloud enabled HP Cloud Services OpenStack Industry trend towards Enterprise Hadoop Lake OLTP K/V & document stores OLTP and ODS on Hadoop Unstructured analytics Trafodion Can join Trafodion, HBase, Hive tables in a single statement Structured OLTP through EDW One of the most powerful database engines in the industry for OLTP and EDW Full ANSI SQL support Full ACID transactional support for multirow, multi- table, & multi-region updates Support for nested loop, merge, hash joins Structured tables, indexes, views Incremental equal height histograms for better execution plans Efficient data flow architecture Referential Integrity, Triggers, Grant/Revoke Security support UDFs for Complex Event processing Workload Management Enterprise class Monitoring & Manageability Compound primary keys Encoding column names for compaction Salting to spread updates 7 HP PRIVATE Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Private Beta program launched Seeking early adopters for POCs Available for internal evaluation: Send email to Project.Trafodion@HP.COM for download details www.hp.com/go/trafodion 8 HP PRIVATE Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Thank You Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.