Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing



Similar documents
So What s the Big Deal?

How To Handle Big Data With A Data Scientist

Structured Data Storage

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

SQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford

Lecture Data Warehouse Systems

Cloud Scale Distributed Data Storage. Jürmo Mehine

Preparing Your Data For Cloud

How To Use Big Data For Telco (For A Telco)

Introduction to NOSQL

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Big Data and Data Science: Behind the Buzz Words

Making Sense ofnosql A GUIDE FOR MANAGERS AND THE REST OF US DAN MCCREARY MANNING ANN KELLY. Shelter Island


Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Challenges for Data Driven Systems

Big Data Technologies. Prof. Dr. Uta Störl Hochschule Darmstadt Fachbereich Informatik Sommersemester 2015

This Symposium brought to you by

NoSQL Databases. Institute of Computer Science Databases and Information Systems (DBIS) DB 2, WS 2014/2015

INTRODUCTION TO CASSANDRA

REAL-TIME BIG DATA ANALYTICS

Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores

Oracle Big Data SQL Technical Update

Big Data Analytics - Accelerated. stream-horizon.com

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

NoSQL Database Options

Enterprise Operational SQL on Hadoop Trafodion Overview

NoSQL for SQL Professionals William McKnight

NoSQL Systems for Big Data Management

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

CREATING HIGH PERFORMANCE BIG DATA APPLICATIONS WITH THE JAVA PERSISTENCE API

Can the Elephants Handle the NoSQL Onslaught?

SAP and Hortonworks Reference Architecture

Benchmarking and Analysis of NoSQL Technologies

Applications for Big Data Analytics

Navigating the Big Data infrastructure layer Helena Schwenk

Big Data Technologies Compared June 2014

Introduction to Big Data! with Apache Spark" UC#BERKELEY#

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Cost-Effective Business Intelligence with Red Hat and Open Source

Big Data. Facebook Wall Data using Graph API. Presented by: Prashant Patel Jaykrushna Patel

An Open Source NoSQL solution for Internet Access Logs Analysis

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Architectures for Big Data Analytics A database perspective

Putting Apache Kafka to Use!

WA2192 Introduction to Big Data and NoSQL EVALUATION ONLY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist

nosql and Non Relational Databases

Why NoSQL? Your database options in the new non- relational world IBM Cloudant 1

Database Scalability and Oracle 12c

NoSQL Database Systems and their Security Challenges

bigdata Managing Scale in Ontological Systems

Scalable Architecture on Amazon AWS Cloud

Bringing Big Data to People

Timo Elliott VP, Global Innovation Evangelist SAP SE or an SAP affiliate company. All rights reserved. 1

Вовченко Алексей, к.т.н., с.н.с. ВМК МГУ ИПИ РАН

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Big Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.

GigaSpaces Real-Time Analytics for Big Data

Hacettepe University Department Of Computer Engineering BBM 471 Database Management Systems Experiment

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world

An Approach to Implement Map Reduce with NoSQL Databases

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

How To Scale Out Of A Nosql Database

Choosing The Right Big Data Tools For The Job A Polyglot Approach

3 Case Studies of NoSQL and Java Apps in the Real World

Real Time Fraud Detection With Sequence Mining on Big Data Platform. Pranab Ghosh Big Data Consultant IEEE CNSV meeting, May Santa Clara, CA

NoSQL replacement for SQLite (for Beatstream) Antti-Jussi Kovalainen Seminar OHJ-1860: NoSQL databases

Timo Elliott VP, Global Innovation Evangelist SAP SE or an SAP affiliate company. All rights reserved. 1

How To Write A Database Program

extensible record stores document stores key-value stores Rick Cattel s clustering from Scalable SQL and NoSQL Data Stores SIGMOD Record, 2010

Big data blue print for cloud architecture

MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)

EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.

THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS

Cloud Big Data Architectures

Performance Management in Big Data Applica6ons. Michael Kopp, Technology

Big Data: Are You Ready? Kevin Lancaster

NoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre

MongoDB in the NoSQL and SQL world. Horst Rechner Berlin,

A Distributed Storage Schema for Cloud Computing based Raster GIS Systems. Presented by Cao Kang, Ph.D. Geography Department, Clark University

Database Performance with In-Memory Solutions

TRAINING PROGRAM ON BIGDATA/HADOOP

Teradata s Big Data Technology Strategy & Roadmap

Daniel J. Adabi. Workshop presentation by Lukas Probst

The Future of Data Management

Big Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA

Business Intelligence for Big Data

Performance Testing of Big Data Applications

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies

Data Warehouse: Introduction

Parallel Data Warehouse

Transcription:

Evaluating NoSQL for Enterprise Applications Dirk Bartels VP Strategy & Marketing

Agenda The Real Time Enterprise The Data Gold Rush Managing The Data Tsunami Analytics and Data Case Studies Where to go from here

The Real Time Enterprise It is about data management

The Intelligent Economy is driving the Database Market Mobility, Apps & Devices Big Data Cloud Services Pervasive Analytics

Big Data or Bust the collection of technologies that handle data volumes well beyond the inflection point when traditional systems fail because requirements for processors, memory and disks escalate as data volume grows. data volume and performance requirements become significant design and decision factors for implementing a data management and analysis system volume, complexity, speed, changing Analysis Model, are increasingly becoming important design factors in enterprise architectures (Gartner)

The Data Gold Rush The Hidden Value in Big Data

You RFID Cards Fitness E Commerce Mobile Communication Telematic Health Monitors

Smart Networks

The Social Web

Managing the Data Tsunami An ever changing landscape

Database Landscape according to 451 Group

Traditional Data Management A Look into the rear view Mirror Transactional Application Transactional Application RDBMS (OLTP) Business Intelligence Application Data Warehouse (OLAP) OLTP ACID transactional applications Relational schema, SQL OLAP BI read only applications Proprietary schemas, SQL Some ETL

Key Web Application Requirements for Data Management Always available Inexpensive Content rich Elastic Infinite Scale Partition Tolerance

Hitting the Wall with RDBMS Complex Availability Prohibitive Costs Rigid data model Inefficient Information Retrieval Limited Scale Not Partition Tolerant

NoSQL Core Value Proposition Web NoSQL Horizontal Scale Out Massive Elastic Yes Dev Ops Public Cloud Commodity server Critical Schemaless Sufficient Always available Critical Partition Tolerant Critical Eventually Consistent Acceptable

NoSQL = Web Scale Databases Type Use Case Limitations Key Products Key Value Tabular / Column Document Store In memory cache, web site analytics, log file analytics Date mining, analytics Document management, CRM Simple, small set of data types, limited transaction support Limited transaction support Limited transaction support, small set of data types Redis, Scalaris, Tokyo Cabinet Google Big Table, Hbase, Cassandra CouchDB, MongoDB, Riak Social Web 2.0 content Scale Out for commodity hardware Simple (one dimensional vectors) The dumbing down of the database No transactions, no complex queries, no schema Table Source: GigaOM Pro 2010

NoSQL & CAP Theorem Consistency Availability Partition Tolerance Sacrifice Consistency for Scalability and Availability Enterprise applications need transactions

NoSQL & Schema schemaless partition key:value. find by key( ) + boat load of application code n disconnected Soft schema.find by key( ) or select from where { operators } + JPA partition key:graph connected n

NoSQL Core Value Proposition Web NoSQL NoSQL for Enterprise Apps Horizontal Scale Out Massive How much scale? Elastic Yes Probably Dev Ops Public Cloud Not. IT runs it, possibly Private Cloud Runs on inexpensive Critical May be commodity server Schemaless Sufficient Problematic Always available Critical Critical Partition Tolerant Critical Critical Eventually Consistent Acceptable Probably Not Acceptable

Analytical Model Domain Model Apples vs. Oranges RT Enterprise Web / NoSQL SQL Linked (soft schema), Graph Embedded (schemaless) Programming Complex events (CEP) Custom code SQL APIs Operational Model Objects, MapReduce, R, Graphs, JPA MapReduce, Custom, special skill sets Schema Transactions ACID & CAP CAP ACID Tooling JDBC, JPA, Web Service Scripted queries JDBC Administration 24/7 File dumps 24/7 Monitoring SNMP cron jobs SNMP JDBC, ORM Data Streaming/ETL, trans. Web content Transactional

Analytics and Data Tools of the trade

in-database Analytics Data Stream 1 Data Stream n ETL Application transactional Application NoSQL NoSQL Operational Data Store Data Store Data Store ETL Application Business Intelligence Application Data Warehouse (OLAP) ACID & CAP Soft Schema Real time Scale out In-database Analytics Hadoop / MapReduce

Understanding the Past - Traditional business intelligence Quantitative Statistical analysis SAS, BOBJ, etc. Relational database schema New Requirements More data More complexity Now in real time and the Presence

Predicting the Future Demand Forecast, Network Provisioning, Qualitative Predictive analysis tools, e.g., R / Revolution Analytics Soft schema New Requirements Massive data / ETL Meta models Near real time

Finding Gems & Needles in the Hay Stack Fraud detection, security breach investigations, Intelligent Rules engines, Petri Nets / Hyper graphs, More special purpose algorithms/customization Soft schema Data scientists New requirements Definitely meta models Massive data / ETL Near real time

Looking for Something Hadoop / Map Reduce Algorithmic, Free programmable Great for non schematic data, e.g., log files Brute force method New requirements Custom algorithms Massive data / ETL Massive scale out

Case Studies

Verite Group Network Intelligence

Verizon Fraud Detection System 3 tier Analysis Model Witness COPS Detectives 1.80 billion inserts / 10h 5 million cases built

Where to go from here

Understand your Design Challenges How much Web scale is required? Data volume, concurrency, scale out architecture, How Big is your data? Complexity, data ingestion rate, compression, Evolving analytics What is the question again? Analysis type, meta data,

About Versant Helping innovators to build high performance database applications NASDAQ:VSNT since 1996 Telecommunications, Defense, Financial Services, Transportation, Manufacturing, other. Redwood City, CA Bangalore Hamburg Shanghai Dirk Bartels dbartels@versant.com