Oracle NoSQL Database Product Overview. Robert Greene Product Manager / Strategist Nov 19, 2014



Similar documents
Oracle NoSQL Database Overview & Use Cases

Oracle Database 12c Plug In. Switch On. Get SMART.

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam

Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview

Constructing a Data Lake: Hadoop and Oracle Database United!

How To Scale Out Of A Nosql Database

Open Source Technologies on Microsoft Azure

X4-2 Exadata announced (well actually around Jan 1) OEM/Grid control 12c R4 just released

Benchmarking Couchbase Server for Interactive Applications. By Alexey Diomin and Kirill Grigorchuk

NoSQL for SQL Professionals William McKnight

MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)

Comparing SQL and NOSQL databases

NoSQL and Hadoop Technologies On Oracle Cloud

Integrating Big Data into the Computing Curricula

How To Use A Data Center With A Data Farm On A Microsoft Server On A Linux Server On An Ipad Or Ipad (Ortero) On A Cheap Computer (Orropera) On An Uniden (Orran)

Can the Elephants Handle the NoSQL Onslaught?

Oracle Big Data Fundamentals Ed 1 NEW

Big Data Analytics in LinkedIn. Danielle Aring & William Merritt

Вовченко Алексей, к.т.н., с.н.с. ВМК МГУ ИПИ РАН

How To Store Data On An Ocora Nosql Database On A Flash Memory Device On A Microsoft Flash Memory 2 (Iomemory)

Hadoop & Spark Using Amazon EMR

Monitis Project Proposals for AUA. September 2014, Yerevan, Armenia

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Amazon Redshift & Amazon DynamoDB Michael Hanisch, Amazon Web Services Erez Hadas-Sonnenschein, clipkit GmbH Witali Stohler, clipkit GmbH

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

Apache HBase. Crazy dances on the elephant back

Practical Cassandra. Vitalii

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Big Data Course Highlights

Architecting for the Internet of Things & Big Data

Safe Harbor Statement

Oracle Big Data Building A Big Data Management System

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Microsoft Azure Data Technologies: An Overview

Oracle Big Data SQL Technical Update

Azure Data Lake Analytics

Big Data Management and Security

Modernizing Your Data Warehouse for Hadoop

Introduction to Big Data Training

NoSQL Databases. Institute of Computer Science Databases and Information Systems (DBIS) DB 2, WS 2014/2015

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Connecting Hadoop with Oracle Database

Introduction to Apache Cassandra


BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

MySQL and Hadoop: Big Data Integration. Shubhangi Garg & Neha Kumari MySQL Engineering

MongoDB Developer and Administrator Certification Course Agenda

Evaluator s Guide. McKnight. Consulting Group. McKnight Consulting Group

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control

In-memory computing with SAP HANA

Safe Harbor Statement

CitusDB Architecture for Real-Time Big Data

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Oracle Big Data Strategy Simplified Infrastrcuture

Tungsten Replicator, more open than ever!

EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.

How to Choose Between Hadoop, NoSQL and RDBMS

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Domain driven design, NoSQL and multi-model databases

Big Data and Hadoop with Components like Flume, Pig, Hive and Jaql

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

Why NoSQL? Your database options in the new non- relational world IBM Cloudant 1

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB, Cassandra, and MongoDB

INTRODUCTION TO CASSANDRA

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, UC Berkeley, Nov 2012

Oracle Big Data Handbook

Big Data for Banking. Kaleem Chaudhry Senior Director, Sales Consulting, ASEAN. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

TRAINING PROGRAM ON BIGDATA/HADOOP

Big Data Operations Guide for Cloudera Manager v5.x Hadoop

Enterprise Operational SQL on Hadoop Trafodion Overview

Structured Data Storage

Apache Hadoop: The Pla/orm for Big Data. Amr Awadallah CTO, Founder, Cloudera, Inc.

Comparison of the Frontier Distributed Database Caching System with NoSQL Databases

Unified Big Data Processing with Apache Spark. Matei

Big Data Analytics - Accelerated. stream-horizon.com

Evaluation of NoSQL databases for large-scale decentralized microblogging

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

GigaSpaces Real-Time Analytics for Big Data

Highly available, scalable and secure data with Cassandra and DataStax Enterprise. GOTO Berlin 27 th February 2014

Search Big Data with MySQL and Sphinx. Mindaugas Žukas

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Real-time Big Data Analytics with Storm

Hypertable Goes Realtime at Baidu. Yang Dong Sherlock Yang(

Apache Ignite TM (Incubating) - In- Memory Data Fabric Fast Data Meets Open Source

Transcription:

Oracle NoSQL Database Product Overview Robert Greene Product Manager / Strategist Nov 19, 2014 Oracle Confidential Internal/Restricted/Highly Restricted

Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle s products remains at the sole discretion of Oracle. 3

Agenda Oracle NoSQL Database Overview Use cases and customer highlight Product features, Performance, API walk thru Costs and Roadmap 4

What is Oracle NoSQL Database less is more 101100101001001 001101010101011 100101010100100 101 Simple Fast Flexible Reliable advanced Key-Value database designed as cost effective, high performance solution for simple operations on collections of data with built in high availability and elastic scale-out. 5

Oracle NoSQL Database Flexible Data Model Hierarchical primary & shard key Automatic data sharding & local indexing BASE & ACID Transactions advanced key-value database 1. KV Application API, Application specific specific opaque opaque values values 2. JSON API, JSON Structures 3. Table API, Tables 6

Oracle NoSQL Database NoSQL for Developers and IT Simple: Fast: Flexible: Setup, Admin, API & Integration Parallel Access & Scale-out Flexible schema & Agile development Reliable: Built-in HA, Predictable Performance 7

Logical Architecture Linear scaling and replicated Application Elastic Auto Sharding (split, add, contract) NoSQL Driver Writes to elected node Shard M Shard R Shard R Shard M Reads from any node in system R M R R Auto re-balance of data on expansion R R M Expand Store and Rebalance R 8

Application Physical Architecture Smart Topology D NoSQL Driver D Agents Machine1 D M 1 R 2 M 3 R 4 A Machine2 R 1 R 2 R 3 M 4 D A Machine3 M 2 R 1 R 4 R 3 A Replica Group 1 Replica Group 2 Replica Group 3 Replica Group 4 9

Oracle NoSQL Database Releases 2011 2012 2013 2014 Release 1.x Major/Minor Key-Value storage Java API ACID & BASE Transactions Replication Distributed Data & Queries Web Admin Console Admin CLI Release 2.0 JSON schemas & AVRO API Streaming Large Object API C API NoSQL Cluster Elasticity Smart Topology Release 2.1 & 2.2 Data Centers Online Rolling Upgrades Index Views Parallel Scans Data CLI BDA as NoSQL appliance Community Edition Support Release 3.0 & 3.1 Unified CLI Secondary Data Centers Table Data Model Secondary Indices Encryption Authentication Authorization Client-side DDL API Thrift-based C API REST API Integration Hadoop KVInputFormat Oracle Loader for Hadoop Integration Oracle Database Oracle Event Processing JMX & SNMP Integration Oracle Coherence Oracle RDF Graph Integration Oracle Enterprise Manager Oracle Big Data SQL 10

Agenda Oracle NoSQL Database Overview Use cases and customer highlight Product features, Performance, API walk thru Costs and Roadmap 11

Oracle NoSQL Database Use Case Summary Web-Scale Personalization Flexibility & Transaction Processing Real-Time Event Processing Point Time Series & Sensor Data Management Point Guaranteed low latency lookups and data capture Real-time events trigger rules that perform low latency lookups for context Capture and management of large volumes of regular and irregular time-based information Advertising, Product Recommendations, Online Catalogs, Social Media, Profile Management, Personalization Web browsing, Shopping Carts Fraud Detection, Risk Analysis, Financial Trading, Medical Monitoring, Factory Automation, Utilities Mgmt, Geo-location based content Stock Quotes, Manufacturing Senor Data Mgmt (QC), Testing Data, Utilities, HW/SW Performance 12

New Recommendation Platform Recommendation Rule Engine Recommend Rules Admin Customers Login User ID Request Elastic Data Store Respond User Info, Billing Info, etc Customer Info Source Systems Corporate Web Page Recommendation Platform 13

Technical Challenges and requirements Scalability Flexibility Performance Support Scale-out capability by adding new nodes, when new recommendation services are added. No additional coding is required, when new user information attributes are added. Recommendation for 20 million users using 350 user information attributes. Performance capability to handle sharply rising requests. Enterprise-grade Support Clear Product Roadmap Key Value Store + Oracle NoSQL Database 14

# of servers Performance Comparison 1. Oracle NoSQL Database with SSDs achieves 26000 tps with 3 nodes. 2. Compared to HBase, Oracle NoSQL Database achieves high performance goals with much smaller number of nodes. 3. Considering high performance goals combined with additional future requirements, the total cost of Oracle NoSQL Database is estimated to be much lower than HBase. 70 60 50 NoSQL HDD NoSQL SSD HBASE HDD 40 30 20 10 Over 20,000 tps can be handled with just 3 nodes 0 0 2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 Transactions per second 15

Agenda Oracle NoSQL Database Overview Use cases and customer highlight Product features, Performance, API walk thru Costs and Roadmap 16

Enterprise Ready Features Elastic BASE Operations Tables / JSON / Binary Online management Data Center Support Secondary Indexes Secure Access Flexible schema Application Application NoSQL DB Driver Application Application NoSQL DB Driver Differentiators ACID transactions Online rolling upgrades Streaming large object support Oracle technology integrated Engineered Systems and Commodity HW Storage Nodes Datacenter A Storage Nodes Datacenter B 17

Enterprise ready -- Integrated out of the box Query NoSQL data from Oracle Database Access NoSQL data from Hadoop for DW and analytics Share data with Oracle Coherence for extensible in-memory cache grid Persist history & event streams for processing with Oracle Event Processing Store & query RDF data using Oracle RDF for NoSQL Replicate changes in Oracle Database to NoSQL DB using Oracle Golden Gate Monitor your NoSQL cluster using Oracle Enterprise Manager 18

Oracle Big Data Management System Data Reservoir + Data Warehouse Cloudera Hadoop Oracle NoSQL Database Oracle R Distribution Oracle Big Data Connectors Oracle Big Data SQL Oracle Database Oracle Database Oracle Industry Models Oracle Industry Models Oracle Advanced Analytics Oracle Advanced Analytics Oracle Spatial & Graph Oracle Spatial & Graph Oracle Event Processing Apache Flume Oracle GoldenGate Oracle Data Integrator Oracle GoldenGate Oracle Event Processing 19

Developer and Admin Tools Standards based tooling SNMP / JMX metrics Oracle Enterprise Manager Cloud ready HTML5 browser admin Command line interface Scripting Query prototyping Data load Easy to use developer API Java, C, REST (Javascript / Python FY15) R, JRuby, Jython community drivers Oracle Confidential Internal/Restricted/Highly Restricted 20

Predictability, Reliability & Support Global, mission-critical application deployment experience Decades of enterprise-grade non-relational database technology Oracle Support available for both Enterprise and Community Edition Designed for Predictability and Manageability Bulk Insert Test Cluster Expansion Test Rolling Upgrade Test 21

YCSB on commodity servers What s the big deal 1.6 billion records 94K insert/sec 25K read/update/sec Low latency Linear scalability 22

Throughput (ops/sec) Average Latency (ms) YCSB on SSD-backed commodity servers What s the big deal Twitter sees ~500M tweets/day This is 75M a minute Capture all tweets with 3 commodity servers 1.25M ops/sec 2 billion records 2 TB of data 95% read, 5% update Low latency, High Scalability 1,400,000 1,200,000 1,000,000 800,000 600,000 400,000 200,000 0 Mixed Throughput 6 (2x3) 12 (4x3) 24 (8x3) 30 (10x3) Cluster Size 4 3 2 1 0 Throughput (ops/sec) Write Latency (ms) Read Latency (ms) 23

Customer benchmarking on SSD Oracle NoSQL beats in-memory grid 10 s of thousands of txn / sec Lower latency than grid cache Persistent storage Predictable SLA s Oracle NoSQL DB Read Latency Terracotta Read latency 24

Developer Centric APIs Language specific client drivers Java, C, *javascript, python, C# REST API available through Oracle Rest Data Services Deployments using Oracle Web Logic Server, Glassfish, Tomcat Directs Web Service calls to NoSQL DB Marshals data returned into JSON format Full CRUD operations across NoSQL Cluster Client-side DDL API Create and Alter tables and schemas inside an application 25

Simple Data Modeling DDL kv-> table create -name User -desc "A sample user table" User-> add-field -type Integer -name userid User-> add-field -type String -name firstame User-> add-field -type String -name lastname User-> primary-key -name userid User-> exit Table User built. kv-> plan add-table -name User wait kv-> plan add-index -name UserSecondary -table User -field firstname field lastname Create Table with id, firstname, lastname columns. API DDL CREATE TABLE complex ( COMMENT "this comment goes into the table metadata" id INTEGER, PRIMARY KEY (id), # id is the pk. this comment is just syntax nested MAP(RECORD( m MAP(FLOAT), a ARRAY(RECORD(age INTEGER)))), address RECORD (street INTEGER, streetname STRING, city STRING, zip INTEGER), friends MAP (STRING), anarray ARRAY (FLOAT), fixedbinary BINARY(5), days ENUM(monday, tuesday, wednesday, thursday, friday, saturday, sunday) NOT NULL DEFAULT tuesday ) Add index on firstname, lastname columns. 26

API CRUD - Write /* Connect to the NoSQL store */ store = KVStoreFactory.getStore (new KVStoreConfig(storeName, hostname + ":" + hostport)); tableapi = store.gettableapi(); /* Insert row, if it is already in the table */ Table table = tableapi.gettable("mytable"); Row row = table.createrow(); row.put("userid", 2); row.put("firstname", "John"); row.put("lastname", "Jameson"); tableapi.put(row, null (a return row), null(a write option));..put, putifpresent, putifabsent, putifversion /* Catch possible Exceptions*/ Durability, Timeout, IllegalArgument, Fault Exceptions 27

API CRUD Keyspace Read /* Read row, Create a primary key and assign the field value */ PrimaryKey key = table.createprimarykey(); key.put("userid", 1); Row row = tableapi.get(key, new ReadOptions(null, 0, null)); List<Row> myrows = tableapi.multiget(key, null, null); or use multiget, /* Read row(s), in a shard or in parallel across all shards*/ TableIterator<Row> iter = tableapi.tableiterator(key, null ( a multirow options), null (iterator options); FieldRange fh = table.createfieldrange("familiarname"); fh.setstart("bob", true); fh.setend("patricia", true); MultiRowOptions mro = fh.createmultirowoptions(); /* Catch possible Exceptions*/ Consistency, Timeout, IllegalArgument, Fault Exceptions 28

API CRUD Index Read Search secondary index where firstname = Jane 1. Create instance of table object we wish to read from 2. Create instance of index object that we will search 3. Set search key 4. Call iterator to scan index 5. Convert results to JSON object, take JSON object and convert back to Row Table usertable = tableapi.gettable("user", null); Index index = usertable.getindex("usersecondary"); IndexKey indexkey = index.createindexkey().put("firstname", "Jane"); TableIterator<Row> results = tableapi.tableiterator(indexkey); while (results.hasnext()) { Row row = results.next(); /* Convert the row to a JSON object */ String jsonstring = row.tojsonstring(); } /* Convert the JSON object back to a row */ Row myrow = usertable.createrowfromjson(jsonstring); 29

Agenda Oracle NoSQL Database Overview Use cases and customer highlight Product features, Performance, API walk thru Costs and Roadmap 30

Oracle NoSQL Database Subscription Model Business-friendly support service Oracle NoSQL Database Community Edition Open Source AGPL Edition NoSQL Database Client Drivers Open Source, Apache licensed Subscription support for Community Edition Price is $2,000/year per server No upfront license fee Provides for full Oracle support policy response Purchase online via the Oracle Store Enterprise Edition Closed Source. Standard Oracle License https://shop.oracle.com/ 31

Roadmap plan Dec 14 2015 Release 3.1 & 3.2 Unified CLI Authorization Client-side DDL API Thrift-based C API Javascript, Python REST API Integration Oracle Enterprise Manager Oracle Big Data SQL APIs C#, Ruby SQL query language Server & Performance Full Text Indexing Admin Repair Diagnostic enhancements Security User defined Roles Authorization Kerberos Integration On-disk Encryption Administration Replication Factor of 2 Improve admin Scriptability Batch Imp/Exp utility Supportability & Debug enhancements Integration More OEM Integration Oracle Distributed Query Integration Oracle Public Cloud KVOutput format Yarn Integration Spark Integration Docker Integration The current plan is for quarterly releases

Join NoSQL Database Community Oracle.com/BigData Twitter https://twitter.com/#!/oraclenosql LinkedIn http://www.linkedin.com/groups?gid=4147754 Oracle s NoSQL DB blog https://blogs.oracle.com/nosql Oracle Technology Network http://bit.ly/1f0d8wu Developer Webcast Series http://bit.ly/1dov2jl 33

Q&A 34

35

Appendix

Data Center Support Availability Zones Flexible configuration Primary Zones Durability guarantees Low latency writes, HA 2 nd ary Read-Only Zones Asynchronous replication Analytic workloads Report generation Topology Aware Client Driver Provides business continuity and distributed workload management PrimaryZones DC1 DC2 DC3 Reports Batch Analytics 38