www.objectivity.com Ibrahim Sallam Director of Development



Similar documents
Graph Databases What makes them Different?

InfiniteGraph: The Distributed Graph Database

Graph Database Proof of Concept Report

Cloud Computing and Advanced Relationship Analytics

bigdata Managing Scale in Ontological Systems

Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems

An Introduction To Presented by Leon Guzenda, Founder, Objectivity

NoSQL for SQL Professionals William McKnight

The Synergy Between the Object Database, Graph Database, Cloud Computing and NoSQL Paradigms

Choosing The Right Big Data Tools For The Job A Polyglot Approach

Fast Innovation requires Fast IT

Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Database Scalability {Patterns} / Robert Treat

NoSQL in der Cloud Why? Andreas Hartmann

ROME, BIG DATA ANALYTICS

How graph databases started the multi-model revolution

Apigee Insights Increase marketing effectiveness and customer satisfaction with API-driven adaptive apps

Oracle Spatial and Graph. Jayant Sharma Director, Product Management

Practical Cassandra. Vitalii

Software Life-Cycle Management

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce

Lecture Data Warehouse Systems

Distributed Data Management

NoSQL - What we ve learned with mongodb. Paul Pedersen, Deputy CTO paul@10gen.com DAMA SF December 15, 2011

Raising Abstractions for the Software Defined Business

The MDM (Measurement Data Management) system environment

F1: A Distributed SQL Database That Scales. Presentation by: Alex Degtiar (adegtiar@cmu.edu) /21/2013

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

X4-2 Exadata announced (well actually around Jan 1) OEM/Grid control 12c R4 just released

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

Hybrid Solutions Combining In-Memory & SSD

Objectivity positions graph database as relational complement to InfiniteGraph 3.0

MongoDB Developer and Administrator Certification Course Agenda

Domain driven design, NoSQL and multi-model databases

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015

Ad Hoc Analysis of Big Data Visualization

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

HDFS Federation. Sanjay Radia Founder and Hortonworks. Page 1

In Memory Accelerator for MongoDB

Embedded Analytics & Big Data Visualization in Any App

The Cloud to the rescue!

Adding scalability to legacy PHP web applications. Overview. Mario Valdez-Ramirez

The Sierra Clustered Database Engine, the technology at the heart of

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

How To Improve Performance In A Database

Big Data Solutions. Portal Development with MongoDB and Liferay. Solutions

Big Data Visualization with JReport

Digital Transformation

Why NoSQL? Your database options in the new non- relational world IBM Cloudant 1

Big Data Visualization and Dashboards

Cloud3DView: Gamifying Data Center Management

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world

Using RDBMS, NoSQL or Hadoop?

Big Data Analytics Nokia

Enterprise Application Monitoring with

Virtual CPE and Software Defined Networking

Transactions and ACID in MongoDB

Building a SaaS Application. ReddyRaja Annareddy CTO and Founder

GigaSpaces Real-Time Analytics for Big Data

Overview of Databases On MacOS. Karl Kuehn Automation Engineer RethinkDB

EDG Project: Database Management Services

<Insert Picture Here> Achieving Business & Government Interoperability through PaaS & SaaS

Internet Scale Storage Microsoft Storage Community

extensible record stores document stores key-value stores Rick Cattel s clustering from Scalable SQL and NoSQL Data Stores SIGMOD Record, 2010

INTRODUCTION TO CASSANDRA

ScaleArc idb Solution for SQL Server Deployments

ENOVIA V6 Architecture Performance Capability Scalability

CitusDB Architecture for Real-Time Big Data

iservdb The database closest to you IDEAS Institute

Table Of Contents. 1. GridGain In-Memory Database

EOFS Workshop Paris Sept, Lustre at exascale. Eric Barton. CTO Whamcloud, Inc Whamcloud, Inc.

Lag Sucks! R.J. Lorimer Director of Platform Engineering Electrotank, Inc. Robert Greene Vice President of Technology Versant Corporation

Вовченко Алексей, к.т.н., с.н.с. ВМК МГУ ИПИ РАН

Traditional BI vs. Business Data Lake A comparison

Big Data Challenges in Bioinformatics

Configuration Management of Massively Scalable Systems

Outdated Architectures Are Holding Back the Cloud

Increasing Business Productivity and Value in Financial Services with Secure Big Data Architecture

SwiftScale: Technical Approach Document

Big data platform for IoT Cloud Analytics. Chen Admati, Advanced Analytics, Intel

NoSQL storage and management of geospatial data with emphasis on serving geospatial data using standard geospatial web services

The deployment of OHMS TM. in private cloud

ADOBE EXPERIENCE MANAGER MOBILE. for Healthcare

Issues in Big-Data Database Systems

ScaleArc for SQL Server

"LET S MIGRATE OUR ENTERPRISE APPLICATION TO BIG DATA TECHNOLOGY IN THE CLOUD" - WHAT DOES THAT MEAN IN PRACTICE?

Unified Batch & Stream Processing Platform

Multi-Domain Master Data Management. Subhash Ramachandran VP, Product Management

MarkLogic Semantics in Healthcare and Life Sciences for LIDER COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

This paper defines as "Classical"

NOSQL, BIG DATA AND GRAPHS. Technology Choices for Today s Mission- Critical Applications

Using Object Database db4o as Storage Provider in Voldemort

Processing and Analyzing Streams. CDRs in Real Time

Innovation: Add Predictability to an Unpredictable World

How to Build an E-Commerce Application using J2EE. Carol McDonald Code Camp Engineer

Introduction to NOSQL

MongoDB in the NoSQL and SQL world. Horst Rechner Berlin,

Big Data Course Highlights

Transcription:

www.objectivity.com Ibrahim Sallam Director of Development

Graphs, what are they and why? Graph Data Management. Why do we need it? Problems in Distributed Graph How we solved the problems

Simple Graph Node <- > Node h"p://opte.org

[finding] Japanese restaurants in New York City liked by people from Japan a challenging Facebook query. "The value of any piece of information is only known when you can connect it with something else that arrives at a future point in time, CIA s CTO Ira "Gus" Hunt

Person? Building Work Live RR Visit Eat Shop

PLM (Product Lifecycle Mgmt) CRM, Sales & Marke.ng Network Mgmt, Telecom Intelligence (Government & Business) IG Finance Social Networks Research: Genomics Healthcare

Data structure for representing complicated relationships between entities. Wide applications in Bioinformatics. Social networks. Network management etc.

Everyone specializes Doctors, Lawyers, Bankers, Developers Why was data so normalized for so long! NoSQL is all about the data specialist Specializing in Distribution / deployment Physical data storage Logical data model Query mechanism

Navigate Ingest Update Massive graph data require efficient and intelligent tools to analyze and understand it.

Ingest Big/Fast data demands write performance Most NoSQL solutions allow you to scale writes by Partitioning the data Understanding your consistency requirements Allowing you to defer conflicts

Ingest App- 1 (E (Ingest 1 2 { V 1 V 21 }) ) App- 2 (E (Ingest 23 { V 2 V 32 }) ) App- 3 (Ingest V 3 ) InfiniteGraph ObjecSvity/DB Persistence Layer V 1 E 12 V 2 E 23 V 3

Ingest IG Core/API E 12 E 23 Target Containers E(1- >2) E(3- >1) E(2- >3) E(1- >2) E(3- >2) E(2- >3) E(2- >1) E(3- >1) E(2- >3) E(3- >1) E(1- >2) E(1- >2) E(2- >1) E(2- >3) E(3- >1) E(3- >2) Pipeline Containers Pipeline Agent C 1 C 2 C 3

Excellent for efficient use of page cache Able to maintain full database consistency Achieves highest ingest rate in distributed environments Almost always has highest perceived rate Trading Off : Eventual consistency in graph (connections) Updates are still atomic, isolated and durable but phased External agent performs graph building Ingest

Ingest 500000 450000 400000 Nodes and Edges per second 350000 300000 250000 200000 150000 100000 4 clients 8 clients 1 client 2 clients 4 clients 8 clients 50000 0 1 2 1 client 2 clients # of Processes 4

Partitioning and Read Replicas easy right! ApplicaSon(s) Distributed API Processor Processor Processor Processor ParSSon 1 ParSSon 2 ParSSon 3 ParSSon...n

Navigate ApplicaSon(s) Distributed API Processor Processor Processor Processor ParSSon 1 ParSSon 2 ParSSon 3 ParSSon...n

Navigate Detect local hops and perform in memory traversal Send the partial path to the distributed processing to continue the navigation. Intelligently cache remote data when accessed frequently Route tasks to other hosts when it is optimal

Navigate ApplicaSon Distributed API Processor Processor P(A,B,C,D) ParSSon 1 ParSSon 2

Schema vs Schema-less Database religion InfiniteGraph supports schema, but does not restrict connection types between vertices We take advantage of the schema when pruning. Working on a hybrid support.

Navigate PaSent PrescripSon Drug Visit Physician Outcome Ingredient Complaint Allergy

Navigate GraphViews are extremely powerful Allow Big Data to appear small! Connection inference can lead to exponential gains in query performance Views are reusable between queries Built into the native kernel

Objectivity/DB is a proven foundation Building distributed databases since 1993 A complete database management system Concurrency, transactions, cache, schema, query, indexing It s a Graph Specialist! Simple but powerful API tailored for data navigation. Easy to configure distribution model

Physically co-locate closely related data Driven through a declarative placement model Dramatically speeds local reads Dr Blake Dr Quinn Mr CiSzen Primary Physician Dr Jones Dr Smith With Located Located Has Has With Located Located Sunny- vale At Visit Visit At San Jose Facility Facility Data Page(s) PaSent Data Page(s) Facility Data Page(s)

AddVertex() IG Core/API Customizable Placement Distributed Object and RelaSonship Persistence Layer HostA HostB HostC HostX Zone 1 Zone 2

Users ParSSoned Distributed DB (ohen Document / KV) External / Legacy Data Distributed Data Processing Plagorm TransformaSon \ MDM RDBMS Document Graph Database ApplicaSons Business

Distributed update. Update we are working on it.

Have we solved all the distributed graph problems? Not quite, but We Learned to Stop Worrying.