Oracle Spatial and Graph



Similar documents
Graph Database Performance: An Oracle Perspective

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Mining Big Data with RDF Graph Technology:

Application of OASIS Integrated Collaboration Object Model (ICOM) with Oracle Database 11g Semantic Technologies

How To Use An Orgode Database With A Graph Graph (Robert Kramer)

Introduction to Ontologies

Oracle Spatial and Graph. Jayant Sharma Director, Product Management

Geospatial Platforms For Enabling Workflows

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Geospatial Technology Innovations and Convergence

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Oracle Business Intelligence EE. Prab h akar A lu ri

Oracle Database 12c Plug In. Switch On. Get SMART.

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Geospatial Platforms For Enabling Workflows

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Semantic Stored Procedures Programming Environment and performance analysis

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

Using OBIEE for Location-Aware Predictive Analytics

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

Introducing Oracle Exalytics In-Memory Machine

Oracle Graph: Graph Features of Oracle Database

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Safe Harbor Statement

TUT NoSQL Seminar (Oracle) Big Data

An Oracle White Paper October Oracle: Big Data for the Enterprise

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

National Security and Cyber Defense with Big Data

Oracle Database Public Cloud Services

An Oracle White Paper June Oracle: Big Data for the Enterprise

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process

Oracle Database 11g Comparison Chart

<Insert Picture Here> Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors...

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir

LDIF - Linked Data Integration Framework

Ganzheitliches Datenmanagement

An Oracle White Paper September Oracle: Big Data for the Enterprise

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

BIG DATA TECHNOLOGY. Hadoop Ecosystem

TopBraid Insight for Life Sciences

A collaborative platform for knowledge management

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper

MarkLogic Semantics in Healthcare and Life Sciences for LIDER COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

<Insert Picture Here> Oracle Retail Data Model Overview

Oracle Big Data Strategy Simplified Infrastrcuture

Actian SQL in Hadoop Buyer s Guide

Semantic and Data Mining Technologies. Simon See, Ph.D.,

How To Create A Business Intelligence (Bi)

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

An Oracle White Paper June High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

Zynga Analytics Leveraging Big Data to Make Games More Fun and Social

Oracle Big Data Building A Big Data Management System

Oracle Big Data SQL Technical Update

Intelligent Government From Data to Decision. Robert Lindsley Oracle, Public Sector Technology Group

Big Data and Analytics in Government

SQL Server 2005 Features Comparison

Cray: Enabling Real-Time Discovery in Big Data

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Big Data Are You Ready? Thomas Kyte

Big Data for Official Statistics Processing Big and Fast Data Optimizing Results with a Multi-Model Database

Advanced In-Database Analytics

High Performance Data Management Use of Standards in Commercial Product Development

Data Analysis with Various Oracle Business Intelligence and Analytic Tools

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

ORACLE DATABASE 10G ENTERPRISE EDITION

HIGH PERFORMANCE ANALYTICS FOR TERADATA

Oracle Architecture, Concepts & Facilities

Ramesh Bhashyam Teradata Fellow Teradata Corporation

Business Intelligence and Service Oriented Architectures. An Oracle White Paper May 2007

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

bigdata Managing Scale in Ontological Systems

UNIFY YOUR (BIG) DATA

Data Integration and ETL with Oracle Warehouse Builder: Part 1

The Semantic Web for Application Developers. Oracle New England Development Center Zhe Wu, Ph.D. 1

Big Data Analytics Nokia

Comparison of Triple Stores

Oracle Database: SQL and PL/SQL Fundamentals NEW

Unified Batch & Stream Processing Platform

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

AllegroGraph. a graph database. Gary King gwking@franz.com

A Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities

A Big Data Storage Architecture for the Second Wave David Sunny Sundstrom Principle Product Director, Storage Oracle

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

An Oracle White Paper May Oracle Database Cloud Service

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

[Hadoop, Storm and Couchbase: Faster Big Data]

Transcription:

Oracle Spatial and Graph Overview of New Graph Features

"THE FOLLOWING IS INTENDED TO OUTLINE OUR GENERAL PRODUCT DIRECTION. IT IS INTENDED FOR INFORMATION PURPOSES ONLY, AND MAY NOT BE INCORPORATED INTO ANY CONTRACT. IT IS NOT A COMMITMENT TO DELIVER ANY MATERIAL, CODE, OR FUNCTIONALITY, AND SHOULD NOT BE RELIED UPON IN MAKING PURCHASING DECISION. THE DEVELOPMENT, RELEASE, AND TIMING OF ANY FEATURES OR FUNCTIONALITY DESCRIBED FOR ORACLE'S PRODUCTS REMAINS AT THE SOLE DISCRETION OF ORACLE." 2

Program Agenda Background: Graph Concepts Use Cases Newest Graph Features in Oracle Database 12c Performance, Scalability, Manageability 3

Oracle Spatial and Graph Mature, Proven Graph Database Capabilities Graph Features Network Data Model graph W3C RDF Semantic graph 4

Network Data Model: Graph model to represent physical and logical networks Key Features Explicitly stores and maintains connectivity Attributes at link and node level Java API to perform Analysis in memory Multiple Cost Support in Path Analysis Traveling salesman, spanning tree, shortest path, sub-path, within cost, nearest neighbors 5

GRAPH CONCEPTS APPLIED TO THE ENTERPRISE 6

RDF Graph Use Cases Semantic Metadata Layer Text Mining & Entity Analytics Social Media Analysis Unified content metadata for federated resources Validate semantic and structural consistency Find related content & relations by navigating connected entities Reason across entities Analyze social relations using curated metadata - Blogs, wikis, tweets, video - Calendars, IM, voice 7

Modeling Entity Relationships as graph The basic unit of information (fact) is represented as <subject, predicate, object> triple Originally created to encode metadata such as author, date, etc. for web resources. Recently, it has become popular to relate things in the real-world such as people, places, concepts etc. Triples together form a graph, connecting pieces of data subject predicate object :friendof :Mary :John :age 32 ^^xsd:decimal 8

Relational to Graph Modeling Neuron has is_located_in Compartment involves Neuronal Property Receptor Pathological Change involves Pathological Agent inhibits inhibits Agent inhibits Drug is_located_in Channel 9 Courtesy: SenseLab, Yale University

Enterprise Metadata Modeling Graph Metadata Mapping Mid-Tier Server Application 1 Application 2 Application 3 SPARQL Benefits Existing relational data stays in place and corresponding applications do not need to change Use of virtual mapping eliminates synchronization issues Common vocabulary helps with data integration issues SQL Database Server RDF Graph Shared Ontologies Inventory Graph Sales Graph HR Schema Inventory Schema Sales Schema HR Database Inventory Database Sales Database 10

Linked Data in Enterprise Access & Presentation Layer Semantic Graph model (W3C RDF Metadata Model) Index Data Servers Event Server Hadoop Appliance Content Mgmt BI Server Data Warehouse Data Sources / Types Machine Generated Data Social Media Human Sourced Information Subscription Services Transaction Systems 11

Benefits of RDF Graph Metadata Management Open, flexible, unifying enterprise metadata model Incorporates ALL business content (RDBMS, ECM, Big Data, events) Highly expressive rules for managing data policies and governance Enables users to link, analyze, and discover information horizontally, as shared services, across a variety of enterprise and Web sources 12

USE CASES 13

Industries Have Already Adopted the Concept Industries Life Sciences Finance Media Networks & Communications Defense & Intelligence Public Sector Hutchinson 3G Austria Thomson Reuters 14

Eli Lilly and Company Oracle Spatial and Graph: RDF Graph Metadata Repository Objectives Unified vocabulary for scientific investigation Easier, more complete investigations Solution Integrate patient records, chemical structures, biological sequences & pathways, images, scientific papers View related data as a graph Traverse graphs to discover relationships, search for a term, or browse ontologies [This technology ] provides improved insight into our business by bringing together related information from diverse data sources, J. Phil Brooks Information Consultant, Eli Lilly and Company 15

Allied Nation Intelligence Service Oracle Spatial and Graph: Social Analysis Objectives Profile suspects through telephone, email and social network communications Produce data products for analysts Solution RDF Graph modeling of the social network: people, groups and places of interest Inferencing & graph analytics discover relationships among individuals & meaning of pseudonyms, aliases, codes, terminology Benefits Standards-based tools: W3C RDF & SPARQL Semantic tagging for 600 TB / 10b triples graph Top-secret, compartmented security for data New discovery on ~100 million triples / month Find & label same-as relationships 16

Cisco WebEx Social Graph for Enterprise Collaboration Objectives Social connectivity and collaboration through semantic enablement Connect knowledge silos Solution Persistent unified graph metadata model Concepts tagged with unique meaning Find related content & groups by navigating connected entities, recommendations Benefits Unifies metadata model - forum, blog, wiki, etc. Tagging media documents, pictures, blogs, etc. to user-defined and/or enterprise vocabularies. Validates tag semantic/structural consistency 17

Novartis Institutes for BioMedical Research (NIBR) Project Metastore Insert your company logo here BRIEF ORGANIZATIONAL OVERVIEW NIBR is the global pharmaceutical organization for Novartis committed to discovering innovative medicines to treat diseases with high unmet medical need 6000+ scientists, physicians, business professionals worldwide BUSINESS CHALLENGES / OPPORTUNITIES Scientific knowledge portal does not provide ability to: Link database information on genes, proteins, metabolic pathways, compounds, ligands, etc. to original sources. Increase productivity for accessing, sharing, searching, navigating, cross-linking, analyzing internal /external data SOLUTION Provide a semantic integration layer on existing relational tables: Rich domain-specific terminology (biology, chemistry and medicine) containing 1.6 M terms Terminology Hub: 8 GB of referential data that crossreferences between data repositories. BUSINESS BENEFITS REALIZED BY ORACLE SOLUTION Performance: met design goals for comparable performance to legacy application with enhanced usability Better Analysis: enables discovery of unknown relationships based on the meaning (the semantics) of the data Flexible Data Modeling: supports discovery and allows easy changes to incorporate new kinds of data and relationships Manageability: RDF triple store benefits from Oracle Database functionality: backup, security, replication,. 18

RDF GRAPH DATABASE FEATURES 19

Oracle Database 12c Spatial and Graph Tooling Transaction Systems Transform & Modeling Tools Load, Query & Inference Applications & Analysis Tools Unstructured Content RSS, email Other Data Formats R2RML Protégé for Oracle Jena for Oracle Sesame for Oracle NLP Extraction (partners) RDF/OWL Data Management SQL & SPARQL Query OWL Inferencing Semantic Rules Scalability & Security Semantic Indexing Java, HTTP access JSON output Graph visualization (Cytoscape) Map (GIS) Visualization Oracle Business Intelligence (OBIEE) Data Sources Oracle Database 12c Oracle BPM 20

Oracle Database 12c RDF Triple Store Leverages Oracle Manageability: RAC & Exadata scalability Compression & partitioning SQL*Loader direct path load Parallel load, inference, query High Availability Triple-level label security Ladder based inference Choice of SPARQL, SQL, or Java Native inference engine Enterprise Manager Load / Storage Query Reasoning Analytics Native RDF graph data store Manages tens of billions of triples Relational to RDF mapping SPARQL-Jena/Joseki, Sesame SQL/graph query, B-tree indexing Ontology assisted SQL query RDFS, OWL2 RL, EL+, SKOS User-defined rules Incremental, parallel reasoning User-defined inferencing Plug-in architecture Semantic indexing framework Integration with OBIEE, Oracle R Enterprise Oracle Data Mining 21

SPARQL and SPARQL in SQL Architecture HTTP Standard SPARQL Endpoint Enhanced with query management control Java Jena API Jena Adapter Sesame API Sesame Adapter SPARQL-to-SQL Translation Logic SQL SEM_MATCH rewritable table function 22

NEWEST GRAPH FEATURES IN ORACLE SPATIAL AND GRAPH FOR ORACLE DATABASE 12C 23

SPARQL 1.1 Query Support 40+ new query functions/operators: IF, COALESCE, STRBEFORE, REPLACE, ABS, Aggregates: COUNT, SUM, MIN, MAX, AVG, GROUP_CONCAT, SAMPLE Subqueries Value Assignment: BIND, GROUP BY Expressions, SELECT Expressions Negation: NOT EXISTS, MINUS 24

RDB2RDF: Modeling Relational Data as a Graph Relational to RDF Modeling W3C Standard Specification Oracle Spatial and Graph 12c can represent relational schema as graph view Integrate content from distributed sources Federate distributed databases Apply SPARQL queries on tables, views, SQL query results No duplication of data and storage 25

SPARQL 1.1 Update Capabilities of SPARQL Update Insert triples into an RDF Graph Delete triples from an RDF Graph Load an RDF Graph Clear an RDF Graph Create a new RDF Graph Drop an RDF Graph Copy, move or add the content of one RDF Graph to another Perform a group of update operations as a single action 26

SPARQL 1.1 Property Paths A property path is a possible route through a graph between two graph nodes W3C Property Paths Answers question, Yes or No: does a path exist from Tim to Sam? Extends matching of a triple pattern to any length path A more succinct way to write parts of basic graph patterns Works with Tim Manages Ray Bob Manages John Works with Mary Works with Works with Sam Works with Alice 27

Oracle Spatial and Graph Inference Native OWL 2 EL inference support User defined inferencing Allows generation of new RDF resources Temporal reasoning, Spatial reasoning Web service callouts Ladder Based Inference Fine grained security for inference graph Performance optimization for user defined rules Integration with TrOWL*, an external OWL 2 reasoner TrOWL is a transformation based, tractable reasoner for OWL 2 28 * http://trowl.eu/

Jena and Sesame Adapters Preconfigured, low query cost, ease of install & management Easy-to-use Java APIs to access Oracle database A standard-compliant SPARQL web service endpoint Data loading (RDF/XML, N-TRIPLES, N-QUADS, TriG,Turtle) w/ long literals JSON output Oracle-specific extensions for query execution control and management Integration with OBIEE, RDF browser 29

Graph Support on Oracle NoSQL Available on Oracle NoSQL Database (Enterprise Edition) Graph Feature for NoSQL RDF Graph support in Oracle NoSQL Database Enterprise Edition High performance Key Value store Standard access to graph data: SPARQL 1.1 Jena & Joseki SPARQL endpoint Web Services Massive horizontal scalability petabytes of triples Support for World Wide Web Consortium (W3C) Semantic Web standards 30

In-Database Graph and Predictive Analytics Oracle Advanced Analytics Problem Classification Anomaly Detection Association Rules Sample Problem Given demographic data about a set of customers, identify customer purchasing behavior that is significantly different from the norm Find the items that tend to be purchased together and specify their relationship market basket analysis Clustering Segment demographic data into clusters and rank the probability that an individual will belong to a given cluster Feature Extraction Given demographic data about a set of customers, group the attributes into general characteristics of the customers F1 F2 F3 F4 31

Reporting RDF Data with Oracle BI EE Powerful BI dashboards Visually appealing 100% thin client Across all styles of analysis R-OLAP, M-OLAP, Scorecards, Reporting, Collaboration, Actions Across all data sources Federated data access Share, collaborate, & publish 32

Performing Statistical Graph Analytics Oracle R Enterprise Open source language Statistical computing and chart for graph data Produces publication quality plots Highly extensible with open source R packages 33

PERFORMANCE, SCALABILITY, MANAGEABILITY 34

Performance and Scalability Scales to 100s of billions of triples (petabytes) and more - Scales linearly with Oracle database and hardware - No limitations as with other in-memory approaches Fast loading of triples Incremental and bulk loading Parallelism is exploited Load, Query, Inference Comparable to or faster than competing RDF graph databases 35

Manageability of RDF Semantic Graph Integration with Oracle Database 11g/12c utilities and tools Ingest / Replicate / Recover Bulk load: Apache Jena bulk loader Oracle external tables & SQL*Loader (Direct Path) w/ PL/SQL Bulk Load API Replicate & recover: Data Guard: physical standby Data Pump: staging tables Recovery Manager: RMAN Tune / Analyze Tune load/ query/ inference: Parallelism Btree indexing triple/quad Typed literals indexing SPARQL query hints Statistics gathering Dynamic Sampling Analyze performance: Enterprise Manager: view optimizer plans, monitor execution / resource usage Manage Control query execution: in database & Jena client Create & monitor graph w/ SQL Developer: Semantic Network Models, virtual models Btree indexes Rule bases Entailments Security data labels Semantic index policies 36

Summary Oracle brings enterprise-class RDF semantic graph data management Scalable, Secure, and High Performance: load, query, inference features Supports W3C Semantic Standards Works with structured and unstructured data Enterprise-class Oracle tools can now mine insight from semantic data OBIEE Oracle Data Mining Oracle R Enterprise Graph DB now available on Oracle NoSQL Database EE 37

For More Information Oracle RDF Xavier.Lopez@oracle.com oracle.com 38

39

40