Experiences from a Large Scale Ontology-Based Application Development



Similar documents
An industry perspective on deployed semantic interoperability solutions

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

OWL: Path to Massive Deployment. Dean Allemang Chief Scien0st, TopQuadrant Inc.

Open Data Integration Using SPARQL and SPIN

Application of OASIS Integrated Collaboration Object Model (ICOM) with Oracle Database 11g Semantic Technologies

12 The Semantic Web and RDF

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

Introduction to SKOS. Bob DuCharme October 6, 2011

Open Source egovernment Reference Architecture Osera.modeldriven.org. Copyright 2006 Data Access Technologies, Inc. Slide 1

Network Graph Databases, RDF, SPARQL, and SNA

Performance Testing Web 2.0

bigdata Managing Scale in Ontological Systems

Department of Defense. Enterprise Information Warehouse/Web (EIW) Using standards to Federate and Integrate Domains at DOD

Semantic Stored Procedures Programming Environment and performance analysis

Common IT solutions for The Norwegian Continental Shelf (NCS)

Module I: Overview of Semantic Technologies and the Semantic Web

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

Graph Database Performance: An Oracle Perspective

Data Consumer's Guide. Aggregating and consuming data from profiles March, 2015

How To Use An Orgode Database With A Graph Graph (Robert Kramer)

SMART Apps. Rob Tweed M/Gateway Developments

We have big data, but we need big knowledge

Lift your data hands on session

Cloud-based Linked Data Geoprocessing: Implementing Kriging as WPS on the Cloud

Towards a Sales Assistant using a Product Knowledge Graph

MarkLogic Server. Reference Application Architecture Guide. MarkLogic 8 February, Copyright 2015 MarkLogic Corporation. All rights reserved.

An Ontology-based e-learning System for Network Security

EPIM LogisticsHub (ELH)

MarkLogic Semantics in Healthcare and Life Sciences for LIDER COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

Semantic Web Success Story

Using SPARQL/OWL for Validation of Smart Grid Standards

- a Humanities Asset Management System. Georg Vogeler & Martina Semlak

A smart app integrated with a Webbased advisory system for designing and managing grain drying and storage

STAR Semantic Technologies for Archaeological Resources.

The Ontology and Architecture for an Academic Social Network

Short Paper: Enabling Lightweight Semantic Sensor Networks on Android Devices

Scope. Cognescent SBI Semantic Business Intelligence

Cognitive Interaction Toolkit

Semantic Interoperability

Distribution and Integration Technologies

Service Oriented Architecture

DISCOVERING RESUME INFORMATION USING LINKED DATA

Creating a Semantic Web Service in 5 Easy Steps. Using SPARQLMotion in TopBraid Composer Maestro Edition

Taming Big Data Variety with Semantic Graph Databases. Evren Sirin CTO Complexible

Annotea and Semantic Web Supported Collaboration

DataOps: Seamless End-to-end Anything-to-RDF Data Integration

Microsoft Azure Data Technologies: An Overview

Annotation: An Approach for Building Semantic Web Library

ABSTRACT 1. INTRODUCTION. Kamil Bajda-Pawlikowski

Semantic Web Tool Landscape

RDF Dataset Management Framework for Data.go.th

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at

Basic Scheduling in Grid environment &Grid Scheduling Ontology

Getting Started with GRUFF

Introducing. automated functional testing of mobile apps. Karl Krukow, CTO, LessPainful GotoAMS, May, 2012

How To Write A Drupal Rdf Plugin For A Site Administrator To Write An Html Oracle Website In A Blog Post In A Flashdrupal.Org Blog Post

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

Linked Statistical Data Analysis

GetLOD - Linked Open Data and Spatial Data Infrastructures

Managing enterprise applications as dynamic resources in corporate semantic webs an application scenario for semantic web services.

XML for Manufacturing Systems Integration

DLDB: Extending Relational Databases to Support Semantic Web Queries

Clinical Knowledge Manager. Product Description 2012 MAKING HEALTH COMPUTE

TopBraid Life Sciences Insight

The use of Semantic Web Technologies in Spatial Decision Support Systems

NoSQL and Graph Database

JOURNAL OF COMPUTER SCIENCE AND ENGINEERING

Core Enterprise Services, SOA, and Semantic Technologies: Supporting Semantic Interoperability

Automating Rich Internet Application Development for Enterprise Web 2.0 and SOA

Ampersand and the Semantic Web

A Model of the Operation of The Model-View- Controller Pattern in a Rails-Based Web Server

Introducing DocumentDB

TopBraid Insight for Life Sciences

Industry 4.0 and Big Data

Transcription:

Experiences from a Large Scale Ontology-Based Application Development Ontology Summit 2012 David Price, TopQuadrant Copyright 2012 TopQuadrant Inc 1

Agenda Customer slides explaining EPIM ReportingHub TopQuadrantslides explaining ontology-related topics Copyright 2012 TopQuadrant Inc 2

Copyright 2012 TopQuadrant Inc 3

Copyright 2012 TopQuadrant Inc 4

Copyright 2012 TopQuadrant Inc 5

Copyright 2012 TopQuadrant Inc 6

Copyright 2012 TopQuadrant Inc 7

Copyright 2012 TopQuadrant Inc 8

Copyright 2012 TopQuadrant Inc 9

Scale of EPIM ReportingHub 300 million triples in the next 4 years Potential for many more if data sources added or historical data imported 40+ concurrent users REST/SOAP services too Source XSDs have 2000-ish elements -> resulting ontologies have 900-ish classes and 900-ish properties Delivered on an SaaSbasis with high availability Service Level Agreement RDF database replication and warm fallover Secondary app server running at all times Copyright 2012 TopQuadrant Inc 10

TopQuadrantViewpoint Ontologiesare software artifacts -part of an application that has a purpose Purpose is not always to represent the real world, not always to enable reasoners Creating and using ontologieshas challenges, but it s not a lot harder than other software engineering tasks Applications that use ontologiesoften have competing and conflicting requirements and require trade-offs E.g. How can my ontology 1) support change and aggregation using 4-dimensionalism, 2) support DL reasoning, 3) be understandable by software engineers and 4) have good query performance? Answer it cannot Copyright 2012 TopQuadrant Inc 11

Use of Ontologies We use ontologies for everything, literally, and we use RDF/OWL E&P ontologiescover the domain and are based in ISO 15926 Proxy ontologiesof XML Schemas for Daily Drilling, Daily Production, and Monthly Production allow XML-as-OWL data Ontology of NPD CSV/spreadsheet allows CSV-as-OWL data Ontology of SPARQL is part of SPIN W3C submission Ontology of functions similar to software library of functions allows packaged SPARQL, Java or JavaScript code Ontology of stored SPARQL as REST-like service returning JSON, XML, CSV, RDF Copyright 2012 TopQuadrant Inc 12

Use of SPIN/SPARQL These are critical ontology-related technologies ontologieson their own do very little for an application SPIN is SPARQL Inference Notation, W3C member submission Makes using SPARQL for constraints, transforms, etc. simpler We use SPIN/SPARQL for transformation and data validation Norwegian Petroleum Directorate Facts as CSV -> SPIN/SPARQL Construct E&P data Platform Operator Report XML -> SPIN/SPARQL Construct E&P data Validation of input data against NPD Facts -> SPIN Constraint as SPARQL We use SPARQL Web Pages for report generation E&P data -> SPARQL embedded in HTML Report for Partners in License Copyright 2012 TopQuadrant Inc 13

Challenge : The Ontology Team Disparate view, approach, background of ontologists working together E.g. zealots with respect to their flavor of ontology can always do it better and cannot believe how little understanding followers of other ontology religions have results often vary and have to be integrated by single ontology lead at the end E.g. engineers turned ontologistsoften don't have the necessary background/skills Early mistakes become legacy that has to be worked around These were more of a problem wrtbackground requirements on the project, not as much within the team, but even then they impacted the project Copyright 2012 TopQuadrant Inc 14

Challenge : Ontology Cost Large scale ontology development is expensive Is also engineering so suffers from At some point you just stop syndrome Reuse is often hard because organizations don't have the money to complete the ontology work things left in not-quite-good-enough state for future use when the money runs out and the money always runs out yet organizations often insist on reuse of previous work to recoup investment which costs them more Copyright 2012 TopQuadrant Inc 15

Challenge : Ontology as Software Ontology team meets Software team can be a clash of cultures E.g. Software developers can prove their code works, ontologists cannot It s hard to test an ontology (e.g. test data?) Ontological purity not high priority for Java or XML Schema developers In the end, the software development team owns the application, not the ontologist Copyright 2012 TopQuadrant Inc 16

Challenge : Practicalities Infrastructure supports software development far better than large-scale ontology development, yet ontology is just another software artifact in large scale apps E.g. diff utility helps software developers track change, but doesn t work on ontology files E.g. Cannot SPARQL with label rather than URI so human readable URIs are an issue that tools are not going to handle Documentation of ontologiesis often vague/ambiguous It means whatever the author says it meant when you manage to find them and ask Copyright 2012 TopQuadrant Inc 17

Summary wrtepim ReportingHub Semantic analysis of XML schema to E&P/ISO 15926 ontologies is not a quick process We used Franz s AllegroGraphfor this project there are scalable, enterprise triplestores now Supports ACID commit/rollback, warm standby, replication, etc. Transformations using SPIN/SPARQL make mappings and semantics visible to non-programmers SPIN/SPARQL is also how we will project from E&P ontology into DL-safe ontologies, full ISO 15926, etc. Copyright 2012 TopQuadrant Inc 18