Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management



Similar documents
Application of OASIS Integrated Collaboration Object Model (ICOM) with Oracle Database 11g Semantic Technologies

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

Scalable End-User Access to Big Data HELLENIC REPUBLIC National and Kapodistrian University of Athens

Open Source egovernment Reference Architecture Osera.modeldriven.org. Copyright 2006 Data Access Technologies, Inc. Slide 1

Industry 4.0 and Big Data

Graph Database Performance: An Oracle Perspective

Taming Big Data Variety with Semantic Graph Databases. Evren Sirin CTO Complexible

How To Build A Cloud Based Intelligence System

HOW TO DO A SMART DATA PROJECT

A collaborative platform for knowledge management

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

Semantic Stored Procedures Programming Environment and performance analysis

Publishing Linked Data Requires More than Just Using a Tool

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

Semantics and Ontology of Logistic Cloud Services*

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

Cray: Enabling Real-Time Discovery in Big Data

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Semantic Web Success Story

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

A Semantic web approach for e-learning platforms

An Ontological Approach to Oracle BPM

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

Building Applications with Protégé: An Overview. Protégé Conference July 23, 2006

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee Berlin, Germany.

Analance Data Integration Technical Whitepaper

Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology

The Role of the Software Architect

A generic approach for data integration using RDF, OWL and XML

Amit Sheth & Ajith Ranabahu, Presented by Mohammad Hossein Danesh

Linked Open Data Infrastructure for Public Sector Information: Example from Serbia

Analance Data Integration Technical Whitepaper

Cisco Process Orchestrator Adapter for Cisco UCS Manager: Automate Enterprise IT Workflows

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

APPLYING CASE BASED REASONING IN AGILE SOFTWARE DEVELOPMENT

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce

MANDARAX + ORYX An Open-Source Rule Platform

Service Oriented Architecture

Lightweight Data Integration using the WebComposition Data Grid Service

How To Write A Drupal Rdf Plugin For A Site Administrator To Write An Html Oracle Website In A Blog Post In A Flashdrupal.Org Blog Post

bigdata Managing Scale in Ontological Systems

TopBraid Insight for Life Sciences

Model Driven Interoperability through Semantic Annotations using SoaML and ODM

Creating visualizations through ontology mapping

Model-Driven Data Warehousing

How To Make Sense Of Data With Altilia

Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution

Model-Driven Software Produces Truly Agile Solutions

Ampersand and the Semantic Web

Data Validation with OWL Integrity Constraints

Application of ontologies for the integration of network monitoring platforms

Perspectives of Semantic Web in E- Commerce

Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery

CMDB Federation. DMTF Standards for Federating CMDBs and other Management Data Repositories

Secure Semantic Web Service Using SAML

Simplifying e Business Collaboration by providing a Semantic Mapping Platform

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors...

Microsoft Technology Practice Capability document. MOSS / WSS Building Portal based Information Worker Solutions. Overview

CiteSeer x in the Cloud

f...-. I enterprise Amazon SimpIeDB Developer Guide Scale your application's database on the cloud using Amazon SimpIeDB Prabhakar Chaganti Rich Helms

Big Data Analytics Nokia

Software Engineering/Courses Description Introduction to Software Engineering Credit Hours: 3 Prerequisite: (Computer Programming 2).

NS DISCOVER 4.0 ADMINISTRATOR S GUIDE. July, Version 4.0

Creating an Enterprise Reporting Bus with SAP BusinessObjects

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Test Data Management Concepts

Keywords document, agile documentation, documentation, Techno functional expert, Team Collaboration, document selection;

Semantic Workflows and the Wings Workflow System

Introduction to Agile and Scrum

Information Technology for KM

A Survey Study on Monitoring Service for Grid

Standards for Big Data in the Cloud

An industry perspective on deployed semantic interoperability solutions

Designing a Semantic Repository

Implementing a Data Governance Initiative

Considering Third Generation ediscovery? Two Approaches for Evaluating ediscovery Offerings

GetLOD - Linked Open Data and Spatial Data Infrastructures

Oracle Real Time Decisions

Core Enterprise Services, SOA, and Semantic Technologies: Supporting Semantic Interoperability

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

P ERFORMANCE M ONITORING AND A NALYSIS S ERVICES - S TABLE S OFTWARE

Understanding and Addressing Architectural Challenges of Cloud- Based Systems

Big Data 101: Harvest Real Value & Avoid Hollow Hype

Transcription:

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management Ram Soma 2, Amol Bakshi 1, Kanwal Gupta 3, Will Da Sie 2, Viktor Prasanna 1 1 University of Southern California, Los Angeles 2 Chevron Corp. 3 Avanade Inc.

About CiSoft USC-Chevron Center of Excellence for Research and Academic Training on Interactive Smart Oilfield Technologies Established: December 2003 Disciplines: Petroleum Engineering, Chemical Engineering, Material Science, Physics, Computer Science, Electrical Engineering, Industrial Engineering MS Degree in Petroleum Engineering with emphasis on Smart Oilfield Technologies (SOFT) Integrated Asset Management Well Productivity Improvement RESEARCH AREAS Robotics and Artificial Intelligence Embedded and Networked Systems http://cisoft.usc.edu Reservoir Management Data Management Tools Immersive Visualization

Outline Integrated Asset Management Objectives Role of semantic web Software development methodology IAM Ontology Ontology design Change management and dirty queries Remarks Lessons learnt Areas of interest

Integrated Asset Management (IAM) What is IAM? A comprehensive transformation approach to integrated oilfield operations A software application that can help asset team members simulate decisions before making them Objectives Increase integration between different functions Enable asset level what if scenarios Create a knowledge base of activities and decisions Reduce risk and uncertainty in decision making Challenges Data silos are not interoperable Data is semi-structured Multiple organizations and classes of users 4

What IAM provides to users Efficient access to data and information Reduces time spent looking for data Answers complex queries across semi-structured data sets Consistent view of information Reconciles different views of the same information Creates shared situational awareness of the asset Context of information creation and usage Leads to more meaningful interpretation of data Acts as organizational memory for the workflow Non-functional: Non-disruptive, extensible, scalable, usable, etc. 5

The IAM Metacatalog Problem Simulation models embody different realizations of uncertainty and development strategies for an asset Models are created by different user groups at different times; it is difficult to maintain consistency of assumptions No intuitive search functionality available to domain experts ( Show me most recently history matched model ) Solution: The IAM Metacatalog Metadata repository at the core of the IAM application Focus on answering What does the data mean? (vs. How do I access the data ) Key parameters and assumptions from various models are extracted and stored in the metacatalog Also stores relationships between data objects and their provenance

Why Semantic Web Technologies Expressivity and richness of data model Organic growth capability for domain models/knowledge Inferencing and Rule Based Reasoning Flexibility of querying Ease of domain expert to understand and contribute to domain models Standards based (No vendor lock-in) Promoted by W3C 7

IAM R&D Timeline

Populating the Metacatalog Most of the metadata is captured offline Metadata extraction by custom built parsers Simulation cases IAM UI/ Agent Extract Information OWL Inferencing Upload to database Simulation model repositories IAM Application IAM KB

Example 1: Browse And Search Search based on metadata Provenance and metadata info for IAM data objects Data objects

Example 2: Comparing Assumptions Comparing a key assumption made in two simulation cases OOIP Region Names

Software Development Methodology Agile development using Scrum Iterative software development in Sprints Close collaboration with customer Reviews/demos after each sprint Flexible prioritization at sprint boundaries Product Owner role represents the stakeholders Less focus on formal documentation

Phases Development in sprints Requirements Planning Ontology Spec/ Refactoring Review Application Dev Observation: Ontology frequently modified Techniques for change management make methodology more successful

Miscellaneous Addressed key risks of an OWL-based solution Performance - Benchmarking Limited tool support Web service interfaces for KB Ongoing evaluation of alternatives Tech transfer to software developers Code and documentation Demos and training Development Ontology design was done with the assistance of domain experts and end users CiSoft researcher acting as Product Owner for Scrum team moved research into deployment

Outline Integrated Asset Management Objectives Role of semantic web Software development methodology IAM Ontology Ontology design Change management and dirty queries Remarks Lessons learnt Areas of interest

Ontology Design Ontology design divided into three levels to improve modularity Domain independent/upper ontologies Concepts common to all ontologies like time, units etc. Domain ontology Model of the elements in the asset Uses elements from upper ontologies Application specific ontologies: Elements specific to a given application or workflow Uses elements from upper and domain ontologies Domain Ontology MDC DSE Events Application Specific Ontologies Tool specific Ontologies

IAM Ontologies: Domain Ontology

IAM Ontologies: Metadata ontology Entities from domain ontology Metadata for data objects

Implementation OWL data store + SPARQL querying Current implementation uses Jena OWL API Two reasoners Rule based reasoner (Jena) Tableaux reasoner (Pellet) OWL data stored in Jena RDBMS, file system Web service API to abstract data store (Apache Axis2) Various applications that use MDC

Supporting Iterative Development Requirements Planning Ontology Spec Review Application Dev Requirements Planning Ontology Spec Review Application Dev Demo/ Feedback Demo/ Feedback Demo/ Feedback Ontologies are modified in every sprint

Change Management Problem Knowledge Base KB Ontology O SPARQL QUERIES Q Δ K Code C Δ O Message schemas X Δ Q Δ C Ontology O Knowledge Base KB SPARQL QUERIES Q Δ M Code C Message schemas X Detect dirty queries that are invalidated when an ontology is modified

Dirty Queries OWL graphsspace of RDF graphs Q2Dirty WF T,OWL TOWL Q1 EXT T (Q) (WF T,OWL \ WF T,OWL )!= Φ V EXT T (Q) (WF T,OWL \ WF T,OWL )!= Φ

Change handling Detect ontology changes Evaluate Query, EXT(Q) Compute the impact/semantics of changes, WF T,OWL \ WF T,OWL Match query and changes Capture Change Query Evaluation Semantics of Change Detect Dirty Queries

Implementation Protégé plugin Jena, Pellet, SPARQL parser

Outline Integrated Asset Management Objectives Role of semantic web Software development methodology IAM Ontology Ontology design Change management and dirty queries Remarks Lessons learnt Areas of interest

Lessons learnt Ontology design Plan schema changes carefully and do not change schema often Keep OWL ontology small and modular; use OWL imports Performance Track performance through product development cycle Consider performance enhancing components (caching) in architecture Be cognizant of OWL features your tool supports Very few are fully compliant with standards Design for change Use SPARQL querying Separate KB querying components from business logic and UI Active area of work- expect big improvements soon

Features we missed SPARQL Rollup/aggregation queries. E.g. get the aggregate of OOIP for region as sum of OOIPs of contained regions Results as triples XPath like expressions. E.g. get sub-tree under X Updating materialized OWL knowledge bases Solved problem in research Better XML-OWL/RDF interoperability SPARQL-XML (?) OWL/RDF- XML (Gloze)

Areas of interest Ontology extension Modeling events Capturing data provenance Performance improvements Developing representative benchmarks Evaluating various RDF triple stores Algorithms for parallel OWL inferencing Change management Managing evolution of schema and instance data Efficient techniques to track changes to OWL KBs

Some of our publications R. Soma, Viktor Prasanna, Detecting dirty queries during iterative development of OWLbased applications, 7th International Conference on Ontologies, DataBases, and Applications of Semantics (ODBASE 2008), Monterrey, Mexico, Nov 11-13, 2008. R. Soma, Viktor Prasanna, Parallel Inferencing for OWL Knowledge Bases, International Conference on Parallel Processing (ICPP-2008), September 2008. R. Soma, Viktor Prasanna, A Data Partitioning Approach for Parallelizing Rule Based Inferencing for Materialized OWL Knowledge Bases, International Conference on Parallel and Distributed Computing and Communication Systems (PDCCS), September 2008. R. Soma, Amol Bakshi, Viktor Prasanna, W. DaSie and B. Bourgeois, Semantic-web technologies for Oil-field Management, SPE Intelligent Energy Conference and Exhibition, April 2008. R. Soma, Amol Bakshi, Viktor Prasanna, A Semantic Framework for Integrated Asset Management, Proceedings of The Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid), 2007 R. Soma, A. Bakshi, V. K. Prasanna, and W. Da Sie, A Model-Based Framework for Developing and Deploying Data Aggregation Workflows, 4th International Conference on Service Oriented Computing (ICSOC), December 2006.

Backup

Change capture Well studied problem All changes to OWL, representation, capture.. Use Protégé plugin

Query evaluation Evaluate triple patterns (TP) Projecting TP to WF T,OWL Observations: All OWL statements are either type, property or identity assertions Triple pattern can have variable or constant in each of its 3 places: 2*2*2= 8 types of triple patterns Evaluate graph pattern Based on semantics of connectors

Semantics of change Not all changes modify WF Lexical Changes: Names of entities, properties, easy to handle Extensional: Modifies WF Assertional: Does not change WF but adds rules Cardinality: Does not change WF but adds/removes constraints Determine WF OWL \ WF OWL from changes About 50 kinds of changes to OWL ontology WF T,OWL

Matching Aggregate changes Handle Lexical change: String search/replace Compare extension of query with semantics of change If they have some element in common dirty E.g. EXT(Q) = P (ALL_Persons X rdf:type X Person) U P(ALL_Persons X IOP X I U L) Sem(ch) = {ALL_Persons!= ALL_Persons}