Customer experiences in implemen0ng SKOS- based vocabulary management systems, Ralph Hodgson, TopQuadrant. CWI, Amsterdam, April 3, 2014

Similar documents
An industry perspective on deployed semantic interoperability solutions

How semantic technology can help you do more with production data. Doing more with production data

Experiences from a Large Scale Ontology-Based Application Development

TopBraid Insight for Life Sciences

Logical Semantic Warehouse - Developing Your Own Semantic Ecosystem Peter Lawrence, TopQuadrant

Seman&c Web: Benefits For Clinical Decision Support At The Bedside. Emory Fry, MD SemTechBiz 2013

TopBraid Life Sciences Insight

SEMANTIC SOLUTIONS FOR OIL & GAS: ROLES AND RESPONSIBILITIES

Introduction to SKOS. Bob DuCharme October 6, 2011

TopQuadrant-Syngenta Webcast July 10, 2014 Semantic Data Virtualization: Extracting More Value from Data Silos

Common IT solutions for The Norwegian Continental Shelf (NCS)

OWL: Path to Massive Deployment. Dean Allemang Chief Scien0st, TopQuadrant Inc.

Graph Database Performance: An Oracle Perspective

W3C PROV FAMILY OF SPECIFICATIONS: AN UPDATE

BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE. Big Data Europe

The Foundations of Successful Reference Data Management

Get More Value from Your Reference Data Make it Meaningful with TopBraid RDM

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

CMMI for High-Performance with TSP/PSP

Industry 4.0 and Big Data

STAR Semantic Technologies for Archaeological Resources.

Getting Real with Policies for Software Defined Infrastructure. Manish Dave Principal Engineer, Intel IT

Semantic Interoperability

Department of Defense. Enterprise Information Warehouse/Web (EIW) Using standards to Federate and Integrate Domains at DOD

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Solving today's challenges with Oracle SOA Suite, and Oracle Coherence

Bank of America Security by Design. Derrick Barksdale Jason Gillam

A Seman(c Approach to Cloud Infrastructure Management at Freudenberg IT

Business Analysis Standardization A Strategic Mandate. John E. Parker CVO, Enfocus Solu7ons Inc.

EPIM LogisticsHub (ELH)

Using Open Source software and Open data to support Clinical Trial Protocol design

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study

B2B Offerings. Helping businesses op2mize. Infolob s amazing b2b offerings helps your company achieve maximum produc2vity

A R o a d t o y o u r C l o u d. Professional Service. C R M a n d C l o u d C o n s u l t i n g

The Ontological Approach for SIEM Data Repository

A Sematic Web-Based Framework for Quality Assurance of Electronic Medical Records Data for Secondary Use

THE STATE OF THE DATA WAREHOUSE

Return on Experience on Cloud Compu2ng Issues a stairway to clouds. Experts Workshop Nov. 21st, 2013

Amit Sheth & Ajith Ranabahu, Presented by Mohammad Hossein Danesh

The Data Reservoir. 10 th September Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons

Screenshot Tour. Copyright 2013 TopQuadrant Inc. TopBraid Suite Supporting the Complete Semantic Application Lifecycle

Open Data Integration Using SPARQL and SPIN

Open Source egovernment Reference Architecture Osera.modeldriven.org. Copyright 2006 Data Access Technologies, Inc. Slide 1

LDIF - Linked Data Integration Framework

/Endpoint Security and More Rondi Jamison

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT

Filtering the Web to Feed Data Warehouses

Taming Big Data Variety with Semantic Graph Databases. Evren Sirin CTO Complexible

Natural Language Processing in the EHR Lifecycle

ON DEMAND ACCESS TO BIG DATA. Peter Haase fluid Operations AG

Publishing Linked Data Requires More than Just Using a Tool

NoSQL and Agility. Why Document and Graph Stores Rock November 12th, 2015 COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED

Exchange of experience from a SuccessFactors LMS Implementa9on

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

MAXIMIZING THE SUCCESS OF YOUR E-PROCUREMENT TECHNOLOGY INVESTMENT. How to Drive Adop.on, Efficiency, and ROI for the Long Term

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Using SPARQL/OWL for Validation of Smart Grid Standards

Everything You Need to Know about Cloud BI. Freek Kamst

Project Por)olio Management

Effec%ve AX 2012 Upgrade Project Planning and Microso< Sure Step. Arbela Technologies

Fixed Scope Offering (FSO) for Oracle SRM

Drupal.

How to write a Bachelor s Thesis in Cogni4ve and Decision Sciences? Gilles Du4lh

E6895 Advanced Big Data Analytics Lecture 4:! Data Store

Building your cloud porbolio APS Connect

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India

Scalable End-User Access to Big Data HELLENIC REPUBLIC National and Kapodistrian University of Athens

ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES. Peter Haase fluid Operations AG

bigdata Managing Scale in Ontological Systems

Sustainable Development with Geospatial Information Leveraging the Data and Technology Revolution

VoIP Security How to prevent eavesdropping on VoIP conversa8ons. Dmitry Dessiatnikov

Telephone Related Queries (TeRQ) IETF 85 (Atlanta)

So#ware- based CyberSecurity. Michael Butler Gennaro Parlato Electronic and So.ware Systems (ESS)

Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS

How To Make Sense Of Data With Altilia

Anzo Smart Data Integra/on

Application of ontologies for the integration of network monitoring platforms

Drupal and the Media Industry. Stéphane Corlosquet EMWRT IX, Sept 2013, Amsterdam

Web services in corporate semantic Webs. On intranets and extranets too, a little semantics goes a long way. Fabien.Gandon@sophia.inria.

Turning your Warehouse Data into Business Intelligence: Reporting Trends and Visibility Michael Armanious; Vice President Sales and Marketing Datex,

How To Use An Orgode Database With A Graph Graph (Robert Kramer)

IBM Rational DOORS Next Generation

Semantic Web Success Story

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

BIG DATA & the Data Warehouse

Model Driven Interoperability through Semantic Annotations using SoaML and ODM

A collaborative platform for knowledge management

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Modeling and mining large scale biological seman0c networks using NEO4J

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors...

Big Data in Subsea Solutions

Presente e futuro del Web Semantico

Case Studies in Solving Testing Constraints using Service Virtualization

Geospatial Technology Innovations and Convergence

Semantic Web Tool Landscape

An Ontology Model for Organizing Information Resources Sharing on Personal Web

Transcription:

LDBC Consor*um Fourth Technical User Community (TUC) mee*ng Customer experiences in implemen0ng SKOS- based vocabulary management systems, and other Seman0c- Technology- Driven Systems. Ralph Hodgson, TopQuadrant CWI, Amsterdam, April 3, 2014 v5 http://www.ldbc.eu:8090/display/tuc/fourth+tuc+meeting,+april+2014 Copyright 2014 TopQuadrant Inc. Slide 1

Content Introduc*ons Overview of TopBraid Case Study Overviews Deeper Dive on One Case Study: EPIM Concluding Remarks Copyright 2014 TopQuadrant Inc. Slide 2

Introduc*ons Me - Ralph Hodgson v co- founder and CTO of TopQuadrant, Inc., a US- headquartered company that specializes in seman0c solu0ons, consul0ng, training, and plahorms; v NASA QUDT Ontologies and Handbook Lead v EPIM Lead Seman0c Applica0ons Architect and Ontology Modeler TopQuadrant v Seman0c Technology Products, Solu0ons and Tools, training and consul0ng, headquartered in the US with offices in Europe v Innovator of SPARQL- based technologies: SPIN, SPARQLMo0on, SWP, SWA Copyright 2014 TopQuadrant Inc. Slide 3

The TopBraid Technology Stack Copyright 2014 TopQuadrant Inc. Slide 4

TopBraid Services Copyright 2014 TopQuadrant Inc. Slide 5

Illustrative Enterprise Customers Digital Media Life Sciences Other Copyright 2014 TopQuadrant Inc. Slide 6

These customers use TopBraid to: Mayo Clinic: re- integrate and enhance access to knowledge across research, educa*on and clinical prac*ce Syngenta: help scien*sts to develop insights into research data using databases and informa*on sources both internal and external EPIM: establish a standards- based knowledge plarorm for data exchange receiving, valida*ng, storing, analyzing and transmitng reports OTPP: enable data to be searched without a PhD in SQL Copyright 2014 TopQuadrant Inc. Slide 7

Case Study: Mayo Clinic Enhance Value of Mayo s Knowledge Ini*a*ve: Knowledge Content Management System (KCMS) Went Live 5 January 2014 with over 5.6 million page views per day! Enhanced Search Taxonomy management Run- *me terminology services Copyright 2014 TopQuadrant Inc. Slide 8

KCMS Conceptual Architecture TopBraid EVN (single server) Terminology governance and content tagging (mulnple servers) Serving producnon terminologies in real Nme for enhancing search and content enrichment. Copyright 2014 TopQuadrant Inc. Slide 9

Typical EVN SKOS Customers Don t necessarily or always use SKOS meta and master data management is an important area for our customers and SKOS doesn t fit well Use between 5 and 100 vocabularies Have vocabularies of varying sizes, on average, less than 10 M triples overall Has complex requirements for management of changes, promo*ons of changes, comparison, user permissions, rollback, data quality rules, constraints, valida*ons As a result, edit *me user interfaces can rarely be delivered by a single query O_en uses a combina*on of EVN (for edi*ng) and TopBraid Live (for read only web services access) Copyright 2014 TopQuadrant Inc. Slide 10

EVN is more than SKOS: an example configured for an ontology of oil and gas facility asset management Copyright 2014 TopQuadrant Inc. Slide 11

EVN is an extensible model- driven applica*on: Example read service (SKOS) TopBraid EVN ships with > 30 template services, e.g.: Creating a new service can take as little as an hour (often less) customers implement their own custom services based on SPARQL templates. Copyright 2014 TopQuadrant Inc. Slide 12

Case Study: Syngenta Many disconnected and diverse data sources Need to solve problems by using insights built from the components of meaning - questions, concepts, perspectives Topbraid Insight offers a layer of connec*on and meaning to the user and protects them from the distrac*ng mechanics of data access. Copyright 2014 TopQuadrant Inc. Slide 13

Documents/IP Model- driven Data Virtualiza*on: TopBraid Insight provides a unified query interface over mul*ple data sources Real time, on-demand datamarts Physical Data Warehouse Sample Test Results Any source SIDER Chembl Chebi DrugBank Copyright 2014 TopQuadrant Inc. Slide 14

Queries may need to be staged and/or nested improvedby hastarget haspathway includes codedby Disease Drug Target Pathway Protein Gene Multiple queries may be involved by exploring links Federated data sources are often relational some may be SPARQL endpoints and other sources Copyright 2014 TopQuadrant Inc. Slide 15

Case Study: EPIM Copyright 2014 TopQuadrant Inc. Slide 16

Suppor*ng EPIM s Vision for Oil & Gas Solu*ons Build a shared suite of knowledge based- applica*ons using Seman*c technology and industry- standard domain concepts i.e. a SemanNc Ecosystem for the Oil and Gas Industry on the Norwegian Con*nental Shelf EnvironmentHub ReportingHub LogisticsHub Copyright 2013 TopQuadrant Inc. Slide 17

EPIM ReportingHub (ERH) Architecture in production for nearly two years now Operators on the NCS Semantic Processing DDR Authorities MPR DDR MPR License Partners DPR ISO 15926 Inside Semantic 15926 Reporting Analytics Copyright 2014 TopQuadrant Inc. Slide 18

Scale of ERH Solution 300 million triples in the next 4 years Poten*al for many more if data sources added or historical data imported 40+ concurrent users User interfaces and Service interfaces required Poten*al for many more if data sources added Phase 1 XML Schemas have 2000- ish elements resul*ng ontologies have 900- ish classes and 900- ish proper*es Delivered on an SaaS basis with high availability Service Level Agreement Secondary app server running at all *mes RDF database replica*on and warm fallover Copyright 2014 TopQuadrant Inc. Slide 19

Co- exis*ng in the XML World Copyright 2014 TopQuadrant Inc. Slide 20

EPIM EnvironmentHub (EEH) Example User Interface Copyright 2014 TopQuadrant Inc. Slide 21

EEH Example Query Genera*ng a Report Table (KLIF 3.1) 1 of 3 Copyright 2014 TopQuadrant Inc. Slide 22

EEH Example Query Genera*ng a Report Table (KLIF 3.1) 2 of 3 SELECT?Vanntype?Totalt_vannvolum_UNITm3_SUM ((IF((?Vann_til_sj_UNITm3_SUM!= 0.0), ((?Olje_til_sj_UNITtonn_SUM /?Vann_til_sj_UNITm3_SUM) * 1e6), "")) AS?Midlere_oljeinnhold_UNITmg_per_l) ((IF((1.0!= 0.0), (?Midlere_oljevedheng_p_sand_UNITg_per_kg_SUM /?counter_sum), 0.0)) AS?Midlere_oljevedheng_p_sand_UNITg_per_kg)?Olje_til_sj_UNITtonn_SUM?Injisert_vann_UNITm3_SUM?Vann_til_sj_UNITm3_SUM?Eksportert_prod_vann_UNITm3_SUM?Importert_prod_vann_UNITm3_SUM FROM <http://www.environmenthub.no/env/schema/1.0/interface-model> FROM <http://www.environmenthub.no/data/npd/facts-interface> FROM <http://eeh.testing/ttl/3.1-oseberg> WHERE { { BIND ("Produsert,Fortregning,Drenasje,Jetting,Annet" AS?Vanntype_LIST).?Vanntype spif:split (?Vanntype_LIST "," ). { SELECT?Vanntype ((SUM(?Totalt_vannvolum_UNITm3_SUM_TMP)) AS?Totalt_vannvolum_UNITm3_SUM) ((SUM(?Midlere_oljevedheng_p_sand_UNITg_per_kg_SUM_TMP)) AS?Midlere_oljevedheng_p_sand_UNITg_per_kg_SUM) ((SUM(?counter)) AS?counter_SUM) ((SUM(?Olje_til_sj_UNITtonn_SUM_TMP)) AS?Olje_til_sj_UNITtonn_SUM) ((SUM(?Injisert_vann_UNITm3_SUM_TMP)) AS?Injisert_vann_UNITm3_SUM) ((SUM(?Vann_til_sj_UNITm3_SUM_TMP)) AS?Vann_til_sj_UNITm3_SUM) ((SUM(?Eksportert_prod_vann_UNITm3_SUM_TMP)) AS?Eksportert_prod_vann_UNITm3_SUM) ((SUM(?Importert_prod_vann_UNITm3_SUM_TMP)) AS?Importert_prod_vann_UNITm3_SUM) WHERE { { BIND ("Produsert,Fortregning,Drenasje,Jetting,Annet" AS?Vanntype_LIST).?Vanntype spif:split (?Vanntype_LIST "," ). { { FILTER (?Vanntype IN ("Produsert", "Fortregning", "Drenasje", "Annet")).. Copyright 2014 TopQuadrant Inc. Slide 23

EEH Example Query Genera*ng a Report Table (KLIF 3.1) 3 of 3 } UNION {?oilywaterdischarge a eeh-ui:oilywaterdischargeproduceddrainagedisplacement.?oilywaterdischarge eeh-ui:oilywaterdischargeproduceddrainagedisplacement-watertype?watertype_constraint.?watertype_constraint rdfs:label?watertype_value. FILTER eeh-lib:matchstrings(?watertype_value,?vanntype). FILTER fn:contains("jetting",?vanntype).?oilywaterdischarge a eeh-ui:oilywaterdischargejetting. EXISTS {?oilywaterdischarge eeh-ui:oilywaterdischargejetting-oilonsolidparticles-g_per_kg eeh-ui:oilywaterdischargejetting- OilToSeaISOmethod-tonnes?testForValue. }. OPTIONAL {?oilywaterdischarge eeh-ui:oilywaterdischargejetting-oilonsolidparticles-g_per_kg?midlere_oljevedheng_p_sand_unitg_per_kg_sum_tmp_maybe. }. BIND (COALESCE(?Midlere_oljevedheng_p_sand_UNITg_per_kg_SUM_TMP_MAYBE, 0.0) AS?Midlere_oljevedheng_p_sand_UNITg_per_kg_SUM_TMP). BIND (IF(bound(?Midlere_oljevedheng_p_sand_UNITg_per_kg_SUM_TMP_MAYBE), 1, 0) AS?counter). OPTIONAL {?oilywaterdischarge eeh-ui:oilywaterdischargejetting-oiltoseaisomethod-tonnes?olje_til_sj_unittonn_tmp_maybe. }. BIND (COALESCE(?Olje_til_sj_UNITtonn_TMP_MAYBE, 0.0) AS?Olje_til_sj_UNITtonn_SUM_TMP). }. }. } GROUP BY?Vanntype }. }. } Copyright 2014 TopQuadrant Inc. Slide 24

Example of using SPIN Func*ons SELECT?Avfallstype?Beskrivelse?Avfallsnummer?Nuklider?Sendt_til_land_UNITtonn_SUM?Total_aktivitet_UNITGBq_SUM WHERE { { SELECT?Avfallstype?Beskrivelse?Avfallsnummer ((eeh-lib:sortstringlistremovingduplicates(group_concat(?nuklides; SEPARATOR=','))) AS?Nuklider) ((SUM(?Sendt_til_land_UNITtonn_SUM_TMP)) AS?Sendt_til_land_UNITtonn_SUM_TMP2) ((SUM(?Total_aktivitet_UNITGBq_SUM_TMP)) AS?Total_aktivitet_UNITGBq_SUM_TMP2) WHERE {?hazardouswasteother a eeh-ui:hazardouswasteother.?yeardata eeh-ui:facilityyearhazardouswasteother?hazardouswasteother.?hazardouswasteother eeh-ui:hazardouswasteother-category?avfallstypeu.?avfallstypeu rdfs:label?avfallstype. OPTIONAL {?hazardouswasteother eeh-ui:hazardouswasteother-description?beskrivelse_maybe. }. BIND (COALESCE(?Beskrivelse_MAYBE, "") AS?Beskrivelse).?hazardousWasteOther eeh-ui:hazardouswasteother-wastecode?avfallsnummer. FILTER fn:starts-with(?avfallsnummer, "3"). OPTIONAL {?hazardouswasteother eeh-ui:hazardouswasteother-nuclidename?nuklides_maybe. }. BIND (COALESCE(?Nuklides_MAYBE, "NULL") AS?Nuklides). BIND (eeh-lib:sumstructuretypevaluesyeardatawithoneliteralconstraint(eeh-ui:hazardouswasteother-wastecode,?avfallsnummer, eeh-ui:hazardouswasteother-sentonshore-tonnes, eeh-ui:hazardouswasteother,?yeardata) AS?Sendt_til_land_UNITtonn_SUM_TMP1). BIND (COALESCE(?Sendt_til_land_UNITtonn_SUM_TMP1, 0) AS?Sendt_til_land_UNITtonn_SUM_TMP). } GROUP BY?Avfallstype?Beskrivelse?Avfallsnummer ORDER BY (?Avfallsnummer) (?Beskrivelse) }. BIND (IF((datatype(?Total_aktivitet_UNITGBq_SUM_TMP2)!= xsd:double), "",?Total_aktivitet_UNITGBq_SUM_TMP2) AS?Total_aktivitet_UNITGBq_SUM). BIND (IF((datatype(?Sendt_til_land_UNITtonn_SUM_TMP2)!= xsd:double), "",?Sendt_til_land_UNITtonn_SUM_TMP2) AS?Sendt_til_land_UNITtonn_SUM). } Copyright 2014 TopQuadrant Inc. Slide 25

Ontology Architecture for Composite Graphs Simple View Ontologies for UI, reporting and analytics Interface Model CG_1 CG_2 CG_3 SPARQL Query-View- Transformation SPIN- Based Transformer Uses Magic Graphs <urn:x-magic: TheSchemaMapGraph: TheFieldYearSubGraph: TheTargetSchema> ISO-15926 4D Ontologies FYG_1 FYG_2 FYG_n IntegraNon Model Copyright 2014 TopQuadrant Inc. Slide 26

Transforming is controlled by a SPARQLMo*on Web Service (1 of 2) SPARQL Views: Lifting the ISO 15926 Data Graphs into the Interface Model Representation. Iterate over all official field- year graphs Use of W3C Provenance Ontology for traceability Li_ the Integra*on Model using SPIN transforms Bulk Load the transformed triples Copyright 2014 TopQuadrant Inc. Slide 27

Transforming is controlled by a SPARQLMo*on Web Service (2 of 2) SPARQL Views: Lifting the ISO 15926 Data Graphs into the Interface Model Representation. q Total triples in the Integration Model (ISO-15926 Ontologies): ~ 4,000,000 q Total triples in the Interface Model: ~400,000 Copyright 2014 TopQuadrant Inc. Slide 28

EPIM LogisticsHub (ELH) Copyright 2014 TopQuadrant Inc. Slide 29

Concluding Remarks SPARQL is more than a query language Model- driven applica*ons exploit the real power of seman*c web technologies Different kinds of solu*on types can be dis*nguished We need more kinds of benchmarks Copyright 2014 TopQuadrant Inc. Slide 30

Thank you! v rhodgson AT topquadrant.com v Twioer @ralphtq, @topquadrant v www.scribd.com/ralphtq v www.linkedmodel.org Copyright 2014 TopQuadrant Inc. Slide 31