Disributed Query Processing KGRAM - Search Engine TOP 10

Size: px
Start display at page:

Download "Disributed Query Processing KGRAM - Search Engine TOP 10"

Transcription

1 fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries Johan Montagnat CNRS, I3S lab, Modalis team on behalf of the CrEDIBLE consortium CNRS/UNS, laboratoire I3S (UMR7271), équipe MODALIS INRIA/CNRS/UNS, laboratoire I3S (UMR 7271), équipe Wimmics INSERM U1099, laboratoire LTSI, équipe MediCIS U. Picardie, laboratoire MIS, équipe Connaissances CNRS/INSERM/INSA/U. Lyon 1, laboratoire CREATIS (UMR 5220 / U1044) MI CNRS bilan 2013 Paris, 23/1/2014 1

2 Motivations Biomedical data High heterogeneity: images, clinical data, biomarkers, biology... Increasing amount / number of (open) sources Big Data Large-scale medical studies (statistical medical studies, epidemiology...) Need for cross-factors analysis Linked Data Data (re)analysis opportunities Translational research Centralized approaches encounter limitations Multiple data source kinds Large data volumes to transfer / archive / search Sensitive patient data / complex access control policies Need to adopt uniform data model & format Data is de facto distributed over acquisition centers MI CNRS bilan 2013 Paris, 23/1/2014 2

3 Biomedical data mediation & federation Data federation through distributed querying and query rewriting Client Federator (query decomposition, planning & results federation) Remote sub-queries Query-based access to data Site 1 Mediator Site 2 (ETL or query rewriting) Mediator (ETL or query rewriting) Heterogeneous databases schema mediation Medical data & metadata: raw data + models + processing results + models + provenance... MI CNRS bilan 2013 Paris, 23/1/2014 3

4 Domain ontology-based federation Medical domain ontology (reference model) Data querying Client Query-based access to data Federator Remote sub-queries Data alignment Mediator Mediator Site 1 Site 2 MI CNRS bilan 2013 Paris, 23/1/2014 4

5 Challenges and expertise Challenges Representation of data semantics for heterogeneous data sources biomedical ontology building Data federation distributed query engine Data mediation RDB2RDF, ontology alignment Partnership I3S/Modalis Distributed query engine Semantic representation Semantic reference INRIA/I3S/Wimmics LTSI/MediCIS MIS/Connaissances CREATIS Medical applications MI CNRS bilan 2013 Paris, 23/1/2014 5

6 Scientific networking annual workshop Objectives Multidisciplinary workshop gathering experts in biomedical data representation, semantic, distribution, federation, integration... October 2012: data integration, ontologies, data models, reasoning Semantic models are now widely accepted although ontologies resources are not sufficient Existing systems are mostly centralized but strigent need to support multi-centric studies October 2013: data reuse, ontologies, mediation, federation, graphs Both horizontal and vertical data partition schemes need to be addressed Data models in use are often constructed bottom-up Expressivity of the query language is important MI CNRS bilan 2013 Paris, 23/1/2014 6

7 Bibliography study Distributed (semantic) Query Processing Report CrEDIBLE-12-2, november 2012 A. Gaignard PhD thesis, march 2013, U. Nice Sophia Relational data mediation Report I3S FR, november 2013 Submitted to Journal of Web Semantics Ontology of scientific measures (data acquisition, metrology) Report CrEDIBLE-13-1, december MI CNRS bilan 2013 Paris, 23/1/2014 7

8 Reference ontology 3-levels structure: foundational (DOLCE), core, domain Domain-specific rules Particular Inference abilities DataTop ontology Current focus on measurements Derived relational schema Dataset acquisition equipment Endurant Perdurant Action Centres Dataset Investigators Expression processings Conceptualization Inscription Studies Medical image files Subjects Medical image expressions Languages Medical Image formats Datasets Dataset acquisitions Examinations MI CNRS bilan 2013 Paris, 23/1/2014 8

9 Ontology modules Modularized ontology to improve reuse and lightweightness ONL-MR-DA: MR Dataset Acquisition ONL-DP: Data Processing ONL-MSA: Mental State Assessment OntoVIP: Medical Image Simulation Wide diffusion MI CNRS bilan 2013 Paris, 23/1/2014 9

10 Data query and federation engine KGRAM (Knowledge Graph Abstract Machine) Semantic query engine: Full support of SPARQL1.1 Generic interface for heterogeneous backends Flexible architecture facilitating different deployment scenarios Mediation interface to access relational data Federated relational schema derived from the ontology MI CNRS bilan 2013 Paris, 23/1/

11 Distributed Query Processing Query federator decoupled from data sources Asynchronous querying of multiple data sources Query planning and parallel querying MI CNRS bilan 2013 Paris, 23/1/

12 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. FILTER (CONTAINS (?name, 'Bob')) } MI CNRS bilan 2013 Paris, 23/1/

13 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 MI CNRS bilan 2013 Paris, 23/1/

14 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q #1 #2 MI CNRS bilan 2013 Paris, 23/1/

15 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q #1 #2 MI CNRS bilan 2013 Paris, 23/1/

16 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core #1 Q Q1 #2 MI CNRS bilan 2013 Paris, 23/1/

17 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core #1 Q Q1 #2 MI CNRS bilan 2013 Paris, 23/1/

18 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q Q1 Q2' #1 #2 MI CNRS bilan 2013 Paris, 23/1/

19 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q Q1 Q2' #1 #2 MI CNRS bilan 2013 Paris, 23/1/

20 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core #1 Q Q1 Q2' Q2'' #2 MI CNRS bilan 2013 Paris, 23/1/

21 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core #1 Q Q1 Q2' Q2'' #2 MI CNRS bilan 2013 Paris, 23/1/

22 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core #1 Q Q1 Q2' Q2'' #2 MI CNRS bilan 2013 Paris, 23/1/

23 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core #1 Q Q1 Q2' Q2'' #2 MI CNRS bilan 2013 Paris, 23/1/

24 Performance results Mixed (relational / semantic) stores querying FedBench standard benchmark MI CNRS bilan 2013 Paris, 23/1/

25 Exploitation in ANR GINSENG (epidemiology) Partnership with GINSENG consortium and Mnemotix SME Federation of heterogeneous epidemiology repositories Multiple epidemiology data acquisition networks Cross-correlation with external data (e.g. demographic: IGN) Mediation of the EHR (Electronic Health Record) data schema MI CNRS bilan 2013 Paris, 23/1/

26 Collaboration with Pitié Salpétrière CAC database Neurodegenerative diseases (esp. Alzheimer's) Mediation of relational CAC schema Comparative study of different relational-to-rdf mediation techniques R2RML transformation language Test case for various RDB2RDF conversion tools MI CNRS bilan 2013 Paris, 23/1/

27 Perspectives DataTop ontology development Data distribution (spatial and temporal) and data collections structure Alzheimer disease Query engine architecture Code modularity for alternative optimization strategy implementation Query performance optimization Query plan improvement (especially when dealing both with horizontal and vertical partitioning) Mediation Dynamic mediation of data sources Integration of new medical data sources MI CNRS bilan 2013 Paris, 23/1/

28 Perspectives: collaborations Université de Laval au Québec Ontology design: Alzheimer's disease data ANR CONTINT 2013 BIOMIST Industrial system for large databases management (Product Lifecycle Management - PLM) Application to medical databases in the area of brain functions analysis France Life Imaging excellence infrastructure National-level infrastructure for medical data sharing and analysis MI CNRS bilan 2013 Paris, 23/1/

29 Publications Peer-reviewed Medical domain: idash image informatics workshop, sept. 2012; DCICTIA-MICCAI workshop, oct Distributed Query Processing: Web Intelligence, dec. 2012; Journal of Web Semantics (submitted) Mediation: Journal of Web Semantics (submitted) PhD thesis A. Gaignard (march 2013, U. Nice Sophia Antipolis) N. Cerezo (december 2013, U. Nice Sophia Antipolis) Research reports Workshop summaries: #12-1, #14-1 Annual reports: #12-3, #13-3 Bibliography studies: #12-2, I3S/ FR MI CNRS bilan 2013 Paris, 23/1/

30 Conclusions Query-based data federation grounded on semantic web standards (SPARQL, RDF, RDFS) Emphasis on query language expressivity Support for both horizontal and vertical data partitioning Broad applicability (given that ontologies are available) Ontology-based Reference model for data alignment and query terms Currently using Extract-Transform-Load Static conversion of data sources in RDF format Work on dynamic data mediation on-going Optimization work on-going Flexible software architecture Distributed query processing optimization MI CNRS bilan 2013 Paris, 23/1/

fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries

fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries Johan Montagnat CNRS, I3S lab, Modalis team on behalf of the CrEDIBLE

More information

fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Interrogation d'entrepôts distribués et hétérogènes

fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Interrogation d'entrepôts distribués et hétérogènes fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Interrogation d'entrepôts distribués et hétérogènes Johan Montagnat Alban Gaignard http://credible.i3s.unice.fr MI CNRS appel

More information

Fédération et analyse de données distribuées en imagerie biomédicale

Fédération et analyse de données distribuées en imagerie biomédicale Software technologies for integration of processes and data in neurosciences ConnaissancEs Distribuées en Imagerie BiomédicaLE Fédération et analyse de données distribuées en imagerie biomédicale Johan

More information

Publishing Linked Data Requires More than Just Using a Tool

Publishing Linked Data Requires More than Just Using a Tool Publishing Linked Data Requires More than Just Using a Tool G. Atemezing 1, F. Gandon 2, G. Kepeklian 3, F. Scharffe 4, R. Troncy 1, B. Vatant 5, S. Villata 2 1 EURECOM, 2 Inria, 3 Atos Origin, 4 LIRMM,

More information

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many

More information

Additional mechanisms for rewriting on-the-fly SPARQL queries proxy

Additional mechanisms for rewriting on-the-fly SPARQL queries proxy Additional mechanisms for rewriting on-the-fly SPARQL queries proxy Arthur Vaisse-Lesteven, Bruno Grilhères To cite this version: Arthur Vaisse-Lesteven, Bruno Grilhères. Additional mechanisms for rewriting

More information

QASM: a Q&A Social Media System Based on Social Semantics

QASM: a Q&A Social Media System Based on Social Semantics QASM: a Q&A Social Media System Based on Social Semantics Zide Meng, Fabien Gandon, Catherine Faron-Zucker To cite this version: Zide Meng, Fabien Gandon, Catherine Faron-Zucker. QASM: a Q&A Social Media

More information

Graph Database Performance: An Oracle Perspective

Graph Database Performance: An Oracle Perspective Graph Database Performance: An Oracle Perspective Xavier Lopez, Ph.D. Senior Director, Product Management 1 Copyright 2012, Oracle and/or its affiliates. All rights reserved. Program Agenda Broad Perspective

More information

Leveraging ambient applications interactions with their environment to improve services selection relevancy

Leveraging ambient applications interactions with their environment to improve services selection relevancy Leveraging ambient applications interactions with their environment to improve services selection relevancy Gérald Rocher, Jean-Yves Tigli, Stéphane Lavirotte, Rahma Daikhi To cite this version: Gérald

More information

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer Alejandro Vaisman Esteban Zimanyi Data Warehouse Systems Design and Implementation ^ Springer Contents Part I Fundamental Concepts 1 Introduction 3 1.1 A Historical Overview of Data Warehousing 4 1.2 Spatial

More information

Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution

Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution Semantic Exploration of Archived Lifecycle Metadata under Schema and Instance Evolution Jörg Brunsmann Faculty of Mathematics and Computer Science, University of Hagen, D-58097 Hagen, Germany joerg.brunsmann@fernuni-hagen.de

More information

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology Hong-Linh Truong Institute for Software Science, University of Vienna, Austria truong@par.univie.ac.at Thomas Fahringer

More information

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity

More information

A generic approach for data integration using RDF, OWL and XML

A generic approach for data integration using RDF, OWL and XML A generic approach for data integration using RDF, OWL and XML Miguel A. Macias-Garcia, Victor J. Sosa-Sosa, and Ivan Lopez-Arevalo Laboratory of Information Technology (LTI) CINVESTAV-TAMAULIPAS Km 6

More information

Enable Location-based Services with a Tracking Framework

Enable Location-based Services with a Tracking Framework Enable Location-based Services with a Tracking Framework Mareike Kritzler University of Muenster, Institute for Geoinformatics, Weseler Str. 253, 48151 Münster, Germany kritzler@uni-muenster.de Abstract.

More information

Supporting Change-Aware Semantic Web Services

Supporting Change-Aware Semantic Web Services Supporting Change-Aware Semantic Web Services Annika Hinze Department of Computer Science, University of Waikato, New Zealand a.hinze@cs.waikato.ac.nz Abstract. The Semantic Web is not only evolving into

More information

DataBridges: data integration for digital cities

DataBridges: data integration for digital cities DataBridges: data integration for digital cities Thematic action line «Digital Cities» Ioana Manolescu Oak team INRIA Saclay and Univ. Paris Sud-XI Plan 1. DataBridges short history and overview 2. RDF

More information

bigdata Managing Scale in Ontological Systems

bigdata Managing Scale in Ontological Systems Managing Scale in Ontological Systems 1 This presentation offers a brief look scale in ontological (semantic) systems, tradeoffs in expressivity and data scale, and both information and systems architectural

More information

OAK Database optimizations and architectures for complex large data Ioana MANOLESCU-GOUJOT

OAK Database optimizations and architectures for complex large data Ioana MANOLESCU-GOUJOT OAK Database optimizations and architectures for complex large data Ioana MANOLESCU-GOUJOT INRIA Saclay Île-de-France Université Paris Sud LRI UMR CNRS 8623 Plan 1. The team 2. Oak research at a glance

More information

OntoDBench: Ontology-based Database Benchmark

OntoDBench: Ontology-based Database Benchmark OntoDBench: Ontology-based Database Benchmark Stéphane Jean, Ladjel Bellatreche, Géraud Fokou, Mickaël Baron, and Selma Khouri LIAS/ISAE-ENSMA and University of Poitiers BP 40109, 86961 Futuroscope Cedex,

More information

Industry 4.0 and Big Data

Industry 4.0 and Big Data Industry 4.0 and Big Data Marek Obitko, mobitko@ra.rockwell.com Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H 2 Background Joint work with Czech Institute of Informatics, Robotics and

More information

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Anne Monceaux 1, Joanna Guss 1 1 EADS-CCR, Centreda 1, 4 Avenue Didier Daurat 31700 Blagnac France

More information

Enabling End User Access to Big Data in the O&G Industry

Enabling End User Access to Big Data in the O&G Industry Enabling End User Access to Big Data in the O&G Industry Johan W. Klüwer (DNV) and Michael Schmidt (fluidops) 1 / 28 HELLENIC REPUBLIC National and Kapodistrian University of Athens 2 / 28 . Paradigm Shift

More information

LDIF - Linked Data Integration Framework

LDIF - Linked Data Integration Framework LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany a.schultz@fu-berlin.de,

More information

Some Research Challenges for Big Data Analytics of Intelligent Security

Some Research Challenges for Big Data Analytics of Intelligent Security Some Research Challenges for Big Data Analytics of Intelligent Security Yuh-Jong Hu hu at cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,

More information

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY Yu. A. Zagorulko, O. I. Borovikova, S. V. Bulgakov, E. A. Sidorova 1 A.P.Ershov s Institute

More information

Linked Science as a producer and consumer of big data in the Earth Sciences

Linked Science as a producer and consumer of big data in the Earth Sciences Linked Science as a producer and consumer of big data in the Earth Sciences Line C. Pouchard,* Robert B. Cook,* Jim Green,* Natasha Noy,** Giri Palanisamy* Oak Ridge National Laboratory* Stanford Center

More information

Geospatial Platforms For Enabling Workflows

Geospatial Platforms For Enabling Workflows Geospatial Platforms For Enabling Workflows Steven Hagan Vice President Oracle Database Server Technologies November, 2015 Evolution of Enabling Workflows HENRY FORD 100 YEARS AGO Industrialized the Manufacturing

More information

Automatic Timeline Construction For Computer Forensics Purposes

Automatic Timeline Construction For Computer Forensics Purposes Automatic Timeline Construction For Computer Forensics Purposes Yoan Chabot, Aurélie Bertaux, Christophe Nicolle and Tahar Kechadi CheckSem Team, Laboratoire Le2i, UMR CNRS 6306 Faculté des sciences Mirande,

More information

«Shanoir : une solu/on pour la ges/on de données distribuées en imagerie in- vivo» Jus/ne Guillaumont Isabelle Corouge

«Shanoir : une solu/on pour la ges/on de données distribuées en imagerie in- vivo» Jus/ne Guillaumont Isabelle Corouge «Shanoir : une solu/on pour la ges/on de données distribuées en imagerie in- vivo» Jus/ne Guillaumont Isabelle Corouge Shanoir: a solu-on for neuro- imaging data management Jus/ne Guillaumont, Isabelle

More information

Linked Statistical Data Analysis

Linked Statistical Data Analysis Linked Statistical Data Analysis Sarven Capadisli 1, Sören Auer 2, Reinhard Riedl 3 1 Universität Leipzig, Institut für Informatik, AKSW, Leipzig, Germany, 2 University of Bonn and Fraunhofer IAIS, Bonn,

More information

PONTE Presentation CETIC. EU Open Day, Cambridge, 31/01/2012. Philippe Massonet

PONTE Presentation CETIC. EU Open Day, Cambridge, 31/01/2012. Philippe Massonet PONTE Presentation CETIC Philippe Massonet EU Open Day, Cambridge, 31/01/2012 PONTE Description Efficient Patient Recruitment for Innovative Clinical Trials of Existing Drugs to other Indications Start

More information

Big Data Architect Certification Self-Study Kit Bundle

Big Data Architect Certification Self-Study Kit Bundle Big Data Architect Certification Bundle This certification bundle provides you with the self-study materials you need to prepare for the exams required to complete the Big Data Architect Certification.

More information

BYODs & FAIR Data Stewardship

BYODs & FAIR Data Stewardship BYODs & FAIR Data Stewardship Luiz Olavo Bonino luiz.bonino@dtls.nl www.elixir-europe.org Summary FAIR Data stewardship Approach in NL BYOD FAIR Data tooling ecosystem Way of working (FAIR) Data Stewardship

More information

UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications

UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications Gaël de Chalendar CEA LIST F-92265 Fontenay aux Roses Gael.de-Chalendar@cea.fr 1 Introduction The main data sources

More information

The various steps in the solution approach are presented below.

The various steps in the solution approach are presented below. From Web 1.0 3.0: Is RDF access to RDB enough? Vipul Kashyap, Senior Medical Informatician, Partners Healthcare System, vkashyap1@partners.org Martin Flanagan, CTO, InSilico Discovery, mflanagan@insilicodiscovery.com

More information

Ontology-based Archetype Interoperability and Management

Ontology-based Archetype Interoperability and Management Ontology-based Archetype Interoperability and Management Catalina Martínez-Costa, Marcos Menárguez-Tortosa, J. T. Fernández-Breis Departamento de Informática y Sistemas, Facultad de Informática Universidad

More information

Big Data Governance Certification Self-Study Kit Bundle

Big Data Governance Certification Self-Study Kit Bundle Big Data Governance Certification Bundle This certification bundle provides you with the self-study materials you need to prepare for the exams required to complete the Big Data Governance Certification.

More information

Open Ontology Repository Initiative

Open Ontology Repository Initiative Open Ontology Repository Initiative Frank Olken Lawrence Berkeley National Laboratory National Science Foundation folken@nsf.gov presented to CENDI/NKOS Workshop World Bank Sept. 11, 2008 Version 6.0 DISCLAIMER

More information

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee 31 10589 Berlin, Germany. www.fokus.fraunhofer.

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee 31 10589 Berlin, Germany. www.fokus.fraunhofer. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee 31 10589 Berlin, Germany www.fokus.fraunhofer.de 1 Identification and Utilization of Components for a linked Open Data Platform

More information

UniGR Workshop: Big Data «The challenge of visualizing big data»

UniGR Workshop: Big Data «The challenge of visualizing big data» Dept. ISC Informatics, Systems & Collaboration UniGR Workshop: Big Data «The challenge of visualizing big data» Dr Ir Benoît Otjacques Deputy Scientific Director ISC The Future is Data-based Can we help?

More information

Models and Architecture for Smart Data Management

Models and Architecture for Smart Data Management 1 Models and Architecture for Smart Data Management Pierre De Vettor, Michaël Mrissa and Djamal Benslimane Université de Lyon, CNRS LIRIS, UMR5205, F-69622, France E-mail: firstname.surname@liris.cnrs.fr

More information

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 James Maltby, Ph.D 1 Outline of Presentation Semantic Graph Analytics Database Architectures In-memory Semantic Database Formulation

More information

Data Services @neurist and beyond

Data Services @neurist and beyond s @neurist and beyond Siegfried Benkner Department of Scientific Computing Faculty of Computer Science University of Vienna http://www.par.univie.ac.at Department of Scientific Computing Parallel Computing

More information

Semantic Search in Portals using Ontologies

Semantic Search in Portals using Ontologies Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br

More information

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and

More information

Semantic Information on Electronic Medical Records (EMRs) through Ontologies

Semantic Information on Electronic Medical Records (EMRs) through Ontologies Semantic Information on Electronic Medical Records (EMRs) through Ontologies Suarez Barón M. J. Researcher, Research Center at Colombian School of Industrial Careers marcojaviersuarezbaron@gmail.com Bogotá,

More information

Databases & Data Infrastructure. Kerstin Lehnert

Databases & Data Infrastructure. Kerstin Lehnert + Databases & Data Infrastructure Kerstin Lehnert + Access to Data is Needed 2 to allow verification of research results to allow re-use of data + The road to reuse is perilous (1) 3 Accessibility Discovery,

More information

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors...

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors... Cloud-based Spatial Data Infrastructures for Smart Cities Geospatial World Forum 2015 Hans Viehmann Product Manager EMEA ORACLE Corporation Smart Cities require Geospatial Data Providing services to citizens,

More information

ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES. Peter Haase fluid Operations AG

ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES. Peter Haase fluid Operations AG ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES Peter Haase fluid Operations AG fluid Operations(fluidOps) Linked Data& Semantic Technologies Enterprise Cloud Computing Software company founded

More information

Fabien.Gandon@sophia.inria.fr. Semantic Web and Multi-Agent Approach to Corporate Memory Management

Fabien.Gandon@sophia.inria.fr. Semantic Web and Multi-Agent Approach to Corporate Memory Management Fabien Gandon, Rose Dieng-Kuntz, Olivier Corby, Alain Giboin Semantic Web and Multi- Approach to Corporate Memory Management Fabien Gandon, Rose Dieng-Kuntz, Olivier Corby, Alain Giboin Semantic Web and

More information

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model 22 October 2014 Tony Hammond Michele Pasin Background About Macmillan

More information

The Ontological Approach for SIEM Data Repository

The Ontological Approach for SIEM Data Repository The Ontological Approach for SIEM Data Repository Igor Kotenko, Olga Polubelova, and Igor Saenko Laboratory of Computer Science Problems, Saint-Petersburg Institute for Information and Automation of Russian

More information

FIPA agent based network distributed control system

FIPA agent based network distributed control system FIPA agent based network distributed control system V.Gyurjyan, D. Abbott, G. Heyes, E. Jastrzembski, C. Timmer, E. Wolin TJNAF, Newport News, VA 23606, USA A control system with the capabilities to combine

More information

Principles of Distributed Database Systems

Principles of Distributed Database Systems M. Tamer Özsu Patrick Valduriez Principles of Distributed Database Systems Third Edition

More information

Susanna-Assunta Sansone, PhD. Metadata WG3 chair. 3-workgroup@biocaddie.org

Susanna-Assunta Sansone, PhD. Metadata WG3 chair. 3-workgroup@biocaddie.org Susanna-Assunta Sansone, PhD Metadata WG3 chair 3-workgroup@biocaddie.org http://dx.doi.org/10.6084/m9.figshare.1362572 WG3 Metadata - Goals Define a set of metadata specifications that support intended

More information

Big Data Governance Certification Self-Study Kit Bundle

Big Data Governance Certification Self-Study Kit Bundle Big Data Governance Certification Bundle This certification bundle provides you with the self-study materials you need to prepare for the exams required to complete the Big Data Governance Certification.

More information

DISCOVERING RESUME INFORMATION USING LINKED DATA

DISCOVERING RESUME INFORMATION USING LINKED DATA DISCOVERING RESUME INFORMATION USING LINKED DATA Ujjal Marjit 1, Kumar Sharma 2 and Utpal Biswas 3 1 C.I.R.M, University Kalyani, Kalyani (West Bengal) India sic@klyuniv.ac.in 2 Department of Computer

More information

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo Expected Outcomes You will learn: Basic concepts related to ontologies Semantic model Semantic web Basic features of RDF and RDF

More information

The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data. Ravi Shankar

The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data. Ravi Shankar The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data Ravi Shankar Open access to clinical trials data advances open science Broad open access to entire clinical

More information

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

BUSINESS VALUE OF SEMANTIC TECHNOLOGY BUSINESS VALUE OF SEMANTIC TECHNOLOGY Preliminary Findings Industry Advisory Council Emerging Technology (ET) SIG Information Sharing & Collaboration Committee July 15, 2005 Mills Davis Managing Director

More information

Enabling Collaboration Using the Biomedical Informatics Research Network (BIRN):

Enabling Collaboration Using the Biomedical Informatics Research Network (BIRN): Enabling Collaboration Using the Biomedical Informatics Research Network (BIRN): Karl Helmer Ph.D. Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital June 4, 2010 BIRN

More information

In 2014, the Research Data group @ Purdue University

In 2014, the Research Data group @ Purdue University EDITOR S SUMMARY At the 2015 ASIS&T Research Data Access and Preservation (RDAP) Summit, panelists from Research Data @ Purdue University Libraries discussed the organizational structure intended to promote

More information

SWAP: ONTOLOGY-BASED KNOWLEDGE MANAGEMENT WITH PEER-TO-PEER TECHNOLOGY

SWAP: ONTOLOGY-BASED KNOWLEDGE MANAGEMENT WITH PEER-TO-PEER TECHNOLOGY SWAP: ONTOLOGY-BASED KNOWLEDGE MANAGEMENT WITH PEER-TO-PEER TECHNOLOGY M. EHRIG, C. TEMPICH AND S. STAAB Institute AIFB University of Karlsruhe 76128 Karlsruhe, Germany E-mail: {meh,cte,sst}@aifb.uni-karlsruhe.de

More information

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel

More information

MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management

MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management Zhan Liu, Fabian Cretton, Anne Le Calvé, Nicole Glassey, Alexandre Cotting, Fabrice Chapuis

More information

Presente e futuro del Web Semantico

Presente e futuro del Web Semantico Sistemi di Elaborazione dell informazione II Corso di Laurea Specialistica in Ingegneria Telematica II anno 4 CFU Università Kore Enna A.A. 2009-2010 Alessandro Longheu http://www.diit.unict.it/users/alongheu

More information

COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design

COURSE OUTLINE. Track 1 Advanced Data Modeling, Analysis and Design COURSE OUTLINE Track 1 Advanced Data Modeling, Analysis and Design TDWI Advanced Data Modeling Techniques Module One Data Modeling Concepts Data Models in Context Zachman Framework Overview Levels of Data

More information

Semantic Information Retrieval from Distributed Heterogeneous Data Sources

Semantic Information Retrieval from Distributed Heterogeneous Data Sources Semantic Information Retrieval from Distributed Heterogeneous Sources K. Munir, M. Odeh, R. McClatchey, S. Khan, I. Habib CCS Research Centre, University of West of England, Frenchay, Bristol, UK Email

More information

Semantically Steered Clinical Decision Support Systems

Semantically Steered Clinical Decision Support Systems Semantically Steered Clinical Decision Support Systems By Eider Sanchez Herrero Department of Computer Science and Artificial Intelligence University of the Basque Country Advisors Prof. Manuel Graña Romay

More information

CRM dig : A generic digital provenance model for scientific observation

CRM dig : A generic digital provenance model for scientific observation CRM dig : A generic digital provenance model for scientific observation Martin Doerr, Maria Theodoridou Institute of Computer Science, FORTH-ICS, Crete, Greece Abstract The systematic large-scale production

More information

Ontology and automatic code generation on modeling and simulation

Ontology and automatic code generation on modeling and simulation Ontology and automatic code generation on modeling and simulation Youcef Gheraibia Computing Department University Md Messadia Souk Ahras, 41000, Algeria youcef.gheraibia@gmail.com Abdelhabib Bourouis

More information

EIT ICT Labs MASTER SCHOOL DSS Programme Specialisations

EIT ICT Labs MASTER SCHOOL DSS Programme Specialisations EIT ICT Labs MASTER SCHOOL DSS Programme Specialisations DSS EIT ICT Labs Master Programme Distributed System and Services (Cloud Computing) The programme in Distributed Systems and Services focuses on

More information

How semantic technology can help you do more with production data. Doing more with production data

How semantic technology can help you do more with production data. Doing more with production data How semantic technology can help you do more with production data Doing more with production data EPIM and Digital Energy Journal 2013-04-18 David Price, TopQuadrant London, UK dprice at topquadrant dot

More information

A RDF Vocabulary for Spatiotemporal Observation Data Sources

A RDF Vocabulary for Spatiotemporal Observation Data Sources A RDF Vocabulary for Spatiotemporal Observation Data Sources Karine Reis Ferreira 1, Diego Benincasa F. C. Almeida 1, Antônio Miguel Vieira Monteiro 1 1 DPI Instituto Nacional de Pesquisas Espaciais (INPE)

More information

Knowledge Management

Knowledge Management Knowledge Management INF5100 Autumn 2006 Outline Background Knowledge Management (KM) What is knowledge KM Processes Knowledge Management Systems and Knowledge Bases Ontologies What is an ontology Types

More information

TopQuadrant-Syngenta Webcast July 10, 2014 Semantic Data Virtualization: Extracting More Value from Data Silos

TopQuadrant-Syngenta Webcast July 10, 2014 Semantic Data Virtualization: Extracting More Value from Data Silos TopQuadrant-Syngenta Webcast July 10, 2014 Semantic Data Virtualization: Extracting More Value from Data Silos Featuring Syngenta's report on its successful pilot Webcast Agenda Overview of Problem and

More information

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc Big Data, Fast Data, Complex Data Jans Aasman Franz Inc Private, founded 1984 AI, Semantic Technology, professional services Now in Oakland Franz Inc Who We Are (1 (2 3) (4 5) (6 7) (8 9) (10 11) (12

More information

WHITE PAPER TOPIC DATE Enabling MaaS Open Data Agile Design and Deployment with CA ERwin. Nuccio Piscopo. agility made possible

WHITE PAPER TOPIC DATE Enabling MaaS Open Data Agile Design and Deployment with CA ERwin. Nuccio Piscopo. agility made possible WHITE PAPER TOPIC DATE Enabling MaaS Open Data Agile Design and Deployment with CA ERwin Nuccio Piscopo agility made possible Table of Contents Introduction 3 MaaS enables Agile Open Data Design 4 MaaS

More information

Agenda. Overview. Federation Requirements. Panlab IST034305 Teagle for Partners

Agenda. Overview. Federation Requirements. Panlab IST034305 Teagle for Partners Agenda Panlab IST034305 Teagle for Partners Sebastian Wahle, sebastian.wahle@fokus.fraunhofer.de Overview Testbed Federation Requirements Panlab Roles Federation Architecture Functional Components of Teagle

More information

Aiding the Data Integration in Medicinal Settings by Means of Semantic Technologies

Aiding the Data Integration in Medicinal Settings by Means of Semantic Technologies Aiding the Data Integration in Medicinal Settings by Means of Semantic Technologies Vit Novacek 1 Loredana Laera 2 Siegfried Handschuh 1 1 Digital Enterprise Research Institute (DERI) National University

More information

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing

More information

Dementia Ambient Care: Multi-Sensing Monitoring for Intelligent Remote Management and Decision Support

Dementia Ambient Care: Multi-Sensing Monitoring for Intelligent Remote Management and Decision Support Dementia Ambient Care: Multi-Sensing Monitoring for Intelligent Remote Management and Decision Support Alexia Briassouli Informatics & Telematics Institute Introduction Instances of dementia increasing

More information

SmartLink: a Web-based editor and search environment for Linked Services

SmartLink: a Web-based editor and search environment for Linked Services SmartLink: a Web-based editor and search environment for Linked Services Stefan Dietze, Hong Qing Yu, Carlos Pedrinaci, Dong Liu, John Domingue Knowledge Media Institute, The Open University, MK7 6AA,

More information

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum Big Data Analysis John Domingue (STI International and The Open University) Project co-funded by the European Commission within the 7th Framework Program (Grant Agreement No. 257943) 1 The Data landscape

More information

Creating an RDF Graph from a Relational Database Using SPARQL

Creating an RDF Graph from a Relational Database Using SPARQL Creating an RDF Graph from a Relational Database Using SPARQL Ayoub Oudani, Mohamed Bahaj*, Ilias Cherti Department of Mathematics and Informatics, University Hassan I, FSTS, Settat, Morocco. * Corresponding

More information

urika! Unlocking the Power of Big Data at PSC

urika! Unlocking the Power of Big Data at PSC urika! Unlocking the Power of Big Data at PSC Nick Nystrom Director, Strategic Applications Pittsburgh Supercomputing Center February 1, 2013 nystrom@psc.edu 2013 Pittsburgh Supercomputing Center Big Data

More information

ADVANCED GEOGRAPHIC INFORMATION SYSTEMS Vol. II - Using Ontologies for Geographic Information Intergration Frederico Torres Fonseca

ADVANCED GEOGRAPHIC INFORMATION SYSTEMS Vol. II - Using Ontologies for Geographic Information Intergration Frederico Torres Fonseca USING ONTOLOGIES FOR GEOGRAPHIC INFORMATION INTEGRATION Frederico Torres Fonseca The Pennsylvania State University, USA Keywords: ontologies, GIS, geographic information integration, interoperability Contents

More information

Application of ontologies for the integration of network monitoring platforms

Application of ontologies for the integration of network monitoring platforms Application of ontologies for the integration of network monitoring platforms Jorge E. López de Vergara, Javier Aracil, Jesús Martínez, Alfredo Salvador, José Alberto Hernández Networking Research Group,

More information

Taming Big Data Variety with Semantic Graph Databases. Evren Sirin CTO Complexible

Taming Big Data Variety with Semantic Graph Databases. Evren Sirin CTO Complexible Taming Big Data Variety with Semantic Graph Databases Evren Sirin CTO Complexible About Complexible Semantic Tech leader since 2006 (née Clark & Parsia) software, consulting W3C leadership Offices in DC

More information

Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot

Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot www.etidaho.com (208) 327-0768 Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot 3 Days About this Course This course is designed for the end users and analysts that

More information

On the Requirements for Cooperative Assistance in the Medical Domain

On the Requirements for Cooperative Assistance in the Medical Domain On the Requirements for Cooperative Assistance in the Medical Domain L. Ardissono, A. Di Leva, G. Petrone, M. Segnan and M. Sonnessa Dipartimento di Informatica, Università di Torino, corso Svizzera 185,

More information

ezdi s semantics-enhanced linguistic, NLP, and ML approach for health informatics

ezdi s semantics-enhanced linguistic, NLP, and ML approach for health informatics ezdi s semantics-enhanced linguistic, NLP, and ML approach for health informatics Raxit Goswami*, Neil Shah* and Amit Sheth*, ** ezdi Inc, Louisville, KY and Ahmedabad, India. ** Kno.e.sis-Wright State

More information

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management Ram Soma 2, Amol Bakshi 1, Kanwal Gupta 3, Will Da Sie 2, Viktor Prasanna 1 1 University of Southern California,

More information

Filtering the Web to Feed Data Warehouses

Filtering the Web to Feed Data Warehouses Witold Abramowicz, Pawel Kalczynski and Krzysztof We^cel Filtering the Web to Feed Data Warehouses Springer Table of Contents CHAPTER 1 INTRODUCTION 1 1.1 Information Systems 1 1.2 Information Filtering

More information

How To Create A Social Web With A Free And Open Source Web Browser (W3C)

How To Create A Social Web With A Free And Open Source Web Browser (W3C) Balancing Data Utility and Privacy Protection in the Socially Aware Data Cloud Yuh-Jong Hu hu@cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,

More information

RFI Summary: Executive Summary

RFI Summary: Executive Summary RFI Summary: Executive Summary On February 20, 2013, the NIH issued a Request for Information titled Training Needs In Response to Big Data to Knowledge (BD2K) Initiative. The response was large, with

More information

SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA

SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA INDEX EXECUTIVE SUMMARY... 3 1. PREFACE... 5 1.1. Acronyms... 5

More information

Lift your data hands on session

Lift your data hands on session Lift your data hands on session Duration: 40mn Foreword Publishing data as linked data requires several procedures like converting initial data into RDF, polishing URIs, possibly finding a commonly used

More information

secure intelligence collection and assessment system Your business technologists. Powering progress

secure intelligence collection and assessment system Your business technologists. Powering progress secure intelligence collection and assessment system Your business technologists. Powering progress The decisive advantage for intelligence services The rising mass of data items from multiple sources

More information