fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries

Size: px
Start display at page:

Download "fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries"

Transcription

1 fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries Johan Montagnat CNRS, I3S lab, Modalis team on behalf of the CrEDIBLE consortium CNRS/UNS, laboratoire I3S (UMR7271), équipe MODALIS INRIA/CNRS/UNS, laboratoire I3S (UMR 7271), équipe Wimmics INSERM U1099, laboratoire LTSI, équipe MediCIS U. Picardie, laboratoire MIS, équipe Connaissances icube, journée défis masse de données Illkirch, 7/11/2014 1

2 Motivations Biomedical data High heterogeneity: images, clinical data, biomarkers, biology... Increasing amount / number of (open) sources Big Data Large-scale medical studies (statistical medical studies, epidemiology...) Data (re)analysis opportunities Multicentric studies, Translational research Centralized approaches encounter limitations Need for cross-factors analysis Linked Data Multiple data source kinds Large data volumes to transfer / archive / search Sensitive patient data / complex access control policies Need to adopt uniform data model & format Data is de facto distributed over acquisition centers icube, journée défis masse de données Illkirch, 7/11/2014 2

3 Biomedical data mediation & federation Data federation through distributed querying and query rewriting Client Query-based access to data Federator (query decomposition, planning & results federation) Remote sub-queries Site 1 Mediator (ETL or query rewriting) Mediator (ETL or query rewriting) Site 2 Heterogeneous databases schema mediation Medical data & metadata: raw data + models + processing results + models + provenance... icube, journée défis masse de données Illkirch, 7/11/2014 3

4 Domain ontology-based federation Medical domain ontology (reference model) Data querying Client Query-based access to data Federator Data alignment Remote sub-queries Site 1 Mediator Mediator Site 2 icube, journée défis masse de données Illkirch, 7/11/2014 4

5 Exploitation example: ANR GINSENG (epidemiology) Partnership with GINSENG consortium and Mnemotix SME Federation of heterogeneous epidemiology repositories Multiple epidemiology data acquisition networks Cross-correlation with external data (e.g. demographic: IGN) Mediation of the EHR (Electronic Health Record) data schema icube, journée défis masse de données Illkirch, 7/11/2014 5

6 Challenges and expertise Challenges Representation of data semantics for heterogeneous data sources biomedical ontology building Data federation distributed query engine Data mediation RDB2RDF, ontology alignment Partnership Distributed query engine Semantic representation Semantic reference I3S/Modalis INRIA/I3S/Wimmics LTSI/MediCIS MIS/Connaissances Medical applications icube, journée défis masse de données Illkirch, 7/11/2014 6

7 Scientific networking annual workshop Objectives October 2012 Semantic models usage becomes mainstream Medical systems are mostly centralized but strigent need to support multi-centric studies October 2013 Multidisciplinary workshop gathering international experts in biomedical data representation, semantic, distribution, federation, integration... Many distributed data federation engines Work on data partition schemes and data modeling Query language expressivity limitations October 2014 Towards Web-scale databases (bilion of tuples) R/W access and Semantic reasoning increasingly studied icube, journée défis masse de données Illkirch, 7/11/2014 7

8 Reference ontology 3-levels structure: foundational (DOLCE), core, domain Particular Domain-specific rules Inference abilities Endurant Languages Dataset Subjects acquisition equipment Centres DataTop ontology Current focus on measurements Derived relational schema Medical Image formats Action Dataset processings Dataset Expression acquisitions Conceptualization Inscription Studies Investigators Perdurant Medical image files Medical Datasets image expressions icube, journée défis masse de données Illkirch, 7/11/2014 Examinations 8

9 Ontology modules Modularized ontology to improve reuse and lightweightness ONL-MR-DA: MR Dataset Acquisition ONL-DP: Data Processing ONL-MSA: Mental State Assessment OntoVIP: Medical Image Simulation Wide diffusion icube, journée défis masse de données Illkirch, 7/11/2014 9

10 Data query and federation engine KGRAM (Knowledge Graph Abstract Machine) Semantic query engine: Full support of SPARQL1.1 Generic interface for heterogeneous backends Flexible architecture facilitating different deployment scenarios Mediation interface to access relational data Federated relational schema derived from the ontology icube, journée défis masse de données Illkirch, 7/11/

11 Distributed Query Processing Query federator decoupled from data sources Asynchronous querying of multiple data sources Query planning and parallel querying icube, journée défis masse de données Illkirch, 7/11/

12 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. FILTER (CONTAINS (?name, 'Bob')) } icube, journée défis masse de données Illkirch, 7/11/

13 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 icube, journée défis masse de données Illkirch, 7/11/

14 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q #1 #2 icube, journée défis masse de données Illkirch, 7/11/

15 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q #1 #2 icube, journée défis masse de données Illkirch, 7/11/

16 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q Q1 #1 #2 icube, journée défis masse de données Illkirch, 7/11/

17 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q Q1 #1 #2 icube, journée défis masse de données Illkirch, 7/11/

18 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q Q1 Q2' #1 #2 icube, journée défis masse de données Illkirch, 7/11/

19 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q Q1 Q2' #1 #2 icube, journée défis masse de données Illkirch, 7/11/

20 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q Q1 Q2' Q2'' #1 #2 icube, journée défis masse de données Illkirch, 7/11/

21 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q Q1 Q2' Q2'' #1 #2 icube, journée défis masse de données Illkirch, 7/11/

22 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q Q1 Q2' Q2'' #1 #2 icube, journée défis masse de données Illkirch, 7/11/

23 Distributed Query Processing KGRAM query processing Q SELECT?name?date WHERE {?x foaf:name?name.?x dbpedia:birthdate?date. Q1 FILTER (CONTAINS (?name, 'Bob')) } Q2 Asynchronous execution Interface KGRAM core Q Q1 Q2' Q2'' #1 #2 icube, journée défis masse de données Illkirch, 7/11/

24 Data analysis workflows Data-driven compute-intensive workflow engine icube, journée défis masse de données Illkirch, 7/11/

25 Ontology Provenance capture Concepts & Rules T1-MRI DataSet T2-MRI fmri... DataSetProcessing Segmentation Registration... A B Annotations Processing icube, journée défis masse de données Illkirch, 7/11/

26 Ontology Provenance capture Concepts & Rules T1-MRI DataSet T2-MRI fmri... DataSetProcessing Segmentation Registration... A B Img1 IsA T1-MRI Annotations Processing Img2 IsA T1-MRI icube, journée défis masse de données Illkirch, 7/11/

27 Ontology Provenance capture Concepts & Rules T1-MRI DataSet DataSetProcessing T2-MRI fmri... Segmentation Registration... A B Tool1 HasInput T1-MRI Tool1 IsA Registration Annotations Img1 IsA T1-MRI Img2 IsA T1-MRI Tool1 HasOutput Transfo Processing icube, journée défis masse de données Illkirch, 7/11/

28 Ontology Provenance capture Concepts & Rules T1-MRI DataSet T2-MRI fmri... DataSetProcessing Segmentation Registration... A B Img1 IsA T1-MRI Img2 IsA T1-MRI Tool1 HasInput T1-MRI Tool1 HasOutput Transfo Tool1 IsA Registration Annotations Img1 IsProcessedBy Tool1 Img2 IsProcessedBy Tool1 Processing Tool1 Produced Transfo1 Transfo1 IsA GlobalTransfo icube, journée défis masse de données Illkirch, 7/11/

29 Provenance summarization Fine-grained annotation traces generated at run-time Summary generated by inference rules application Produce relevant and human-tractable experiment summaries icube, journée défis masse de données Illkirch, 7/11/

30 Conclusions Query-based data federation grounded on semantic web standards (SPARQL, RDF, RDFS) Ontology-based Emphasis on query language expressivity Support for both horizontal and vertical data partitioning Broad applicability (given that ontologies are available) Reference model for data alignment and query terms Towards support of Multi-Centric studies Heterogeneous databases (data models, database engines) Inherently distibuted data sets Cross-domains (translational research) support through Linked Data icube, journée défis masse de données Illkirch, 7/11/

Disributed Query Processing KGRAM - Search Engine TOP 10

Disributed Query Processing KGRAM - Search Engine TOP 10 fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries Johan Montagnat CNRS, I3S lab, Modalis team on behalf of the CrEDIBLE

More information

fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Interrogation d'entrepôts distribués et hétérogènes

fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Interrogation d'entrepôts distribués et hétérogènes fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Interrogation d'entrepôts distribués et hétérogènes Johan Montagnat Alban Gaignard http://credible.i3s.unice.fr MI CNRS appel

More information

Fédération et analyse de données distribuées en imagerie biomédicale

Fédération et analyse de données distribuées en imagerie biomédicale Software technologies for integration of processes and data in neurosciences ConnaissancEs Distribuées en Imagerie BiomédicaLE Fédération et analyse de données distribuées en imagerie biomédicale Johan

More information

Publishing Linked Data Requires More than Just Using a Tool

Publishing Linked Data Requires More than Just Using a Tool Publishing Linked Data Requires More than Just Using a Tool G. Atemezing 1, F. Gandon 2, G. Kepeklian 3, F. Scharffe 4, R. Troncy 1, B. Vatant 5, S. Villata 2 1 EURECOM, 2 Inria, 3 Atos Origin, 4 LIRMM,

More information

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens

Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many

More information

UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications

UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications Gaël de Chalendar CEA LIST F-92265 Fontenay aux Roses Gael.de-Chalendar@cea.fr 1 Introduction The main data sources

More information

Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution

Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution Semantic Exploration of Archived Lifecycle Metadata under Schema and Instance Evolution Jörg Brunsmann Faculty of Mathematics and Computer Science, University of Hagen, D-58097 Hagen, Germany joerg.brunsmann@fernuni-hagen.de

More information

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity

More information

A generic approach for data integration using RDF, OWL and XML

A generic approach for data integration using RDF, OWL and XML A generic approach for data integration using RDF, OWL and XML Miguel A. Macias-Garcia, Victor J. Sosa-Sosa, and Ivan Lopez-Arevalo Laboratory of Information Technology (LTI) CINVESTAV-TAMAULIPAS Km 6

More information

Linked Science as a producer and consumer of big data in the Earth Sciences

Linked Science as a producer and consumer of big data in the Earth Sciences Linked Science as a producer and consumer of big data in the Earth Sciences Line C. Pouchard,* Robert B. Cook,* Jim Green,* Natasha Noy,** Giri Palanisamy* Oak Ridge National Laboratory* Stanford Center

More information

LDIF - Linked Data Integration Framework

LDIF - Linked Data Integration Framework LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany a.schultz@fu-berlin.de,

More information

Data Services @neurist and beyond

Data Services @neurist and beyond s @neurist and beyond Siegfried Benkner Department of Scientific Computing Faculty of Computer Science University of Vienna http://www.par.univie.ac.at Department of Scientific Computing Parallel Computing

More information

Supporting Change-Aware Semantic Web Services

Supporting Change-Aware Semantic Web Services Supporting Change-Aware Semantic Web Services Annika Hinze Department of Computer Science, University of Waikato, New Zealand a.hinze@cs.waikato.ac.nz Abstract. The Semantic Web is not only evolving into

More information

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer Alejandro Vaisman Esteban Zimanyi Data Warehouse Systems Design and Implementation ^ Springer Contents Part I Fundamental Concepts 1 Introduction 3 1.1 A Historical Overview of Data Warehousing 4 1.2 Spatial

More information

Models and Architecture for Smart Data Management

Models and Architecture for Smart Data Management 1 Models and Architecture for Smart Data Management Pierre De Vettor, Michaël Mrissa and Djamal Benslimane Université de Lyon, CNRS LIRIS, UMR5205, F-69622, France E-mail: firstname.surname@liris.cnrs.fr

More information

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology Hong-Linh Truong Institute for Software Science, University of Vienna, Austria truong@par.univie.ac.at Thomas Fahringer

More information

Databases & Data Infrastructure. Kerstin Lehnert

Databases & Data Infrastructure. Kerstin Lehnert + Databases & Data Infrastructure Kerstin Lehnert + Access to Data is Needed 2 to allow verification of research results to allow re-use of data + The road to reuse is perilous (1) 3 Accessibility Discovery,

More information

OntoDBench: Ontology-based Database Benchmark

OntoDBench: Ontology-based Database Benchmark OntoDBench: Ontology-based Database Benchmark Stéphane Jean, Ladjel Bellatreche, Géraud Fokou, Mickaël Baron, and Selma Khouri LIAS/ISAE-ENSMA and University of Poitiers BP 40109, 86961 Futuroscope Cedex,

More information

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

LinkZoo: A linked data platform for collaborative management of heterogeneous resources LinkZoo: A linked data platform for collaborative management of heterogeneous resources Marios Meimaris, George Alexiou, George Papastefanatos Institute for the Management of Information Systems, Research

More information

Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery

Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery Dimitrios Kourtesis, Iraklis Paraskakis SEERC South East European Research Centre, Greece Research centre of the University

More information

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and

More information

Additional mechanisms for rewriting on-the-fly SPARQL queries proxy

Additional mechanisms for rewriting on-the-fly SPARQL queries proxy Additional mechanisms for rewriting on-the-fly SPARQL queries proxy Arthur Vaisse-Lesteven, Bruno Grilhères To cite this version: Arthur Vaisse-Lesteven, Bruno Grilhères. Additional mechanisms for rewriting

More information

Industry 4.0 and Big Data

Industry 4.0 and Big Data Industry 4.0 and Big Data Marek Obitko, mobitko@ra.rockwell.com Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H 2 Background Joint work with Czech Institute of Informatics, Robotics and

More information

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model 22 October 2014 Tony Hammond Michele Pasin Background About Macmillan

More information

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY Yu. A. Zagorulko, O. I. Borovikova, S. V. Bulgakov, E. A. Sidorova 1 A.P.Ershov s Institute

More information

FIPA agent based network distributed control system

FIPA agent based network distributed control system FIPA agent based network distributed control system V.Gyurjyan, D. Abbott, G. Heyes, E. Jastrzembski, C. Timmer, E. Wolin TJNAF, Newport News, VA 23606, USA A control system with the capabilities to combine

More information

SmartLink: a Web-based editor and search environment for Linked Services

SmartLink: a Web-based editor and search environment for Linked Services SmartLink: a Web-based editor and search environment for Linked Services Stefan Dietze, Hong Qing Yu, Carlos Pedrinaci, Dong Liu, John Domingue Knowledge Media Institute, The Open University, MK7 6AA,

More information

secure intelligence collection and assessment system Your business technologists. Powering progress

secure intelligence collection and assessment system Your business technologists. Powering progress secure intelligence collection and assessment system Your business technologists. Powering progress The decisive advantage for intelligence services The rising mass of data items from multiple sources

More information

BYODs & FAIR Data Stewardship

BYODs & FAIR Data Stewardship BYODs & FAIR Data Stewardship Luiz Olavo Bonino luiz.bonino@dtls.nl www.elixir-europe.org Summary FAIR Data stewardship Approach in NL BYOD FAIR Data tooling ecosystem Way of working (FAIR) Data Stewardship

More information

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Anne Monceaux 1, Joanna Guss 1 1 EADS-CCR, Centreda 1, 4 Avenue Didier Daurat 31700 Blagnac France

More information

SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA

SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA INDEX EXECUTIVE SUMMARY... 3 1. PREFACE... 5 1.1. Acronyms... 5

More information

urika! Unlocking the Power of Big Data at PSC

urika! Unlocking the Power of Big Data at PSC urika! Unlocking the Power of Big Data at PSC Nick Nystrom Director, Strategic Applications Pittsburgh Supercomputing Center February 1, 2013 nystrom@psc.edu 2013 Pittsburgh Supercomputing Center Big Data

More information

Software Design Document (SDD) Template

Software Design Document (SDD) Template (SDD) Template Software design is a process by which the software requirements are translated into a representation of software components, interfaces, and data necessary for the implementation phase.

More information

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo Expected Outcomes You will learn: Basic concepts related to ontologies Semantic model Semantic web Basic features of RDF and RDF

More information

Data collection architecture for Big Data

Data collection architecture for Big Data Data collection architecture for Big Data a framework for a research agenda (Research in progress - ERP Sense Making of Big Data) Wout Hofman, May 2015, BDEI workshop 2 Big Data succes stories bias our

More information

Towards Semantics-Enabled Distributed Infrastructure for Knowledge Acquisition

Towards Semantics-Enabled Distributed Infrastructure for Knowledge Acquisition Towards Semantics-Enabled Distributed Infrastructure for Knowledge Acquisition Vasant Honavar 1 and Doina Caragea 2 1 Artificial Intelligence Research Laboratory, Department of Computer Science, Iowa State

More information

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum Big Data Analysis John Domingue (STI International and The Open University) Project co-funded by the European Commission within the 7th Framework Program (Grant Agreement No. 257943) 1 The Data landscape

More information

bigdata Managing Scale in Ontological Systems

bigdata Managing Scale in Ontological Systems Managing Scale in Ontological Systems 1 This presentation offers a brief look scale in ontological (semantic) systems, tradeoffs in expressivity and data scale, and both information and systems architectural

More information

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee 31 10589 Berlin, Germany. www.fokus.fraunhofer.

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee 31 10589 Berlin, Germany. www.fokus.fraunhofer. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee 31 10589 Berlin, Germany www.fokus.fraunhofer.de 1 Identification and Utilization of Components for a linked Open Data Platform

More information

Graph Database Performance: An Oracle Perspective

Graph Database Performance: An Oracle Perspective Graph Database Performance: An Oracle Perspective Xavier Lopez, Ph.D. Senior Director, Product Management 1 Copyright 2012, Oracle and/or its affiliates. All rights reserved. Program Agenda Broad Perspective

More information

Knowledge Management

Knowledge Management Knowledge Management INF5100 Autumn 2006 Outline Background Knowledge Management (KM) What is knowledge KM Processes Knowledge Management Systems and Knowledge Bases Ontologies What is an ontology Types

More information

Core Enterprise Services, SOA, and Semantic Technologies: Supporting Semantic Interoperability

Core Enterprise Services, SOA, and Semantic Technologies: Supporting Semantic Interoperability Core Enterprise, SOA, and Semantic Technologies: Supporting Semantic Interoperability in a Network-Enabled Environment 2011 SOA & Semantic Technology Symposium 13-14 July 2011 Sven E. Kuehne sven.kuehne@nc3a.nato.int

More information

Semantic Search in Portals using Ontologies

Semantic Search in Portals using Ontologies Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br

More information

Linked Statistical Data Analysis

Linked Statistical Data Analysis Linked Statistical Data Analysis Sarven Capadisli 1, Sören Auer 2, Reinhard Riedl 3 1 Universität Leipzig, Institut für Informatik, AKSW, Leipzig, Germany, 2 University of Bonn and Fraunhofer IAIS, Bonn,

More information

SOA Success is Not a Matter of Luck

SOA Success is Not a Matter of Luck by Prasad Jayakumar, Technology Lead at Enterprise Solutions, Infosys Technologies Ltd SERVICE TECHNOLOGY MAGAZINE Issue L May 2011 Introduction There is nothing either good or bad, but thinking makes

More information

Fabien.Gandon@sophia.inria.fr. Semantic Web and Multi-Agent Approach to Corporate Memory Management

Fabien.Gandon@sophia.inria.fr. Semantic Web and Multi-Agent Approach to Corporate Memory Management Fabien Gandon, Rose Dieng-Kuntz, Olivier Corby, Alain Giboin Semantic Web and Multi- Approach to Corporate Memory Management Fabien Gandon, Rose Dieng-Kuntz, Olivier Corby, Alain Giboin Semantic Web and

More information

Amit Sheth & Ajith Ranabahu, 2010. Presented by Mohammad Hossein Danesh

Amit Sheth & Ajith Ranabahu, 2010. Presented by Mohammad Hossein Danesh Amit Sheth & Ajith Ranabahu, 2010 Presented by Mohammad Hossein Danesh 1 Agenda Introduction to Cloud Computing Research Motivation Semantic Modeling Can Help Use of DSLs Solution Conclusion 2 3 Motivation

More information

Ontology and automatic code generation on modeling and simulation

Ontology and automatic code generation on modeling and simulation Ontology and automatic code generation on modeling and simulation Youcef Gheraibia Computing Department University Md Messadia Souk Ahras, 41000, Algeria youcef.gheraibia@gmail.com Abdelhabib Bourouis

More information

OAK Database optimizations and architectures for complex large data Ioana MANOLESCU-GOUJOT

OAK Database optimizations and architectures for complex large data Ioana MANOLESCU-GOUJOT OAK Database optimizations and architectures for complex large data Ioana MANOLESCU-GOUJOT INRIA Saclay Île-de-France Université Paris Sud LRI UMR CNRS 8623 Plan 1. The team 2. Oak research at a glance

More information

«Shanoir : une solu/on pour la ges/on de données distribuées en imagerie in- vivo» Jus/ne Guillaumont Isabelle Corouge

«Shanoir : une solu/on pour la ges/on de données distribuées en imagerie in- vivo» Jus/ne Guillaumont Isabelle Corouge «Shanoir : une solu/on pour la ges/on de données distribuées en imagerie in- vivo» Jus/ne Guillaumont Isabelle Corouge Shanoir: a solu-on for neuro- imaging data management Jus/ne Guillaumont, Isabelle

More information

Vertical Integration of Enterprise Industrial Systems Utilizing Web Services

Vertical Integration of Enterprise Industrial Systems Utilizing Web Services Vertical Integration of Enterprise Industrial Systems Utilizing Web Services A.P. Kalogeras 1, J. Gialelis 2, C. Alexakos 1, M. Georgoudakis 2, and S. Koubias 2 1 Industrial Systems Institute, Building

More information

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel

More information

Open Ontology Repository Initiative

Open Ontology Repository Initiative Open Ontology Repository Initiative Frank Olken Lawrence Berkeley National Laboratory National Science Foundation folken@nsf.gov presented to CENDI/NKOS Workshop World Bank Sept. 11, 2008 Version 6.0 DISCLAIMER

More information

WHITE PAPER TOPIC DATE Enabling MaaS Open Data Agile Design and Deployment with CA ERwin. Nuccio Piscopo. agility made possible

WHITE PAPER TOPIC DATE Enabling MaaS Open Data Agile Design and Deployment with CA ERwin. Nuccio Piscopo. agility made possible WHITE PAPER TOPIC DATE Enabling MaaS Open Data Agile Design and Deployment with CA ERwin Nuccio Piscopo agility made possible Table of Contents Introduction 3 MaaS enables Agile Open Data Design 4 MaaS

More information

Data Grids. Lidan Wang April 5, 2007

Data Grids. Lidan Wang April 5, 2007 Data Grids Lidan Wang April 5, 2007 Outline Data-intensive applications Challenges in data access, integration and management in Grid setting Grid services for these data-intensive application Architectural

More information

The various steps in the solution approach are presented below.

The various steps in the solution approach are presented below. From Web 1.0 3.0: Is RDF access to RDB enough? Vipul Kashyap, Senior Medical Informatician, Partners Healthcare System, vkashyap1@partners.org Martin Flanagan, CTO, InSilico Discovery, mflanagan@insilicodiscovery.com

More information

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

BUSINESS VALUE OF SEMANTIC TECHNOLOGY BUSINESS VALUE OF SEMANTIC TECHNOLOGY Preliminary Findings Industry Advisory Council Emerging Technology (ET) SIG Information Sharing & Collaboration Committee July 15, 2005 Mills Davis Managing Director

More information

SCIENTIFIC workflows have recently emerged as a new

SCIENTIFIC workflows have recently emerged as a new IEEE TRANSACTIONS ON SERVICES COMPUTING, VOL. 2, NO. 1, JANUARY-MARCH 2009 79 A Reference Architecture for Scientific Workflow Management Systems and the VIEW SOA Solution Cui Lin, Student Member, IEEE,

More information

Building Semantic Content Management Framework

Building Semantic Content Management Framework Building Semantic Content Management Framework Eric Yen Computing Centre, Academia Sinica Outline What is CMS Related Work CMS Evaluation, Selection, and Metrics CMS Applications in Academia Sinica Concluding

More information

CAREER TRACKS PHASE 1 UCSD Information Technology Family Function and Job Function Summary

CAREER TRACKS PHASE 1 UCSD Information Technology Family Function and Job Function Summary UCSD Applications Programming Involved in the development of server / OS / desktop / mobile applications and services including researching, designing, developing specifications for designing, writing,

More information

Enabling Collaboration Using the Biomedical Informatics Research Network (BIRN):

Enabling Collaboration Using the Biomedical Informatics Research Network (BIRN): Enabling Collaboration Using the Biomedical Informatics Research Network (BIRN): Karl Helmer Ph.D. Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital June 4, 2010 BIRN

More information

Data Wrangling: From the Wild to the Lake

Data Wrangling: From the Wild to the Lake Data Wrangling: From the Wild to the Lake Ignacio Terrizzano Peter Schwarz Mary Roth John Colino IBM Research - Almaden 48 hours of video is uploaded to YouTube every minute Walmart processes million transactions

More information

Data Quality in Information Integration and Business Intelligence

Data Quality in Information Integration and Business Intelligence Data Quality in Information Integration and Business Intelligence Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada : Faculty Fellow of the IBM Center for Advanced Studies

More information

BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE. Big Data Europe

BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE. Big Data Europe BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE Big Data Europe The Big Data Aggregator The Big Data Aggregator: o A general-purpose architecture for processing Big Data o An implementation

More information

Distributed knowledge sharing and production through collaborative e-science platforms

Distributed knowledge sharing and production through collaborative e-science platforms Distributed knowledge sharing and production through collaborative e-science platforms PhD Defense - Alban Gaignard Advisor: Johan Montagnat CNRS, University of Nice Sophia Antipolis, I3S Laboratory, MODALIS

More information

Semantic Information on Electronic Medical Records (EMRs) through Ontologies

Semantic Information on Electronic Medical Records (EMRs) through Ontologies Semantic Information on Electronic Medical Records (EMRs) through Ontologies Suarez Barón M. J. Researcher, Research Center at Colombian School of Industrial Careers marcojaviersuarezbaron@gmail.com Bogotá,

More information

ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES. Peter Haase fluid Operations AG

ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES. Peter Haase fluid Operations AG ON DEMAND ACCESS TO BIG DATA THROUGH SEMANTIC TECHNOLOGIES Peter Haase fluid Operations AG fluid Operations(fluidOps) Linked Data& Semantic Technologies Enterprise Cloud Computing Software company founded

More information

Thomas Usländer Fraunhofer IITB

Thomas Usländer Fraunhofer IITB ORCHESTRA Day Stresa, 12 December 2007 ORCHESTRA Architecture - Behind the Scenes Thomas Usländer Fraunhofer IITB ORCHESTRA Consortium ORCHESTRA Ambition Analysis Maps Info Centre Archive Control centre

More information

Enable Location-based Services with a Tracking Framework

Enable Location-based Services with a Tracking Framework Enable Location-based Services with a Tracking Framework Mareike Kritzler University of Muenster, Institute for Geoinformatics, Weseler Str. 253, 48151 Münster, Germany kritzler@uni-muenster.de Abstract.

More information

In 2014, the Research Data group @ Purdue University

In 2014, the Research Data group @ Purdue University EDITOR S SUMMARY At the 2015 ASIS&T Research Data Access and Preservation (RDAP) Summit, panelists from Research Data @ Purdue University Libraries discussed the organizational structure intended to promote

More information

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing

More information

The Ontological Approach for SIEM Data Repository

The Ontological Approach for SIEM Data Repository The Ontological Approach for SIEM Data Repository Igor Kotenko, Olga Polubelova, and Igor Saenko Laboratory of Computer Science Problems, Saint-Petersburg Institute for Information and Automation of Russian

More information

An Ontology-based Architecture for Integration of Clinical Trials Management Applications

An Ontology-based Architecture for Integration of Clinical Trials Management Applications An Ontology-based Architecture for Integration of Clinical Trials Management Applications Ravi D. Shankar, MS 1, Susana B. Martins, MD, MSc 1, Martin O Connor, MSc 1, David B. Parrish, MS 2, Amar K. Das,

More information

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc Big Data, Fast Data, Complex Data Jans Aasman Franz Inc Private, founded 1984 AI, Semantic Technology, professional services Now in Oakland Franz Inc Who We Are (1 (2 3) (4 5) (6 7) (8 9) (10 11) (12

More information

SWAP: ONTOLOGY-BASED KNOWLEDGE MANAGEMENT WITH PEER-TO-PEER TECHNOLOGY

SWAP: ONTOLOGY-BASED KNOWLEDGE MANAGEMENT WITH PEER-TO-PEER TECHNOLOGY SWAP: ONTOLOGY-BASED KNOWLEDGE MANAGEMENT WITH PEER-TO-PEER TECHNOLOGY M. EHRIG, C. TEMPICH AND S. STAAB Institute AIFB University of Karlsruhe 76128 Karlsruhe, Germany E-mail: {meh,cte,sst}@aifb.uni-karlsruhe.de

More information

Application of ontologies for the integration of network monitoring platforms

Application of ontologies for the integration of network monitoring platforms Application of ontologies for the integration of network monitoring platforms Jorge E. López de Vergara, Javier Aracil, Jesús Martínez, Alfredo Salvador, José Alberto Hernández Networking Research Group,

More information

Data Management in NeuroMat and the Neuroscience Experiments System (NES)

Data Management in NeuroMat and the Neuroscience Experiments System (NES) Data Management in NeuroMat and the Neuroscience Experiments System (NES) Kelly Rosa Braghetto Department of Computer Science Institute of Mathematics and Statistics - University of São Paulo 1st NeuroMat

More information

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management Ram Soma 2, Amol Bakshi 1, Kanwal Gupta 3, Will Da Sie 2, Viktor Prasanna 1 1 University of Southern California,

More information

Leveraging ambient applications interactions with their environment to improve services selection relevancy

Leveraging ambient applications interactions with their environment to improve services selection relevancy Leveraging ambient applications interactions with their environment to improve services selection relevancy Gérald Rocher, Jean-Yves Tigli, Stéphane Lavirotte, Rahma Daikhi To cite this version: Gérald

More information

CREATING AND APPLYING KNOWLEDGE IN ELECTRONIC HEALTH RECORD SYSTEMS. Prof Brendan Delaney, King s College London

CREATING AND APPLYING KNOWLEDGE IN ELECTRONIC HEALTH RECORD SYSTEMS. Prof Brendan Delaney, King s College London CREATING AND APPLYING KNOWLEDGE IN ELECTRONIC HEALTH RECORD SYSTEMS Prof Brendan Delaney, King s College London www.transformproject.eu 7.5M European Commission March 2010-May 2015 Funded under the Patient

More information

Some Research Challenges for Big Data Analytics of Intelligent Security

Some Research Challenges for Big Data Analytics of Intelligent Security Some Research Challenges for Big Data Analytics of Intelligent Security Yuh-Jong Hu hu at cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,

More information

Presente e futuro del Web Semantico

Presente e futuro del Web Semantico Sistemi di Elaborazione dell informazione II Corso di Laurea Specialistica in Ingegneria Telematica II anno 4 CFU Università Kore Enna A.A. 2009-2010 Alessandro Longheu http://www.diit.unict.it/users/alongheu

More information

IAAA Grupo de Sistemas de Información Avanzados

IAAA Grupo de Sistemas de Información Avanzados Upgrading maps with Linked Data Lopez Pellicer Pellicer, Francisco J Lacasta, Javier Rentería, Walter, Universidad de Zaragoza Barrera, Jesús Lopez de Larrinzar, Juan Agudo, Jose M GeoSpatiumLab The Linked

More information

How To Create A Social Web With A Free And Open Source Web Browser (W3C)

How To Create A Social Web With A Free And Open Source Web Browser (W3C) Balancing Data Utility and Privacy Protection in the Socially Aware Data Cloud Yuh-Jong Hu hu@cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,

More information

Enabling End User Access to Big Data in the O&G Industry

Enabling End User Access to Big Data in the O&G Industry Enabling End User Access to Big Data in the O&G Industry Johan W. Klüwer (DNV) and Michael Schmidt (fluidops) 1 / 28 HELLENIC REPUBLIC National and Kapodistrian University of Athens 2 / 28 . Paradigm Shift

More information

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 James Maltby, Ph.D 1 Outline of Presentation Semantic Graph Analytics Database Architectures In-memory Semantic Database Formulation

More information

DataBridges: data integration for digital cities

DataBridges: data integration for digital cities DataBridges: data integration for digital cities Thematic action line «Digital Cities» Ioana Manolescu Oak team INRIA Saclay and Univ. Paris Sud-XI Plan 1. DataBridges short history and overview 2. RDF

More information

Susanna-Assunta Sansone, PhD. Metadata WG3 chair. 3-workgroup@biocaddie.org

Susanna-Assunta Sansone, PhD. Metadata WG3 chair. 3-workgroup@biocaddie.org Susanna-Assunta Sansone, PhD Metadata WG3 chair 3-workgroup@biocaddie.org http://dx.doi.org/10.6084/m9.figshare.1362572 WG3 Metadata - Goals Define a set of metadata specifications that support intended

More information

Second EUDAT Conference, October 2013 Workshop: Digital Preservation of Cultural Data Scalability in preservation of cultural heritage data

Second EUDAT Conference, October 2013 Workshop: Digital Preservation of Cultural Data Scalability in preservation of cultural heritage data Second EUDAT Conference, October 2013 Workshop: Digital Preservation of Cultural Data Scalability in preservation of cultural heritage data Simon Lambert Scientific Computing Department STFC UK Types of

More information

RFI Summary: Executive Summary

RFI Summary: Executive Summary RFI Summary: Executive Summary On February 20, 2013, the NIH issued a Request for Information titled Training Needs In Response to Big Data to Knowledge (BD2K) Initiative. The response was large, with

More information

The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data. Ravi Shankar

The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data. Ravi Shankar The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data Ravi Shankar Open access to clinical trials data advances open science Broad open access to entire clinical

More information

OVERVIEW OF JPSEARCH: A STANDARD FOR IMAGE SEARCH AND RETRIEVAL

OVERVIEW OF JPSEARCH: A STANDARD FOR IMAGE SEARCH AND RETRIEVAL OVERVIEW OF JPSEARCH: A STANDARD FOR IMAGE SEARCH AND RETRIEVAL Frédéric Dufaux, Michael Ansorge, and Touradj Ebrahimi Institut de Traitement des Signaux Ecole Polytechnique Fédérale de Lausanne (EPFL)

More information

CRM dig : A generic digital provenance model for scientific observation

CRM dig : A generic digital provenance model for scientific observation CRM dig : A generic digital provenance model for scientific observation Martin Doerr, Maria Theodoridou Institute of Computer Science, FORTH-ICS, Crete, Greece Abstract The systematic large-scale production

More information

Semantic Interoperability

Semantic Interoperability Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from

More information

Automatic Timeline Construction For Computer Forensics Purposes

Automatic Timeline Construction For Computer Forensics Purposes Automatic Timeline Construction For Computer Forensics Purposes Yoan Chabot, Aurélie Bertaux, Christophe Nicolle and Tahar Kechadi CheckSem Team, Laboratoire Le2i, UMR CNRS 6306 Faculté des sciences Mirande,

More information

MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management

MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database System in Energy Data Management Zhan Liu, Fabian Cretton, Anne Le Calvé, Nicole Glassey, Alexandre Cotting, Fabrice Chapuis

More information

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

More information

Linked Open Data Infrastructure for Public Sector Information: Example from Serbia

Linked Open Data Infrastructure for Public Sector Information: Example from Serbia Proceedings of the I-SEMANTICS 2012 Posters & Demonstrations Track, pp. 26-30, 2012. Copyright 2012 for the individual papers by the papers' authors. Copying permitted only for private and academic purposes.

More information

Lift your data hands on session

Lift your data hands on session Lift your data hands on session Duration: 40mn Foreword Publishing data as linked data requires several procedures like converting initial data into RDF, polishing URIs, possibly finding a commonly used

More information

PONTE Presentation CETIC. EU Open Day, Cambridge, 31/01/2012. Philippe Massonet

PONTE Presentation CETIC. EU Open Day, Cambridge, 31/01/2012. Philippe Massonet PONTE Presentation CETIC Philippe Massonet EU Open Day, Cambridge, 31/01/2012 PONTE Description Efficient Patient Recruitment for Innovative Clinical Trials of Existing Drugs to other Indications Start

More information