Provenance-Centered Dataset of Drug-Drug Interactions
|
|
|
- Muriel Robertson
- 10 years ago
- Views:
Transcription
1 Provenance-Centered Dataset of Drug-Drug Interactions Juan M. Banda 1, Tobias Kuhn 2,3, Nigam H. Shah 1, and Michel Dumontier 1 1 Stanford University - Center for Biomedical Informatics Research, 1265 Welch Road, Stanford, CA, 94305, USA [email protected], [email protected], [email protected] 2 Department of Humanities, Social and Political Sciences, ETH Zurich, Switzerland 3 Department of Computer Science, VU University Amsterdam, Netherlands [email protected] Abstract. Over the years several studies have demonstrated the ability to identify potential drug-drug interactions via data mining from the literature (MEDLINE), electronic health records, public databases (Drugbank), etc. While each one of these approaches is properly statistically validated, they do not take into consideration the overlap between them as one of their decision making variables. In this paper we present LInked Drug-Drug Interactions (LIDDI), a public nanopublication-based RDF dataset with trusty URIs that encompasses some of the most cited prediction methods and sources to provide researchers a resource for leveraging the work of others into their prediction methods. As one of the main issues to overcome the usage of external resources is their mappings between drug names and identifiers used, we also provide the set of mappings we curated to be able to compare the multiple sources we aggregate in our dataset. Keywords: drug-drug interactions, nanopublications, data mining 1 Introduction Studies analyzing costs over time have shown that adverse drug reactions (ADRs) cost over $136 billion a year [13]. One significant cause of ADRs are drug-drug interactions (DDIs) which greatly affect older adults due to the multiple drugs they are taking [3]. A DDI occurs when the effect of any given drug is altered by another drug which results in an unpredictable effect. While new drugs, before market approval, are tested in both in vivo and in vitro methods [19], it is unfeasible to test their interactions with all other approved and experimental drugs. In the recent years, computational approaches have been trying to infer potential DDI signals using a wide variety of sources [6,9,16,17], however these methods produce thousands of statistically plausible predictions [6,8], thus making the task of testing them in an experimental setting impractical. We are therefore facing the need of combining data from various disparate sources, but at the same time have to remain aware of their provenance due
2 Fig. 1. Overview of the Linked Drug-Drug Interactions dataset and the data sources that it incorporates and their original formats to differences in individual quality and overall confidence. Depending on their primary source and their provenance path, two data entries about the same DDI might substantially increase our confidence about a relation. By relying on Semantic Web technologies such as RDF and more specifically nanopublications [7] these provenance paths can be represented in an explicit and uniform manner. In this paper we introduce LInked Drug-Drug Interactions (LIDDI), a dataset that consolidates multiple data sources in one cohesive linked place that allows researchers to have immediate access to multiple collections of DDI predictions from public databases and reports, biomedical literature, and methods that take different and sometimes more comprehensive approaches. With great potential for use in drug safety studies, LIDDI provides the linking components to branch from DDIs into individual properties of each drug via Drugbank and UMLS, as well as mappings to other types of biomedical ontologies which are used to describe drug-drug interactions. In terms of direct applications, the resources in our dataset allow researchers to quickly determine if there is support by other DDI sources for their own predictions, allowing them to evaluate them via consensus rather than only by independent statistical measures (AUROC values, odds ratios, etc). LIDDI offers all the mappings needed to bridge the incorporated resources into a single comparable entity, providing extra value for researchers looking to bridge the data sources we have connected, for example Drugbank drug identifiers to UMLS Concept Unique Identifiers (CUI)
3 to Medical Subject Headings (MeSH) codes, opening endless possibilities for reuse. To our knowledge, this is the first public dataset exemplifying the use of nanopublications for the integration of knowledge from multiple diverse data sources in the field of drug safety surveillance. We believe it will facilitate crossreferencing of results between different domains not previously available, as well as the enrichment of drugs involved in the DDIs thanks to the linkages provided. 2 Methods 2.1 Nanopublications with Trusty URIs Nanopublications [7] are a concept to use Semantic Web techniques (most importantly RDF and named graphs) to closely link data to their provenance and meta-data in a uniform manner. The vision is that small data packages should become the primary format for research outputs instead of narrative articles [15]. Technically, a nanopublication consists of an assertion graph with triples expressing an atomic statement (about drug-drug interactions in our case), a provenance graph that reports how this assertion came about (e.g. where it was extracted from or what mechanism was used to derive it), and a publication information graph that provides meta-data for the nanopublication (such as its creators and a timestamp). Trusty URIs [11,12] are a recent proposal to make URI links for digital artifacts verifiable, immutable, and permanent. Such URI identifiers contain a cryptographic hash value calculated on the content of the represented digital artifact i.e. RDF content of nanopublications in our case in a format-independent way. With trusty URIs, any reference to an artifact thereby comes with the possibility to verify with 100% confidence that a retrieved file really represents the correct and original state of that resource. 2.2 Data Schema We represent DDIs in the following way: We instantiate the DDI class from the Semantic science Integrated Ontology (SIO) [5]. We define a vocabulary in our own namespace, as is normally done with Bio2RDF [4], to assign domain-specific predicates to point from the DDI to each drug, and to the clinically-relevant event resulting from the drug interaction. This assertion is stored in the assertion graph of the nanopublication. We also create nanopublications to capture the mappings between each drug and our seven sources. We use BioPortal PURL identifiers for UMLS URIs. We use the PROV ontology [14] to capture the provenance of each DDI by linking the assertion graph URI to a PROV activity. This activity is linked to the software used to generate the drug and event mappings, as well as to the citation for the method, the generation time, and the dataset where the DDIs were extracted from. In this last case we also provide a direct link to the original dataset used. Figure 2 highlights our general schema with a particular example for a DDI that contains a drug-drug set.
4 Fig. 2. Dataset Schema shows two types of nanopublications included in the dataset. The first captures the DDI, and the second captures drug mappings.a third one not show captures the event mappings. 3 Data sources aggregated In this section we provide details about the transformations we performed on each of the data sources to be able to normalize them and use them together. Due to space constraints we provide a very high-level view of how each source procures and generates its data as this lies out of the scope of this dataset paper. All the mappings provided within our dataset for UMLS concept unique identifiers (CUI), RxNORM codes, MeSH codes and Drugbank identifiers, were done as an initial step once we acquired all our sources. Electronic Health Records (EHR). In [9] the authors used Stanford Translational Research Integrated Database Environment (STRIDE) dataset comprising 9 million unstructured clinical notes corresponding to 1 million patients that span a period of 18 years ( ) to mine DDIs from the EHR. Published as a comma delimited file in the supplemental materials, we use the highest confidence predictions as the initial basis of our dataset in terms of drugs (345) and events (10). This dataset provides drug-drug-event sets of UMLS concept unique identifiers (CUI), drug/event names in string literals. FDA Adverse Event Reporting System (FAERS). We analyzed over 3.2 million reports found on the FDA Adverse Event Reporting System (FAERS), since there are no established algorithmic ways of determining the statistical significance of a drug-drug interaction from FAERS reports without any additional external information, we set a threshold of each DDI to have at least ten appearances in different FAERS reports for it to appear on our dataset. In this data source, the drugs are given in string literals that normalized to RxNorm drug names and the events normalized to MeDRA. DrugBank. We used the RDF versions of Drugbank [18] made publicly available through Bio2RDF. The used version was dated as We proceeded to strip the Drugbank identifiers and their labels into string literals mapped
5 Fig. 3. Text-tagging pipeline. We build a term/concept dictionary from ontologies in UMLS. We then proceed to tag concepts found in any piece of text from any source via Unitex, resulting in a bag of UMLS CUIs for each input. into RxNorm/SNOMED string literals. In order to extract DDIs we used the drugbank vocabulary:drug-drug-interaction type to match interacting drugs, we then used the rdfs:label to extracted the event by concept tagging the indication section with a modified version of the NCBO annotator used in [9]. This concept tagging process is described on Figure 3. Drugs.com. In order to mine DDIs from this resource, we used a web-crawling agent developed in Python. We programmatically checked the individual drugs.com page for each of the 345 drugs in our set and extracted all their interacting drugs. For each of the interacting drugs we build the event part of our DDI and by annotating the complementary free-text section, the same way as on Figure 3, which describes the interaction (similar to the Drugbank indication). MEDLINE. In order to find potential DDIs signals in the biomedical literature, we have adapted a clever method developed by Avilach et al. [1], that allows for the use of a query with certain MeSH terms and qualifiers to find adverse events in the literature. We modified this query by adding a MeSH term for Drug interactions and modified the search to allow for two drugs instead of one from the original approach. These modifications of the method with some manipulation produces drug-drug-event sets. TWOSIDES. Provided as a download by Tatonetti et al. [16], TWOSIDES contains polypharmacy side effects for pairs of drugs. Mined from FDAs FAERS, this methods is designed to complement modern signal detection approaches, providing stratification, dampening or removing the effect of co-variates, without needing to divide drug-exposed reports into strata. This resource is available in a CSV format and features drugs in RxNorm normalized string literal form and events in UMLS CUI form. INferring Drug Interactions (INDI). Currently available on a website for checking drug-drug interactions and provided to us by the authors [6] in a comma separated file. This prediction method infers both pharmacodynamic and pharmacokinetic interactions based on their similarity to existing known adverse events that result from a common metabolizing enzyme (CYP). The provided resources consisted of drug-drug sets (no events) in UMLS CUI form.
6 Table 1. LIDDI DDI event distribution between sources Event Name EHR MEDLINE Drugbank Drugs.com FAERS TWOSIDES INDI* Protocol* Arrhythmia , ,632 NA* NA* Bradycardia , ,824 NA* NA* Hyperkalaemia 1, , ,840 NA* NA* Hypoglycaemia 1, , ,150 NA* NA* Long QT syndrome , NA* NA* Neutropenia 4, , ,702 NA* NA* Pancytopenia 1, , ,440 NA* NA* Parkinsonism , NA* NA* Rhabdomyolysis , ,264 NA* NA* Serotonin syndrome , ,094 NA* NA* Total: 11,966 1,248 4,260 43,160 1,716 32,830 8, Similarity-based Modeling Protocol (Protocol). Provided to us by the authors, Vilar et al. [17], this protocol integrates a gold standard of DDIs with drug similarity information extracted from sources like: 2D and 3D molecular structure, interaction profile, target similarities, and side-effect similarities. This method generates drug interaction candidates that are traceable to pharmacological or clinical effects. We were provided with the resulting drug-drug (no events) sets with Drugbank identifiers. 4 Statistics and Access Table 1 shows the total number of DDIs found with support in each of the multiple sources we integrated. Note that these are not unique DDIs in the sense that we have a drug1-drug2-event set that is equivalent to drug2-drug1- event set as the directionality does not matter, thus counting them twice. We left this reflexive drug drug interactions in our dataset to conform to the standard employed by bio2rdf s DrugBank repository, in order to not limit the discovery of any potential interaction when a query is performed on any given single drug. The dataset in its entirety contains a total of 98,085 nanopublications out of which 345 are used for drug mappings, 10 for event mappings and the remainder for DDIs extracted from the data sources we used. The dataset has a total of 392,340 graphs (four per nanopublications) and 2,051,959 triples, taking 723 MB in nquads representation. LIDDI can be accessed as a bulk download via figshare [2] and SPARQL endpoint at: dataset is also made available via the recently installed nanopublication server network [10]. The command np from the Java nanopublication library 1 can be used to download the dataset [20]: $ np get -c -o out.trig RA7SuQ0e661LJdKpt5EOS2DKykf1ht9LFmNaZtFSDMrXg 1
7 5 Potential for applications and future work The first application that this dataset has been used for involves ranking DDI predictions from EHR data. By taking the EHR data source published by [9] and using this resource, researchers are ranking predictions in order to determine which ones are more feasible for experimental evaluation. By leveraging such diverse resources, the researchers have been able to use the data sources as voting mechanisms for certain DDIs to be prioritized over others. Similar studies can be performed to determine the priority of the other data sources and their predictions just using LIDDI and selecting a different resource to evaluate. The authors of each data source could enhance their own predictions by using their method and consider other sources we compiled in this dataset as goldstandards or at least silver-standards as there is no de facto community-wide gold standard for drug safety when it comes to drug-drug interactions. Multiple applications can revolve around enhancing the DDIs listed here by enriching the drugs and events with properties available in the Drugbank, RxNorm and MeSH linked graphs that are not easily accessible via traditional means. This might lead to even further linkage of other resources tying all things back to a described DDI and helping to further explain its reason and impact. As more researchers become aware of this resource they should be encouraged to contribute to it by adding their own DDI predictions, thus enhancing the overall availability of DDI predictions sets for scientific comparisons. We will continue to contact authors and map their data sets into LIDDI to have a more diverse and rich set of resources. We also plan at some point incorporate more drugs and more events into the resource as they become available for it to have a wider coverage of the current drug space. References 1. P. Avillach, J.-C. Dufour, G. Diallo, F. Salvo, M. Joubert, F. Thiessard, F. Mougin, G. Trifirò, A. Fourrier-Réglat, A. Pariente, and M. Fieschi. Design and validation of an automated method to detect known adverse drug reactions in medline: a contribution from the eu adr project. Journal of the American Medical Informatics Association, 20(3): , J. M. Banda, T. Kuhn, N. H. Shah, and M. Dumontier. Liddi: Provenance-centered dataset of drug-drug interactions. figshare July R. L. Bushardt, E. B. Massey, T. W. Simpson, J. C. Ariail, and K. N. Simpson. Polypharmacy: Misleading, but manageable. Clinical Interventions in Aging, 3(2): , [pmid] Clin Interv Aging. 4. A. Callahan, J. Cruz-Toledo, P. Ansell, and M. Dumontier. Bio2rdf release 2: Improved coverage, interoperability and provenance of life science linked data. In The Semantic Web: Semantics and Big Data, pages Springer, M. Dumontier, C. J. Baker, J. Baran, A. Callahan, L. L. Chepelev, J. Cruz-Toledo, R. Nicholas, D. Rio, G. Duck, L. I. Furlong, et al. The semanticscience integrated ontology (sio) for biomedical research and knowledge discovery. J. Biomedical Semantics, 5:14, 2014.
8 6. A. Gottlieb, G. Y. Stein, Y. Oron, E. Ruppin, and R. Sharan. Indi: a computational framework for inferring drug interactions and their associated recommendations. Molecular Systems Biology, 8: , [pmid] Mol Syst Biol. 7. P. Groth, A. Gibson, and J. Velterop. The anatomy of a nano-publication. Information Services and Use, 30(1):51 56, K. Haerian, D. Varn, S. Vaidya, L. Ena, H. S. Chase, and C. Friedman. Detection of pharmacovigilance-related adverse events using electronic health records and automated methods. Clinical pharmacology and therapeutics, 92(2): , S. V. Iyer, R. Harpaz, P. LePendu, A. Bauer-Mehren, and N. H. Shah. Mining clinical text for signals of adverse drug-drug interactions. J Am Med Inform Assoc, 21(2):353 62, T. Kuhn, C. Chichester, M. Krauthammer, and M. Dumontier. Publishing without publishers: a decentralized approach to dissemination, retrieval, and archiving of data. In Proceedings of ISWC 2015, Lecture Notes in Computer Science. Springer, T. Kuhn and M. Dumontier. Trusty URIs: Verifiable, immutable, and permanent digital artifacts for linked data. In Proceedings of ESWC Springer, T. Kuhn and M. Dumontier. Making digital artifacts on the web verifiable and reliable. IEEE Transactions on Knowledge and Data Engineering, J. Lazarou, B. H. Pomeranz, and P. N. Corey. Incidence of adverse drug reactions in hospitalized patients: A meta-analysis of prospective studies. JAMA, 279(15): , /jama T. Lebo et al. PROV-O: The PROV ontology. Recommendation, W3C, B. Mons, H. van Haagen, C. Chichester, J. T. den Dunnen, G. van Ommen, E. van Mulligen, B. Singh, R. Hooft, M. Roos, J. Hammond, et al. The value of data. Nature genetics, 43(4): , N. P. Tatonetti, P. P. Ye, R. Daneshjou, and R. B. Altman. Data-driven prediction of drug effects and interactions. Science Translational Medicine, 4(125):125ra31 125ra31, /scitranslmed S. Vilar, E. Uriarte, L. Santana, T. Lorberbaum, G. Hripcsak, C. Friedman, and N. P. Tatonetti. Similarity-based modeling in large-scale prediction of drug-drug interactions. Nat. Protocols, 9(9): , D. S. Wishart, C. Knox, A. C. Guo, D. Cheng, S. Shrivastava, D. Tzur, B. Gautam, and M. Hassanali. Drugbank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Research, 36(Database issue):d901 D906, [pmid] Nucleic Acids Res. 19. L. Zhang, Y. Zhang, P. Zhao, and S.-M. Huang. Predicting drugdrug interactions: An fda perspective. The AAPS Journal, 11(2): , Linked Drug-Drug Interactions (LIDDI) dataset. Nanopublication index http: //np.inn.ac/ra7suq0e661ljdkpt5eos2dkykf1ht9lfmnaztfsdmrxg, 17 July 2015.
Knowledge discovery from biological Big Data : scalability issues
Knowledge discovery from biological Big Data : scalability issues Marie-Dominique Devignes, Malika Smaïl, Emmanuel Bresso, Adrien Coulet, Chedy Raïssi, Amedeo Napoli Université de Lorraine, LORIA laboratory
LinkZoo: A linked data platform for collaborative management of heterogeneous resources
LinkZoo: A linked data platform for collaborative management of heterogeneous resources Marios Meimaris, George Alexiou, George Papastefanatos Institute for the Management of Information Systems, Research
AgroPortal. a proposition for ontologybased services in the agronomic domain
AgroPortal a proposition for ontologybased services in the agronomic domain Clément Jonquet, Esther Dzalé-Yeumo, Elizabeth Arnaud, Pierre Larmande Why ontologies? Why an ontology repository? 2 Biologist
Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011)
Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011) Key Dates Release Date: June 6, 2013 Response Date: June 25, 2013 Purpose This Request
Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies
Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies 1 Enterprise Information Challenge Source: Oracle customer 2 Vision of Semantically Linked Data The Network of Collaborative
Towards Personalized Medicine: Leveraging Patient Similarity and Drug Similarity Analytics
Towards Personalized Medicine: Leveraging Patient Similarity and Drug Similarity Analytics Ping Zhang, PhD, Fei Wang, PhD, Jianying Hu, PhD, Robert Sorrentino, MD Healthcare Analytics Research Group, IBM
LDIF - Linked Data Integration Framework
LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany [email protected],
Lightweight Data Integration using the WebComposition Data Grid Service
Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed
Connecting Basic Research and Healthcare Big Data
Elsevier Health Analytics WHS 2015 Big Data in Health Connecting Basic Research and Healthcare Big Data Olaf Lodbrok Managing Director Elsevier Health Analytics [email protected] t +49 89 5383 600
Linked Statistical Data Analysis
Linked Statistical Data Analysis Sarven Capadisli 1, Sören Auer 2, Reinhard Riedl 3 1 Universität Leipzig, Institut für Informatik, AKSW, Leipzig, Germany, 2 University of Bonn and Fraunhofer IAIS, Bonn,
Security Issues for the Semantic Web
Security Issues for the Semantic Web Dr. Bhavani Thuraisingham Program Director Data and Applications Security The National Science Foundation Arlington, VA On leave from The MITRE Corporation Bedford,
The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data. Ravi Shankar
The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data Ravi Shankar Open access to clinical trials data advances open science Broad open access to entire clinical
City Data Pipeline. A System for Making Open Data Useful for Cities. [email protected]
City Data Pipeline A System for Making Open Data Useful for Cities Stefan Bischof 1,2, Axel Polleres 1, and Simon Sperl 1 1 Siemens AG Österreich, Siemensstraße 90, 1211 Vienna, Austria {bischof.stefan,axel.polleres,simon.sperl}@siemens.com
Experiences from a Large Scale Ontology-Based Application Development
Experiences from a Large Scale Ontology-Based Application Development Ontology Summit 2012 David Price, TopQuadrant Copyright 2012 TopQuadrant Inc 1 Agenda Customer slides explaining EPIM ReportingHub
DISCOVERING RESUME INFORMATION USING LINKED DATA
DISCOVERING RESUME INFORMATION USING LINKED DATA Ujjal Marjit 1, Kumar Sharma 2 and Utpal Biswas 3 1 C.I.R.M, University Kalyani, Kalyani (West Bengal) India [email protected] 2 Department of Computer
Supporting Change-Aware Semantic Web Services
Supporting Change-Aware Semantic Web Services Annika Hinze Department of Computer Science, University of Waikato, New Zealand [email protected] Abstract. The Semantic Web is not only evolving into
A leader in the development and application of information technology to prevent and treat disease.
A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today
> Semantic Web Use Cases and Case Studies
> Semantic Web Use Cases and Case Studies Case Study: Applied Semantic Knowledgebase for Detection of Patients at Risk of Organ Failure through Immune Rejection Robert Stanley 1, Bruce McManus 2, Raymond
Semantic Search in Portals using Ontologies
Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br
Survey of Canadian and International Data Management Initiatives. By Diego Argáez and Kathleen Shearer
Survey of Canadian and International Data Management Initiatives By Diego Argáez and Kathleen Shearer on behalf of the CARL Data Management Working Group (Working paper) April 28, 2008 Introduction Today,
SHared Access Research Ecosystem (SHARE)
SHared Access Research Ecosystem (SHARE) June 7, 2013 DRAFT Association of American Universities (AAU) Association of Public and Land-grant Universities (APLU) Association of Research Libraries (ARL) This
An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials
ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity
Using Open Source software and Open data to support Clinical Trial Protocol design
Using Open Source software and Open data to support Clinical Trial Protocol design Nikolaos Matskanis, Joseph Roumier, Fabrice Estiévenart {nikolaos.matskanis, joseph.roumier, fabrice.estievenart}@cetic.be
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Melanie Dulong de Rosnay Fellow, Science Commons and Berkman Center for Internet & Society at Harvard University This article
Semantic Knowledge Management System. Paripati Lohith Kumar. School of Information Technology
Semantic Knowledge Management System Paripati Lohith Kumar School of Information Technology Vellore Institute of Technology University, Vellore, India. [email protected] Abstract The scholarly activities
Semantic Interoperability
Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from
Integration of Polish National Bibliography within the repository platform for science and humanities
Marcin Roszkowski Integration of Polish National Bibliography within the repository platform for science and humanities The best thing to do to your data will be thought of by somebody else W3C LLD Agenda
A Framework of User-Driven Data Analytics in the Cloud for Course Management
A Framework of User-Driven Data Analytics in the Cloud for Course Management Jie ZHANG 1, William Chandra TJHI 2, Bu Sung LEE 1, Kee Khoon LEE 2, Julita VASSILEVA 3 & Chee Kit LOOI 4 1 School of Computer
Industry 4.0 and Big Data
Industry 4.0 and Big Data Marek Obitko, [email protected] Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H 2 Background Joint work with Czech Institute of Informatics, Robotics and
Model Driven Interoperability through Semantic Annotations using SoaML and ODM
Model Driven Interoperability through Semantic Annotations using SoaML and ODM JiuCheng Xu*, ZhaoYang Bai*, Arne J.Berre*, Odd Christer Brovig** *SINTEF, Pb. 124 Blindern, NO-0314 Oslo, Norway (e-mail:
Semantic Workflows and the Wings Workflow System
To Appear in AAAI Fall Symposium on Proactive Assistant Agents, Arlington, VA, November 2010. Assisting Scientists with Complex Data Analysis Tasks through Semantic Workflows Yolanda Gil, Varun Ratnakar,
DDI Lifecycle: Moving Forward Status of the Development of DDI 4. Joachim Wackerow Technical Committee, DDI Alliance
DDI Lifecycle: Moving Forward Status of the Development of DDI 4 Joachim Wackerow Technical Committee, DDI Alliance Should I Wait for DDI 4? No! DDI Lifecycle 4 is a long development process DDI Lifecycle
Secure Semantic Web Service Using SAML
Secure Semantic Web Service Using SAML JOO-YOUNG LEE and KI-YOUNG MOON Information Security Department Electronics and Telecommunications Research Institute 161 Gajeong-dong, Yuseong-gu, Daejeon KOREA
BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS
BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge
Building Semantic Content Management Framework
Building Semantic Content Management Framework Eric Yen Computing Centre, Academia Sinica Outline What is CMS Related Work CMS Evaluation, Selection, and Metrics CMS Applications in Academia Sinica Concluding
ProteinQuest user guide
ProteinQuest user guide 1. Introduction... 3 1.1 With ProteinQuest you can... 3 1.2 ProteinQuest basic version 4 1.3 ProteinQuest extended version... 5 2. ProteinQuest dictionaries... 6 3. Directions for
MarkLogic Semantics in Healthcare and Life Sciences for LIDER COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
MarkLogic Semantics in Healthcare and Life Sciences for LIDER The Only Enterprise NoSQL Database Search & Query ACID Transactions High Availability / Disaster Recovery Replication Government-grade Security
So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)
Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we
ADVANCING MEASUREMENT OF PATIENT- CENTERED OUTCOMES AND QUALITY METRICS WITH ELECTRONIC HEALTH RECORDS
ADVANCING MEASUREMENT OF PATIENT- CENTERED OUTCOMES AND QUALITY METRICS WITH ELECTRONIC HEALTH RECORDS Tina Hernandez-Boussard, PhD, MPH, MS Director, Surgical Health Services Research Unit Assistant Professor
Practical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment
Cross-Border Challenges in Informatics with a Focus on Disease Surveillance and Utilising Big-Data L. Stoicu-Tivadar et al. (Eds.) 2014 The authors. This article is published online with Open Access by
Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution
Semantic Exploration of Archived Lifecycle Metadata under Schema and Instance Evolution Jörg Brunsmann Faculty of Mathematics and Computer Science, University of Hagen, D-58097 Hagen, Germany [email protected]
We have big data, but we need big knowledge
We have big data, but we need big knowledge Weaving surveys into the semantic web ASC Big Data Conference September 26 th 2014 So much knowledge, so little time 1 3 takeaways What are linked data and the
SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA
SEMANTIC WEB BASED INFERENCE MODEL FOR LARGE SCALE ONTOLOGIES FROM BIG DATA J.RAVI RAJESH PG Scholar Rajalakshmi engineering college Thandalam, Chennai. [email protected] Mrs.
Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo
DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo Expected Outcomes You will learn: Basic concepts related to ontologies Semantic model Semantic web Basic features of RDF and RDF
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Anne Monceaux 1, Joanna Guss 1 1 EADS-CCR, Centreda 1, 4 Avenue Didier Daurat 31700 Blagnac France
Data Publishing Workflows with Dataverse
Data Publishing Workflows with Dataverse Mercè Crosas, Ph.D. Twitter: @mercecrosas Director of Data Science Institute for Quantitative Social Science, Harvard University MIT, May 6, 2014 Intro to our Data
TRANSFoRm: Vision of a learning healthcare system
TRANSFoRm: Vision of a learning healthcare system Vasa Curcin, Imperial College London Theo Arvanitis, University of Birmingham Derek Corrigan, Royal College of Surgeons Ireland TRANSFoRm is partially
A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1
A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 Yannis Stavrakas Vassilis Plachouras IMIS / RC ATHENA Athens, Greece {yannis, vplachouras}@imis.athena-innovation.gr Abstract.
REACCH PNA Data Management Plan
REACCH PNA Data Management Plan Regional Approaches to Climate Change (REACCH) For Pacific Northwest Agriculture 875 Perimeter Drive MS 2339 Moscow, ID 83844-2339 http://www.reacchpna.org [email protected]
ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY
ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY Yu. A. Zagorulko, O. I. Borovikova, S. V. Bulgakov, E. A. Sidorova 1 A.P.Ershov s Institute
PONTE Presentation CETIC. EU Open Day, Cambridge, 31/01/2012. Philippe Massonet
PONTE Presentation CETIC Philippe Massonet EU Open Day, Cambridge, 31/01/2012 PONTE Description Efficient Patient Recruitment for Innovative Clinical Trials of Existing Drugs to other Indications Start
Dendro: collaborative research data management built on linked open data
Dendro: collaborative research data management built on linked open data João Rocha da Silva João Aguiar Castro Faculdade de Engenharia da Universidade do Porto/INESC TEC, Portugal, {joaorosilva,joaoaguiarcastro}@gmail.com
DATA MODEL FOR STORAGE AND RETRIEVAL OF LEGISLATIVE DOCUMENTS IN DIGITAL LIBRARIES USING LINKED DATA
DATA MODEL FOR STORAGE AND RETRIEVAL OF LEGISLATIVE DOCUMENTS IN DIGITAL LIBRARIES USING LINKED DATA María Hallo 1, Sergio Luján-Mora 2, and Alejandro Mate 3 1 Department of Computer Science, National
DC2AP Metadata Editor: A Metadata Editor for an Analysis Pattern Reuse Infrastructure
DC2AP Metadata Editor: A Metadata Editor for an Analysis Pattern Reuse Infrastructure Douglas Alves Peixoto, Lucas Francisco da Matta Vegi, Jugurta Lisboa-Filho Departamento de Informática, Universidade
Semantic Stored Procedures Programming Environment and performance analysis
Semantic Stored Procedures Programming Environment and performance analysis Marjan Efremov 1, Vladimir Zdraveski 2, Petar Ristoski 2, Dimitar Trajanov 2 1 Open Mind Solutions Skopje, bul. Kliment Ohridski
Implementing Ontology-based Information Sharing in Product Lifecycle Management
Implementing Ontology-based Information Sharing in Product Lifecycle Management Dillon McKenzie-Veal, Nathan W. Hartman, and John Springer College of Technology, Purdue University, West Lafayette, Indiana
How To Write A Blog Post On Globus
Globus Software as a Service data publication and discovery Kyle Chard, University of Chicago Computation Institute, [email protected] Jim Pruyne, University of Chicago Computation Institute, [email protected]
CitationBase: A social tagging management portal for references
CitationBase: A social tagging management portal for references Martin Hofmann Department of Computer Science, University of Innsbruck, Austria [email protected] Ying Ding School of Library and Information Science,
An industry perspective on deployed semantic interoperability solutions
An industry perspective on deployed semantic interoperability solutions Ralph Hodgson, CTO, TopQuadrant SEMIC Conference, Athens, April 9, 2014 https://joinup.ec.europa.eu/community/semic/event/se mic-2014-semantic-interoperability-conference
THE SEMANTIC WEB AND IT`S APPLICATIONS
15-16 September 2011, BULGARIA 1 Proceedings of the International Conference on Information Technologies (InfoTech-2011) 15-16 September 2011, Bulgaria THE SEMANTIC WEB AND IT`S APPLICATIONS Dimitar Vuldzhev
D5.3.2b Automatic Rigorous Testing Components
ICT Seventh Framework Programme (ICT FP7) Grant Agreement No: 318497 Data Intensive Techniques to Boost the Real Time Performance of Global Agricultural Data Infrastructures D5.3.2b Automatic Rigorous
HL7 V2 Implementation Guide Authoring Tool Proposal
HL7 V2 Authoring Tool Proposal Robert Snelick, NIST National Institute of Standards and Technology May 14 th 2012 Contact: [email protected] NIST and Veterans Administration Joint Project NIST will provide
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, [email protected] Abstract: Independent
Web-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, [email protected] Abstract. Despite the dramatic growth of online genomic
A generic approach for data integration using RDF, OWL and XML
A generic approach for data integration using RDF, OWL and XML Miguel A. Macias-Garcia, Victor J. Sosa-Sosa, and Ivan Lopez-Arevalo Laboratory of Information Technology (LTI) CINVESTAV-TAMAULIPAS Km 6
LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model
LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model 22 October 2014 Tony Hammond Michele Pasin Background About Macmillan
Extracting value from scientific literature: the power of mining full-text articles for pathway analysis
FOR PHARMA & LIFE SCIENCES WHITE PAPER Harnessing the Power of Content Extracting value from scientific literature: the power of mining full-text articles for pathway analysis Executive Summary Biological
JOURNAL OF COMPUTER SCIENCE AND ENGINEERING
Exploration on Service Matching Methodology Based On Description Logic using Similarity Performance Parameters K.Jayasri Final Year Student IFET College of engineering [email protected] R.Rajmohan
BRIDGing CDASH to SAS: How Harmonizing Clinical Trial and Healthcare Standards May Impact SAS Users Clinton W. Brownley, Cupertino, CA
BRIDGing CDASH to SAS: How Harmonizing Clinical Trial and Healthcare Standards May Impact SAS Users Clinton W. Brownley, Cupertino, CA ABSTRACT The Clinical Data Interchange Standards Consortium (CDISC),
Scope. Cognescent SBI Semantic Business Intelligence
Cognescent SBI Semantic Business Intelligence Scope...1 Conceptual Diagram...2 Datasources...3 Core Concepts...3 Resources...3 Occurrence (SPO)...4 Links...4 Statements...4 Rules...4 Types...4 Mappings...5
Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM)
Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM) Extended Abstract Ioanna Koffina 1, Giorgos Serfiotis 1, Vassilis Christophides 1, Val Tannen
Publishing Linked Data Requires More than Just Using a Tool
Publishing Linked Data Requires More than Just Using a Tool G. Atemezing 1, F. Gandon 2, G. Kepeklian 3, F. Scharffe 4, R. Troncy 1, B. Vatant 5, S. Villata 2 1 EURECOM, 2 Inria, 3 Atos Origin, 4 LIRMM,
Exploring and Understanding Adverse Drug Reactions by Integrative Mining of Clinical Records and Biomedical Knowledge
Exploring and Understanding Adverse Drug Reactions by Integrative Mining of Clinical Records and Biomedical Knowledge http://euadr-project.org PEDRO LOPES [email protected] University of Manchester October
Identify Disorders in Health Records using Conditional Random Fields and Metamap
Identify Disorders in Health Records using Conditional Random Fields and Metamap AEHRC at ShARe/CLEF 2013 ehealth Evaluation Lab Task 1 G. Zuccon 1, A. Holloway 1,2, B. Koopman 1,2, A. Nguyen 1 1 The Australian
FHIM Model Content Overview
FHIM Model Content Overview Federal Health Information Model (FHIM) and Associated Terminology Models Goal Produce a logical, health information model that supports semantic interoperability and that is
