PerCuro-A Semantic Approach to Drug Discovery. Final Project Report submitted by Meenakshi Nagarajan Karthik Gomadam Hongyu Yang

Size: px
Start display at page:

Download "PerCuro-A Semantic Approach to Drug Discovery. Final Project Report submitted by Meenakshi Nagarajan Karthik Gomadam Hongyu Yang"

Transcription

1 PerCuro-A Semantic Approach to Drug Discovery Final Project Report submitted by Meenakshi Nagarajan Karthik Gomadam Hongyu Yang Towards the fulfillment of the course Semantic Web CSCI 8350 Fall 2003 Under the guidance of Prof. Amit P. Sheth LSDIS Lab University of Georgia, Athens

2 PerCuro-A Semantic Approach to Drug Discovery Karthik Gomadam, Meenakshi Nagarajan, Hongyu Yang {karthik, nbmeena, hyy}uga.edu ABSTRACT Drug Discovery is a domain that presents an excellent case for the application of semantic techniques in Bioinformatics. The domain is one of the most ancient characterized by rich voluminous data. All the information in this domain serves as a basis for making decisions to advance and prioritize potential leads in the process of discovering a drug. Clearly, capture and integration of such information of a varied nature has been increasingly critical. The process of discovering a drug is therefore very complicated and requires investment of time, money and effort. The possible use of semantic techniques as a way to speed up the process of target identification has been our primary motivation for this project. Here we present a small prototype system, wherein we exploit the relationships between various domains in drug development while trying to answer questions significant to speeding the process of drug discovery. The software has been named PerCuro, which in Latin means to Cure. KEYWORDS Drug Discovery, Drug, Target, Pathogen, Chemical compound, molecules, functional group, Multi-Ontology, RDF, RQL INTRODUCTION The process of discovering a drug depends largely on the interactions of the receptor, pathogen and drug molecules. A disease is caused by a pathogen binding to a receptor. A drug used to cure a disease can either bind to a receptor or to a pathogen to yield suitable results. The process of a drug binding to a receptor is preferred, killing the pathogen at the site and preventing further influx of pathogens. This target chemical compound match is found based on several molecular and chemical properties of the two entities under question. This brings under consideration factors that determine how the two entities are potentially related. The use of semantic techniques to analyze such relationships is the crux of this project. Our approach towards speeding the drug discovery escalates to an effort towards multi-ontology integration. Implementation details discuss this in more detail. RELATED WORK Many companies have set out to build an integration platform providing ontologies and related tools focused on drug discovery. We will discuss the different approaches and compare it with the approach we have used in PerCuro. Most of the implementation

3 details of related approaches are not very clear because the product or idea is under development. BioWisdom BioWisdom s Discovery ontology products provide a comprehensive and detailed description of the life sciences domain. The Discovery Ontologies contain around 500,000 concepts describing targets, diseases, chemicals, anatomy, species, phenotypes, technologies, institutions and people. By use of an intuitive navigational facilities of an ontology browser, BioWisdom combines various indexing techniques and the database architecture of biomedical information systems. Sagitus Solutions Sagitus Solutions approach is towards exploiting the power of using different ontologies together. The OIL language that combines features from Frames and DLs and is designed to represent ontologies developed in different formalisms, allowing information transfer between them. By representing ontologies in OIL, Sagitus aims to facilitate data mining /management across data sources from genes to clinical trials. The vision is to develop and integrate new and existing ontologies to capture knowledge across the whole drug discovery process. 3 rd Millenium The claim is that meaningful understanding of biological pathways can be realized only through semantic integration of biological data within its biological concept. At 3 rd Millenium, a system that extracts high throughput biological data and places it in its biological context thereby enabling semantic integration and greater knowledge of biological pathways has been created. The purpose is to enable high throughput biology to measurable accelerate drug discovery. Netwrok Ineference Network Inference s Cerebra Server Platform combines with BioWisdom s comprehensive ontologies and deep domain expertise to enable discovery and innovation. They attempt to discover and show associations using a multi-relational ontology. Comparison with our work As opposed to Network Inference s use of one multi-relational ontology, our approach is to explore similar associations using multiple ontologies with potential multirelations in them. Our approach seems to be in line with that of Sagitus Solutions with the exception of use of OIL. Our toy ontologies are developed using Protégé and associations explored using RQL.

4 WHY ONTOLOGIES It is probably important to make a clear argument for why such an application would require the use of ontologies as opposed to continuing with traditional database approaches. The domain of drug discovery is inherently very complex more because of the multiple domains it encompasses. Ontologies provide for universal agreement of concepts within and across domains. Another reasoning is based on the fact that all the information being discovered is by exploring relationships between entities. Such an implementation would otherwise have been very complicated and expensive in terms of resources and time if implemented using database techniques. The idea is to show that this domain is composed of associations, some simple and some complex which can be best captured and traversed using ontologies and related techniques. INTER ONTOLOGY QUERYING The domain of drug discovery, as we have mentioned earlier is a very complex one comprising of several other complex domains. To name some of them Pharmacogenetics, Pharmacokinetics, molecular biology, genes and gene-function, immunology, macromolecule structure ontology, diseases, chemical compound, human anatomy etc. Creation and maintenance of a single ontology just for drug discovery would be very expensive. (any updates in the individual ontologies would mean updating the drug discovery ontology) This was our primary motivation of exploring associations using multiple ontologies instead of a single ontology. Given the diversity and complexity of biological information, it very unlikely that one single drug discovery ontology will be developed. However, individual ontologies are being developed and some of them are well on their way. PharmGKB, Tambis ontology, RiboWeb, EcoCvc, Molecular Biology Ontology, Gene Ontology, IMGT ontology, Star macromolecule structure ontology, GENAROM, EpoDB Controlled vocabulary are some of them. SOFTWARE ARCHITECTURE Our software architecture is diagrammatically represented below.

5 JSP Front End Control Class RQL Query Builder Inference Class RQL Query Engine Query language Ontologies Drug Disease Pathogen/ ASSUMPTIONS The aim of this project has been to show case what semantics can do for a domain like drug discovery. Towards this we have made certain assumptions that simplify the scope of our domain. With respect to ontology design, we have considered only pathological diseases. The chemical properties of molecules of drugs, receptors and pathogens that we have captured have been limited to those essential towards exploring certain associations. When considering multi ontology integration, it is necessary to consider the different formalisms in which the ontologies may have been developed. For the sake of simplicity, we have shown multi ontology integration of ontologies created using Protégé.

6 ONTOLOGY DESIGN A graph representation of our toy ontologies ( rdf and rdfs )have been shown below. SCENARIOS The following sections describe the four scenarios that we have used to show case certain associations in the domain of drug discovery that can be explored using semantic techniques. Query creation: Inter ontology querying is achieved by means of querying separate ontologies and making dynamic associations between concepts in the ontologies. The mapping of concepts is achieved by passing the output of one query (operating on one ontology) as the input to another query (operating on second ontology). Scenario1: Given a disease D; can we identify what functional groups that a drug trying to cure D must not contain? Description: A disease is caused by a pathogen binding to a receptor. A drug that has to cure the disease has to bind itself to the receptor site to stop further influx of pathogens and also kill the ones at the receptor site. This action hugely depends on the interactions of the drug and receptor molecules. The structural compatibility between two molecules is out of the scope of our data capture. However, the chemical properties of these molecules can be exploited to state whether they react favorable towards each other. If the receptor and drug molecules do not reach favorable, they produce toxins. A drug that ends up creating toxins a body in the process of curing a disease is not a good choice. Being able to make such eliminations at the stage of target identification reduces the number of chemicals that scientists have to consider in modeling a drug for that particular disease. The flow of information is shown in the picture below Look up the disease ontology to find the disease causing pathogen and receptor using the is_caused_by relationship Look up the pathogen and receptor ontology relation to get the func.grps for the specific receptor using the "has_func_group" Look up molecule class in drug ontology to find those functional groups that are toxic with the functional groups of the receptor. Drugs having those toxic functional groups cannot be used to cure the disease

7 Results:

8 Scenario2: Given a disease D; can we identify what functional groups that a drug trying to cure D can possibly contain? Description: A disease is caused by a pathogen binding to a receptor. A drug that has to cure the disease has to bind itself to the receptor site to stop further influx of pathogens and also kill the ones at the receptor site. This action hugely depends on the interactions of the drug and receptor molecules. The structural compatibility between two molecules is out of the scope of our data capture. However, the chemical properties of these molecules can be exploited to state whether they react favorably towards each other. If the receptor and drug molecules react favorably, they do not produce toxins. Structural compatibility is yet to be determined, but an elimination like this reduces the number of drugable compounds that a scientist has to consider in modeling a drug for that particular disease. The flow of information is shown in the picture below Look up at the disease ontology and get the receptor and the pathogen involved using the is caused by relation. Look up at the pathogen and receptor ontology using the "has_func_group" relation to get the func.grps for the specific receptor Select a function group and look up at drug-ontology using the "reacts_fav_with" relation to get the func.grps for the specific receptor Drugs having those favorable functional groups are potential cures for that disease. Results:

9

10 Scenario3: Given a specific disease D, whose cause in unknown, is there any existing drug that can suppress the disease symptoms? Description: This scenario explores any additional uses of an existing drug. When there is an outbreak of a new disease with undetermined causes, an existing drug that alleviates symptoms same as that of the new disease offers temporary relief to patients. Being able to answer such questions also explores new uses for a drug increasing its value Select a disease and find the symptoms of the Look up the drug ontology using the "has_indication" relation to get the conditions the drug is used to cure. Drugs used to alleviate the indications same as the symptoms for the disease are potential alleviators. Results:

11

12 Scenario4:Given a compound C, can it act as a drug for a particular disease D? Description: This scenario answers if a specific chemical compound can be used to cure a particular disease. The possible use of this scenario is to eliminate at this pass that this chemical compound cannot be used. An additional check of Lipinski s rule of five which are rules that test the drugability of a compound is also made in this scenario if the compound C is already Used as a drug for D Else, solve scenario 1 to check if C contains any of the toxic func groups. If so it cannot act as a drug for D If not, check if C obeys Lipinski s rules of drugability If step 3 returns yes, C can be used to cure D Results:

13 FUTURE WORK Although this project began as a means of applying semantics to drug discovery; the implementation forced us to look at it from an angle of multi-ontology integration with Drug Discovery as an example. Future work is to implement this idea using larger ontologies to also provide quantitative results on scalability. The inter-ontology querying is done by executing a sequence of RQL queries; the dynamically provide the mapping between concepts in the ontologies. The result of one query serves as the input for the second thus making an implicit mapping between the two entities that the input and output correspond to. Use of large scale ontologies will possibly mean a bigger result set and the cost of sequential querying to get to the desired result is therefore a bottleneck. Using some form of Logics and Rules to represent the mappings is one of our other considerations for future work. REFERENCES

14 ACKNOWLEDGEMENTS We would like to thank Prof. Philip Bowen, Chemistry Dept University of Georgia, Athens for sharing valuable time and ideas for the project. Prof Amit P.Sheth for his invaluable suggestions, continuous support and encouragement during the course of the project. The teaching assistants for the course - Kunal Verma, Cartic Ramakrishnan for all their time and suggestions and all students of the Semantic Web Course.

Web-Based Genomic Information Integration with Gene Ontology

Web-Based Genomic Information Integration with Gene Ontology Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic

More information

A leader in the development and application of information technology to prevent and treat disease.

A leader in the development and application of information technology to prevent and treat disease. A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today

More information

ToxiCat: Hybrid Named Entity Recognition services to support curation of the Comparative Toxicogenomic Database

ToxiCat: Hybrid Named Entity Recognition services to support curation of the Comparative Toxicogenomic Database ToxiCat: Hybrid Named Entity Recognition services to support curation of the Comparative Toxicogenomic Database Dina Vishnyakova 1,2, 4, *, Julien Gobeill 1,3,4, Emilie Pasche 1,2,3,4 and Patrick Ruch

More information

Improve Cooperation in R&D. Catalyze Drug Repositioning. Optimize Clinical Trials. Respect Information Governance and Security

Improve Cooperation in R&D. Catalyze Drug Repositioning. Optimize Clinical Trials. Respect Information Governance and Security SINEQUA FOR LIFE SCIENCES DRIVE INNOVATION. ACCELERATE RESEARCH. SHORTEN TIME-TO-MARKET. 6 Ways to Leverage Big Data Search & Content Analytics for a Pharmaceutical Company Improve Cooperation in R&D Catalyze

More information

Big Data and Text Mining

Big Data and Text Mining Big Data and Text Mining Dr. Ian Lewin Senior NLP Resource Specialist Ian.lewin@linguamatics.com www.linguamatics.com About Linguamatics Boston, USA Cambridge, UK Software Consulting Hosted content Agile,

More information

Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences

Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences It s not information overload, it s filter failure. Clay Shirky Life Sciences organizations face the challenge

More information

Distributed Database for Environmental Data Integration

Distributed Database for Environmental Data Integration Distributed Database for Environmental Data Integration A. Amato', V. Di Lecce2, and V. Piuri 3 II Engineering Faculty of Politecnico di Bari - Italy 2 DIASS, Politecnico di Bari, Italy 3Dept Information

More information

Text Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk

Text Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk Text Mining for Health Care and Medicine Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk The Need for Text Mining MEDLINE 2005: ~14M 2009: ~18M Overwhelming information in textual,

More information

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Roberto Todeschini Milano Chemometrics and QSAR Research Group - Dept. of

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION 1 CHAPTER 1 INTRODUCTION Exploration is a process of discovery. In the database exploration process, an analyst executes a sequence of transformations over a collection of data structures to discover useful

More information

QLIKVIEW FOR LIFE SCIENCES. Partnering for Innovation and Sustainable Growth

QLIKVIEW FOR LIFE SCIENCES. Partnering for Innovation and Sustainable Growth QLIKVIEW FOR LIFE SCIENCES Partnering for Innovation and Sustainable Growth A BUSINESS MODEL BUILT FOR INSIGHT Success in today s life sciences industry requires insight into volumes of data, the sharing

More information

TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials

TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials Pharmaceutical leader deploys TIBCO Spotfire enterprise analytics platform across its drug discovery organization

More information

Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik

Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik Leading Genomics Diagnostic harma Discove Collab Shanghai Cambridge, MA Reykjavik Global leadership for using the genome to create better medicine WuXi NextCODE provides a uniquely proven and integrated

More information

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies 1 Enterprise Information Challenge Source: Oracle customer 2 Vision of Semantically Linked Data The Network of Collaborative

More information

Driving Innovation in Licensing Through Competitive Intelligence and Big Data Analytics

Driving Innovation in Licensing Through Competitive Intelligence and Big Data Analytics Driving Innovation in Licensing Through Competitive Intelligence and Big Data Analytics John D Antonio Practice Leader, Licensing & Competitive Intelligence About Paragon We help increase asset value from

More information

Find the signal in the noise

Find the signal in the noise Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical

More information

Protein Protein Interaction Networks

Protein Protein Interaction Networks Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics

More information

BIOINFORMATICS Supporting competencies for the pharma industry

BIOINFORMATICS Supporting competencies for the pharma industry BIOINFORMATICS Supporting competencies for the pharma industry ABOUT QFAB QFAB is a bioinformatics service provider based in Brisbane, Australia operating nationwide and internationally. QFAB was established

More information

Selbo 2 an Environment for Creating Electronic Content in Software Engineering

Selbo 2 an Environment for Creating Electronic Content in Software Engineering BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 9, No 3 Sofia 2009 Selbo 2 an Environment for Creating Electronic Content in Software Engineering Damyan Mitev 1, Stanimir

More information

Presente e futuro del Web Semantico

Presente e futuro del Web Semantico Sistemi di Elaborazione dell informazione II Corso di Laurea Specialistica in Ingegneria Telematica II anno 4 CFU Università Kore Enna A.A. 2009-2010 Alessandro Longheu http://www.diit.unict.it/users/alongheu

More information

University Uses Business Intelligence Software to Boost Gene Research

University Uses Business Intelligence Software to Boost Gene Research Microsoft SQL Server 2008 R2 Customer Solution Case Study University Uses Business Intelligence Software to Boost Gene Research Overview Country or Region: Scotland Industry: Education Customer Profile

More information

Delivering the power of the world s most successful genomics platform

Delivering the power of the world s most successful genomics platform Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE

More information

From Data to Foresight:

From Data to Foresight: Laura Haas, IBM Fellow IBM Research - Almaden From Data to Foresight: Leveraging Data and Analytics for Materials Research 1 2011 IBM Corporation The road from data to foresight is long? Consumer Reports

More information

Healthcare Measurement Analysis Using Data mining Techniques

Healthcare Measurement Analysis Using Data mining Techniques www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik

More information

Bio-IT World 2013 Best Practices Awards

Bio-IT World 2013 Best Practices Awards Published Resources for the Life Sciences 250 First Avenue, Suite 300, Needham, MA 02494 phone: 781-972-5400 fax: 781-972-5425 Bio-IT World 2013 Best Practices Awards Celebrating Excellence in Innovation

More information

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel

More information

Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences

Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences Prof. Dr. Martin Hofmann-Apitius Head of the Department of Bioinformatics Fraunhofer Institute for Algorithms and

More information

Chemical safety and big data: the industry s demands

Chemical safety and big data: the industry s demands Chemical safety and big data: the industry s demands Richard CURRIE Senior Technical Expert; Group Leader & Global Predictive and Computational Toxicology Lead Valid results Useful results Credit Money/Grants

More information

Informatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR)

Informatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR) Informatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR) Enable Science in silico & Provide the Right Knowledge to the Right People at the Right Time to enable the

More information

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge

More information

Open PHACTS Workshop, February 2015. The Lilly Perspective: Challenges We Face & Tools We Need

Open PHACTS Workshop, February 2015. The Lilly Perspective: Challenges We Face & Tools We Need Open PHACTS Workshop, February 2015 The Lilly Perspective: Challenges We Face & Tools We Need María Jesús Blanco, Ph.D. Director, Advanced Portfolio Strategies Marta Piñeiro-Núñez, Ph.D. Director, Open

More information

Integrating Bioinformatics, Medical Sciences and Drug Discovery

Integrating Bioinformatics, Medical Sciences and Drug Discovery Integrating Bioinformatics, Medical Sciences and Drug Discovery M. Madan Babu Centre for Biotechnology, Anna University, Chennai - 600025 phone: 44-4332179 :: email: madanm1@rediffmail.com Bioinformatics

More information

HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - aniketb1@umbc.edu. CMSC 601 - Presentation

HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - aniketb1@umbc.edu. CMSC 601 - Presentation HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - aniketb1@umbc.edu CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous

More information

Pharmacology skills for drug discovery. Why is pharmacology important?

Pharmacology skills for drug discovery. Why is pharmacology important? skills for drug discovery Why is pharmacology important?, the science underlying the interaction between chemicals and living systems, emerged as a distinct discipline allied to medicine in the mid-19th

More information

Doctor of Philosophy in Computer Science

Doctor of Philosophy in Computer Science Doctor of Philosophy in Computer Science Background/Rationale The program aims to develop computer scientists who are armed with methods, tools and techniques from both theoretical and systems aspects

More information

An Ontology-based e-learning System for Network Security

An Ontology-based e-learning System for Network Security An Ontology-based e-learning System for Network Security Yoshihito Takahashi, Tomomi Abiko, Eriko Negishi Sendai National College of Technology a0432@ccedu.sendai-ct.ac.jp Goichi Itabashi Graduate School

More information

Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources

Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources Ping Zhang IBM T. J. Watson Research Center Pankaj Agarwal GlaxoSmithKline Zoran Obradovic Temple University Terms and

More information

PharmaPendium. The definitive source of best-in-class drug information

PharmaPendium. The definitive source of best-in-class drug information Please contact us for more information Americas E-Customer Service 360 Park Avenue South New York, NY 10010-1710 USA Tel: +1 888 615 4500 (+1 212 462 1978, if calling from outside the USA and Canada) Fax:

More information

PONTE Presentation CETIC. EU Open Day, Cambridge, 31/01/2012. Philippe Massonet

PONTE Presentation CETIC. EU Open Day, Cambridge, 31/01/2012. Philippe Massonet PONTE Presentation CETIC Philippe Massonet EU Open Day, Cambridge, 31/01/2012 PONTE Description Efficient Patient Recruitment for Innovative Clinical Trials of Existing Drugs to other Indications Start

More information

Dr Alexander Henzing

Dr Alexander Henzing Horizon 2020 Health, Demographic Change & Wellbeing EU funding, research and collaboration opportunities for 2016/17 Innovate UK funding opportunities in omics, bridging health and life sciences Dr Alexander

More information

What s New in Pathway Studio Web 11.1

What s New in Pathway Studio Web 11.1 1 1 What s New in Pathway Studio Web 11.1 Elseiver is pleased to announce the release of Pathway Studio Web 11.1 for all database subscriptions (Mammal, Mammal+ChemEffect+DiseaseFx, Plant). This release

More information

Big Data Management Assessed Coursework Two Big Data vs Semantic Web F21BD

Big Data Management Assessed Coursework Two Big Data vs Semantic Web F21BD Big Data Management Assessed Coursework Two Big Data vs Semantic Web F21BD Boris Mocialov (H00180016) MSc Software Engineering Heriot-Watt University, Edinburgh April 5, 2015 1 1 Introduction The purpose

More information

Extraction and Visualization of Protein-Protein Interactions from PubMed

Extraction and Visualization of Protein-Protein Interactions from PubMed Extraction and Visualization of Protein-Protein Interactions from PubMed Ulf Leser Knowledge Management in Bioinformatics Humboldt-Universität Berlin Finding Relevant Knowledge Find information about Much

More information

M110.726 The Nucleus M110.727 The Cytoskeleton M340.703 Cell Structure and Dynamics

M110.726 The Nucleus M110.727 The Cytoskeleton M340.703 Cell Structure and Dynamics of Biochemistry and Molecular Biology 1. Master the knowledge base of current biochemistry, molecular biology, and cellular physiology Describe current knowledge in metabolic transformations conducted

More information

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer Alejandro Vaisman Esteban Zimanyi Data Warehouse Systems Design and Implementation ^ Springer Contents Part I Fundamental Concepts 1 Introduction 3 1.1 A Historical Overview of Data Warehousing 4 1.2 Spatial

More information

Information Discovery on Electronic Medical Records

Information Discovery on Electronic Medical Records Information Discovery on Electronic Medical Records Vagelis Hristidis, Fernando Farfán, Redmond P. Burke, MD Anthony F. Rossi, MD Jeffrey A. White, FIU FIU Miami Children s Hospital Miami Children s Hospital

More information

Software Description Technology

Software Description Technology Software applications using NCB Technology. Software Description Technology LEX Provide learning management system that is a central resource for online medical education content and computer-based learning

More information

Understanding Web personalization with Web Usage Mining and its Application: Recommender System

Understanding Web personalization with Web Usage Mining and its Application: Recommender System Understanding Web personalization with Web Usage Mining and its Application: Recommender System Manoj Swami 1, Prof. Manasi Kulkarni 2 1 M.Tech (Computer-NIMS), VJTI, Mumbai. 2 Department of Computer Technology,

More information

University of Medicine and Dentistry of New Jersey (UMDNJ)

University of Medicine and Dentistry of New Jersey (UMDNJ) University of Medicine and Dentistry of New Jersey (UMDNJ) Dual-Degree Program between the UMDNJ Graduate School of Biomedical Sciences (GSBS) And the UMDNJ School of Public Health (SPH) Leading to the:

More information

SmartLink: a Web-based editor and search environment for Linked Services

SmartLink: a Web-based editor and search environment for Linked Services SmartLink: a Web-based editor and search environment for Linked Services Stefan Dietze, Hong Qing Yu, Carlos Pedrinaci, Dong Liu, John Domingue Knowledge Media Institute, The Open University, MK7 6AA,

More information

Logical Semantic Warehouse - Developing Your Own Semantic Ecosystem Peter Lawrence, TopQuadrant

Logical Semantic Warehouse - Developing Your Own Semantic Ecosystem Peter Lawrence, TopQuadrant Logical Semantic Warehouse - Developing Your Own Semantic Ecosystem Peter Lawrence, TopQuadrant Semantic Ecosystem Solution Value Chain Enrich... searching and locating information using EVN to manage

More information

ProteinQuest user guide

ProteinQuest user guide ProteinQuest user guide 1. Introduction... 3 1.1 With ProteinQuest you can... 3 1.2 ProteinQuest basic version 4 1.3 ProteinQuest extended version... 5 2. ProteinQuest dictionaries... 6 3. Directions for

More information

The Importance of Bioinformatics and Information Management

The Importance of Bioinformatics and Information Management A Graduate Program for Biological Information Specialists 1 Bryan Heidorn, Carole Palmer, and Dan Wright Graduate School of Library and Information Science University of Illinois at Urbana-Champaign UIUC

More information

Building Semantic Content Management Framework

Building Semantic Content Management Framework Building Semantic Content Management Framework Eric Yen Computing Centre, Academia Sinica Outline What is CMS Related Work CMS Evaluation, Selection, and Metrics CMS Applications in Academia Sinica Concluding

More information

Some Research Challenges for Big Data Analytics of Intelligent Security

Some Research Challenges for Big Data Analytics of Intelligent Security Some Research Challenges for Big Data Analytics of Intelligent Security Yuh-Jong Hu hu at cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,

More information

We use Reaxys intensively for hit identification, hit-to-lead and lead optimization.

We use Reaxys intensively for hit identification, hit-to-lead and lead optimization. CASE STUDY Dr. Fabio C. Tucci, COO of Epigen Biosciences We use Reaxys intensively for hit identification, hit-to-lead and lead optimization. CREATING NEW ASSETS Epigen Biosciences is a start-up pharmaceutical

More information

Netherlands escience Center

Netherlands escience Center Netherlands escience Center ICT Synergy Hub, Amsterdam Research & Innovation in the Big Data Era CWI in Bedrijf Centrum Wiskunde & Informatica Op 5 oktober 2012 Prof. dr. Jacob de Vlieg ¹ ² 1. CEO & Scientific

More information

A Virtual Machine Searching Method in Networks using a Vector Space Model and Routing Table Tree Architecture

A Virtual Machine Searching Method in Networks using a Vector Space Model and Routing Table Tree Architecture A Virtual Machine Searching Method in Networks using a Vector Space Model and Routing Table Tree Architecture Hyeon seok O, Namgi Kim1, Byoung-Dai Lee dept. of Computer Science. Kyonggi University, Suwon,

More information

Genomic CDS: an example of a complex ontology for pharmacogenetics and clinical decision support

Genomic CDS: an example of a complex ontology for pharmacogenetics and clinical decision support Genomic CDS: an example of a complex ontology for pharmacogenetics and clinical decision support Matthias Samwald 1 1 Medical University of Vienna, Vienna, Austria matthias.samwald@meduniwien.ac.at Abstract.

More information

7/15/2015 THE CHALLENGE. Amazon, Google & Facebook have Big Data problems. in Oncology we have a Small Data Problem!

7/15/2015 THE CHALLENGE. Amazon, Google & Facebook have Big Data problems. in Oncology we have a Small Data Problem! The Power of Ontologies and Standardized Terminologies for Capturing Clinical Knowledge Peter E. Gabriel, MD, MSE AAPM Annual Meeting July 15, 2015 1 THE CHALLENGE Amazon, Google & Facebook have Big Data

More information

Cheminformatics and Pharmacophore Modeling, Together at Last

Cheminformatics and Pharmacophore Modeling, Together at Last Application Guide Cheminformatics and Pharmacophore Modeling, Together at Last SciTegic Pipeline Pilot Bridging Accord Database Explorer and Discovery Studio Carl Colburn Shikha Varma-O Brien Introduction

More information

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of

More information

dixa a data infrastructure for chemical safety Jos Kleinjans Dept of Toxicogenomics Maastricht University

dixa a data infrastructure for chemical safety Jos Kleinjans Dept of Toxicogenomics Maastricht University dixa a data infrastructure for chemical safety Jos Kleinjans Dept of Toxicogenomics Maastricht University Current protocol for chemical safety testing Short Term Tests for Genetic Toxicity Bacterial Reverse

More information

PODD. An Ontology Driven Architecture for Extensible Phenomics Data Management

PODD. An Ontology Driven Architecture for Extensible Phenomics Data Management PODD An Ontology Driven Architecture for Extensible Phenomics Data Management Gavin Kennedy Gavin Kennedy PODD Project Manager High Resolution Plant Phenomics Centre Canberra, Australia What is Plant Phenomics?

More information

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India Big Data and Semantic Web in Manufacturing Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India Outline Big data in Manufacturing Big data Analytics Semantic web technologies Case

More information

Agilent s Kalabie Electronic Lab Notebook (ELN) Product Overview ChemAxon UGM 2008 Agilent Software and Informatics Division Mike Burke

Agilent s Kalabie Electronic Lab Notebook (ELN) Product Overview ChemAxon UGM 2008 Agilent Software and Informatics Division Mike Burke Agilent s Kalabie Electronic Lab Notebook (ELN) Product Overview ChemAxon UGM 2008 Agilent Software and Informatics Division Mike Burke Kalabie: Extending the OpenLAB Architecture Agilent User Interface

More information

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10 1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom

More information

Georgia Department of Education

Georgia Department of Education Epidemiology Curriculum The Georgia Performance Standards are designed to provide students with the knowledge and skills for proficiency in science. The Project 2061 s Benchmarks for Science Literacy is

More information

Semantic Web Services for e-learning: Engineering and Technology Domain

Semantic Web Services for e-learning: Engineering and Technology Domain Web s for e-learning: Engineering and Technology Domain Krupali Shah and Jayant Gadge Abstract E learning has gained its importance over the traditional classroom learning techniques in past few decades.

More information

International Workshop on Big Data Analytics for Advanced Databases (BIGDATA, 2016)

International Workshop on Big Data Analytics for Advanced Databases (BIGDATA, 2016) International Workshop on Big Data Analytics for Advanced Databases (BIGDATA, 2016) Call for Papers AIM and SCOPE There is an exponential growth in digital data with unprecedented new platforms derived

More information

MOBILE ARCHITECTURE FOR DYNAMIC GENERATION AND SCALABLE DISTRIBUTION OF SENSOR-BASED APPLICATIONS

MOBILE ARCHITECTURE FOR DYNAMIC GENERATION AND SCALABLE DISTRIBUTION OF SENSOR-BASED APPLICATIONS MOBILE ARCHITECTURE FOR DYNAMIC GENERATION AND SCALABLE DISTRIBUTION OF SENSOR-BASED APPLICATIONS Marco Picone, Marco Muro, Vincenzo Micelli, Michele Amoretti, Francesco Zanichelli Distributed Systems

More information

EMBL Identity & Access Management

EMBL Identity & Access Management EMBL Identity & Access Management Rupert Lück EMBL Heidelberg e IRG Workshop Zürich Apr 24th 2008 Outline EMBL Overview Identity & Access Management for EMBL IT Requirements & Strategy Project Goal and

More information

Lightweight Data Integration using the WebComposition Data Grid Service

Lightweight Data Integration using the WebComposition Data Grid Service Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed

More information

Talking your Language. E-WorkBook 10 provides a one-platform, single source of truth without adding complexity to research

Talking your Language. E-WorkBook 10 provides a one-platform, single source of truth without adding complexity to research Talking your Language E-WorkBook 10 provides a one-platform, single source of truth without adding complexity to research Meet E-WorkBook 10 In January 2015 we launched E-WorkBook 10 - the next step in

More information

TopBraid Insight for Life Sciences

TopBraid Insight for Life Sciences TopBraid Insight for Life Sciences In the Life Sciences industries, making critical business decisions depends on having relevant information. However, queries often have to span multiple sources of information.

More information

Master of Science in Biomedical Sciences

Master of Science in Biomedical Sciences Master of Science in Biomedical Sciences Faculty of Medicine Good health is our greatest treasure. Understanding the human body, healthy and diseased, is the stepping stone to finding tools to improving

More information

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources 1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools

More information

Digital libraries of the future and the role of libraries

Digital libraries of the future and the role of libraries Digital libraries of the future and the role of libraries Donatella Castelli ISTI-CNR, Pisa, Italy Abstract Purpose: To introduce the digital libraries of the future, their enabling technologies and their

More information

WORKSHOP ON TOPOLOGY AND ABSTRACT ALGEBRA FOR BIOMEDICINE

WORKSHOP ON TOPOLOGY AND ABSTRACT ALGEBRA FOR BIOMEDICINE WORKSHOP ON TOPOLOGY AND ABSTRACT ALGEBRA FOR BIOMEDICINE ERIC K. NEUMANN Foundation Medicine, Cambridge, MA 02139, USA Email: eneumann@foundationmedicine.com SVETLANA LOCKWOOD School of Electrical Engineering

More information

INSERM/ A. Bernheim. Overcoming clinical relapse in multiple myeloma by understanding and targeting the molecular causes of drug resistance

INSERM/ A. Bernheim. Overcoming clinical relapse in multiple myeloma by understanding and targeting the molecular causes of drug resistance A. Bernheim Overcoming clinical relapse in multiple myeloma by understanding and targeting the molecular causes of drug resistance OVER-MyR is funded by the European Commission within its FP7 specific

More information

High-Volume Data Warehousing in Centerprise. Product Datasheet

High-Volume Data Warehousing in Centerprise. Product Datasheet High-Volume Data Warehousing in Centerprise Product Datasheet Table of Contents Overview 3 Data Complexity 3 Data Quality 3 Speed and Scalability 3 Centerprise Data Warehouse Features 4 ETL in a Unified

More information

Describing Web Services for user-oriented retrieval

Describing Web Services for user-oriented retrieval Describing Web Services for user-oriented retrieval Duncan Hull, Robert Stevens, and Phillip Lord School of Computer Science, University of Manchester, Oxford Road, Manchester, UK. M13 9PL Abstract. As

More information

Big Data in Drug Discovery

Big Data in Drug Discovery Big Data in Drug Discovery David J. Wild Assistant Professor & Director, Cheminformatics Program Indiana University School of Informatics and Computing djwild@indiana.edu - http://djwild.info Epochs in

More information

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 James Maltby, Ph.D 1 Outline of Presentation Semantic Graph Analytics Database Architectures In-memory Semantic Database Formulation

More information

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02) Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we

More information

Building Decision Support through Dynamic Workflow Systems for Health Care Ontology Focus. Tanay Sharma B00098872 tsharma@dal.ca

Building Decision Support through Dynamic Workflow Systems for Health Care Ontology Focus. Tanay Sharma B00098872 tsharma@dal.ca Building Decision Support through Dynamic Workflow Systems for Health Care Ontology Focus By Tanay Sharma B00098872 tsharma@dal.ca Performed at Atlantic Innovation Fund St. Francis Xavier University Antigonish,

More information

Research Data Integration of Retrospective Studies for Prediction of Disease Progression A White Paper. By Erich A. Gombocz

Research Data Integration of Retrospective Studies for Prediction of Disease Progression A White Paper. By Erich A. Gombocz Research Data Integration of Retrospective Studies for Prediction of Disease Progression A White Paper By Erich A. Gombocz 2 Research Data Integration of Retrospective Studies for Prediction of Disease

More information

BIOSCIENCES COURSE TITLE AWARD

BIOSCIENCES COURSE TITLE AWARD COURSE TITLE AWARD BIOSCIENCES As a Biosciences undergraduate student at the University of Westminster, you will benefit from some of the best teaching and facilities available. Our courses combine lecture,

More information

Medicines for Neglected Diseases Workshop. Dennis Liotta, Ph.D. Director Emory Institute for Drug Discovery Atlanta, Georgia

Medicines for Neglected Diseases Workshop. Dennis Liotta, Ph.D. Director Emory Institute for Drug Discovery Atlanta, Georgia Medicines for Neglected Diseases Workshop Dennis Liotta, Ph.D. Director Emory Institute for Drug Discovery Atlanta, Georgia September 11, 2010 Poverty, Many People Poor health conditions High Morbidity

More information

Guidelines for Establishment of Contract Areas Computer Science Department

Guidelines for Establishment of Contract Areas Computer Science Department Guidelines for Establishment of Contract Areas Computer Science Department Current 07/01/07 Statement: The Contract Area is designed to allow a student, in cooperation with a member of the Computer Science

More information

Big Data Mining Services and Knowledge Discovery Applications on Clouds

Big Data Mining Services and Knowledge Discovery Applications on Clouds Big Data Mining Services and Knowledge Discovery Applications on Clouds Domenico Talia DIMES, Università della Calabria & DtoK Lab Italy talia@dimes.unical.it Data Availability or Data Deluge? Some decades

More information

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence Augmented Search for Web Applications New frontier in big log data analysis and application intelligence Business white paper May 2015 Web applications are the most common business applications today.

More information

General Services Administration Federal Supply Service Authorized Federal Supply Schedule Price List

General Services Administration Federal Supply Service Authorized Federal Supply Schedule Price List General Services Administration Federal Supply Service Authorized Federal Supply Schedule Price List GSA Schedule 66 Scientific Equipment and Services SIN 66-1000 Professional Scientific Services IHRC,

More information

Ontology-Based Discovery of Workflow Activity Patterns

Ontology-Based Discovery of Workflow Activity Patterns Ontology-Based Discovery of Workflow Activity Patterns Diogo R. Ferreira 1, Susana Alves 1, Lucinéia H. Thom 2 1 IST Technical University of Lisbon, Portugal {diogo.ferreira,susana.alves}@ist.utl.pt 2

More information

Driving SDN Adoption in Service Provider Networks

Driving SDN Adoption in Service Provider Networks WHITEPAPER Software Defined Networking (SDN) Driving SDN Adoption in Service Provider Networks This whitepaper provides an overview of key requirements and enablers for driving SDN adoption in Service

More information

Medical Decision Logic, Inc.

Medical Decision Logic, Inc. Medical Decision Logic, Inc. mdlogix Registries and Health Science: Applied Health Informatics Presentation Plan Mission, Goals, and Vision Theoretical Foundation (models) Pragmatic Foundation (cases)

More information

Explorer's Guide to the Semantic Web

Explorer's Guide to the Semantic Web Explorer's Guide to the Semantic Web THOMAS B. PASSIN 11 MANNING Greenwich (74 w. long.) contents preface xiii acknowledgments xv about this booh xvii The Semantic Web 1 1.1 What is the Semantic Web? 3

More information

TECHNICAL INSIGHTS TECHNOLOGY ALERT

TECHNICAL INSIGHTS TECHNOLOGY ALERT TECHNICAL INSIGHTS TECHNOLOGY ALERT UNLOCKING BREAST CANCER TUMORS' DRUG RESISTANCE A troubling phenomenon in breast cancer treatment is that some patients tumors become resistant to treatments over time.

More information

Job list - Research China

Job list - Research China Job list - Research China Job # Job Title Page 010499 Scientist for Mechanical Engineering P2-3 002590 Senior Scientist for Biomedical Engineering P4-5 022831 Research Scientist for MRI P6-7 010501 Scientist

More information