The Open PHACTS Discovery Platform Semantic data integration for Medicinal Chemists



Similar documents
Open PHACTS Data integration for all. Andrew Leach

BIG DATA EUROPE. Integrating Big Data, Software & Communities for Addressing Europe s Societal Challenges

Integrating pharmacological data

Logical Semantic Warehouse - Developing Your Own Semantic Ecosystem Peter Lawrence, TopQuadrant

Big Data Europe

Too Much Data or Too Little Cooperation? Tom Plasterer, PhD. Research & Development Information (RDI) Director, US Cross-Science

Corporate Presentation November, 2013

De novo design in the cloud from mining big data to clinical candidate

Cheminformatics and its Role in the Modern Drug Discovery Process

Big Data in Drug Discovery

We use Reaxys intensively for hit identification, hit-to-lead and lead optimization.

Big Data Problem? or Big Problem with Data? William Hayes, PhD SVP PlaCorm Dev, Selventa

How to create and interpret the predictive analysis of a compound

Data Visualization in Cheminformatics. Simon Xi Computational Sciences CoE Pfizer Cambridge

Diabetes and Drug Development

How To Understand Protein-Protein Interaction And Inhibitors

STRUCTURE-GUIDED, FRAGMENT-BASED LEAD GENERATION FOR ONCOLOGY TARGETS

Open Source Software in Life Science Research. Woodhead Publishing Series in Biomedicine

Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences

TopQuadrant-Syngenta Webcast July 10, 2014 Semantic Data Virtualization: Extracting More Value from Data Silos

Integrating Medicinal Chemistry and Computational Chemistry: The Molecular Forecaster Approach

標 準 納 期 2 週 間 を 誇 るKINOMEscan 受 託 試 験 : 報 告 書 から 抽 出 できる 更 なるデータ? 日 本 事 業 担 当 大 嶺 青 河 DiscoveRx Corporation

Lead generation and lead optimisation:

Dr Alexander Henzing

Pharmacology skills for drug discovery. Why is pharmacology important?

dixa a data infrastructure for chemical safety Jos Kleinjans Dept of Toxicogenomics Maastricht University

DMPK: Experimentation & Data

Academic Drug Discovery in the Center for Integrative Chemical Biology and Drug Discovery

Exploiting the Pathogen box

THE BIOTECH & PHARMACEUTICAL INDUSTRY

High-Throughput Screening at The University of Chicago Cellular Screening Center. Sam Bettis Technical Director

Crossing the drug development divide

Computational Tools for Medicinal Chemists Increasing the Dimensions of Drug Discovery. Dr Robert Scoffin CEO

cansar: integrated cancer knowledgebase

Improving Health Outcomes Using Big Data

Biological importance of metabolites. Safety and efficacy aspects

Drug Discovery in China

ChemCloud - Chemical e-science Information Cloud. Adrian Paschke, Freie Universitaet Berlin Stephan Heineke, FIZ CHEMIE

Improve Cooperation in R&D. Catalyze Drug Repositioning. Optimize Clinical Trials. Respect Information Governance and Security

Accelerating Lead Generation: Emerging Technologies and Strategies

ProteinQuest user guide

Informatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR)

Network Webinar Series

CTC Technology Readiness Levels

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

Alterações empresariais sustentadas pelo conceito de engenharia do Produto Patrício Soares da Silva, MD, PhD

SEMANTIC DATA PLATFORM FOR HEALTHCARE. Dr. Philipp Daumke

Chemical safety and big data: the industry s demands

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.

PIRAMAL DISCOVERY SOLUTIONS

OpenText Content Hub for Publishers

Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences

Medicines for Neglected Diseases Workshop. Dennis Liotta, Ph.D. Director Emory Institute for Drug Discovery Atlanta, Georgia

Molecule Shapes. 1

Text Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining

How To Understand The Chemistry Of A 2D Structure

BIOINFORMATICS Supporting competencies for the pharma industry

THOMSON REUTERS CORTELLIS FOR INFORMATICS. REUTERS/ Aly Song

Identification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation

Thermodynamics of Mixing

Building a Unified Drug Discovery Database

Knowledge Graph: Connecting Big Data Semantics. Ying Ding Indiana University

Validated Cell-Based Assays for Rapid Screening and Functional Characterization of Therapeutic Monoclonal Antibodies

Netherlands escience Center

MSC IN MEDICINAL CHEMISTRY

Anforderungen der Life-Science Industrie an die Hochschulen. Hans Widmer Novartis Institutes for BioMedical Research

EMBL Identity & Access Management

Nathan Brown. The Application of Consensus Modelling and Genetic Algorithms to Interpretable Discriminant Analysis.

Data Warehouse Design for Pharmaceutical Drug Discovery Research

TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials

Scientific Business Intelligence using Pipeline Pilot

THE CAMBRIDGE CRYSTALLOGRAPHIC DATA CENTRE (CCDC)

PerCuro-A Semantic Approach to Drug Discovery. Final Project Report submitted by Meenakshi Nagarajan Karthik Gomadam Hongyu Yang

Call 2014: High throughput screening of therapeutic molecules and rare diseases

HER2 Testing in Breast Cancer

Bachelor of Science in Pharmaceutical Sciences (BSPS) Program Overview and Internship Requirements

Why Open Drug Discovery Needs Four Simple Rules for Licensing Data and

A leader in the development and application of information technology to prevent and treat disease.

org.rn.eg.db December 16, 2015 org.rn.egaccnum is an R object that contains mappings between Entrez Gene identifiers and GenBank accession numbers.

BROMOscan Quantitative Ligand Binding Assays

Ingenuity Pathway Analysis (IPA )

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane

Cell Discovery 360: Explore more possibilities.

Corporate Overview. Dr Robert Scoffin CEO. http;// STAND NUMBER: 27

Eudendron: an Innovative Biotech Start-up

The Clinical Trials Process an educated patient s guide

FACT SHEET TESTETROL, A NOVEL ORALLY BIOACTIVE ANDROGEN

IO Informatics The Sentient Suite

University-wide Courses/Seminars These 4 courses are offered by the Graduate School:

A career on the science park

REGULATIONS FOR THE DEGREE OF BACHELOR OF PHARMACY IN CHINESE MEDICINE [PART-TIME] (BPharm[ChinMed])

Making semantics work in drug discovery

EVT Execute & EVT Innovate Leading drug discovery

Research Data Integration of Retrospective Studies for Prediction of Disease Progression A White Paper. By Erich A. Gombocz

ToxiCat: Hybrid Named Entity Recognition services to support curation of the Comparative Toxicogenomic Database

Fragment-based lead generation of reversible inhibitors for lysine-specific demethylases

Roopa Ramamoorthi PhD, PMP, Associate Director, Partnering and Scientific Affairs

Data Integration and Decision-Making For Biomarkers Discovery, Validation and Evaluation. D. POLVERARI, CTO October

Transcription:

Pharmacoinformatics Research Group Department of Pharmaceutical Chemistry The Open PHACTS Discovery Platform Semantic data integration for Medicinal Chemists Gerhard F. Ecker Dept. of Pharmaceutical Chemistry, Univ Vienna Gerhard.f.ecker@univie.ac.at; www.openphacts.org

What is the selectivity profile of known p38 inhibitors? Let me compare MW, logp and PSA for known oxidoreductase inhibitors Find me compounds that inhibit targets in NFkB pathway assayed in only functional assays with a potency <1 μm ChEMBL DrugBank Gene Ontology Wikipathways GeneGo ChEBI Uniprot UMLS GVKBio ConceptWiki ChemSpider TrialTrove TR Integrity Approaching complex research questions needs integration of data sources

Why? Public Domain Drug Discovery Data: Pharma are accessing, processing, storing & re-processing Literature Patents PubChem Genbank Databases Downloads x Repeat @ each company Data Integration Data Analysis Firewalled Databases

The Innovative Medicines Initiative EC funded public-private partnership for pharmaceutical research Focus on key problems Efficacy, Safety, Education & Training, Knowledge Management The Open PHACTS Project Create a semantic integration hub ( Open Pharmacological Space ) Delivering services to support on-going drug discovery programs in pharma and public domain Not just another project; Leading academics in semantics, pharmacology and informatics, driven by solid industry business requirements 16 academic partners, 8 pharmaceutical companies, 4 biotechs Work split into clusters: Technical Build Scientific Drive Community & Sustainability The Project

The Need Number sum Nr of 1 Question 15 12 9 All oxidoreductase inhibitors active <100nM in both human and mouse 18 14 8 Given compound X, what is its predicted secondary pharmacology? What are the on and off,target safety concerns for a compound? What is the evidence and how reliable is that evidence (journal impact factor, KOL) for findings associated with a compound? 24 13 8 Given a target find me all actives against that target. Find/predict polypharmacology of actives. Determine ADMET profile of actives. 32 13 8 For a given interaction profile, give me compounds similar to it. 37 13 8 38 13 8 The current Factor Xa lead series is characterised by substructure X. Retrieve all bioactivity data in serine protease assays for molecules that contain substructure X. Retrieve all experimental and clinical data for a given list of compounds defined by their chemical structure (with options to match stereochemistry or not). 41 13 8 A project is considering Protein Kinase C Alpha (PRKCA) as a target. What are all the compounds known to modulate the target directly? What are the compounds that may modulate the target directly? i.e. return all cmpds active in assays where the resolution is at least at the level of the target family (i.e. PKC) both from structured assay databases and the literature. 44 13 8 Give me all active compounds on a given target with the relevant assay data 46 13 8 Give me the compound(s) which hit most specifically the multiple targets in a given pathway (disease) 59 14 8 Identify all known protein-protein interaction inhibitors Azzaoui et al., Drug Discov Today 2013, 843-52

Platform Explorer Apps API Standards

The User The Explorer is a user-friendly, full featured interface that allows scientists to explore and interrogate integrated biological and chemical data The API allows searches along the concepts of compound target pathway - disease

STANDARD_TYPE UNIT_COUNT ---------------- ------- STANDARD_TYPE STANDARD_UNITS COUNT(*) AC50 7 ------------------ ------------------ -------- Activity 421 IC50 nm 829448 EC50 39 IC50 ug.ml-1 41000 IC50 46 IC50 38521 ID50 42 IC50 ug/ml 2038 Ki 23 IC50 ug ml-1 509 Log IC50 4 IC50 mg kg-1 295 Log Ki 7 IC50 molar ratio 178 Potency 11 IC50 ug 117 log IC50 0 IC50 % 113 IC50 um well-1 52 IC50 p.p.m. 51 >5000 types IC50 ppm 36 IC50 um-1 25 IC50 nm kg-1 25 IC50 milliequivalent 22 IC50 kj m-2 20 Implemented using the Quantities, Dimension, Units, Types Ontology (http://www.qudt.org/) ~ 100 units

Licensing Chose John Wilbanks as consultant A framework built around STANDARD wellunderstood Creative Commons licences and how they interoperate Deal with the problems by: Interoperable licences Appropriate terms Declare expectations to users and data publishers One size won t fit all requirements

Explorer www.openphacts.org

Provenance

Further planned: Transporter taxonomy Receptor taxonomy ADME taxonomy Target Taxonomy

Example applications Advanced analytics ChemBioNavigator TargetDossier PharmaTrek UTOPIA Navigating at the interface of chemical and biological data with sorting and plotting options Interconnecting Open PHACTS with multiple target centric services. Exploring target similarity using diverse criteria Interactive Polypharmacology space of experimental annotations Semantic enrichment of scientific PDFs Predictions GARFIELD etox connector Prediction of target pharmacology based on the Similar Ensemble Approach Automatic extraction of data for building predictive toxicology models in etox project

PharmaTrek

ChemBioNavigator Developed by the University of Hamburg and BioSolveIT

Manuel Pastor

MOE

The power of workflows Give me compounds active against all targets in the Epidermal growth factor receptor (ErbB) signaling pathway that have a relevance to disease Ratnam et al., submitted

Pharmacoinformatics Research Group Department of Pharmaceutical Chemistry TRPV1 Transmembrane receptor Tetramer Non-selective cation channel Peripheral nervous system Activation influx of cations depolarization of membrane pain perception Prolonged activation desensitization Lee et al., J Comput Aided Mol Des (2011) 25: 317-37

Pharmacoinformatics Research Group Department of Pharmaceutical Chemistry TRPV1

Pharmacoinformatics Research Group Department of Pharmaceutical Chemistry TRPV1 htrpv1 2328 ligands from Open PHACTS capsaicin HEK293 Goldmann D, Mol Inform 2013.

Pharmacoinformatics Research Group Department of Pharmaceutical Chemistry Develop assay ontology Create tools for semiautomatic combination Propose reference compounds

Open PHACTS Project Partners www.openphacts.org Project Contacts: GSK Coordinator (Stefan Senger) Universität Wien Managing entity (Gerhard Ecker)