De novo design in the cloud from mining big data to clinical candidate

Similar documents
STRUCTURE-GUIDED, FRAGMENT-BASED LEAD GENERATION FOR ONCOLOGY TARGETS

Corporate Presentation November, 2013

We use Reaxys intensively for hit identification, hit-to-lead and lead optimization.

How To Understand The Chemistry Of A 2D Structure

Accelerating Lead Generation: Emerging Technologies and Strategies

Drug Discovery in China

The Open PHACTS Discovery Platform Semantic data integration for Medicinal Chemists

Eudendron: an Innovative Biotech Start-up

Academic Drug Discovery in the Center for Integrative Chemical Biology and Drug Discovery

Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences

Computational Tools for Medicinal Chemists Increasing the Dimensions of Drug Discovery. Dr Robert Scoffin CEO

Cheminformatics and Pharmacophore Modeling, Together at Last

Exploiting the Pathogen box

Data Visualization in Cheminformatics. Simon Xi Computational Sciences CoE Pfizer Cambridge

Integrating Medicinal Chemistry and Computational Chemistry: The Molecular Forecaster Approach

Building a Unified Drug Discovery Database

Diabetes and Drug Development

Cheminformatics and its Role in the Modern Drug Discovery Process

Medicines for Neglected Diseases Workshop. Dennis Liotta, Ph.D. Director Emory Institute for Drug Discovery Atlanta, Georgia

Corporate Overview. Dr Robert Scoffin CEO. http;// STAND NUMBER: 27

Alterações empresariais sustentadas pelo conceito de engenharia do Produto Patrício Soares da Silva, MD, PhD

TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials

Dr Alexander Henzing

MSC IN MEDICINAL CHEMISTRY

New Tools. Diseases. for Research on. R&D for Neglected Patients. Scientific Symposium. conference programme. In collaboration with.

From Data to Foresight:

Driving Innovation in Licensing Through Competitive Intelligence and Big Data Analytics

THOMSON REUTERS CORTELLIS FOR INFORMATICS. REUTERS/ Aly Song

Lead generation and lead optimisation:

Netherlands escience Center

Pivot Park Screening Centre participates in novel 196 million pan-european drug discovery platform

Call 2014: High throughput screening of therapeutic molecules and rare diseases

PIRAMAL DISCOVERY SOLUTIONS

Matteo di Tommaso FDA-PhUSE March 2013 Vice President, Research Business Technology Chair, PRISME Forum

BIOINFORMATICS Supporting competencies for the pharma industry

CHEM-E4140 Selectivity 12. Pharma Business

The INFUSIS Project Data and Text Mining for In Silico Modeling

Use of Predictive ADME in Library Profiling and Lead Optimization

EVT Execute & EVT Innovate Leading drug discovery

BROMOscan Quantitative Ligand Binding Assays

New epigenetic agents: therapeutic approach in cancer

A leader in the development and application of information technology to prevent and treat disease.

Informatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR)

Validated Cell-Based Assays for Rapid Screening and Functional Characterization of Therapeutic Monoclonal Antibodies

Drug design Drug repositioning Virtual screening

The Clinical Trials Process an educated patient s guide

How To Understand Protein-Protein Interaction And Inhibitors

19 October Company Announcement Office Bendigo Stock Exchange PO Box 191 Flinders Lane, Melbourne Vic By ;

A career on the science park

TransCelerate's Role in Transforming Pharmaceutical Trials Presentation to PCORNet

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

THE CAMBRIDGE CRYSTALLOGRAPHIC DATA CENTRE (CCDC)

Big Data in Drug Discovery

Making semantics work in drug discovery

Orphan Pharma: pathfinders for an increasingly specialised industry

Strategies and tactics for optimizing the Hit-to-Lead process and beyond A computational chemistry perspective

FACT SHEET TESTETROL, A NOVEL ORALLY BIOACTIVE ANDROGEN

Fragment-based lead generation of reversible inhibitors for lysine-specific demethylases

H Innovation. efficiency

The Aptuit Center for Drug Discovery & Development Verona, Italy. Uncommon Expertise. Exceptional Results.

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.

Open PHACTS Workshop, February The Lilly Perspective: Challenges We Face & Tools We Need

Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources

Predictive Analytics Certificate Program

Pharmacology skills for drug discovery. Why is pharmacology important?

Data Warehouse Design for Pharmaceutical Drug Discovery Research

A vaccine for rheumatoid arthritis

Open Innovation: An Imperative for the Pharmaceutical Industry. Berkeley Innovation Forum

Agenda Item 3. Discovery to Product Accelerator

Building innovative drug discovery alliances. Profitable. Growth Go

THE BIOTECH & PHARMACEUTICAL INDUSTRY

DMPK: Experimentation & Data

Making the most of academic drug target discoveries

Record 16.9m for Oxford spinout designing stem cell drugs to treat age- related disease

MYRIAD, HITACHI, ORACLE & FRIEDLI JOIN FORCES TO MAP THE ENTIRE HUMAN PROTEOME

The Leading Gene Through Screen CRO

Marubeni & Biotech Business

Technology and Expertise Add Operational Value to Medical Device Trials

GenScript USA Inc. Open Position List

Cloud-based Informatics Systems for Collaborative Network Research in Life Sciences

Outsourcing life science informatics solutions

Consensus Scoring to Improve the Predictive Power of in-silico Screening for Drug Design

Big Data An Opportunity or a Distraction? Signal or Noise?

MRC Technology Centre for Therapeutics Discovery

CDRD and the MSSC Seek Collaborative Projects to Discover and Develop New Treatments for Progressive Multiple Sclerosis

Towards Well-Defined ADCs (Antibody Drug Conjugates)

Forward step pby step with Action Plan 2016

Computational Chemistry and Cheminformatics in Drug Discovery Research

Combinatorial Chemistry and solid phase synthesis seminar and laboratory course

Creative Utilization of Existing Knowledge to Harness Innovation for the Neglected

University of Michigan College of Pharmacy Department of Medicinal Chemistry. Eleventh Annual Leroy B. Townsend Lectureship in Medicinal Chemistry

Putting IBM Watson to Work In Healthcare

Florida Translational Research Program (FTRP)

using the fully human ADLib system

Building Bridges to Long-Term Growth

Chemical safety and big data: the industry s demands

How to create and interpret the predictive analysis of a compound

A UNIFIED VIEW OF RESEARCH AND TRIAL DATA Red Hat JBoss Data Virtualization for the pharmaceutical and life sciences industry

Hitting the New Accelerated Pace of Digital Asset Management

Transcription:

De novo design in the cloud from mining big data to clinical candidate Jérémy Besnard Data Science For Pharma Summit 28 th January 2016

Overview the 3 bullet points Cloud based data platform that can efficiently capture and mine multiple data sources Platform to facilitate exchange with collaborators and integrate live data into our infrastructure Large scale machine learning to extract knowledge from this data to improve the decision making during drug discovery campaign

Cloud not just a trendy word

exscientia A young spin-out company from the University of Dundee Few employees working in multiple locations Built platform from revenues through partnerships and contracts with pharmaceutical companies We were able to work with our partners quickly and efficiently by deploying our infrastructure on the cloud

Big data Big compared to what? Philosophy is more important than the size Collect and use all the data rather than use small sample sets Accept messiness of data - benefits of using more data of variable quality outweigh the costs of using small, very exact data Accept we may not need to understand the physical basis of a correlation for the predictions to be useful ex scientia = from knowledge in latin

Sources of data Comprehensive exploration Platform integrated with proprietary methods delivers a global view of the polypharmacopaeia HTS & Patent Competitor Structural Fragments Literature derived SAR intel complexes Clinical Proprietary Automated Med-Chem Design Synthesis and Assay through preferred outsourced providers or collaborators

Live data Large datasets, public, commercial or corporate are important source of information but once a drug discovery project starts new information is generated and need to be integrated and exploited Learnt from the past but don t live in the past

Challenges of data flow - plumbing

Shared platform and automation Integration with ScienceCloud from Biovia to share chemistry and assay data with our collaborators Collaborator upload new data Upload to our collaborator for future decisions Automatically downloaded and integrated New information generated

DataScience Darwin meets big data Drug discovery is ultimately a high dimensional optimization problem Given the impossibly vast chemical space, brute force searches are inherently inefficient Darwinian processes are unreasonably efficient at finding solutions to high dimensional problems, whether it be evolutionary fitness in nature or drug discovery

Initial population selection De novo design algorithm Background knowledge Virtual enumeration Elite & random population selection Non-stop condition Predict properties Multiobjective prioritization Besnard et al. Automated design of ligands to polypharmacological profiles, Nature 492, 215 220 http://doi.org/10.1038/nature11691 Final population

Illustration of evolution <30 compounds required to discover, synthesize and patent Compound 27s, a selective D4 compound with early lead properties 2 generations 2 generations 2 generations 2 generations 2 generations >10,000 compounds evolved & scored for D4 and off-targets per generation, but only the few most promising compounds were synthesized screened. Compound 27s D4 K i =90nM Patent: PCT/GB2012/051194 / WO2012160392

Technology in practice Automated lead generation with rapid design cycle and efficient evolution to drug candidate profile Design Simultaneous design objectives deliver balanced compounds Learn Assay data informs next design cycle Make 10-30 compounds/cycle High information content Test in disease relevant assays

COLLABORATION PROJECTS

Metabolic Disorder Dual agonist for two unrelated targets Design against polypharmacology profiles Confirmatory 3D structures of both complexes

Bispecific Compounds Goal is to find first-in-class bispecific small molecule bispecific for two enzymes of unrelated families Process: Gather public and patent data to built models De novo design with evolutionary algorithm Docking of top ranked compounds to assess if the compounds could bind in the 2 targets In-vitro assay followed by crystallography

Structural validation X-ray crystallography of both structure complexes with the top prioritized compound and assay data confirm the design hypothesis Enzyme A IC 50 = 350nM Enzyme B IC 50 = 10nM

Psychiatric Disease Dual agonist for two distinct GPCRs Collaboration with Sumitomo Dainippon Pharma Design against polypharmacology profiles in vitro assessment Rapid delivery of candidate to in vivo safety study

Lead Identification Design, synthesis, assay: 5-15 compounds per 2-week cycle Design of 5 chemotypes Compounds synthesized Ease of synthesis Dual agonist activity Best Affinity GPCR selectivity* 25 80 nm 70 nm to lead optimization 5 30 45 70 nm 100 nm to lead optimization 5 multiple compounds <150nM at both targets. * <50% activity at 1uM over 20 GPCR receptors

Lead Optimization 80 further compounds for each prioritized scaffold Dual agonist <20nM target 1 <20nM target 2 scaffold designated as backup scaffold prioritized additional assays progressed further compounds made on this scaffold Candidate Seeking 40 compounds for prioritized scaffold Dual agonist <20nM target 1 <20nM target 2 solubility HERG >10uM GPCR selectivity DMPK

quality of compounds over 8 metrics Towards candidate nomination Successful bispecific project for CNS disease 2 chemotypes progressing to candidate selection (for Q2 2015) <400 compounds synthesized and assayed 12 month project circle size number of assays performed for each compound circle colour compound quality/proximity to objectives compounds synthesized over time

Faster, cleaner, lower cost Lead to Candidate $3.5M improved productivity $13.5M Hit to Lead Target to Hit Target to Hit Hit to Lead Lead Opt Target Standard cost ($million)* $1 $2.5 $10 Lead to Candidate 1.25 Years shorter timelines enhance efficiency 4.5 Years Hit to Lead Target to Hit Target Standard time (years) * Target to Hit Hit to Lead Lead Opt 1 1.5 2 *Reference cost & time model of R&D process from Eli Lilly: Paul, SM et al. Nature Rev Drug Disc. (2010), 9(3), 203-214

Disruptive approach Using the cloud as IT infrastructure we were able to develop our technology to mine, process and share information smoothly and efficiently Our technology can use this large set of information to deliver Intellectual Property to our clients in the form of compound designs and improve their hit discovery campaign Multiple targets is a realistic objective

Extended capabilities A disruptive system for automated medicinal chemistry Single target projects Improved side effect profiles Planned avoidance of anti-targets Bispecific small molecules Increased efficacy New therapeutic space Efficiency gains Faster project delivery More projects explored Phenotypic drug discovery Increased efficacy New therapeutic space

The team CEO Andrew Hopkins Chair of Medicinal Informatics (Dundee & Oxford) Raised $50 Million for research Author of highly cited papers CTO Jérémy Besnard Co-founder & Co-inventor CIO Richard Bickerton Co-founder. 8 years in Biotech. Trained by Sir Tom Blundell. Chemoinformatics Willem van Hoorn Molecular Informatics Adrian Schreyer COO Mark Swindells Previously Yamanouchi (Tsukuba, Japan) & CSO at Inpharmatica Ltd (UK). Raised over 40 Million in Venture Capital Chief Chemist Andy Bell Co-inventor of sildenafil (Viagra) & key contributor to voriconazole (Vfend) project.

THANK YOU