Making semantics work in drug discovery

Size: px
Start display at page:

Download "Making semantics work in drug discovery"

Transcription

1 Indiana University School of Informatics and Computing Making semantics work in drug discovery Information is cheap. Understanding is expensive (Karl Fast) David Wild, Assistant Professor and Director, Cheminformatics & Chemogenomics Research Group (CCRG) Indiana University School of Informatics and Computing

2 Stack of applying semantics in drug discovery & healthcare New biomedical insights Integrated knowledge discovery processes Integrative tools and algorithms Accessible networks of semantically integrated data Only now going mainstream Wild, D.J., Ding, Y., Sheth, A.P., Harland, L., Gifford, E.M., Lajiness, M.S. Systems Chemical Biology and the Semantic Web: what they mean for the future of drug discovery research, Drug Discovery Today, 2012, 17,

3 Chem2Bio2RDF.org semantically integrated data Chen, B., Dong. X., Jiao, D., Wang, H., Zhu, Q., Ding, Y., Wild, D.J. Chem2Bio2RDF: a semantic framework for linking and data mining chemogenomic and systems chemical biology data. BMC Bioinformatics 2010, 11, 255.

4 We can answer many questions with SPARQL What pathways will troglitazone affect? PREFIX c2b2r: < chem2bio2rdf.owl#> PREFIX bp: < PREFIX rdfs: < PREFIX rdf: < select distinct?pathwayname?datasource from < where {?chemical rdfs:label "Troglitazone"^^xsd:string; c2b2r:hasinteraction?interaction.?interaction c2b2r:hastarget?target ; c2b2r:biologicalinterest true.?pathway c2b2r:ispathwayof?target ; bp:name?pathwayname ; bp:xref [c2b2r:identifiertype?datasource]. } Drug Gene Pathway

5 We can answer many questions with SPARQL What are possible multi-target MAPK Inhibitors? PREFIX pubchem: < resource/> PREFIX kegg: < PREFIX uniprot: < resource/> SELECT?compound_cid (count(?compound_cid) as? active_assays) FROM < FROM < FROM < Compound Bio- Assay WHERE {?bioassay pubchem:cid?compound_cid.?bioassay pubchem:outcome?activity. FILTER (? activity=2).?bioassay pubchem:score?score. FILTER (?score>50).?bioassay pubchem:gi?gi.?uniprot uniprot:gi?gi.?pathway kegg:protein?uniprot.?pathway kegg:pathway_name?pathway_name. FILTER regex(?pathway_name,"mapk signaling pathway","i"). Gene Gene } GROUP BY?compound_cid HAVING (count(*)>1) Pathway

6 Variety of expert GUI tools for searching

7 Stack of applying semantics in drug discovery & healthcare New biomedical insights Integrated knowledge discovery processes Integrative tools & algorithms Very little work done in this area Accessible networks of semantically integrated data Wild, D.J., Ding, Y., Sheth, A.P., Harland, L., Gifford, E.M., Lajiness, M.S. Systems Chemical Biology and the Semantic Web: what they mean for the future of drug discovery research, Drug Discovery Today, 2012, 17,

8 ChemBioSpace Association Search He, B., Tang, J., Ding, Y., Wang, H., Sun, Y., Shin, J.H., Chen, B., Moorthy, G., Qiu, J., Desai, P., Wild, D.J., Mining relational paths in biomedical data. PloS One, 2011, 6(12), e27506.

9 Semantic Linked Association Prediction Association score: Association significance: 9.06 x 10-6 => missing link predicted

10 SLAP significant subgraph Chen, B., Ding, Y., Wild, D.J. Assessing Drug Target Association using Semantic Linked Data. PLoS Computational Biology, 2012, 8(7), e

11 Compound/Target SLAP Virtual Screen - Troglitazone (C) 2012 DATA2DISCOVERY INC 11

12 SLAP Drug-Target Prediction Matrix

13 Bipartite repurposing graph created with Sci2

14 Assessing drug similarity from biological function Took 157 drugs with 10 known therapeutic indications, and created SLAP profils against 1,683 human targets Pearson correlation between profiles > 0.9 was used to create associations between drugs Drugs with the same therapeutic indication unsurprisingly cluster together also subcluster by MOA Some drugs with similar profile have different indications potential for use in drug repurposing?

15 Large-scale repurposing networks (C) 2012 DATA2DISCOVERY INC 15

16 Repurposing example anticonvulsant antiarrhythmic anticonvulsant antidepressive H1 antihistamine alpha / beta blocker used for CHF (C) 2012 DATA2DISCOVERY INC 16

17 Stack of applying semantics in drug discovery & healthcare New biomedical insights Integrated knowledge discovery processes What is the added value? Integrative tools & algorithms Accessible networks of semantically integrated data Wild, D.J., Ding, Y., Sheth, A.P., Harland, L., Gifford, E.M., Lajiness, M.S. Systems Chemical Biology and the Semantic Web: what they mean for the future of drug discovery research, Drug Discovery Today, 2012, 17,

18 Integrative virtual screening Ligand-based screening: QSAR, similarity, pharmacophore Structure-based screening: Molecular docking Semantic screening: Semantic association with targets and/or known ligands Look at top hits using each method, and fused hits using harmonic data fusion of ranked lists Currently being validated in PXR (Univ. Cincinnati) and Mtb (OSDD) projects) Pharmacophore Forest Random ROCS SLAP Fusion Pharmacophore Random Forest ROCS SLAP See Bioorg Med Chem Lett.2012 May 1;22(9): Fusion 1

19 MOA: Identifying cardiac side effects of Rosiglitazone Gene/Drug SAA2 APOE ADIPOQ CYP2C8 Strong Discussed PharmGKB Strong Discussed PharmGKB + Matador Strong Positive PharmGKB Strong Changes metabolism (CTD) Rosiglitazone Troglitazone Pioglitazone V. weak V. weak V. weak V. weak V. weak Strong Positive PharmGKB V. Weak Strong Changes metabolism (CTD)

20 MOA: Identifying differential LDL-lowering effect of Troglitazone

21 Parkinson Disease-Inflammation Network

22 Integrating phenotypic assays Wnt pathway Associated Assay Phenotypic Assay Resveratrol Known assay data Associated targets 22 (C) 2012 DATA2DISCOVERY INC

23 Tool prototypes: djwild.info / d2discovery.com

24 Semantic Technologies in Drug Discovery Most commercial organizations are still in the early adopter phase: with a big data integration problem and realizing semantic technologies are a better solution to this than relational databases Some companies are in the bowling alley and are moviong out of this phase. No-one is in the tornado Research (OpenPHACTS, etc) is well on the way to solving the data integration problem and is moving on to advanced searching, data mining and prediction

25 Google Knowledge Graph

26 Lessons & thoughts: adoption of semantics Semantic search is going mainstream Google Knowledge Graph, Facebook Graph Search, Linked Open Data (LOD), OpenPHACTS Now identified as top technology trend for 2013 (gartner.com/newsroom/id/ ) Everyone has a big data integration problem. Semantics now work well for this. Many pharma companies have semantic pilot programs but no-one has gone mainstream with semantics. However this is probably not far off. Switching from relational to semantic models seems revolutionary but can be done in an evolutionary fashion (D2R, etc), although there are some capacity issues and limitations with this approach. We need support for horizontal research in semantic prediction and data mining Based on huge hetergeneous graphs application of existing and new graph algorithms Very little work has been done so far on semantic prediction using heterogeneous, semantic graphs most work is siloed in graph theory, data mining, communities We need support for vertical research in big data / networks / semantics for translational medicine and drug discovery Semantic prediction, using all available data, shows strong promise for utility in areas such as drug-target prediction, off-target profiling, and drug repurposing. Semantics might have rapid adoption in healthcare (EMR s, PHR), and it should be important to be able to integrate from the molecular to patient level. Need to keep good alignment between disciplines Academic-Industry cross-silo colloabration essential: EU OpenPHACTS good example of success

Big Data in Drug Discovery

Big Data in Drug Discovery Big Data in Drug Discovery David J. Wild Assistant Professor & Director, Cheminformatics Program Indiana University School of Informatics and Computing djwild@indiana.edu - http://djwild.info Epochs in

More information

Knowledge Graph: Connecting Big Data Semantics. Ying Ding Indiana University

Knowledge Graph: Connecting Big Data Semantics. Ying Ding Indiana University Knowledge Graph: Connecting Big Data Semantics Ying Ding Indiana University Outline Vision Use Case: VIVO Ontology Use Case: Chem2Bio2RDF Challenges VISION Vision Changes in Search Strings vs. things Vision

More information

Cheminformatics and Pharmacophore Modeling, Together at Last

Cheminformatics and Pharmacophore Modeling, Together at Last Application Guide Cheminformatics and Pharmacophore Modeling, Together at Last SciTegic Pipeline Pilot Bridging Accord Database Explorer and Discovery Studio Carl Colburn Shikha Varma-O Brien Introduction

More information

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 James Maltby, Ph.D 1 Outline of Presentation Semantic Graph Analytics Database Architectures In-memory Semantic Database Formulation

More information

Computational Tools for Medicinal Chemists Increasing the Dimensions of Drug Discovery. Dr Robert Scoffin CEO

Computational Tools for Medicinal Chemists Increasing the Dimensions of Drug Discovery. Dr Robert Scoffin CEO Computational Tools for Medicinal Chemists Increasing the Dimensions of Drug Discovery Dr Robert Scoffin CE Agenda > Building Desktop Tools - A History > About Cresset BioMolecular Discovery > Fields,

More information

PREDICTIVE ANALYTICS: PROVIDING NOVEL APPROACHES TO ENHANCE OUTCOMES RESEARCH LEVERAGING BIG AND COMPLEX DATA

PREDICTIVE ANALYTICS: PROVIDING NOVEL APPROACHES TO ENHANCE OUTCOMES RESEARCH LEVERAGING BIG AND COMPLEX DATA PREDICTIVE ANALYTICS: PROVIDING NOVEL APPROACHES TO ENHANCE OUTCOMES RESEARCH LEVERAGING BIG AND COMPLEX DATA IMS Symposium at ISPOR at Montreal June 2 nd, 2014 Agenda Topic Presenter Time Introduction:

More information

De novo design in the cloud from mining big data to clinical candidate

De novo design in the cloud from mining big data to clinical candidate De novo design in the cloud from mining big data to clinical candidate Jérémy Besnard Data Science For Pharma Summit 28 th January 2016 Overview the 3 bullet points Cloud based data platform that can efficiently

More information

Laboratory Informatics Tools Integration Strategies for Drug Discovery: Integration of LIMS, ELN, CDS, and SDMS

Laboratory Informatics Tools Integration Strategies for Drug Discovery: Integration of LIMS, ELN, CDS, and SDMS 454852JLAXXX10.1177/2211068212454852Journal of Laboratory Automation XX(X)Machina and Wild 2012 Review Article Laboratory Informatics Tools Integration Strategies for Drug Discovery: Integration of LIMS,

More information

Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources

Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources Ping Zhang IBM T. J. Watson Research Center Pankaj Agarwal GlaxoSmithKline Zoran Obradovic Temple University Terms and

More information

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16 Course Director: Dr. Barry Grant (DCM&B, bjgrant@med.umich.edu) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems

More information

Integrating Medicinal Chemistry and Computational Chemistry: The Molecular Forecaster Approach

Integrating Medicinal Chemistry and Computational Chemistry: The Molecular Forecaster Approach Integrating Medicinal Chemistry and Computational Chemistry: The Molecular Forecaster Approach Molecular Forecaster Inc. www.molecularforecaster.com Company Profile Founded in 2010 by Dr. Eric Therrien

More information

The INFUSIS Project Data and Text Mining for In Silico Modeling

The INFUSIS Project Data and Text Mining for In Silico Modeling The INFUSIS Project Data and Text Mining for In Silico Modeling Henrik Boström 1,2, Ulf Norinder 3, Ulf Johansson 4, Cecilia Sönströd 4, Tuve Löfström 4, Elzbieta Dura 5, Ola Engkvist 6, Sorel Muresan

More information

Protein Protein Interaction Networks

Protein Protein Interaction Networks Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics

More information

Anforderungen der Life-Science Industrie an die Hochschulen. Hans Widmer Novartis Institutes for BioMedical Research

Anforderungen der Life-Science Industrie an die Hochschulen. Hans Widmer Novartis Institutes for BioMedical Research Anforderungen der Life-Science Industrie an die Hochschulen Hans Widmer Novartis Institutes for BioMedical Research There s nothing more extraordinary than a normal life 2 What does industry expect from

More information

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov Search and Data Mining: Techniques Applications Anya Yarygina Boris Novikov Introduction Data mining applications Data mining system products and research prototypes Additional themes on data mining Social

More information

Cheminformatics and its Role in the Modern Drug Discovery Process

Cheminformatics and its Role in the Modern Drug Discovery Process Cheminformatics and its Role in the Modern Drug Discovery Process Novartis Institutes for BioMedical Research Basel, Switzerland With thanks to my colleagues: J. Mühlbacher, B. Rohde, A. Schuffenhauer

More information

Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives

Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives Dirk.Repsilber@oru.se 2015-05-21 Functional Bioinformatics, Örebro University Vad är bioinformatik och varför

More information

Pipeline Pilot Enterprise Server. Flexible Integration of Disparate Data and Applications. Capture and Deployment of Best Practices

Pipeline Pilot Enterprise Server. Flexible Integration of Disparate Data and Applications. Capture and Deployment of Best Practices overview Pipeline Pilot Enterprise Server Pipeline Pilot Enterprise Server (PPES) is a powerful client-server platform that streamlines the integration and analysis of the vast quantities of data flooding

More information

PerCuro-A Semantic Approach to Drug Discovery. Final Project Report submitted by Meenakshi Nagarajan Karthik Gomadam Hongyu Yang

PerCuro-A Semantic Approach to Drug Discovery. Final Project Report submitted by Meenakshi Nagarajan Karthik Gomadam Hongyu Yang PerCuro-A Semantic Approach to Drug Discovery Final Project Report submitted by Meenakshi Nagarajan Karthik Gomadam Hongyu Yang Towards the fulfillment of the course Semantic Web CSCI 8350 Fall 2003 Under

More information

TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials

TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials Pharmaceutical leader deploys TIBCO Spotfire enterprise analytics platform across its drug discovery organization

More information

Big Data Healthcare. Fei Wang Associate Professor Department of Computer Science and Engineering School of Engineering University of Connecticut

Big Data Healthcare. Fei Wang Associate Professor Department of Computer Science and Engineering School of Engineering University of Connecticut Big Data Healthcare Fei Wang Associate Professor Department of Computer Science and Engineering School of Engineering University of Connecticut Healthcare Is in Crisis Hersh, W., Jacko, J. A., Greenes,

More information

> Semantic Web Use Cases and Case Studies

> Semantic Web Use Cases and Case Studies > Semantic Web Use Cases and Case Studies Case Study: Applied Semantic Knowledgebase for Detection of Patients at Risk of Organ Failure through Immune Rejection Robert Stanley 1, Bruce McManus 2, Raymond

More information

Dr Alexander Henzing

Dr Alexander Henzing Horizon 2020 Health, Demographic Change & Wellbeing EU funding, research and collaboration opportunities for 2016/17 Innovate UK funding opportunities in omics, bridging health and life sciences Dr Alexander

More information

CLUSTER ANALYSIS WITH R

CLUSTER ANALYSIS WITH R CLUSTER ANALYSIS WITH R [cluster analysis divides data into groups that are meaningful, useful, or both] LEARNING STAGE ADVANCED DURATION 3 DAY WHAT IS CLUSTER ANALYSIS? Cluster Analysis or Clustering

More information

Consensus Scoring to Improve the Predictive Power of in-silico Screening for Drug Design

Consensus Scoring to Improve the Predictive Power of in-silico Screening for Drug Design Consensus Scoring to Improve the Predictive Power of in-silico Screening for Drug Design Masato Okada Faculty of Science and Technology, Masato Tsukamoto Faculty of Pharmaceutical Sciences, Hayato Ohwada

More information

Clinical Genomics at Scale: Synthesizing and Analyzing Big Data From Thousands of Patients

Clinical Genomics at Scale: Synthesizing and Analyzing Big Data From Thousands of Patients Clinical Genomics at Scale: Synthesizing and Analyzing Big Data From Thousands of Patients Brandy Bernard PhD Senior Research Scientist Institute for Systems Biology Seattle, WA Dr. Bernard s research

More information

Healthcare Analytics. Aryya Gangopadhyay UMBC

Healthcare Analytics. Aryya Gangopadhyay UMBC Healthcare Analytics Aryya Gangopadhyay UMBC Two of many projects Integrated network approach to personalized medicine Multidimensional and multimodal Dynamic Analyze interactions HealthMask Need for sharing

More information

Research Data Integration of Retrospective Studies for Prediction of Disease Progression A White Paper. By Erich A. Gombocz

Research Data Integration of Retrospective Studies for Prediction of Disease Progression A White Paper. By Erich A. Gombocz Research Data Integration of Retrospective Studies for Prediction of Disease Progression A White Paper By Erich A. Gombocz 2 Research Data Integration of Retrospective Studies for Prediction of Disease

More information

Big Data An Opportunity or a Distraction? Signal or Noise?

Big Data An Opportunity or a Distraction? Signal or Noise? Big Data An Opportunity or a Distraction? Signal or Noise? Maya R. Said, Sc.D. SVP & Global Head, Oncology Policy & Market Access, Novartis 3rd International Systems Biomedicine Symposium Luxembourg, 28

More information

DATA SCIENCE ADVISING NOTES David Wild - updated May 2015

DATA SCIENCE ADVISING NOTES David Wild - updated May 2015 DATA SCIENCE ADVISING NOTES David Wild - updated May 2015 GENERAL NOTES Lots of information can be found on the website at http://datascience.soic.indiana.edu. Dr David Wild, Data Science Graduate Program

More information

Research IT Services. UCD IT Services

Research IT Services. UCD IT Services Research IT Services Survey Final Results - Brief Breakdown During October 2007, Research IT Services conducted a survey of the UCD Research Community in an effort to establish their requirements with

More information

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and

More information

The Open PHACTS Discovery Platform Semantic data integration for Medicinal Chemists

The Open PHACTS Discovery Platform Semantic data integration for Medicinal Chemists Pharmacoinformatics Research Group Department of Pharmaceutical Chemistry The Open PHACTS Discovery Platform Semantic data integration for Medicinal Chemists Gerhard F. Ecker Dept. of Pharmaceutical Chemistry,

More information

How To Use Data Analysis To Get More Information From A Computer Or Cell Phone To A Computer

How To Use Data Analysis To Get More Information From A Computer Or Cell Phone To A Computer Applying Big Data approaches to Competitive Intelligence challenges THOMSON REUTERS IP & SCIENCE PHARMA CI EUROPE CONFERENCE & EXHIBITION TIM MILLER 19 FEBRUARY 2014 BIG DATA, NOT JUST ABOUT VOLUMES Patient

More information

Integrated Biomedical and Clinical Research Informatics for Translational Medicine and Therapeutics

Integrated Biomedical and Clinical Research Informatics for Translational Medicine and Therapeutics Integrated Biomedical and Clinical Research Informatics for Translational Medicine and Therapeutics J. Richard Landis, PhD Robert M. Curley, MS Gregg Fromell, MD Center for Clinical Epidemiology and Biostatistics

More information

Leukemia Drug Pathway Analyzer

Leukemia Drug Pathway Analyzer Brochure More information from http://www.researchandmarkets.com/reports/2673689/ Leukemia Drug Pathway Analyzer Description: Extra value: One year of free online updates included with this product There

More information

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science Dr. Daisy Zhe Wang CISE Department University of Florida August 25th 2014 20 Review Overview of Data Science Why Data

More information

Course Requirements for the Ph.D., M.S. and Certificate Programs

Course Requirements for the Ph.D., M.S. and Certificate Programs Course Requirements for the Ph.D., M.S. and Certificate Programs PhD Program The PhD program in the Health Informatics subtrack inherits all course requirements of the Informatics PhD program, that is,

More information

Analysis of Illumina Gene Expression Microarray Data

Analysis of Illumina Gene Expression Microarray Data Analysis of Illumina Gene Expression Microarray Data Asta Laiho, Msc. Tech. Bioinformatics research engineer The Finnish DNA Microarray Centre Turku Centre for Biotechnology, Finland The Finnish DNA Microarray

More information

Driving Innovation in Licensing Through Competitive Intelligence and Big Data Analytics

Driving Innovation in Licensing Through Competitive Intelligence and Big Data Analytics Driving Innovation in Licensing Through Competitive Intelligence and Big Data Analytics John D Antonio Practice Leader, Licensing & Competitive Intelligence About Paragon We help increase asset value from

More information

Alterações empresariais sustentadas pelo conceito de engenharia do Produto Patrício Soares da Silva, MD, PhD

Alterações empresariais sustentadas pelo conceito de engenharia do Produto Patrício Soares da Silva, MD, PhD Alterações empresariais sustentadas pelo conceito de engenharia do Produto Patrício Soares da Silva, MD, PhD 1 Summary Hypothesis Generation Candidate Development Commercialization Target Identification

More information

Trends in Technology based Health Investments (A USA view)

Trends in Technology based Health Investments (A USA view) Trends in Technology based Health Investments (A USA view) Second UPM innovatech International Workshop Investing in High Technology for Success Center for Support of Technological Innovation CAIT Technical

More information

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies 1 Enterprise Information Challenge Source: Oracle customer 2 Vision of Semantically Linked Data The Network of Collaborative

More information

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel

More information

Application of Graph-based Data Mining to Metabolic Pathways

Application of Graph-based Data Mining to Metabolic Pathways Application of Graph-based Data Mining to Metabolic Pathways Chang Hun You, Lawrence B. Holder, Diane J. Cook School of Electrical Engineering and Computer Science Washington State University Pullman,

More information

Towards a National Health Knowledge Infrastructure (NHKI): The role of Semantics-based Knowledge Management vkashyap1@partners.org

Towards a National Health Knowledge Infrastructure (NHKI): The role of Semantics-based Knowledge Management vkashyap1@partners.org Towards a National Health Knowledge Infrastructure (NHKI): The role of Semantics-based Knowledge Management Vipul Kashyap and Tonya Hongsermeier Clinical Informatics R&D, Partners HealthCare System vkashyap1@partners.org

More information

LDIF - Linked Data Integration Framework

LDIF - Linked Data Integration Framework LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany a.schultz@fu-berlin.de,

More information

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com

More information

Course Requirements for the Ph.D., M.S. and Certificate Programs

Course Requirements for the Ph.D., M.S. and Certificate Programs Health Informatics Course Requirements for the Ph.D., M.S. and Certificate Programs Health Informatics Core (6 s.h.) All students must take the following two courses. 173:120 Principles of Public Health

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

Exploration and Visualization of Post-Market Data

Exploration and Visualization of Post-Market Data Exploration and Visualization of Post-Market Data Jianying Hu, PhD Joint work with David Gotz, Shahram Ebadollahi, Jimeng Sun, Fei Wang, Marianthi Markatou Healthcare Analytics Research IBM T.J. Watson

More information

IO Informatics The Sentient Suite

IO Informatics The Sentient Suite IO Informatics The Sentient Suite Our software, The Sentient Suite, allows a user to assemble, view, analyze and search very disparate information in a common environment. The disparate data can be numeric

More information

Data Mining Analysis of HIV-1 Protease Crystal Structures

Data Mining Analysis of HIV-1 Protease Crystal Structures Data Mining Analysis of HIV-1 Protease Crystal Structures Gene M. Ko, A. Srinivas Reddy, Sunil Kumar, and Rajni Garg AP0907 09 Data Mining Analysis of HIV-1 Protease Crystal Structures Gene M. Ko 1, A.

More information

LUCKY AHMED Department of Chemistry and Biochemistry Yale University, New Haven, CT 06511 Email: lucky.ahmed@yale.edu

LUCKY AHMED Department of Chemistry and Biochemistry Yale University, New Haven, CT 06511 Email: lucky.ahmed@yale.edu LUCKY AHMED Department of Chemistry and Biochemistry Yale University, New Haven, CT 06511 Email: lucky.ahmed@yale.edu EDUCATION PhD in Computational Chemistry Spring- Dissertation Title: Computational

More information

AP Biology Essential Knowledge Student Diagnostic

AP Biology Essential Knowledge Student Diagnostic AP Biology Essential Knowledge Student Diagnostic Background The Essential Knowledge statements provided in the AP Biology Curriculum Framework are scientific claims describing phenomenon occurring in

More information

Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences

Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences Paradigm Changes Affecting the Practice of Scientific Communication in the Life Sciences Prof. Dr. Martin Hofmann-Apitius Head of the Department of Bioinformatics Fraunhofer Institute for Algorithms and

More information

TECHNICAL INSIGHTS TECHNOLOGY ALERT

TECHNICAL INSIGHTS TECHNOLOGY ALERT TECHNICAL INSIGHTS TECHNOLOGY ALERT UNLOCKING BREAST CANCER TUMORS' DRUG RESISTANCE A troubling phenomenon in breast cancer treatment is that some patients tumors become resistant to treatments over time.

More information

Recent Developments in Chemoinformatics Education. Val Gillet University of Sheffield

Recent Developments in Chemoinformatics Education. Val Gillet University of Sheffield Recent Developments in Chemoinformatics Education Val Gillet University of Sheffield Chemoinformatics as a Discipline Chemical Information Systems and Services have been established for many years Chemical

More information

Focus on Your Future: PhD and MD-PhD Programs and Research Careers

Focus on Your Future: PhD and MD-PhD Programs and Research Careers Focus on Your Future: PhD and MD-PhD Programs and Research Careers TWD Program Director s Meeting Chantilly, VA June 4, 2015 Elise Covic, PhD (University of Chicago, Dean of Students MSTP) Sean Crosson,

More information

Subgraph Patterns: Network Motifs and Graphlets. Pedro Ribeiro

Subgraph Patterns: Network Motifs and Graphlets. Pedro Ribeiro Subgraph Patterns: Network Motifs and Graphlets Pedro Ribeiro Analyzing Complex Networks We have been talking about extracting information from networks Some possible tasks: General Patterns Ex: scale-free,

More information

Personalized Predictive Modeling and Risk Factor Identification using Patient Similarity

Personalized Predictive Modeling and Risk Factor Identification using Patient Similarity Personalized Predictive Modeling and Risk Factor Identification using Patient Similarity Kenney Ng, PhD 1, Jimeng Sun, PhD 2, Jianying Hu, PhD 1, Fei Wang, PhD 1,3 1 IBM T. J. Watson Research Center, Yorktown

More information

Russ College of Engineering and Technology. Revised 9/06. 2. Undergraduate GPA of 3.0/4.0 or equivalent.

Russ College of Engineering and Technology. Revised 9/06. 2. Undergraduate GPA of 3.0/4.0 or equivalent. Requirements and Guidelines for the Master of Science Degree in Biomedical Engineering Revised 9/06 I. Minimum Entrance Requirements Admission is subject to final approval by the BME Graduate Committee.

More information

Integration of genomic data into electronic health records

Integration of genomic data into electronic health records Integration of genomic data into electronic health records Daniel Masys, MD Affiliate Professor Biomedical & Health Informatics University of Washington, Seattle Major portion of today s lecture is based

More information

Linked Longitudinal Medical Record. Susie Stephens Co chair W3C Health Care & Life Science Interest Group

Linked Longitudinal Medical Record. Susie Stephens Co chair W3C Health Care & Life Science Interest Group Linked Longitudinal Medical Record Susie Stephens Co chair W3C Health Care & Life Science Interest Group Outline Secondary Use of Health Care Data Introduction to the Semantic Web Translational Medicine

More information

A leader in the development and application of information technology to prevent and treat disease.

A leader in the development and application of information technology to prevent and treat disease. A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today

More information

University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology

University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology Programme Structure - the MSc outcome will require 180 credits total (full-time only) - 60

More information

THE CAMBRIDGE CRYSTALLOGRAPHIC DATA CENTRE (CCDC)

THE CAMBRIDGE CRYSTALLOGRAPHIC DATA CENTRE (CCDC) ABOUT THE CAMBRIDGE CRYSTALLOGRAPHIC DATA CENTRE (CCDC) The CCDC is the trusted research institution responsible for the 50-year old Cambridge Structural Database (CSD) and its applications. Used by thousands

More information

The data forest. Application. Application Application DATA. Office of Research

The data forest. Application. Application Application DATA. Office of Research The data forest DATA Unfortunately Data to the rescue The Rensselaer IDEA HPC: Computational Science and Engineering + Data Science and Predictive Analytics + Cognitive Computing + Perceptualization DATA

More information

The various steps in the solution approach are presented below.

The various steps in the solution approach are presented below. From Web 1.0 3.0: Is RDF access to RDB enough? Vipul Kashyap, Senior Medical Informatician, Partners Healthcare System, vkashyap1@partners.org Martin Flanagan, CTO, InSilico Discovery, mflanagan@insilicodiscovery.com

More information

Big Data and Healthcare Payers WHITE PAPER

Big Data and Healthcare Payers WHITE PAPER Knowledgent White Paper Series Big Data and Healthcare Payers WHITE PAPER Summary With the implementation of the Affordable Care Act, the transition to a more member-centric relationship model, and other

More information

Uncovering Value in Healthcare Data with Cognitive Analytics. Christine Livingston, Perficient Ken Dugan, IBM

Uncovering Value in Healthcare Data with Cognitive Analytics. Christine Livingston, Perficient Ken Dugan, IBM Uncovering Value in Healthcare Data with Cognitive Analytics Christine Livingston, Perficient Ken Dugan, IBM Conflict of Interest Christine Livingston Ken Dugan Has no real or apparent conflicts of interest

More information

GITIRBio: A Semantic and Distributed Service OrientedArchitecture for Bioinformatics Pipeline

GITIRBio: A Semantic and Distributed Service OrientedArchitecture for Bioinformatics Pipeline Luis F. Castillo1, Germán López-Gartner2, Gustavo A. Isaza1*, Mariana Sánchez2, Jeferson Arango1, Daniel Agudelo-Valencia2, Sergio Castaño1 1 Systems and Informatics Department, GITIR Research Group. Caldas

More information

Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences

Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences It s not information overload, it s filter failure. Clay Shirky Life Sciences organizations face the challenge

More information

1. Introduction Gene regulation Genomics and genome analyses Hidden markov model (HMM)

1. Introduction Gene regulation Genomics and genome analyses Hidden markov model (HMM) 1. Introduction Gene regulation Genomics and genome analyses Hidden markov model (HMM) 2. Gene regulation tools and methods Regulatory sequences and motif discovery TF binding sites, microrna target prediction

More information

TRANSLATIONAL BIOINFORMATICS 101

TRANSLATIONAL BIOINFORMATICS 101 TRANSLATIONAL BIOINFORMATICS 101 JESSICA D. TENENBAUM Department of Bioinformatics and Biostatistics, Duke University Durham, NC 27715 USA Jessie.Tenenbaum@duke.edu SUBHA MADHAVAN Innovation Center for

More information

Integrating Bioinformatics, Medical Sciences and Drug Discovery

Integrating Bioinformatics, Medical Sciences and Drug Discovery Integrating Bioinformatics, Medical Sciences and Drug Discovery M. Madan Babu Centre for Biotechnology, Anna University, Chennai - 600025 phone: 44-4332179 :: email: madanm1@rediffmail.com Bioinformatics

More information

Division of Bioinformatics and Biostatistics

Division of Bioinformatics and Biostatistics Division of Bioinformatics and Biostatistics Weida Tong, Ph.D. The views presented do not necessarily reflect the views of the FDA. 1 Established on May 20 th, 2012. Three Branches: Division Overview Bioinformatics

More information

Big Data and Text Mining

Big Data and Text Mining Big Data and Text Mining Dr. Ian Lewin Senior NLP Resource Specialist Ian.lewin@linguamatics.com www.linguamatics.com About Linguamatics Boston, USA Cambridge, UK Software Consulting Hosted content Agile,

More information

Text Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk

Text Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk Text Mining for Health Care and Medicine Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk The Need for Text Mining MEDLINE 2005: ~14M 2009: ~18M Overwhelming information in textual,

More information

Big RDF Data Partitioning and Processing using hadoop in Cloud

Big RDF Data Partitioning and Processing using hadoop in Cloud Big RDF Data Partitioning and Processing using hadoop in Cloud Tejas Bharat Thorat Dept. of Computer Engineering MIT Academy of Engineering, Alandi, Pune, India Prof.Ranjana R.Badre Dept. of Computer Engineering

More information

cansar: integrated cancer knowledgebase

cansar: integrated cancer knowledgebase in partnership with cansar: integrated cancer knowledgebase Bissan Al-Lazikani Cancer Research UK Cancer Therapeutics Unit 10 th Dec 2013 Sharing knowledge for drug discovery Resource to effectively integrate

More information

Pharmacology skills for drug discovery. Why is pharmacology important?

Pharmacology skills for drug discovery. Why is pharmacology important? skills for drug discovery Why is pharmacology important?, the science underlying the interaction between chemicals and living systems, emerged as a distinct discipline allied to medicine in the mid-19th

More information

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge

More information

Delivering real Information Management solutions that result in better, more efficient healthcare.

Delivering real Information Management solutions that result in better, more efficient healthcare. Delivering real Information Management solutions that result in better, more efficient healthcare. For more than 30 years, we have helped companies overcome challenges and identify opportunities to achieve

More information

Too Much Data or Too Little Cooperation? Tom Plasterer, PhD. Research & Development Information (RDI) Director, US Cross-Science

Too Much Data or Too Little Cooperation? Tom Plasterer, PhD. Research & Development Information (RDI) Director, US Cross-Science Too Much Data or Too Little Cooperation? Tom Plasterer, PhD. Research & Development Information (RDI) Director, US Cross-Science Approaching a Pharma Big Data Problem: Requirements of the CI Informatics

More information

Health Informatics Student Handbook

Health Informatics Student Handbook Health Informatics Student Handbook Interdisciplinary Graduate Program in Informatics (IGPI) Please see the Interdisciplinary Graduate Program in Informatics Handbook for more information regarding our

More information

Data Analytics at NICTA. Stephen Hardy National ICT Australia (NICTA) shardy@nicta.com.au

Data Analytics at NICTA. Stephen Hardy National ICT Australia (NICTA) shardy@nicta.com.au Data Analytics at NICTA Stephen Hardy National ICT Australia (NICTA) shardy@nicta.com.au NICTA Copyright 2013 Outline Big data = science! Data analytics at NICTA Discrete Finite Infinite Machine Learning

More information

Mining the Web of Linked Data with RapidMiner

Mining the Web of Linked Data with RapidMiner Mining the Web of Linked Data with RapidMiner Petar Ristoski, Christian Bizer, and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {petar.ristoski,heiko,chris}@informatik.uni-mannheim.de

More information

International Workshop on Big Data Analytics for Advanced Databases (BIGDATA, 2016)

International Workshop on Big Data Analytics for Advanced Databases (BIGDATA, 2016) International Workshop on Big Data Analytics for Advanced Databases (BIGDATA, 2016) Call for Papers AIM and SCOPE There is an exponential growth in digital data with unprecedented new platforms derived

More information

PHCO 701 Short title Molecular Pharmacology Long title Introduction to Molecular Pharmacology

PHCO 701 Short title Molecular Pharmacology Long title Introduction to Molecular Pharmacology PHCO 701 Short title Molecular Pharmacology Long title Introduction to Molecular Pharmacology Effective term Fall Course units/hours 3 credit hours M/W/F 9:00am-9:50am in 4007 Genetic Medicine Bldg. Grading

More information

Chapter 1. Data Analytics for Pharmaceutical Discoveries

Chapter 1. Data Analytics for Pharmaceutical Discoveries Data Analytics for Pharmaceutical Discoveries Shobeir Fakhraei, Eberechukwu Onukwugha, Lise Getoor Healthcare Data Analytics, CRC Press 2015 Chapter 1 Data Analytics for Pharmaceutical Discoveries Shobeir

More information

Powering Cutting Edge Research in Life Sciences with High Performance Computing

Powering Cutting Edge Research in Life Sciences with High Performance Computing A Point of View Powering Cutting Edge Research in Life Sciences with High Performance Computing High performance computing (HPC) is the foundation of pioneering research in life sciences. HPC plays a vital

More information

Ph.D. in Bioinformatics and Computational Biology Degree Requirements

Ph.D. in Bioinformatics and Computational Biology Degree Requirements Ph.D. in Bioinformatics and Computational Biology Degree Requirements Credits Students pursuing the doctoral degree in BCB must complete a minimum of 90 credits of relevant work beyond the bachelor s degree;

More information

Fulvio Gualtieri Department of Pharmaceutical Sciences, University of Florence, Italy

Fulvio Gualtieri Department of Pharmaceutical Sciences, University of Florence, Italy MEDICINAL CHEMISTRY Fulvio Gualtieri Department of Pharmaceutical Sciences, University of Florence, Italy Keywords: Drugs, medicinal chemistry, drug design, drug synthesis, drug development, pharmacokinetics,

More information

Healthcare data analytics. Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw

Healthcare data analytics. Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw Healthcare data analytics Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw Outline Data Science Enabling technologies Grand goals Issues Google flu trend Privacy Conclusion Analytics

More information

Indiana University School of Informatics and Computing

Indiana University School of Informatics and Computing SCHOOL OF INFORMATICS AND COMPUTING Exceed Expectations Our programs will exceed your expectations and help you exceed your own. Our students and faculty are breaking new ground every day: creating applications

More information

Medical Informatics An Overview Saudi Board For Community Medicine

Medical Informatics An Overview Saudi Board For Community Medicine Medical Informatics An Overview Saudi Board For Community Medicine Ahmed AlBarrak PhD Medical Informatics Associate Professor of Health Informatics, Family & Community Med, Chairman, Medical Informatics,

More information

SEARCHING BIOTECHNOLOGY INFORMATION IN THE 2010s: Section II (Databases & Search Strategies)

SEARCHING BIOTECHNOLOGY INFORMATION IN THE 2010s: Section II (Databases & Search Strategies) SEARCHING BIOTECHNOLOGY INFORMATION IN THE 2010s: Section II (Databases & Search Strategies) Luca Falciola, IP Manager, Promethera Biosciences Sardegna Ricerche-Univ. Cagliari (Sep. 15 th 2014) Databases

More information

A UNIFIED VIEW OF RESEARCH AND TRIAL DATA Red Hat JBoss Data Virtualization for the pharmaceutical and life sciences industry

A UNIFIED VIEW OF RESEARCH AND TRIAL DATA Red Hat JBoss Data Virtualization for the pharmaceutical and life sciences industry A UNIFIED VIEW OF RESEARCH AND TRIAL DATA Red Hat JBoss Data Virtualization for the pharmaceutical and life sciences industry TECHNOLOGY OVERVIEW UNLOCK THE FULL POTENTIAL OF ALL THE DATA ASSETS TO DRIVE

More information

Arizona and PSMs. Collaboration Across Arizona Universities for a Better Arizona

Arizona and PSMs. Collaboration Across Arizona Universities for a Better Arizona Arizona and PSMs Collaboration Across Arizona Universities for a Better Arizona Positioning Arizona and Its Research Universities: Science and Technology Core Competencies Assessment Battelle Based on

More information