Genevestigator Training



Similar documents
AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

School of Nursing. Presented by Yvette Conley, PhD

Thomson Reuters Biomarker Solutions: Hepatitis C Treatment Biomarkers and special considerations in patients with Asthma

Ingenuity Pathway Analysis (IPA )

ProteinQuest user guide

Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

What s New in Pathway Studio Web 11.1

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

JustClust User Manual

SIMCA 14 MASTER YOUR DATA SIMCA THE STANDARD IN MULTIVARIATE DATA ANALYSIS

Dr Alexander Henzing

Analysis of Illumina Gene Expression Microarray Data

Targeting Specific Cell Signaling Pathways for the Treatment of Malignant Peritoneal Mesothelioma

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences

Hierarchical Clustering Analysis

Delivering the power of the world s most successful genomics platform

Analysis of gene expression data. Ulf Leser and Philippe Thomas

Big Data and Text Mining

dixa a data infrastructure for chemical safety Jos Kleinjans Dept of Toxicogenomics Maastricht University

BIOINFORMATICS Supporting competencies for the pharma industry

Core Facility Genomics

Big Data Visualization for Genomics. Luca Vezzadini Kairos3D

A Primer of Genome Science THIRD

Data Mining and Machine Learning in Bioinformatics

life science data mining

Pharmacology skills for drug discovery. Why is pharmacology important?

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

Cancer Genomics: What Does It Mean for You?

Gene expression analysis. Ulf Leser and Karin Zimmermann

MultiExperiment Viewer Quickstart Guide

PODD. An Ontology Driven Architecture for Extensible Phenomics Data Management

Survey of clinical data mining applications on big data in health informatics

> Semantic Web Use Cases and Case Studies

Integrating DNA Motif Discovery and Genome-Wide Expression Analysis. Erin M. Conlon

MediSapiens Ltd. Bio-IT solutions for improving cancer patient care. Because data is not knowledge. 19th of March 2015

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals

Personalized medicine in China s healthcare system

Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik

NIH Genomic Data Sharing (GDS) Policy Guidance Memo #2 1

Validation and Replication

Data deluge (and it s applications) Gianluigi Zanetti. Data deluge. (and its applications) Gianluigi Zanetti

CyTOF2. Mass cytometry system. Unveil new cell types and function with high-parameter protein detection

A leader in the development and application of information technology to prevent and treat disease.

Voluntary Genomic Data Submissions at the U.S. FDA

IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS

A Genetic Analysis of Rheumatoid Arthritis

Strategies in data integration to predict fish susceptibility to toxicants

How To Get A Grant From Kinesis

Methods for network visualization and gene enrichment analysis July 17, Jeremy Miller Scientist I jeremym@alleninstitute.org

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology

GC3 Use cases for the Cloud

Kinexus has an in-house inventory of lysates prepared from 16 human cancer cell lines that have been selected to represent a diversity of tissues,

Protein Protein Interaction Networks

Guide for Data Visualization and Analysis using ACSN

How many of you have checked out the web site on protein-dna interactions?

Introduction To Real Time Quantitative PCR (qpcr)

University of Glasgow - Programme Structure Summary C1G MSc Bioinformatics, Polyomics and Systems Biology

Genetic profiles in relation to sports: a databased approach

Big Data. Tom Plunkett Senior Consultant

Text Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining

The Future of the Electronic Health Record. Gerry Higgins, Ph.D., Johns Hopkins

TRANSLATIONAL BIOINFORMATICS 101

Exercise with Gene Ontology - Cytoscape - BiNGO

InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis

Get More Value from Your Reference Data Make it Meaningful with TopBraid RDM

Autoimmunity and immunemediated. FOCiS. Lecture outline

An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives

Genomics GENterprise

Biophysical and biochemical mechanisms of the biological effects of mobile phone radiation

Clinical and research data integration: the i2b2 FSM experience

G E N OM I C S S E RV I C ES

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Bioinformatics for cancer immunology and immunotherapy

Metodi Numerici per la Bioinformatica

Find the signal in the noise

Special report. Chronic Lymphocytic Leukemia (CLL) Genomic Biology 3020 April 20, 2006

Visualizing Networks: Cytoscape. Prat Thiru

Transcription:

Genevestigator Training Gent, 6 November 2012 Philip Zimmermann, Nebion AG

Goals Get to know Genevestigator What Genevestigator is for For who Genevestigator was created How to use Genevestigator for your research

Background: our company NEBION AG Spin-off company from the Swiss Federal Institute of Technology Zurich (ETH) Founded in 2008 Product: Genevestigator Services: curation of proprietary data data analysis services www.nebion.com

Number of datasets Value of public -omics data Thousands of -omics experiments have been published; soon millions! Highly specialized experiments Genome-wide coverage Wealth of information! 35000 30000 25000 20000 15000 10000 5000 Total number of datasets available in GEO 0 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 e.g. Publicly available microarray studies

Problem with public -omics data Not systematically curated No annotation using controlled vocabularies No normalization No quality control Rudimentary sample descriptions Many annotation errors!!! Therefore: Not easily exploitable! Extensive curation needed Methods and tools for global exploration are lacking

OUR SOLUTION Quality control Normalization Annotation Search Engine www.genevestigator.com 1. Professional biocuration of public data 2. Powerful search engine 3. User-friendly Web based tools

Interpreting your results Pathways, biological processes Networks Diseases GSEA, GO enrichment Literature Your results e.g. List of genes INTERPRETATION Compare with data from public and proprietary omics experiments. View/cluster expression by: - Tissue types, cell lines - Genotypes - Diseases, perturbations, cancers - etc.

Annotation ontologies mouse example Perturbations ontology Anatomy ontology SPACE Neoplasm ontology SPACE TIME CLINICAL PARAMETERS RESPONSE

Concept of meta-profiles

Deep data integration Proof of principle

Tissue type versus perturbations

Database content - overview Status: october 2013

Database content diseases 172 diseases from 28 disease areas 1000+ cancer types and subtypes Human and animal models

Genevestigator website www.genevestigator.com

Genevestigator is an online tool Website Java Client Application INTERNET Search Engine

Comparison with other products experiment Sample annotation via keywords Sample annotation with controlled vocabularies using ontologies proprietary public analysis of one or a few experiments Individual experiment normalization Combine results Global compendium normalization Search engine Analysis on deeply integrated data Genedata expressionist Qlucore Omics Explorer Partek, Omicsoft, etc. MeV, GenePattern, etc. Nextbio Oncomine Medisapiens Genelogic GENEVESTIGATOR

Application areas Functional genomics Bring gene expression into the context of thousands of conditions Find gene-gene and gene-condition associations Interpret lists of genes Target validation / drug repositioning Find out when, where, and in response to what a target is expressed Biomarker identification / validation Find genes highly specific for selected diseases or conditions

Demo

Analytical approach 1: Condition Search genes which conditions? Anatomy [space] Development [time] Stimulus / Mutation [response]

Case study 1 Drug repositioning Example of Glivec (Novartis) Glivec inhibits ABL (used for CML treatment) Glivec also inhibits KIT: in which conditions is this gene strongly up regulated? cell culture (107) non-cancer tissues (177) neoplasms (1040 / 850 subtypes)

Analytical approach 2: Gene Search conditions which genes? Anatomy [space] Development [time] Stimulus / Mutation [response]

Gene Search Identify genes that exhibit specific expression characteristics Anatomy Development Stimulus / Mutation

condition 1 condition 2 condition 3 condition 4 condition 5 condition 6 condition 7 condition 8 condition 9 condition 10 condition 11 condition 12 condition 13 condition 14 condition 15 condition 16 condition 17 Concept of gene search in Genevestigator gene 1 gene 2 gene 3 gene 4 gene 5 most biomarker search approaches look for the genes, which respond the most to a given condition gene 6 gene 7 gene 8 gene 9 gene 10 gene 11 gene 12 gene 13 gene 14 gene 15?? This condition may include multiple similar studies How these genes respond to other conditions is unknown, because they were not included into the analysis gene 16 gene 17

condition 1 condition 2 condition 3 condition 4 condition 5 condition 6 condition 7 condition 8 condition 9 condition 10 condition 11 condition 12 condition 13 condition 14 condition 15 condition 16 condition 17 Concept of gene search in Genevestigator gene 1 gene 2 gene 3 gene 4 gene 5 gene 6 gene 7 gene 8 gene 9 gene 10 gene 11 gene 12 gene 13 gene 14 gene 15 gene 16 gene 17 Genevestigator allows to find out how specific these genes are (Meta-Profile Analysis -> Stimulus/Mutation tools) Only few are responsive only to condition 9 (black arrows). All others are sensitive to one (grey arrows) or more other conditions.

condition 1 condition 2 condition 3 condition 4 condition 5 condition 6 condition 7 condition 8 condition 9 condition 10 condition 11 condition 12 condition 13 condition 14 condition 15 condition 16 condition 17 Concept of gene search in Genevestigator gene 3 gene 5 gene 7 gene 13 gene 17 gene 10 gene 2 gene 15 gene 9 gene 12 gene 4 gene 11 gene 16 gene 1 gene 6 gene 8 gene 14 The Genevestigator Biomarker Search tools identify genes that are specifically responsive to the chosen condition (they respond minimally to other conditions). These genes are not necessarily the ones with the strongest response to the chosen condition The Genevestigator Biomarker Search tools usually find other target candidates than classical tools, which analyze only a subset of experiments

condition 1 condition 2 condition 3 condition 4 condition 5 condition 6 condition 7 condition 8 condition 9 condition 10 condition 11 condition 12 condition 13 condition 14 condition 15 condition 16 condition 17 condition 18 condition 19 condition 20 condition 21 condition 22 condition 23 condition 24 condition 25 condition 26 condition 27 condition 28 condition 29 condition 30 condition 31 condition 32 condition 33 condition 34 condition 35 condition 36 condition 37 condition 38 condition 39 condition 40 condition 41 condition 42 condition 43 condition 44 condition 45 condition 46 condition 47 condition 48 condition 49 condition 50 condition 51 condition 52 condition 53 condition 54 condition 55 condition 56 condition 57 condition 58 condition 59 condition 60 condition 61 condition 62 condition 63 condition 64 condition 65 condition 66 condition 67 condition 68 condition 69 condition 70 condition 71 condition 72 condition 73 condition 74 condition 75 Concept of gene search in Genevestigator Imagine extending this to a much wider set of conditions you may find other conditions to which the set of genes respond target condition gene 3 gene 5 gene 7 gene 13 gene 17 gene 10 gene 2 gene 15 gene 9 gene 12 gene 4 gene 11 gene 16 gene 1 gene 6 gene 8 gene 14 other conditions to which the genes are responding

Example of co-regulation Similar mechanisms of action cause similar transcriptional effects target condition(s) Actinomycin-D vmyb Oncolytic herpes simplex virus Propiconazole Sapphyrin Echinomycin Cell cycle inhibition co-inducing conditions Chemical: ARC

Clustering tools Goal: to identify groups of genes that have similar expression characteristics Tools: Hierarchical clustering (with leaf ordering) Biclustering (BiMax algorithm)

RefGenes

RefGenes - validation Reference genes searched from a set of 197 mouse liver samples 4 candidates compared to 4 commonly used genes Validation experiment on mouse liver Experimental validation on 16 mouse liver samples (RT-qPCR) Selection using GeNorm genorm selection of the most stable reference genes within this experiment

RefGenes study experimental validation

RefGenes concept

RefGenes website www.refgenes.org

Summary Genevestigator is... A gene expression search engine High quality compendium of manually curated expression data The world s largest and growing collection of expression data A flexible software application (plug-in architecture) Web-based (Internet or Intranet) User-friendly and extremely fast

Summary: it s all about CONTEXT!