TUTORIAL From Protein-Protein Interactions (PPIs) to networks and pathways ???

Size: px
Start display at page:

Download "TUTORIAL From Protein-Protein Interactions (PPIs) to networks and pathways ??? http://bioinfow.dep.usal.es"

Transcription

1 TUTORIAL From Protein-Protein Interactions (PPIs) to networks and pathways Dr. Javier De Las Rivas Dr. Carlos Prieto Bioinformatics and Functional Genomics Research Group Cancer Research Center (CIC - IBMCC, CSIC / USAL) Salamanca, Spain De Las Rivas & Prieto - Tutorial ISMB/ECCB - June high-throughput data produce masive networks --The high-throughput proteomic techniques are producig a masive amount of new data and PPI networks are becaming quite complex.??? De Las Rivas & Prieto - Tutorial ISMB/ECCB - June

2 --- QA and QC of PPI data --There is a clear concern in this research area about the large amount of false positives produced both by the high-throughput proteomic techniques but also by many low-throughput small scale experiments published that have not been properly reported or properly curated. Therefore, PPI data need a proper Quality Assesment (QA) and Quality Control (QC).??? De Las Rivas & Prieto - Tutorial ISMB/ECCB - June Protein-Protein Interactions (PPIs) --- QA and QC of PPI data --There is a clear concern in this research area about the large amount of false positives produced both by the high-throughput proteomic techniques but also by many low-throughput small scale experiments published that have not been properly reported or properly curated. Therefore, PPI data need a proper Quality Assesment (QA) and Quality Control (QC). Nature (2002) Mol. & Cell. Proteomics (2002) De Las Rivas & Prieto - Tutorial ISMB/ECCB - June

3 Protein-Protein Interactions (PPIs) --- QA and QC of PPI data --- BMC Bioinformatics (2006) Bioinformatics (2006) Proteins (2006) Nat. Biotech. (2004) Many attempts to gain confidence 5 Protein-Protein Interactions (PPIs) --- QA and QC of PPI data --- Yu et al., Science (2008) completeness precision Cusick et al., Nature Methods (2009) New attempts to gain confidence Braun et al., Nature Methods (2009) based on experiments and unbiased HT De Las Rivas & Prieto - Tutorial ISMB/ECCB - June

4 Protein-Protein Interactions (PPIs) --- QA and QC of PPI data --- Key problems within the protein-protein interaction networks (from PPI networks to the interactome) 1.- Still partial >>> incomplete 2.- Many false positives >>> noisy 3.- Many inconsistent annotations >>> inaccurate 4.- Lack of cell-type-specific assignment >>> mixed (essential for metazoa: human, mouse, etc) Key problems - PPI networks incomplete - noisy - inaccurate - mixed ERROR De Las Rivas & Prieto - Tutorial ISMB/ECCB - June Protein-Protein Interactions (PPIs) --- QA and QC of PPI data --- Key problems - PPI networks incomplete - noisy - inaccurate - mixed ERROR in 2008 known human proteome proteins (by intrg8, EBI) proteins involved in the human PPI from experimental data proteins (by APID, CIC) coverage 30.8 % Example of incomplete = Human PPI network De Las Rivas & Prieto - Tutorial ISMB/ECCB - June

5 Protein-Protein Interactions (PPIs) --- QA and QC of PPI data --- Key problems - PPI networks incomplete - noisy - inaccurate - mixed ERROR Example of inaccurate = Lack of consistency between PPI DBs: BIND (Biomolecular Interaction Network Database) DIP (Database of Interacting Proteins) IntAct (Database system & analysis tools for PI data) De Las Rivas & Prieto - Tutorial ISMB/ECCB - June Protein-Protein Interactions (PPIs) --- improving PPI data --- We have developed a bioinformatic web platform to try to find consistency on the protein-protein interaction networks by unifiying well referenced experimental data. The tool is called APID and it allows full exploration of PPI information. APID (Agile Protein Interaction DataAnalizer) De Las Rivas & Prieto - Tutorial ISMB/ECCB - June

6 Protein-Protein Interactions (PPIs) --- improving PPI data --- APID (Agile Protein Interaction DataAnalizer) BioGRID DIP HPRD MINT BIND IntAct At present 6 source PPI DBs were unified: BIND (Biomolecular Interaction Network DB) BioGRID (Biological Gral. Repository for Interaction Datasets) DIP (Database of Interacting Proteins) HPRD (Human Protein Reference Database) IntAct (Database system & analysis tools for PI data) MINT (Molecular Interactions Database) Data integration and unification by Sequence UniProt_ID PubMed_ID De Las Rivas & Prieto - Tutorial ISMB/ECCB - June Protein-Protein Interactions (PPIs) --- improving PPI data --- APID data integration and data unification by Sequence UniProt_ID PubMed_ID 12

7 Protein-Protein Interactions (PPIs) --- integration & unification of PPI data --- APID (Agile Protein Interaction DataAnalizer) ppi 22.5 % MINT 1017 ppi 1.75 % DIP ppi 40.9 % BioGRID ppi 63.3 % HPRD in 2008 human proteins proteins human interactions interactions BIND 9.66 % 5626 ppi IntAct 34.9 % ppi in 2007 human interactions interactions in 2008 human interactions interactions De Las Rivas & Prieto - Tutorial ISMB/ECCB - June integration & unification of PPI data --- Exploring the data of 2 major papers that applied high-throughput two-hybrid techniques to map the human interactome RUAL et al., Nature (2005) STELZL et al., Cell (2005) De Las Rivas & Prieto - Tutorial ISMB/ECCB - June

8 --- integration & unification of PPI data --- Exploring the data of 2 major papers that applied high-throughput two-hybrid techniques to map the human interactome RUAL et al., Nature (2005) 1526 prot by Y2H (APID) STELZL et al., Cell (2005) 1665 prot by Y2H (APID) Only 294 proteins in common HUBs marked De Las Rivas & Prieto - Tutorial ISMB/ECCB - June integration & unification of PPI data --- Exploring the data of 2 major papers that applied high-throughput two-hybrid techniques to map the human interactome APID RUAL STELZL PA1B2 LRSM1 CDN1A ADT3 TCEA2 SGTA RGS2 RAB1A PP16A MARK3 CCD53 PSME3 PHC2 DVL3 PIN1 ZNF24 PRAF1 UBC9 MOB3 ARI2 MYST2 NRIF3 PA1B3 DAZP2 UT14A BAT3 P53 TRP13 ZBT16 SAT1 VIME EF1G UN119 MDFI 40 proteins most highly connected (>25 ppi) (for APID: include only the interactions by Y2H method) PSA1 NCK2 HSP7C EPS8 TTHY TBC17 De Las Rivas & Prieto - Tutorial ISMB/ECCB - June

9 --- unified PPI data with Q-filters --In APID_2H: 1169 human proteins and 6237 pair-wise interactions (proven by 1 method) PPI network 1169 prot 6237 inter InterPro kinases 149 prot IPR IPR prot_kinase kinase_like GO-BP signal transduction 171 prot GO: unified PPI data with Q-filters --In APID_2H: 1169 human proteins and 1787 pair-wise interactions (proven by 2 methods) PPI network 1169 prot 1787 inter InterPro kinases 119 prot IPR IPR prot_kinase kinase_like GO-BP signal transduction 136 prot GO:7165 De Las Rivas & Prieto - Tutorial ISMB/ECCB - June

10 --- unified PPI data with Q-filters method 2 methods 2 methods + ipfam indirect validation --- conclusions It is important to define well what is a protein-protein interaction (PPI) to avoid confusion with other types of biomolecular relationships or associations. 2.- PPI networks are scale free networks with different types of hubs that are critical nodes with specific topological characteristics. 3.- The two major high-throughput methods to find PPIs are: tandemaffinity purification and mass spectrometry (TAP-MS) and two-hybrid systems (2H). These methods provide quite different mapping of PPIs because TAP-MS is complex oriented but 2H is binary oriented. 4.- Due to the large proportion of false positives and the low overlapping of PPI data from diferent source databases, a major effort is needed in quality assessment (QA) and quality control (QC). Data integration and unification, including reference sets and scoring schemes is a good approach to achieve PPI data improvement. De Las Rivas & Prieto - Tutorial ISMB/ECCB - June

11 --- conclusions The analyses of PPI networks allow to find clusters of nodes highly interconnected, that usually can be assigned to specific molecular machines like the proteasome. These clusters include a kind of critical nodes or hubs that have been called: intramolecular hubs. 6.- The analysis of PPI networks allow to find other type of critical nodes or hubs that are highly connected with many nodes that show low interconnection. These hubs can be usually assigned to molecular regulators included in pathways, like the signal transducers. These critical nodes that have been called: intermolecular hubs. De Las Rivas & Prieto - Tutorial ISMB/ECCB - June examples for the practicals of the tutorial A molecular machine: PROTEASOME, complex interaction and a clear subnetwork 2. A signaling pathway: NOTCH, integration of the pathway and the PPI network 3. Building a molecular machine: Pre-RIBOSOME, steps for its biogenesis and assemble De Las Rivas & Prieto - Tutorial ISMB/ECCB - June

Protein Protein Interactions (PPI) APID (Agile Protein Interaction DataAnalyzer)

Protein Protein Interactions (PPI) APID (Agile Protein Interaction DataAnalyzer) APID (Agile Protein Interaction DataAnalyzer) 23 APID (Agile Protein Interaction DataAnalyzer) Integrates and unifies 7 DBs: BIND, DIP, HPRD, IntAct, MINT, BioGRID. Includes 51,873 proteins 241,204 interactions

More information

A brief introduction to Cytoscape

A brief introduction to Cytoscape A brief introduction to Cytoscape Scientific day «Data mining of omics data» 5th Feb 2015 CGFB, Bordeaux yves.vandenbrouck@cea.fr (U1038, EDyP team, CEA-Grenoble) OUTLINES Concepts and context Cytoscape

More information

Protein Protein Interaction Networks

Protein Protein Interaction Networks Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics

More information

0 value2. 3. Assign labels to the proteins in each interval of the ranked list

0 value2. 3. Assign labels to the proteins in each interval of the ranked list Ranked list of protein degrees in decreasing order 0 100 Proteins with few connections Proteins with many connections 1. Sort a random number between 80 and 98 (value1) 0 value1 2. Sort a random number

More information

How To Understand A Protein Network

How To Understand A Protein Network Graph-theoretical approaches for studying biological networks by Tijana Milenković Advancement Committee: Prof. Wayne Hayes Prof. Lan Huang Prof. Eric Mjolsness Prof. Zoran Nenadić Prof. Nataša Pržulj,

More information

CENG 734 Advanced Topics in Bioinformatics

CENG 734 Advanced Topics in Bioinformatics CENG 734 Advanced Topics in Bioinformatics Week 9 Text Mining for Bioinformatics: BioCreative II.5 Fall 2010-2011 Quiz #7 1. Draw the decompressed graph for the following graph summary 2. Describe the

More information

Prediction of Protein-Protein Interactions and Essential Genes Through Data Integration. Max Kotlyar

Prediction of Protein-Protein Interactions and Essential Genes Through Data Integration. Max Kotlyar Prediction of Protein-Protein Interactions and Essential Genes Through Data Integration by Max Kotlyar A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Graduate

More information

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge

More information

A Graph-Theoretic Analysis of the Human Protein-Interaction Network Using Multicore Parallel Algorithms

A Graph-Theoretic Analysis of the Human Protein-Interaction Network Using Multicore Parallel Algorithms A Graph-Theoretic Analysis of the Human Protein-Interaction Network Using Multicore Parallel Algorithms David A. Bader and Kamesh Madduri College of Computing Georgia Institute of Technology, Atlanta,

More information

國 立 交 通 大 學. 同 源 蛋 白 質 - 蛋 白 質 交 互 作 用 之 研 究 A Study of Homologous Protein-protein Interactions 生 物 科 技 學 系 博 士 論 文

國 立 交 通 大 學. 同 源 蛋 白 質 - 蛋 白 質 交 互 作 用 之 研 究 A Study of Homologous Protein-protein Interactions 生 物 科 技 學 系 博 士 論 文 國 立 交 通 大 學 生 物 科 技 學 系 博 士 論 文 同 源 蛋 白 質 - 蛋 白 質 交 互 作 用 之 研 究 A Study of Homologous Protein-protein Interactions 研 究 生 : 陳 俊 辰 Chen Chun-Chen 指 導 教 授 : 楊 進 木 博 士 Advisor: Yang Jinn-Moon, Ph. D. 中 華 民

More information

Network Analysis. BCH 5101: Analysis of -Omics Data 1/34

Network Analysis. BCH 5101: Analysis of -Omics Data 1/34 Network Analysis BCH 5101: Analysis of -Omics Data 1/34 Network Analysis Graphs as a representation of networks Examples of genome-scale graphs Statistical properties of genome-scale graphs The search

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

Tutorial for proteome data analysis using the Perseus software platform

Tutorial for proteome data analysis using the Perseus software platform Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information

More information

La Protéomique : Etat de l art et perspectives

La Protéomique : Etat de l art et perspectives La Protéomique : Etat de l art et perspectives Odile Schiltz Institut de Pharmacologie et de Biologie Structurale CNRS, Université de Toulouse, Odile.Schiltz@ipbs.fr Protéomique et Spectrométrie de Masse

More information

PPInterFinder A Web Server for Mining Human Protein Protein Interaction

PPInterFinder A Web Server for Mining Human Protein Protein Interaction PPInterFinder A Web Server for Mining Human Protein Protein Interaction Kalpana Raja, Suresh Subramani, Jeyakumar Natarajan Data Mining and Text Mining Laboratory, Department of Bioinformatics, Bharathiar

More information

Network Analysis and System Biology with Omics Data. Zhenqiu Liu Samuel Oschin Comprehensive Cancer Institute Cedars Sinai Medical Center

Network Analysis and System Biology with Omics Data. Zhenqiu Liu Samuel Oschin Comprehensive Cancer Institute Cedars Sinai Medical Center Network Analysis and System Biology with Omics Data Zhenqiu Liu Samuel Oschin Comprehensive Cancer Institute Cedars Sinai Medical Center Outline System biology: core concepts and basic ideas Networks:

More information

Ingenuity Pathway Analysis (IPA )

Ingenuity Pathway Analysis (IPA ) ProductProfile Ingenuity Pathway Analysis (IPA ) For the analysis and interpretation of omics data IPA is a web-based software application for the analysis, integration, and interpretation of data derived

More information

EMBL Identity & Access Management

EMBL Identity & Access Management EMBL Identity & Access Management Rupert Lück EMBL Heidelberg e IRG Workshop Zürich Apr 24th 2008 Outline EMBL Overview Identity & Access Management for EMBL IT Requirements & Strategy Project Goal and

More information

Data, Measurements, Features

Data, Measurements, Features Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are

More information

More complete and more accurate interactomes for elucidating the mechanisms of complex diseases

More complete and more accurate interactomes for elucidating the mechanisms of complex diseases More complete and more accurate interactomes for elucidating the mechanisms of complex diseases Atanas Kamburov August 2011 Dissertation zur Erlangung des Grades eines Doktors der Naturwissenschaften (Dr.

More information

Tutorial for Proteomics Data Submission. Katalin F. Medzihradszky Robert J. Chalkley UCSF

Tutorial for Proteomics Data Submission. Katalin F. Medzihradszky Robert J. Chalkley UCSF Tutorial for Proteomics Data Submission Katalin F. Medzihradszky Robert J. Chalkley UCSF Why Have Guidelines? Large-scale proteomics studies create huge amounts of data. It is impossible/impractical to

More information

Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives

Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives Dirk.Repsilber@oru.se 2015-05-21 Functional Bioinformatics, Örebro University Vad är bioinformatik och varför

More information

Module 1. Sequence Formats and Retrieval. Charles Steward

Module 1. Sequence Formats and Retrieval. Charles Steward The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.

More information

Interaktionen von RNAs und Proteinen

Interaktionen von RNAs und Proteinen Sonja Prohaska Computational EvoDevo Universitaet Leipzig June 9, 2015 Studying RNA-protein interactions Given: target protein known to bind to RNA problem: find binding partners and binding sites experimental

More information

Life as a scientific database curator

Life as a scientific database curator Life as a scientific database curator Sandra Orchard EBI is an Outstation of the European Molecular Biology Laboratory. What is a database curator Curator OED - a keeper of a museum or other collection

More information

The EcoCyc Curation Process

The EcoCyc Curation Process The EcoCyc Curation Process Ingrid M. Keseler SRI International 1 HOW OFTEN IS THE GOLDEN GATE BRIDGE PAINTED? Many misconceptions exist about how often the Bridge is painted. Some say once every seven

More information

REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf])

REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) 820 REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) (See also General Regulations) BMS1 Admission to the Degree To be eligible for admission to the degree of Bachelor

More information

A Primer of Genome Science THIRD

A Primer of Genome Science THIRD A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:

More information

cansar: integrated cancer knowledgebase

cansar: integrated cancer knowledgebase in partnership with cansar: integrated cancer knowledgebase Bissan Al-Lazikani Cancer Research UK Cancer Therapeutics Unit 10 th Dec 2013 Sharing knowledge for drug discovery Resource to effectively integrate

More information

Novel Global Network Scores to Analyze Kinase Inhibitor Profiles

Novel Global Network Scores to Analyze Kinase Inhibitor Profiles The Fourth International Conference on Computational Systems Biology (ISB2010) Suzhou, China, September 9 11, 2010 Copyright 2010 ORSC & APORC, pp. 305 313 Novel Global Network Scores to Analyze Kinase

More information

Healthcare Analytics. Aryya Gangopadhyay UMBC

Healthcare Analytics. Aryya Gangopadhyay UMBC Healthcare Analytics Aryya Gangopadhyay UMBC Two of many projects Integrated network approach to personalized medicine Multidimensional and multimodal Dynamic Analyze interactions HealthMask Need for sharing

More information

Visualizing Networks: Cytoscape. Prat Thiru

Visualizing Networks: Cytoscape. Prat Thiru Visualizing Networks: Cytoscape Prat Thiru Outline Introduction to Networks Network Basics Visualization Inferences Cytoscape Demo 2 Why (Biological) Networks? 3 Networks: An Integrative Approach Zvelebil,

More information

Graph Theory and Networks in Biology

Graph Theory and Networks in Biology Graph Theory and Networks in Biology Oliver Mason and Mark Verwoerd March 14, 2006 Abstract In this paper, we present a survey of the use of graph theoretical techniques in Biology. In particular, we discuss

More information

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16 Course Director: Dr. Barry Grant (DCM&B, bjgrant@med.umich.edu) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems

More information

Identification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation

Identification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation Identification of rheumatoid arthritis and osterthritis patients by transcriptome-based rule set generation Bering Limited Report generated on September 19, 2014 Contents 1 Dataset summary 2 1.1 Project

More information

BIOINFORMATICS METHODS AND APPLICATIONS

BIOINFORMATICS METHODS AND APPLICATIONS FACULTY of ENGINEERING SCHOOL OF COMPUTER SCIENCE AND ENGINEERING BINF3010/9010 BIOINFORMATICS METHODS AND APPLICATIONS SESSION 1, 2015 Course staff Course Convener: Bruno Gaëta bgaeta@unsw.edu.au School

More information

Focusing on results not data comprehensive data analysis for targeted next generation sequencing

Focusing on results not data comprehensive data analysis for targeted next generation sequencing Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes

More information

Learning Predictive Models of Gene Dynamics. Nick Jones, Sumeet Agarwal

Learning Predictive Models of Gene Dynamics. Nick Jones, Sumeet Agarwal Learning Predictive Models of Gene Dynamics Nick Jones, Sumeet Agarwal An Aim of Systems Biology How about predicting what cells are going to do? Quantitatively predict the response of cells to multiple

More information

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources 1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools

More information

ProteinQuest user guide

ProteinQuest user guide ProteinQuest user guide 1. Introduction... 3 1.1 With ProteinQuest you can... 3 1.2 ProteinQuest basic version 4 1.3 ProteinQuest extended version... 5 2. ProteinQuest dictionaries... 6 3. Directions for

More information

Aiping Lu. Key Laboratory of System Biology Chinese Academic Society APLV@sibs.ac.cn

Aiping Lu. Key Laboratory of System Biology Chinese Academic Society APLV@sibs.ac.cn Aiping Lu Key Laboratory of System Biology Chinese Academic Society APLV@sibs.ac.cn Proteome and Proteomics PROTEin complement expressed by genome Marc Wilkins Electrophoresis. 1995. 16(7):1090-4. proteomics

More information

Genome Viewing. Module 2. Using Genome Browsers to View Annotation of the Human Genome

Genome Viewing. Module 2. Using Genome Browsers to View Annotation of the Human Genome Module 2 Genome Viewing Using Genome Browsers to View Annotation of the Human Genome Bert Overduin, Ph.D. PANDA Coordination & Outreach EMBL - European Bioinformatics Institute Wellcome Trust Genome Campus

More information

Quantitative proteomics background

Quantitative proteomics background Proteomics data analysis seminar Quantitative proteomics and transcriptomics of anaerobic and aerobic yeast cultures reveals post transcriptional regulation of key cellular processes de Groot, M., Daran

More information

Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments

Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments Mario Cannataro, Pietro Hiram Guzzi, Tommaso Mazza, and Pierangelo Veltri University Magna Græcia of Catanzaro, 88100

More information

Review Article A Review of Integration Strategies to Support Gene Regulatory Network Construction

Review Article A Review of Integration Strategies to Support Gene Regulatory Network Construction The Scientific World Journal Volume 2012, Article ID 435257, 12 pages doi:10.1100/2012/435257 The cientificworldjournal Review Article A Review of Integration Strategies to Support Gene Regulatory Network

More information

Data Integration. Lectures 16 & 17. ECS289A, WQ03, Filkov

Data Integration. Lectures 16 & 17. ECS289A, WQ03, Filkov Data Integration Lectures 16 & 17 Lectures Outline Goals for Data Integration Homogeneous data integration time series data (Filkov et al. 2002) Heterogeneous data integration microarray + sequence microarray

More information

Extraction and Visualization of Protein-Protein Interactions from PubMed

Extraction and Visualization of Protein-Protein Interactions from PubMed Extraction and Visualization of Protein-Protein Interactions from PubMed Ulf Leser Knowledge Management in Bioinformatics Humboldt-Universität Berlin Finding Relevant Knowledge Find information about Much

More information

Proteome Data Integration: Characteristics and Challenges

Proteome Data Integration: Characteristics and Challenges Proteome Data Integration: Characteristics and Challenges K. Belhajjame 1, S.M. Embury 1, H. Fan 2, C. Goble 1, H. Hermjakob 4, S.J. Hubbard 1, D. Jones 3, P. Jones 4, N. Martin 2, S. Oliver 1, C. Orengo

More information

Contents. Page 1 of 11

Contents. Page 1 of 11 Programme-specific Section of the Curriculum for the MSc Programme in Bioinformatics at the Faculty of Science, University of Copenhagen 2009 (Rev. 2015) Contents 1 Title, affiliation and language... 2

More information

NCBI resources III: GEO and ftp site. Yanbin Yin Spring 2013

NCBI resources III: GEO and ftp site. Yanbin Yin Spring 2013 NCBI resources III: GEO and ftp site Yanbin Yin Spring 2013 1 Homework assignment 2 Search colon cancer at GEO and find a data Series and perform a GEO2R analysis Write a report (in word or ppt) to include

More information

Molecular Bioinformatics for Diseases

Molecular Bioinformatics for Diseases PSB 2009 Molecular Bioinformatics for Diseases Tutorial Atul Butte, Maricel Kann, Yves Lussier Yanay Ofran, Predrag Radivojac January 5, 2009 Who and where we are Yves Atul Pedja Maricel Yanay All of us

More information

Introduction to Proteomics

Introduction to Proteomics Introduction to Proteomics Why Proteomics? Same Genome Different Proteome Black Swallowtail - larvae and butterfly Biological Complexity Yeast - a simple proteome 6,113 proteins = 344,855 tryptic peptides

More information

Frequently Asked Questions (FAQ)

Frequently Asked Questions (FAQ) Frequently Asked Questions (FAQ) Why screen your (therapeutic) antibody for cross-reactivity? Cross-reactivity of therapeutic antibodies leads to adverse effects and might render the antibody unsuitable

More information

CellNOpt: a brief overview

CellNOpt: a brief overview CellNOpt: a brief overview Thomas Cokelaer Aidan MacNamara Julio Saez-Rodriguez European Bioinformatics Institute Hinxton (Cambridge) U.K www.ebi.ac.uk/saezrodriguez Lausanne, Switzerland, April 2014 Outline

More information

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements

More information

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of

More information

Predicting gene functions from multiple biological sources using novel ensemble methods. Chandan K. Reddy* and Mohammad S. Aziz

Predicting gene functions from multiple biological sources using novel ensemble methods. Chandan K. Reddy* and Mohammad S. Aziz 184 Int. J. Data Mining and Bioinformatics, Vol. 12, No. 2, 2015 Predicting gene functions from multiple biological sources using novel ensemble methods Chandan K. Reddy* and Mohammad S. Aziz Department

More information

Graph Theory Approaches to Protein Interaction Data Analysis

Graph Theory Approaches to Protein Interaction Data Analysis Graph Theory Approaches to Protein Interaction Data Analysis Nataša Pržulj Technical Report 322/04 Department of Computer Science, University of Toronto Completed on September 8, 2003 Report issued on

More information

FACULTY OF MEDICAL SCIENCE

FACULTY OF MEDICAL SCIENCE Doctor of Philosophy in Biochemistry FACULTY OF MEDICAL SCIENCE Naresuan University 73 Doctor of Philosophy in Biochemistry The Biochemistry Department at Naresuan University is a leader in lower northern

More information

Genetomic Promototypes

Genetomic Promototypes Genetomic Promototypes Mirkó Palla and Dana Pe er Department of Mechanical Engineering Clarkson University Potsdam, New York and Department of Genetics Harvard Medical School 77 Avenue Louis Pasteur Boston,

More information

Gene expression analysis. Ulf Leser and Karin Zimmermann

Gene expression analysis. Ulf Leser and Karin Zimmermann Gene expression analysis Ulf Leser and Karin Zimmermann Ulf Leser: Bioinformatics, Wintersemester 2010/2011 1 Last lecture What are microarrays? - Biomolecular devices measuring the transcriptome of a

More information

Heat Shock Partially Dissociates the Overlapping Modules of the Yeast Protein-Protein Interaction Network: A Systems Level Model of Adaptation

Heat Shock Partially Dissociates the Overlapping Modules of the Yeast Protein-Protein Interaction Network: A Systems Level Model of Adaptation Heat Shock Partially Dissociates the Overlapping Modules of the Yeast Protein-Protein Interaction Network: A Systems Level Model of Adaptation Ágoston Mihalik, Peter Csermely* Department of Medical Chemistry,

More information

Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing

Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing James D. Jackson Philip J. Hatcher Department of Computer Science Kingsbury Hall University of New Hampshire Durham,

More information

BIOINFORMATICS Supporting competencies for the pharma industry

BIOINFORMATICS Supporting competencies for the pharma industry BIOINFORMATICS Supporting competencies for the pharma industry ABOUT QFAB QFAB is a bioinformatics service provider based in Brisbane, Australia operating nationwide and internationally. QFAB was established

More information

Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center

Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Computational Challenges in Storage, Analysis and Interpretation of Next-Generation Sequencing Data Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Next Generation Sequencing

More information

Carlos III Networked Proteomics Platform

Carlos III Networked Proteomics Platform Carlos III Networked Proteomics Platform Carlos III Networked Proteomics Platform, ProteoRed-ISCIII is a National Network for the coordination, integration and development of the Spanish Proteomics Facilities

More information

Kinexus has an in-house inventory of lysates prepared from 16 human cancer cell lines that have been selected to represent a diversity of tissues,

Kinexus has an in-house inventory of lysates prepared from 16 human cancer cell lines that have been selected to represent a diversity of tissues, Kinexus Bioinformatics Corporation is seeking to map and monitor the molecular communications networks of living cells for biomedical research into the diagnosis, prognosis and treatment of human diseases.

More information

Journal of Biology. BioMed Central

Journal of Biology. BioMed Central Journal of Biology BioMed Central Open Access Research article Comprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae Teresa Reguly *, Ashton Breitkreutz *, Lorrie

More information

Understanding the dynamics and function of cellular networks

Understanding the dynamics and function of cellular networks Understanding the dynamics and function of cellular networks Cells are complex systems functionally diverse elements diverse interactions that form networks signal transduction-, gene regulatory-, metabolic-

More information

Web-Based Genomic Information Integration with Gene Ontology

Web-Based Genomic Information Integration with Gene Ontology Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic

More information

GeneProf and the new GeneProf Web Services

GeneProf and the new GeneProf Web Services GeneProf and the new GeneProf Web Services Florian Halbritter florian.halbritter@ed.ac.uk Stem Cell Bioinformatics Group (Simon R. Tomlinson) simon.tomlinson@ed.ac.uk December 10, 2012 Florian Halbritter

More information

Information and Data Sharing Policy* Genomics:GTL Program

Information and Data Sharing Policy* Genomics:GTL Program Appendix 1 Information and Data Sharing Policy* Genomics:GTL Program Office of Biological and Environmental Research Office of Science Department of Energy Appendix 1 Final Date: April 4, 2008 Introduction

More information

BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS

BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title:

More information

DISCOVERY TOOL FOR GENOME-WIDE ASSOCIATION STUDIES

DISCOVERY TOOL FOR GENOME-WIDE ASSOCIATION STUDIES IPINBPA: AN INTEGRATIVE NETWORK-BASED FUNCTIONAL MODULE DISCOVERY TOOL FOR GENOME-WIDE ASSOCIATION STUDIES LILI WANG School of Computing, Queen s University 25 Union Street, Goodwin Hall, Kingston, Ontario,

More information

G E N OM I C S S E RV I C ES

G E N OM I C S S E RV I C ES GENOMICS SERVICES THE NEW YORK GENOME CENTER NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. capabilities. N E X T- G E

More information

Global and Discovery Proteomics Lecture Agenda

Global and Discovery Proteomics Lecture Agenda Global and Discovery Proteomics Christine A. Jelinek, Ph.D. Johns Hopkins University School of Medicine Department of Pharmacology and Molecular Sciences Middle Atlantic Mass Spectrometry Laboratory Global

More information

Processing Genome Data using Scalable Database Technology. My Background

Processing Genome Data using Scalable Database Technology. My Background Johann Christoph Freytag, Ph.D. freytag@dbis.informatik.hu-berlin.de http://www.dbis.informatik.hu-berlin.de Stanford University, February 2004 PhD @ Harvard Univ. Visiting Scientist, Microsoft Res. (2002)

More information

Systems Biology through Data Analysis and Simulation

Systems Biology through Data Analysis and Simulation Biomolecular Networks Initiative Systems Biology through Data Analysis and Simulation William Cannon Computational Biosciences 5/30/03 Cellular Dynamics Microbial Cell Dynamics Data Mining Nitrate NARX

More information

Dr Alexander Henzing

Dr Alexander Henzing Horizon 2020 Health, Demographic Change & Wellbeing EU funding, research and collaboration opportunities for 2016/17 Innovate UK funding opportunities in omics, bridging health and life sciences Dr Alexander

More information

Importance of Statistics in creating high dimensional data

Importance of Statistics in creating high dimensional data Importance of Statistics in creating high dimensional data Hemant K. Tiwari, PhD Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham History of Genomic Data

More information

How To Understand Protein-Protein Interaction And Inhibitors

How To Understand Protein-Protein Interaction And Inhibitors Protein-Protein Interactions and Inhibitors Alan Naylor Independent Consultant Optibrium Consultants Meeting Cambridge 27 th November 2012 Why PPI inhibitors? PPIs are involved in many biological / disease

More information

Data Integration and Knowledge Management within Oncotyrol A. Dander, R. Gallasch, S. Pabinger, H. Fiegl, Z. Trajanoski

Data Integration and Knowledge Management within Oncotyrol A. Dander, R. Gallasch, S. Pabinger, H. Fiegl, Z. Trajanoski Data Integration and Knowledge Management within Oncotyrol A. Dander, R. Gallasch, S. Pabinger, H. Fiegl, Z. Trajanoski Oncotyrol, Center for Personalized Cancer Medicine, Innsbruck, Austria Biocenter,

More information

Proteomics Standard Group

Proteomics Standard Group Proteomics Standard Group Bioinformatique et protéomique Montbonnot 1er juin 2006 yves.vandenbrouck@cea.fr 1 Proteomics : the state of play The volume of generated proteome data is rapidly increasing Movement

More information

Bioinformatics Grid - Enabled Tools For Biologists.

Bioinformatics Grid - Enabled Tools For Biologists. Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis

More information

Data Integration and Decision-Making For Biomarkers Discovery, Validation and Evaluation. D. POLVERARI, CTO October 06-07 2008

Data Integration and Decision-Making For Biomarkers Discovery, Validation and Evaluation. D. POLVERARI, CTO October 06-07 2008 Data Integration and Decision-Making For Biomarkers Discovery, Validation and Evaluation D. POLVERARI, CTO October 06-07 2008 Data integration definition and aims Definition : Data integration consists

More information

Linear Sequence Analysis. 3-D Structure Analysis

Linear Sequence Analysis. 3-D Structure Analysis Linear Sequence Analysis What can you learn from a (single) protein sequence? Calculate it s physical properties Molecular weight (MW), isoelectric point (pi), amino acid content, hydropathy (hydrophilic

More information

European Molecular Biology Laboratory Case Example

European Molecular Biology Laboratory Case Example European Molecular Biology Laboratory Case Example Dr. Silke Schumacher Director International Relations EMBL Member States Austria 1974 Denmark 1974 France 1974 Germany 1974 Israel 1974 Italy 1974 Netherlands

More information

PeptidomicsDB: a new platform for sharing MS/MS data.

PeptidomicsDB: a new platform for sharing MS/MS data. PeptidomicsDB: a new platform for sharing MS/MS data. Federica Viti, Ivan Merelli, Dario Di Silvestre, Pietro Brunetti, Luciano Milanesi, Pierluigi Mauri NETTAB2010 Napoli, 01/12/2010 Mass Spectrometry

More information

APPLICATION OF DATA MINING IN BIOINFORMATICS

APPLICATION OF DATA MINING IN BIOINFORMATICS APPLICATION OF DATA MINING IN BIOINFORMATICS KHALID RAZA Centre for Theoretical Physics, Jamia Millia Islamia, New Delhi-110025, India Abstract This article highlights some of the basic concepts of bioinformatics

More information

InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis

InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis WHITE PAPER By InSyBio Ltd Konstantinos Theofilatos Bioinformatician, PhD InSyBio Technical Sales Manager August

More information

Denominazione insegnamento in italiano Denominazione insegnamento in inglese Tipologia dell esame (scritto- scritto/orale orale)

Denominazione insegnamento in italiano Denominazione insegnamento in inglese Tipologia dell esame (scritto- scritto/orale orale) Biochimica Cellulare II Cellular Biochemistry II e COMPREHENSION OF MECHANISMS REGULATING CELL CYCLE AND CELL PROLIFERATION IN EUKARYOTES. EVOLUTION OF MOLECULAR CIRCUITS THAT REGULATE GROWTH AND CELL

More information

Supplementary Information accompanying with the manuscript titled:

Supplementary Information accompanying with the manuscript titled: Supplementary Information accompanying with the manuscript titled: Tethering preferences of domain families cooccurring in multi domain proteins Smita Mohanty, Mansi Purvar, Naryanswamy Srinivasan* and

More information

Analysis of the colorectal tumor microenvironment using integrative bioinformatic tools

Analysis of the colorectal tumor microenvironment using integrative bioinformatic tools MLECNIK Bernhard & BINDEA Gabriela Analysis of the colorectal tumor microenvironment using integrative bioinformatic tools INSERM U872, Jérôme Galon Team15: Integrative Cancer Immunology Cordeliers Research

More information

Weill Graduate School of Medical Sciences of Cornell University Program in Pharmacology. Graduate School Curriculum

Weill Graduate School of Medical Sciences of Cornell University Program in Pharmacology. Graduate School Curriculum Weill Graduate School of Medical Sciences of Cornell University Program in Pharmacology Graduate School Curriculum Weill Graduate School of Medical Sciences Partnership between Weill Medical College and

More information

1. Introduction Gene regulation Genomics and genome analyses Hidden markov model (HMM)

1. Introduction Gene regulation Genomics and genome analyses Hidden markov model (HMM) 1. Introduction Gene regulation Genomics and genome analyses Hidden markov model (HMM) 2. Gene regulation tools and methods Regulatory sequences and motif discovery TF binding sites, microrna target prediction

More information

Cloud-Based Big Data Analytics in Bioinformatics

Cloud-Based Big Data Analytics in Bioinformatics Cloud-Based Big Data Analytics in Bioinformatics Presented By Cephas Mawere Harare Institute of Technology, Zimbabwe 1 Introduction 2 Big Data Analytics Big Data are a collection of data sets so large

More information

Analysis of ChIP-seq data in Galaxy

Analysis of ChIP-seq data in Galaxy Analysis of ChIP-seq data in Galaxy November, 2012 Local copy: https://galaxy.wi.mit.edu/ Joint project between BaRC and IT Main site: http://main.g2.bx.psu.edu/ 1 Font Conventions Bold and blue refers

More information

Data mining with Mascot Integra ASMS 2005

Data mining with Mascot Integra ASMS 2005 Data mining with Mascot Integra 1 What is Mascot Integra? Fully functional out-the-box solution for proteomics workflow and data management Support for all the major mass-spectrometry data systems Powered

More information

Exercise with Gene Ontology - Cytoscape - BiNGO

Exercise with Gene Ontology - Cytoscape - BiNGO Exercise with Gene Ontology - Cytoscape - BiNGO This practical has material extracted from http://www.cbs.dtu.dk/chipcourse/exercises/ex_go/goexercise11.php In this exercise we will analyze microarray

More information

Analysis and Integration of Big Data from Next-Generation Genomics, Epigenomics, and Transcriptomics

Analysis and Integration of Big Data from Next-Generation Genomics, Epigenomics, and Transcriptomics Analysis and Integration of Big Data from Next-Generation Genomics, Epigenomics, and Transcriptomics Christopher Benner, PhD Director, Integrative Genomics and Bioinformatics Core (IGC) idash Webinar,

More information