1. Introduction Gene regulation Genomics and genome analyses Hidden markov model (HMM)



Similar documents
Understanding the dynamics and function of cellular networks

Bioinformatics: Network Analysis

Feed Forward Loops in Biological Systems

Visualizing Networks: Cytoscape. Prat Thiru

transcription networks

Protein Protein Interaction Networks

Healthcare Analytics. Aryya Gangopadhyay UMBC

Greedy Routing on Hidden Metric Spaces as a Foundation of Scalable Routing Architectures

Distance Degree Sequences for Network Analysis

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

A Primer of Genome Science THIRD

Network Analysis. BCH 5101: Analysis of -Omics Data 1/34

Business Intelligence and Process Modelling

Complex Networks Analysis: Clustering Methods

Graph Theory and Networks in Biology

Qualitative modeling of biological systems

Quantitative proteomics background

Data Integration. Lectures 16 & 17. ECS289A, WQ03, Filkov

Application of Graph-based Data Mining to Metabolic Pathways

Genetomic Promototypes

General Network Analysis: Graph-theoretic. COMP572 Fall 2009

How To Cluster Of Complex Systems

Social Media Mining. Network Measures

T cell Epitope Prediction

A MEASURE OF GLOBAL EFFICIENCY IN NETWORKS. Aysun Aytac 1, Betul Atay 2. Faculty of Science Ege University 35100, Bornova, Izmir, TURKEY

Bioinformatics Grid - Enabled Tools For Biologists.

Social Networks and Social Media

Boolean Network Models

Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

Pathway Analysis : An Introduction

Master's projects at ITMO University. Daniil Chivilikhin PhD ITMO University

Unraveling protein networks with Power Graph Analysis

Social Media Mining. Graph Essentials

Introduction to Networks and Business Intelligence

IC05 Introduction on Networks &Visualization Nov

Structural constraints in complex networks

Cluster detection algorithm in neural networks

Organization of Complex Networks

Dmitri Krioukov CAIDA/UCSD

AP Biology Essential Knowledge Student Diagnostic

Multiprotocol Label Switching (MPLS)

Graph theoretic approach to analyze amino acid network

Dynamic Routing Protocols II OSPF. Distance Vector vs. Link State Routing

Hidden Markov Models in Bioinformatics. By Máthé Zoltán Kőrösi Zoltán 2006

Integrating DNA Motif Discovery and Genome-Wide Expression Analysis. Erin M. Conlon

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

The Basics of Graphical Models

A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION

Comparative genomic hybridization Because arrays are more than just a tool for expression analysis

Lecture 2.1 : The Distributed Bellman-Ford Algorithm. Lecture 2.2 : The Destination Sequenced Distance Vector (DSDV) protocol

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals

Statistical and computational challenges in networks and cybersecurity

A discussion of Statistical Mechanics of Complex Networks P. Part I

Machine Learning.

DeCyder Extended Data Analysis (EDA) Software

Bayesian networks - Time-series models - Apache Spark & Scala

Routing in packet-switching networks

Network Load Balancing Using Ant Colony Optimization

JustClust User Manual

Social Network Mining

Core Bioinformatics. Degree Type Year Semester Bioinformàtica/Bioinformatics OB 0 1

WORKSHOP ON TOPOLOGY AND ABSTRACT ALGEBRA FOR BIOMEDICINE

Bioinformatics: Network Analysis

Load balancing Static Load Balancing

A role of microrna in the regulation of telomerase? Yuan Ming Yeh, Pei Rong Huang, and Tzu Chien V. Wang

Option 1: empirical network analysis. Task: find data, analyze data (and visualize it), then interpret.

Social and Economic Networks: Lecture 1, Networks?

InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis

Discovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG)

How To Understand The Network Of A Network

Analysis of Illumina Gene Expression Microarray Data

Qualitative analysis of regulatory networks

A General Framework for Weighted Gene Co-expression Network Analysis

Metabolic Network Analysis

Dynamic Network Analyzer Building a Framework for the Graph-theoretic Analysis of Dynamic Networks

Data, Measurements, Features

SCAN: A Structural Clustering Algorithm for Networks

Lecture/Recitation Topic SMA 5303 L1 Sampling and statistical distributions

Strategies for Optimizing Public Train Transport Networks in China: Under a Viewpoint of Complex Networks

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

DECENTRALIZED SCALE-FREE NETWORK CONSTRUCTION AND LOAD BALANCING IN MASSIVE MULTIUSER VIRTUAL ENVIRONMENTS

Tutorial for proteome data analysis using the Perseus software platform

Chapter 6: Biological Networks

Transcription:

1. Introduction Gene regulation Genomics and genome analyses Hidden markov model (HMM) 2. Gene regulation tools and methods Regulatory sequences and motif discovery TF binding sites, microrna target prediction 3. Technologies Microarrays Deep sequencing (RNAseq) 4. Clustering Unsupervized clustering (HCA,K means, PCA, SOM) Supervized clustering (classification) 5. Gene ontology, Pathways Databases, tools Over representation and enrichment analysis 6. Biomolecular networks Network analysis and characteristics 1

Reverse Engineering Input Temporal series of data Input Temporal series of data Reverse Engineering System Modeling Reverse Engineering Considerations Inferential versus predictive power Parameters are static Genes and encoded proteins build a unit Instantenous versus synchronous model Model have to be stable Solutions Boolean networks Differential equations Bayesian networks (conditional independence) 2

Boolean networks for sensitivity/robustness analysis CellNetAnalyzer Klemt A et al. BMC Systems Biology. 2007 Co expression network analysis 3

Similarity measures Correlation coefficient Mutual information Gene association network MICO Discretizing expression profiles groups of genes with identical profile REVEAL algorithm based on Mutual information Pparg Correlation Apmap Bogner Strauss et al. Cell Mol Life Sci. 2010 4

Adjacency function unweighted weighted Network measures Connectivity (degree) Clustering coefficient 5

Path length and network diameter A path is a sequence {x 1, x 2,, x n } such that (x 1,x 2 ), (x 2,x 3 ),, (x n 1,x n ) are edges of the graph. A closed path x n =x 1 on a graph is called a graph cycle or circuit. Shortest path between nodes Longest shortest path Different network representation 6

Different levels of networks Small world network Every node can be reached from every other by a small number of hops or steps High clustering coefficient and low mean shortest path length (random graphs don t necessarily have high clustering coefficients) Social networks, the Internet, and biological networks all exhibit small world network characteristics Six degrees of separation 7

Six degrees of separation Pilgrim Experiment Random people from Nebraska were to send a letter (via intermediaries) to a stock broker in Boston. Could only send to someone with whom they were on a first name basis Kevin Bacon Game Connect any actor to Kevin Bacon, by linking actors who have acted in the same movie Erdös Number Number of links required to connect scholars to Erdős, via co authorship of papers Kevin Bacon Game Kevin Bacon Mystic River (2003) Tim Robbins Code 46 (2003) Om Puri Yuva (2004) Rani Mukherjee Black (2005) Amitabh Bachchan 8

Scale free network follows a power law Many nodes with few connections Probability distribution few nodes with high connectivity clustering coefficient connectivity Scale free networks are robust Complex systems (cell, internet, social networks), are resilient to component failure Network topology plays an important role in this robustness (even if ~80% of nodes fail, the remaining ~20% still maintain network connectivity) Attack vulnerability if hubs are selectively targeted Essential genes/proteins tend to be hubs Cellular networks are assortative, hubs tend not to interact directly with other hubs. 9

Complex network models Scale free network Modular networks Metabolic networks We have seen that the cellular functionality can be partitioned into a collection of modules. Each module is a discrete entity of several elementary components which perform an identifiable task Modular network (Ci>) But, it was demonstrated that the degree distribution follows a power law Scale free network (power law) Modular network Scale free network Hierarchical network Ravasz et al.science. 2002 10

Hierarchical networks Network Power law Modules (C i ) Network motifs 11

Negative and positive autoregulation NAR PAR b a c NAR speeds up the response time of gene circuits NAR can reduce cell cell variation in protein levels PAR works in the opposite way Feedforward loop (FFL) Supress short signals Coherent and incoherent FFL 12