Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha

Similar documents
Bioinformatics: Network Analysis

Analyzing the Facebook graph?

Understanding the dynamics and function of cellular networks

Healthcare Analytics. Aryya Gangopadhyay UMBC

AP Biology Essential Knowledge Student Diagnostic

Graph Theory Approaches to Protein Interaction Data Analysis

Name Class Date. binomial nomenclature. MAIN IDEA: Linnaeus developed the scientific naming system still used today.

Cellular Respiration: Practice Questions #1

Algorithms in Computational Biology (236522) spring 2007 Lecture #1

A discussion of Statistical Mechanics of Complex Networks P. Part I

1. Introduction Gene regulation Genomics and genome analyses Hidden markov model (HMM)

Introduction to Networks and Business Intelligence

DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS

Graph Theory and Networks in Biology

Structural constraints in complex networks

High Throughput Network Analysis

Network Analysis. BCH 5101: Analysis of -Omics Data 1/34

Enzymes: Practice Questions #1

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations

Complex Networks Analysis: Clustering Methods

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display

The Shape of the Network. The Shape of the Internet. Why study topology? Internet topologies. Early work. More on topologies..

General Network Analysis: Graph-theoretic. COMP572 Fall 2009

Social and Economic Networks: Lecture 1, Networks?

Graph theoretic approach to analyze amino acid network

Statistical mechanics of complex networks

Effects of node buffer and capacity on network traffic

Graph Mining and Social Network Analysis

Self similarity of complex networks & hidden metric spaces

MINFS544: Business Network Data Analytics and Applications

Greedy Routing on Hidden Metric Spaces as a Foundation of Scalable Routing Architectures

Localised Sex, Contingency and Mutator Genes. Bacterial Genetics as a Metaphor for Computing Systems

DECENTRALIZED SCALE-FREE NETWORK CONSTRUCTION AND LOAD BALANCING IN MASSIVE MULTIUSER VIRTUAL ENVIRONMENTS

Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Application of Graph-based Data Mining to Metabolic Pathways

Statistical Mechanics of Complex Networks

Enzymes. A. a lipid B. a protein C. a carbohydrate D. a mineral

Name Date Period. Keystone Review Enzymes

Dmitri Krioukov CAIDA/UCSD

A CONTENT STANDARD IS NOT MET UNLESS APPLICABLE CHARACTERISTICS OF SCIENCE ARE ALSO ADDRESSED AT THE SAME TIME.

AP BIOLOGY 2015 SCORING GUIDELINES

Disease Re-classification via Integration of Biological Networks

Replication Study Guide

Metabolic Network Analysis

Metabolic network analysis: Towards the construction of a meaningful network

Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003

Online Appendix to Social Network Formation and Strategic Interaction in Large Networks

An Alternative Web Search Strategy? Abstract

How To Understand A Protein Network

Some questions... Graphs

The E. coli Insulin Factory

Towards Modelling The Internet Topology The Interactive Growth Model

Dynamics of Biological Systems

Seventh Grade Science Content Standards and Objectives

Chapter 25: The History of Life on Earth

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Two Forms of Energy

Genetics Lecture Notes Lectures 1 2

XII. Biology, Grade 10

A MULTI-MODEL DOCKING EXPERIMENT OF DYNAMIC SOCIAL NETWORK SIMULATIONS ABSTRACT

Biology: Foundation Edition Miller/Levine 2010

Strategies for Optimizing Public Train Transport Networks in China: Under a Viewpoint of Complex Networks

Option 1: empirical network analysis. Task: find data, analyze data (and visualize it), then interpret.

Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics

A mixture model for random graphs

pencil. Vocabulary: 1. Reactant 2. Product 3. Activation energy 4. Catalyst 5. substrate 6. Chemical reaction Keep your textbooks when you are done

Ms. Campbell Protein Synthesis Practice Questions Regents L.E.

KNOWLEDGE NETWORK SYSTEM APPROACH TO THE KNOWLEDGE MANAGEMENT

Cellular Respiration Worksheet What are the 3 phases of the cellular respiration process? Glycolysis, Krebs Cycle, Electron Transport Chain.

transcription networks

Endocrine System: Practice Questions #1

Feed Forward Loops in Biological Systems

The Large-Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth

ModelingandSimulationofthe OpenSourceSoftware Community

green B 1 ) into a single unit to model the substrate in this reaction. enzyme

Applying Social Network Analysis to the Information in CVS Repositories

Research Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of Sina Microblogging

BCOR101 Midterm II Wednesday, October 26, 2005

Catalysis by Enzymes. Enzyme A protein that acts as a catalyst for a biochemical reaction.

Chapter 2 Polar Covalent Bonds; Acids and Bases

An Overview of Cells and Cell Research

Quick Hit Activity Using UIL Science Contests For Formative and Summative Assessments of Pre-AP and AP Biology Students

Complex networks theory for analyzing metabolic networks

10.1 The function of Digestion pg. 402

GENERATING AN ASSORTATIVE NETWORK WITH A GIVEN DEGREE DISTRIBUTION

Laboratory 5: Properties of Enzymes

Name: Date: Period: DNA Unit: DNA Webquest

Experimental Comparison of Symbolic Learning Programs for the Classification of Gene Network Topology Models

Lecture Objectives: Why study microbiology? What is microbiology? Roots of microbiology

A box-covering algorithm for fractal scaling in scale-free networks

Basic Scientific Principles that All Students Should Know Upon Entering Medical and Dental School at McGill

Distance Degree Sequences for Network Analysis

Load balancing in a heterogeneous computer system by self-organizing Kohonen network

KEY CONCEPT Organisms can be classified based on physical similarities. binomial nomenclature

Protein Protein Interaction Networks

Antibiotics: The difference between prokaryotic and eukaryotic cells, Biology AA, Teacher Leslie Hadaway, New lesson, Science

CHAPTER 6 GRIFFITH/HERSHEY/CHASE: DNA IS THE GENETIC MATERIAL IDENTIFICATION OF DNA DNA AND HEREDITY DNA CAN GENETICALLY TRANSFORM CELLS

How To Understand The Network Of A Network

Cellular Energy. 1. Photosynthesis is carried out by which of the following?

Sampling Biases in IP Topology Measurements

Transcription:

Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha

Complex network in cell Cellular processes generating mass & energy, transfering information, specifying cell fate Integrated through complex network of several constituents and reactions Can we look at such networks and learn something about biology and evolution?

Key findings Metabolic networks analyzed Compared among 43 species from all three domains of life (archaea, bacteria, eukarya) Noticed the same topologic scaling properties across all species Metabolic organization identical for living organisms, and is robust May extrapolate to other cellular networks

Introduction Fundamental design principles of cellular networks? Example: Dynamic interactions of various constituents impart robustness to cellular processes If one reaction did not happen optimally, does not necessarily mess up the whole process

Introduction Constituents of networks: DNA, RNA, proteins, small molecules High throughput biology has helped develop databases of networks, e.g., metabolic networks Maps are extremely complex Fundamental features of network topology?

Network models

Erdos-Renyi random graph Start with a fixed number of nodes, no edges Each pair of nodes connected by an edge with probability p This leads to a statistically homogeneous graph or network Most nodes have same degree

http://arxiv.org/pdf/cond-mat/0010278

Erdos-Renyi random graph Degree distribution is Poisson with strong peak at mean <k> Therefore, probability of finding a highly connected node decays exponentially

http://arxiv.org/pdf/cond-mat/0010278

Empirical graphs World-wide web, internet, social networks have been studied Serious deviations from random structure of E-R model Better described by scale-free networks

Scale-free networks Degree distribution P(k) follows power law distribution P(k) ~ k -x Scale-free networks are extremely heterogeneous: a few highly connected nodes (hubs) rest of the (less connected) nodes connect to hubs Generated by a process where new nodes are preferentially attached to already highdegree nodes

http://arxiv.org/pdf/cond-mat/0010278

What does this tell us? Difference between Erdos-Renyi and scale-free graphs arise from simple principles of how the graphs were created Therefore, understanding topological properties can tell us how the cellular networks were created

Data

Metabolic networks Core metabolic network of 43 different organisms (WIT database) 6 archaea, 32 bacteria, 5 eukarya Nodes = substrates, edges = metabolic reactions, additional nodes = enzymes Based on firmly established data from biochemical literature Sufficient data for statistical analysis

Example of a metabolic network http://www.avatar.se/strbio2001/metabolic/what.html Green boxes: known enzymes

Results

Topology Is the topology described by E-R model or by scale-free model? Observed that degree distribution follows a power law. Therefore, scalefree networks

http://arxiv.org/pdf/cond-mat/0010278 A. fulgidus E. coli C. elegans Average

Small-world property General feature of many complex networks: any two nodes can be connected by relatively short paths In metabolic network, a path is the biological pathway connecting two substrates Characterized by network diameter Shortest path, over all pairs of nodes

Small-world property For non-biological networks, the average degree is usually fixed This implies that network diameter increases logarithmically with new nodes being added Is this true of metabolic networks? That is, more complex bacterium (more substrates and enzymes) will have larger diameter?

Small-world property More complex bacterium (more substrates and enzymes) will have larger diameter? Observed: diameter same across all 43 species! A possible explanation: average degree must be higher for more complex organisms This is also verified.

http://arxiv.org/pdf/cond-mat/0010278 network diameter for different organisms average degree over different organisms

Hubs in network Power-law connectivity implies that a few hub nodes dominate the overall connectivity Sequential removal of hubs => diameter rises sharply Observed: metabolic networks show this phenomenon too

http://arxiv.org/pdf/cond-mat/0010278 Diameter after removing M substrates

Hubs in network At the same time, scale-free networks are robust to random errors In metabolic network, removal of randomly chosen substrates did not affect average distance between remaining nodes Fault tolerance to removal of metabolic enzymes also demonstrated through biological experiments

Hubs across networks Do the same substrates act as hubs in all organisms? Rank all substrates by their degrees Ranking of the top substrates is practically same across all species For every substrate present in all species, compute rank r (in terms of degree) in each species

Hubs across networks Compute mean <r> and standard deviation σ r of rank of each substrate Observed: σ r increases with <r> The top-ranking nodes (hubs) have relatively little variance across species

http://arxiv.org/pdf/cond-mat/0010278

Summary Other biological networks also hypothesized to be scale-free. Evolutionary selection of a robust and error tolerant architecture