Generating Biological Networks: A Software Tool
|
|
- Philomena Francis
- 7 years ago
- Views:
Transcription
1 Generating Biological Networks: A Software Tool Brian Mulhall July 27, 2012 Abstract The study of biological networks and the representation of those networks as graphs has created an opportunity to better fill the needs of researchers who may possess limited experience creating software but nonetheless require software created for a specific task. A problem we have identified as being worthwhile to address is in the need to standardize the set of graphs used by researchers in wide range of network science applications. Researchers trying to produce a set of consistent and comparable results require a software tool that can generate a graph tailored to their specific needs in both an efficient and accurate manner. This consistency can be achieved by creating a software that tool that allows the end user a degree of control over the networks used in their particular application. The approach taken to solve this problem is to sift through numerous graph generating algorithms and try to find or modify an existing algorithm to best suit our stated purpose. The Erdos-Renyi, Watts-Strogatz, Barabasi- Albert, Klemm-Euguilez, Newman and a number of algorithms based on the configuration model were all considered, but only three of these were chosen to implement and then conduct trials on. The efficacy of this approach was tested by running trials on all three algorithms and comparing the anticipated results to the actual results that were observed using the available network analysis software. The algorithm that initially appears to be the most promising is the Degree algorithm. The results that have been obtained to this point, while preliminary in nature give a good deal of insight into which of the three algorithms chosen are most suitable to the ambitious goal of controlling all four stated properties simultaneously. 1
2 1 Introduction This section will briefly define the properties of biological networks that have been targeted as those that should be reproducible in a random graph created by this software tool. The three algorithms that have been implemented will also be described in detail in the following section. This will serve as a means of orientating the reader to the methods and goals of this project. 1.1 Biological Networks Biological processes are best captured when they are represented in form of complex networks such as protein-protein inter-action and metabolic pathway networks. The study of biological networks, their modeling, analysis, and visualization are important tasks in life science today.[1] An understanding of these networks is essential in order to further the related work in fields ranging form social sciences to electrical engineering. The focus of this paper will be limited to biological networks and making sense of much of the data that is now being generated by the study of such networks. The increasing importance of biological networks is also evidenced by the expansion in the number of analytical tools and the quantifiable metrics used in network science, this fact coupled with the rise in publications and researchers working in the field adds credence to suggestion this area of research has a rising level of importance. This topic remains a rich area of research due to the fact that most biological networks are still far from being complete and it is difficult to fully understand the complexity of the relationships and the peculiarities of the data at the present time.[1] The generation of biological networks that are flexible enough to be used in a wide range of applications in the field of network science is the purpose of the research that we have done up to this point. We have concluded that in order to be useful these generated networks need to exhibit a set of properties that have been deemed critical to being truly representative of a real world network. One such property is the small world effect, which is the name given to the finding that the average distance between vertices in a network is small usually scaling logarithmically with the total number of vertices. Another common property is the power law degree distributions that many networks possess. This type of distribution is found in many real world networks and has been empirically found to have an exponent between 2 and 3. A power law degree distribution can be spotted with a linear best fit 2
3 curve when the degree of each node is placed on a log-log plot.the degree of a vertex in a network is the number of other vertices to which it is connected, and one finds that there are typically many vertices in a network with low degree and a small number with high degree, the precise distribution often following a power-law (heavy tailed).[3] A third property that many networks have in common is clustering, or network transitivity, which is the property that two vertices that are both neighbours of the same third vertex have a heightened probability of also being neighbours of one another. The clustering coefficient is one on a fully connected graph and has typical values in the range of 0.1 to 0.5 in many real-world networks.[3] Many real networks are observed to be fundamentally modular, meaning that the network can be seamlessly partitioned into a collection of modules.[4] The community structure of a any given network is also of importance to biological research. Many networks have been observed to possess subsets of vertices within which connections are dense, but between which connections are less dense.[3] The scale-free property existing simultaneously with the networks potential modularity creates a challenge for any random graph generation technique.in order to account for the coexistence of modularity, local clustering and scale-free topology in real systems, we have to assume that clusters combine in an iterative manner. It is possible that the communities themselves also join together to form larger and larger communities still, and that those larger communities are themselves joined together, and so on in a hierarchical structure. Being able to identify these communities could help us to understand these networks more effectively and therefore it is necessary to incorporate this property into any software tool which has our stated purpose.[3] 1.2 Klemm-Eguilez Model The Klemm-Eguilez graph generating model combines the three properties that are associated with being truly representative of real world networks.[3] This makes this model appealing for the purpose of generating networks that can be tuned by the user to exhibit the desired properties to varying degrees. The algorithm itself incorporates elements of preferential attachment which increases the likelihood of there being some nodes which have a higher degree thus leading to what is referred to as a scale free network (a network with a power law degree distribution). The mean geodesic distance is kept low along with a relatively high clustering coefficient. These properties can be theoretically tuned by altering the input parameter m, which can be seen 3
4 in the pseudo code listed below. The details of how the algorithm functions and what the algorithm offers us as we try to create a software tool that satisfies our stated purpose will be given below. The algorithm employs a list of active nodes in order to ensure a preferential attachment model. This allows for fine tuning of this approach so that the small-world and scale-free topology of the Klemm-Egulez network is maintained. It begins with the creation of a fully connected network of a size that is specified as an input parameter. The remaining nodes in the network are introduced sequentially along with edges that total in number equal to that initial network size. This allows that average degree of nodes to be controlled also by the user. This active node list is biased toward containing nodes with higher degrees. A given parameter mu, is the probability with which new edges are connected to non-active nodes. When new nodes are added to the network, each new edge is connected from the new node to either a node in the list of active nodes or with probability mu, to a randomly selected deactive node. The new node is added to the list of active nodes, and one node is then randomly chosen, with probability equal to the input parameter pd, for removal from the active node list. This choice is biased ability pd is biased toward nodes with a lower degree, so that the nodes with ability pd the highest degree are less likely to be chosen for removal. The combination of preferential attachment with the random selection of nodes outside the active nodes for new edges and the final step of edge switching which creates short cuts between any given node and a randomly selected node outside of its neighbourhood allows for a power law degree distribution as well as the small world property.[3] pd = the probability a node will be deactivated kj = the degree of node j //First create the fully connected initial network, and add its nodes to active nodes For: all nodes i = 1m_ Add: i to Active_Nodes For: all nodes j = i+1m If: Undirected Create a reciprocal edge between node i and node j Else: Create a directed edge from node i to node j 4
5 Create a directed edge from node j to node i EndFor EndFor //Second, iteratively, connect remaining nodes to active nodes or a random node with probability m, then remove one active node For: all nodes i = m + 1N For: all nodes j in Active_Nodes Set: Chance = a randomly chosen continuous variable between 0 and 1 If: m > Chance or the set of Deactivated_Nodes = 0 If: Undirected Create a reciprocal edge between i and node j Else: Create a directed edge from node i to node j Create a directed edge from node j to node i Else: Set: Connected = false While: Connected == false Set: node j = randomly chosen from the set of all Deactivated_Nodes Set: Chance = a uniform random number between 0 Set: E = sum(degrees of all Deactivated_Nodes) / E > Chance If: kj If: Undirected Create a reciprocal edge between i and node j Else: Create a directed edge from node i to node j Create a directed edge from node j to node i Set: Connected = true EndWhile 5
6 //Replace an active node with node i. Active nodes with lower degrees are more likely to be replaced. Set: node i as an Active_Node While: node j is not chosen Set: j = uniformly randomly chosen node from Active_Nodes kj ) kj ) / sum(1 / kj = (1 / kj Set: pd Set: Chance = a uniform random number between 0 and 1 > Chance If: pd Set: node j as chosen Remove: node j from Active_Nodes EndWhile EndFor //Third, if directed, use d to set undirected equivalency For: all nodes i = 1N For: all nodes j = the set of all nodes adjacent to node i Set: Chance = a uniform random variable between 0 and 1 If: d > Chance Set: node h = uniformly randomly chosen from the set of all nodes, excluding i and nodes adjacent to i Disconnect: directed edge from node i to node j Connect: directed edge from node i to node h EndFor EndFor Mathematical verification The undirected network of Klemm and Egulez (2002b) 1.3 Newman Algorithm The Newman algorithm generates a graph with tunable clustering coefficient and given degree sequence. This algorithm is a modification of the configuration which wont be discussed in this paper but is the foundation of two of the three algorithms used by this software tool. In this algorithm, for 6
7 each node v, the number of triangles it is part of and the number of single edges incident upon it that do not participate in any triangle are given as input parameters.the sum of all triangles in the graph in its entirety is a multiple of 3 and the sum of all single edges in the graph is a multiple of 2. As the algorithm is based on the configuration model, each node v can be seen as consisting of a given number of stubs and corners. Then a triangle is created by selecting three corners and the three corresponding nodes at random and joining them. This process is repeated until each corner is part of a unique triangle. An edge is created by selecting two stubs and two corresponding nodes at random and creating an edge between them. This process is repeated until each stub is part of a unique edge.[6] As we choose unconnected stubs or corners to form edges or triangles, we keep track of the number of unconnected stubs and corners for each node selected node. The number of unconnected stubs at a particular node is the difference between the initially given single edges and the current single edges at that node. It is also known as the residual single edge degree. Similarly, the number of unconnected corners at a node is the difference between the initially given triangle degree and the current triangle degree at that node. It is known as the residual triangle degree. Conf-1(n, s, t) 1 input: number of nodes n, single edge and triangle degree sequence arrays s and t 2 output: graph (V,E), where V is a set of nodes and E a is set of edges 3 V {1, 2,... n} 4 E 5 rtt 6 rss 7 T total number of trianlges to create 8 M total number of single edges to create 9 while T > 0 do Choose three distinct nodes u, v, w 11 if rt[u] > 0, rt[v] > 0 and rt[w] > 0 12 then E E {(u, v), (v, w), (u, w)} 13 rt[u]rt[u]1 14 rt[v]rt[v] 1 7
8 15 rt[w]rt[w] 1 16 T T 1 17 while M > 0 18 do Choose two distinct nodes u, v from 1 to n 19 if rs[u] > 0 and rs[v] > 0 20 then E E {(u, v)} 21 rs[u]rs[u]1 22 rs[v]rs[v] 1 23 M M 1 24 return (V,E) Figure 7.1: Algorithm Conf Degree Algorithm The Degree algorithm generates a random graph with a degree sequence and clustering coefficient given as input parameters. It takes the number of nodes, the degree sequence and the clustering coefficient as input and generates a simple random graph with these properties. The next step in the algorithm, which is outlined below in pseudo code, is to estimate the number of triangles needed to achieve the given clustering coefficient and then create those triangles. It is done by selecting three stubs at random and joining them to create a triangle much like the Newman algorithm which was described in the previous section. Once the estimated number of triangles are created, a pair of stubs are selected independently at random and joined to create additional edges. We also avoid any self-loops by discarding any iteration that does not produce distinct nodes for creating edges and triangles.[6] For creating a triangle, this algorithm chooses three distinct nodes with a probability proportional to their residual degree. If these three nodes have residual degrees greater than zero than a triangle is created and there residual degrees are updated appropriately as well a the remaining number of triangles and single edges left to create, otherwise the node selection is ignored. This process is repeated until either there no longer remain any triangles or single edges to create or the counter that is keeping track of the total number of iterations is exceeded and the algorithm is terminated short of the number of triangles and edges needed to create. 8
9 Deg(n,d,C) 1 input: number of nodes n, degree sequence array d and target clustering coefficient C 2 output: graph (V,E), where V is set of nodes and E is set of edges 3 V {1, 2,..., n} 4 E 5 M the numner of single ednges to create 6 rd d 7 T the number of trianlges to create 8 while T > 0 9 do Choose three distinct nodes with probability proportional to their residual degree 13 Check rd[u], rd[v] and rd[w]are greater than zero 14 Add edges needed to create a triangle between u,v and w 15 update rd[u], rd[v], rd[w], T and M appropriately. 16 T T 1 17 while M > 0 do Choose two distinct nodes with probability proportional to their residual degree then E E {(u, v)} 20 rd[u]rd[u]1 21 rd[v]rd[v] 1 22 M M 1 2 Methodology The experimental results were gathered by testing the software tool s ability to generate graphs with the required properties and how close those property values were to the ones that had been specified by the user or by using a 9
10 set of analytically determined values if there is not direct analogue between the input parameters and the property values. These trials were done in order to gain some insight into the tools efficacy and efficiency. The tests included timed trials during the construction of networks of various sizes. The networks were all generated with an expectation of a particular clustering coefficient as well as a particular degree distribution and the analytical results generated by Cytoscape (a software tool used for network analysis) were used to cross reference the empirical results with the anticipated results. The results of these tests would be used to off insight into which of the implemented algorithms is most appropriate to solve the stated problem of creating a set of standardized biological networks. 3 Experimental Results Degree Algorithm Table 1: Clustering Coefficient Initial Triangle Degree Anticipated Clustering Coefficient Newman Algorithm Table 2: Newman Algorithm Clustering Coefficient, Numbers of Nodes = 1000 and the initial single edge degree is for all nodes = 0 4 Discussion The results that have been obtained to this point while preliminary can offer some insight into which of the three algorithms that were chosen for implementation are best suited for creating networks that have tunable global properties. The Degree algorithm is an efficient algorithm being capable of creating graphs of large size in relatively short amount of time. This coupled with its performance when comparing the anticipated clustering coefficient to what was observed empirically as well being able to accept a degree sequence as an input parameter which guarantees a power law degree distribution if that is so desired by the user. 10
11 The Newman algorithm was also able to generate results efficiently and accurately when compared to the set of analytically determined clustering coefficient. The input triangle degree sequences were chosen carefully in order to make it possible to compare the resulting clustering coefficients to those calculated by hand. This does not lend itself to the practical use of this software but a more appropriate test was not devised at this time. The Klemm-Eugilez algorithm has a much more convoluted set of parameters that control the output. This makes it difficult to control any of the global properties. This algorithm seemed at first to have promise as it appeared able to offer control over the global properties of the network but at this point it now appears to have limited use in the formation of a viable solution to creating networks. Their are portions of code form the Klemm-Euguilez algorithm that may be used in a hybrid algorithm that could better preserve the small world properties of the network. The edge swapping portion of code can be appended to the existing Degree algorithm and the Newman algorithm and some short cuts can be made between nodes that otherwise have no edges between them. The use of an active and deactive list in the algorithm is an interesting approach to incorporating preferential attachment, to facilitate the creation of a nodes with a substantially higher degree. 5 Conclusion The algorithms that have been implemented up to this point in time have been optimized to run as efficiently as possible and a set of preliminary results were gathered for the production of this manuscript. The point that this research is at right now is the beginning stages of what is needed to be done in order to achieve a software tool that will be able to be used by researchers across the field of network science. The goal of creating a tool that can allow the user to tune both the clustering coefficient and the degree distribution, while maintaining a small world network with a hierarchical structure has not been achieved and is requires significant work to realize. The work that has been done up until now is a preliminary step towards that stated goal. 6 Future Work The continuation of this work would include the creation of an effective means of translating the set of properties that are interesting an understood 11
12 by a potential end user and the input parameters that are accepted by the algorithms that have been implemented to date. The following stages after that challenge has been overcome is the analysis of the generated graphs to verify that those graphs are indeed being created to reflect those set of inputs. Acknowledgements Brian Mulhall conducted this research as part of the BIOGRID initiatives in the Department of Computer Science and Engineering at the University of Connecticut. It is supported by the National Science Foundation REU grant OCI We are grateful to our mentor Saad Quadar for all his patience, guidance, and support throughout this project and to Professor Chun-Hsi Huang for giving us the opportunity to participate in this project. References [1] C. Bachmaier, Biological Networks [2] C. Herrera and P. J. Zufiria, Generating Scale-free Networks with Adjustable Clustering Coefficient Via Random Walks, pp [3] Girvan2002 M. Girvan and M. E. J. Newman, Community structure in social and biological networks., Proceedings of the National Academy of Sciences of the United States of America, vol. 99, no. 12, pp , Jun [4] S. Wuchty, The Architecture of Biological Networks, pp [5] B. J. Prettejohn, M. J. Berryman, and M. D. McDonnell, Methods for generating complex networks with selected structural properties for simulations: a review and tutorial for neuroscientists., Frontiers in computational neuroscience, vol. 5, no. March, p. 11, Jan [6] G. DAngelo and S. Ferretti, Simulation of scale-free networks, Proceedings of the Second International ICST Conference on Simulation Tools and Techniques, [7] N. Parikh, Generating Random Graphs with Tunable Clustering Coefficient Generating Random Graphs with Tunable Clustering Coefficient,
USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS
USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS Natarajan Meghanathan Jackson State University, 1400 Lynch St, Jackson, MS, USA natarajan.meghanathan@jsums.edu
More informationChapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks
Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks Imre Varga Abstract In this paper I propose a novel method to model real online social networks where the growing
More informationBig Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network
, pp.273-284 http://dx.doi.org/10.14257/ijdta.2015.8.5.24 Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network Gengxin Sun 1, Sheng Bin 2 and
More informationBioinformatics: Network Analysis
Bioinformatics: Network Analysis Graph-theoretic Properties of Biological Networks COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay Nakhleh, Rice University 1 Outline Architectural features Motifs, modules,
More informationModelingandSimulationofthe OpenSourceSoftware Community
ModelingandSimulationofthe OpenSourceSoftware Community Yongqin Gao, GregMadey Departmentof ComputerScience and Engineering University ofnotre Dame ygao,gmadey@nd.edu Vince Freeh Department of ComputerScience
More informationProtein Protein Interaction Networks
Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics
More informationGraph theoretic approach to analyze amino acid network
Int. J. Adv. Appl. Math. and Mech. 2(3) (2015) 31-37 (ISSN: 2347-2529) Journal homepage: www.ijaamm.com International Journal of Advances in Applied Mathematics and Mechanics Graph theoretic approach to
More informationRandom graphs and complex networks
Random graphs and complex networks Remco van der Hofstad Honours Class, spring 2008 Complex networks Figure 2 Ye a s t p ro te in in te ra c tio n n e tw o rk. A m a p o f p ro tein p ro tein in tera c
More informationComplex Networks Analysis: Clustering Methods
Complex Networks Analysis: Clustering Methods Nikolai Nefedov Spring 2013 ISI ETH Zurich nefedov@isi.ee.ethz.ch 1 Outline Purpose to give an overview of modern graph-clustering methods and their applications
More informationTemporal Dynamics of Scale-Free Networks
Temporal Dynamics of Scale-Free Networks Erez Shmueli, Yaniv Altshuler, and Alex Sandy Pentland MIT Media Lab {shmueli,yanival,sandy}@media.mit.edu Abstract. Many social, biological, and technological
More informationIntroduction to Networks and Business Intelligence
Introduction to Networks and Business Intelligence Prof. Dr. Daning Hu Department of Informatics University of Zurich Sep 17th, 2015 Outline Network Science A Random History Network Analysis Network Topological
More informationDegree distribution in random Apollonian networks structures
Degree distribution in random Apollonian networks structures Alexis Darrasse joint work with Michèle Soria ALÉA 2007 Plan 1 Introduction 2 Properties of real-life graphs Distinctive properties Existing
More informationThe Open University s repository of research publications and other research outputs
Open Research Online The Open University s repository of research publications and other research outputs Online survey for collective clustering of computer generated architectural floor plans Conference
More informationNetwork Analysis. BCH 5101: Analysis of -Omics Data 1/34
Network Analysis BCH 5101: Analysis of -Omics Data 1/34 Network Analysis Graphs as a representation of networks Examples of genome-scale graphs Statistical properties of genome-scale graphs The search
More informationGraph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003
Graph models for the Web and the Internet Elias Koutsoupias University of Athens and UCLA Crete, July 2003 Outline of the lecture Small world phenomenon The shape of the Web graph Searching and navigation
More informationSmall-World Characteristics of Internet Topologies and Implications on Multicast Scaling
Small-World Characteristics of Internet Topologies and Implications on Multicast Scaling Shudong Jin Department of Electrical Engineering and Computer Science, Case Western Reserve University Cleveland,
More informationGraph Theory and Networks in Biology
Graph Theory and Networks in Biology Oliver Mason and Mark Verwoerd March 14, 2006 Abstract In this paper, we present a survey of the use of graph theoretical techniques in Biology. In particular, we discuss
More informationComplex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics
Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics Zhao Wenbin 1, Zhao Zhengxu 2 1 School of Instrument Science and Engineering, Southeast University, Nanjing, Jiangsu
More informationGeneral Network Analysis: Graph-theoretic. COMP572 Fall 2009
General Network Analysis: Graph-theoretic Techniques COMP572 Fall 2009 Networks (aka Graphs) A network is a set of vertices, or nodes, and edges that connect pairs of vertices Example: a network with 5
More informationGreedy Routing on Hidden Metric Spaces as a Foundation of Scalable Routing Architectures
Greedy Routing on Hidden Metric Spaces as a Foundation of Scalable Routing Architectures Dmitri Krioukov, kc claffy, and Kevin Fall CAIDA/UCSD, and Intel Research, Berkeley Problem High-level Routing is
More informationCurrent Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary
Shape, Space, and Measurement- Primary A student shall apply concepts of shape, space, and measurement to solve problems involving two- and three-dimensional shapes by demonstrating an understanding of:
More informationBig Ideas in Mathematics
Big Ideas in Mathematics which are important to all mathematics learning. (Adapted from the NCTM Curriculum Focal Points, 2006) The Mathematics Big Ideas are organized using the PA Mathematics Standards
More informationGenerating Hierarchically Modular Networks via Link Switching
Generating Hierarchically Modular Networks via Link Switching Susan Khor ABSTRACT This paper introduces a method to generate hierarchically modular networks with prescribed node degree list by link switching.
More informationPerformance Metrics for Graph Mining Tasks
Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics Supervised Learning Performance Metrics Unsupervised Learning Performance Metrics Optimizing Metrics Statistical
More informationSocial Media Mining. Network Measures
Klout Measures and Metrics 22 Why Do We Need Measures? Who are the central figures (influential individuals) in the network? What interaction patterns are common in friends? Who are the like-minded users
More informationNetwork (Tree) Topology Inference Based on Prüfer Sequence
Network (Tree) Topology Inference Based on Prüfer Sequence C. Vanniarajan and Kamala Krithivasan Department of Computer Science and Engineering Indian Institute of Technology Madras Chennai 600036 vanniarajanc@hcl.in,
More informationNETZCOPE - a tool to analyze and display complex R&D collaboration networks
The Task Concepts from Spectral Graph Theory EU R&D Network Analysis Netzcope Screenshots NETZCOPE - a tool to analyze and display complex R&D collaboration networks L. Streit & O. Strogan BiBoS, Univ.
More informationDATA ANALYSIS II. Matrix Algorithms
DATA ANALYSIS II Matrix Algorithms Similarity Matrix Given a dataset D = {x i }, i=1,..,n consisting of n points in R d, let A denote the n n symmetric similarity matrix between the points, given as where
More informationGraphs over Time Densification Laws, Shrinking Diameters and Possible Explanations
Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations Jurij Leskovec, CMU Jon Kleinberg, Cornell Christos Faloutsos, CMU 1 Introduction What can we do with graphs? What patterns
More informationThe Topology of Large-Scale Engineering Problem-Solving Networks
The Topology of Large-Scale Engineering Problem-Solving Networks by Dan Braha 1, 2 and Yaneer Bar-Yam 2, 3 1 Faculty of Engineering Sciences Ben-Gurion University, P.O.Box 653 Beer-Sheva 84105, Israel
More informationNetwork/Graph Theory. What is a Network? What is network theory? Graph-based representations. Friendship Network. What makes a problem graph-like?
What is a Network? Network/Graph Theory Network = graph Informally a graph is a set of nodes joined by a set of lines or arrows. 1 1 2 3 2 3 4 5 6 4 5 6 Graph-based representations Representing a problem
More informationAnalyzing the Facebook graph?
Logistics Big Data Algorithmic Introduction Prof. Yuval Shavitt Contact: shavitt@eng.tau.ac.il Final grade: 4 6 home assignments (will try to include programing assignments as well): 2% Exam 8% Big Data
More informationCluster detection algorithm in neural networks
Cluster detection algorithm in neural networks David Meunier and Hélène Paugam-Moisy Institute for Cognitive Science, UMR CNRS 5015 67, boulevard Pinel F-69675 BRON - France E-mail: {dmeunier,hpaugam}@isc.cnrs.fr
More informationAn Alternative Web Search Strategy? Abstract
An Alternative Web Search Strategy? V.-H. Winterer, Rechenzentrum Universität Freiburg (Dated: November 2007) Abstract We propose an alternative Web search strategy taking advantage of the knowledge on
More informationEvaluation of a New Method for Measuring the Internet Degree Distribution: Simulation Results
Evaluation of a New Method for Measuring the Internet Distribution: Simulation Results Christophe Crespelle and Fabien Tarissan LIP6 CNRS and Université Pierre et Marie Curie Paris 6 4 avenue du président
More informationStrong and Weak Ties
Strong and Weak Ties Web Science (VU) (707.000) Elisabeth Lex KTI, TU Graz April 11, 2016 Elisabeth Lex (KTI, TU Graz) Networks April 11, 2016 1 / 66 Outline 1 Repetition 2 Strong and Weak Ties 3 General
More informationSPANNING CACTI FOR STRUCTURALLY CONTROLLABLE NETWORKS NGO THI TU ANH NATIONAL UNIVERSITY OF SINGAPORE
SPANNING CACTI FOR STRUCTURALLY CONTROLLABLE NETWORKS NGO THI TU ANH NATIONAL UNIVERSITY OF SINGAPORE 2012 SPANNING CACTI FOR STRUCTURALLY CONTROLLABLE NETWORKS NGO THI TU ANH (M.Sc., SFU, Russia) A THESIS
More informationPart 2: Community Detection
Chapter 8: Graph Data Part 2: Community Detection Based on Leskovec, Rajaraman, Ullman 2014: Mining of Massive Datasets Big Data Management and Analytics Outline Community Detection - Social networks -
More informationMetabolic Network Analysis
Metabolic Network nalysis Overview -- modelling chemical reaction networks -- Levels of modelling Lecture II: Modelling chemical reaction networks dr. Sander Hille shille@math.leidenuniv.nl http://www.math.leidenuniv.nl/~shille
More informationHow To Find Influence Between Two Concepts In A Network
2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation Influence Discovery in Semantic Networks: An Initial Approach Marcello Trovati and Ovidiu Bagdasar School of Computing
More informationA Review And Evaluations Of Shortest Path Algorithms
A Review And Evaluations Of Shortest Path Algorithms Kairanbay Magzhan, Hajar Mat Jani Abstract: Nowadays, in computer networks, the routing is based on the shortest path problem. This will help in minimizing
More informationHow To Cluster Of Complex Systems
Entropy based Graph Clustering: Application to Biological and Social Networks Edward C Kenley Young-Rae Cho Department of Computer Science Baylor University Complex Systems Definition Dynamically evolving
More informationOpen Source Software Developer and Project Networks
Open Source Software Developer and Project Networks Matthew Van Antwerp and Greg Madey University of Notre Dame {mvanantw,gmadey}@cse.nd.edu Abstract. This paper outlines complex network concepts and how
More informationGraph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis
Graph Theory and Complex Networks: An Introduction Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.0, steen@cs.vu.nl Chapter 06: Network analysis Version: April 8, 04 / 3 Contents Chapter
More informationGraph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis. Contents. Introduction. Maarten van Steen. Version: April 28, 2014
Graph Theory and Complex Networks: An Introduction Maarten van Steen VU Amsterdam, Dept. Computer Science Room R.0, steen@cs.vu.nl Chapter 0: Version: April 8, 0 / Contents Chapter Description 0: Introduction
More informationJoan Paola Cruz, Camilo Olaya
A SYSTEM DYNAMICS MODEL FOR STUDYING THE STRUCTURE OF NETWORK MARKETING ORGANIZATIONS Joan Paola Cruz, Camilo Olaya Universidad de los Andes, Dpto. de Ingeniería Industrial Carrera 1 N 18A 10 Bogotá, Colombia
More information1. Write the number of the left-hand item next to the item on the right that corresponds to it.
1. Write the number of the left-hand item next to the item on the right that corresponds to it. 1. Stanford prison experiment 2. Friendster 3. neuron 4. router 5. tipping 6. small worlds 7. job-hunting
More informationSocial Media Mining. Graph Essentials
Graph Essentials Graph Basics Measures Graph and Essentials Metrics 2 2 Nodes and Edges A network is a graph nodes, actors, or vertices (plural of vertex) Connections, edges or ties Edge Node Measures
More informationThe mathematics of networks
The mathematics of networks M. E. J. Newman Center for the Study of Complex Systems, University of Michigan, Ann Arbor, MI 48109 1040 In much of economic theory it is assumed that economic agents interact,
More informationB490 Mining the Big Data. 2 Clustering
B490 Mining the Big Data 2 Clustering Qin Zhang 1-1 Motivations Group together similar documents/webpages/images/people/proteins/products One of the most important problems in machine learning, pattern
More informationGraph Theory Approaches to Protein Interaction Data Analysis
Graph Theory Approaches to Protein Interaction Data Analysis Nataša Pržulj Technical Report 322/04 Department of Computer Science, University of Toronto Completed on September 8, 2003 Report issued on
More informationA discussion of Statistical Mechanics of Complex Networks P. Part I
A discussion of Statistical Mechanics of Complex Networks Part I Review of Modern Physics, Vol. 74, 2002 Small Word Networks Clustering Coefficient Scale-Free Networks Erdös-Rényi model cover only parts
More informationA mixture model for random graphs
A mixture model for random graphs J-J Daudin, F. Picard, S. Robin robin@inapg.inra.fr UMR INA-PG / ENGREF / INRA, Paris Mathématique et Informatique Appliquées Examples of networks. Social: Biological:
More informationDistance Degree Sequences for Network Analysis
Universität Konstanz Computer & Information Science Algorithmics Group 15 Mar 2005 based on Palmer, Gibbons, and Faloutsos: ANF A Fast and Scalable Tool for Data Mining in Massive Graphs, SIGKDD 02. Motivation
More informationEfficient Crawling of Community Structures in Online Social Networks
Efficient Crawling of Community Structures in Online Social Networks Network Architectures and Services PVM 2011-071 Efficient Crawling of Community Structures in Online Social Networks For the degree
More informationRandom Map Generator v1.0 User s Guide
Random Map Generator v1.0 User s Guide Jonathan Teutenberg 2003 1 Map Generation Overview...4 1.1 Command Line...4 1.2 Operation Flow...4 2 Map Initialisation...5 2.1 Initialisation Parameters...5 -w xxxxxxx...5
More informationAn Empirical Study of Two MIS Algorithms
An Empirical Study of Two MIS Algorithms Email: Tushar Bisht and Kishore Kothapalli International Institute of Information Technology, Hyderabad Hyderabad, Andhra Pradesh, India 32. tushar.bisht@research.iiit.ac.in,
More informationAlgebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard
Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express
More informationDistributed Dynamic Load Balancing for Iterative-Stencil Applications
Distributed Dynamic Load Balancing for Iterative-Stencil Applications G. Dethier 1, P. Marchot 2 and P.A. de Marneffe 1 1 EECS Department, University of Liege, Belgium 2 Chemical Engineering Department,
More informationVisualization and Modeling of Structural Features of a Large Organizational Email Network
Visualization and Modeling of Structural Features of a Large Organizational Email Network Benjamin H. Sims Statistical Sciences (CCS-6) Email: bsims@lanl.gov Nikolai Sinitsyn Physics of Condensed Matter
More informationwith functions, expressions and equations which follow in units 3 and 4.
Grade 8 Overview View unit yearlong overview here The unit design was created in line with the areas of focus for grade 8 Mathematics as identified by the Common Core State Standards and the PARCC Model
More informationSubgraph Patterns: Network Motifs and Graphlets. Pedro Ribeiro
Subgraph Patterns: Network Motifs and Graphlets Pedro Ribeiro Analyzing Complex Networks We have been talking about extracting information from networks Some possible tasks: General Patterns Ex: scale-free,
More informationAsking Hard Graph Questions. Paul Burkhardt. February 3, 2014
Beyond Watson: Predictive Analytics and Big Data U.S. National Security Agency Research Directorate - R6 Technical Report February 3, 2014 300 years before Watson there was Euler! The first (Jeopardy!)
More informationCreating, Solving, and Graphing Systems of Linear Equations and Linear Inequalities
Algebra 1, Quarter 2, Unit 2.1 Creating, Solving, and Graphing Systems of Linear Equations and Linear Inequalities Overview Number of instructional days: 15 (1 day = 45 60 minutes) Content to be learned
More informationDmitri Krioukov CAIDA/UCSD
Hyperbolic geometry of complex networks Dmitri Krioukov CAIDA/UCSD dima@caida.org F. Papadopoulos, M. Boguñá, A. Vahdat, and kc claffy Complex networks Technological Internet Transportation Power grid
More informationEnterprise Organization and Communication Network
Enterprise Organization and Communication Network Hideyuki Mizuta IBM Tokyo Research Laboratory 1623-14, Shimotsuruma, Yamato-shi Kanagawa-ken 242-8502, Japan E-mail: e28193@jp.ibm.com Fusashi Nakamura
More informationMODEL SELECTION FOR SOCIAL NETWORKS USING GRAPHLETS
MODEL SELECTION FOR SOCIAL NETWORKS USING GRAPHLETS JEANNETTE JANSSEN, MATT HURSHMAN, AND NAUZER KALYANIWALLA Abstract. Several network models have been proposed to explain the link structure observed
More informationGraph/Network Visualization
Graph/Network Visualization Data model: graph structures (relations, knowledge) and networks. Applications: Telecommunication systems, Internet and WWW, Retailers distribution networks knowledge representation
More informationA QuestionPro Publication
How to effectively conduct an online survey A QuestionPro Publication Steps in Preparing an Online Questionnaire How to Effectively Conduct an Online Survey By: Vivek Bhaskaran Co-Founder Survey Analytics
More information! E6893 Big Data Analytics Lecture 10:! Linked Big Data Graph Computing (II)
E6893 Big Data Analytics Lecture 10: Linked Big Data Graph Computing (II) Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science Mgr., Dept. of Network Science and
More informationAn approach of detecting structure emergence of regional complex network of entrepreneurs: simulation experiment of college student start-ups
An approach of detecting structure emergence of regional complex network of entrepreneurs: simulation experiment of college student start-ups Abstract Yan Shen 1, Bao Wu 2* 3 1 Hangzhou Normal University,
More informationEffects of node buffer and capacity on network traffic
Chin. Phys. B Vol. 21, No. 9 (212) 9892 Effects of node buffer and capacity on network traffic Ling Xiang( 凌 翔 ) a), Hu Mao-Bin( 胡 茂 彬 ) b), and Ding Jian-Xun( 丁 建 勋 ) a) a) School of Transportation Engineering,
More informationPROBABILITY SECOND EDITION
PROBABILITY SECOND EDITION Table of Contents How to Use This Series........................................... v Foreword..................................................... vi Basics 1. Probability All
More informationNetwork Analysis For Sustainability Management
Network Analysis For Sustainability Management 1 Cátia Vaz 1º Summer Course in E4SD Outline Motivation Networks representation Structural network analysis Behavior network analysis 2 Networks Over the
More informationThe Basics of FEA Procedure
CHAPTER 2 The Basics of FEA Procedure 2.1 Introduction This chapter discusses the spring element, especially for the purpose of introducing various concepts involved in use of the FEA technique. A spring
More informationBusiness Intelligence and Process Modelling
Business Intelligence and Process Modelling F.W. Takes Universiteit Leiden Lecture 7: Network Analytics & Process Modelling Introduction BIPM Lecture 7: Network Analytics & Process Modelling Introduction
More informationA Software Architecture for a Photonic Network Planning Tool
A Software Architecture for a Photonic Network Planning Tool Volker Feil, Jan Späth University of Stuttgart, Institute of Communication Networks and Computer Engineering Pfaffenwaldring 47, D-70569 Stuttgart
More informationAn Analysis of Social Network-Based Sybil Defenses
An Analysis of Social Network-Based Sybil Defenses ABSTRACT Bimal Viswanath MPI-SWS bviswana@mpi-sws.org Krishna P. Gummadi MPI-SWS gummadi@mpi-sws.org Recently, there has been much excitement in the research
More informationUSE OF EIGENVALUES AND EIGENVECTORS TO ANALYZE BIPARTIVITY OF NETWORK GRAPHS
USE OF EIGENVALUES AND EIGENVECTORS TO ANALYZE BIPARTIVITY OF NETWORK GRAPHS Natarajan Meghanathan Jackson State University, 1400 Lynch St, Jackson, MS, USA natarajan.meghanathan@jsums.edu ABSTRACT This
More informationFault Analysis in Software with the Data Interaction of Classes
, pp.189-196 http://dx.doi.org/10.14257/ijsia.2015.9.9.17 Fault Analysis in Software with the Data Interaction of Classes Yan Xiaobo 1 and Wang Yichen 2 1 Science & Technology on Reliability & Environmental
More informationFast Sequential Summation Algorithms Using Augmented Data Structures
Fast Sequential Summation Algorithms Using Augmented Data Structures Vadim Stadnik vadim.stadnik@gmail.com Abstract This paper provides an introduction to the design of augmented data structures that offer
More informationMining Social Network Graphs
Mining Social Network Graphs Debapriyo Majumdar Data Mining Fall 2014 Indian Statistical Institute Kolkata November 13, 17, 2014 Social Network No introduc+on required Really? We s7ll need to understand
More informationCommon Core State Standards for Mathematics Accelerated 7th Grade
A Correlation of 2013 To the to the Introduction This document demonstrates how Mathematics Accelerated Grade 7, 2013, meets the. Correlation references are to the pages within the Student Edition. Meeting
More informationDynamic Network Analyzer Building a Framework for the Graph-theoretic Analysis of Dynamic Networks
Dynamic Network Analyzer Building a Framework for the Graph-theoretic Analysis of Dynamic Networks Benjamin Schiller and Thorsten Strufe P2P Networks - TU Darmstadt [schiller, strufe][at]cs.tu-darmstadt.de
More informationCharacter Image Patterns as Big Data
22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,
More informationWhy? A central concept in Computer Science. Algorithms are ubiquitous.
Analysis of Algorithms: A Brief Introduction Why? A central concept in Computer Science. Algorithms are ubiquitous. Using the Internet (sending email, transferring files, use of search engines, online
More informationPerformance Level Descriptors Grade 6 Mathematics
Performance Level Descriptors Grade 6 Mathematics Multiplying and Dividing with Fractions 6.NS.1-2 Grade 6 Math : Sub-Claim A The student solves problems involving the Major Content for grade/course with
More informationSome questions... Graphs
Uni Innsbruck Informatik - 1 Uni Innsbruck Informatik - 2 Some questions... Peer-to to-peer Systems Analysis of unstructured P2P systems How scalable is Gnutella? How robust is Gnutella? Why does FreeNet
More informationParallel Ray Tracing using MPI: A Dynamic Load-balancing Approach
Parallel Ray Tracing using MPI: A Dynamic Load-balancing Approach S. M. Ashraful Kadir 1 and Tazrian Khan 2 1 Scientific Computing, Royal Institute of Technology (KTH), Stockholm, Sweden smakadir@csc.kth.se,
More informationDATA ANALYSIS IN PUBLIC SOCIAL NETWORKS
International Scientific Conference & International Workshop Present Day Trends of Innovations 2012 28 th 29 th May 2012 Łomża, Poland DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS Lubos Takac 1 Michal Zabovsky
More informationStructure of a large social network
PHYSICAL REVIEW E 69, 036131 2004 Structure of a large social network Gábor Csányi 1, * and Balázs Szendrői 2, 1 TCM Group, Cavendish Laboratory, University of Cambridge, Madingley Road, Cambridge CB3
More informationParallel Algorithms for Small-world Network. David A. Bader and Kamesh Madduri
Parallel Algorithms for Small-world Network Analysis ayssand Partitioning atto g(s (SNAP) David A. Bader and Kamesh Madduri Overview Informatics networks, small-world topology Community Identification/Graph
More informationLoad balancing in a heterogeneous computer system by self-organizing Kohonen network
Bull. Nov. Comp. Center, Comp. Science, 25 (2006), 69 74 c 2006 NCC Publisher Load balancing in a heterogeneous computer system by self-organizing Kohonen network Mikhail S. Tarkov, Yakov S. Bezrukov Abstract.
More informationJava Modules for Time Series Analysis
Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series
More informationGENERATING AN ASSORTATIVE NETWORK WITH A GIVEN DEGREE DISTRIBUTION
International Journal of Bifurcation and Chaos, Vol. 18, o. 11 (2008) 3495 3502 c World Scientific Publishing Company GEERATIG A ASSORTATIVE ETWORK WITH A GIVE DEGREE DISTRIBUTIO JI ZHOU, XIAOKE XU, JIE
More informationInteger Operations. Overview. Grade 7 Mathematics, Quarter 1, Unit 1.1. Number of Instructional Days: 15 (1 day = 45 minutes) Essential Questions
Grade 7 Mathematics, Quarter 1, Unit 1.1 Integer Operations Overview Number of Instructional Days: 15 (1 day = 45 minutes) Content to Be Learned Describe situations in which opposites combine to make zero.
More informationAn Introduction to APGL
An Introduction to APGL Charanpal Dhanjal February 2012 Abstract Another Python Graph Library (APGL) is a graph library written using pure Python, NumPy and SciPy. Users new to the library can gain an
More informationMANAGEMENT SCIENCE. Special Issue on Complex Systems Luis A. Nunes Amaral, Brian Uzzi, Editors
A JOURNAL OF THE INSTITUTE FOR OPERATIONS RESEARCH AND THE MANAGEMENT SCIENCES MANAGEMENT SCIENCE Volume 53 Number 7 July 7 Special Issue on Complex Systems Luis A. Nunes Amaral, Brian Uzzi, Editors Amaral,
More informationStandard for Software Component Testing
Standard for Software Component Testing Working Draft 3.4 Date: 27 April 2001 produced by the British Computer Society Specialist Interest Group in Software Testing (BCS SIGIST) Copyright Notice This document
More informationTRU Math Conversation Guide
Release Version Alpha TRU Math Conversation Guide Module A: Contextual Algebraic Tasks This TRU Math Conversation Guide, Module A: Contextual Algebraic Tasks is a product of The Algebra Teaching Study
More information