SGL: Stata graph library for network analysis
|
|
|
- Ambrose Flynn
- 10 years ago
- Views:
Transcription
1 SGL: Stata graph library for network analysis Hirotaka Miura Federal Reserve Bank of San Francisco Stata Conference Chicago 2011 The views presented here are my own and do not necessarily represent the views of the Federal Reserve System or any person affiliated with it.
2 Introduction What is network analysis? Network analysis is an application of network theory, which is a subfield of graph theory, and is concerned with analyzing relational data. Some questions network analysis tries to address is how important, or how central are the actors in the network and how concentrated is the network. Example usages of network analysis include: Determining the importance of a web page using Google s PageRank. Examining communication networks in intelligence and computer security. Solving transportation problems that involve flow of traffic or commodities. Addressing the too-connected-to-fail problem in financial networks. Analyzing social relationships between individuals in social network analysis. 2 / 41
3 Introduction Outline Outline Modeling relational data Matrix representations Centrality measures Clustering coefficient Stata implementation Conclusion 3 / 41
4 Modeling relational data Definitions Graph model A graph model representing a network G = (V, E) consists of a set of vertices V and a set of edges E. V equals the number of vertices. E equals the number of edges. An edge is defined as a link between two vertices i and j, not necessarily distinct, that has vertex i on one end and vertex j on the other. An edge may be directed or undirected and may also be weighted with differing edge values or have all equal edge values of one in which case the network is said to be unweighted. 4 / 41
5 Modeling relational data Definitions Special cases Special types of vertices and edges exist for which standard graph algorithms are not designed to handle or there simply does not exist routines to accommodate such types. Thus the following types of vertices and edges are currently excluded from analysis: Isolated vertex - a vertex that is not attached to any edges. Parallel edges - two or more edges that connect the same pair of vertices. Self-loop - an edge connecting vertex i to itself. Zero or negative weighted edge. 5 / 41
6 Modeling relational data Storing data Storing relational data A variety of storage types are available for capturing relational data: Adjacency matrix Adjacency list Core SGL algorithms use this structure. Edge list Most suited for storing relational data in Stata, as it allows the use of options such as if exp and in range. Plus others such as the Compressed Sparse Row format for efficient storage and access. 6 / 41
7 Modeling relational data Storing data Example storage types Undirected unweighted network. Drawn using NETPLOT (Corten, 2011). 3 Adjacency matrix Adjacency list Edge list Vertex Neighbor(s) / 41
8 Modeling relational data Using joinby Creating an edge list using joinby (1) We illustrate a method of creating an edge list using the joinby command and example datasets used in [D] Data-Management Reference Manual, child.dta and parent.dta. Directed edges can represent parent vertices providing care to child vertices.. use child, clear (Data on Children). list family ~ d child_id x1 x use parent, clear (Data on Parents). list family ~ d parent ~ d x1 x / 41
9 Modeling relational data Using joinby Creating an edge list using joinby (2). sort family_id. joinby family_id using child. list parent_id child_id parent ~ d child_id Drawn using NETPLOT (Corten, 2011). 9 / 41
10 Matrix representations Matrix representations 10 / 41
11 Matrix representations Adjacency matrix Adjacency matrix Adjacency matrix A for unweighted networks is defined as a V V matrix with A ij entries being equal to one if an edge connects vertices i and j and zero otherwise. A ii entries are set to zero. Matrix A is symmetric if the network is undirected. For directed networks, rows of matrix A represent outgoing edges and columns represent incoming edges. The convention of denoting X ij entries as an edge from i to j is adopted for all matrices. For weighted networks, A ij entries are equal to the weight of the edge connecting vertices i and j. 11 / 41
12 Matrix representations Distance matrix Distance matrix Distance matrix D is defined as a V V matrix with D ij entries being equal to the length of the shortest path between vertices i and j. A path is defined as a way of reaching vertex j starting from vertex i using a combination of edges that do not go through a particular vertex more than once. If no path connects vertices i and j, D ij is set to missing. Signifies what is sometimes referred to as an infinite path. D ii is set to zero. For undirected networks, matrix D is symmetric. 12 / 41
13 Matrix representations Path matrix Path matrix Path matrix P is defined as a V V matrix with P ij entries being equal to the number of shortest paths between vertices i and j. If no paths exist between vertices i and j, P ij is set to zero. P ii is set to one. P matrix is symmetric for undirected networks. 13 / 41
14 Matrix representations Example Example matrices Undirected unweighted network. Drawn using NETPLOT (Corten, 2011). 3 Adjacency matrix Distance matrix Path matrix / 41
15 Centrality measures Centrality measures 15 / 41
16 Centrality measures Degree centrality Degree centrality (1) Undirected network Degree centrality measures the importance of a vertex by the number of connection the vertex has if the network is unweighted, and by the aggregate of the weights of edges connected to the vertex if the network is weighted (Freeman, 1978). For an undirected network, degree centrality for vertex i is defined as 1 V 1 A ij (1) j( i) where the leading divisor is adjusted for the exclusion of the j = i term. 16 / 41
17 Centrality measures Degree centrality Degree centrality (2) Directed network Directed networks may entail vertices having different number of incoming and outgoing edges, and thus we have out-degree and in-degree centrality. Out-degree centrality for vertex i is defined similarly to equation (1). For in-degree, we simply transpose the adjacency matrix: 1 V 1 A ij. (2) j( i) 17 / 41
18 Centrality measures Degree centrality Example Undirected unweighted network Centrality comparison G (0.33) A (0.33) Centrality Vertex E (0.50) D (0.33) C (0.50) A B F C G E D F (0.33) B (0.33) Degree Closeness Betweenness Eigenvector Katz-Bonacich Degree centrality in parentheses. Figure from Jackson (2008). Drawn using NETPLOT (Corten, 2011). 18 / 41
19 Centrality measures Closeness centrality Closeness centrality Closeness centrality provides higher centrality scores to vertices that are situated closer to members of their component, or the set of reachable vertices, by taking the inverse of the average shortest paths as a measure of proximity (Freeman, 1978). That is, closeness centrality for vertex i is defined as ( V 1) j( i) D, (3) ij which reflects how vertices with smaller average shortest path lengths receive higher centrality scores than those that are situated farther away from members of their component. 19 / 41
20 Centrality measures Closeness centrality Example Undirected unweighted network Centrality comparison G (0.40) A (0.40) Centrality Vertex E (0.55) D (0.60) C (0.55) A B F C G E D F (0.40) B (0.40) Degree Closeness Betweenness Eigenvector Katz-Bonacich Closeness centrality in parentheses. Figure from Jackson (2008). Drawn using NETPLOT (Corten, 2011). 20 / 41
21 Centrality measures Betweenness centrality Betweenness centrality Betweenness centrality bestows larger centrality scores on vertices that lie on a higher proportion of shortest paths linking vertices other than itself. Let P ij denote the number of shortest paths from vertex i to j. Let P ij (k) denote the number of shortest paths from vertex i to j that vertex k lies on. Then following Anthonisse (1971) and Freeman (1977), betweenness centrality measure for vertex k is defined as ij:i j,k ij P ij (k) P ij. (4) To normalize (4), divide by ( V 1)( V 2), the maximum number of paths a given vertex could lie on between pairs of other vertices. 21 / 41
22 Centrality measures Betweenness centrality Example Undirected unweighted network Centrality comparison G (0.00) A (0.00) Centrality Vertex E (0.53) D (0.60) C (0.53) A B F C G E D F (0.00) B (0.00) Degree Closeness Betweenness Eigenvector Katz-Bonacich Normalized betweenness centrality in parentheses. Figure from Jackson (2008). Drawn using NETPLOT (Corten, 2011). 22 / 41
23 Centrality measures Eigenvector centrality Eigenvector centrality (1) Eigenvector centrality can provide an indication on how important a vertex is by having the property of being large if a vertex has many neighbors, important neighbors, or both (Bonacich, 1972). For an undirected network with adjacency matrix A, centrality of vertex i, x i, can be expressed as x i = λ 1 j A ij x j (5) which can be rewritten as λx = Ax. (6) The convention is to use the eigenvector corresponding to the dominant eigenvalue of A. 23 / 41
24 Centrality measures Eigenvector centrality Eigenvector centrality (2) For directed networks, the general concern is in obtaining a centrality measure based on how often a vertex is being pointed to and the importance of neighbors associated with the incoming edges. Thus with a slight modification to equation (6), eigenvector centrality is redefined as λx = A x (7) where A is the transposed adjacency matrix. There are several shortcomings to the eigenvector centrality: A vertex with no incoming edges will always have centrality of zero. Vertices with neighbors that all have zero incoming edges will also have zero centrality since the sum in equation (5), j A ijx j, will not have any terms. The Katz-Bonacich centrality, a variation of the eigenvector centrality, seeks to address these issues. 24 / 41
25 Centrality measures Eigenvector centrality Example Undirected unweighted network Centrality comparison G (0.33) A (0.33) Centrality Vertex E (0.45) D (0.38) C (0.45) A B F C G E D F (0.33) B (0.33) Degree Closeness Betweenness Eigenvector Katz-Bonacich Eigenvector centrality in parentheses. Figure from Jackson (2008). Drawn using NETPLOT (Corten, 2011). 25 / 41
26 Centrality measures Katz-Bonacich centrality Katz-Bonacich centrality The additional inclusion of a free parameter (also referred to as a decay factor) and a vector of exogenous factors into equation (7): Avoids the exclusion of vertices with zero incoming edges. Allows connection values to decay over distance. Attributed to Katz (1953), Bonacich (1987), and Bonacich and Lloyd (2001). Centrality measure is defined as a solution to the equation x = αa x + β (8) where α is the free parameter and β is the vector of exogenous factors which can vary or be constant across vertices. For the centrality measure to converge properly, absolute value of α must be less than the absolute value of the inverse of the dominant eigenvalue of A. A positive α allows vertices with important neighbors to have higher status while a negative α value reduces the status. 26 / 41
27 Centrality measures Katz-Bonacich centrality Example Undirected unweighted network Centrality comparison G (3.99) A (3.99) Centrality Vertex E (5.06) D (4.34) C (5.06) A B F C G E D Degree Closeness Betweenness Eigenvector Katz-Bonacich* F (3.99) B (3.99) * Maximum α = 0.43 (0.33 used). Exogenous factors set to one for all vertices. Katz-Bonacich centrality in parentheses. Figure from Jackson (2008). Drawn using NETPLOT (Corten, 2011). 27 / 41
28 Clustering coefficient Clustering coefficient 28 / 41
29 Clustering coefficient Introduction Clustering coefficient Clustering coefficient is one way of gauging how tightly connected a network is. The general idea is to consider transitive relations: If vertex j is connected to vertex i, and i is connected to k, then j is also connected to k. Global clustering coefficients provide indication on the degree of concentration of the entire network and consists of overall and average clustering coefficients. Overall clustering coefficient is equal to all observed transitive relations divided by all possible transitive relations in the network. Average clustering coefficient involves applying the definition of overall clustering coefficient at the vertex level, then averaging across all the vertices. 29 / 41
30 Clustering coefficient Overall Overall clustering coefficient For an undirected unweighted adjacency matrix A, overall clustering coefficient is defined as A ji A ik A jk c o (A) = i;j i;k j;k i i;j i;k j;k i A ji A ik (9) where the numerator represents the sum over i of all closed triplets in which transitivity holds, and the denominator represents the sum over i of all possible triplets. 30 / 41
31 Clustering coefficient Local and average Local and average clustering coefficient With a slight modification in notation, local clustering coefficient for vertex i is defined as A ji A ik A jk c i (A) = j i;k j;k i j i;k j;k i which leads to the average clustering coefficient: c a (A) = 1 V A ji A ik (10) c i (A). (11) By convention, c i (A) = 0 if vertex i has zero or only one link. i 31 / 41
32 Clustering coefficient Generalized methods Generalized clustering coefficient Building upon the works of Barrat et al. (2004), Opsahl and Panzarasa (2009) propose generalized methods. Clustering coefficients for vertex i based on weighted adjacency matrix W and corresponding unweighted adjacency matrix A are calculated as c i (W) = j i;k j;k i j i;k j;k i ωa jk ω (12) where ω equals (W ji + W ik )/2 for arithmetic mean, W ji W ik for geometric mean, max(w ji, W ik ) for maximum, and min(w ji, W ik ) for minimum. For unweighted networks, W = A and the four types of clustering coefficients are all equal. 32 / 41
33 Clustering coefficient Generalized methods Example Undirected unweighted network G (1.00) A (1.00) E (0.33) D (0.00) C (0.33) Average clustering coefficient: 0.67 Overall clustering coefficient: 0.55 F (1.00) B (1.00) Local clustering coefficients in parentheses. Figure from Jackson (2008). Drawn using NETPLOT (Corten, 2011). 33 / 41
34 Stata implementation Stata implementation 34 / 41
35 Stata implementation Using network and netsummarize network and netsummarize commands We demonstrate the use of network and netsummarize commands on a dataset of 15th-century Florentine marriages from Padgett and Ansell (1993) to compute betweenness and eigenvector centrality measures. network generates vectors of betweenness and eigenvector centralities. netsummarize merges vectors to Stata dataset.. use florentine_marriages, clear (15th Florentine marriages 15th-century > (Padgett and Ansell 1993)) marriages Edge. list list stored as Stata dataset v1 v2 1. Peruzzi Castellan 2. Peruzzi Strozzi 3. Peruzzi Bischeri 4. Castellan Strozzi 5. Castellan Barbadori 6. Strozzi Ridolfi 7. Strozzi Bischeri 8. Bischeri Guadagni 9. Barbadori Medici 10. Ridolfi Medici 11. Ridolfi Tornabuon 12. Medici Tornabuon 13. Medici Albizzi 14. Medici Salviati 15. Medici Acciaiuol 16. Tornabuon Guadagni 17. Guadagni Lambertes 18. Guadagni Albizzi 19. Albizzi Ginori 20. Salviati Pazzi 35 / 41
36 Stata implementation Using network and netsummarize Computing network centrality measures. // Generate betweenness centrality.. network v1 v2, measure(betweenness) name(b,replace) Breadth-first search algorithm (15 vertices) Breadth-first search algorithm completed Betweenness centrality calculation (15 vertices) Betweenness centrality calculation completed matrix b saved in Mata. netsummarize b/((rows(b)-1)*(rows(b)-2)), > generate(betweenness) statistic(rowsum). // Generate eigenvector centrality.. network v1 v2, measure(eigenvector) name(e,replace) matrix e saved in Mata. netsummarize e, generate(eigenvector) statistic(rowsum) 36 / 41
37 Stata implementation Using network and netsummarize Data description (1). describe, full Contains data from florentine_marriages.dta obs: 20 15th century Florentine marriages > (Padgett and Ansell 1993) vars: Dec :45 size: 1,160 (99.9% of memory free) storage display value variable name type format label variable label v1 str9 %9s v2 str9 %9s betweenness_source float %9.0g rowsum of Mata matrix b/((rows(b)-1)*(rows(b)-2)) betweenness_target float %9.0g rowsum of Mata matrix b/((rows(b)-1)*(rows(b)-2)) eigenvector_source float %9.0g rowsum of Mata matrix e eigenvector_target float %9.0g rowsum of Mata matrix e Sorted by: Note: dataset has changed since last saved 37 / 41
38 Stata implementation Using network and netsummarize Data description (2). list v1 v2 betweenness_source betweenness_target eigenvector_source eigenve > ctor_target v1 v2 betwee ~ e betwee ~ t eigenv ~ e eigenv ~ t 1. Peruzzi Castellan Peruzzi Strozzi Peruzzi Bischeri Castellan Strozzi Castellan Barbadori Strozzi Ridolfi Strozzi Bischeri Bischeri Guadagni Barbadori Medici Ridolfi Medici Ridolfi Tornabuon Medici Tornabuon Medici Albizzi Medici Salviati Medici Acciaiuol Tornabuon Guadagni Guadagni Lambertes Guadagni Albizzi Albizzi Ginori Salviati Pazzi / 41
39 Stata implementation Using network and netsummarize Network visualization Betweenness centrality Eigenvector centrality Ginori (0.00) Ginori (0.07) Lambertes (0.00) Albizzi (0.21) Guadagni (0.25) Acciaiuol (0.00) Tornabuon (0.09) Medici (0.52) Bischeri (0.10) Salviati (0.14) Ridolfi (0.11) Pazzi (0.00) Lambertes (0.09) Albizzi (0.24) Guadagni (0.29) Acciaiuol (0.13) Tornabuon (0.33) Medici (0.43) Bischeri (0.28) Salviati (0.15) Ridolfi (0.34) Pazzi (0.04) Strozzi (0.10) Peruzzi (0.02) Barbadori (0.09) Strozzi (0.36) Peruzzi (0.28) Barbadori (0.21) Castellan (0.05) Castellan (0.26) Centrality scores in parentheses. Drawn using NETPLOT (Corten, 2011). 39 / 41
40 Conclusion Conclusion Different types of matrices, centrality measures, and clustering coefficients can be generated from information retrieved from relational data. The command network provides access to SGL functions that generate network measures based on edge list stored in Stata. The postcomputation command netsummarize allows the user to generate standard and customized network measures which are merged into Stata dataset. Future developments include: More efficient algorithms. Designing functions for additional network measures. Optimizing SGL in Mata. 40 / 41
41 References References I Anthonisse, J. M. (1971): The rush in a directed graph, Discussion paper, Stichting Mathematisch Centrum, Amsterdam, Technical Report BN 9/71. Barrat, A., M. Barthélemy, R. Pastor-Satorras, and A. Vespignani (2004): The architecture of complex weighted networks, Proceedings of the National Academy of Sciences, 101(11), Bonacich, P. (1972): Factoring and weighting approaches to status scores and clique identification, Journal of Mathematical Sociology 2, pp (1987): Power and Centrality: A Family of Measures, The American Journal of Sociology, 92(5), Bonacich, P., and P. Lloyd (2001): Eigenvector-like measures of centrality for asymmetric relations, Social Networks, 23(3), Corten, R. (2011): Visualization of social networks in Stata using multidimensional scaling, Stata Journal, 11(1), Freeman, L. C. (1977): A set of measures of centrality based on betweenness, Sociometry, 40(1), (1978): Centrality in social networks: Conceptual clarification, Social Networks, (1), Jackson, M. O. (2008): Social and Economic Networks. Princeton University Press. Katz, L. (1953): A New Status Index Derived from Sociometric Analysis, Psychometrika, 18, Opsahl, T., and P. Panzarasa (2009): Clustering in weighted networks, Social Networks, 31(2), Padgett, J. F., and C. K. Ansell (1993): Robust action and the rise of the Medici, , The American Journal of Sociology, 98(6), / 41
Visualization of Social Networks in Stata by Multi-dimensional Scaling
Visualization of Social Networks in Stata by Multi-dimensional Scaling Rense Corten Department of Sociology/ICS Utrecht University The Netherlands [email protected] April 12, 2010 1 Introduction Social network
DATA ANALYSIS II. Matrix Algorithms
DATA ANALYSIS II Matrix Algorithms Similarity Matrix Given a dataset D = {x i }, i=1,..,n consisting of n points in R d, let A denote the n n symmetric similarity matrix between the points, given as where
Follow links for Class Use and other Permissions. For more information send email to: [email protected]
COPYRIGHT NOTICE: Matthew O. Jackson: Social and Economic Networks is published by Princeton University Press and copyrighted, 2008, by Princeton University Press. All rights reserved. No part of this
Eigenvector-like measures of centrality for asymmetric relations
Social Networks 23 (2001) 191 201 Eigenvector-like measures of centrality for asymmetric relations Phillip Bonacich, Paulette Lloyd Department of Sociology, University of California at Los Angeles, 2201
Social Media Mining. Network Measures
Klout Measures and Metrics 22 Why Do We Need Measures? Who are the central figures (influential individuals) in the network? What interaction patterns are common in friends? Who are the like-minded users
Social Network Analysis: Lecture 4-Centrality Measures
Social Network Analysis: Lecture 4-Centrality Measures Donglei Du ([email protected]) Faculty of Business Administration, University of New Brunswick, NB Canada Fredericton E3B 9Y2 Donglei Du (UNB) Social Network
Social network and smoking: a pilotstudyamonghigh-schoolstudents
2013 Italian Stata Users Group meeting Social network and smoking: a pilotstudyamonghigh-schoolstudents Bruno Federico Firenze, 14 novembre 2013 The social pattern of smoking Smoking is an important risk
Practical Graph Mining with R. 5. Link Analysis
Practical Graph Mining with R 5. Link Analysis Outline Link Analysis Concepts Metrics for Analyzing Networks PageRank HITS Link Prediction 2 Link Analysis Concepts Link A relationship between two entities
Social Media Mining. Graph Essentials
Graph Essentials Graph Basics Measures Graph and Essentials Metrics 2 2 Nodes and Edges A network is a graph nodes, actors, or vertices (plural of vertex) Connections, edges or ties Edge Node Measures
Algorithms for representing network centrality, groups and density and clustered graph representation
COSIN IST 2001 33555 COevolution and Self-organization In dynamical Networks Algorithms for representing network centrality, groups and density and clustered graph representation Deliverable Number: D06
USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS
USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS Natarajan Meghanathan Jackson State University, 1400 Lynch St, Jackson, MS, USA [email protected]
HISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy
SOCIAL NETWORK ANALYSIS University of Amsterdam, Netherlands Keywords: Social networks, structuralism, cohesion, brokerage, stratification, network analysis, methods, graph theory, statistical models Contents
Mining Social-Network Graphs
342 Chapter 10 Mining Social-Network Graphs There is much information to be gained by analyzing the large-scale data that is derived from social networks. The best-known example of a social network is
Walk-Based Centrality and Communicability Measures for Network Analysis
Walk-Based Centrality and Communicability Measures for Network Analysis Michele Benzi Department of Mathematics and Computer Science Emory University Atlanta, Georgia, USA Workshop on Innovative Clustering
Part 2: Community Detection
Chapter 8: Graph Data Part 2: Community Detection Based on Leskovec, Rajaraman, Ullman 2014: Mining of Massive Datasets Big Data Management and Analytics Outline Community Detection - Social networks -
15.062 Data Mining: Algorithms and Applications Matrix Math Review
.6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop
STATISTICS AND DATA ANALYSIS IN GEOLOGY, 3rd ed. Clarificationof zonationprocedure described onpp. 238-239
STATISTICS AND DATA ANALYSIS IN GEOLOGY, 3rd ed. by John C. Davis Clarificationof zonationprocedure described onpp. 38-39 Because the notation used in this section (Eqs. 4.8 through 4.84) is inconsistent
How To Understand And Solve Algebraic Equations
College Algebra Course Text Barnett, Raymond A., Michael R. Ziegler, and Karl E. Byleen. College Algebra, 8th edition, McGraw-Hill, 2008, ISBN: 978-0-07-286738-1 Course Description This course provides
Graph Processing and Social Networks
Graph Processing and Social Networks Presented by Shu Jiayu, Yang Ji Department of Computer Science and Engineering The Hong Kong University of Science and Technology 2015/4/20 1 Outline Background Graph
How To Understand The Network Of A Network
Roles in Networks Roles in Networks Motivation for work: Let topology define network roles. Work by Kleinberg on directed graphs, used topology to define two types of roles: authorities and hubs. (Each
An Introduction to APGL
An Introduction to APGL Charanpal Dhanjal February 2012 Abstract Another Python Graph Library (APGL) is a graph library written using pure Python, NumPy and SciPy. Users new to the library can gain an
General Network Analysis: Graph-theoretic. COMP572 Fall 2009
General Network Analysis: Graph-theoretic Techniques COMP572 Fall 2009 Networks (aka Graphs) A network is a set of vertices, or nodes, and edges that connect pairs of vertices Example: a network with 5
Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014
Beyond Watson: Predictive Analytics and Big Data U.S. National Security Agency Research Directorate - R6 Technical Report February 3, 2014 300 years before Watson there was Euler! The first (Jeopardy!)
Network Analysis: Lecture 1. Sacha Epskamp 02-09-2014
: Lecture 1 University of Amsterdam Department of Psychological Methods 02-09-2014 Who are you? What is your specialization? Why are you here? Are you familiar with the network perspective? How familiar
arxiv:1201.3120v3 [math.na] 1 Oct 2012
RANKING HUBS AND AUTHORITIES USING MATRIX FUNCTIONS MICHELE BENZI, ERNESTO ESTRADA, AND CHRISTINE KLYMKO arxiv:1201.3120v3 [math.na] 1 Oct 2012 Abstract. The notions of subgraph centrality and communicability,
USE OF EIGENVALUES AND EIGENVECTORS TO ANALYZE BIPARTIVITY OF NETWORK GRAPHS
USE OF EIGENVALUES AND EIGENVECTORS TO ANALYZE BIPARTIVITY OF NETWORK GRAPHS Natarajan Meghanathan Jackson State University, 1400 Lynch St, Jackson, MS, USA [email protected] ABSTRACT This
Social and Technological Network Analysis. Lecture 3: Centrality Measures. Dr. Cecilia Mascolo (some material from Lada Adamic s lectures)
Social and Technological Network Analysis Lecture 3: Centrality Measures Dr. Cecilia Mascolo (some material from Lada Adamic s lectures) In This Lecture We will introduce the concept of centrality and
IE 680 Special Topics in Production Systems: Networks, Routing and Logistics*
IE 680 Special Topics in Production Systems: Networks, Routing and Logistics* Rakesh Nagi Department of Industrial Engineering University at Buffalo (SUNY) *Lecture notes from Network Flows by Ahuja, Magnanti
! E6893 Big Data Analytics Lecture 10:! Linked Big Data Graph Computing (II)
E6893 Big Data Analytics Lecture 10: Linked Big Data Graph Computing (II) Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science Mgr., Dept. of Network Science and
Analyzing Big Data: Social Choice & Measurement
Analyzing Big Data: Social Choice & Measurement John W. Patty Elizabeth Maggie Penn August 30, 2014 Social scientists commonly construct indices to reduce and summarize complex, multidimensional data.
Why? A central concept in Computer Science. Algorithms are ubiquitous.
Analysis of Algorithms: A Brief Introduction Why? A central concept in Computer Science. Algorithms are ubiquitous. Using the Internet (sending email, transferring files, use of search engines, online
SECTIONS 1.5-1.6 NOTES ON GRAPH THEORY NOTATION AND ITS USE IN THE STUDY OF SPARSE SYMMETRIC MATRICES
SECIONS.5-.6 NOES ON GRPH HEORY NOION ND IS USE IN HE SUDY OF SPRSE SYMMERIC MRICES graph G ( X, E) consists of a finite set of nodes or vertices X and edges E. EXMPLE : road map of part of British Columbia
MATH 551 - APPLIED MATRIX THEORY
MATH 55 - APPLIED MATRIX THEORY FINAL TEST: SAMPLE with SOLUTIONS (25 points NAME: PROBLEM (3 points A web of 5 pages is described by a directed graph whose matrix is given by A Do the following ( points
Operation Count; Numerical Linear Algebra
10 Operation Count; Numerical Linear Algebra 10.1 Introduction Many computations are limited simply by the sheer number of required additions, multiplications, or function evaluations. If floating-point
APPM4720/5720: Fast algorithms for big data. Gunnar Martinsson The University of Colorado at Boulder
APPM4720/5720: Fast algorithms for big data Gunnar Martinsson The University of Colorado at Boulder Course objectives: The purpose of this course is to teach efficient algorithms for processing very large
Arithmetic and Algebra of Matrices
Arithmetic and Algebra of Matrices Math 572: Algebra for Middle School Teachers The University of Montana 1 The Real Numbers 2 Classroom Connection: Systems of Linear Equations 3 Rational Numbers 4 Irrational
1 Introduction to Matrices
1 Introduction to Matrices In this section, important definitions and results from matrix algebra that are useful in regression analysis are introduced. While all statements below regarding the columns
Linear Algebra and TI 89
Linear Algebra and TI 89 Abdul Hassen and Jay Schiffman This short manual is a quick guide to the use of TI89 for Linear Algebra. We do this in two sections. In the first section, we will go over the editing
Analyzing Big Data: Social Choice & Measurement
Analyzing Big Data: Social Choice & Measurement John W. Patty Elizabeth Maggie Penn September 27, 2014 Social scientists commonly construct indices to reduce and summarize complex, multidimensional data.
V. Adamchik 1. Graph Theory. Victor Adamchik. Fall of 2005
V. Adamchik 1 Graph Theory Victor Adamchik Fall of 2005 Plan 1. Basic Vocabulary 2. Regular graph 3. Connectivity 4. Representing Graphs Introduction A.Aho and J.Ulman acknowledge that Fundamentally, computer
Practical statistical network analysis (with R and igraph)
Practical statistical network analysis (with R and igraph) Gábor Csárdi [email protected] Department of Biophysics, KFKI Research Institute for Nuclear and Particle Physics of the Hungarian Academy of
Solving Simultaneous Equations and Matrices
Solving Simultaneous Equations and Matrices The following represents a systematic investigation for the steps used to solve two simultaneous linear equations in two unknowns. The motivation for considering
The University of Chicago Press is collaborating with JSTOR to digitize, preserve and extend access to American Journal of Sociology.
Power and Centrality: A Family of Measures Author(s): Phillip Bonacich Reviewed work(s): Source: American Journal of Sociology, Vol. 92, No. 5 (Mar., 1987), pp. 1170-1182 Published by: The University of
The PageRank Citation Ranking: Bring Order to the Web
The PageRank Citation Ranking: Bring Order to the Web presented by: Xiaoxi Pang 25.Nov 2010 1 / 20 Outline Introduction A ranking for every page on the Web Implementation Convergence Properties Personalized
Statistical Analysis of Complete Social Networks
Statistical Analysis of Complete Social Networks Introduction to networks Christian Steglich [email protected] median geodesic distance between groups 1.8 1.2 0.6 transitivity 0.0 0.0 0.5 1.0 1.5 2.0
Similar matrices and Jordan form
Similar matrices and Jordan form We ve nearly covered the entire heart of linear algebra once we ve finished singular value decompositions we ll have seen all the most central topics. A T A is positive
Lecture 15 An Arithmetic Circuit Lowerbound and Flows in Graphs
CSE599s: Extremal Combinatorics November 21, 2011 Lecture 15 An Arithmetic Circuit Lowerbound and Flows in Graphs Lecturer: Anup Rao 1 An Arithmetic Circuit Lower Bound An arithmetic circuit is just like
Network Analysis Basics and applications to online data
Network Analysis Basics and applications to online data Katherine Ognyanova University of Southern California Prepared for the Annenberg Program for Online Communities, 2010. Relational data Node (actor,
Hadoop SNS. renren.com. Saturday, December 3, 11
Hadoop SNS renren.com Saturday, December 3, 11 2.2 190 40 Saturday, December 3, 11 Saturday, December 3, 11 Saturday, December 3, 11 Saturday, December 3, 11 Saturday, December 3, 11 Saturday, December
A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION
A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS Kyoungjin Park Alper Yilmaz Photogrammetric and Computer Vision Lab Ohio State University [email protected] [email protected] ABSTRACT Depending
Manifold Learning Examples PCA, LLE and ISOMAP
Manifold Learning Examples PCA, LLE and ISOMAP Dan Ventura October 14, 28 Abstract We try to give a helpful concrete example that demonstrates how to use PCA, LLE and Isomap, attempts to provide some intuition
Analysis of Algorithms, I
Analysis of Algorithms, I CSOR W4231.002 Eleni Drinea Computer Science Department Columbia University Thursday, February 26, 2015 Outline 1 Recap 2 Representing graphs 3 Breadth-first search (BFS) 4 Applications
CONTROLLABILITY. Chapter 2. 2.1 Reachable Set and Controllability. Suppose we have a linear system described by the state equation
Chapter 2 CONTROLLABILITY 2 Reachable Set and Controllability Suppose we have a linear system described by the state equation ẋ Ax + Bu (2) x() x Consider the following problem For a given vector x in
Graph/Network Visualization
Graph/Network Visualization Data model: graph structures (relations, knowledge) and networks. Applications: Telecommunication systems, Internet and WWW, Retailers distribution networks knowledge representation
The Singular Value Decomposition in Symmetric (Löwdin) Orthogonalization and Data Compression
The Singular Value Decomposition in Symmetric (Löwdin) Orthogonalization and Data Compression The SVD is the most generally applicable of the orthogonal-diagonal-orthogonal type matrix decompositions Every
MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m
MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS 1. SYSTEMS OF EQUATIONS AND MATRICES 1.1. Representation of a linear system. The general system of m equations in n unknowns can be written a 11 x 1 + a 12 x 2 +
December 4, 2013 MATH 171 BASIC LINEAR ALGEBRA B. KITCHENS
December 4, 2013 MATH 171 BASIC LINEAR ALGEBRA B KITCHENS The equation 1 Lines in two-dimensional space (1) 2x y = 3 describes a line in two-dimensional space The coefficients of x and y in the equation
Nonlinear Iterative Partial Least Squares Method
Numerical Methods for Determining Principal Component Analysis Abstract Factors Béchu, S., Richard-Plouet, M., Fernandez, V., Walton, J., and Fairley, N. (2016) Developments in numerical treatments for
BookTOC.txt. 1. Functions, Graphs, and Models. Algebra Toolbox. Sets. The Real Numbers. Inequalities and Intervals on the Real Number Line
College Algebra in Context with Applications for the Managerial, Life, and Social Sciences, 3rd Edition Ronald J. Harshbarger, University of South Carolina - Beaufort Lisa S. Yocco, Georgia Southern University
SPANNING CACTI FOR STRUCTURALLY CONTROLLABLE NETWORKS NGO THI TU ANH NATIONAL UNIVERSITY OF SINGAPORE
SPANNING CACTI FOR STRUCTURALLY CONTROLLABLE NETWORKS NGO THI TU ANH NATIONAL UNIVERSITY OF SINGAPORE 2012 SPANNING CACTI FOR STRUCTURALLY CONTROLLABLE NETWORKS NGO THI TU ANH (M.Sc., SFU, Russia) A THESIS
(67902) Topics in Theory and Complexity Nov 2, 2006. Lecture 7
(67902) Topics in Theory and Complexity Nov 2, 2006 Lecturer: Irit Dinur Lecture 7 Scribe: Rani Lekach 1 Lecture overview This Lecture consists of two parts In the first part we will refresh the definition
SOLVING LINEAR SYSTEMS
SOLVING LINEAR SYSTEMS Linear systems Ax = b occur widely in applied mathematics They occur as direct formulations of real world problems; but more often, they occur as a part of the numerical analysis
Social network analysis with R sna package
Social network analysis with R sna package George Zhang iresearch Consulting Group (China) [email protected] [email protected] Social network (graph) definition G = (V,E) Max edges = N All possible
Network (Tree) Topology Inference Based on Prüfer Sequence
Network (Tree) Topology Inference Based on Prüfer Sequence C. Vanniarajan and Kamala Krithivasan Department of Computer Science and Engineering Indian Institute of Technology Madras Chennai 600036 [email protected],
Introduction to Matrix Algebra
Psychology 7291: Multivariate Statistics (Carey) 8/27/98 Matrix Algebra - 1 Introduction to Matrix Algebra Definitions: A matrix is a collection of numbers ordered by rows and columns. It is customary
Warshall s Algorithm: Transitive Closure
CS 0 Theory of Algorithms / CS 68 Algorithms in Bioinformaticsi Dynamic Programming Part II. Warshall s Algorithm: Transitive Closure Computes the transitive closure of a relation (Alternatively: all paths
Linear Programming. March 14, 2014
Linear Programming March 1, 01 Parts of this introduction to linear programming were adapted from Chapter 9 of Introduction to Algorithms, Second Edition, by Cormen, Leiserson, Rivest and Stein [1]. 1
The Characteristic Polynomial
Physics 116A Winter 2011 The Characteristic Polynomial 1 Coefficients of the characteristic polynomial Consider the eigenvalue problem for an n n matrix A, A v = λ v, v 0 (1) The solution to this problem
Algebra Unpacked Content For the new Common Core standards that will be effective in all North Carolina schools in the 2012-13 school year.
This document is designed to help North Carolina educators teach the Common Core (Standard Course of Study). NCDPI staff are continually updating and improving these tools to better serve teachers. Algebra
Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data
CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear
THREE DIMENSIONAL REPRESENTATION OF AMINO ACID CHARAC- TERISTICS
THREE DIMENSIONAL REPRESENTATION OF AMINO ACID CHARAC- TERISTICS O.U. Sezerman 1, R. Islamaj 2, E. Alpaydin 2 1 Laborotory of Computational Biology, Sabancı University, Istanbul, Turkey. 2 Computer Engineering
Linear Programming I
Linear Programming I November 30, 2003 1 Introduction In the VCR/guns/nuclear bombs/napkins/star wars/professors/butter/mice problem, the benevolent dictator, Bigus Piguinus, of south Antarctica penguins
Course on Social Network Analysis Graphs and Networks
Course on Social Network Analysis Graphs and Networks Vladimir Batagelj University of Ljubljana Slovenia V. Batagelj: Social Network Analysis / Graphs and Networks 1 Outline 1 Graph...............................
A comparative study of social network analysis tools
Membre de Membre de A comparative study of social network analysis tools David Combe, Christine Largeron, Előd Egyed-Zsigmond and Mathias Géry International Workshop on Web Intelligence and Virtual Enterprises
3. INNER PRODUCT SPACES
. INNER PRODUCT SPACES.. Definition So far we have studied abstract vector spaces. These are a generalisation of the geometric spaces R and R. But these have more structure than just that of a vector space.
Introduction to Principal Component Analysis: Stock Market Values
Chapter 10 Introduction to Principal Component Analysis: Stock Market Values The combination of some data and an aching desire for an answer does not ensure that a reasonable answer can be extracted from
Network-Based Tools for the Visualization and Analysis of Domain Models
Network-Based Tools for the Visualization and Analysis of Domain Models Paper presented as the annual meeting of the American Educational Research Association, Philadelphia, PA Hua Wei April 2014 Visualizing
Factor Analysis. Chapter 420. Introduction
Chapter 420 Introduction (FA) is an exploratory technique applied to a set of observed variables that seeks to find underlying factors (subsets of variables) from which the observed variables were generated.
Matrix Multiplication
Matrix Multiplication CPS343 Parallel and High Performance Computing Spring 2016 CPS343 (Parallel and HPC) Matrix Multiplication Spring 2016 1 / 32 Outline 1 Matrix operations Importance Dense and sparse
Matrix Differentiation
1 Introduction Matrix Differentiation ( and some other stuff ) Randal J. Barnes Department of Civil Engineering, University of Minnesota Minneapolis, Minnesota, USA Throughout this presentation I have
A Primer on Index Notation
A Primer on John Crimaldi August 28, 2006 1. Index versus Index notation (a.k.a. Cartesian notation) is a powerful tool for manipulating multidimensional equations. However, there are times when the more
Predicting Influentials in Online Social Networks
Predicting Influentials in Online Social Networks Rumi Ghosh Kristina Lerman USC Information Sciences Institute WHO is IMPORTANT? Characteristics Topology Dynamic Processes /Nature of flow What are the
Data Structure [Question Bank]
Unit I (Analysis of Algorithms) 1. What are algorithms and how they are useful? 2. Describe the factor on best algorithms depends on? 3. Differentiate: Correct & Incorrect Algorithms? 4. Write short note:
The mathematics of networks
The mathematics of networks M. E. J. Newman Center for the Study of Complex Systems, University of Michigan, Ann Arbor, MI 48109 1040 In much of economic theory it is assumed that economic agents interact,
ANALYTIC HIERARCHY PROCESS (AHP) TUTORIAL
Kardi Teknomo ANALYTIC HIERARCHY PROCESS (AHP) TUTORIAL Revoledu.com Table of Contents Analytic Hierarchy Process (AHP) Tutorial... 1 Multi Criteria Decision Making... 1 Cross Tabulation... 2 Evaluation
FURTHER VECTORS (MEI)
Mathematics Revision Guides Further Vectors (MEI) (column notation) Page of MK HOME TUITION Mathematics Revision Guides Level: AS / A Level - MEI OCR MEI: C FURTHER VECTORS (MEI) Version : Date: -9-7 Mathematics
Cloud Computing is NP-Complete
Working Paper, February 2, 20 Joe Weinman Permalink: http://www.joeweinman.com/resources/joe_weinman_cloud_computing_is_np-complete.pdf Abstract Cloud computing is a rapidly emerging paradigm for computing,
MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS
MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS Systems of Equations and Matrices Representation of a linear system The general system of m equations in n unknowns can be written a x + a 2 x 2 + + a n x n b a
A linear combination is a sum of scalars times quantities. Such expressions arise quite frequently and have the form
Section 1.3 Matrix Products A linear combination is a sum of scalars times quantities. Such expressions arise quite frequently and have the form (scalar #1)(quantity #1) + (scalar #2)(quantity #2) +...
SCAN: A Structural Clustering Algorithm for Networks
SCAN: A Structural Clustering Algorithm for Networks Xiaowei Xu, Nurcan Yuruk, Zhidan Feng (University of Arkansas at Little Rock) Thomas A. J. Schweiger (Acxiom Corporation) Networks scaling: #edges connected
Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis
Graph Theory and Complex Networks: An Introduction Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.0, [email protected] Chapter 06: Network analysis Version: April 8, 04 / 3 Contents Chapter
