Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis



Similar documents
Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis. Contents. Introduction. Maarten van Steen. Version: April 28, 2014

Social Media Mining. Graph Essentials

Graph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks

Distance Degree Sequences for Network Analysis

Social Media Mining. Network Measures

Course on Social Network Analysis Graphs and Networks

Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014

BOUNDARY EDGE DOMINATION IN GRAPHS

A discussion of Statistical Mechanics of Complex Networks P. Part I

Small Maximal Independent Sets and Faster Exact Graph Coloring

V. Adamchik 1. Graph Theory. Victor Adamchik. Fall of 2005

A MEASURE OF GLOBAL EFFICIENCY IN NETWORKS. Aysun Aytac 1, Betul Atay 2. Faculty of Science Ege University 35100, Bornova, Izmir, TURKEY

Graph Mining and Social Network Analysis

On the independence number of graphs with maximum degree 3

Introduction to Graph Mining

GRAPH THEORY LECTURE 4: TREES

Part 2: Community Detection

Tracers Placement for IP Traceback against DDoS Attacks

SEMITOTAL AND TOTAL BLOCK-CUTVERTEX GRAPH

A Study of Sufficient Conditions for Hamiltonian Cycles

Simple Graphs Degrees, Isomorphism, Paths

What cannot be measured on the Internet? Yvonne-Anne Pignolet, Stefan Schmid, G. Trédan. Misleading stars

Chapter Load Balancing. Approximation Algorithms. Load Balancing. Load Balancing on 2 Machines. Load Balancing: Greedy Scheduling

Extracting Information from Social Networks

Distributed Computing over Communication Networks: Maximal Independent Set

A 2-factor in which each cycle has long length in claw-free graphs

General Network Analysis: Graph-theoretic. COMP572 Fall 2009

Midterm Practice Problems

! E6893 Big Data Analytics Lecture 10:! Linked Big Data Graph Computing (II)

Network/Graph Theory. What is a Network? What is network theory? Graph-based representations. Friendship Network. What makes a problem graph-like?

Mining Social Network Graphs

Chapter 6: Graph Theory

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS

CIS 700: algorithms for Big Data

Link Prediction in Social Networks

Finding and counting given length cycles

Seminar. Path planning using Voronoi diagrams and B-Splines. Stefano Martina

Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003

Graph Theory Problems and Solutions

Option 1: empirical network analysis. Task: find data, analyze data (and visualize it), then interpret.

Class One: Degree Sequences

MATHEMATICAL ENGINEERING TECHNICAL REPORTS. An Improved Approximation Algorithm for the Traveling Tournament Problem

Zachary Monaco Georgia College Olympic Coloring: Go For The Gold

Why? A central concept in Computer Science. Algorithms are ubiquitous.

Random graphs with a given degree sequence

Gephi Tutorial Visualization

Practical Graph Mining with R. 5. Link Analysis

Analysis of Algorithms, I

Generalized Induced Factor Problems

Handout #Ch7 San Skulrattanakulchai Gustavus Adolphus College Dec 6, Chapter 7: Digraphs

Frans J.C.T. de Ruiter, Norman L. Biggs Applications of integer programming methods to cages

136 CHAPTER 4. INDUCTION, GRAPHS AND TREES

! Solve problem to optimality. ! Solve problem in poly-time. ! Solve arbitrary instances of the problem. !-approximation algorithm.

Complex Networks Analysis: Clustering Methods

Graph/Network Visualization

Total colorings of planar graphs with small maximum degree

Outline 2.1 Graph Isomorphism 2.2 Automorphisms and Symmetry 2.3 Subgraphs, part 1

Distributed Computing over Communication Networks: Topology. (with an excursion to P2P)


How To Understand The Network Of A Network

Mining Social-Network Graphs

Graph theory and network analysis. Devika Subramanian Comp 140 Fall 2008

Graph theoretic techniques in the analysis of uniquely localizable sensor networks

Graph theoretic approach to analyze amino acid network

Discrete Mathematics & Mathematical Reasoning Chapter 10: Graphs

Inet-3.0: Internet Topology Generator

Systems and Algorithms for Big Data Analytics

Reductions & NP-completeness as part of Foundations of Computer Science undergraduate course

Graph Mining and Social Network Analysis. Data Mining; EECS 4412 Darren Rolfe + Vince Chu

IE 680 Special Topics in Production Systems: Networks, Routing and Logistics*

Topological Properties

Euler Paths and Euler Circuits

Chapter 1. Introduction

Outline. NP-completeness. When is a problem easy? When is a problem hard? Today. Euler Circuits

Subgraph Patterns: Network Motifs and Graphlets. Pedro Ribeiro

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

! Solve problem to optimality. ! Solve problem in poly-time. ! Solve arbitrary instances of the problem. #-approximation algorithm.

Extremal Wiener Index of Trees with All Degrees Odd

Protein Protein Interaction Networks

Ph.D. Thesis. Judit Nagy-György. Supervisor: Péter Hajnal Associate Professor

The Open University s repository of research publications and other research outputs

Cycles and clique-minors in expanders

Misleading Stars: What Cannot Be Measured in the Internet?

DECENTRALIZED SCALE-FREE NETWORK CONSTRUCTION AND LOAD BALANCING IN MASSIVE MULTIUSER VIRTUAL ENVIRONMENTS

Every tree contains a large induced subgraph with all degrees odd

Graph Theory and Complex Networks

SCAN: A Structural Clustering Algorithm for Networks

Graph Processing and Social Networks

Greedy Routing on Hidden Metric Spaces as a Foundation of Scalable Routing Architectures

1. Nondeterministically guess a solution (called a certificate) 2. Check whether the solution solves the problem (called verification)

Transcription:

Graph Theory and Complex Networks: An Introduction Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.0, steen@cs.vu.nl Chapter 06: Network analysis Version: April 8, 04

/ 3 Contents Chapter Description 0: Introduction History, background 0: Foundations Basic terminology and properties of graphs 03: Extensions Directed & weighted graphs, colorings 04: Network traversal Walking through graphs (cf. traveling) 05: Trees Graphs without cycles; routing algorithms 06: Network analysis Basic metrics for analyzing large graphs 07: Random networks Introduction modeling real-world networks 08: Computer networks The Internet & WWW seen as a huge graph 09: Social networks Communities seen as graphs

Network analysis Introduction Observation In real-world situations, graphs (or networks) may become very large, making it difficult to (visually) discover properties we need network analysis tools. Vertex degrees: Consider the distribution of degrees: how many vertices have high degrees versus the number of vertices with low degrees. Distance statistics: Focus on where vertices are positioned in the network: far away from each other, central in the network, etc. Clustering: To what extent are my neighbors also adjacent to each other? Centrality: Are there vertices that are more important than others? 3 / 3

Network analysis 6. Vertex degree Vertex degree Question Can you visually observe real (nonisomorphic) differences? 4 / 3

5 / 3 Network analysis 6. Vertex degree Vertex degree: Histogram (a): (b): n = 00, m = 300 n = 00, m = 37 30 5 0 (a) (b) 5 0 5 3 4 5 6 7 8 9 0 3

6 / 3 Network analysis 6. Vertex degree Vertex degree: Ranked histogram 0 9 8 7 6 5 4 0 8 6 4 0 40 60 80 00 0 40 60 80 00

7 / 3 Network analysis 6. Distance statistics Distance statistics Definition G is connected, d(u,v) is distance between vertices u and v: the length of a shortest path between u and v. Eccentricity ε(u): Radius rad(g): Diameter diam(g): max{d(u, v) v V (G)} min{ε(u) u V (G)} max{d(u, v) u, v V (G)} Note Note that these definitions apply to directed as well as undirected graphs.

8 / 3 Network analysis 6. Distance statistics Path lengths Definition G is connected with vertex V ; d(u) is average length of shortest paths from u to any other vertex v: d(u) def = V d(u,v) v V,v u The average path length d(g): d(g) def = V u V d(u) = V V d(u,v) u,v V,u v

Network analysis 6. Distance statistics Path lengths Definition The characteristic path length is the median over all d(u). Note The median over n nondecreasing values x,x,...,x n : n odd x (n+)/ n even (x n/ + x n/+ )/ The median separates the higher values from the lower values into two equally-sized subsets. Example {3,4,4,6,0,6,} [0,,3,4,4,6,6] M = x (7+)/ = x 4 = 4 9 / 3

0 / 3 Network analysis 6. Distance statistics Example distance statistics 4 5 3 5 3 7 3 5 6 Vertex 3 4 5 6 7 ε(u) v u d(u,v) d(u) 0 5 3 3 7 7 3.50 0 5 4 7 3 7 3.67 3 5 5 0 7 4 3 7 6 4.33 4 3 7 0 6 9 5 9 3 5.33 5 3 4 4 6 0 6 6 4 4.00 6 7 7 9 6 0 5 9 36 6.00 7 3 3 5 5 0 5 9 3.7

Network analysis 6.3 Clustering coefficient Clustering coefficient Observation Many networks show a high degree of clustering: my neighbors are each other s neighbors. Note An extreme case is formed by having all my neighbors be adjacent to each other neighbors form a complete graph. Question What is the other extreme case? / 3

Network analysis 6.3 Clustering coefficient Clustering coefficient Definition G is simple, connected, undirected. Vertex v V (G) with neighborset N(v). Let n v = N(v). Note: max. number of edges between neighbors is ( n v ). Let m v is number of edges in subgraph induced by N(v): m v = E(G[N(v)]). Clustering coefficient cc(v): cc(v) def = { mv / ( n v ) = m v n v (n v ) if δ(v) > undefined otherwise / 3

Network analysis 6.3 Clustering coefficient Clustering coefficient Definition G is simple, connected and undirected. Let V def = {v V (G) δ(v) > }. Clustering coefficient CC(G) for G: CC(G) def = V v V cc(v) 3 / 3

Network analysis 6.3 Clustering coefficient Clustering coefficient: triangles Definition A triangle is a complete (sub)graph with exactly 3 vertices. A triple is a (sub)graph with exactly 3 vertices and edges. Definition G is simple and connected with n (G) distinct triangles and n Λ (G) distinct triples. The network transitivity τ(g) def = n (G)/n Λ (G). Notation A triple at v: v is incident to both edges ( in the middle ). n Λ (v) : number of triples at v. 4 / 3

5 / 3 Network analysis 6.3 Clustering coefficient Clustering coefficient: example 5 3 4 5 3 7 5 3 6 Vertex: 3 4 5 6 7 cc: /3 0 /3 undefined /3 n Λ : 3 3 3 0 6 Vertex N() = {,5,7}; E(G[N()]) = 5,7 cc() = 3 Triples at : G[{,,5}],G[{,,7}],G[{5,,7}]

6 / 3 Network analysis 6.3 Clustering coefficient Clustering coefficient versus transitivity Observation Let n (v) be the number of triangles of which v is member cc(v) = n (v) n Λ (v) n Λ (v) = ( δ(v) ) n (G) = 3 v V n (v) (Note: V = {v V δ(v) > })

7 / 3 Network analysis 6.3 Clustering coefficient Clustering coefficient versus transitivity x x x v v v 3... v n v k v v v 3... v n y G k = G[{x,y,v,v,...,v k }] : cc(u) = y if u = v,...,v k ) = k k(k+) = k+ if u = x or u = y k ( k+ CC(G k ) = k + ( k + + k ) = k + k + 4 k + 3k + lim k CC(G k) = y

8 / 3 Network analysis 6.3 Clustering coefficient Clustering coefficient versus transitivity x v v v 3... v n G k = G[{x,y,v,v,...,v k }] { if u = v,...,v k n Λ (u) = ) ( = k+ ) if u = x,y ( δ(u) y τ(g k ) = n (G k ) n Λ (u) = k = k(k + ) + k k + lim τ(g k) = 0 k

Network analysis 6.4 Centrality Centrality Issue Are there any vertices that are more important than the others? Definition G is (strongly) connected. The center C(G) is the set of vertices with minimal eccentricity: Intuition C(G) def = {v V (G) ε(v) = rad(g)} At the center means at minimal distance to the farthest node. 9 / 3

0 / 3 Network analysis 6.4 Centrality Vertex centrality Definition G is (strongly) connected. The (eccentricity based) vertex centrality c E (u) of u: c E (u) def = ε(u) Intuition The higher the centrality, the closer to the center of a graph.

Network analysis 6.4 Centrality Closeness Definition G is (strongly) connected. The closeness c C (u) of u: c C (u) def = v V (G) d(u,v) Intuition How close is a vertex to all other nodes? / 3

/ 3 Network analysis 6.4 Centrality Centrality: example 5 3 4 5 3 7 5 3 6 Vertex: 3 4 5 6 7 ε(u) 7 7 7 9 6 9 5 d(u, ) 7 3 4 37 9 c C (u): 0.048 0.045 0.037 0.03 0.04 0.07 0.034

Network analysis 6.4 Centrality Betweenness Intuition Important vertices are those whose removal significantly increases the distance between other vertices. Example: cut vertices. Definition G is simple and (strongly) connected. S(x,y) is set of shortest paths between x and y. S(x,u,y) S(x,y) paths that pass through u. Betweenness centrality c B (u) of u: c B (u) def = x y u S(x,u,y) S(x, y) 3 / 3