Course on Social Network Analysis Graphs and Networks



Similar documents
Social Media Mining. Graph Essentials

V. Adamchik 1. Graph Theory. Victor Adamchik. Fall of 2005

Pajek. Program for Analysis and Visualization of Large Networks. Reference Manual List of commands with short explanation version 2.

Tools and Techniques for Social Network Analysis

Network/Graph Theory. What is a Network? What is network theory? Graph-based representations. Friendship Network. What makes a problem graph-like?

IE 680 Special Topics in Production Systems: Networks, Routing and Logistics*

Discrete Mathematics & Mathematical Reasoning Chapter 10: Graphs

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis

Handout #Ch7 San Skulrattanakulchai Gustavus Adolphus College Dec 6, Chapter 7: Digraphs

Analysis of Algorithms, I

A. Mrvar: Network Analysis using Pajek 1. Cluster

! E6893 Big Data Analytics Lecture 10:! Linked Big Data Graph Computing (II)

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis. Contents. Introduction. Maarten van Steen. Version: April 28, 2014

An overview of Software Applications for Social Network Analysis

BOUNDARY EDGE DOMINATION IN GRAPHS

Social Media Mining. Network Measures

HISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy

Every tree contains a large induced subgraph with all degrees odd

Practical Graph Mining with R. 5. Link Analysis

Systems and Algorithms for Big Data Analytics

A comparative study of social network analysis tools

Distributed Computing over Communication Networks: Maximal Independent Set

Equivalence Concepts for Social Networks

Midterm Practice Problems

Graph/Network Visualization

Class One: Degree Sequences

Graph theory and network analysis. Devika Subramanian Comp 140 Fall 2008

Part 2: Community Detection

Institut für Informatik Lehrstuhl Theoretische Informatik I / Komplexitätstheorie. An Iterative Compression Algorithm for Vertex Cover

5.1 Bipartite Matching

Graph Theory Problems and Solutions

Triangle deletion. Ernie Croot. February 3, 2010

An Empirical Study of Two MIS Algorithms

Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014

DATA ANALYSIS II. Matrix Algorithms

Groups and Positions in Complete Networks

CIS 700: algorithms for Big Data

Simplified External memory Algorithms for Planar DAGs. July 2004

Semi-Matchings for Bipartite Graphs and Load Balancing

Scheduling Shop Scheduling. Tim Nieberg

Follow links for Class Use and other Permissions. For more information send to:

Social Networking Analytics

136 CHAPTER 4. INDUCTION, GRAPHS AND TREES

Types of Degrees in Bipolar Fuzzy Graphs

Graph Mining and Social Network Analysis

A 2-factor in which each cycle has long length in claw-free graphs

Chapter 1. Introduction

Distance Degree Sequences for Network Analysis

Graph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks

Global secure sets of trees and grid-like graphs. Yiu Yu Ho

Zachary Monaco Georgia College Olympic Coloring: Go For The Gold

A MEASURE OF GLOBAL EFFICIENCY IN NETWORKS. Aysun Aytac 1, Betul Atay 2. Faculty of Science Ege University 35100, Bornova, Izmir, TURKEY

Discrete Applied Mathematics. The firefighter problem with more than one firefighter on trees

Minimum degree condition forcing complete graph immersion

On the independence number of graphs with maximum degree 3

Math 179: Graph Theory

CS 598CSC: Combinatorial Optimization Lecture date: 2/4/2010

CMPSCI611: Approximating MAX-CUT Lecture 20

Walk-Based Centrality and Communicability Measures for Network Analysis

SCAN: A Structural Clustering Algorithm for Networks

Graphical degree sequences and realizations

On Integer Additive Set-Indexers of Graphs

Complex Networks Analysis: Clustering Methods

Evaluating Software Products - A Case Study

Lecture 15 An Arithmetic Circuit Lowerbound and Flows in Graphs

All trees contain a large induced subgraph having all degrees 1 (mod k)

Mining Social-Network Graphs

2. (a) Explain the strassen s matrix multiplication. (b) Write deletion algorithm, of Binary search tree. [8+8]

Approximation Algorithms

Small Maximal Independent Sets and Faster Exact Graph Coloring

General Network Analysis: Graph-theoretic. COMP572 Fall 2009

Degree Hypergroupoids Associated with Hypergraphs

Outline 2.1 Graph Isomorphism 2.2 Automorphisms and Symmetry 2.3 Subgraphs, part 1

CSE331: Introduction to Networks and Security. Lecture 8 Fall 2006

An Introduction to APGL

Discrete Mathematics Problems

Social network analysis with R sna package

Nimble Algorithms for Cloud Computing. Ravi Kannan, Santosh Vempala and David Woodruff

Connectivity and cuts

Network Analysis Basics and applications to online data

OPTIMAL DESIGN OF DISTRIBUTED SENSOR NETWORKS FOR FIELD RECONSTRUCTION

in R Binbin Lu, Martin Charlton National Centre for Geocomputation National University of Ireland Maynooth The R User Conference 2011

Social and Technological Network Analysis. Lecture 3: Centrality Measures. Dr. Cecilia Mascolo (some material from Lada Adamic s lectures)

SGL: Stata graph library for network analysis

A discussion of Statistical Mechanics of Complex Networks P. Part I

Outline. NP-completeness. When is a problem easy? When is a problem hard? Today. Euler Circuits

Disjoint Compatible Geometric Matchings

Simple Graphs Degrees, Isomorphism, Paths

Discrete Mathematics. Hans Cuypers. October 11, 2007

A Study of Sufficient Conditions for Hamiltonian Cycles

GRAPH THEORY LECTURE 4: TREES

Finding and counting given length cycles

NETZCOPE - a tool to analyze and display complex R&D collaboration networks

Transcription:

Course on Social Network Analysis Graphs and Networks Vladimir Batagelj University of Ljubljana Slovenia

V. Batagelj: Social Network Analysis / Graphs and Networks 1 Outline 1 Graph............................... 1 2 Graph / Sets............................ 2 3 Graph / Neighbors........................ 3 4 Graph / Matrix.......................... 4 5 Subgraph............................. 5 6 Graph characteristics....................... 6 7 Walks............................... 7 8 Shortest paths........................... 8 9 Equivalence relations and Partitions............... 9 10 Connectivity............................ 10 11 Reduction............................. 11 12 Biconnectivity........................... 12

V. Batagelj: Social Network Analysis / Graphs and Networks 2 13 Special graphs........................... 13 14 Krebs Internet Industry Companies............... 14 15 Cores............................... 15 17 6-core of Krebs Internet Industry Companies.......... 17 18 Networks............................. 18 19 Computational Geometry Valued Core.............. 19 20 Sources.............................. 20

V. Batagelj: Social Network Analysis / Graphs and Networks 1 Graph actor vertex, node relation line, edge, arc, link, tie arc = directed line, (a, d) a is the initial vertex, d is the terminal vertex. edge = undirected line, (c: d) c and d are end vertices.

V. Batagelj: Social Network Analysis / Graphs and Networks 2 Graph / Sets V = {a, b, c, d, e, f, g, h, i, j, k, l} A = {(a, b), (a, d), (a, f), (b, a), (b, f), (c, b), (c, c), (c, g), (c, g), (e, c), (e, f), (e, h), (f, k), (h, d), (h, l), (j, h), (l, e), (l, g), (l, h)} E = {(b: e), (c: d), (e: g), (f: h)} G = (V, A, E) L = A E A = undirected graph; E = directed graph. Pajek: GraphSet; TinaSet.

V. Batagelj: Social Network Analysis / Graphs and Networks 3 Graph / Neighbors N A (a) = {b, d, f} N A (b) = {a, f} N A (c) = {b, c, g, g} N A (e) = {c, f, h} N A (f) = {k} N A (h) = {d, l} N A (j) = {h} N A (l) = {e, g, h} N E (e) = {b, g} N E (c) = {d} N E (f) = {h} Pajek: GraphList; TinaList. N(v) = N A (v) N E (v), = N out (v), N in (v) Star in v, S(v) is the set of all lines with v as their initial vertex.

V. Batagelj: Social Network Analysis / Graphs and Networks 4 Graph / Matrix a b c d e f g h i j k l a 0 1 0 1 0 1 0 0 0 0 0 0 b 1 0 0 0 1 1 0 0 0 0 0 0 c 0 1 1 1 0 0 2 0 0 0 0 0 d 0 0 1 0 0 0 0 0 0 0 0 0 e 0 1 1 0 0 1 1 1 0 0 0 0 f 0 0 0 0 0 0 0 1 0 0 1 0 g 0 0 0 0 1 0 0 0 0 0 0 0 h 0 0 0 1 0 1 0 0 0 0 0 1 i 0 0 0 0 0 0 0 0 0 0 0 0 j 0 0 0 0 0 0 0 1 0 0 0 0 k 0 0 0 0 0 0 0 0 0 0 0 0 l 0 0 0 0 1 0 1 1 0 0 0 0 Pajek: GraphMat; TinaMat, picture picture. Graph G is simple if in the corresponding matrix all entries are 0 or 1.

V. Batagelj: Social Network Analysis / Graphs and Networks 5 Subgraph A subgraph H = (V, L ) of a given graph G = (V, L) is a graph which set of lines is a subset of set of lines of G, L L, its vertex set is a subset of set of vertices of G, V V, and it contains all end-vertices of L. A subgraph can be induced by a given subset of vertices or lines.

V. Batagelj: Social Network Analysis / Graphs and Networks 6 Graph characteristics number of vertices n = V number of lines m = L degree of vertex v, deg(v) = number of lines with v as end-vertex; indegree of vertex v, indeg(v) = number of lines with v as terminal vertex (end-vertex is both initial and terminal); outdegree of vertex v, outdeg(v) = number of lines with v as initial vertex. v V n = 12, m = 23, indeg(e) = 3, outdeg(e) = 5, deg(e) = 6 indeg(v) = outdeg(v) = A + 2 E, deg(v) = 2 L E 0 v V v V

V. Batagelj: Social Network Analysis / Graphs and Networks 7 Walks length s of the walk s is the number of lines it contains. s = (j, h, l, g, e, f, h, l, e, c, b, a) s = 11 A walk is closed iff its initial and terminal vertex coincide. If we don t consider the direction of the lines in the walk we get a semiwalk or chain. trail walk with all lines different path walk with all vertices different cycle closed walk with all internal vertices different A graph is acyclic if it doesn t contain any cycle.

V. Batagelj: Social Network Analysis / Graphs and Networks 8 Shortest paths A shortest path from u to v is also called a geodesic from u to v. Its length is denoted by d(u, v). If there is no walk from u to v then d(u, v) =. d(j, a) = (j, h, d, c, b, a) = 5 d(a, j) = ˆd(u, v) = max(d(u, v), d(v, u)) is a distance. The diameter of a graph equals to the distance between the most distant pair of vertices: D = max u,v V d(u, v).

V. Batagelj: Social Network Analysis / Graphs and Networks 9 Equivalence relations and Partitions Let C = {C i } be a set of subsets of V, C i V. C is a partition of V iff i C i = V and for i j, C i C j =. A relation R on V is an equivalence relation iff it is reflexive v V : vrv, symmetric u, v V : urv vru, and transitive u, v, z V : urz zrv urv. Each equivalence relation determines a partition into equivalence classes [v] = {u : vru}. Each partition C determines an equivalence relation urv C C : u C v C. k-neighbors of v is the set of vertices on distance k from v, N k (v) = {u v : d(v, u) = k}. The set of all k-neighbors, k = 0, 1,... of v is a partition of V. k-neighborhood of v, N (k) (v) = {u v : d(v, u) k}.

V. Batagelj: Social Network Analysis / Graphs and Networks 10 Connectivity Vertex u is reachable from vertex v iff there exists a walk with initial vertex v and terminal vertex u. Vertex v is weakly connected with vertex u iff there exists a semiwalk with v and u as its end-vertices. Vertex v is strongly connected with vertex u iff they are mutually reachable. Weak and strong connectivity are equivalence relations. Equivalence classes induce weak/strong components.

V. Batagelj: Social Network Analysis / Graphs and Networks 11 Reduction If we shrink every strong component of a given graph into a vertex, delete all loops and identify parallel arcs the obtained reduced graph is acyclic. For every acyclic graph an ordering / level function i : V IN exists s.t. (u, v) A i(u) < i(v).

V. Batagelj: Social Network Analysis / Graphs and Networks 12 Biconnectivity Vertices u and v are biconnected iff they are connected (in both directions) by two independent (no common internal vertex) paths. Biconnectivity determines a partition of the set of lines. A vertex is an articulation vertex iff its deletion increases the number of weak components in a graph.

V. Batagelj: Social Network Analysis / Graphs and Networks 13 Special graphs A weakly connected graph G is a tree iff it doesn t contain loops and semicycles of length at least 3. A graph G = (V, L) is bipartite iff its set of vertices V can be partitioned into two sets V 1 and V 2 such that every line from L has one end-vertex in V 1 and the other in V 2. A simple undirected graph is complete, K n, iff it contains all possible edges.

V. Batagelj: Social Network Analysis / Graphs and Networks 14 Krebs Internet Industry Companies This network shows a subset of the total internet industry during the period from 1998 to 2001, n = 219, m = 631. Two companies are connected with a line if they have announced a joint venture, strategic alliance or other partnership (red - content, blue - infrastructure, yellow - commerce). Network source: http://www.orgnet.com/netindustry.html.

V. Batagelj: Social Network Analysis / Graphs and Networks 15 Cores The notion of core was introduced by Seidman in 1983. A subgraph H = (W, L W ) induced by the set W in a graph G = (V, L) is a k-core or a core of order k iff v W : deg H (v) k, and H is a maximum subgraph with this property.

V. Batagelj: Social Network Analysis / Graphs and Networks 16... Cores The core of maximum order is also called the main core. The core number of vertex v is the highest order of a core that contains this vertex. The degree deg(v) can be: in-degree, out-degree, in-degree + out-degree,..., determining different types of cores. The cores are nested: i < j = H j H i Cores are not necessarily connected subgraphs. The notion of cores can be generalized to valued cores.

V. Batagelj: Social Network Analysis / Graphs and Networks 17 6-core of Krebs Internet Industry Companies

V. Batagelj: Social Network Analysis / Graphs and Networks 18 Networks A graph with additional information on vertices and/or lines is called a network. In Pajek this information is represented using vectors (numerical properties of vertices) and partitions (categorical/nominal properties of vertices). Numerical values can be assigned also to lines line values. Example: Authors collaboration network based on the Computational Geometry Database. Two authors are linked with an edge, iff they wrote a common paper. The weight of the edge is the number of publications they wrote together. Problem of cleaning. Different names: Pankaj K. Agarwal, P. Agarwal, Pankaj Agarwal, and P.K. Agarwal. n = 9072, m = 13567/22577 n = 7343, m = 11898.

V. Batagelj: Social Network Analysis / Graphs and Networks 19 Computational Geometry Valued Core

V. Batagelj: Social Network Analysis / Graphs and Networks 20 Sources Vladimir Batagelj, Andrej Mrvar: Pajek. http://vlado.fmf.uni-lj.si/pub/networks/pajek/ Vladimir Batagelj: Slides on network analysis. http://vlado.fmf.uni-lj.si/pub/networks/doc/ Vladimir Batagelj: Papers on network analysis. http://vlado.fmf.uni-lj.si/vlado/vladodat.htm Andrej Mrvar: Social network analysis. http://mrvar.fdv.uni-lj.si/sola/info4/programe.htm B. Jones: Computational geometry database. http://compgeom.cs.uiuc.edu/ jeffe/compgeom/biblios.html ftp://ftp.cs.usask.ca/pub/geometry/.