Mathematical Foundations of Social Network Analysis

Similar documents
Strength of Weak Ties, Structural Holes, Closure and Small Worlds. Steve Borgatti MGT 780, Spring 2010 LINKS Center, U of Kentucky

Social Media Mining. Graph Essentials

Week 3. Network Data; Introduction to Graph Theory and Sociometric Notation

Strong and Weak Ties

Handout #Ch7 San Skulrattanakulchai Gustavus Adolphus College Dec 6, Chapter 7: Digraphs

Course on Social Network Analysis Graphs and Networks

V. Adamchik 1. Graph Theory. Victor Adamchik. Fall of 2005

Network Analysis For Sustainability Management

Midterm Practice Problems

Understanding Sociograms

A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION

Network Analysis of Project Communication Based on Graph Theory

Discrete Mathematics & Mathematical Reasoning Chapter 10: Graphs

Practical Graph Mining with R. 5. Link Analysis

Network (Tree) Topology Inference Based on Prüfer Sequence

Lecture 17 : Equivalence and Order Relations DRAFT

CSV886: Social, Economics and Business Networks. Lecture 2: Affiliation and Balance. R Ravi ravi+iitd@andrew.cmu.edu

Social Media Mining. Network Measures

THE ROLE OF SOCIOGRAMS IN SOCIAL NETWORK ANALYSIS. Maryann Durland Ph.D. EERS Conference 2012 Monday April 20, 10:30-12:00

Equivalence Concepts for Social Networks

General Network Analysis: Graph-theoretic. COMP572 Fall 2009

Chapter 6: Graph Theory

HISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy

Lecture 16 : Relations and Functions DRAFT

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Competitive Analysis of On line Randomized Call Control in Cellular Networks

Social Networking Analytics

Metabolic Network Analysis

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis

IE 680 Special Topics in Production Systems: Networks, Routing and Logistics*

A comparative study of social network analysis tools

! E6893 Big Data Analytics Lecture 10:! Linked Big Data Graph Computing (II)

Topology and Topological Rules Geometric properties that are maintained in spatial databases

Graph/Network Visualization

Graph theoretic approach to analyze amino acid network

Network Analysis Basics and applications to online data

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis. Contents. Introduction. Maarten van Steen. Version: April 28, 2014

DATA ANALYSIS II. Matrix Algorithms

Business Intelligence and Process Modelling

Protographs: Graph-Based Approach to NetFlow Analysis. Jeff Janies RedJack FloCon 2011

Sociology and CS. Small World. Sociology Problems. Degree of Separation. Milgram s Experiment. How close are people connected? (Problem Understanding)

Information Flow and the Locus of Influence in Online User Networks: The Case of ios Jailbreak *

1. Write the number of the left-hand item next to the item on the right that corresponds to it.

Network Analysis: Lecture 1. Sacha Epskamp

OPERATING SYSTEMS DEADLOCKS

How To Understand The Network Of A Network

My work provides a distinction between the national inputoutput model and three spatial models: regional, interregional y multiregional

A Review And Evaluations Of Shortest Path Algorithms

ALGEBRA. sequence, term, nth term, consecutive, rule, relationship, generate, predict, continue increase, decrease finite, infinite

Performance of networks containing both MaxNet and SumNet links

A Network Approach to Define Modularity of Components in Complex Products

Complexity Theory. IE 661: Scheduling Theory Fall 2003 Satyaki Ghosh Dastidar

What is SNA? A sociogram showing ties

Statistical Analysis of Complete Social Networks

Chinese postman problem

Network Metrics, Planar Graphs, and Software Tools. Based on materials by Lala Adamic, UMichigan

System Interconnect Architectures. Goals and Analysis. Network Properties and Routing. Terminology - 2. Terminology - 1

Pythagorean Theorem: Proof and Applications

Mining Social Network Graphs

Problem Set 7 Solutions

Network theory and systemic importance

Network Theory: 80/20 Rule and Small Worlds Theory

Weather Direct Displays show Lost Forecast (blank boxes in the picture icons)

Scheduling. Getting Started. Scheduling 79

Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary

Network/Graph Theory. What is a Network? What is network theory? Graph-based representations. Friendship Network. What makes a problem graph-like?

Social Networks and Social Media

Graphical degree sequences and realizations

In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data.

Graph Theory Problems and Solutions

Cpt S 223. School of EECS, WSU

SGL: Stata graph library for network analysis

Simple Graphs Degrees, Isomorphism, Paths

Lecture 15 An Arithmetic Circuit Lowerbound and Flows in Graphs

Introduction to Metropolitan Area Networks and Wide Area Networks

136 CHAPTER 4. INDUCTION, GRAPHS AND TREES

Chapter 1. Introduction

Examining graduate committee faculty compositions- A social network analysis example. Kathryn Shirley and Kelly D. Bradley. University of Kentucky

Social Network Analysis: Introduzione all'analisi di reti sociali

6.852: Distributed Algorithms Fall, Class 2

2. (a) Explain the strassen s matrix multiplication. (b) Write deletion algorithm, of Binary search tree. [8+8]

A GRAPH-THEORETIC DEFINITION OF A SOCIOMETRIC CLIQUE *

Social network analysis with R sna package

Distributed Computing over Communication Networks: Maximal Independent Set

6.3 Conditional Probability and Independence

Extremal Wiener Index of Trees with All Degrees Odd

Networks and Paths. The study of networks in mathematics began in the middle 1700 s with a famous puzzle called the Seven Bridges of Konigsburg.

6. Vectors Scott Surgent (surgent@asu.edu)

Euler Paths and Euler Circuits

Analysis of Algorithms, I

: RICH NETWORKS: EVALUATING UNIVERSITY-HIGH SCHOOLS PARTNERSHIPS USING GRAPH ANALYSIS

Level 2 Routing: LAN Bridges and Switches

Zachary Monaco Georgia College Olympic Coloring: Go For The Gold

Network Analysis of a Large Scale Open Source Project

Computer Network. Interconnected collection of autonomous computers that are able to exchange information

Decision Mathematics D1. Advanced/Advanced Subsidiary. Friday 12 January 2007 Morning Time: 1 hour 30 minutes. D1 answer book

A linear combination is a sum of scalars times quantities. Such expressions arise quite frequently and have the form

The Basics of Graphical Models

Solutions Brief for Financial Institutions A Solutions Brief By Shoretel

Introduction to Networks

Transcription:

Mathematical Foundations of Social Network Analysis Steve Borgatti

Three Representations Graphs Matrices Relations

Graphs Networks represented mathematically as graphs A graph G(V,E) consists of Set of nodes vertices V representing actors Set of lines edges E representing ties An edge is an unordered pair of nodes (u,v) Nodes u and v adjacent if (u,v) E So E is subset of set of all pairs of nodes Typically drawn without arrow heads

Digraphs Digraph D(V,E) consists of Set of nodes V Set of directed arcs E An arc is an ordered pair of nodes (u,v) (u,v) E indicates u sends arc to v (u,v) E does not imply that (v,u) E Ties drawn with arrow heads, which can be in both directions Bob Betsy Biff Bonnie Betty

Directed vs undirected graphs Undirected relations Attended meeting with Communicates daily with Directed relations Lent money to Logically vs empirically directed ties Empirically, even undirected relations can be non-symmetric due to measurement error Bob Biff Bonnie Betty Betsy

Strength of Tie We can attach values to ties, representing quantitative attributes Strength of relationship Information capacity of tie Rates of flow or traffic across tie Distances between nodes Probabilities of passing on information Frequency of interaction Valued graphs or vigraphs Bob 3 6 Jane 8 1 Betsy

Adjacency Matrices Friendship Jim Jill Jen Joe Jim - 1 0 1 Jill 1-1 0 Jen 0 1-1 Joe 1 0 1 - Jen 1 Jill 9 3 Jim Proximity Jim Jill Jen Joe Jim - 3 9 2 Jill 3-1 15 Jen 9 1-3 Joe 2 15 3-3 15 Joe 2

Walks, Trails, Paths Path: can t repeat node 10 12 1-2-3-4-5-6-7-8 Not 7-1-2-3-7-4 11 8 9 Trail: can t repeat line 1-2-3-1-7-8 2 7 Not 7-1-2-7-1-4 Walk: unrestricted 3 1 6 1-2-3-1-2-7-1-7-1 4 5

Things that move thru networks Used goods Money Packages Personnel Orders Innovations Practices Gossip / information E-mail Infections Attitudes Influence? Borgatti, S.P. 2005. Centrality and network flow. Social Networks. 27(1): 55-71.

Used Goods Process Canonical example: passing along used paperback novel Single object in only one place at a time Doesn t (usually) travel between same pair twice Could be received by the same person twice A--B--C--B--D--E--B--F--C... Travels along graph-theoretic trails

Package Delivery Process Example: package delivered by postal service Single object at only one place at one time Map of network enables the intelligent object to select only the shortest paths to all destinations Travels along shortest paths (geodesics)

Mooch Process Examples Obnoxious homeless relative who visits for six months until kick out and moves to next relative Personnel flows between firms In just one place at a time Doesn t repeat a node (bridges burned) Travels along paths

Monetary Exchange Process Canonical example: specific dollar bill moving through the economy Single object in only one place at a time Can travel between same pair more than once A--B--C--B--C--D--E--B--C--B--C...

Gossip Process Example: juicy story moving through informal network Multiple copies exist simultaneously Person tells only one person at a time* Doesn t travel between same pair twice Can reach same person multiple times * More generally, they tell a very limited number at a time.

E-Mail Process Example: forwarded jokes and virus warnings e-mail viruses themselves Multiple copies exist simultaneously All (or many) connected nodes told simultaneously Except, perhaps, the immediate source

Influence Process Example: attitude formation Multiple copies exist simultaneously Multiple simultaneous transmission, even between the same pairs of nodes

Viral Infection Process Example: virus which activates effective immunological response or which kills host Multiple copies may exist simultaneously Cannot revisit a node A--B--C--E--D--F... Travels along graph-theoretic paths

Package Delivery Process Example: package delivered by postal service Single object at only one place at one time Map of network enables the intelligent object to select only the shortest paths to all destinations

Two Properties of Flow Processes Sequence type: path, trail, walk path: can t revisit node nor edge (tie) trail: can revisit node but not edges walk: can revisit edges & nodes Deterministic vs non-deterministic blind vs guided always chooses best route; aware of map Combine into 4-way traversal type property: geodesics, paths, Copyright trails, 2006 Steve walks Borgatti

Traversal Dimension of Flow Sequence type: path, trail, walk path: can t revisit node nor edge (tie) trail: can revisit node but not edges walk: can revisit edges & nodes Deterministic vs non-deterministic blind vs guided always chooses best route; aware of map Combine into 4-way traversal type property: geodesics, paths, trails, walks

Flow Method Property Duplication vs transfer (copy vs move) transfer/move: only one place at one time duplication/copy: multiple copies exist Serial vs parallel duplication serial: only one transmission at a time parallel: broadcast to all surrounding nodes Combine into method 3-way property: parallel dup., serial dup., transfer

Simplified Typology goods information geodesics paths trails walks parallel duplication <no process> internet nameserver e-mail broadcast attitude influencing serial duplication mitotic reproduction viral infection gossip emotional support transfer package delivery mooch used goods money exchange *Note: Names not to be taken too seriously. Markov

Implications for SNA Measures like centrality are constructed by counting things like paths or walks Which measure is sensible for a given application depends on how things flow Different measures for different processes Most common flow processes do not have measures designed for them Currently misapply std measures to all situations

Components Maximal sets of nodes in which every node can reach every other by some path (no matter how long) A connected graph has just one component It is relations (types of tie) that define different networks, not components. A graph that has two components remains one (disconnected) graph.

Components in Directed Graphs Strong component There is a directed path from each member of the component to every other Weak component There is an undirected path (a weak path) from every member of the component to every other Is like symmetrizing via maximum method

A network with 4 components Who you go to so that you can say I ran it by, and she says... Recent acquisition Older acquisitions Original company Data drawn from Cross, Borgatti & Parker 2001.

Length & Distance Length of a path is number of links it has Distance between two nodes is length of shortest path (aka geodesic) 11 2 10 8 7 12 9 3 1 6 4 5

Austin Powers: The spy who shagged me Robert Wagner Let s make it legal Wild Things What Price Glory Barry Norton A Few Good Man Monsieur Verdoux

Implications of Distance For something flowing across links, distance correlates Inversely with probability of arrival Inversely with time until arrival Positively with amount of distortion or change

Geodesic Distance Matrix 0 1 2 4 3 3 4 g 1 0 1 3 2 2 3 f 2 1 0 2 1 1 2 e 4 3 2 0 1 2 3 d 3 2 1 1 0 1 2 c 3 2 1 2 1 0 1 b 4 3 2 3 2 1 0 a g f e d c b a

Independent Paths A set of paths is node-independent if they share no nodes (except beginning and end) They are line-independent if they share no lines S T 2 node-independent paths from S to T 3 line-independent paths from S to T

Cutpoint A node which, if deleted, would increase the number of components Bob Biff Bonnie Bill Betty Betsy

Bridge s a b d e c f A tie that, if removed, would increase the g number of components h i p j m k l q n r o

Local Bridge of Degree K A tie that connects nodes that would otherwise be at least k steps apart A B

Granovetter Transitivity A C B D

Granovetter s SWT Theory Bridges are sources of novel information Only weak ties can be bridges Strong ties create g-transitivity Two nodes connected by a strong tie will have mutual acquaintances (ties to same 3 rd parties) Ties that are part of transitive triples cannot be bridges or local bridges Therefore, only weak ties can be bridges Weak ties are sources of novel information

In other words Strong ties are embedded in tight homophilous and homogeneous clusters You don t often hear new stuff from your strong ties because Strong ties embedded in tight homogeneous and homophilous clusters that recirculate same old stories You know all the same people as your strong ties Weak ties a source of novel information Strong ties are locally cohesive but weak ties are globally cohesive