Social and Technological Network Analysis. Lecture 3: Centrality Measures. Dr. Cecilia Mascolo (some material from Lada Adamic s lectures)



Similar documents
Social Media Mining. Network Measures

How To Understand The Network Of A Network

Social and Economic Networks: Lecture 1, Networks?

DATA ANALYSIS II. Matrix Algorithms

Practical Graph Mining with R. 5. Link Analysis

SGL: Stata graph library for network analysis

Social Media Mining. Graph Essentials

Predicting Influentials in Online Social Networks

Social Networks and Social Media

Network Analysis Basics and applications to online data

Walk-Based Centrality and Communicability Measures for Network Analysis

Social network analysis with R sna package

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS

Warshall s Algorithm: Transitive Closure

NodeXL for Network analysis Demo/hands-on at NICAR 2012, St Louis, Feb 24. Peter Aldhous, San Francisco Bureau Chief.

Course on Social Network Analysis Graphs and Networks

Microblogging Queries on Graph Databases: An Introspection

Part 2: Community Detection

Max Flow. Lecture 4. Optimization on graphs. C25 Optimization Hilary 2013 A. Zisserman. Max-flow & min-cut. The augmented path algorithm

Graph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks

Complex Networks Analysis: Clustering Methods

Strong and Weak Ties

Graph Processing and Social Networks

IE 680 Special Topics in Production Systems: Networks, Routing and Logistics*

Discrete Mathematics & Mathematical Reasoning Chapter 10: Graphs

Distributed Computing over Communication Networks: Maximal Independent Set

V. Adamchik 1. Graph Theory. Victor Adamchik. Fall of 2005

Data Structures and Algorithms Written Examination

Load balancing in a heterogeneous computer system by self-organizing Kohonen network

General Network Analysis: Graph-theoretic. COMP572 Fall 2009

Research Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of Sina Microblogging

5.1 Bipartite Matching

Systems and Algorithms for Big Data Analytics

A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION

Why? A central concept in Computer Science. Algorithms are ubiquitous.

Definition Given a graph G on n vertices, we define the following quantities:

Network/Graph Theory. What is a Network? What is network theory? Graph-based representations. Friendship Network. What makes a problem graph-like?

Reconstruction and Analysis of Twitter Conversation Graphs

Extracting Information from Social Networks

Network-based spam filter on Twitter

Dynamic Network Resources Allocation in Grids through a Grid Network Resource Broker

Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding

Graph Mining Techniques for Social Media Analysis

Statistical and computational challenges in networks and cybersecurity

The Clar Structure of Fullerenes

Follow links for Class Use and other Permissions. For more information send to:

in R Binbin Lu, Martin Charlton National Centre for Geocomputation National University of Ireland Maynooth The R User Conference 2011

Social network and smoking: a pilotstudyamonghigh-schoolstudents

Reductions & NP-completeness as part of Foundations of Computer Science undergraduate course

Practical statistical network analysis (with R and igraph)

Finding and counting given length cycles

Network Analysis of a Large Scale Open Source Project

Graph/Network Visualization

Distributed Computing over Communication Networks: Topology. (with an excursion to P2P)

DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS

Hidden Markov Models

Analysis of Web Archives. Vinay Goel Senior Data Engineer

Lecture 15 An Arithmetic Circuit Lowerbound and Flows in Graphs

Structural constraints in complex networks

! E6893 Big Data Analytics Lecture 10:! Linked Big Data Graph Computing (II)

Project Report BIG-DATA CONTENT RETRIEVAL, STORAGE AND ANALYSIS FOUNDATIONS OF DATA-INTENSIVE COMPUTING. Masters in Computer Science

Temporal Dynamics of Scale-Free Networks

Dynamics of information spread on networks. Kristina Lerman USC Information Sciences Institute

Internet Firewall CSIS Packet Filtering. Internet Firewall. Examples. Spring 2011 CSIS net15 1. Routers can implement packet filtering

Protographs: Graph-Based Approach to NetFlow Analysis. Jeff Janies RedJack FloCon 2011

A simpler and better derandomization of an approximation algorithm for Single Source Rent-or-Buy

Segmentation & Clustering

CSE 326, Data Structures. Sample Final Exam. Problem Max Points Score 1 14 (2x7) 2 18 (3x6) Total 92.

Option 1: empirical network analysis. Task: find data, analyze data (and visualize it), then interpret.

Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003

1 o Semestre 2007/2008

On Orchestrating Virtual Network Functions

Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014

Social Network Discovery based on Sensitivity Analysis

Algorithms for representing network centrality, groups and density and clustered graph representation

Information Network or Social Network? The Structure of the Twitter Follow Graph

Lecture 2.1 : The Distributed Bellman-Ford Algorithm. Lecture 2.2 : The Destination Sequenced Distance Vector (DSDV) protocol

Network-Based Tools for the Visualization and Analysis of Domain Models

Mining Social-Network Graphs

Beyond the Stars: Revisiting Virtual Cluster Embeddings

Graph Mining and Social Network Analysis

Max Flow, Min Cut, and Matchings (Solution)

What is SNA? A sociogram showing ties

Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Do All Birds Tweet the Same? Characterizing Twitter Around the World

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study

A number of tasks executing serially or in parallel. Distribute tasks on processors so that minimal execution time is achieved. Optimal distribution

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis

Scheduling Shop Scheduling. Tim Nieberg

Evaluation of a New Method for Measuring the Internet Degree Distribution: Simulation Results

Social Network Mining

Bandwidth Allocation in a Network Virtualization Environment

Manifold Learning Examples PCA, LLE and ISOMAP

Transcription:

Social and Technological Network Analysis Lecture 3: Centrality Measures Dr. Cecilia Mascolo (some material from Lada Adamic s lectures)

In This Lecture We will introduce the concept of centrality and the various measures which have been associated to this concept. We will show an applicahon.

Centrality Finding out which is the most central node is important: It could help disseminahng informahon in the network faster It could help stopping epidemics It could help protechng the network from breaking

Centrality: visually Centrality can have various meanings: Y X Y X Y X X Y indegree outdegree betweenness closeness

Degree Centrality When is the number of connechons the best centrality measure? o people who will do favors to you o people you can talk to / have a beer with

NormalizaHon Divide for the max number of nodes (N- 1)

Freeman s Network Centrality How do we calculate the value of centrality of the network To check how much variahon there is among the nodes (heterogeneity?) Max value of the above: when network is a star: 1 node has C=N- 1 and all others (N- 1) have 1. C D = g & Max value of Degree Centrality in the Network "C # D (n * )!C D (i) $ i=1 % [(N!1)(N! 2)]

Freeman Network Centrality Explained ExplanaHon of the denominator: In the star topology one node has degree N- 1 and all other nodes have degree of 1: 0+((n- 1)- 1)*n- 1= (n- 2) *(n- 1)

Freeman s Network Centrality C D = 1.0 C D = 0.167 4+4+4+4+4/5*4 1+0+0+0+1/4*3=1/6 1+1+0+1+0+1+1/6*5=5/30 C D = 0.167

Examples: Financial Networks

When is Degree Centrality not so good?

When is Degree Centrality not so good (2)? Ability to broker between groups Likelihood that informahon originahng anywhere in the network reaches you

Betweenness Centrality IntuiHon: how many pairs of individuals would have to go through you in order to reach one another in the minimum number of hops? who has higher betweenness, X or Y? Y X X Y

Betweenness (Formally) C B (i) = " j!k g jk (i) / g jk Where g jk (i)= the number of shortest paths connechng jk passing through i g jk = total number of shortest paths Usually normalized by: C B ' (i) = C B (i ) /[(n "1)(n " 2) /2] number of pairs of vertices excluding the vertex itself

Betweenness: Example n A lies between no two other verhces n B lies between A and 3 other verhces: C, D, and E n C lies between 4 pairs of verhces (A,D),(A,E), (B,D),(B,E) n note that there are no alternahve paths for these pairs to take, so C gets full credit A B C D E

Facebook Example (Adamic) Color(from blue to red) is betweenness Size is degree.

Closeness Centrality What if it is not so important to have many direct friends? Or be between others But one shll wants to be in the middle of things, not too far from the center

Closeness Centrality (Formally) Closeness is based on the length of the average shortest path between a vertex and all verhces in the graph # N & C c (i) = %" d(i, j) ( $ % j=1 '( )1! C C ' (i) = (C C (i)) /(N "1)

Closeness: Example A B C D E C c ' (A) = $ & & & & % N " j=1 ' d(a, j) ) ) N #1 ) ) ( #1 $ = 1+ 2 + 3 + 4 ' % & 4 ( ) #1 $ = 10 % & 4 ' ( ) #1 = 0.4

(1+1+2+3+4+4/6)^- 1=6/15=0.4 Examples

Example: Facebook (Adamic) Degree is the size Color is closeness

Eigenvector Centrality Degree Centrality depends on having many connechons: but what if these connechons are preay isolated? A central node should be one connected to powerful nodes Neighbourhood of Xv Adjacency Matrix of the graph

Eigenvector Centrality Algorithm 1. Start by assigning centrality score of 1 to all nodes (v_i = 1 for all i in the network) 2. Recompute scores of each node as weighted sum of centralihes of all nodes in a node's neighborhood: v_i = sum_{j \in N} a_{ij}*v_j 3. Normalize v by dividing each value by the largest value 4. Repeat steps 2 and 3 unhl values of v stop changing.

Example

Katz Centrality Closeness counts the number of shortest paths, but one could count the number of paths. Alpha is an aaenuahon factor A^k (ij) indicates if ij are connected by k- 1 hops

ApplicaHon of Centrality Measures to Big Data

People Tweet While They Watch

Repeated (retweet) content starts with RT Address other users with an @ Rich Media embeds via links Tags start with # RT: @jowyang If you are watching the debate you re invited to parhcipate in #tweetdebate Here is the 411 hap://hnyurl.com/3jdy67 ANATOMY OF A TWEET

Tweet Crawl Three hashtags: #current #debate08 #tweetdebate 97 mins debate + 53 mins following = 2.5 hours total. 3,238 tweets from 1,160 people. 1,824 tweets from 647 people during the debate. 1,414 tweets from 738 people post debate. 577 @ menhons (reciprocity!) 266 menhons during the debate 311 a?erwards. Low RT: 24 retweets in total 6 during 18 aperwards.

Volume of Tweets

Discussion Segments

Cluster of tags

Twiaer MenHons John Tweets: Hey @mary, my person is winning! Large node size= High eigenvector centrality

AutomaHc Discovery through Centrality High Eigenvector Centrality Figures on Twiaer from the First US PresidenHal Debate of 2008.

Centrality

Sinks High in- degree but poor centrality: @current, who ran the Hack the Debate program, and @jowyang, an employee of Forrester Research who uses Twiaer as a personal, not corporately related microblog.

Tweets to Terms

Twiaer as ReacHon

Summary We have introduced various measures of centrality and explained the pros and cons We have illustrated one example of use of centrality in a Twiaer related example

References Kleinberg s book: Chapter 3. Ack: L. Adamic s slides M. Newmann. Networks. Oxford University Press. April 2010. David A. Shamma, Lyndon Kennedy, and Elizabeth F. Churchill. 2009. Tweet the debates: understanding community annota;on of uncollected sources. In Proceedings of the first SIGMM workshop on Social media (WSM '09). ACM, New York, NY, USA