A comparative study of social network analysis tools



Similar documents
A comparative study of social network analysis tools

Network Metrics, Planar Graphs, and Software Tools. Based on materials by Lala Adamic, UMichigan

Tools and Techniques for Social Network Analysis

Graph Visualization Tools: A Comparative Analysis

Temporal Visualization and Analysis of Social Networks

Graph/Network Visualization

An overview of Software Applications for Social Network Analysis

Open Source Software Developer and Project Networks

A Survey on Social Network Analysis for Counter- Terrorism

Social Network Analysis: Visualization Tools

HISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy

Visualization of Communication Patterns in Collaborative Innovation Networks. Analysis of some W3C working groups

Visualizing bibliometric networks

Practical Graph Mining with R. 5. Link Analysis

Social Media Mining. Graph Essentials

Visualization of Social Networks in Stata by Multi-dimensional Scaling

Course on Social Network Analysis Graphs and Networks

Social Network Analysis: Introduzione all'analisi di reti sociali

Practical statistical network analysis (with R and igraph)

Software for Social Network Analysis

An Introduction to APGL

Copyright 2008, Lada Adamic. School of Information University of Michigan

Recent Large Graph Visualization Tools : A Review

Option 1: empirical network analysis. Task: find data, analyze data (and visualize it), then interpret.

NetworkX: Network Analysis with Python

Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics

Week 3. Network Data; Introduction to Graph Theory and Sociometric Notation

Introduction to Networks and Business Intelligence

Cluster detection algorithm in neural networks

Visual Mining of Multi-Modal Social Networks at Different Abstraction Levels

Groups and Positions in Complete Networks

Software tools for Complex Networks Analysis. Fabrice Huet, University of Nice Sophia- Antipolis SCALE (ex-oasis) Team

Department of Cognitive Sciences University of California, Irvine 1

Graphs, Networks and Python: The Power of Interconnection. Lachlan Blackhall - lachlan@repositpower.com

Social Network Analysis and Usability of our Own

A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION

DATA ANALYSIS II. Matrix Algorithms

Network-Based Tools for the Visualization and Analysis of Domain Models

Network VisualizationS

Proximity Analysis of Social Network using Skip Graph

Visualizing Complexity in Networks: Seeing Both the Forest and the Trees

Graph Processing and Social Networks

Journal of Statistical Software

Efficient Crawling of Community Structures in Online Social Networks

Graph Mining Techniques for Social Media Analysis

Statistical Analysis of Complete Social Networks

Subgraph Patterns: Network Motifs and Graphlets. Pedro Ribeiro

My work provides a distinction between the national inputoutput model and three spatial models: regional, interregional y multiregional

Gephi Tutorial Quick Start

Statistical and computational challenges in networks and cybersecurity

Social Networking Analytics

CS Data Science and Visualization Spring 2016

Web Data Visualization

IC05 Introduction on Networks &Visualization Nov

Detecting Communities in Science Blogs

Algorithms for representing network centrality, groups and density and clustered graph representation

Parallel Algorithms for Small-world Network. David A. Bader and Kamesh Madduri

NetworkX: Network Analysis with Python

SGL: Stata graph library for network analysis

Visualizing Networks: Cytoscape. Prat Thiru

Analyzing (Social Media) Networks with NodeXL

SOCIAL NETWORK ANALYSIS. ANZEA conference 2012 Jenny Long

Data Visualisation and Statistical Analysis Within the Decision Making Process

Social Network Mining

Open Source Visualization with OpenGraphiti. Thibault Reuille Andrew Hay

Distance Degree Sequences for Network Analysis

Graph Mining and Social Network Analysis

Exploratory Facebook Social Network Analysis

ALBERTA. Social Network Analysis for the Assessment of Learning UNIVERSITY OF. Osmar R. Zaïane Professor & Scientific Director of AICML

Evaluating Software Products - A Case Study

Visual Analysis of Complex Networks for Business Intelligence with Gephi

Sociology and CS. Small World. Sociology Problems. Degree of Separation. Milgram s Experiment. How close are people connected? (Problem Understanding)

Analyzing Actors and Their Discussion Topics by Semantic Social Network Analysis

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis

Mining Social Network Graphs

Dynamic Network Analyzer Building a Framework for the Graph-theoretic Analysis of Dynamic Networks

HIGH PERFORMANCE BIG DATA ANALYTICS

NETZCOPE - a tool to analyze and display complex R&D collaboration networks

NetworkX: Network Analysis with Python

Business Intelligence and Process Modelling

Towards automatic discovery of co-authorship networks in the Brazilian academic areas

Network Analysis of Clinical Trials on Depression: Implications for Comparative Effectiveness Research

LAB 1 Intro to Ucinet & Netdraw

Department of Social Systems and Management Discussion Paper Series

Socio-semantic network data visualization

Técnicas y Herramientas de Apoyo a la investigación (THA) II. Técnicas de Investigación Cualitativa. Social network analysis (SNA)

Sébastien Heymann

Exploration and Visualization of Post-Market Data

What is SNA? A sociogram showing ties

ForceAtlas2, A Graph Layout Algorithm for Handy Network Visualization

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

TIBCO Spotfire Network Analytics 1.1. User s Manual

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS

SCAN: A Structural Clustering Algorithm for Networks

Group CRM: a New Telecom CRM Framework from Social Network Perspective

JustClust User Manual

Supporting Force Structure Review through graph visualisation and capability view improvements

Social Media Mining. Network Measures

A Performance Evaluation of Open Source Graph Databases. Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader

Visualization and Modeling of Structural Features of a Large Organizational Network

Transcription:

Membre de Membre de A comparative study of social network analysis tools David Combe, Christine Largeron, Előd Egyed-Zsigmond and Mathias Géry International Workshop on Web Intelligence and Virtual Enterprises 2 (2010)

Outline 2/26

Context Context Definition (Wikipedia) A social network is a social structure made up of individuals called "nodes," which are tied by one or more specific types of interdependency, such as friendship, common interest, etc. Sociologic analysis Sociological works (Moreno 1934, Milgram 1967, Cartwright and Harary, 1977) Web 2.0 : Renewed interest from the Web based social networks websites development. 3/26

Context Context: Social network in business For the Gartner Institute: By 2014, social networking services will replace e-mail as the primary vehicle for interpersonal communications for 20 percent of business users. (Gartner 2008) Social network analysis is getting mature. Some applications in business: Workflow study to adapt management to the real flow in a company; Identify key actors, ie. for viral marketing. These applications need adapted software. 4/26

Context Context: social networks and analysis software Network analysis software A previous statistical analysis oriented survey (Huisman & Van Duijn, 2003) Networks and needs are changing Size Complex graphs Necessity to make a new benchmark 5/26

6/26

Expected functionalities of network analysis software Expected functionalities of network analysis software 1. Representation 2. Visualization 3. Characterization by indicators 4. Community detection 7/26

Expected functionalities of network analysis software 1. Network representation as graph (Cartwright and Harary, 1977) Link orientation Undirected links (edges, ex: co-authorship) Directed (arcs, ex: e-mails sent, Enron dataset) Weight on edges With typed nodes (ex. bipartite network) 1 3 3 2 8/26

Expected functionalities of network analysis software 1. Network representation as graph *Vertices 5 *Edges 1 2 1 4 2 3 2 4 3 4 3 5 4 5 Connections (.net file format) Edge list 1 2 4 3 5 1 2, 4 2 1, 2, 4 3 2, 4, 5 4 2, 3, 5 5 3, 4 Adjacency list 1 2 3 4 5 1 0 1 0 1 0 2 1 0 1 1 0 3 0 1 0 1 1 4 1 1 1 0 1 5 0 0 1 1 0 Adjacency matrix 9/26

Expected functionalities of network analysis software 2. Visualization Aim: give a visual representation of the graph, with different approaches: Fish eye Centered on an actor Force driven visualization layouts Fruchterman Reingold (1984) Iterative algorithm Random layout F-R convergence 10/26

Expected functionalities of network analysis software 3. Characterization by indicators Global indicators at network level by: Number of nodes Number of edges Diameter Density Local indicators at node level: Number of neighboors degree 1 2 4 3 5 Distance Length of the shortest path 11/26

Expected functionalities of network analysis software 3. Characterization by indicators : how to decide if an actor is «central»? Many ways to determine central actors. Ex: Betweenness centrality Which node is the most likely to be an intermediary for a random communication? higher betweenness centrality Selection depends on what they are needed for. 12/26

Expected functionalities of network analysis software 4. Community detection Community: A set of actors having strong connexions. Community detection algorithms Newman Girvan (Newman and Girvan, 2002) Walktrap (Latapy & Pons, 2005) 13/26

14/26

Benchmark Benchmark methodology Required points: A social network analysis point of view Scalability Free for educational purposes A balance between well established software and newer ones, based on recent development standards (ergonomics, modularity and data portability). Datasets: Zachary s karate-club, DBLP 15/26

Benchmark Software comparison criteria Input/output formats Custom attribute handling Bipartite graphs specific functions Longitudinal analysis Visualization Indicators Community detection 16/26

Benchmark Studied software Gephi is an interactive visualization and exploration platform. GUESS is dedicated to visualization purposes, with several layouts. Tulip can handle over 1 million vertices and 4 millions edges. It has visualization, clustering and extension by plug-ins capabilities. GraphViz is mainly for graph visualization. UCInet is not free. It uses Pajek and Netdraw for visualization. It is specialized in statistical and matricial analysis. It calculates indicators (such as triad census, Freeman betweenness) and performs hierarchical clustering. Pajek is a Windows program for analysis and visualization of large networks. It is freely available, for noncommercial use. igraph is a free software package for creating and manipulating graphs. It also implements algorithms for some recent network analysis methods. NetworkX is a package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks. JUNG, for Java Universal Network/Graph Framework, is mainly developed for creating interactive graphs in Java GUIs, JUNG has been extended with some SNA metrics. 17/26

Benchmark Selected software Stand-alone software Pajek http://pajek.imfm.si/doku.php Gephi http://gephi.org/ Libraries igraph http://igraph.sourceforge.net/ NetworkX http://networkx.lanl.gov/ 18/26

Pajek (Vladimir Batagelj and Andrej Mrvar) Benchmark Development started in 1996 Data mining oriented Many graph operators available Fast Exports 3D visualization Macro Supports matrices, adjacency lists and arcs lists oriented input files 19/26

Benchmark Gephi (Bastian M., Heymann S., Jacomy M.) Development started in 2008 Interactive GUI Uses Java Recent scriptability improvements «Photoshop for graphs» with customizable visualization Supports the main file formats for networks Improvable by plugins Community detection still experimental 20/26

Benchmark NetworkX (Brandes U., Erlebach T.) Python Bipartite graphs ready Attribute-friendly 1,000,000 nodes wide networks can be handled. Lacks in community detection algorithms Relies on other software for visualization >>> import networkx as nx >>> G=nx.Graph() >>> G.add_node("spam") >>> G.add_edge(1,2) >>> print(g.nodes()) [1, 2, 'spam'] >>> print(g.edges()) [(1, 2)] >>> G.degree(1) 1 21/26

Benchmark Igraph (Csárdi G., Nepusz T.) For R (a statistical environment) and Python. The low level routines are written in C. GUI available for R. Community detection ready. Not custom attributesfriendly > g <- graph.ring(10) > degree(g) [1] 2 2 2 2 2 2 2 2 2 2 > g2 <- erdos.renyi.game(1000, 10/1000) > degree.distribution(g2) [1] 0.000 0.000 0.002 0.009 0.020 0.039 0.064 0.107 0.111 0.115 0.118 [21] 0.003 0.001 22/26

How to choose the right tool? Pajek Gephi NetworkX igraph Input/output + ++ + + Attribute handling + + ++ - - Bipartite graphs + - + + Temporality + + + - Benchmark Visualization ++ ++ - ++ Indicators + + ++ ++ Clustering + - - - - ++ ++ Mature functionality - - Not available or weak 23/26

Benchmark Feature comparison Temporality Input / output Clustering Visualization igraph Pajek Bipartite NetworkX Gephi Attribute handling Indicators 24/26

25/26

Conclusion Conclusion Many domains, many approaches, many software (sociology, computer science, mathematics and physics). Functionalities to develop in the future (e.g. for decision support): Temporality awareness Links and nodes attributes analysis Hierarchical graphs 26/26

27/26

Bibliography Gartner http://www.gartner.com/it/page.jsp?id=1293114 Gartner Hype Cycle for Social Software, 2008 Fortunato, S. (2009). Community detection in graphs. Physics Reports, 103. Retrieved from http://arxiv.org/abs/0906.0612.pons, P., & Latapy, M. (2005). Computing communities in large networks using random walks. Computer and Information Sciences-ISCIS 2005. Retrieved from http://www.springerlink.com/index/p312811313637372.pdf. Newman, M., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical review E. Retrieved from http://link.aps.org/doi/10.1103/physreve.69.026113. Kamada, T., & Kawai, S. (1989). An algorithm for drawing general undirected graphs. Information processing letters, 31(12), 7--15. Retrieved from http://linkinghub.elsevier.com/retrieve/pii/0020019089901026. 28/26

Bibliography (2) Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine* 1. Computer networks and ISDN systems. Retrieved from http://linkinghub.elsevier.com/retrieve/pii/s016975529800110x. Fruchterman, T. M., & Reingold, E. M. (1991). Graph Drawing by Force-directed Placement. Huisman, M., & Van Duijn, M. (2003). Software for social network analysis. In Models and methods in social network analysis (p. 270 316). Freeman, L. (1979). Centrality in Social Networks Conceptual Clarification. Social Networks. 29/26