Network VisualizationS



Similar documents
Gephi focus on data import. Clément Levallois Gephi Support Team and Assist.

Graph/Network Visualization

Network-Based Tools for the Visualization and Analysis of Domain Models

Option 1: empirical network analysis. Task: find data, analyze data (and visualize it), then interpret.

How to Create an Overlay Map of Science Using the Web of Science

A Tutorial on dynamic networks. By Clement Levallois, Erasmus University Rotterdam

ADVANCED VISUALIZATION

A comparative study of social network analysis tools

Part 2: Community Detection

NodeXL for Network analysis Demo/hands-on at NICAR 2012, St Louis, Feb 24. Peter Aldhous, San Francisco Bureau Chief.

Tableau's data visualization software is provided through the Tableau for Teaching program.

Exploratory Facebook Social Network Analysis

Executive Dashboard. User Guide

Developing Business Intelligence and Data Visualization Applications with Web Maps

Visualizing Networks: Cytoscape. Prat Thiru

Big Data in Pictures: Data Visualization

Beyond Web Application Log Analysis using Apache TM Hadoop. A Whitepaper by Orzota, Inc.

Gephi Tutorial Quick Start

Social Network Analysis: Visualization Tools

Sensors talk and humans sense Part II

RA MODEL VISUALIZATION WITH MICROSOFT EXCEL 2013 AND GEPHI

Apigee Insights Increase marketing effectiveness and customer satisfaction with API-driven adaptive apps

Network Metrics, Planar Graphs, and Software Tools. Based on materials by Lala Adamic, UMichigan

Security visualisation

NakeDB: Database Schema Visualization

Gephi Tutorial Visualization

Comparative Analysis Report:

Hierarchical Data Visualization. Ai Nakatani IAT 814 February 21, 2007

World Trade Analysis

An Introduction to Web Metrics. Paul G. Strupp, Ph.D.

Visualizing bibliometric networks

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Socio-semantic network data visualization

Social Network Mining

Impress Funders and Make Mission and Message Clear: Easy Data Visualization and Infographics. May 10,

Extracting Information from Social Networks

Spotfire and Tableau Positioning. Summary

Network Analysis. BCH 5101: Analysis of -Omics Data 1/34

The 4 Pillars of Technosoft s Big Data Practice

MicroStrategy Desktop

Social Media Mining. Network Measures

MARS STUDENT IMAGING PROJECT

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015

and BI Services Overview CONTACT W: E: M: +385 (91) A: Lastovska 23, Zagreb, Croatia

Introduction to Networks and Business Intelligence

HISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy

Create Cool Lumira Visualization Extensions with SAP Web IDE Dong Pan SAP PM and RIG Analytics Henry Kam Senior Product Manager, Developer Ecosystem

Introduction... 1 Welcome Screen... 2 Map View Generating a map Map View Basic Map Features... 4

Visualisation in the Google Cloud

Outline. What is Big data and where they come from? How we deal with Big data?

Graph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks

Analysis of Stock Symbol Co occurrences in Financial Articles

An overview of Software Applications for Social Network Analysis

ProteinQuest user guide

Creating a Network Graph with Gephi

OECD.Stat Web Browser User Guide

Capturing Meaningful Competitive Intelligence from the Social Media Movement

TABLEAU COURSE CONTENT. Presented By 3S Business Corporation Inc Call us at : Mail us at : info@3sbc.com

Data Visualization. Scientific Principles, Design Choices and Implementation in LabKey. Cory Nathe Software Engineer, LabKey

Graphs, Networks and Python: The Power of Interconnection. Lachlan Blackhall - lachlan@repositpower.com

Data Structures and Algorithms Written Examination

TIBCO Spotfire Network Analytics 1.1. User s Manual

Protographs: Graph-Based Approach to NetFlow Analysis. Jeff Janies RedJack FloCon 2011

Graph Theory and Complex Networks: An Introduction. Chapter 06: Network analysis

Network Analysis of a Large Scale Open Source Project

Examining graduate committee faculty compositions- A social network analysis example. Kathryn Shirley and Kelly D. Bradley. University of Kentucky

V. Adamchik 1. Graph Theory. Victor Adamchik. Fall of 2005

Starting User Guide 11/29/2011

in R Binbin Lu, Martin Charlton National Centre for Geocomputation National University of Ireland Maynooth The R User Conference 2011

GEOGRAPHIC INFORMATION SYSTEMS CERTIFICATION

with your eyes: Considerations when visualizing information Joshua Mitchell & Melissa Rands, RISE

Data Visualisation and Its Application in Official Statistics. Olivia Or Census and Statistics Department, Hong Kong, China

Effective Big Data Visualization

Graph Visualization Tools: A Comparative Analysis

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)

Visualizing Large, Complex Data

Network Analysis For Sustainability Management

Understanding Data: A Comparison of Information Visualization Tools and Techniques

<Insert Picture Here> Web 2.0 Data Visualization with JSF. Juan Camilo Ruiz Senior Product Manager Oracle Development Tools

IC05 Introduction on Networks &Visualization Nov

Practical statistical network analysis (with R and igraph)

Analysis of Algorithms, I

DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS

How To Understand The Network Of A Network

Predictive Analytics for Procurement Lead Time Forecasting at Lockheed Martin Space Systems

Introduction to Exploratory Data Analysis

GAZETRACKERrM: SOFTWARE DESIGNED TO FACILITATE EYE MOVEMENT ANALYSIS

Big Data for Investment Research Management

Client Overview. Engagement Situation. Key Requirements

Geo-Localization of KNIME Downloads

Network Analysis Basics and applications to online data

Class One: Degree Sequences

Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations

KAIST Business School

MINFS544: Business Network Data Analytics and Applications

Transcription:

Network VisualizationS When do they make sense? Where to start? Clement Levallois, Assist. Prof. EMLYON Business School v. 1.1, January 2014

Bio notes Education in economics, management, history of science (Ph.D.) Turned to digital methods for research. data visualization, network analysis, natural language processing, web applications and more. Member of the Gephi Community Support team Gephi certified trainer https://marketplace.gephi.org/service/data-analysis/ Contact, feedback welcome: on twitter @seinecle or www.clementlevallois.net

1. See your datasets differently 2. Insights to be gained 3. How to work with connected data 4. Good practices and red flags

A note on the terminology X A node (also called a vertex, plural vertices). An edge (or a tie, or a link ) Y Y was born in Y s marital status is Y s latitude is Y s longitude is Nodes attributes (NB: edges can have attributes too!) 3 A directed network (the direction of the relation matters) An undirected network (the direction of the relation does not matter) A weighted network (the edges have a strength represented by a numerical value)

1. SEE YOUR DATASETS DIFFERENTLY

We are accustomed to geospatial data, and how to visualize them on maps!

We are accustomed to time series / economic data, and how to represent them on charts!

We are accustomed to social network data, and how to represent them as graphs!

second example of a social network representation that makes intuitive sense.

Network data visualization does not stop at social networks or server logs. When any form of bond exists between observations, these observations can be analyzed from the point of view of forming a network. We have been so much trained to look at observations as independent entities, to be analyzed through aggregation that we have lost sight that many datasets are actually rich of connections between individual data points, worth exploring.

Co-occurrences (Muesli ingredients) credits: moritz.stefaner.eu Müsli Ingredient Network

Co-occurrences (all ingredients) Map of flavors 2 flavors are connected If they frequently appear in common recipes.

Co-occurrences (concepts)

Co-location Map of product categories 2 products are connected If they are frequently exported by the same countries

Similarity

2 entities are connected if they they sit close geographically they perform the same kind of actions they share antecedents they possess common characteristics they connect to common entities they transact with each other they are referred to in pair they refer to common entities or combinations of the above

2. INSIGHTS TO BE GAINED

1. Community detection A community / cluster is a group of nodes more densely connected with each other than with the rest of the network. How to define more densely connected, and how to calculate it efficiently, is research in progress. Two options: - Disjoint communities - Overlapping communities => Research on disjoint communities is currently more advanced.

2. Sense of evolution, dynamics Dynamics in networks covers: Topology Creation and deletion of nodes and edges Attributes Change in the values of the quantitative and qualitative attributes of nodes and edges Network analysis is focusing on the topology Network visualization (with Gephi) accommodates both.

3. Overlay analysis Social network of scientists Colored by they research interests. Colored by they geographical location

4. The picture-examining eye is the best finder we have of the wholly unanticipated. Exploratory analysis: Visualizing the network stimulates the process of forming hypotheses. Needs confirmatory analysis at a later stage: Either with network statistics, or with traditional forms of statistics / numerical analysis.

3. HOW TO WORK WITH CONNECTED DATA

The general idea

The steps YOU START WITH List / table of observations 1. Extract connected entities (a network!) 2. Format this network in a conventional standard Not many solutions! Many solutions 3. Display the network with a software package / in the browser Many solutions

1 Extract connections through similarities John Anna Lili Example: Gaze Simple program that creates networks from similarities (available from www.clementlevallois.net)

2 Format the network in a standard A simple list of edges in a csv file is enough: A,B B,C C,A. Programming packaging for richer network formats (that include attributes for nodes and edges, and dynamics) R, Python, Java

3 Choosing network viz tools size matters Do you prefer web browsers or desktop apps? Web browsers Desktop apps More than ~ 300 nodes? More than ~ 1,000 nodes? yes no yes no vivagraph.js (animated graphs) Seadragon (static pics) sigma.js, d3.js, gexf.js, jit.js, arbor.js, cytoscape.js Gephi GraphInsight Vosviewer Cytoscape NodeXL Netdraw Pajek Visone

3 Choosing network viz tools for dynamic networks? In web browsers or desktop apps? In web browsers Desktop apps No solution? Gephi Experimental: NodeXL Cytoscape

3 Choosing network viz tools for spatialized networks? In web browsers or desktop apps? in web browsers Desktop apps (no standard solution?) But see: http://www.leydesdorff. net/maps/is2009.html Gephi NodeXL (experimental?)

GOOD PRACTICES AND RED FLAGS 4 pieces of advice

1 this is *visual* analytics - making a network readable is not a sin - add labels, captions - curved versus straight edges - color codes this is not infographics data are sacred - size and position of nodes and edges are determined by a computational procedure - choices in this domain depend on design (viz should be readable, engaging and insightful) but not at the expense of methodological soundness.

2 Same network, different centralities A: degree centrality = local connectivity => nodes having many edges are central B: closeness centrality = geographic middle => nodes that are close to all other nodes are central C: betweenness centrality = connectivity => nodes that lay on many shortest paths are central (a shortest path is the quickest way to go from one node to another) Legend: D: Eigenvector centrality = authority => nodes that are connected to highly ranked nodes are central (recursive approach). - centrality + Ref: http://en.wikipedia.org/wiki/file:centrality.svg

3 Directed, undirected, weighted networks: it matters! 1: directed network 2: same network, but undirected G F E D C A B G F Legend: E D C A B - centrality +

4 The problem of hairballs - when networks fail to reveal structure

5 Come with a purpose - yes, the picture can yield unexpected insights - does not mean that visualizations are a substitute for working at formulating a research question.

CONCLUSION To go further

#dataviz, #datavis, #infovis, #infoviz

Credits Migration map Martin de Wulf, 2010 http://migrationsmap.net GapMinder Gapminder.org LinkedIn Map http://inmaps.linkedinlabs.com Facebook network of friendships Paul Butler, 2010. https://www.facebook.com/note.php?note_id=469716398919 Musli ingredient network Moritz Stefaner, 2012 http://moritz.stefaner.eu/projects/musli-ingredient-network/ Flavor network Yong-Yeol Ahn, Sebastian E. Ahnert, James P. Bagrow & Albert-László Barabási, 2011 http://www.nature.com/srep/2011/111215/srep00196/fig_tab/srep00196_f2.html

Credits Map of terms Levallois, Clithero, Smidts, Wouters and Huettel (2012). http://www.nature.com/nrn/journal/v13/n11/full/nrn3354.html Social network of scientists Levallois (private data). Map of product categories The Observatory of Economic Complexity, Alexander Simoes (2012). http://atlas.media.mit.edu/ Map of science Rafols, Porter and Leydesdorff (2010). http://www.leydesdorff.net/overlaytoolkit/