Introduction to Networks and Business Intelligence

Similar documents
MINFS544: Business Network Data Analytics and Applications

Complex Networks Analysis: Clustering Methods

Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations

General Network Analysis: Graph-theoretic. COMP572 Fall 2009

Network Theory: 80/20 Rule and Small Worlds Theory

Network/Graph Theory. What is a Network? What is network theory? Graph-based representations. Friendship Network. What makes a problem graph-like?

A discussion of Statistical Mechanics of Complex Networks P. Part I

Graph Mining Techniques for Social Media Analysis

IC05 Introduction on Networks &Visualization Nov

Research Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of Sina Microblogging

Effects of node buffer and capacity on network traffic

Healthcare Analytics. Aryya Gangopadhyay UMBC

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Dmitri Krioukov CAIDA/UCSD

Network-Based Tools for the Visualization and Analysis of Domain Models

Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks

Graphs, Networks and Python: The Power of Interconnection. Lachlan Blackhall - lachlan@repositpower.com

Network Analysis. BCH 5101: Analysis of -Omics Data 1/34

Temporal Dynamics of Scale-Free Networks

DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS

Bioinformatics: Network Analysis

Social Media Mining. Network Measures

Degree distribution in random Apollonian networks structures

Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003

Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014

The average distances in random graphs with given expected degrees

Random graphs and complex networks

The Large-Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth

Social Network Mining

! E6893 Big Data Analytics Lecture 10:! Linked Big Data Graph Computing (II)

The mathematics of networks

ModelingandSimulationofthe OpenSourceSoftware Community

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS

A comparative study of social network analysis tools

V. Adamchik 1. Graph Theory. Victor Adamchik. Fall of 2005

Graph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks

An Interest-Oriented Network Evolution Mechanism for Online Communities

Walk-Based Centrality and Communicability Measures for Network Analysis

The Structure and Function of Complex Networks

Six Degrees of Separation in Online Society

Strong and Weak Ties

The Topology of Large-Scale Engineering Problem-Solving Networks

Graph Theory and Networks in Biology

DECENTRALIZED SCALE-FREE NETWORK CONSTRUCTION AND LOAD BALANCING IN MASSIVE MULTIUSER VIRTUAL ENVIRONMENTS

Statistical mechanics of complex networks

Discovering Determinants of Project Participation in an Open Source Social Network

Network Analysis For Sustainability Management

Extracting Information from Social Networks

Social Networks and Social Media

PUBLIC TRANSPORT SYSTEMS IN POLAND: FROM BIAŁYSTOK TO ZIELONA GÓRA BY BUS AND TRAM USING UNIVERSAL STATISTICS OF COMPLEX NETWORKS

Social Media Mining. Graph Essentials

The Structure of Growing Social Networks

GENERATING AN ASSORTATIVE NETWORK WITH A GIVEN DEGREE DISTRIBUTION

Scientific Collaboration Networks in China s System Engineering Subject

Graph theory and network analysis. Devika Subramanian Comp 140 Fall 2008

MINING COMMUNITIES OF BLOGGERS: A CASE STUDY

Open Source Software Developer and Project Networks

Algorithms for representing network centrality, groups and density and clustered graph representation

1. Write the number of the left-hand item next to the item on the right that corresponds to it.

Practical Graph Mining with R. 5. Link Analysis

1 Six Degrees of Separation

Complex Network Analysis of Brain Connectivity: An Introduction LABREPORT 5

KNOWLEDGE NETWORK SYSTEM APPROACH TO THE KNOWLEDGE MANAGEMENT

Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics

Graph Mining and Social Network Analysis

Network Analysis Basics and applications to online data

Parallel Algorithms for Small-world Network. David A. Bader and Kamesh Madduri

Combining Spatial and Network Analysis: A Case Study of the GoMore Network

Online Appendix to Social Network Formation and Strategic Interaction in Large Networks

Social Network Analysis using Graph Metrics of Web-based Social Networks


Robustness of Spatial Databases: Using Network Analysis on GIS Data Models

Social Network Analysis Measuring, Mapping, and Modeling Collections of Connections

SPANNING CACTI FOR STRUCTURALLY CONTROLLABLE NETWORKS NGO THI TU ANH NATIONAL UNIVERSITY OF SINGAPORE

A scalable multilevel algorithm for graph clustering and community structure detection

Recent Progress in Complex Network Analysis. Models of Random Intersection Graphs

ONLINE SOCIAL NETWORK ANALYTICS

Transcription:

Introduction to Networks and Business Intelligence Prof. Dr. Daning Hu Department of Informatics University of Zurich Sep 17th, 2015

Outline Network Science A Random History Network Analysis Network Topological Analysis: Random, Scale-Free, and Small-world Networks Node level analysis Link Analysis Network Visualization Network-based Business Intelligence Application 2

Network Science Network science is an interdisciplinary academic field which studies complex networks such as information networks, biological networks, cognitive and semantic networks, and social networks. It draws on theories and methods including (Wiki) Graph theory from mathematics, e.g., Small-world Statistical mechanics from physics, e.g., Rich get richer, Data mining and information visualization from computer science, Inferential modeling from statistics, e.g., Collaborative filtering Social structure from sociology, e.g., weak tie, structural holes network science can be defined as "the study of network representations of physical, biological, and social phenomena leading to predictive models of these phenomena. 3

A Random History: Math, Psychology, Sociology The study of networks has emerged in diverse disciplines as a means of analyzing complex relational data. Network science has its root in Graph Theory. Seven Bridges of Königsberg written by Leonhard Euler in 1736. Vertices, Edges, Nodes, Links, a branch of mathematics that studies the properties of pairwise relations in a network structure Social Network Analysis Jacob Moreno, a psychologist, developed the Sociogram and to precisely describe the interpersonal structure of a group. Jacob s experiment is the first to use Social Network Analysis and was published in the New York Times (April 3, 1933, page 17). Stanley Milgram (Small World Experiment: Six Degrees of Separation, 4 1960s). Facebook: 5.28 steps in 2008, 4.74 in 2011.

Jacob Moreno s experiment on Friendship Network 5

Now Nodes Links Social network People Friendship, kinship, collaboration Inter-organizational network Complex Networks in the Real World Companies Citation network Documents/authors Citations Internet Routers/computers Wire, cable WWW Web pages hyperlink Strategic alliance, buyer-seller relation, joint venture Biochemical network Genes/proteins Regulatory effect 6

Examples of Real-World Complex Networks 7 A collaboration network of physicists (size < 1K) Source: (Newman & Girvan, 2004) The Internet (size > 150K), Source: Lumeta Corp., The Internet Mapping Project

Network Analysis: Topology Analysis Network Topology Analysis takes a macro perspective to study the physical properties of network structures. Network topological measures include: Size, Density, Average Degree, Average Path Length: on average, the number of steps it takes to get from one member of the network to another. Diameter Clustering Coefficient: a measure of an "all-my-friends-know-eachother" property; small-world feature CC( i) CC i 1 2Ei k ( k 1) i i ClusteringCoeff ( i) k i = C d (i) = # of neighbors of node i E i = # of links actually exist between k i nodes 8

Topology Analysis: Three Topology Models Random Network Erdős Rényi Random Graph model used for generating random graphs in which edges are set between nodes with equal probabilities 9

Topology Analysis: Three Topology Models Small-World Network Watts-Strogatz Small World Model used for generating graphs with small-world properties large clustering coefficients 10

Topology Analysis: Three Topology Models Scale-Free Network Barabási Albert (BA) Preferential Attachment Model A network model used to demonstrate a preferential attachment or a "richget-richer" effect. an edge is most likely to attach to nodes with higher degrees. Power-law degree distribution 11

Network Analysis: Topology Analysis Topology Average Path Length (L) Clustering Coefficient (CC) Degree Distribution (P(k)) Random Graph L rand ~ ln N ln k CC rand k N Poisson Dist.: P( k) e k k k! k Small World (Watts & Strogatz, 1998) L sw L rand CC sw CC rand Similar to random graph Scale-Free network L SF L rand Power-law Distribution: P(k) ~ k - k : Average degree 12

Network Scientists Paul Erdős (Random graph model) Duncan Watts (Small-World model) A.-L. Barabási (Scale-Free model); Linked Mark Newman (SW and SF models) 13

Network Analysis: Node-level Analysis Node Centrality can be viewed as a measure of influence or importance of nodes in a network. Degree the number of links that a node possesses in a network. In a directed network, one must differentiate between in-links and out-links by calculating in-degree and out-degree. Betweeness the number of shortest paths in a network that traverse through that node. Closeness the average distance that each node is from all other nodes in the network 14

Example: Centrality Measures of Bin Laden in a Global Terrorist Network Degree Betweenness 60 5000 50 4500 4000 40 30 20 10 0 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 The changes in the degree, betweenness and closeness of the node bin Laden from 1989 to 2002 3500 3000 2500 2000 1500 1000 500 400 350 300 250 200 150 100 50 0 0 1989 1990 1989 1990 1991 1992 1993 1994 1995 1996 Closeness 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 1997 1998 1999 2000 2001 2002 15

Findings and Possible Explanations The changes described in the above figure show that From 1994 to 1996, bin Laden s betweenness decreased a lot and then increased until 2001 In 1994, The Saudi government revoked his citizenship and expelled him from the country In 1995, he then went to Khartoum, Sudan, but under U.S. pressure was expelled Again In 1996, bin Laden returned to Afghanistan established camps and refuge there From 1998 to 1999, there is another sharp decrease in betweenness After 1998 bombings of the United States embassies around world, President Bill Clinton ordered a freeze on assets linked to bin Laden Since then, bin Laden was officially listed as one of the FBI Ten Most Wanted Fugitives and FBI Most Wanted Terrorists In August 1998, the U.S. military launched an assassination but failed to harm bin Laden but killed 19 other people In 1999, United States convinced the United Nations to impose sanctions against Afghanistan in an attempt to force the Taliban to extradite him 16

Network Analysis: Link Analysis Link analysis focuses on the prediction of link formations between a pair of nodes based on various network factors. Its applications include: Finance: Insurance fraud detections E-commerce: recommendation systems, e.g., Amazon Internet Search Engine: Google PageRank Law Enforcement: Crime link predictions 17

Network Visualization: Expert Partition of the Collaboration Network Weapons of massive destruction Criminal justice An international terrorism conf. Terrorism in Europe Not well-defined group Legal perspective of terrorism Rand Corp. Historical and policy perspective of terrorism 18

19

Network-based Business Applications Facebook: People you may know 20