School of Computer Science Carnegie Mellon Graph Mining, self-similarity and power laws

Size: px
Start display at page:

Download "School of Computer Science Carnegie Mellon Graph Mining, self-similarity and power laws"

Transcription

1 Graph Mining, self-similarity and power laws Christos Faloutsos University

2 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 2

3 Motivating questions: How does the Internet look like? What constitutes a normal social network? network value of a customer? [Domingos+] which gene/species affects the others the most? DoE/DoD, 2007 C. Faloutsos 3

4 Problem #1 - topology How does the Internet look like? Any rules? DoE/DoD, 2007 C. Faloutsos 4

5 Solution#1: Rank exponent R A1: Power law in the degree distribution [SIGCOMM99] internet domains log(degree) ibm.com att.com log(rank) DoE/DoD, 2007 C. Faloutsos 5

6 Power laws In- and out-degree distribution of web sites [Barabasi], [IBM-CLEVER] log(freq) from [Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, Andrew Tomkins ] log indegree DoE/DoD, 2007 C. Faloutsos 6

7 epinions.com count who-trusts-whom [Richardson + Domingos, KDD 2001] (out) degree DoE/DoD, 2007 C. Faloutsos 7

8 Even more power laws: web hit counts [w/ A. Montgomery] Web Site Traffic log(count) Zipf yahoo.com log(freq) DoE/DoD, 2007 C. Faloutsos 8

9 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 9

10 Problem#2: evolution Given a graph: how will it look like, next year? [from Lumeta: ISPs 6/1999] DoE/DoD, 2007 C. Faloutsos 10

11 Evolution of diameter? Prior analysis, on power-law-like graphs, hints that diameter ~ O(log(N)) or diameter ~ O( log(log(n))) i.e.., slowly increasing with network size Q: What is happening, in reality? DoE/DoD, 2007 C. Faloutsos 11

12 Evolution of diameter? Prior analysis, on power-law-like graphs, hints that diameter ~ O(log(N)) or diameter ~ O( log(log(n))) i.e.., slowly increasing with network size Q: What is happening, in reality? A: It shrinks(!!), towards a constant value DoE/DoD, 2007 C. Faloutsos 12

13 Shrinking diameter ArXiv physics papers and their citations [Leskovec+05a] DoE/DoD, 2007 C. Faloutsos 13

14 Shrinking diameter ArXiv: who wrote what DoE/DoD, 2007 C. Faloutsos 14

15 Shrinking diameter U.S. patents citing each other DoE/DoD, 2007 C. Faloutsos 15

16 Shrinking diameter Autonomous systems DoE/DoD, 2007 C. Faloutsos 16

17 Temporal evolution of graphs N(t) nodes; E(t) edges at time t suppose that N(t+1) = 2 * N(t) Q: what is your guess for E(t+1) =? 2 * E(t) DoE/DoD, 2007 C. Faloutsos 17

18 Temporal evolution of graphs N(t) nodes; E(t) edges at time t suppose that N(t+1) = 2 * N(t) Q: what is your guess for E(t+1) =? x2 * E(t) A: over-doubled! DoE/DoD, 2007 C. Faloutsos 18

19 Densification Power Law ArXiv: Physics papers and their citations E(t) 1.69 N(t) DoE/DoD, 2007 C. Faloutsos 19

20 Densification Power Law U.S. Patents, citing each other E(t) 1.66 N(t) DoE/DoD, 2007 C. Faloutsos 20

21 Densification Power Law Autonomous Systems E(t) 1.18 N(t) DoE/DoD, 2007 C. Faloutsos 21

22 Densification Power Law ArXiv: who wrote what E(t) 1.15 N(t) DoE/DoD, 2007 C. Faloutsos 22

23 Summary of laws Static graphs power law degrees; power law eigenvalues communities (within communities) small diameters Dynamic graphs shrinking diameter densification power law DoE/DoD, 2007 C. Faloutsos 23

24 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 24

25 Problem#3: Generators Q: what local behavior can generate such graphs? DoE/DoD, 2007 C. Faloutsos 25

26 Problem#3: Generators Q: what local behavior can generate such graphs? A1: Preferential attachment [Barabasi+] A2: copying model [Kleinberg+] A3: forest-fire model [Leskovec+] A4: Kronecker [Leskovec+] A5: Economic reasons [Papadimitriou+] DoE/DoD, 2007 C. Faloutsos 26

27 Problem#3: Generators Q: what local behavior can generate such graphs? A1: Preferential attachment A2: copying model A3: forest-fire model A4: Kronecker A5: Economic reasons power law degree + communities + DPL + easily parallilizable DoE/DoD, 2007 C. Faloutsos 27

28 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 28

29 Problem#4: influence propagation how do influence/rumors/viruses propagate? what is the best customer to market to? DoE/DoD, 2007 C. Faloutsos 29

30 Problem#4: influence propagation how do influence/rumors/viruses propagate? tipping point [Kleinberg+] first eigenvalue -> epidemic threshold [Chakrabarti+] what is the best customer to market to? network value of a customer [Domingos+] DoE/DoD, 2007 C. Faloutsos 30

31 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 31

32 Problem #5: communities how to find natural communities, quickly? how to find strange /suspicious/valuable edges? DoE/DoD, 2007 C. Faloutsos 32

33 Problem #5: communities how to find natural communities, quickly? network flow [Flake+] node/edge betweeness (~ stress ) cross-associations [Chakrabarti+] 2 nd eigenvalue; METIS [Karypis+] random walks [Newman] etc etc etc DoE/DoD, 2007 C. Faloutsos 33

34 Problem #5: communities connection sub-graphs [Faloutsos+] DoE/DoD, 2007 C. Faloutsos 34

35 Problem #5: communities connection sub-graphs [Faloutsos+] DoE/DoD, 2007 C. Faloutsos 35

36 Problem #5: communities connection sub-graphs [Faloutsos+] BANKS system [Chakrabarti+] ObjectRank [Papakonstantinou+] DoE/DoD, 2007 C. Faloutsos 36

37 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 37

38 Problem #6: local patterns Which sub-graphs are common/frequent? H C C N C C C H N C molecule #1 molecule #M DoE/DoD, 2007 C. Faloutsos 38

39 Problem #6: local patterns Which sub-graphs are common/frequent? H C C N C C C H N C molecule #1 molecule #M DoE/DoD, 2007 C. Faloutsos 39

40 Problem #6: local patterns Clever extensions of Association Rules (= frequent itemsets / market basket analysis) [Jiawei Han+] [Jian Pei+] [G. Karypis] [M. Zaki+]... DoE/DoD, 2007 C. Faloutsos 40

41 Overview Achievements Challenges time evolving graphs multi-graphs DoE/DoD, 2007 C. Faloutsos 41

42 Time evolving graphs Q: what will happen next? (eg., on a traffic matrix?) 1/1/2005 DoE/DoD, 2007 C. Faloutsos 42

43 Time evolving graphs Q: what will happen next? (eg., on a traffic matrix?) 1/2/2005 DoE/DoD, 2007 C. Faloutsos 43

44 Time evolving graphs Q: what will happen next? (eg., on a traffic matrix?) 1/3/2005 DoE/DoD, 2007 C. Faloutsos 44

45 Time evolving graphs Q: what will happen next? (eg., on a traffic matrix?) 1/4/2005???? DoE/DoD, 2007 C. Faloutsos 45

46 Overview Achievements Challenges time evolving graphs multi-graphs DoE/DoD, 2007 C. Faloutsos 46

47 Multi-graphs Patterns/outliers? friends co-authors DoE/DoD, 2007 C. Faloutsos 47

48 Multi-graphs Relational Learning link prediction, feature extraction [Jensen+] etc Dis-ambiguation, de-duplication [Getoor+] etc friends co-authors DoE/DoD, 2007 C. Faloutsos 48

49 source person School of Computer Science Promising solution: tensors I x J x K destination person type of contact DoE/DoD, 2007 C. Faloutsos 49

50 Tensors: ~SVD, for >=3 modes C I x J x K Ix R K x R Jx R ¼ A R x R x R B = + + Foil: from Tamara Kolda (Sandia) DoE/DoD, 2007 C. Faloutsos 50

51 Specially Structured Tensors Tucker Tensor Kruskal Tensor Our Notation Our Notation core W K x T K x R W I x J x K Ix R Jx S I x J x K w 1 Ix R w R Jx R = U R x S x T V = = u 1 v 1 U + + V R x R x R u R v R Foil: from Tamara Kolda (Sandia) DoE/DoD, 2007 C. Faloutsos 51

52 Conclusions - achievements Surprising patterns in graphs power laws; communities small/shrinking diameters simple, local behavior can lead to such patterns (eg., preferential attachment, etc) fast algorithms for communities/partitioning fast algorithms for frequent subgraphs DoE/DoD, 2007 C. Faloutsos 52

53 Conclusions - next steps multi-graphs time-evolving graphs scalability (graph sampling) (large graph visualization) DoE/DoD, 2007 C. Faloutsos 53

54 Conclusions - philosophically: deep connections with self-similarity, cellular automata (~agents), and fractals DoE/DoD, 2007 C. Faloutsos 54

55 Resources Manfred Schroeder Chaos, Fractals and Power Laws, 1991 A-L. Barabasi, Linked, 2002 D. Watts, Six Degrees, 2004 DoE/DoD, 2007 C. Faloutsos 55

56 References R. Albert, H. Jeong, and A.-L. Barabási, Diameter of the World Wide Web. Nature 401, (1999) A. Fabrikant, E. Koutsoupias, and C. Papadimitriou. Heuristically Optimized Tradeoffs: A New Paradigm for Power Laws in the Internet. Int. Colloquium on Automata, Languages, and Programming (ICALP), Malaga, Spain, July DoE/DoD, 2007 C. Faloutsos 56

57 References [sigcomm99] Michalis Faloutsos, Petros Faloutsos and Christos Faloutsos, What does the Internet look like? Empirical Laws of the Internet Topology, SIGCOMM 1999 [Leskovec 05] Jure Leskovec, Jon M. Kleinberg, Christos Faloutsos: Graphs over time: densification laws, shrinking diameters and possible explanations. KDD 2005: DoE/DoD, 2007 C. Faloutsos 57

58 References [brite] Alberto Medina, Anukool Lakhina, Ibrahim Matta, and John Byers. BRITE: An Approach to Universal Topology Generation. MASCOTS '01 Xifeng Yan, Xianghong Jasmine Zhou, Jiawei Han: Mining closed relational graphs with connectivity constraints. KDD 2005: DoE/DoD, 2007 C. Faloutsos 58

59 Thank you! Contact info: christos <at> cs.cmu.edu www. cs.cmu.edu /~christos (w/ papers, datasets, code, etc) DoE/DoD, 2007 C. Faloutsos 59

Graph Mining Techniques for Social Media Analysis

Graph Mining Techniques for Social Media Analysis Graph Mining Techniques for Social Media Analysis Mary McGlohon Christos Faloutsos 1 1-1 What is graph mining? Extracting useful knowledge (patterns, outliers, etc.) from structured data that can be represented

More information

Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations

Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations Jurij Leskovec, CMU Jon Kleinberg, Cornell Christos Faloutsos, CMU 1 Introduction What can we do with graphs? What patterns

More information

The Shape of the Network. The Shape of the Internet. Why study topology? Internet topologies. Early work. More on topologies..

The Shape of the Network. The Shape of the Internet. Why study topology? Internet topologies. Early work. More on topologies.. The Shape of the Internet Slides assembled by Jeff Chase Duke University (thanks to and ) The Shape of the Network Characterizing shape : AS-level topology: who connects to whom Router-level topology:

More information

CMU SCS Large Graph Mining Patterns, Tools and Cascade analysis

CMU SCS Large Graph Mining Patterns, Tools and Cascade analysis Large Graph Mining Patterns, Tools and Cascade analysis Christos Faloutsos CMU Roadmap Introduction Motivation Why big data Why (big) graphs? Patterns in graphs Tools: fraud detection on e-bay Conclusions

More information

Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003

Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003 Graph models for the Web and the Internet Elias Koutsoupias University of Athens and UCLA Crete, July 2003 Outline of the lecture Small world phenomenon The shape of the Web graph Searching and navigation

More information

Thank you! NetMine Data mining on networks IIS -0209107 AWSOM. Outline. Proposed method. Goals

Thank you! NetMine Data mining on networks IIS -0209107 AWSOM. Outline. Proposed method. Goals NetMine Data mining on networks IIS -0209107 Christos Faloutsos (CMU) Michalis Faloutsos (UCR) Peggy Agouris George Kollios Fillia Makedon Betty Salzberg Anthony Stefanidis Thank you! NSF-IDM 04 C. Faloutsos

More information

Expansion Properties of Large Social Graphs

Expansion Properties of Large Social Graphs Expansion Properties of Large Social Graphs Fragkiskos D. Malliaros 1 and Vasileios Megalooikonomou 1,2 1 Computer Engineering and Informatics Department University of Patras, 26500 Rio, Greece 2 Data

More information

Computer Network Topologies: Models and Generation Tools

Computer Network Topologies: Models and Generation Tools Consiglio Nazionale delle Ricerche Technical Report n. 5/200 Computer Network Topologies: Models and Generation Tools Giuseppe Di Fatta, Giuseppe Lo Presti 2, Giuseppe Lo Re CE.R.E. Researcher 2 CE.R.E.,

More information

CMU SCS Mining Billion-Node Graphs - Patterns and Algorithms

CMU SCS Mining Billion-Node Graphs - Patterns and Algorithms Mining Billion-Node Graphs - Patterns and Algorithms Christos Faloutsos CMU Thank you! Panos Chrysanthis Ling Liu Vladimir Zadorozhny Prashant Krishnamurthy C. Faloutsos (CMU) 2 Resource Open source system

More information

The ebay Graph: How Do Online Auction Users Interact?

The ebay Graph: How Do Online Auction Users Interact? The ebay Graph: How Do Online Auction Users Interact? Yordanos Beyene, Michalis Faloutsos University of California, Riverside {yordanos, michalis}@cs.ucr.edu Duen Horng (Polo) Chau, Christos Faloutsos

More information

On Generating Graphs with Prescribed Vertex Degrees for Complex Network Modeling 1. College of Computing Georgia Institute of Technology

On Generating Graphs with Prescribed Vertex Degrees for Complex Network Modeling 1. College of Computing Georgia Institute of Technology On Generating Graphs with Prescribed Vertex Degrees for Complex Network Modeling 1 Milena Mihail Nisheeth K. Vishnoi College of Computing Georgia Institute of Technology Abstract Graph models for real-world

More information

CMU SCS Mining Large Graphs and Tensors - Patterns, Tools and Discoveries.

CMU SCS Mining Large Graphs and Tensors - Patterns, Tools and Discoveries. Mining Large Graphs and Tensors - Patterns, Tools and Discoveries. Christos Faloutsos CMU Thank you! Nikos Sidiropoulos Kuo-Chu Chang Zhi (Gerry) Tian C. Faloutsos (CMU) 2 Roadmap Introduction Motivation

More information

Introduction to Graph Mining

Introduction to Graph Mining Introduction to Graph Mining What is a graph? A graph G = (V,E) is a set of vertices V and a set (possibly empty) E of pairs of vertices e 1 = (v 1, v 2 ), where e 1 E and v 1, v 2 V. Edges may contain

More information

Analyzing the Facebook graph?

Analyzing the Facebook graph? Logistics Big Data Algorithmic Introduction Prof. Yuval Shavitt Contact: shavitt@eng.tau.ac.il Final grade: 4 6 home assignments (will try to include programing assignments as well): 2% Exam 8% Big Data

More information

(B ) Empirical observation: Shrinking diameters: The effective diameter is, in many cases, actually decreasing as the network grows.

(B ) Empirical observation: Shrinking diameters: The effective diameter is, in many cases, actually decreasing as the network grows. Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations Jure Leskovec Carnegie Mellon University jure@cs.cmu.edu Jon Kleinberg Cornell University kleinber@cs.cornell.edu Christos

More information

How To Find Out How A Graph Densifies

How To Find Out How A Graph Densifies Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations Jure Leskovec Carnegie Mellon University jure@cs.cmu.edu Jon Kleinberg Cornell University kleinber@cs.cornell.edu Christos

More information

Graph Mining and Social Network Analysis

Graph Mining and Social Network Analysis Graph Mining and Social Network Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann

More information

Jure Leskovec (@jure) Stanford University

Jure Leskovec (@jure) Stanford University Jure Leskovec (@jure) Stanford University KDD Summer School, Beijing, August 2012 8/10/2012 Jure Leskovec (@jure), KDD Summer School 2012 2 Graph: Kronecker graphs Graph Node attributes: MAG model Graph

More information

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network , pp.273-284 http://dx.doi.org/10.14257/ijdta.2015.8.5.24 Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network Gengxin Sun 1, Sheng Bin 2 and

More information

An Alternative Web Search Strategy? Abstract

An Alternative Web Search Strategy? Abstract An Alternative Web Search Strategy? V.-H. Winterer, Rechenzentrum Universität Freiburg (Dated: November 2007) Abstract We propose an alternative Web search strategy taking advantage of the knowledge on

More information

Studying E-mail Graphs for Intelligence Monitoring and Analysis in the Absence of Semantic Information

Studying E-mail Graphs for Intelligence Monitoring and Analysis in the Absence of Semantic Information Studying E-mail Graphs for Intelligence Monitoring and Analysis in the Absence of Semantic Information Petros Drineas, Mukkai S. Krishnamoorthy, Michael D. Sofka Bülent Yener Department of Computer Science,

More information

The average distances in random graphs with given expected degrees

The average distances in random graphs with given expected degrees Classification: Physical Sciences, Mathematics The average distances in random graphs with given expected degrees by Fan Chung 1 and Linyuan Lu Department of Mathematics University of California at San

More information

The Internet Is Like A Jellyfish

The Internet Is Like A Jellyfish The Internet Is Like A Jellyfish Michalis Faloutsos UC Riverside Joint work with: Leslie Tauro, Georgos Siganos (UCR) Chris Palmer(CMU) Big Picture: Modeling the Internet Topology Traffic Protocols Routing,

More information

Evolutionary Network Analysis: A Survey

Evolutionary Network Analysis: A Survey Evolutionary Network Analysis: A Survey CHARU AGGARWAL, IBM T. J. Watson Research Center KARTHIK SUBBIAN, University of Minnesota Evolutionary network analysis has found an increasing interest in the literature

More information

AN INTRODUCTION TO SOCIAL NETWORK DATA ANALYTICS

AN INTRODUCTION TO SOCIAL NETWORK DATA ANALYTICS Chapter 1 AN INTRODUCTION TO SOCIAL NETWORK DATA ANALYTICS Charu C. Aggarwal IBM T. J. Watson Research Center Hawthorne, NY 10532 charu@us.ibm.com Abstract The advent of online social networks has been

More information

arxiv:cs.dm/0204001 v1 30 Mar 2002

arxiv:cs.dm/0204001 v1 30 Mar 2002 A Steady State Model for Graph Power Laws David Eppstein Joseph Wang arxiv:cs.dm/0000 v 0 Mar 00 Abstract Power law distribution seems to be an important characteristic of web graphs. Several existing

More information

A SURVEY OF PROPERTIES AND MODELS OF ON- LINE SOCIAL NETWORKS

A SURVEY OF PROPERTIES AND MODELS OF ON- LINE SOCIAL NETWORKS International Conference on Mathematical and Computational Models PSG College of Technology, Coimbatore Copyright 2009, Narosa Publishing House, New Delhi, India A SURVEY OF PROPERTIES AND MODELS OF ON-

More information

Why Rumors Spread Fast in Social Networks

Why Rumors Spread Fast in Social Networks Why Rumors Spread Fast in Social Networks Benjamin Doerr 1, Mahmoud Fouz 2, and Tobias Friedrich 1,2 1 Max-Planck-Institut für Informatik, Saarbrücken, Germany 2 Universität des Saarlandes, Saarbrücken,

More information

With the increasing availability of large social network data, there is

With the increasing availability of large social network data, there is Learning and Predicting the Evolution of Social Networks Björn Bringmann, Katholieke Universiteit Leuven Michele Berlingerio, ISTI-CNR Francesco Bonchi and Aristides Gionis, Yahoo Research Graph Evolution

More information

An Analysis of Verifications in Microblogging Social Networks - Sina Weibo

An Analysis of Verifications in Microblogging Social Networks - Sina Weibo An Analysis of Verifications in Microblogging Social Networks - Sina Weibo Junting Chen and James She HKUST-NIE Social Media Lab Dept. of Electronic and Computer Engineering The Hong Kong University of

More information

Foundations of Multidimensional Network Analysis

Foundations of Multidimensional Network Analysis 2 International Conference on Advances in Social Networks Analysis and Mining Foundations of Multidimensional Network Analysis Michele Berlingerio Michele Coscia 2 Fosca Giannotti Anna Monreale 2 Dino

More information

A dynamic model for on-line social networks

A dynamic model for on-line social networks A dynamic model for on-line social networks A. Bonato 1, N. Hadi, P. Horn, P. Pra lat 4, and C. Wang 1 1 Ryerson University, Toronto, Canada abonato@ryerson.ca, cpwang@ryerson.ca Wilfrid Laurier University,

More information

Online Estimating the k Central Nodes of a Network

Online Estimating the k Central Nodes of a Network Online Estimating the k Central Nodes of a Network Yeon-sup Lim, Daniel S. Menasché, Bruno Ribeiro, Don Towsley, and Prithwish Basu Department of Computer Science UMass Amherst, Raytheon BBN Technologies

More information

Towards Modeling Legitimate and Unsolicited Email Traffic Using Social Network Properties

Towards Modeling Legitimate and Unsolicited Email Traffic Using Social Network Properties Towards Modeling Legitimate and Unsolicited Email Traffic Using Social Network Properties Farnaz Moradi Tomas Olovsson Philippas Tsigas Computer Science and Engineering Chalmers University of Technology,

More information

Some questions... Graphs

Some questions... Graphs Uni Innsbruck Informatik - 1 Uni Innsbruck Informatik - 2 Some questions... Peer-to to-peer Systems Analysis of unstructured P2P systems How scalable is Gnutella? How robust is Gnutella? Why does FreeNet

More information

Internet Topology of the Power-Laws

Internet Topology of the Power-Laws Power-Laws and the AS-level Internet Topology Georgos Siganos,Michalis Faloutsos,Petros Faloutsos,Christos Faloutsos Abstract In this paper, we study and characterize the topology of the Internet at the

More information

Introduction to Networks and Business Intelligence

Introduction to Networks and Business Intelligence Introduction to Networks and Business Intelligence Prof. Dr. Daning Hu Department of Informatics University of Zurich Sep 17th, 2015 Outline Network Science A Random History Network Analysis Network Topological

More information

On Realistic Network Topologies for Simulation

On Realistic Network Topologies for Simulation On Realistic Network Topologies for Simulation Oliver Heckmann, Michael Piringer, Jens Schmitt, Ralf Steinmetz Multimedia Communications (KOM), Department of Electronic Engineering & Information Technology

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)

More information

Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics

Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics Zhao Wenbin 1, Zhao Zhengxu 2 1 School of Instrument Science and Engineering, Southeast University, Nanjing, Jiangsu

More information

Social Networks and Social Media

Social Networks and Social Media Social Networks and Social Media Social Media: Many-to-Many Social Networking Content Sharing Social Media Blogs Microblogging Wiki Forum 2 Characteristics of Social Media Consumers become Producers Rich

More information

Characterizing and Modelling Clustering Features in AS-Level Internet Topology

Characterizing and Modelling Clustering Features in AS-Level Internet Topology Characterizing and Modelling Clustering Features in AS-Level Topology Yan Li, Jun-Hong Cui, Dario Maggiorini and Michalis Faloutsos yan.li@uconn.edu, jcui@engr.uconn.edu, dario@dico.unimi.it, michalis@cs.ucr.edu

More information

Analyzing and modelling the AS-level Internet topology

Analyzing and modelling the AS-level Internet topology Analyzing and modelling the AS-level Internet topology Shi Zhou & Raul J. Mondragon Department of Electronic Engineering Queen Mary, University of London Mile End Road, London, E1 4NS, United Kingdom Email:

More information

Mobile Call Graphs: Beyond Power-Law and Lognormal Distributions

Mobile Call Graphs: Beyond Power-Law and Lognormal Distributions Mobile Call Graphs: Beyond Power-Law and Lognormal Distributions Mukund Seshadri Sprint Burlingame, California, USA mukund.seshadri@sprint.com Jean Bolot Sprint Burlingame, California, USA bolot@sprint.com

More information

Homophily in Online Social Networks

Homophily in Online Social Networks Homophily in Online Social Networks Bassel Tarbush and Alexander Teytelboym Department of Economics, University of Oxford bassel.tarbush@economics.ox.ac.uk Department of Economics, University of Oxford

More information

Community Detection Proseminar - Elementary Data Mining Techniques by Simon Grätzer

Community Detection Proseminar - Elementary Data Mining Techniques by Simon Grätzer Community Detection Proseminar - Elementary Data Mining Techniques by Simon Grätzer 1 Content What is Community Detection? Motivation Defining a community Methods to find communities Overlapping communities

More information

Research Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of Sina Microblogging

Research Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of Sina Microblogging Mathematical Problems in Engineering, Article ID 578713, 6 pages http://dx.doi.org/10.1155/2014/578713 Research Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of

More information

Part 2: Community Detection

Part 2: Community Detection Chapter 8: Graph Data Part 2: Community Detection Based on Leskovec, Rajaraman, Ullman 2014: Mining of Massive Datasets Big Data Management and Analytics Outline Community Detection - Social networks -

More information

Social Network Mining

Social Network Mining Social Network Mining Data Mining November 11, 2013 Frank Takes (ftakes@liacs.nl) LIACS, Universiteit Leiden Overview Social Network Analysis Graph Mining Online Social Networks Friendship Graph Semantics

More information

Mathematical Challenges in Cybersecurity

Mathematical Challenges in Cybersecurity SANDIA REPORT SAND 2009-0805 Unlimited Release Printed February 2009 Mathematical Challenges in Cybersecurity Daniel M. Dunlavy, Bruce Hendrickson, and Tamara G. Kolda Prepared by Sandia National Laboratories

More information

Cost effective Outbreak Detection in Networks

Cost effective Outbreak Detection in Networks Cost effective Outbreak Detection in Networks Jure Leskovec Joint work with Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne VanBriesen, and Natalie Glance Diffusion in Social Networks One of

More information

SOCIAL NETWORK DATA ANALYTICS

SOCIAL NETWORK DATA ANALYTICS SOCIAL NETWORK DATA ANALYTICS SOCIAL NETWORK DATA ANALYTICS Edited by CHARU C. AGGARWAL IBM T. J. Watson Research Center, Yorktown Heights, NY 10598, USA Kluwer Academic Publishers Boston/Dordrecht/London

More information

NetProbe: A Fast and Scalable System for Fraud Detection in Online Auction Networks

NetProbe: A Fast and Scalable System for Fraud Detection in Online Auction Networks NetProbe: A Fast and Scalable System for Fraud Detection in Online Auction Networks Shashank Pandit, Duen Horng Chau, Samuel Wang, Christos Faloutsos Carnegie Mellon University Pittsburgh, PA 15213, USA

More information

Search in BigData2 - When Big Text meets Big Graph 1. Introduction State of the Art on Big Data

Search in BigData2 - When Big Text meets Big Graph 1. Introduction State of the Art on Big Data Search in BigData 2 - When Big Text meets Big Graph Christos Giatsidis, Fragkiskos D. Malliaros, François Rousseau, Michalis Vazirgiannis Computer Science Laboratory, École Polytechnique, France {giatsidis,

More information

Who Links to Whom: Mining Linkage between Web Sites

Who Links to Whom: Mining Linkage between Web Sites Who Links to Whom: Mining Linkage between Web Sites Krishna Bharat, Bay-Wei Chang, Monika Henzinger Google Inc. Mountain View, CA 94043, USA {krishna,bay,monika}@google.com Matthias Ruhl MIT Laboratory

More information

Distance Degree Sequences for Network Analysis

Distance Degree Sequences for Network Analysis Universität Konstanz Computer & Information Science Algorithmics Group 15 Mar 2005 based on Palmer, Gibbons, and Faloutsos: ANF A Fast and Scalable Tool for Data Mining in Massive Graphs, SIGKDD 02. Motivation

More information

Tutorial, IEEE SERVICE 2014 Anchorage, Alaska

Tutorial, IEEE SERVICE 2014 Anchorage, Alaska Tutorial, IEEE SERVICE 2014 Anchorage, Alaska Big Data Science: Fundamental, Techniques, and Challenges (Data Mining on Big Data) 2014. 6. 27. By Neil Y. Yen Presented by Incheon Paik University of Aizu

More information

Graph structure in the web

Graph structure in the web Graph structure in the web Andrei Broder 1, Ravi Kumar 2, Farzin Maghoul 1, Prabhakar Raghavan 2, Sridhar Rajagopalan 2, Raymie Stata 3, Andrew Tomkins 2, Janet Wiener 3 1: AltaVista Company, San Mateo,

More information

Comparing Trade-off Based Models of the Internet

Comparing Trade-off Based Models of the Internet Fundamenta Informaticae 92 (2009) 363 372 363 DOI 10.3233/FI-2009-79 IOS Press Comparing Trade-off Based Models of the Internet Anthony Spatharis, Ilias Foudalis, Martha Sideri Computer Science Department

More information

Dynamics of Large Networks. Jurij Leskovec

Dynamics of Large Networks. Jurij Leskovec Dynamics of Large Networks Jurij Leskovec September 2008 CMU-ML-08-111 Machine Learning Department School of Computer Science Carnegie Mellon University Pittsburgh, PA Thesis Committee: Christos Faloutsos,

More information

Mining Large Graphs. ECML/PKDD 2007 tutorial. Part 2: Diffusion and cascading behavior

Mining Large Graphs. ECML/PKDD 2007 tutorial. Part 2: Diffusion and cascading behavior Mining Large Graphs ECML/PKDD 2007 tutorial Part 2: Diffusion and cascading behavior Jure Leskovec and Christos Faloutsos Machine Learning Department Joint work with: Lada Adamic, Deepay Chakrabarti, Natalie

More information

Preserving Privacy and Frequent Sharing Patterns for Social Network Data Publishing

Preserving Privacy and Frequent Sharing Patterns for Social Network Data Publishing Preserving Privacy and Frequent Sharing Patterns for Social Network Data Publishing Benjamin C. M. Fung CIISE, Concordia University Montreal, QC, Canada fung@ciise.concordia.ca Yan an Jin Huazhong University

More information

DualIso: Scalable Subgraph Pattern Matching On Large Labeled Graphs SUPPLEMENT. Computer Science Department

DualIso: Scalable Subgraph Pattern Matching On Large Labeled Graphs SUPPLEMENT. Computer Science Department DualIso: Scalable Subgraph Pattern Matching On Large Labeled Graphs SUPPLEMENT Matthew Saltz, Ayushi Jain, Abhishek Kothari, Arash Fard, John A. Miller and Lakshmish Ramaswamy Computer Science Department

More information

DATA ANALYSIS II. Matrix Algorithms

DATA ANALYSIS II. Matrix Algorithms DATA ANALYSIS II Matrix Algorithms Similarity Matrix Given a dataset D = {x i }, i=1,..,n consisting of n points in R d, let A denote the n n symmetric similarity matrix between the points, given as where

More information

Communication Dynamics of Blog Networks

Communication Dynamics of Blog Networks Communication Dynamics of Blog Networks Mark Goldberg CS Department, RPI 0 8th Street Troy, NY goldberg@cs.rpi.edu Konstantin Mertsalov CS Department, RPI 0 8th Street Troy, NY mertsk2@cs.rpi.edu Stephen

More information

Complex Networks Analysis: Clustering Methods

Complex Networks Analysis: Clustering Methods Complex Networks Analysis: Clustering Methods Nikolai Nefedov Spring 2013 ISI ETH Zurich nefedov@isi.ee.ethz.ch 1 Outline Purpose to give an overview of modern graph-clustering methods and their applications

More information

Looking at the Blogosphere Topology through Different Lenses

Looking at the Blogosphere Topology through Different Lenses Looking at the Blogosphere Topology through Different Lenses Xiaolin Shi Dept. of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI 48109 shixl@umich.edu Belle Tseng NEC

More information

An Incremental Super-Linear Preferential Internet Topology Model

An Incremental Super-Linear Preferential Internet Topology Model An Incremental Super-Linear Preferential Internet Topology Model Extended Abstract Sagy Bar 1, Mira Gonen 2, and Avishai Wool 3 1 School of Electrical Engineering, Tel Aviv University, Ramat Aviv 69978,

More information

Part 1: Link Analysis & Page Rank

Part 1: Link Analysis & Page Rank Chapter 8: Graph Data Part 1: Link Analysis & Page Rank Based on Leskovec, Rajaraman, Ullman 214: Mining of Massive Datasets 1 Exam on the 5th of February, 216, 14. to 16. If you wish to attend, please

More information

Power Law of Predictive Graphs

Power Law of Predictive Graphs Internet Mathematics Vol. 1, No. 1: 91-114 The Average Distance in a Random Graph with Given Expected Degrees FanChungandLinyuanLu Abstract. Random graph theory is used to examine the small-world phenomenon

More information

Degree distribution of the FKP network model

Degree distribution of the FKP network model Degree distribution of the FKP network model Noam Berger 1, Béla Bollobás 2,3, Christian Borgs 4, Jennifer Chayes 4, and Oliver Riordan 3 1 Department of Statistics, University of California, Berkeley,

More information

Link Prediction on Evolving Data using Tensor Factorization

Link Prediction on Evolving Data using Tensor Factorization Link Prediction on Evolving Data using Tensor Factorization Stephan Spiegel 1, Jan Clausen 1, Sahin Albayrak 1, and Jérôme Kunegis 2 1 DAI-Labor, Technical University Berlin Ernst-Reuter-Platz 7, 10587

More information

Network Theory: 80/20 Rule and Small Worlds Theory

Network Theory: 80/20 Rule and Small Worlds Theory Scott J. Simon / p. 1 Network Theory: 80/20 Rule and Small Worlds Theory Introduction Starting with isolated research in the early twentieth century, and following with significant gaps in research progress,

More information

Scaling Properties of the Internet Graph

Scaling Properties of the Internet Graph Scaling Properties of the Internet Graph Aditya Akella Shuchi Chawla Arvind Kannan Srinivasan Seshan Computer Science Department Carnegie Mellon University Pittsburgh, PA 53 {aditya,shuchi}@cs.cmu.edu

More information

CS224W Project Report: Finding Top UI/UX Design Talent on Adobe Behance

CS224W Project Report: Finding Top UI/UX Design Talent on Adobe Behance CS224W Project Report: Finding Top UI/UX Design Talent on Adobe Behance Susanne Halstead, Daniel Serrano, Scott Proctor 6 December 2014 1 Abstract The Behance social network allows professionals of diverse

More information

Community Mining from Multi-relational Networks

Community Mining from Multi-relational Networks Community Mining from Multi-relational Networks Deng Cai 1, Zheng Shao 1, Xiaofei He 2, Xifeng Yan 1, and Jiawei Han 1 1 Computer Science Department, University of Illinois at Urbana Champaign (dengcai2,

More information

Efficient Topological OLAP on Information Networks

Efficient Topological OLAP on Information Networks Efficient Topological OLAP on Information Networks Qiang Qu 1, Feida Zhu 2, Xifeng Yan 3, Jiawei Han 4, Philip S. Yu 5, and Hongyan Li 1 1 {quqiang,lihy}@cis.pku.edu.cn, 2 fdzhu@smu.edu.sg, 3 xyan@cs.ucsb.edu,

More information

Q. Yan, X. Huang School of Economics and Management, Beijing University of Posts and Telecommunications, Beijing, China, 100876 Email: yq_10@sohu.

Q. Yan, X. Huang School of Economics and Management, Beijing University of Posts and Telecommunications, Beijing, China, 100876 Email: yq_10@sohu. JOURNAL OF NETWORKS, VOL. 3, NO. 7, JULY 28 1 fuser Behavior and Topology Analysis Q. Yan, X. Huang School of Economics and Management, Beijing University of Posts and Telecommunications, Beijing, China,

More information

Information Network or Social Network? The Structure of the Twitter Follow Graph

Information Network or Social Network? The Structure of the Twitter Follow Graph Information Network or Social Network? The Structure of the Twitter Follow Graph Seth A. Myers, Aneesh Sharma, Pankaj Gupta, and Jimmy Lin Twitter, Inc. @seth_a_myers @aneeshs @pankaj @lintool ABSTRACT

More information

MINING COMMUNITIES OF BLOGGERS: A CASE STUDY

MINING COMMUNITIES OF BLOGGERS: A CASE STUDY MINING COMMUNITIES OF BLOGGERS: A CASE STUDY ON CYBER-HATE Jennifer Xu Bentley College Waltham, MA, USA jxu@bentley.edu Michael Chau University of Hong Kong Pokfulam, Hong Kong mchau@business.hku.hk Abstract

More information

Dynamics of information spread on networks. Kristina Lerman USC Information Sciences Institute

Dynamics of information spread on networks. Kristina Lerman USC Information Sciences Institute Dynamics of information spread on networks Kristina Lerman USC Information Sciences Institute Information spread in online social networks Diffusion of activation on a graph, where each infected (activated)

More information

Analysis of Internet Topologies

Analysis of Internet Topologies Analysis of Internet Topologies Ljiljana Trajković ljilja@cs.sfu.ca Communication Networks Laboratory http://www.ensc.sfu.ca/cnl School of Engineering Science Simon Fraser University, Vancouver, British

More information

A scalable multilevel algorithm for graph clustering and community structure detection

A scalable multilevel algorithm for graph clustering and community structure detection A scalable multilevel algorithm for graph clustering and community structure detection Hristo N. Djidjev 1 Los Alamos National Laboratory, Los Alamos, NM 87545 Abstract. One of the most useful measures

More information

Temporal Visualization and Analysis of Social Networks

Temporal Visualization and Analysis of Social Networks Temporal Visualization and Analysis of Social Networks Peter A. Gloor*, Rob Laubacher MIT {pgloor,rjl}@mit.edu Yan Zhao, Scott B.C. Dynes *Dartmouth {yan.zhao,sdynes}@dartmouth.edu Abstract This paper

More information

On the Bursty Evolution of Online Social Networks

On the Bursty Evolution of Online Social Networks On the Bursty Evolution of Online Social Networks Sabrina Gaito, Matteo Zignani, Gian Paolo Rossi, Alessandra Sala Xiaohan Zhao, Haitao Zheng, Ben Y. Zhao Computer Science, Universitá degli Studi di Milano

More information

Hadoop Based Link Prediction Performance Analysis

Hadoop Based Link Prediction Performance Analysis Hadoop Based Link Prediction Performance Analysis Yuxiao Dong, Casey Robinson, Jian Xu Department of Computer Science and Engineering University of Notre Dame Notre Dame, IN 46556, USA Email: ydong1@nd.edu,

More information

Network/Graph Theory. What is a Network? What is network theory? Graph-based representations. Friendship Network. What makes a problem graph-like?

Network/Graph Theory. What is a Network? What is network theory? Graph-based representations. Friendship Network. What makes a problem graph-like? What is a Network? Network/Graph Theory Network = graph Informally a graph is a set of nodes joined by a set of lines or arrows. 1 1 2 3 2 3 4 5 6 4 5 6 Graph-based representations Representing a problem

More information

-Duplication of Time-Varying Graphs

-Duplication of Time-Varying Graphs -Duplication of Time-Varying Graphs François Queyroi Sorbonne Universités, UPMC Univ Paris 06, UMR 7606, LIP6, F-75005, Paris CNRS, UMR 7606, LIP6, F-75005, Paris, France francois.queyroi@lip6.fr ABSTRACT.

More information

Efficient target control of complex networks based on preferential matching

Efficient target control of complex networks based on preferential matching Efficient target control of complex networks based on preferential matching Xizhe Zhang 1, Huaizhen Wang 1, Tianyang Lv 2,3 1 (School of Computer Science and Engineering, Northeastern University, Shenyang110819,

More information

Maximizing Restorable Throughput in MPLS Networks

Maximizing Restorable Throughput in MPLS Networks Maximizing Restorable Throughput in MPLS Networks Reuven Cohen Gabi Nakibly Technion Israel Institute of Technology, Computer Science, Haifa, Israel Abstract MPLS recovery mechanisms are increasing in

More information

MANAGING AND MINING GRAPH DATA

MANAGING AND MINING GRAPH DATA MANAGING AND MINING GRAPH DATA MANAGING AND MINING GRAPH DATA Edited by CHARU C. AGGARWAL IBM T. J. Watson Research Center, Yorktown Heights, NY 10598, USA HAIXUN WANG Microsoft Research Asia, Beijing,

More information

Graph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks

Graph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks Graph Theory and Complex Networks: An Introduction Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.20, steen@cs.vu.nl Chapter 08: Computer networks Version: March 3, 2011 2 / 53 Contents

More information

Jellyfish: A Conceptual Model for the AS Internet Topology

Jellyfish: A Conceptual Model for the AS Internet Topology Jellyfish: A Conceptual Model for the AS Internet Topology Georgos Siganos U. C. Riverside siganos@cs.ucr.edu Sudhir L. Tauro U. C. Riverside stauro@cs.ucr.edu Michalis Faloutsos U. C. Riverside michalis@cs.ucr.edu

More information

Top Top 10 Algorithms in Data Mining

Top Top 10 Algorithms in Data Mining ICDM 06 Panel on Top Top 10 Algorithms in Data Mining 1. The 3-step identification process 2. The 18 identified candidates 3. Algorithm presentations 4. Top 10 algorithms: summary 5. Open discussions ICDM

More information

Mining the Temporal Dimension of the Information Propagation

Mining the Temporal Dimension of the Information Propagation Mining the Temporal Dimension of the Information Propagation Michele Berlingerio, Michele Coscia 2, and Fosca Giannotti 3 IMT-Lucca, Lucca, Italy 2 Dipartimento di Informatica, Pisa, Italy {name.surname}@isti.cnr.it

More information

Social Media Mining. Graph Essentials

Social Media Mining. Graph Essentials Graph Essentials Graph Basics Measures Graph and Essentials Metrics 2 2 Nodes and Edges A network is a graph nodes, actors, or vertices (plural of vertex) Connections, edges or ties Edge Node Measures

More information

Top 10 Algorithms in Data Mining

Top 10 Algorithms in Data Mining Top 10 Algorithms in Data Mining Xindong Wu ( 吴 信 东 ) Department of Computer Science University of Vermont, USA; 合 肥 工 业 大 学 计 算 机 与 信 息 学 院 1 Top 10 Algorithms in Data Mining by the IEEE ICDM Conference

More information

Evolution of the Internet AS-Level Ecosystem

Evolution of the Internet AS-Level Ecosystem Evolution of the Internet AS-Level Ecosystem Srinivas Shakkottai, Marina Fomenkov 2, Ryan Koga 2, Dmitri Krioukov 2, and kc claffy 2 Texas A&M University, College Station, USA, sshakkot@tamu.edu, 2 Cooperative

More information

Analysis of Internet Topologies: A Historical View

Analysis of Internet Topologies: A Historical View Analysis of Internet Topologies: A Historical View Mohamadreza Najiminaini, Laxmi Subedi, and Ljiljana Trajković Communication Networks Laboratory http://www.ensc.sfu.ca/cnl Simon Fraser University Vancouver,

More information

SCAN: A Structural Clustering Algorithm for Networks

SCAN: A Structural Clustering Algorithm for Networks SCAN: A Structural Clustering Algorithm for Networks Xiaowei Xu University of Arkansas at Little Rock xwxu@ualr.edu Nurcan Yuruk, Zhidan Feng University of Arkansas at Little Rock {nxyuruk, zxfeng@ualr.edu}

More information