School of Computer Science Carnegie Mellon Graph Mining, self-similarity and power laws
|
|
- Eugene McCarthy
- 8 years ago
- Views:
Transcription
1 Graph Mining, self-similarity and power laws Christos Faloutsos University
2 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 2
3 Motivating questions: How does the Internet look like? What constitutes a normal social network? network value of a customer? [Domingos+] which gene/species affects the others the most? DoE/DoD, 2007 C. Faloutsos 3
4 Problem #1 - topology How does the Internet look like? Any rules? DoE/DoD, 2007 C. Faloutsos 4
5 Solution#1: Rank exponent R A1: Power law in the degree distribution [SIGCOMM99] internet domains log(degree) ibm.com att.com log(rank) DoE/DoD, 2007 C. Faloutsos 5
6 Power laws In- and out-degree distribution of web sites [Barabasi], [IBM-CLEVER] log(freq) from [Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, Andrew Tomkins ] log indegree DoE/DoD, 2007 C. Faloutsos 6
7 epinions.com count who-trusts-whom [Richardson + Domingos, KDD 2001] (out) degree DoE/DoD, 2007 C. Faloutsos 7
8 Even more power laws: web hit counts [w/ A. Montgomery] Web Site Traffic log(count) Zipf yahoo.com log(freq) DoE/DoD, 2007 C. Faloutsos 8
9 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 9
10 Problem#2: evolution Given a graph: how will it look like, next year? [from Lumeta: ISPs 6/1999] DoE/DoD, 2007 C. Faloutsos 10
11 Evolution of diameter? Prior analysis, on power-law-like graphs, hints that diameter ~ O(log(N)) or diameter ~ O( log(log(n))) i.e.., slowly increasing with network size Q: What is happening, in reality? DoE/DoD, 2007 C. Faloutsos 11
12 Evolution of diameter? Prior analysis, on power-law-like graphs, hints that diameter ~ O(log(N)) or diameter ~ O( log(log(n))) i.e.., slowly increasing with network size Q: What is happening, in reality? A: It shrinks(!!), towards a constant value DoE/DoD, 2007 C. Faloutsos 12
13 Shrinking diameter ArXiv physics papers and their citations [Leskovec+05a] DoE/DoD, 2007 C. Faloutsos 13
14 Shrinking diameter ArXiv: who wrote what DoE/DoD, 2007 C. Faloutsos 14
15 Shrinking diameter U.S. patents citing each other DoE/DoD, 2007 C. Faloutsos 15
16 Shrinking diameter Autonomous systems DoE/DoD, 2007 C. Faloutsos 16
17 Temporal evolution of graphs N(t) nodes; E(t) edges at time t suppose that N(t+1) = 2 * N(t) Q: what is your guess for E(t+1) =? 2 * E(t) DoE/DoD, 2007 C. Faloutsos 17
18 Temporal evolution of graphs N(t) nodes; E(t) edges at time t suppose that N(t+1) = 2 * N(t) Q: what is your guess for E(t+1) =? x2 * E(t) A: over-doubled! DoE/DoD, 2007 C. Faloutsos 18
19 Densification Power Law ArXiv: Physics papers and their citations E(t) 1.69 N(t) DoE/DoD, 2007 C. Faloutsos 19
20 Densification Power Law U.S. Patents, citing each other E(t) 1.66 N(t) DoE/DoD, 2007 C. Faloutsos 20
21 Densification Power Law Autonomous Systems E(t) 1.18 N(t) DoE/DoD, 2007 C. Faloutsos 21
22 Densification Power Law ArXiv: who wrote what E(t) 1.15 N(t) DoE/DoD, 2007 C. Faloutsos 22
23 Summary of laws Static graphs power law degrees; power law eigenvalues communities (within communities) small diameters Dynamic graphs shrinking diameter densification power law DoE/DoD, 2007 C. Faloutsos 23
24 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 24
25 Problem#3: Generators Q: what local behavior can generate such graphs? DoE/DoD, 2007 C. Faloutsos 25
26 Problem#3: Generators Q: what local behavior can generate such graphs? A1: Preferential attachment [Barabasi+] A2: copying model [Kleinberg+] A3: forest-fire model [Leskovec+] A4: Kronecker [Leskovec+] A5: Economic reasons [Papadimitriou+] DoE/DoD, 2007 C. Faloutsos 26
27 Problem#3: Generators Q: what local behavior can generate such graphs? A1: Preferential attachment A2: copying model A3: forest-fire model A4: Kronecker A5: Economic reasons power law degree + communities + DPL + easily parallilizable DoE/DoD, 2007 C. Faloutsos 27
28 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 28
29 Problem#4: influence propagation how do influence/rumors/viruses propagate? what is the best customer to market to? DoE/DoD, 2007 C. Faloutsos 29
30 Problem#4: influence propagation how do influence/rumors/viruses propagate? tipping point [Kleinberg+] first eigenvalue -> epidemic threshold [Chakrabarti+] what is the best customer to market to? network value of a customer [Domingos+] DoE/DoD, 2007 C. Faloutsos 30
31 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 31
32 Problem #5: communities how to find natural communities, quickly? how to find strange /suspicious/valuable edges? DoE/DoD, 2007 C. Faloutsos 32
33 Problem #5: communities how to find natural communities, quickly? network flow [Flake+] node/edge betweeness (~ stress ) cross-associations [Chakrabarti+] 2 nd eigenvalue; METIS [Karypis+] random walks [Newman] etc etc etc DoE/DoD, 2007 C. Faloutsos 33
34 Problem #5: communities connection sub-graphs [Faloutsos+] DoE/DoD, 2007 C. Faloutsos 34
35 Problem #5: communities connection sub-graphs [Faloutsos+] DoE/DoD, 2007 C. Faloutsos 35
36 Problem #5: communities connection sub-graphs [Faloutsos+] BANKS system [Chakrabarti+] ObjectRank [Papakonstantinou+] DoE/DoD, 2007 C. Faloutsos 36
37 Overview Achievements global patterns and laws (static/dynamic) generators influence propagation communities; graph partitioning local patterns: frequent subgraphs Challenges DoE/DoD, 2007 C. Faloutsos 37
38 Problem #6: local patterns Which sub-graphs are common/frequent? H C C N C C C H N C molecule #1 molecule #M DoE/DoD, 2007 C. Faloutsos 38
39 Problem #6: local patterns Which sub-graphs are common/frequent? H C C N C C C H N C molecule #1 molecule #M DoE/DoD, 2007 C. Faloutsos 39
40 Problem #6: local patterns Clever extensions of Association Rules (= frequent itemsets / market basket analysis) [Jiawei Han+] [Jian Pei+] [G. Karypis] [M. Zaki+]... DoE/DoD, 2007 C. Faloutsos 40
41 Overview Achievements Challenges time evolving graphs multi-graphs DoE/DoD, 2007 C. Faloutsos 41
42 Time evolving graphs Q: what will happen next? (eg., on a traffic matrix?) 1/1/2005 DoE/DoD, 2007 C. Faloutsos 42
43 Time evolving graphs Q: what will happen next? (eg., on a traffic matrix?) 1/2/2005 DoE/DoD, 2007 C. Faloutsos 43
44 Time evolving graphs Q: what will happen next? (eg., on a traffic matrix?) 1/3/2005 DoE/DoD, 2007 C. Faloutsos 44
45 Time evolving graphs Q: what will happen next? (eg., on a traffic matrix?) 1/4/2005???? DoE/DoD, 2007 C. Faloutsos 45
46 Overview Achievements Challenges time evolving graphs multi-graphs DoE/DoD, 2007 C. Faloutsos 46
47 Multi-graphs Patterns/outliers? friends co-authors DoE/DoD, 2007 C. Faloutsos 47
48 Multi-graphs Relational Learning link prediction, feature extraction [Jensen+] etc Dis-ambiguation, de-duplication [Getoor+] etc friends co-authors DoE/DoD, 2007 C. Faloutsos 48
49 source person School of Computer Science Promising solution: tensors I x J x K destination person type of contact DoE/DoD, 2007 C. Faloutsos 49
50 Tensors: ~SVD, for >=3 modes C I x J x K Ix R K x R Jx R ¼ A R x R x R B = + + Foil: from Tamara Kolda (Sandia) DoE/DoD, 2007 C. Faloutsos 50
51 Specially Structured Tensors Tucker Tensor Kruskal Tensor Our Notation Our Notation core W K x T K x R W I x J x K Ix R Jx S I x J x K w 1 Ix R w R Jx R = U R x S x T V = = u 1 v 1 U + + V R x R x R u R v R Foil: from Tamara Kolda (Sandia) DoE/DoD, 2007 C. Faloutsos 51
52 Conclusions - achievements Surprising patterns in graphs power laws; communities small/shrinking diameters simple, local behavior can lead to such patterns (eg., preferential attachment, etc) fast algorithms for communities/partitioning fast algorithms for frequent subgraphs DoE/DoD, 2007 C. Faloutsos 52
53 Conclusions - next steps multi-graphs time-evolving graphs scalability (graph sampling) (large graph visualization) DoE/DoD, 2007 C. Faloutsos 53
54 Conclusions - philosophically: deep connections with self-similarity, cellular automata (~agents), and fractals DoE/DoD, 2007 C. Faloutsos 54
55 Resources Manfred Schroeder Chaos, Fractals and Power Laws, 1991 A-L. Barabasi, Linked, 2002 D. Watts, Six Degrees, 2004 DoE/DoD, 2007 C. Faloutsos 55
56 References R. Albert, H. Jeong, and A.-L. Barabási, Diameter of the World Wide Web. Nature 401, (1999) A. Fabrikant, E. Koutsoupias, and C. Papadimitriou. Heuristically Optimized Tradeoffs: A New Paradigm for Power Laws in the Internet. Int. Colloquium on Automata, Languages, and Programming (ICALP), Malaga, Spain, July DoE/DoD, 2007 C. Faloutsos 56
57 References [sigcomm99] Michalis Faloutsos, Petros Faloutsos and Christos Faloutsos, What does the Internet look like? Empirical Laws of the Internet Topology, SIGCOMM 1999 [Leskovec 05] Jure Leskovec, Jon M. Kleinberg, Christos Faloutsos: Graphs over time: densification laws, shrinking diameters and possible explanations. KDD 2005: DoE/DoD, 2007 C. Faloutsos 57
58 References [brite] Alberto Medina, Anukool Lakhina, Ibrahim Matta, and John Byers. BRITE: An Approach to Universal Topology Generation. MASCOTS '01 Xifeng Yan, Xianghong Jasmine Zhou, Jiawei Han: Mining closed relational graphs with connectivity constraints. KDD 2005: DoE/DoD, 2007 C. Faloutsos 58
59 Thank you! Contact info: christos <at> cs.cmu.edu www. cs.cmu.edu /~christos (w/ papers, datasets, code, etc) DoE/DoD, 2007 C. Faloutsos 59
Graph Mining Techniques for Social Media Analysis
Graph Mining Techniques for Social Media Analysis Mary McGlohon Christos Faloutsos 1 1-1 What is graph mining? Extracting useful knowledge (patterns, outliers, etc.) from structured data that can be represented
More informationGraphs over Time Densification Laws, Shrinking Diameters and Possible Explanations
Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations Jurij Leskovec, CMU Jon Kleinberg, Cornell Christos Faloutsos, CMU 1 Introduction What can we do with graphs? What patterns
More informationThe Shape of the Network. The Shape of the Internet. Why study topology? Internet topologies. Early work. More on topologies..
The Shape of the Internet Slides assembled by Jeff Chase Duke University (thanks to and ) The Shape of the Network Characterizing shape : AS-level topology: who connects to whom Router-level topology:
More informationCMU SCS Large Graph Mining Patterns, Tools and Cascade analysis
Large Graph Mining Patterns, Tools and Cascade analysis Christos Faloutsos CMU Roadmap Introduction Motivation Why big data Why (big) graphs? Patterns in graphs Tools: fraud detection on e-bay Conclusions
More informationGraph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003
Graph models for the Web and the Internet Elias Koutsoupias University of Athens and UCLA Crete, July 2003 Outline of the lecture Small world phenomenon The shape of the Web graph Searching and navigation
More informationThank you! NetMine Data mining on networks IIS -0209107 AWSOM. Outline. Proposed method. Goals
NetMine Data mining on networks IIS -0209107 Christos Faloutsos (CMU) Michalis Faloutsos (UCR) Peggy Agouris George Kollios Fillia Makedon Betty Salzberg Anthony Stefanidis Thank you! NSF-IDM 04 C. Faloutsos
More informationExpansion Properties of Large Social Graphs
Expansion Properties of Large Social Graphs Fragkiskos D. Malliaros 1 and Vasileios Megalooikonomou 1,2 1 Computer Engineering and Informatics Department University of Patras, 26500 Rio, Greece 2 Data
More informationComputer Network Topologies: Models and Generation Tools
Consiglio Nazionale delle Ricerche Technical Report n. 5/200 Computer Network Topologies: Models and Generation Tools Giuseppe Di Fatta, Giuseppe Lo Presti 2, Giuseppe Lo Re CE.R.E. Researcher 2 CE.R.E.,
More informationCMU SCS Mining Billion-Node Graphs - Patterns and Algorithms
Mining Billion-Node Graphs - Patterns and Algorithms Christos Faloutsos CMU Thank you! Panos Chrysanthis Ling Liu Vladimir Zadorozhny Prashant Krishnamurthy C. Faloutsos (CMU) 2 Resource Open source system
More informationThe ebay Graph: How Do Online Auction Users Interact?
The ebay Graph: How Do Online Auction Users Interact? Yordanos Beyene, Michalis Faloutsos University of California, Riverside {yordanos, michalis}@cs.ucr.edu Duen Horng (Polo) Chau, Christos Faloutsos
More informationOn Generating Graphs with Prescribed Vertex Degrees for Complex Network Modeling 1. College of Computing Georgia Institute of Technology
On Generating Graphs with Prescribed Vertex Degrees for Complex Network Modeling 1 Milena Mihail Nisheeth K. Vishnoi College of Computing Georgia Institute of Technology Abstract Graph models for real-world
More informationCMU SCS Mining Large Graphs and Tensors - Patterns, Tools and Discoveries.
Mining Large Graphs and Tensors - Patterns, Tools and Discoveries. Christos Faloutsos CMU Thank you! Nikos Sidiropoulos Kuo-Chu Chang Zhi (Gerry) Tian C. Faloutsos (CMU) 2 Roadmap Introduction Motivation
More informationIntroduction to Graph Mining
Introduction to Graph Mining What is a graph? A graph G = (V,E) is a set of vertices V and a set (possibly empty) E of pairs of vertices e 1 = (v 1, v 2 ), where e 1 E and v 1, v 2 V. Edges may contain
More informationAnalyzing the Facebook graph?
Logistics Big Data Algorithmic Introduction Prof. Yuval Shavitt Contact: shavitt@eng.tau.ac.il Final grade: 4 6 home assignments (will try to include programing assignments as well): 2% Exam 8% Big Data
More information(B ) Empirical observation: Shrinking diameters: The effective diameter is, in many cases, actually decreasing as the network grows.
Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations Jure Leskovec Carnegie Mellon University jure@cs.cmu.edu Jon Kleinberg Cornell University kleinber@cs.cornell.edu Christos
More informationHow To Find Out How A Graph Densifies
Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations Jure Leskovec Carnegie Mellon University jure@cs.cmu.edu Jon Kleinberg Cornell University kleinber@cs.cornell.edu Christos
More informationGraph Mining and Social Network Analysis
Graph Mining and Social Network Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann
More informationJure Leskovec (@jure) Stanford University
Jure Leskovec (@jure) Stanford University KDD Summer School, Beijing, August 2012 8/10/2012 Jure Leskovec (@jure), KDD Summer School 2012 2 Graph: Kronecker graphs Graph Node attributes: MAG model Graph
More informationBig Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network
, pp.273-284 http://dx.doi.org/10.14257/ijdta.2015.8.5.24 Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network Gengxin Sun 1, Sheng Bin 2 and
More informationAn Alternative Web Search Strategy? Abstract
An Alternative Web Search Strategy? V.-H. Winterer, Rechenzentrum Universität Freiburg (Dated: November 2007) Abstract We propose an alternative Web search strategy taking advantage of the knowledge on
More informationStudying E-mail Graphs for Intelligence Monitoring and Analysis in the Absence of Semantic Information
Studying E-mail Graphs for Intelligence Monitoring and Analysis in the Absence of Semantic Information Petros Drineas, Mukkai S. Krishnamoorthy, Michael D. Sofka Bülent Yener Department of Computer Science,
More informationThe average distances in random graphs with given expected degrees
Classification: Physical Sciences, Mathematics The average distances in random graphs with given expected degrees by Fan Chung 1 and Linyuan Lu Department of Mathematics University of California at San
More informationThe Internet Is Like A Jellyfish
The Internet Is Like A Jellyfish Michalis Faloutsos UC Riverside Joint work with: Leslie Tauro, Georgos Siganos (UCR) Chris Palmer(CMU) Big Picture: Modeling the Internet Topology Traffic Protocols Routing,
More informationEvolutionary Network Analysis: A Survey
Evolutionary Network Analysis: A Survey CHARU AGGARWAL, IBM T. J. Watson Research Center KARTHIK SUBBIAN, University of Minnesota Evolutionary network analysis has found an increasing interest in the literature
More informationAN INTRODUCTION TO SOCIAL NETWORK DATA ANALYTICS
Chapter 1 AN INTRODUCTION TO SOCIAL NETWORK DATA ANALYTICS Charu C. Aggarwal IBM T. J. Watson Research Center Hawthorne, NY 10532 charu@us.ibm.com Abstract The advent of online social networks has been
More informationarxiv:cs.dm/0204001 v1 30 Mar 2002
A Steady State Model for Graph Power Laws David Eppstein Joseph Wang arxiv:cs.dm/0000 v 0 Mar 00 Abstract Power law distribution seems to be an important characteristic of web graphs. Several existing
More informationA SURVEY OF PROPERTIES AND MODELS OF ON- LINE SOCIAL NETWORKS
International Conference on Mathematical and Computational Models PSG College of Technology, Coimbatore Copyright 2009, Narosa Publishing House, New Delhi, India A SURVEY OF PROPERTIES AND MODELS OF ON-
More informationWhy Rumors Spread Fast in Social Networks
Why Rumors Spread Fast in Social Networks Benjamin Doerr 1, Mahmoud Fouz 2, and Tobias Friedrich 1,2 1 Max-Planck-Institut für Informatik, Saarbrücken, Germany 2 Universität des Saarlandes, Saarbrücken,
More informationWith the increasing availability of large social network data, there is
Learning and Predicting the Evolution of Social Networks Björn Bringmann, Katholieke Universiteit Leuven Michele Berlingerio, ISTI-CNR Francesco Bonchi and Aristides Gionis, Yahoo Research Graph Evolution
More informationAn Analysis of Verifications in Microblogging Social Networks - Sina Weibo
An Analysis of Verifications in Microblogging Social Networks - Sina Weibo Junting Chen and James She HKUST-NIE Social Media Lab Dept. of Electronic and Computer Engineering The Hong Kong University of
More informationFoundations of Multidimensional Network Analysis
2 International Conference on Advances in Social Networks Analysis and Mining Foundations of Multidimensional Network Analysis Michele Berlingerio Michele Coscia 2 Fosca Giannotti Anna Monreale 2 Dino
More informationA dynamic model for on-line social networks
A dynamic model for on-line social networks A. Bonato 1, N. Hadi, P. Horn, P. Pra lat 4, and C. Wang 1 1 Ryerson University, Toronto, Canada abonato@ryerson.ca, cpwang@ryerson.ca Wilfrid Laurier University,
More informationOnline Estimating the k Central Nodes of a Network
Online Estimating the k Central Nodes of a Network Yeon-sup Lim, Daniel S. Menasché, Bruno Ribeiro, Don Towsley, and Prithwish Basu Department of Computer Science UMass Amherst, Raytheon BBN Technologies
More informationTowards Modeling Legitimate and Unsolicited Email Traffic Using Social Network Properties
Towards Modeling Legitimate and Unsolicited Email Traffic Using Social Network Properties Farnaz Moradi Tomas Olovsson Philippas Tsigas Computer Science and Engineering Chalmers University of Technology,
More informationSome questions... Graphs
Uni Innsbruck Informatik - 1 Uni Innsbruck Informatik - 2 Some questions... Peer-to to-peer Systems Analysis of unstructured P2P systems How scalable is Gnutella? How robust is Gnutella? Why does FreeNet
More informationInternet Topology of the Power-Laws
Power-Laws and the AS-level Internet Topology Georgos Siganos,Michalis Faloutsos,Petros Faloutsos,Christos Faloutsos Abstract In this paper, we study and characterize the topology of the Internet at the
More informationIntroduction to Networks and Business Intelligence
Introduction to Networks and Business Intelligence Prof. Dr. Daning Hu Department of Informatics University of Zurich Sep 17th, 2015 Outline Network Science A Random History Network Analysis Network Topological
More informationOn Realistic Network Topologies for Simulation
On Realistic Network Topologies for Simulation Oliver Heckmann, Michael Piringer, Jens Schmitt, Ralf Steinmetz Multimedia Communications (KOM), Department of Electronic Engineering & Information Technology
More informationInformation Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)
More informationComplex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics
Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics Zhao Wenbin 1, Zhao Zhengxu 2 1 School of Instrument Science and Engineering, Southeast University, Nanjing, Jiangsu
More informationSocial Networks and Social Media
Social Networks and Social Media Social Media: Many-to-Many Social Networking Content Sharing Social Media Blogs Microblogging Wiki Forum 2 Characteristics of Social Media Consumers become Producers Rich
More informationCharacterizing and Modelling Clustering Features in AS-Level Internet Topology
Characterizing and Modelling Clustering Features in AS-Level Topology Yan Li, Jun-Hong Cui, Dario Maggiorini and Michalis Faloutsos yan.li@uconn.edu, jcui@engr.uconn.edu, dario@dico.unimi.it, michalis@cs.ucr.edu
More informationAnalyzing and modelling the AS-level Internet topology
Analyzing and modelling the AS-level Internet topology Shi Zhou & Raul J. Mondragon Department of Electronic Engineering Queen Mary, University of London Mile End Road, London, E1 4NS, United Kingdom Email:
More informationMobile Call Graphs: Beyond Power-Law and Lognormal Distributions
Mobile Call Graphs: Beyond Power-Law and Lognormal Distributions Mukund Seshadri Sprint Burlingame, California, USA mukund.seshadri@sprint.com Jean Bolot Sprint Burlingame, California, USA bolot@sprint.com
More informationHomophily in Online Social Networks
Homophily in Online Social Networks Bassel Tarbush and Alexander Teytelboym Department of Economics, University of Oxford bassel.tarbush@economics.ox.ac.uk Department of Economics, University of Oxford
More informationCommunity Detection Proseminar - Elementary Data Mining Techniques by Simon Grätzer
Community Detection Proseminar - Elementary Data Mining Techniques by Simon Grätzer 1 Content What is Community Detection? Motivation Defining a community Methods to find communities Overlapping communities
More informationResearch Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of Sina Microblogging
Mathematical Problems in Engineering, Article ID 578713, 6 pages http://dx.doi.org/10.1155/2014/578713 Research Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of
More informationPart 2: Community Detection
Chapter 8: Graph Data Part 2: Community Detection Based on Leskovec, Rajaraman, Ullman 2014: Mining of Massive Datasets Big Data Management and Analytics Outline Community Detection - Social networks -
More informationSocial Network Mining
Social Network Mining Data Mining November 11, 2013 Frank Takes (ftakes@liacs.nl) LIACS, Universiteit Leiden Overview Social Network Analysis Graph Mining Online Social Networks Friendship Graph Semantics
More informationMathematical Challenges in Cybersecurity
SANDIA REPORT SAND 2009-0805 Unlimited Release Printed February 2009 Mathematical Challenges in Cybersecurity Daniel M. Dunlavy, Bruce Hendrickson, and Tamara G. Kolda Prepared by Sandia National Laboratories
More informationCost effective Outbreak Detection in Networks
Cost effective Outbreak Detection in Networks Jure Leskovec Joint work with Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne VanBriesen, and Natalie Glance Diffusion in Social Networks One of
More informationSOCIAL NETWORK DATA ANALYTICS
SOCIAL NETWORK DATA ANALYTICS SOCIAL NETWORK DATA ANALYTICS Edited by CHARU C. AGGARWAL IBM T. J. Watson Research Center, Yorktown Heights, NY 10598, USA Kluwer Academic Publishers Boston/Dordrecht/London
More informationNetProbe: A Fast and Scalable System for Fraud Detection in Online Auction Networks
NetProbe: A Fast and Scalable System for Fraud Detection in Online Auction Networks Shashank Pandit, Duen Horng Chau, Samuel Wang, Christos Faloutsos Carnegie Mellon University Pittsburgh, PA 15213, USA
More informationSearch in BigData2 - When Big Text meets Big Graph 1. Introduction State of the Art on Big Data
Search in BigData 2 - When Big Text meets Big Graph Christos Giatsidis, Fragkiskos D. Malliaros, François Rousseau, Michalis Vazirgiannis Computer Science Laboratory, École Polytechnique, France {giatsidis,
More informationWho Links to Whom: Mining Linkage between Web Sites
Who Links to Whom: Mining Linkage between Web Sites Krishna Bharat, Bay-Wei Chang, Monika Henzinger Google Inc. Mountain View, CA 94043, USA {krishna,bay,monika}@google.com Matthias Ruhl MIT Laboratory
More informationDistance Degree Sequences for Network Analysis
Universität Konstanz Computer & Information Science Algorithmics Group 15 Mar 2005 based on Palmer, Gibbons, and Faloutsos: ANF A Fast and Scalable Tool for Data Mining in Massive Graphs, SIGKDD 02. Motivation
More informationTutorial, IEEE SERVICE 2014 Anchorage, Alaska
Tutorial, IEEE SERVICE 2014 Anchorage, Alaska Big Data Science: Fundamental, Techniques, and Challenges (Data Mining on Big Data) 2014. 6. 27. By Neil Y. Yen Presented by Incheon Paik University of Aizu
More informationGraph structure in the web
Graph structure in the web Andrei Broder 1, Ravi Kumar 2, Farzin Maghoul 1, Prabhakar Raghavan 2, Sridhar Rajagopalan 2, Raymie Stata 3, Andrew Tomkins 2, Janet Wiener 3 1: AltaVista Company, San Mateo,
More informationComparing Trade-off Based Models of the Internet
Fundamenta Informaticae 92 (2009) 363 372 363 DOI 10.3233/FI-2009-79 IOS Press Comparing Trade-off Based Models of the Internet Anthony Spatharis, Ilias Foudalis, Martha Sideri Computer Science Department
More informationDynamics of Large Networks. Jurij Leskovec
Dynamics of Large Networks Jurij Leskovec September 2008 CMU-ML-08-111 Machine Learning Department School of Computer Science Carnegie Mellon University Pittsburgh, PA Thesis Committee: Christos Faloutsos,
More informationMining Large Graphs. ECML/PKDD 2007 tutorial. Part 2: Diffusion and cascading behavior
Mining Large Graphs ECML/PKDD 2007 tutorial Part 2: Diffusion and cascading behavior Jure Leskovec and Christos Faloutsos Machine Learning Department Joint work with: Lada Adamic, Deepay Chakrabarti, Natalie
More informationPreserving Privacy and Frequent Sharing Patterns for Social Network Data Publishing
Preserving Privacy and Frequent Sharing Patterns for Social Network Data Publishing Benjamin C. M. Fung CIISE, Concordia University Montreal, QC, Canada fung@ciise.concordia.ca Yan an Jin Huazhong University
More informationDualIso: Scalable Subgraph Pattern Matching On Large Labeled Graphs SUPPLEMENT. Computer Science Department
DualIso: Scalable Subgraph Pattern Matching On Large Labeled Graphs SUPPLEMENT Matthew Saltz, Ayushi Jain, Abhishek Kothari, Arash Fard, John A. Miller and Lakshmish Ramaswamy Computer Science Department
More informationDATA ANALYSIS II. Matrix Algorithms
DATA ANALYSIS II Matrix Algorithms Similarity Matrix Given a dataset D = {x i }, i=1,..,n consisting of n points in R d, let A denote the n n symmetric similarity matrix between the points, given as where
More informationCommunication Dynamics of Blog Networks
Communication Dynamics of Blog Networks Mark Goldberg CS Department, RPI 0 8th Street Troy, NY goldberg@cs.rpi.edu Konstantin Mertsalov CS Department, RPI 0 8th Street Troy, NY mertsk2@cs.rpi.edu Stephen
More informationComplex Networks Analysis: Clustering Methods
Complex Networks Analysis: Clustering Methods Nikolai Nefedov Spring 2013 ISI ETH Zurich nefedov@isi.ee.ethz.ch 1 Outline Purpose to give an overview of modern graph-clustering methods and their applications
More informationLooking at the Blogosphere Topology through Different Lenses
Looking at the Blogosphere Topology through Different Lenses Xiaolin Shi Dept. of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI 48109 shixl@umich.edu Belle Tseng NEC
More informationAn Incremental Super-Linear Preferential Internet Topology Model
An Incremental Super-Linear Preferential Internet Topology Model Extended Abstract Sagy Bar 1, Mira Gonen 2, and Avishai Wool 3 1 School of Electrical Engineering, Tel Aviv University, Ramat Aviv 69978,
More informationPart 1: Link Analysis & Page Rank
Chapter 8: Graph Data Part 1: Link Analysis & Page Rank Based on Leskovec, Rajaraman, Ullman 214: Mining of Massive Datasets 1 Exam on the 5th of February, 216, 14. to 16. If you wish to attend, please
More informationPower Law of Predictive Graphs
Internet Mathematics Vol. 1, No. 1: 91-114 The Average Distance in a Random Graph with Given Expected Degrees FanChungandLinyuanLu Abstract. Random graph theory is used to examine the small-world phenomenon
More informationDegree distribution of the FKP network model
Degree distribution of the FKP network model Noam Berger 1, Béla Bollobás 2,3, Christian Borgs 4, Jennifer Chayes 4, and Oliver Riordan 3 1 Department of Statistics, University of California, Berkeley,
More informationLink Prediction on Evolving Data using Tensor Factorization
Link Prediction on Evolving Data using Tensor Factorization Stephan Spiegel 1, Jan Clausen 1, Sahin Albayrak 1, and Jérôme Kunegis 2 1 DAI-Labor, Technical University Berlin Ernst-Reuter-Platz 7, 10587
More informationNetwork Theory: 80/20 Rule and Small Worlds Theory
Scott J. Simon / p. 1 Network Theory: 80/20 Rule and Small Worlds Theory Introduction Starting with isolated research in the early twentieth century, and following with significant gaps in research progress,
More informationScaling Properties of the Internet Graph
Scaling Properties of the Internet Graph Aditya Akella Shuchi Chawla Arvind Kannan Srinivasan Seshan Computer Science Department Carnegie Mellon University Pittsburgh, PA 53 {aditya,shuchi}@cs.cmu.edu
More informationCS224W Project Report: Finding Top UI/UX Design Talent on Adobe Behance
CS224W Project Report: Finding Top UI/UX Design Talent on Adobe Behance Susanne Halstead, Daniel Serrano, Scott Proctor 6 December 2014 1 Abstract The Behance social network allows professionals of diverse
More informationCommunity Mining from Multi-relational Networks
Community Mining from Multi-relational Networks Deng Cai 1, Zheng Shao 1, Xiaofei He 2, Xifeng Yan 1, and Jiawei Han 1 1 Computer Science Department, University of Illinois at Urbana Champaign (dengcai2,
More informationEfficient Topological OLAP on Information Networks
Efficient Topological OLAP on Information Networks Qiang Qu 1, Feida Zhu 2, Xifeng Yan 3, Jiawei Han 4, Philip S. Yu 5, and Hongyan Li 1 1 {quqiang,lihy}@cis.pku.edu.cn, 2 fdzhu@smu.edu.sg, 3 xyan@cs.ucsb.edu,
More informationQ. Yan, X. Huang School of Economics and Management, Beijing University of Posts and Telecommunications, Beijing, China, 100876 Email: yq_10@sohu.
JOURNAL OF NETWORKS, VOL. 3, NO. 7, JULY 28 1 fuser Behavior and Topology Analysis Q. Yan, X. Huang School of Economics and Management, Beijing University of Posts and Telecommunications, Beijing, China,
More informationInformation Network or Social Network? The Structure of the Twitter Follow Graph
Information Network or Social Network? The Structure of the Twitter Follow Graph Seth A. Myers, Aneesh Sharma, Pankaj Gupta, and Jimmy Lin Twitter, Inc. @seth_a_myers @aneeshs @pankaj @lintool ABSTRACT
More informationMINING COMMUNITIES OF BLOGGERS: A CASE STUDY
MINING COMMUNITIES OF BLOGGERS: A CASE STUDY ON CYBER-HATE Jennifer Xu Bentley College Waltham, MA, USA jxu@bentley.edu Michael Chau University of Hong Kong Pokfulam, Hong Kong mchau@business.hku.hk Abstract
More informationDynamics of information spread on networks. Kristina Lerman USC Information Sciences Institute
Dynamics of information spread on networks Kristina Lerman USC Information Sciences Institute Information spread in online social networks Diffusion of activation on a graph, where each infected (activated)
More informationAnalysis of Internet Topologies
Analysis of Internet Topologies Ljiljana Trajković ljilja@cs.sfu.ca Communication Networks Laboratory http://www.ensc.sfu.ca/cnl School of Engineering Science Simon Fraser University, Vancouver, British
More informationA scalable multilevel algorithm for graph clustering and community structure detection
A scalable multilevel algorithm for graph clustering and community structure detection Hristo N. Djidjev 1 Los Alamos National Laboratory, Los Alamos, NM 87545 Abstract. One of the most useful measures
More informationTemporal Visualization and Analysis of Social Networks
Temporal Visualization and Analysis of Social Networks Peter A. Gloor*, Rob Laubacher MIT {pgloor,rjl}@mit.edu Yan Zhao, Scott B.C. Dynes *Dartmouth {yan.zhao,sdynes}@dartmouth.edu Abstract This paper
More informationOn the Bursty Evolution of Online Social Networks
On the Bursty Evolution of Online Social Networks Sabrina Gaito, Matteo Zignani, Gian Paolo Rossi, Alessandra Sala Xiaohan Zhao, Haitao Zheng, Ben Y. Zhao Computer Science, Universitá degli Studi di Milano
More informationHadoop Based Link Prediction Performance Analysis
Hadoop Based Link Prediction Performance Analysis Yuxiao Dong, Casey Robinson, Jian Xu Department of Computer Science and Engineering University of Notre Dame Notre Dame, IN 46556, USA Email: ydong1@nd.edu,
More informationNetwork/Graph Theory. What is a Network? What is network theory? Graph-based representations. Friendship Network. What makes a problem graph-like?
What is a Network? Network/Graph Theory Network = graph Informally a graph is a set of nodes joined by a set of lines or arrows. 1 1 2 3 2 3 4 5 6 4 5 6 Graph-based representations Representing a problem
More information-Duplication of Time-Varying Graphs
-Duplication of Time-Varying Graphs François Queyroi Sorbonne Universités, UPMC Univ Paris 06, UMR 7606, LIP6, F-75005, Paris CNRS, UMR 7606, LIP6, F-75005, Paris, France francois.queyroi@lip6.fr ABSTRACT.
More informationEfficient target control of complex networks based on preferential matching
Efficient target control of complex networks based on preferential matching Xizhe Zhang 1, Huaizhen Wang 1, Tianyang Lv 2,3 1 (School of Computer Science and Engineering, Northeastern University, Shenyang110819,
More informationMaximizing Restorable Throughput in MPLS Networks
Maximizing Restorable Throughput in MPLS Networks Reuven Cohen Gabi Nakibly Technion Israel Institute of Technology, Computer Science, Haifa, Israel Abstract MPLS recovery mechanisms are increasing in
More informationMANAGING AND MINING GRAPH DATA
MANAGING AND MINING GRAPH DATA MANAGING AND MINING GRAPH DATA Edited by CHARU C. AGGARWAL IBM T. J. Watson Research Center, Yorktown Heights, NY 10598, USA HAIXUN WANG Microsoft Research Asia, Beijing,
More informationGraph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks
Graph Theory and Complex Networks: An Introduction Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.20, steen@cs.vu.nl Chapter 08: Computer networks Version: March 3, 2011 2 / 53 Contents
More informationJellyfish: A Conceptual Model for the AS Internet Topology
Jellyfish: A Conceptual Model for the AS Internet Topology Georgos Siganos U. C. Riverside siganos@cs.ucr.edu Sudhir L. Tauro U. C. Riverside stauro@cs.ucr.edu Michalis Faloutsos U. C. Riverside michalis@cs.ucr.edu
More informationTop Top 10 Algorithms in Data Mining
ICDM 06 Panel on Top Top 10 Algorithms in Data Mining 1. The 3-step identification process 2. The 18 identified candidates 3. Algorithm presentations 4. Top 10 algorithms: summary 5. Open discussions ICDM
More informationMining the Temporal Dimension of the Information Propagation
Mining the Temporal Dimension of the Information Propagation Michele Berlingerio, Michele Coscia 2, and Fosca Giannotti 3 IMT-Lucca, Lucca, Italy 2 Dipartimento di Informatica, Pisa, Italy {name.surname}@isti.cnr.it
More informationSocial Media Mining. Graph Essentials
Graph Essentials Graph Basics Measures Graph and Essentials Metrics 2 2 Nodes and Edges A network is a graph nodes, actors, or vertices (plural of vertex) Connections, edges or ties Edge Node Measures
More informationTop 10 Algorithms in Data Mining
Top 10 Algorithms in Data Mining Xindong Wu ( 吴 信 东 ) Department of Computer Science University of Vermont, USA; 合 肥 工 业 大 学 计 算 机 与 信 息 学 院 1 Top 10 Algorithms in Data Mining by the IEEE ICDM Conference
More informationEvolution of the Internet AS-Level Ecosystem
Evolution of the Internet AS-Level Ecosystem Srinivas Shakkottai, Marina Fomenkov 2, Ryan Koga 2, Dmitri Krioukov 2, and kc claffy 2 Texas A&M University, College Station, USA, sshakkot@tamu.edu, 2 Cooperative
More informationAnalysis of Internet Topologies: A Historical View
Analysis of Internet Topologies: A Historical View Mohamadreza Najiminaini, Laxmi Subedi, and Ljiljana Trajković Communication Networks Laboratory http://www.ensc.sfu.ca/cnl Simon Fraser University Vancouver,
More informationSCAN: A Structural Clustering Algorithm for Networks
SCAN: A Structural Clustering Algorithm for Networks Xiaowei Xu University of Arkansas at Little Rock xwxu@ualr.edu Nurcan Yuruk, Zhidan Feng University of Arkansas at Little Rock {nxyuruk, zxfeng@ualr.edu}
More information