Analysis of Internet Topologies



Similar documents
Analysis of Internet Topologies: A Historical View

Towards Modelling The Internet Topology The Interactive Growth Model

Inet-3.0: Internet Topology Generator

The Internet Is Like A Jellyfish

Internet Topology. Yihua He Michalis Faloutsos University of California, Riverside. Glossary 2.

The Joint Degree Distribution as a Definitive Metric of the Internet AS-level Topologies

Some Examples of Network Measurements

DATA ANALYSIS II. Matrix Algorithms

On Characterizing BGP Routing Table Growth Tian Bu, Lixin Gao, and Don Towsley University of Massachusetts, Amherst, MA 01003

AN INITIAL PEER CONFIGURATION ALGORITHM

Mining Network Traffic Data

Link-Rank: A Graphical Tool for Capturing BGP Routing Dynamics

Quality of Service Routing Network and Performance Evaluation*

Router Scheduling Configuration Based on the Maximization of Benefit and Carried Best Effort Traffic

BGP Routing Stability of Popular Destinations

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS

BGP Prefix Hijack: An Empirical Investigation of a Theoretical Effect Masters Project

An Overview of Solutions to Avoid Persistent BGP Divergence

On Routing Asymmetry in the Internet

International Journal of Advanced Research in Computer Science and Software Engineering

Graph Mining Techniques for Social Media Analysis

Design and Experiments of small DDoS Defense System using Traffic Deflecting in Autonomous System

Inter-domain Routing

Jellyfish: A Conceptual Model for the AS Internet Topology

Performance Analysis of AQM Schemes in Wired and Wireless Networks based on TCP flow

Traceroute-Based Topology Inference without Network Coordinate Estimation

Distributed Out-bound Load Balancing in Inter-AS Routing by Random Matchings

BGP route propagation. Internet AS relationships, Routing policy on Internet paths. Example of commercial relationship. Transit vs.

G.Vijaya kumar et al, Int. J. Comp. Tech. Appl., Vol 2 (5),

Anonym: A Tool for Anonymization of the Internet Traffic

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Observingtheeffectof TCP congestion controlon networktraffic

Distributed Out-bound Load Balancing in Inter-AS Routing by Random Matchings

Graphs over Time Densification Laws, Shrinking Diameters and Possible Explanations

EFFECT OF TRANSFER FILE SIZE ON TCP-ADaLR PERFORMANCE: A SIMULATION STUDY

Graph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks

Search Heuristics for Load Balancing in IP-networks

Measurement Study on the Internet reachability. 3.1 Introduction. 3. Internet Backbone

Exterior Gateway Protocols (BGP)

Algorithms and Tools for Anonymization of the Internet Traffic

Algorithms for representing network centrality, groups and density and clustered graph representation

Stability of QOS. Avinash Varadarajan, Subhransu Maji

NETZCOPE - a tool to analyze and display complex R&D collaboration networks

Performance Analysis of VoIP Codecs over Wi-Fi and WiMAX Networks

ENSC 427: COMMUNICATION NETWORKS Simulation and Analysis of Wi-Fi Performance in campus network

Network Level Multihoming and BGP Challenges

Enhanced Multiple Routing Configurations For Fast IP Network Recovery From Multiple Failures

Distance Degree Sequences for Network Analysis

Inter-domain Routing. Outline. Border Gateway Protocol

A Network Recovery Scheme for Node or Link Failures using Multiple Routing Configurations

Intelligent Worms: Searching for Preys

Towards a Next- Generation Inter-domain Routing Protocol. L. Subramanian, M. Caesar, C.T. Ee, M. Handley, Z. Mao, S. Shenker, and I.

EFFICIENT DETECTION IN DDOS ATTACK FOR TOPOLOGY GRAPH DEPENDENT PERFORMANCE IN PPM LARGE SCALE IPTRACEBACK

Tracers Placement for IP Traceback against DDoS Attacks

Characterization and Design of Effective BGP AS-PATH Prepending

Decentralized Utility-based Sensor Network Design

Transcription:

Analysis of Internet Topologies Ljiljana Trajković ljilja@cs.sfu.ca Communication Networks Laboratory http://www.ensc.sfu.ca/cnl School of Engineering Science Simon Fraser University, Vancouver, British Columbia Canada

Roadmap Internet topology and the datasets Spectrum of a graph and power-laws Power-laws and the Internet topology Spectral analysis of the Internet graph Conclusions and references 2010-2011 CAS Society: DLP talk 2

Roadmap Internet topology and the datasets Spectrum of a graph and power-laws Power-laws and the Internet topology Spectral analysis of the Internet graph Conclusions and references 2010-2011 CAS Society: DLP talk 3

lhr (535,102 nodes and 601,678 links) 2010-2011 CAS Society: DLP talk 4

Internet graph Internet is a network of Autonomous Systems: groups of networks sharing the same routing policy identified with Autonomous System Numbers (ASN) Autonomous System Numbers: http://www.iana.org/assignments/as-numbers Internet topology on AS-level: the arrangement of ASes and their interconnections Analyzing the Internet topology and finding properties of associated graphs rely on mining data and capturing information about Autonomous Systems (ASes). 2010-2011 CAS Society: DLP talk 5

Internet routing protocol Border Gateway Protocol (BGP): inter-as protocol used to exchange network reachability information among BGP systems reachability information is stored in routing tables 2010-2011 CAS Society: DLP talk 6

Internet AS-level data Source of data are routing tables: Route Views: http://www.routeviews.org most participating ASes reside in North America RIPE (Réseaux IP européens): http://www.ripe.net/ris most participating ASes reside in Europe The BGP routing tables are collected from multiple geographically distributed BGP Cisco routers and Zebra servers. Analyzed datasets were collected at 00:00 am on July 31, 2003 and 00:00 am on July 31, 2008. 2010-2011 CAS Society: DLP talk 7

Internet AS-level data Data used in prior research (partial list): Faloutsos, 1999 Chang, 2001 Vukadinovic, 2001 Gkantsidis, 2003 Route Views Yes Yes Yes Yes RIPE No Yes No Yes These research results have been used in developing Internet simulation tools: power-laws are employed to model and generate Internet topologies: BA model, BRITE, Inet2 2010-2011 CAS Society: DLP talk 8

Data sets Concerns about the use of the two datasets: different observations about AS degrees: power-law distribution: Route Views [Faloutsos, 1999] Weibull distribution: Route Views + RIPE [Chang, 2001] data completeness: RIPE dataset contains ~ 40% more AS connections and 2% more ASs than Route Views [Chang, 2001] 2010-2011 CAS Society: DLP talk 9

Route Views and RIPE: statistics Route Views and RIPE samples collected on May 30, 2003 Number of AS paths Probed ASs AS pairs Route Views 6,398,912 15,418 34,878 RIPE 6,375,028 15,433 35,225 AS pair: a pair of connected ASs 15,369 probed ASs (99.7%) in both datasets are identical 29,477 AS pairs in Route Views (85%) and in RIPE (84%) are identical 2010-2011 CAS Society: DLP talk 10

Degree distributions: 2003 data Consider all ASs with assigned AS numbers AS degree distribution in Route Views and RIPE datasets: 2010-2011 CAS Society: DLP talk 11

Route Views RIPE Core ASs ASs with largest degrees AS Degree AS Degree 1 701 2595 701 2448 2 1239 2569 1239 1784 3 7018 1999 7018 1638 4 3561 1036 209 861 5 1 999 3561 705 6 209 863 3356 673 7 3356 662 3549 612 8 3549 617 702 580 9 702 562 2914 561 10 2914 556 1 489 11 6461 498 4589 482 12 4513 468 6461 476 13 4323 315 8220 450 14 16631 294 3303 429 15 6347 291 13237 412 16 8220 289 6730 313 17 3257 277 4323 305 18 4766 263 3257 305 19 3786 263 16631 296 20 7132 258 6347 281 2010-2011 CAS Society: DLP talk 12

Route Views RIPE AS Degree AS Degree 1 701 2595 701 2448 Core ASs 2 3 4 1239 7018 3561 2569 1999 1036 1239 7018 209 1784 1638 861 ASs with largest degrees 16 of the core ASs in Route Views and RIPE are identical 5 6 7 8 9 1 209 3356 3549 702 999 863 662 617 562 3561 3356 3549 702 2914 705 673 612 580 561 Core ASs in Route Views have larger degrees than core ASs in RIPE 10 11 12 2914 6461 4513 556 498 468 1 4589 6461 489 482 476 13 4323 315 8220 450 14 16631 294 3303 429 15 6347 291 13237 412 16 8220 289 6730 313 17 3257 277 4323 305 18 4766 263 3257 305 19 3786 263 16631 296 20 7132 258 6347 281 2010-2011 CAS Society: DLP talk 13

Internet topology Datasets are collected from Border Gateway Protocols (BGP) routing tables. The Internet topology is characterized by the presence of various power-laws observed when considering: node degree vs. node rank node degree frequency vs. degree number of nodes within a number of hops vs. number of hops eigenvalues of the adjacency matrix and the normalized Laplacian matrix vs. the order of the eigenvalues Faloutsos et al., 1999 and Siganos et al., 2003 2010-2011 CAS Society: DLP talk 14

Roadmap Internet topology and the datasets Power-laws and spectrum of a graph Power-laws and the Internet topology Spectral analysis of the Internet graph Conclusions and references 2010-2011 CAS Society: DLP talk 15

Spectrum of a graph Normalized Laplacian matrix NL(G): NL( i, 1 j) = 0 1 d d i j 0 d i and d j are degrees of node i and j, respectively The spectrum of NL(G) is the collection of all eigenvalues and contains 0 for every connected graph component. if if i = i and j and d otherwise i j are adjacent Chung et al., 1997 2010-2011 CAS Society: DLP talk 16

Power laws: node degree vs. rank The graph nodes v are sorted in decreasing order based on their degrees d v and are indexed with a sequence of numbers indicating their ranks r v. The (r v, d v ) pairs are plotted on the log-log scale. The power-law implies: d R v r v where v is the node number and R is the node degree powerlaw exponent., 2010-2011 CAS Society: DLP talk 17

Power laws: CCDF of a node degree The frequency of a node degree is equal to the number of nodes having the same degree. The complementary cumulative distribution function (CCDF) D d of a node degree d is equal to the number of nodes having degree less than or equal to d, divided by the number of nodes. The power-law implies: D, D where D is the CCDF power-law exponent. d d 2010-2011 CAS Society: DLP talk 18

Power laws: eigenvalues The eigenvalues λ i of the adjacency matrix and the normalized Laplacian matrix are sorted in decreasing order and plotted versus the associated increasing sequence of numbers i representing the order of the eigenvalue. The power-law for the adjacency matrix implies: λ ai i ε, The power-law for the normalized Laplacian matrix implies: λ Li where ε and L are the eigenvalue power-law exponents. i L, 2010-2011 CAS Society: DLP talk 19

Analysis of datasets Calculated and plotted on a log-log scale are: node degree vs. node rank frequency of node degree vs. node degree eigenvalues vs. index Linear regression is used to determine the correlation coefficient between the regression line and the plotted data. A high correlation coefficient between the regression line and the plotted data indicates the existence of a power-law, which implies that node degree, frequency of node degree, and eigenvalues follow a power-law dependency on the rank, node degree, and index, respectively. 2010-2011 CAS Society: DLP talk 20

Analysis of datasets The power-law exponents are calculated from the linear regression lines 10 a x b, with segment a and slope b when plotted on a log-log scale. Source of data are routing tables: Route Views: http://www.routeviews.org RIPE (Réseaux IP européens): http://www.ripe.net/ris Faloutsos et al., 1999 and Chen et al., 2002 2010-2011 CAS Society: DLP talk 21

Roadmap Internet topology and the datasets Spectrum of a graph and power-laws Power-laws and the Internet topology Spectral analysis of the Internet graph Conclusions and references 2010-2011 CAS Society: DLP talk 22

Route Views 2003 dataset The node degree power-law exponent R = -0.7325 The correlation coefficient = -0.9661 2010-2011 CAS Society: DLP talk 23

Route Views 2008 dataset The node degree power-law exponent R = -0.7712 The correlation coefficient = -0.9721 2010-2011 CAS Society: DLP talk 24

Confidence intervals: node degree 2010-2011 CAS Society: DLP talk 25

Confidence intervals Six samples were randomly selected from 2003 and 2008 Route Views and RIPE datasets Each dataset is smaller than 30, with unknown standard deviation T-distribution was used to predict the confidence interval at 95% confidence level X t / 2( s / n ) < μ < X tx / 2( s / n ) x + t x / X : the sample mean 2 : the t-distribution s : sample standard deviation n : number of samples μ : population mean 2010-2011 CAS Society: DLP talk 26

Route Views 2003 dataset CCDF power-law exponent D = -1.2519 The correlation coefficient = -0.9810 2010-2011 CAS Society: DLP talk 27

Route Views 2008 dataset CCDF power-law exponents D = -1.3696 The correlation coefficients = -0.9626 2010-2011 CAS Society: DLP talk 28

Confidence intervals: CCDF 2010-2011 CAS Society: DLP talk 29

Route Views 2003 dataset The eigenvalue power-law exponents ε = -0.5173 The correlation coefficient = -0.9990 2010-2011 CAS Society: DLP talk 30

Route Views 2008 dataset The eigenvalue power-law exponent ε = -0.4860 The correlation coefficient = -0.9882 2010-2011 CAS Society: DLP talk 31

Confidence intervals: adjacency matrix 2010-2011 CAS Society: DLP talk 32

Route Views 2003 dataset The eigenvalue power-law exponent L = -0.0198 The correlation coefficient = -0.9564 2010-2011 CAS Society: DLP talk 33

Route Views 2008 dataset The eigenvalue power-law exponent L = -0.0177 The correlation coefficient = -0.9782 2010-2011 CAS Society: DLP talk 34

Confidence intervals: normalized Laplacian matrix 2010-2011 CAS Society: DLP talk 35

Roadmap Internet topology and the datasets Spectrum of a graph and power-laws Power-laws and the Internet topology Spectral analysis of the Internet graph Conclusions and references 2010-2011 CAS Society: DLP talk 36

Spectral analysis of Internet graphs We calculate the second smallest and the largest eigenvalues and associated eigenvectors of normalized Laplacian matrix. Each element of an eigenvector is associated with the AS having the same index. ASes are sorted in the ascending order based on the eigenvector values and the sorted AS vector is then indexed. The connectivity status is equal to one if the AS is connected to another AS or zero if the AS is isolated or is absent from the routing table. 2010-2011 CAS Society: DLP talk 37

Spectral analysis of Internet graphs The second smallest eigenvalue, called "algebraic connectivity" of a normalized Laplacian matrix, is related to the connectivity characteristic of the graph. Elements of the eigenvector corresponding to the largest eigenvalue of the normalized Laplacian matrix tend to be positioned close to each other if they correspond to AS nodes with similar connectivity patterns constituting clusters. Gkantsidis et al., 2003 2010-2011 CAS Society: DLP talk 38

Characteristic valuation: example The second smallest eigenvector: 0.1, 0.3, -0.2, 0 AS1(0.1), AS2(0.3), AS3(-0.2), AS4(0) Sort ASs by element value: AS3, AS4, AS1, AS2 AS3 and AS1 are connected connectivity status 1 0 AS3 AS4 AS1 AS2 index of elements 2010-2011 CAS Society: DLP talk 39

Spectral analysis: observations The second smallest eigenvector: separates connected ASs from disconnected ASs Route Views and RIPE datasets are similar on a coarser scale The largest eigenvector: reveals highly connected clusters Route Views and RIPE datasets differ on a finer scale 2010-2011 CAS Society: DLP talk 40

Route Views 2003 dataset Spectral views of the AS connectivity based on the second smallest eigenvalue. 2010-2011 CAS Society: DLP talk 41

Route Views 2008 dataset Spectral views of the AS connectivity based on the second smallest eigenvalue. 2010-2011 CAS Society: DLP talk 42

Route Views 2003 dataset Spectral views of the AS connectivity based on the largest eigenvalue. 2010-2011 CAS Society: DLP talk 43

Route Views 2008 dataset Spectral views of the AS connectivity based on the largest eigenvalue. 2010-2011 CAS Society: DLP talk 44

Roadmap Internet topology and the datasets Spectrum of a graph and power-laws Power-laws and the Internet topology Spectral analysis of the Internet graph Conclusions and references 2010-2011 CAS Society: DLP talk 45

Conclusions We have evaluated collected data from the Route Views and RIPE projects and have confirmed the presence of power-laws in graphs capturing the AS-level Internet topology. Spectral analysis techniques revealed distinct clustering characteristics of Route Views and RIPE datasets The analysis also captured historical trends in the development of the Internet topology over the past five years. In spite of the Internet growth, increasing number of users, and the deployment of new network elements, powerlaw exponents have not changed substantially. 2010-2011 CAS Society: DLP talk 46

Conclusions These power-law exponents do not capture every property of a graph and are only one measure used to characterize the Internet. However, spectral analysis based on the normalized Laplacian matrix indicated visible changes in the clustering of AS nodes and the AS connectivity. 2010-2011 CAS Society: DLP talk 47

References M. Najiminaini, L. Subedi, and Lj. Trajkovic, "Analysis of Internet topologies: a historical view," presented at IEEE International Symposium Circuits and Systems, Taipei, Taiwan, May 2009. J. Chen and Lj. Trajkovic, "Analysis of Internet topology data," Proc. IEEE International Symposium on Circuits and Systems, Vancouver, BC, Canada, May 2004, vol. IV, pp. 629 632. M. Faloutsos, P. Faloutsos, and C. Faloutsos, On power-law relationships of the Internet topology, Proc. ACM SIGCOMM, Computer Communication Review, vol. 29, no. 4, pp. 251-262, Sept. 1999. G. Siganos, M. Faloutsos, P. Faloutsos, and C. Faloutsos, "Power-laws and the AS-level Internet topology," IEEE/ACM Trans. Networking, vol. 11, no. 4, pp. 514 524, Aug. 2003. A. Medina, I. Matta, and J. Byers, "On the origin of power laws in Internet topologies," Proc. ACM SIGCOMM 2000, Computer Communication Review, vol. 30, no. 2, pp. 18 28, Apr. 2000. 2010-2011 CAS Society: DLP talk 48

References L. Gao, "On inferring autonomous system relationships in the Internet," IEEE/ACM Trans. Networking, vol. 9, no. 6, pp. 733 745, Dec. 2001. D. Vukadinovic, P. Huang, and T. Erlebach, On the Spectrum and Structure of Internet Topology Graphs, in H. Unger et al., editors, Innovative Internet Computing Systems, LNCS2346, pp. 83 96. Springer, Berlin, Germany, 2002. Q. Chen, H. Chang, R. Govindan, S. Jamin, S. Shenker, and W. Willinger, "The origin of power laws in Internet topologies revisited," Proc. INFOCOM, New York, NY, USA, Apr. 2002, pp. 608 617. H. Chang, R. Govindan, S. Jamin, S. Shenker, and W. Willinger, "Towards capturing representative AS-level Internet topologies," Proc. of ACM SIGMETRICS 2002, New York, NY, June 2002, pp. 280 281. H. Tangmunarunkit, R. Govindan, S. Jamin, S. Shenker, and W. Willinger, "Network topology generators: degree-based vs. structural," Proc. ACM SIGCOMM, Computer Communication Review, vol. 32, no. 4, pp. 147 159, Oct. 2002. 2010-2011 CAS Society: DLP talk 49

References C. Gkantsidis, M. Mihail, and E. Zegura, "Spectral analysis of Internet topologies, Proc. of Infocom 2003, San Francisco, CA, Mar. 2003, vol. 1, pp. 364 374. S. Jaiswal, A. Rosenberg, and D. Towsley, "Comparing the structure of power-law graphs and the Internet AS graph," Proc. 12th IEEE International Conference on Network Protocols, Washington DC, Aug. 2004, pp. 294 303. F. R. K. Chung, Spectral Graph Theory. Providence, Rhode Island: Conference Board of the Mathematical Sciences, 1997, pp. 2 6. M. Fiedler, "Algebraic connectivity of graphs," Czech. Math. J., vol. 23, no. 2, pp. 298 305, 1973. 2010-2011 CAS Society: DLP talk 50

lhr (535,102 nodes and 601,678 links) 2010-2011 CAS Society: DLP talk 51

lhr (535,102 nodes and 601,678 links) 2010-2011 CAS Society: DLP talk 52

lhr (535,102 nodes and 601,678 links) 2010-2011 CAS Society: DLP talk 53

Round-trip time measurements (63,631 nodes and 63,630 links) 2010-2011 CAS Society: DLP talk 54

Round-trip time measurements (63,631 nodes and 63,630 links) 2010-2011 CAS Society: DLP talk 55

Resources CAIDA: The Cooperative Association for Internet Data Analysis http://www.caida.org/home/ Walrus - Gallery: Visualization & Navigation http://www.caida.org/tools/visualization/walrus/gallery1/ Walrus - Gallery: Abstract Art http://www.caida.org/tools/visualization/walrus/gallery2/ 2010-2011 CAS Society: DLP talk 56