The Network Structure of Hard Combinatorial Landscapes



Similar documents
Introduced by Stuart Kauffman (ca. 1986) as a tunable family of fitness landscapes.

How To Understand The Network Of A Network

A discussion of Statistical Mechanics of Complex Networks P. Part I

Complex Networks Analysis: Clustering Methods

A Study of Local Optima in the Biobjective Travelling Salesman Problem

PUBLIC TRANSPORT SYSTEMS IN POLAND: FROM BIAŁYSTOK TO ZIELONA GÓRA BY BUS AND TRAM USING UNIVERSAL STATISTICS OF COMPLEX NETWORKS

Complex Network Visualization based on Voronoi Diagram and Smoothed-particle Hydrodynamics

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Social Media Mining. Network Measures

GEOENGINE MSc in Geomatics Engineering (Master Thesis) Anamelechi, Falasy Ebere

Graph theoretic approach to analyze amino acid network

Cluster Analysis: Advanced Concepts

Lecture 3. Linear Programming. 3B1B Optimization Michaelmas 2015 A. Zisserman. Extreme solutions. Simplex method. Interior point method

MuACOsm A New Mutation-Based Ant Colony Optimization Algorithm for Learning Finite-State Machines

Overview. Swarms in nature. Fish, birds, ants, termites, Introduction to swarm intelligence principles Particle Swarm Optimization (PSO)

Cluster detection algorithm in neural networks

Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks

A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION

Using Data Mining for Mobile Communication Clustering and Characterization

Load balancing in a heterogeneous computer system by self-organizing Kohonen network

A Non-Linear Schema Theorem for Genetic Algorithms

General Network Analysis: Graph-theoretic. COMP572 Fall 2009

The Topology of Large-Scale Engineering Problem-Solving Networks

Time-Dependent Complex Networks:

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS

Routing in packet-switching networks

Programming Using Python

The Dynamics of a Genetic Algorithm on a Model Hard Optimization Problem

Seminar. Path planning using Voronoi diagrams and B-Splines. Stefano Martina

Distance Degree Sequences for Network Analysis

Research Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of Sina Microblogging

DECENTRALIZED SCALE-FREE NETWORK CONSTRUCTION AND LOAD BALANCING IN MASSIVE MULTIUSER VIRTUAL ENVIRONMENTS

Data Preprocessing. Week 2

Outline. NP-completeness. When is a problem easy? When is a problem hard? Today. Euler Circuits


Distributed Dynamic Load Balancing for Iterative-Stencil Applications

The architecture of complex weighted networks

MAPPING OF ARBITRARY TRAFFIC DEMAND AND NETWORK TOPOLOGY ON A MESH OF RINGS NETWORK

The Joint Degree Distribution as a Definitive Metric of the Internet AS-level Topologies

Structural constraints in complex networks

Memory Allocation Technique for Segregated Free List Based on Genetic Algorithm

ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)

Optimal Planning with ACO

The Application of Bayesian Optimization and Classifier Systems in Nurse Scheduling

Chapter ML:XI (continued)

Search Heuristics for Load Balancing in IP-networks

Graph Theory and Complex Networks: An Introduction. Chapter 08: Computer networks

Genetic Algorithm Based Interconnection Network Topology Optimization Analysis

Open Source Software Developer and Project Networks

Evaluation of Different Task Scheduling Policies in Multi-Core Systems with Reconfigurable Hardware

A Robust Method for Solving Transcendental Equations

Data Warehousing und Data Mining

ModelingandSimulationofthe OpenSourceSoftware Community

AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S.

Product Geometric Crossover for the Sudoku Puzzle

Degree distribution in random Apollonian networks structures

Course: Model, Learning, and Inference: Lecture 5

Expanding the CASEsim Framework to Facilitate Load Balancing of Social Network Simulations

Lecture 10: Regression Trees

Ant Colony Optimization and Constraint Programming

Graph Processing and Social Networks

A Learning Based Method for Super-Resolution of Low Resolution Images

A Comparison of General Approaches to Multiprocessor Scheduling

Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003

1 o Semestre 2007/2008

Research Paper Business Analytics. Applications for the Vehicle Routing Problem. Jelmer Blok

Inet-3.0: Internet Topology Generator

Sensors & Transducers 2015 by IFSA Publishing, S. L.

An ACO Approach to Solve a Variant of TSP

Lecture 9: Introduction to Pattern Analysis

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model

Temporal Dynamics of Scale-Free Networks

Healthcare Analytics. Aryya Gangopadhyay UMBC

Evaluation of a New Method for Measuring the Internet Degree Distribution: Simulation Results

Bioinformatics: Network Analysis

Design and Analysis of ACO algorithms for edge matching problems

Travis Goodwin & Sanda Harabagiu

Master's projects at ITMO University. Daniil Chivilikhin PhD ITMO University

GENERATING AN ASSORTATIVE NETWORK WITH A GIVEN DEGREE DISTRIBUTION

Cellular Automaton: The Roulette Wheel and the Landscape Effect

Transcription:

The Network Structure of Hard Combinatorial Landscapes Marco Tomassini 1, Sebastien Verel 2, Gabriela Ochoa 3 1 University of Lausanne, Lausanne, Switzerland 2 University of Nice Sophia-Antipolis, France 3 University of Nottingham, Nottingham,UK

Motivation Use the tools of network analysis for studying: How? The structure of combinatorial fitness landscapes Problem (search) difficulty in combinatorial optimization Design effective search algorithms Mapping combinatorial landscapes to networks Conduct a network analysis Relate (and exploit?) network features to search operators the more we know of the statistical properties of a class of fitness landscapes, the better equipped will we be for the design of effective search algorithms for such landscapes L. Barnett, U. Sussex, DPhil Diss. 2003 2

Outline Fitness landscapes Features of fitness landscapes relevant to heuristic search Landscapes as networks Inherent networks (energy surfaces) Adapting this idea to combinatorial landscapes How to define the nodes and the edges? Definitions and methods Results of the analysis Basins of attraction Network statistics and features 3

Fitness landscapes Describe dynamics of adaptation in Nature (Wright, 1932). Later, describe dynamics of evolutionary algorithms Search/Evolution: adaptivewalk over a Landscape 3 Components L = (S,d,f) Search space Neighborhood relation Fitness function (height) 4

Features of landscapes relevant to heuristic search Number, fitness, distribution of local optima or peaks Topology of the basins of attraction Fitness differences between neighboring points (ruggedness) Presence and structure of neutral networks (terrains with equal fitness) M. Fuji, Japan Earth pyramids, Tyrol, Italy M. Auyantepui, Venezuela 5 (Angel Falls, Highest Waterfall)

Landscapes as networks (energy surfaces) Inherent Network Vertices: energy minima Edges: two nodes are connected if the energy barrier separating them is sufficiently low (transition state) (a) Model of 2D energy surface (b) Contour plot, partition of the configuration space into basins of attraction surrounding minima (c) landscape as a network J. P. K. Doye, The network topology of a potential energy landscape: a static scale-free network, Phys. Rev. Lett., 88, 2002 6

Landscapes as networks (combinatorial landscapes) Definitions Local optimum: is a solution s*, such that for all s in V(s), f(s)<f(s*) In practice: found by running a best-improvement local search (LS) LS defines a mapping from the search space S to the set of locally optimal solutions S* Basin of attraction: b i of a local optimum i is the set of configurations s in S, s.t. LS from s, will end in i binary strings of length N. Neighbourhood V(s) defined by the 1-move or bit flip operator V(s) = N 7

Landscapes as networks (combinatorial landscapes) Local optima Network G = (S*,E) Nodes: S* (a) Edges: notion of connectivity between basins e ij between i and j, if there is at least a pair of direct neighbours s i and s j, s.t. s i in b i, s j in b j (GECCO, 2008) weights w ij is attached to the edges, account for transition probabilities between basins (ALIFE, 2008, Phys. Rev. E,2008) Two weights are needed, oriented transition graph (b) 8

Definition - edge weight 9

NK landscapes (Kauffman, 93) A string s of N spins or genes represented by binary variables s i {0, 1} A real stochastic function Φ : s R + K (0 K < N) determines how many other spins in the string influence a given spin s i The value of Φ is the average contribution of all the spins N=5, K = 2, Adjacent interaction 10

NK landscapes ctd. K from 0 to N 1, NK landscapes can be tuned from smooth to rugged (easy to difficult respectively) K = 0 no correlations, Φ is an additive function, and there is a single maximum K = N 1 landscape completely random, the expected number of local optima is 2 N /(N + 1) Intermediate values of K interpolate between these two extreme cases and have a variable degree of epistasis (i.e. gene interaction) 11

Methods Extracted and analysed networks for N=14, 16 and 18, K=2, 4,,N-2, N-1 (30 random instances for each case) We measured: Statistics on basins sizes and fitness of optima Network features: clustering coefficient, shortest path to the global optimum, weight distribution, disparity, boundary of basins 12

Analysis of basins (Global optimum basin size versus K) Trend: the basin shrinks very quickly with increasing K. for higher K, more difficult for a search algorithm to locate the basin of attraction of the global optimum Size of the basin corresponding to the global maximum for each K 13

Analysis of basins basin size Trend: small number of large basin, large number of small basin log-normal cumulative distribution slope of correlation increases with K when K large : basin sizes are nearly equals Cumulative distribution of basins sizes for N=18 and K=4 14

Analysis of basins basin size Trend: small number of large basin, large number of small basin log-normal cumulative distribution slope of correlation increases with K when K large : basin sizes are nearly equals Regression lines for N=18 and different values of K 15

Analysis of basins (fitness vs. basin size) Trend: clear positive correlation between the fitness values of maxima and their basins' sizes On average, the global optimum easier to find than one other local optimum But more difficult to find, as the number of local optima increases exponentially with increasing K Correlation fitness of local optima vs. their corresponding basins sizes 16

General network statistics (Example data for N = 16, avg. of 30 instances) K # nodes # edges clustering disparity path length 2 33 516 0.96 0.326 56 6 460 41,791 0.79 0.084 170 10 1,470 16,2139 0.53 0.050 206 15 3,868 32,1203 0.35 0.039 200 Clustering: For high K, transition between a given pair of neighboring basins is less likely to occur Disparity (i.e. dishomogeneity of nodes with a given degree): For high K the transitions to other basins tend to become equally likely, an indication of the randomness of the landscape 17

Weight distribution Weights (transition prob. between neighbouring basins) are small For high K the decay is faster Low K has longer tails On average, the transition probabilities are higher for low K distribution of the network weights wij for outgoing edges with j i in log-x scale, N = 18 18

Weight distribution remain in the same basin Weights to remains in the same are large compare to Wij with i!= j Wii are higher for low K Easier to leave the basin for high K : high exploration But : number of local optima increases fast with K Average weight Wii according to the parameter N and K 19

Weight distribution Size of basin interior Do the basins look like a montain with interior and border? solution is in the interior if all neighbors are in the same basin Interior is very small Nearly all solution are in the border Average of the mean size of basins interiors 20

Shortest path length between local optima Average distance (shortest path) between nodes Increase with N (# of nodes increase exponentially) For a given N, increase with K up to K = 10, then stagnates 21

Shortest path length to global optima Average path length to the global optimum from all the other basins More relevant for optimisation Increase steadily with increasing K 22

Summary Proposed characterization of combinatorial landscapes as networks New findings about basin s structure Related some network features to search difficulty Future? Sampling techniques (instead of exhaustive enumeration) Neutral and more realistic combinatorial landscapes 23

References M. Barthelemy, A. Barrat, R. Pastor-Satorras, and A. Vespignani, Characterization and modeling of weighted networks, Physica A 346 (2005), 34 43. J. P. K. Doye, The network topology of a potential energy landscape: a static scale-free network, Phys. Rev. Lett. 88 (2002), 238701. J. P. K. Doye and C. P. Massen, Characterizing the network topology of the energy landscapes of atomic clusters, J. Chem. Phys. 122 (2005), 084105. G. Ochoa, M. Tomassini, and S. Verel, A study of NK landscape sand basins and local optima networks, Genetic and Evolutionary Computation Conference, GECCO 2008, Proceedings, ACM, 2008 S. Verel, G. Ochoa, M. Tomassini (2008) The Connectivity of NK Landscapes' Basins: A Network Analysis, Artificial Life XI, MIT Press, Cambridge, MA, pp. 648-655. M. Tomassini, S. Verel, G. Ochoa (2008) Complex-network analysis of combinatorial spaces: the landscape case of NK landscapes. Physical Review Letter E, 78 (6), 066114, 2008. 24