LaPIe: Collective Communications adapted to Grid Environments

Transcription

1 LaPIe: Collective Communications adapted to Grid Environments Luiz Angelo Barchet-Estefanel Thesis Supervisor: M Denis TRYSTRAM Co-Supervisor: M Grégory MOUNIE ID-IMAG Laboratory Grenoble - France LaPIe: Collective Communications adapted to Grid Environments 1 / 60

2 Introduction to Parallel Processing Fact The demand for computing power will always grow up There are two options to increase the available computer power: LaPIe: Collective Communications adapted to Grid Environments 2 / 60

3 Introduction to Parallel Processing Fact The demand for computing power will always grow up There are two options to increase the available computer power: Buy a bigger computer - $$$$$ LaPIe: Collective Communications adapted to Grid Environments 2 / 60

4 Introduction to Parallel Processing Fact The demand for computing power will always grow up There are two options to increase the available computer power: Buy a bigger computer - $$$$$ Use several computers Parallel Processing Divide a problem into multiple fragments that can be executed in parallel LaPIe: Collective Communications adapted to Grid Environments 2 / 60

5 Introduction to Grids/Metacomputing Definition Aggregation of geographically distributed computers Mainly clusters of computers Fact The Grid hardware already exists Interconnexion of several clusters and NOWs The Grid software only emerges Most difficulties come from the resource heterogeneity LaPIe: Collective Communications adapted to Grid Environments 3 / 60

6 Communications in a Grid Influence of resource heterogeneity Geographically distributed systems Different communication latencies Heterogeneous communication infrastructures Transfer bandwidth LaPIe: Collective Communications adapted to Grid Environments 4 / 60

7 Example: GRID 5000 Latency Bandwidth* Myrinet 10 µs 250 MB/s Giga Ethernet 50 µs 120 MB/s WAN Connection 5000 µs MB/s average bandwidth for a 32MB message sent with MPI LaPIe: Collective Communications adapted to Grid Environments 5 / 60

8 Example: GRID 5000 Latency Bandwidth Myrinet 10 µs 250 MB/s Giga Ethernet 50 µs 120 MB/s WAN Connection 5000 µs MB/s average bandwidth for a 32MB message sent with MPI LaPIe: Collective Communications adapted to Grid Environments 6 / 60

9 Collective Communications Definition Collective communication is defined as communication that involves a group of processes Different communication patterns LaPIe: Collective Communications adapted to Grid Environments 7 / 60

10 Collective Communications Definition Collective communication is defined as communication that involves a group of processes Different communication patterns Most programming environments include collective communication primitives PVM, MPI, Athapascan, etc. Consensus, Group Membership, etc. LaPIe: Collective Communications adapted to Grid Environments 7 / 60

11 Collective Communications Impact of communication heterogeneity Absence of a single efficient strategy LaPIe: Collective Communications adapted to Grid Environments 8 / 60

12 Collective Communications Impact of communication heterogeneity Absence of a single efficient strategy Performance depends on: communication pattern network characteristics operation parameters (# of nodes, message size, etc.) LaPIe: Collective Communications adapted to Grid Environments 8 / 60

13 Overview of this work Our goal: improve communication scheduling on grid environments through the use of an hierarchical network modelling provide efficient grid-aware collective communication operations What we need: qualitative knowledge of the network topology detect network heterogeneity quantitative knowledge of the network interconnexions identify latency and bandwidth among different nodes LaPIe: Collective Communications adapted to Grid Environments 9 / 60

14 Overview of this work Our approach: use hybrid algorithms dynamic scheduling of inter-cluster communication efficient static algorithms for intra-cluster communication Technical validation: evaluation through synthetic experiences performances are close to those experienced by real applications fast prototyping LaPIe: Collective Communications adapted to Grid Environments 10 / 60

15 Outline 1 Optimising Collective Communications LaPIe: Collective Communications adapted to Grid Environments 11 / 60

16 Optimising Collective Communications Objective: minimise the overall execution time improve data distribution reduce communications through slow links LaPIe: Collective Communications adapted to Grid Environments 12 / 60

17 Optimising Collective Communications Objective: minimise the overall execution time improve data distribution reduce communications through slow links Heterogeneous Systems - Grids communication scheduling according to the network characteristics LaPIe: Collective Communications adapted to Grid Environments 12 / 60

18 Optimising Collective Communications Objective: minimise the overall execution time improve data distribution reduce communications through slow links Heterogeneous Systems - Grids communication scheduling according to the network characteristics NP-Complete no accurate analytical models are available LaPIe: Collective Communications adapted to Grid Environments 12 / 60

19 Hierarchical Structure Flat Tree approach LaPIe: Collective Communications adapted to Grid Environments 13 / 60

20 Hierarchical Structure Flat Tree approach Objective: minimise distant communications LaPIe: Collective Communications adapted to Grid Environments 13 / 60

21 Hierarchical Structure Flat Tree approach Objective: minimise distant communications Communication is divided in two layers LaPIe: Collective Communications adapted to Grid Environments 13 / 60

22 Hierarchical Structure Flat Tree approach Objective: minimise distant communications Communication is divided in two layers Distant nodes LaPIe: Collective Communications adapted to Grid Environments 13 / 60

23 Hierarchical Structure Flat Tree approach Objective: minimise distant communications Communication is divided in two layers Distant nodes Local nodes LaPIe: Collective Communications adapted to Grid Environments 13 / 60

24 Hierarchical Structure Flat Tree approach Objective: minimise distant communications Communication is divided in two layers Distant nodes Local nodes ECO (Lowekamp 96) - PVM library MagPIe (Kielmann 99) - MPI library LaPIe: Collective Communications adapted to Grid Environments 13 / 60

25 Analysis of this approach Advantages Easy to implement Minimises communication across slow links Limitations Too tight scheduling communication hierarchy does not make difference between links capacities/latencies The root process handles all long distance transmissions does not explore parallel transmissions LaPIe: Collective Communications adapted to Grid Environments 14 / 60

26 Multi-layered Hierarchy Multi-layered communications LaPIe: Collective Communications adapted to Grid Environments 15 / 60

27 Multi-layered Hierarchy Multi-layered communications Structured according to the relative performance of each layer WAN > MAN > LAN > SMP LaPIe: Collective Communications adapted to Grid Environments 15 / 60

28 Multi-layered Hierarchy Multi-layered communications Structured according to the relative performance of each layer WAN > MAN > LAN > SMP MPICH-G2 (Karonis 02) - MPI library LaPIe: Collective Communications adapted to Grid Environments 15 / 60

29 Analysis of this approach Advantages More flexible structure Based on the relative communication performance Limitation Hierarchy does not takes into account the communication cost inside each cluster LaPIe: Collective Communications adapted to Grid Environments 16 / 60

30 Analysis of this approach Advantages More flexible structure Based on the relative communication performance Limitation Hierarchy does not takes into account the communication cost inside each cluster LaPIe: Collective Communications adapted to Grid Environments 16 / 60

31 How to improve Grid communications Is it possible to better schedule communications in a grid environment? Dynamically generated hierarchy network parameters, message size and communication pattern Fully Grid-aware includes the communication cost inside each cluster LaPIe: Collective Communications adapted to Grid Environments 17 / 60

32 Our Approach Simplify the network description focus on topology discovery and clustering Augment the information about clusters performance performance models to predict the communication cost Improve the usage of multi-layered hierarchy grid-aware scheduling heuristics LaPIe: Collective Communications adapted to Grid Environments 18 / 60

33 Outline 1 Optimising Collective Communications LaPIe: Collective Communications adapted to Grid Environments 19 / 60

34 Approaches Locality of the nodes User-defined mappings Network discovery tools LaPIe: Collective Communications adapted to Grid Environments 20 / 60

35 Approaches Locality of the nodes User-defined mappings Network discovery tools Locality of the nodes Simple Does not express clusters internal heterogeneity Does not consider interconnection parameters LaPIe: Collective Communications adapted to Grid Environments 20 / 60

36 Approaches Locality of the nodes User-defined mappings Network discovery tools User defined topology Expensive and hard to do Sufficiently accurate (?) Normally falls back to the locality of the nodes LaPIe: Collective Communications adapted to Grid Environments 20 / 60

37 Approaches Locality of the nodes User-defined mappings Network discovery tools Some network tools NWS - measures latency and bandwidth between nodes REMOS - uses SNMP to construct a low-level topology TopoMon - identifies shared links LaPIe: Collective Communications adapted to Grid Environments 20 / 60

38 What we need Application-level topology discovery identification of homogeneous islands fast deployment LaPIe: Collective Communications adapted to Grid Environments 21 / 60

39 Topology Discovery First Phase: identify network heterogeneity use of NWS-like tools LaPIe: Collective Communications adapted to Grid Environments 22 / 60

40 Topology Discovery First Phase: identify network heterogeneity use of NWS-like tools construct a n n distance matrix latency LaPIe: Collective Communications adapted to Grid Environments 22 / 60

41 Details How to minimise the probing time Latency measure is short enough to not disturb the network Schedule parallel probes among independent pairs LaPIe: Collective Communications adapted to Grid Environments 23 / 60

42 Topology Discovery Second Phase: clustering use of a clustering algorithm (ECO) Tolerance factor ρ = 30% LaPIe: Collective Communications adapted to Grid Environments 24 / 60

43 Topology Discovery Second Phase: clustering use of a clustering algorithm (ECO) Tolerance factor ρ = 30% Formatted output (magpie_clusters file) LaPIe: Collective Communications adapted to Grid Environments 24 / 60

44 Topology Description cluster 0 process cluster 1 process cluster 2 process cluster 3 process 24 cluster 4 process 27 cluster 5 process LaPIe: Collective Communications adapted to Grid Environments 25 / 60

45 Topology Discovery Third Phase: obtaining network parameters Reduced set of measures one node from each cluster LaPIe: Collective Communications adapted to Grid Environments 26 / 60

46 Topology Discovery Third Phase: obtaining network parameters Reduced set of measures one node from each cluster O(C 2 ) measures LaPIe: Collective Communications adapted to Grid Environments 26 / 60

47 Topology Discovery Third Phase: obtaining network parameters Reduced set of measures one node from each cluster O(C 2 ) measures Merge of this information with network topology LaPIe: Collective Communications adapted to Grid Environments 26 / 60

48 Example: the IDPOT cluster LaPIe: Collective Communications adapted to Grid Environments 27 / 60

49 Outline Optimising Collective Communications Cost model Broadcast Validating the models 1 Optimising Collective Communications LaPIe: Collective Communications adapted to Grid Environments 28 / 60

50 Cost model Broadcast Validating the models Modelling Collective Communications We use plogp cost model (Kielmann et al.) Number of processes - P Latency - L Communication gap - g(m) Send and receive overhead - os(m), or(m) g(m) os(m) g(m) g(m) L or(m) L LaPIe: Collective Communications adapted to Grid Environments 29 / 60

51 Advantages of plogp Cost model Broadcast Validating the models gap Measured Gap between GdX and IDPOT clusters microseconds Message size (bytes) LaPIe: Collective Communications adapted to Grid Environments 30 / 60

52 Cost model Broadcast Validating the models Comparing with the Hockney model gap Measured Gap between GdX and IDPOT clusters microseconds Message size (bytes) plogp allows a theoretical modelling that is close to the reality LaPIe: Collective Communications adapted to Grid Environments 31 / 60

53 Example: modelling MPI_Bcast Cost model Broadcast Validating the models Definition One process (root) send the same message to every process in the group LaPIe: Collective Communications adapted to Grid Environments 32 / 60

54 Example: modelling MPI_Bcast Cost model Broadcast Validating the models Definition One process (root) send the same message to every process in the group Strategies Flat Tree Binary Tree Binomial Tree Chain (pipeline)... LaPIe: Collective Communications adapted to Grid Environments 32 / 60

58 Cost model Broadcast Validating the models MPI_Bcast Modelling on Homogeneous Clusters Implementation Strategy Communication Model Flat Tree (P 1) g(m) + L Flat Tree with Rendez-vous (P 1) g(m) + 2 g(1) + 3 L Segmented Flat Tree (P 1) (g(s) k) + L Binomial Tree log 2 P g(m) + log 2 P L Binomial Tree with Rendez-vous log 2 P g(m) + log 2 P (2 g(1) + 3 L) Segmented Binomial Tree log 2 P g(s) k + log 2 P L Binary Tree log 2 P (2 g(m) + L) Chain (P 1) (g(m) + L) Chain with Rendez-vous (P 1) (g(m) + 2 g(1) + 3 L) Segmented Chain (Pipeline) (P 1) (g(s) + L) + (g(s) (k 1)) LaPIe: Collective Communications adapted to Grid Environments 33 / 60

59 Cost model Broadcast Validating the models MPI_Bcast Modelling on Homogeneous Clusters Implementation Strategy Communication Model Flat Tree (P 1) g(m) + L Flat Tree with Rendez-vous (P 1) g(m) + 2 g(1) + 3 L Segmented Flat Tree (P 1) (g(s) k) + L Binomial Tree log 2 P g(m) + log 2 P L Binomial Tree with Rendez-vous log 2 P g(m) + log 2 P (2 g(1) + 3 L) Segmented Binomial Tree log 2 P g(s) k + log 2 P L Binary Tree log 2 P (2 g(m) + L) Chain (P 1) (g(m) + L) Chain with Rendez-vous (P 1) (g(m) + 2 g(1) + 3 L) Segmented Chain (Pipeline) (P 1) (g(s) + L) + (g(s) (k 1)) LaPIe: Collective Communications adapted to Grid Environments 33 / 60

60 Flat Tree Broadcast Cost model Broadcast Validating the models The simplest one - (P 1) g(m) + L normally used with a few nodes (bad performance) prediction error < 2% Completion time (s) MPI_Bcast Flat Tree Myrinet Flat Prediction e+060 1e Message size (bytes) Number of nodes LaPIe: Collective Communications adapted to Grid Environments 34 / 60

61 Binomial Tree Broadcast Cost model Broadcast Validating the models log 2 P g(m) + log 2 P L prediction error < 5% Completion time (s) e+060 1e Message size (bytes) MPI_Bcast Binomial Tree Myrinet Binomial Prediction Number of nodes LaPIe: Collective Communications adapted to Grid Environments 35 / 60

62 Segmented Chain Broadcast Cost model Broadcast Validating the models (P 1) (g(s) + L) + (g(s) (k 1)) Performance depends on the segment size Dependent on the performance of all nodes MPI_Bcast Segmented Chain (Pipeline) Myrinet Completion time (s) e+060 1e Message size (bytes) Pipeline 16ko Prediction Number of nodes LaPIe: Collective Communications adapted to Grid Environments 36 / 60

63 Choosing the best strategy Cost model Broadcast Validating the models Comparison MPI_Bcast 25 machines Myrinet Flat Flat prediction Chain Chain prediction Binomial Binomial prediction Completion time (s) e e+06 Message size (bytes) LaPIe: Collective Communications adapted to Grid Environments 37 / 60

64 Cost model Broadcast Validating the models Choosing the best strategy - small messages Comparison MPI_Bcast 25 machines Myrinet Flat Flat prediction Chain Chain prediction Binomial Binomial prediction Completion time (s) Message size (bytes) LaPIe: Collective Communications adapted to Grid Environments 38 / 60

65 Outline Optimising Collective Communications Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation 1 Optimising Collective Communications LaPIe: Collective Communications adapted to Grid Environments 39 / 60

66 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Grid-Aware Collective Communication Scheduling Communications in a Heterogeneous Environment exhaustive search genetic algorithms (Vorakosit) simulated annealing (Vadhiyar) LaPIe: Collective Communications adapted to Grid Environments 40 / 60

67 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Grid-Aware Collective Communication Scheduling Communications in a Heterogeneous Environment exhaustive search genetic algorithms (Vorakosit) simulated annealing (Vadhiyar) operation specific optimisations pipelined broadcasts (Beaumont et al.) balanced trees (Burger et al.). LaPIe: Collective Communications adapted to Grid Environments 40 / 60

68 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Grid-Aware Collective Communication Scheduling Communications in a Heterogeneous Environment exhaustive search genetic algorithms (Vorakosit) simulated annealing (Vadhiyar) operation specific optimisations pipelined broadcasts (Beaumont et al.) balanced trees (Burger et al.). optimisation heuristics FEF and ECEF (Bhat) LaPIe: Collective Communications adapted to Grid Environments 40 / 60

69 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Grid-Aware Collective Communication Why to use an hierarchical scheduling reduces the search space LaPIe: Collective Communications adapted to Grid Environments 41 / 60

70 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Grid-Aware Collective Communication Why to use an hierarchical scheduling reduces the search space each cluster may use different strategies binomial, chain, etc. LaPIe: Collective Communications adapted to Grid Environments 41 / 60

71 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Grid-Aware Collective Communication Why to use an hierarchical scheduling reduces the search space each cluster may use different strategies binomial, chain, etc. this approach may be employed also with other communication patterns LaPIe: Collective Communications adapted to Grid Environments 41 / 60

72 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Broadcast - Optimisation Heuristics Fastest Edge First -FEF (Bhat) objective: select the sender that can reach a new receiver earlier strategy: find the edge with the minimum latency min i A, j B L i,j Drawback this strategy may overload a single sender LaPIe: Collective Communications adapted to Grid Environments 42 / 60

73 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Broadcast - Optimisation Heuristics Earliest Completing Edge First - ECEF (Bhat) objective: select the fastest available sender to reach a new receiver strategy: take into account the Ready Time and the transfer time Weakness (?) min (RT i + g i,j (m) + L i,j ) i A, j B Can the receiver contribute to the broadcast? LaPIe: Collective Communications adapted to Grid Environments 43 / 60

74 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Broadcast - Optimisation Heuristics Earliest Completing Edge First with lookahead - ECEFLA (Bhat) objective: select the fastest available sender to reach a good receiver a node that can contribute with message diffusion strategy: use a lookahead function to evaluate the usefulness of a receiver min (RT i + g i,j (m) + L i,j + F j ) ; i A, j B LaPIe: Collective Communications adapted to Grid Environments 44 / 60

75 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Broadcast - Optimisation Heuristics Earliest Completing Edge First with lookahead - ECEFLA (Bhat) objective: select the fastest available sender to reach a good receiver a node that can contribute with message diffusion strategy: use a lookahead function to evaluate the usefulness of a receiver min (RT i + g i,j (m) + L i,j + F j ) ; F j = min (g j,k(m) + L j,k ) i A, j B P k B LaPIe: Collective Communications adapted to Grid Environments 44 / 60

76 Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Broadcast - Optimisation Heuristics Common characteristics of these heuristics Give priority to fast links Question: maximise the number of potential senders Can a previous knowledge on intra-cluster communications improve the efficiency of these heuristics? T k - communication time inside a cluster LaPIe: Collective Communications adapted to Grid Environments 45 / 60

77 Specific Heuristics Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation ECEFLA-t simple extension of the ECEFLA heuristic objective: select the fastest available sender to reach a good receiver a cluster contacted by this node may finish in the smallest time quickly reduces the number of clusters to contact min (RT i +g i,j (m)+l i,j +F j ) ; F j = min (g j,k(m)+l j,k +T k ) i A, j B P k B LaPIe: Collective Communications adapted to Grid Environments 46 / 60

78 Drawbacks in a Grid System Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation All these strategies always try to contact first the fastest clusters/nodes Communications to distant/slow clusters are delayed This extra delay may augment the makespan Balance communication: Give some priority to slow clusters Still keep trying to reach the largest number of nodes maximise the number of data sources LaPIe: Collective Communications adapted to Grid Environments 47 / 60

79 Specific Heuristics Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation ECEFLA-T - tries to balance the scheduling objective: select a receiver whose cost to contact the slowest cluster is still reduced sender is the fastest one that can reach the slowest cluster strategy: the lookahead function maximises the search min (RT i+g i,j (m)+l i,j +F j ) ; F j = max (g j,k(m)+l j,k +T k ) i A, j B P k B Drawback slow clusters will be contacted only after no fast cluster remains LaPIe: Collective Communications adapted to Grid Environments 48 / 60

80 Specific Heuristics Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Bottom-Up gives priority to slow clusters objective: prevent a supplementary delay for the slow clusters strategy: search for the slowest cluster still not contacted Drawback max (min (g i,j(m) + L i,j + T j )) P j B P i A does not improve the number of data sources LaPIe: Collective Communications adapted to Grid Environments 49 / 60

81 Comparing Strategies Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Simulations Use of simulations to obtain the average performance of each strategy average of runs Random values between: minimum maximum gap i,j 0.10 s 0.60 s IDPOT-icluster2 IDPOT-GdX latency i,j s s IDPOT-icluster2 GdX-Rennes T i 0.02 s 3 s 1 MB Myrinet 1 MB Fast Ethernet LaPIe: Collective Communications adapted to Grid Environments 50 / 60

82 Comparing Strategies Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation 6 5 Flat Tree FEF ECEF ECEF Lookahead ECEFla t ECEFLA T BottomUp 1MB Broadcast in a Grid Environment average of iterations Completion time (s) Number of clusters LaPIe: Collective Communications adapted to Grid Environments 51 / 60

83 A Large Scale Grid Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Flat Tree FEF ECEF ECEF Lookahead ECEFla t ECEFLA T BottomUp 1MB Broadcast in a Grid Environment Average time of iteration Completion time (s) Number of clusters LaPIe: Collective Communications adapted to Grid Environments 52 / 60

84 A Close Look Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation ECEF ECEF Lookahead ECEFla t ECEFla T 1MB Broadcast in a Grid Environment average of iterations Completion time (s) Number of clusters LaPIe: Collective Communications adapted to Grid Environments 53 / 60

85 Hit Rate Optimising Collective Communications Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation A different metric to evaluate the heuristics MB Broadcast in a Grid Environment Hit Rate iterations ECEF ECEF Lookahead ECEFla t ECEFla T Completion time (s) Number of clusters LaPIe: Collective Communications adapted to Grid Environments 54 / 60

86 Experimental validation Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation 88 machines, 6 homogeneous clusters (3 IDPOT,2 GdX, Toulouse) 12 Flat FEF ECEF ECEFLA 10 ECEFLAt ECEFLAT BottomUp Broadcast in a Grid Predicted Times Completion Time (s) e e+06 2e e+06 3e e+06 4e e+06 Message size (Bytes) LaPIe: Collective Communications adapted to Grid Environments 55 / 60

87 Experimental Validation Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Completion time (s) Flat FEF ECEF ECEFLA ECEFLAt ECEFLAT BottomUp Broadcast on a 78 machines grid Measured Times e e+06 2e e+06 3e e+06 4e e+06 Message size (Bytes) LaPIe: Collective Communications adapted to Grid Environments 56 / 60

88 Experimental Validation Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Completion time (s) 12 Flat FEF ECEF ECEFLA 10 ECEFLAt ECEFLAT BottomUp LAM Binomial Broadcast on a 78 machines grid Measured Times e e+06 2e e+06 3e e+06 4e e+06 Message size (Bytes) LaPIe: Collective Communications adapted to Grid Environments 57 / 60

89 Conclusions Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Scheduling communications on a grid environment Hierarchical communication reduces the optimisation complexity Multi-layered communication with hybrid algorithms LaPIe: Collective Communications adapted to Grid Environments 58 / 60

90 Conclusions Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Scheduling communications on a grid environment Hierarchical communication reduces the optimisation complexity Multi-layered communication with hybrid algorithms efficient well known intra-cluster strategies LaPIe: Collective Communications adapted to Grid Environments 58 / 60

91 Conclusions Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Scheduling communications on a grid environment Hierarchical communication reduces the optimisation complexity Multi-layered communication with hybrid algorithms efficient well known intra-cluster strategies dynamically scheduled inter-cluster communications LaPIe: Collective Communications adapted to Grid Environments 58 / 60

92 Conclusions Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Scheduling communications on a grid environment Hierarchical communication reduces the optimisation complexity Multi-layered communication with hybrid algorithms efficient well known intra-cluster strategies dynamically scheduled inter-cluster communications Importance of Topology Discovery Helps to better describe the real network Prevents mistakes induced by manual configuration Simplify further optimisation tasks LaPIe: Collective Communications adapted to Grid Environments 58 / 60

93 Future Works Scheduling Strategies Optimisation Heuristics Simulation Experimental Validation Extend our experiments More experiments on a grid environment Compare with other heuristics and optimisation techniques Evaluate the impact on the performance of real applications LaPIe: Collective Communications adapted to Grid Environments 59 / 60