Branch-and-Bound with Peer-to-Peer for Large-Scale Grids
|
|
|
- Lucas Russell
- 10 years ago
- Views:
Transcription
1 Branch-and-Bound with Peer-to-Peer for Large-Scale Grids Alexandre di Costanzo INRIA - CNRS - I3S - Université de Sophia Antipolis Ph.D. defense Friday 12th October 2007 Advisor: Prof. Denis Caromel
2 The Big Picture 2
3 The Big Picture Objective 2
4 The Big Picture Objective Solving combinatorial optimization problems with Grids 2
5 The Big Picture Objective Solving combinatorial optimization problems with Grids Approach 2
6 The Big Picture Objective Solving combinatorial optimization problems with Grids Approach Parallel Branch-and-Bound and Peer-to-Peer 2
7 The Big Picture Objective Solving combinatorial optimization problems with Grids Approach Parallel Branch-and-Bound and Peer-to-Peer Contributions 2
8 The Big Picture Objective Solving combinatorial optimization problems with Grids Approach Parallel Branch-and-Bound and Peer-to-Peer Contributions 1. Branch-and-Bound framework for Grids 2
9 The Big Picture Objective Solving combinatorial optimization problems with Grids Approach Parallel Branch-and-Bound and Peer-to-Peer Contributions 1. Branch-and-Bound framework for Grids 2. Peer-to-Peer Infrastructure for Grids 2
10 The Big Picture Objective Solving combinatorial optimization problems with Grids Approach Parallel Branch-and-Bound and Peer-to-Peer Contributions 1. Branch-and-Bound framework for Grids 2. Peer-to-Peer Infrastructure for Grids 3. Large-scale experiments 2
11 Agenda Context, Problem, and Related Work Contributions Branch-and-Bound Framework for Grids Desktop Grid with Peer-to-Peer Mixing Desktops & s Perspectives & Conclusion 3
12 Context 4
13 Context Combinatorial Optimization Problems (COPs) costly to solve (finding the best solution) 4
14 Context Combinatorial Optimization Problems (COPs) costly to solve (finding the best solution) Branch-and-Bound (BnB) well adapted for solving COPs relatively easy to provide parallel version 4
15 Context Combinatorial Optimization Problems (COPs) costly to solve (finding the best solution) Branch-and-Bound (BnB) well adapted for solving COPs relatively easy to provide parallel version Grid Computing large pool of resources large-scale environment 4
16 Branch-and-Bound 5
17 Branch-and-Bound Consists of a partial enumeration of all feasible solutions and returns the guaranteed optimal solution 5
18 Branch-and-Bound Consists of a partial enumeration of all feasible solutions and returns the guaranteed optimal solution Feasible solutions are organized as a tree: search-tree 5
19 Branch-and-Bound Consists of a partial enumeration of all feasible solutions and returns the guaranteed optimal solution Feasible solutions are organized as a tree: search-tree 3 operations: 5
20 Branch-and-Bound Consists of a partial enumeration of all feasible solutions and returns the guaranteed optimal solution Feasible solutions are organized as a tree: search-tree 3 operations: Branching: split in sub-problems 5
21 Branch-and-Bound Consists of a partial enumeration of all feasible solutions and returns the guaranteed optimal solution Feasible solutions are organized as a tree: search-tree 3 operations: Branching: split in sub-problems Bounding: compute lower/upper bounds (objective function) 5
22 Branch-and-Bound Consists of a partial enumeration of all feasible solutions and returns the guaranteed optimal solution Feasible solutions are organized as a tree: search-tree 3 operations: Branching: split in sub-problems Bounding: compute lower/upper bounds (objective function) Pruning: eliminate bad branches 5
23 Branch-and-Bound Consists of a partial enumeration of all feasible solutions and returns the guaranteed optimal solution Feasible solutions are organized as a tree: search-tree 3 operations: Branching: split in sub-problems Bounding: compute lower/upper bounds (objective function) Pruning: eliminate bad branches Well adapted for solving COPs [Papadimitriou 98] 5
24 Branch-and-Bound Consists of a partial enumeration of all feasible solutions and returns the guaranteed optimal solution Feasible solutions are organized as a tree: search-tree 3 operations: Branching: split in sub-problems Bounding: compute lower/upper bounds (objective function) Pruning: eliminate bad branches Well adapted for solving COPs [Papadimitriou 98] return the best combinaison out of all 5
25 Branch-and-Bound Consists of a partial enumeration of all feasible solutions and returns the guaranteed optimal solution Feasible solutions are organized as a tree: search-tree 3 operations: Branching: split in sub-problems Bounding: compute lower/upper bounds (objective function) Pruning: eliminate Branching: split in bad sub-problems branches Bounding: compute local lower/upper bounds Well adapted for solving COPs [Papadimitriou 98] Pruning: local lower bound higher than the global one Not generated and explored parts of the tree return the best combinaison out of all 5
26 Parallel Branch-and-Bound 6
27 Parallel Branch-and-Bound COPs are difficult to solve: enumeration size & NP-hard class 6
28 Parallel Branch-and-Bound COPs are difficult to solve: enumeration size & NP-hard class Many studies on parallel approach [Gendron 94, Crainic 06] node-based: parallel bounding on sub-problems tree-based: building the tree in parallel multi-search: several trees are generated in parallel 6
29 Parallel Branch-and-Bound COPs are difficult to solve: enumeration size & NP-hard class Many studies on parallel approach [Gendron 94, Crainic 06] node-based: parallel bounding on sub-problems tree-based: building the tree in parallel multi-search: several trees are generated in parallel Tree-based is the most studied [Authié 95, Crainic 06] the solution tree is not known beforehand no part of the tree may be estimated at compilation tasks are dynamically generated task allocations to processors must be done dynamically distributing issues, such as load-balancing & information sharing 6
30 Parallel Branch-and-Bound COPs are difficult to solve: enumeration size & NP-hard class Many studies on parallel approach [Gendron 94, Crainic 06] node-based: parallel bounding on sub-problems tree-based: building the tree in parallel multi-search: several trees are generated in parallel Tree-based is the most studied [Authié 95, Crainic 06] the solution tree is not known beforehand no part of the tree may be estimated at compilation tasks are dynamically generated task allocations to processors must be done dynamically distributing issues, such as load-balancing & information sharing Sharing a global bound for optimizing the prune operation 6
31 Parallel Branch-and-Bound COPs are difficult to solve: enumeration size & NP-hard class Many studies on parallel approach [Gendron 94, Crainic 06] node-based: parallel bounding on sub-problems tree-based: building the tree in parallel multi-search: several trees are generated in parallel Tree-based is the most studied [Authié 95, Crainic 06] the solution tree is not known beforehand no part of the tree may be estimated at compilation tasks are dynamically generated task allocations to processors must be done dynamically distributing issues, such as load-balancing & information sharing Sharing a global bound for optimizing the prune operation 6
32 Parallel Branch-and-Bound COPs are difficult to solve: enumeration size & NP-hard class Many studies on parallel approach [Gendron 94, Crainic 06] node-based: parallel bounding on sub-problems Proposition: Parallel BnB + Grid tree-based: building the tree in parallel multi-search: several trees are generated in parallel Tree-based is the most known by users & Related difficulties are known Tree-based is the most studied [Authié 95, Crainic 06] the solution tree is not known beforehand no part of the tree may be estimated at compilation tasks are dynamically generated task allocations to processors must be done dynamically distributing issues, such as load-balancing & information sharing Sharing a global bound for optimizing the prune operation 6
33 BnB Related Work Frameworks Algorithms Parallelization Machines PUBB Low-level, Basic B&B SPMD /PVM BOB++ Low-level, Basic B&B SPMD /MPI PPBB-Lib Basic B&B SPMD /PVM PICO Basic B&B, Mixed-integer LP hier. master-worker /MPI MallBa Low-level, Basic B&B SPMD /MPI ZRAM Low-level, Basic B&B SPMD /PVM Low-level ALPS/BiCePS Basic B&B hier. master-worker /MPI Mixed-integer LP Branch&Price&Cut Metacomputing MW Basic B&B master-worker Grids/Condor Symphony Mixed-integer LP, Branch&Price&Cut master-worker /PVM 7
34 BnB Related Work Frameworks Algorithms Parallelization Machines PUBB Low-level, Basic B&B SPMD /PVM BOB++ Low-level, Basic B&B SPMD /MPI PPBB-Lib Basic B&B SPMD /PVM PICO Basic B&B, Mixed-integer LP hier. master-worker /MPI MallBa Low-level, Basic B&B SPMD /MPI ZRAM Low-level, Basic B&B SPMD /PVM Low-level ALPS/BiCePS Basic B&B hier. master-worker /MPI Mixed-integer LP Branch&Price&Cut Metacomputing MW Basic B&B master-worker Grids/Condor Symphony Mixed-integer LP, Branch&Price&Cut master-worker /PVM Most Previous work are based on SPMD and target clusters better for sharing bounds 7
35 Grid Computing 8
36 Grid Computing Distributed shared computing infrastructure multi-institutional virtual organization 8
37 Grid Computing Distributed shared computing infrastructure multi-institutional virtual organization Provide large pool of resources 8
38 Grid Computing Distributed shared computing infrastructure multi-institutional virtual organization Provide large pool of resources New challenges geographically distributed deployment scalability communication fault-tolerance multiple administrative domains, heterogeneity, high performance, programming model, etc. 8
39 Grid Computing Distributed shared computing infrastructure multi-institutional virtual organization Provide large pool of resources New challenges geographically distributed deployment scalability communication fault-tolerance multiple administrative domains, heterogeneity, high performance, programming model, etc. Parallel BnB related work: SPMD Grids are not adapted for SPMD (heterogeneity, latency, etc.) 8
40 Grid Computing Grids involve new concepts for development & execution of applications 9
41 Grid Computing Grids involve new concepts for development & execution of applications Grid Fabric Schedulers, Networking, OS Federated Hardware Resources 9
42 Grid Computing Grids involve new concepts for development & execution of applications Grid Middleware Infrastructure Grid Fabric Super-Schedulers, Resource Trading, Information, Security, etc. Schedulers, Networking, OS Federated Hardware Resources 9
43 Grid Computing Grids involve new concepts for development & execution of applications Grid Programming Grid Middleware Infrastructure Grid Fabric Models, Tools, High-Level Access to Middleware Super-Schedulers, Resource Trading, Information, Security, etc. Schedulers, Networking, OS Federated Hardware Resources 9
44 Grid Computing Grids involve new concepts for development & execution of applications Grid Applications & Portals Grid Programming Grid Middleware Infrastructure Grid Fabric Models, Tools, High-Level Access to Middleware Super-Schedulers, Resource Trading, Information, Security, etc. Schedulers, Networking, OS Federated Hardware Resources 9
45 Grid Applications & Portals Grid Computing Grids involve new concepts for development & execution of applications Grid Programming Grid Middleware Infrastructure Grid Fabric Models, Tools, Branch-and-Bound API High-Level Access to Middleware Super-Schedulers, Resource Trading, Information, Security, etc. Schedulers, Networking, OS Federated Hardware Resources 9
46 Challenges Combinatorial Optimization Problems costly to solve 10
47 Challenges Combinatorial Optimization Problems costly to solve Branch-and-Bound adapted for solving COPs easy to parallelize 10
48 Challenges Combinatorial Optimization Problems costly to solve Branch-and-Bound adapted for solving COPs easy to parallelize Grid Computing huge number of resources 10
49 Challenges Combinatorial Optimization Problems costly to solve Branch-and-Bound adapted for solving COPs easy to parallelize + Grid Computing huge number of resources Use Branch-and-Bound on Grids for solving COPs 10
50 Challenges Combinatorial Optimization Problems costly to solve Branch-and-Bound adapted for solving COPs easy to parallelize + Grid Computing huge number of resources Use Branch-and-Bound on Grids for solving COPs Efficient communications with Grids is difficult Problem with sharing bounds 10
51 Agenda Context, Problem, and Related Work Contributions Branch-and-Bound Framework for Grids Desktop Grid with Peer-to-Peer Mixing Desktops & s Perspectives & Conclusion 11
52 BnB for Grids Related Work 12
53 BnB for Grids Related Work Aida et al. focus on the design: hierarchical master-worker scales on Grids 12
54 BnB for Grids Related Work Aida et al. focus on the design: hierarchical master-worker scales on Grids Foster et al. fully decentralized approach: communication overhead 12
55 BnB for Grids Related Work Aida et al. focus on the design: hierarchical master-worker scales on Grids Foster et al. fully decentralized approach: communication overhead ParadisEO focus on meta-heuristics (not exact solution): master-worker 12
56 BnB for Grids Related Work Aida et al. focus on the design: hierarchical master-worker scales on Grids Foster et al. fully decentralized approach: communication overhead ParadisEO focus on meta-heuristics (not exact solution): master-worker Skeletons with farm or divide-and-conquer Satin for divide-and-conquer 12
57 BnB on Grids: Problems and Solutions 13
58 BnB on Grids: Problems and Solutions 13
59 BnB on Grids: Problems and Solutions Latency Scalability Solution tree size Share the best bounds Faults Asynchronous communications Hierarchical masterworker Dynamically generated by splitting tasks Efficient parallelism & communication Fault-tolerance 13
60 BnB on Grids: Problems and Solutions Latency Scalability Solution tree size Share the best bounds Faults Asynchronous communications Hierarchical masterworker Dynamically generated by splitting tasks Efficient parallelism & communication Fault-tolerance 13
61 BnB on Grids: Problems and Solutions Latency Scalability Solution tree size Share the best bounds Faults Asynchronous communications Hierarchical masterworker Dynamically generated by splitting tasks Efficient parallelism & communication Fault-tolerance 13
62 BnB on Grids: Problems and Solutions Latency Scalability Solution tree size Share the best bounds Faults Asynchronous communications Hierarchical masterworker Dynamically generated by splitting tasks Efficient parallelism & communication Fault-tolerance 13
63 BnB on Grids: Problems and Solutions Latency Scalability Solution tree size Share the best bounds Faults Asynchronous communications Hierarchical masterworker Dynamically generated by splitting tasks Efficient parallelism & communication Fault-tolerance 13
64 BnB on Grids: Problems and Solutions Latency Scalability Solution tree size Share the best bounds Faults Asynchronous communications Hierarchical masterworker Dynamically generated by splitting tasks Efficient parallelism & communication Fault-tolerance 13
65 BnB on Grids: Problems and Solutions Latency Scalability Solution tree size Share the best bounds Faults Asynchronous communications Hierarchical masterworker Dynamically generated by splitting tasks Efficient parallelism & communication Fault-tolerance 13
66 BnB on Grids: Problems and Solutions Latency Scalability Solution tree size Share the best bounds Faults Asynchronous communications Hierarchical masterworker Dynamically generated by splitting tasks Efficient parallelism & communication Fault-tolerance 13
67 BnB on Grids: Problems and Solutions Latency Scalability Solution tree size Share the best bounds Faults Asynchronous communications Hierarchical masterworker Dynamically generated by splitting tasks Efficient parallelism & communication Fault-tolerance 13
68 BnB on Grids: Problems and Solutions Latency Scalability Solution tree size Share the best bounds Faults Asynchronous communications Hierarchical masterworker Dynamically generated by splitting tasks Efficient parallelism & communication Fault-tolerance 13
69 BnB on Grids: Problems and Solutions Asynchronous Latency communications Scalability Solution tree size Share the best bounds Hierarchical masterworker Objective: hide Grid difficulties to users Especially communication problems Dynamically generated by splitting tasks Efficient parallelism & communication Faults Fault-tolerance 13
70 BnB Framework - Entities and Roles 14
71 BnB Framework - Entities and Roles Root Task: implemented by users objective-function and splitting/branching operation 14
72 BnB Framework - Entities and Roles Root Task: implemented by users objective-function and splitting/branching operation Master: Entry Point splits the problem in tasks collects partial-results the best solution 14
73 BnB Framework - Entities and Roles Root Task: implemented by users objective-function and splitting/branching operation Master: Entry Point splits the problem in tasks collects partial-results the best solution Sub-Master: intermediary between master and workers 14
74 BnB Framework - Entities and Roles Root Task: implemented by users objective-function and splitting/branching operation Master: Entry Point splits the problem in tasks collects partial-results the best solution Sub-Master: intermediary between master and workers : computes tasks 14
75 BnB Framework - Architecture Master Submaster Submaster 15
76 BnB Framework - Architecture Problem Master Submaster Submaster 15
77 BnB Framework - Architecture Problem Master First splitting Submaster Submaster 15
78 BnB Framework - Architecture Problem Master Pending tasks First splitting Submaster Submaster 15
79 BnB Framework - Architecture Problem Master Pending tasks Submaster Submaster 15
80 BnB Framework - Architecture Problem Master Pending tasks Ask task Submaster Submaster 15
81 BnB Framework - Architecture Problem Master Pending tasks Submaster Submaster 15
82 BnB Framework - Architecture Problem Master Pending tasks Ask task Submaster Submaster 15
83 BnB Framework - Architecture Problem Master Pending tasks Submaster Submaster 15
84 BnB Framework - Architecture Problem Master Pending tasks Got task Submaster Submaster 15
85 BnB Framework - Architecture Problem Master Pending tasks Submaster Submaster 15
86 BnB Framework - Architecture Problem Master Pending tasks Got task Submaster Submaster 15
87 BnB Framework - Architecture Problem Master Pending tasks Submaster Submaster 15
88 BnB Framework - Architecture Problem Master Pending tasks Submaster Submaster Computing 15
89 BnB Framework - Architecture Problem Master Pending tasks Submaster Submaster Sending result Computing 15
90 BnB Framework - Architecture Problem Master Pending tasks Submaster Submaster Computing 15
91 BnB Framework - Architecture Problem Master Pending tasks Submaster Submaster Computing 15
92 BnB Framework - Architecture Problem Master Pending tasks Partial results Submaster Submaster Computing 15
93 BnB Framework - Solutions 16
94 BnB Framework - Solutions Context ProActive Java Grid middleware: latency asynchronous communication underlaying Grid infrastructure deployment framework (abstraction) 16
95 BnB Framework - Solutions Context ProActive Java Grid middleware: latency asynchronous communication underlaying Grid infrastructure deployment framework (abstraction) Implement the tree-based parallel Master-worker architecture Problem: workers need to share bounds difficult to adapt SPMD for Grids (heterogeneity, distribution, etc.) 16
96 BnB Framework - Solutions Context ProActive Java Grid middleware: latency asynchronous communication underlaying Grid infrastructure deployment framework (abstraction) Implement the tree-based parallel Master-worker architecture Problem: workers need to share bounds difficult to adapt SPMD for Grids (heterogeneity, distribution, etc.) Solution 1: Master keeps the bound 16
97 BnB Framework - Solutions Context ProActive Java Grid middleware: latency asynchronous communication underlaying Grid infrastructure deployment framework (abstraction) Implement the tree-based parallel Master-worker architecture Problem: workers need to share bounds difficult to adapt SPMD for Grids (heterogeneity, distribution, etc.) Solution 1: Master keeps the bound - previous work shows that not scale [Aida 2003] 16
98 BnB Framework - Solutions Context ProActive Java Grid middleware: latency asynchronous communication underlaying Grid infrastructure deployment framework (abstraction) Implement the tree-based parallel Master-worker architecture Problem: workers need to share bounds difficult to adapt SPMD for Grids (heterogeneity, distribution, etc.) Solution 1: Master keeps the bound - previous work shows that not scale [Aida 2003] Solution 2: Message framework (Enterprise Service Bus) 16
99 BnB Framework - Solutions Context ProActive Java Grid middleware: latency asynchronous communication underlaying Grid infrastructure deployment framework (abstraction) Implement the tree-based parallel Master-worker architecture Problem: workers need to share bounds difficult to adapt SPMD for Grids (heterogeneity, distribution, etc.) Solution 1: Master keeps the bound - previous work shows that not scale [Aida 2003] Solution 2: Message framework (Enterprise Service Bus) - Grid middleware dependent / Good for SOA 16
100 BnB Framework - Solutions Context ProActive Java Grid middleware: latency asynchronous communication underlaying Grid infrastructure deployment framework (abstraction) Implement the tree-based parallel Master-worker architecture Problem: workers need to share bounds difficult to adapt SPMD for Grids (heterogeneity, distribution, etc.) Solution 1: Master keeps the bound - previous work shows that not scale [Aida 2003] Solution 2: Message framework (Enterprise Service Bus) - Grid middleware dependent / Good for SOA Solution 3: Broadcasting 16
101 BnB Framework - Solutions Context ProActive Java Grid middleware: latency asynchronous communication underlaying Grid infrastructure deployment framework (abstraction) Implement the tree-based parallel Master-worker architecture Problem: workers need to share bounds difficult to adapt SPMD for Grids (heterogeneity, distribution, etc.) Solution 1: Master keeps the bound - previous work shows that not scale [Aida 2003] Solution 2: Message framework (Enterprise Service Bus) - Grid middleware dependent / Good for SOA Solution 3: Broadcasting - 1 to n communication cannot scale 16
102 BnB Framework - Solutions Context ProActive Java Grid middleware: latency asynchronous communication underlaying Grid infrastructure deployment framework (abstraction) Implement the tree-based parallel Master-worker architecture Problem: workers need to share bounds difficult to adapt SPMD for Grids (heterogeneity, distribution, etc.) Solution 1: Master keeps the bound - previous work shows that not scale [Aida 2003] Solution 2: Message framework (Enterprise Service Bus) - Grid middleware dependent / Good for SOA Solution 3: Broadcasting - 1 to n communication cannot scale hierarchical broadcasting scale [Baduel 05] 16
103 Organizing Communications for Broadcasting 17
104 Organizing Communications for Broadcasting Idea: Grids are composed of clusters organizing s in groups 17
105 Organizing Communications for Broadcasting Idea: Grids are composed of clusters organizing s in groups clusters are high-performance communication environments 17
106 Organizing Communications for Broadcasting Idea: Grids are composed of clusters organizing s in groups clusters are high-performance communication environments 17
107 Organizing Communications for Broadcasting Idea: Grids are composed of clusters organizing s in groups clusters are high-performance communication environments Solution: add a new entity for organizing communications: Leader Leader is a chose by the Master for each group 17
108 Organizing Communications for Broadcasting Idea: Grids are composed of clusters organizing s in groups clusters are high-performance communication environments Solution: add a new entity for organizing communications: Leader Leader is a chose by the Master for each group 17
109 Organizing Communications for Broadcasting Idea: Grids are composed of clusters organizing s in groups clusters are high-performance communication environments Solution: add a new entity for organizing communications: Leader Leader is a chose by the Master for each group Process to update Bounds: 1. the broadcasts the new Bound inside its group 2. the group Leader broadcasts the new Bound to all Leaders 3. each Leader broadcasts the new value inside their groups 17
110 BnB framework - Communications for Sharing Bound Leader Submaster Submaster Leader Leader Leader 18
111 BnB framework - Communications for Sharing Bound Leader Submaster Submaster Leader Leader Leader 18
112 BnB framework - Communications for Sharing Bound Leader Submaster Submaster Leader Leader Leader 18
113 BnB framework - Communications for Sharing Bound Leader Submaster Submaster Leader Leader Leader 18
114 BnB framework - Communications for Sharing Bound Leader Submaster Submaster Leader Leader Leader new best bound! 18
115 BnB framework - Communications for Sharing Bound Leader Submaster Submaster Leader Leader Leader new best bound! 18
116 BnB framework - Communications for Sharing Bound Leader Submaster Submaster Leader Leader Leader new best bound! 18
117 BnB framework - Communications for Sharing Bound Leader Submaster Submaster Leader Leader Leader new best bound! 18
118 BnB framework - Communications for Sharing Bound Leader Submaster Submaster Leader Leader Leader new best bound! 18
119 BnB framework - Communications for Sharing Bound Grid middleware provides communications Leader Submaster Submaster Leader Leader Leader new best bound! 18
120 BnB Search Strategies The improvement of the Bound 1. depends of how it is shared (communication) 2. depends of how the search-tree is generated: Classical Depth-First Search Breadth-First Search Contribution First-In-First-Out (FIFO) Priority open API... 19
121 BnB Framework: Fault-Tolerance 20
122 BnB Framework: Fault-Tolerance Manage user exception computation stopped 20
123 BnB Framework: Fault-Tolerance Manage user exception computation stopped For us, a fault is a Failed Stop 20
124 BnB Framework: Fault-Tolerance Manage user exception computation stopped For us, a fault is a Failed Stop fault handled by (sub-)master 20
125 BnB Framework: Fault-Tolerance Manage user exception computation stopped For us, a fault is a Failed Stop fault handled by (sub-)master Leader fault master choose new one 20
126 BnB Framework: Fault-Tolerance Manage user exception computation stopped For us, a fault is a Failed Stop fault handled by (sub-)master Leader fault master choose new one Sub-master fault manager change a worker to sub-master 20
127 BnB Framework: Fault-Tolerance Manage user exception computation stopped For us, a fault is a Failed Stop fault handled by (sub-)master Leader fault master choose new one Sub-master fault manager change a worker to sub-master Master fault back-up file 20
128 BnB Framework: Fault-Tolerance Manage user exception computation stopped For us, a fault is a Failed Stop fault handled by (sub-)master Leader fault master choose new one Sub-master fault manager change a worker to sub-master Master fault back-up file Load-Balancing is natural with master-worker 20
129 BnB Framework: Fault-Tolerance Manage user exception computation stopped For us, a fault is a Failed Stop fault handled by (sub-)master Leader fault master choose new one Sub-master fault manager change a worker to sub-master Master fault back-up file Load-Balancing is natural with master-worker the framework provides a function to get the number of free s users use it to decide branching 20
130 Grid BnB [HiPC07] Features 21
131 Grid BnB [HiPC07] Features Design 21
132 Grid BnB [HiPC07] Features Asynchronous communications Hierarchical master-worker with com. Dynamic task splitting Efficient communications with groups Fault-tolerance Design 21
133 Grid BnB [HiPC07] Features Asynchronous communications Hierarchical master-worker with com. Dynamic task splitting Efficient communications with groups Fault-tolerance Design Users 21
134 Grid BnB [HiPC07] Features Asynchronous communications Hierarchical master-worker with com. Dynamic task splitting Efficient communications with groups Fault-tolerance Design Hidden parallelism and Grid difficulties API for COPs Ease of deployment Principally tree-based Implementing and testing search strategies Focus on objective function Users 21
135 Grid BnB [HiPC07] Features Asynchronous communications Hierarchical master-worker with com. Dynamic task splitting Design Efficient communications with groups Fault-tolerance Validate and Test Grid'BnB by experiments Hidden parallelism and Grid difficulties API for COPs Ease of deployment Principally tree-based Implementing and testing search strategies Focus on objective function Users 21
136 Flow-Shop Experiments NP-hard permutation optimization problem J = {j1, j2,... jn} ji = { oi1, oi2,... oim } M = {m1,m2,...mm} m 4 o 1,4 o 2,4 o 3,4 o 4,4 m 3 o 1,3 o 2,3 o 3,3 o 4,3 m 2 m 1 o 1,2 o 2,2 o 3,2 o 4,2 o 1,1 o 2,1 o 3,1 o 4,1 Execution time 22
137 Single : Search Strategies Flow-Shop: 16 Jobs / 20 Machines Time in minutes CPUs FIFO Depth-first search Breadth-first search Priority 23
138 Single : Communications Flow-Shop: 16 Jobs / 20 Machines Time in minutes Ratio Factor CPUs With Group Communications Ratio (With Com / No Com) Without Group Communications 24
139 Grid'5000: the French Grid Lille Rennes Orsay Nancy Bordeaux Lyon Grenoble Toulouse Sophia 9 sites - 14 clusters CPUs 25
140 Grid Experimentations up to 621 CPUs on 5 sites Flow-Shop: 17 Jobs / 17 Machines Time in minutes CPUs Execution Time 26
141 Grid Experimentations up to 621 CPUs on 5 sites Flow-Shop: 17 Jobs / 17 Machines Flow-Shop: 17 Jobs / 17 Machines 0,3 0,25 Time in in minutes CPUs CPUs Execution Time 0,2 0,15 0,1 0,05 % of explored search tree Execution Time 26 Total Work
142 Grid Experimentations up to 621 CPUs on 5 sites Flow-Shop: 17 Jobs / 17 Machines Flow-Shop: 17 Jobs / 17 Machines 0,3 0,25 Time in in minutes Speedup anomalies [Lai 84, Li 86] CPUs CPUs Execution Time 0,2 0,15 0,1 0,05 % of explored search tree Execution Time 26 Total Work
143 Speedup Anomalies & Efficiency Parallel tree-based speedup may sometimes quite spectacular (> or < linear)[mans 95] Speedup Anomalies in BnB [Roucairol 87, Lai 84, Li 86] speedup depends of how the tree is dynamically built Efficiency: is a related measure computed as the speedup divided by the number of processors. 27
144 Speedup Anomalies & Efficiency Efficiency 1,2 Flow-Shop: 17 Jobs / 17 Machines Parallel tree-based speedup may sometimes quite 1 spectacular (> or < linear)[mans 95] 0,8 Speedup Anomalies in BnB [Roucairol 87, Lai 84, Li 86] 0,6 speedup depends of how the tree is dynamically built Efficiency: 0,4 is a related measure computed as the speedup divided by the number of processors. 0, CPUs 27
145 Grid'BnB: Results 28
146 Grid'BnB: Results Experimentally validate our BnB framework for Grids validity of organizing communications scalability on Grid (up to 621 CPUs on 5 sites) 28
147 Grid'BnB: Results Experimentally validate our BnB framework for Grids validity of organizing communications scalability on Grid (up to 621 CPUs on 5 sites) 28
148 Grid'BnB: Results Experimentally validate our BnB framework for Grids validity of organizing communications scalability on Grid (up to 621 CPUs on 5 sites) Problems: deployment on Grids is difficult dynamically acquiring new resources is difficult popularity of Grid'5000 cannot mix Grid'5000 and under-utilized lab desktops 28
149 Grid'BnB: Results Experimentally validate our BnB framework for Grids validity of organizing communications scalability on Grid (up to 621 CPUs on 5 sites) Grid middleware needs a better supporting of Problems: dynamic infrastructure deployment on Grids is difficult dynamically acquiring new resources is difficult popularity of Grid'5000 cannot mix Grid'5000 and under-utilized lab desktops 28
150 Agenda Context, Problem, and Related Work Contributions Branch-and-Bound Framework for Grids Desktop Grid with Peer-to-Peer Mixing Desktops & s Perspectives & Conclusion 29
151 Peer-to-Peer as Grid Middleware Grid Computing and Peer-to-Peer share a common goal: sharing resources [Foster 03, Goux 00] Grid related work: [Globus] providing computational resources - installing/deploying Grid middleware is difficult P2P related work: [Gnutella, Freenet, DHT] - focusing on sharing data & mono-application dynamic & easy to deploy 30
152 Peer-to-Peer as Grid Middleware Grid Computing and Peer-to-Peer share a common goal: sharing resources [Foster 03, Goux 00] Grid related work: [Globus] providing computational resources - installing/deploying Grid middleware is difficult P2P related work: [Gnutella, Freenet, DHT] - focusing on sharing data & mono-application dynamic & easy to deploy Objective: provide a P2P infrastructure for Grids and sharing computational resources 30
153 Which P2P Architecture? Master- centralized - targets only desktops good for embarrassingly parallel Pure/Unstructured Peer-to-Peer (Gnutella) - flooding problems supports high-churn (good for Desktop Grids) supports many kind of application (data, computational) Hybrid Peer-to-Peer (JXTA) uses central servers limits the flooding - has to manage churn Structured Peer-to-Peer (Chord) - high cost for managing churn efficient for data sharing (Distributed Hash Table) computing task asking for a task Master sending a task Unstructured P2P B A Resource Y Hybrid P2P B Resource Y Peer A T' T'' T''' pending tasks sending a partial result R' B R'' R''' partial results Flooding A Resource Y Flooding Peers in acquaintance A Peer K54 Server N51 N48 N56 N42 B -> Y... B Resource Y m N38 N1 computing task Peer Peer lookup(k54) N8+4 N14 Peers in acquaint N8 Message N14 query N32 m=6 N21 Finger table N8+1 N8+8 N8+16 N8+32 N14 N8+2 N14 N21 N32 N42 Peer Query 31
154 Which P2P Architecture? Master- centralized - targets only desktops good for embarrassingly parallel Pure/Unstructured Peer-to-Peer (Gnutella) - flooding problems supports high-churn (good for Desktop Grids) supports many kind of application (data, computational) Hybrid Peer-to-Peer (JXTA) uses central servers limits the flooding - has to manage churn Structured Peer-to-Peer (Chord) - high cost for managing churn efficient for data sharing (Distributed Hash Table) computing task asking for a task Master sending a task Unstructured P2P Pure Peer-to-Peer is the most adapted B Needs to avoid the flooding problem A Resource Y Hybrid P2P B Resource Y Peer A T' T'' T''' pending tasks sending a partial result R' B R'' R''' partial results Flooding A Resource Y Flooding Peers in acquaintance A Peer K54 Server N51 N48 N56 N42 B -> Y... B Resource Y m N38 N1 computing task Peer Peer lookup(k54) N8+4 N14 Peers in acquaint N8 Message N14 query N32 m=6 N21 Finger table N8+1 N8+8 N8+16 N8+32 N14 N8+2 N14 N21 N32 N42 Peer Query 31
155 Contribution & Positioning Grid Applications & Portals Branch-and-Bound API Grid Programming Grid Middleware Infrastructure Grid Fabric 32
156 Contribution & Positioning Grid Applications & Portals Branch-and-Bound API Grid Programming P2P Infrastructure Grid Middleware Infrastructure Grid Fabric 32
157 The Peer-to-Peer Infrastructure [CMST06] 33
158 The Peer-to-Peer Infrastructure [CMST06] Pure Peer-to-Peer overlay network Using it as Grid middleware infrastructure The proposed solution to avoid the flooding problem: 3 node-request protocols: 1 node: Random walk algorithm n nodes: Breadth-First-Search (BFS) algorithm with acknowledgement max nodes: BFS without acknowledgement Best-effort 33
159 INRIA Sophia P2P Desktop Grid Need to validate the infrastructure with desktops 260 desktops at INRIA Sophia lab No disturbing normal users: running in low priority working schedules: 24/24 50 machines (INRIA-2424) night/weekend 260 machines (INRIA-ALL) 34
160 INRIA Sophia P2P Desktop Grid Need to validate the infrastructure with desktops 260 desktops at INRIA Sophia lab No disturbing normal users: running in low priority working schedules: 24/24 50 machines (INRIA-2424) night/weekend 260 machines (INRIA-ALL) Deployed our P2P infrastructure as permanent Desktop Grid at INRIA Sophia 34
161 INRIA Sophia P2P Desktop Grid INRIA Sophia Antipolis - Desktop Machines Network INRIA Sub Network Need to validate the infrastructure with desktops 260 desktops at INRIA Sophia lab No disturbing normal users: running in low priority working schedules: 24/24 50 machines (INRIA-2424) night/weekend 260 machines (INRIA-ALL) INRIA Sub Network Deployed Desktop Machines our - INRIA-2424 P2P infrastructure Desktop as Machines permanent - INRIA-All Aquaintance Desktop Grid at INRIA Sophia 34 INRIA Sub Network INRIA Sub Network
162 Long Running Experiments with the P2P Desktop Grid Context: ETSI Grid Plugtests contest n-queens n-queens: embarrassingly parallel / CPU intensive / masterworker 35
163 Long Running Experiments with the P2P Desktop Grid Context: ETSI Grid Plugtests contest n-queens n-queens: embarrassingly parallel / CPU intensive / masterworker How many solution for 25 queens? 35
164 n-queens Experiment Results Total # of Tasks 12,125,199 Task Computation World Record [Sloane Integers Sequence A000170] What we learn from this experiments: validate the workability of the infrastructure validate the robustness of the infrastructure hard to forecast machine's performances '' Computation Time 185 days Cumulated Time 53 years # of Desktop Machines 260 Total of Solution Found 2,207,893,435,808,352 2 quadrillions
165 Agenda Context, Problem, and Related Work Contributions Branch-and-Bound Framework for Grids Desktop Grid with Peer-to-Peer Mixing Desktops & s Perspectives & Conclusion 37
166 Mixing Desktops & s [PARCO07] Grid'BnB Parallel BnB Framework for Grid P2P Infrastructure as Grid Midlleware 38
167 Mixing Desktops & s [PARCO07] Grid'BnB Parallel BnB Framework for Grid + P2P Infrastructure as Grid Midlleware API for solving COPs & Dynamic Grid Infrastructure 38
168 Mixing Desktops & s [PARCO07] Grid'BnB Parallel BnB Framework for Grid + P2P Infrastructure as Grid Midlleware API for solving COPs & Dynamic Grid Infrastructure Validate by experiments on a Grid of Desktops & s 38
169 Mixing Desktops & s [PARCO07] Grid'BnB Parallel BnB Framework for Grid + P2P Infrastructure as Grid Midlleware API for solving COPs & Dynamic Grid Infrastructure Validate by experiments on a Grid of Desktops & s New Problems: firewalls forwarder sharing clusters with the P2P infrastructure 38
170 Sharing 's Nodes with P2P Peer Host shared node Peer sharing local node 39
171 Sharing 's Nodes with P2P Host Host shared node... shared node Peer Host shared node Host shared node Host shared node Peer sharing local node Host Peer Peer sharing remote nodes 39
172 Testbed INRIA Sophia - Desktops G5K Platform Orsay GDX Lyon Sophia Toulouse 40
173 Testbed INRIA Sophia - Desktops G5K Platform Orsay GDX Peer Peer Node Node Lyon Peer Peer Node Node Sophia Toulouse Peer Node 40
174 Testbed INRIA Sophia - Desktops RMI/SSH G5K Platform Node Peer Peer Node Orsay GDXNode Node Node Node Node Peer Node Peer Node Node Lyon Node Peer Node Peer Node Node Node Sophia Node Node Node Node Peer Toulouse Peer Node 40
175 Large-Scale Experiments Goal: validate the infrastructure by experiments With n-queens: no communication between workers test the workability of the infrastructure With flow-shop: communications between workers test solving COPs 41
176 N-Queens Results Lyon Sophia Bord. Orsay Toulouse Total Number of CPUs for Experimentations ,49 39,01 63,68 190,63 238, Lyon Sophia Orsay Lyon Sophia Bordeaux Sophia Time in minutes !050 Total!of CPUs NQueens with n=22 INRIA-ALL G5K Sophia G5K Orsay G5K Lyon G5K Bordeaux G5K Toulouse 42
177 Flow-Shop Results Total Number of CPUs for Experimentations ,15 59,14 56,03 61,31 83,73 86, , Lyon 2456 Sophia Bord. Nancy Sophia Sophia Orsay Sophia Bordeaux Orsay Rennes 186 Toulouse ,667 65, ,333 Time in minutes Total # of CPUs Flow-Shop 17 jobs / 17 machines INRIA-2424 G5K Sophia G5K Orsay G5K Rennes G5K Lyon G5K Bordeaux G5K Nancy G5K Toulouse 43
178 Total Number of CPUs for Experimentations Flow-Shop Results Sophia Bord. Nancy Rennes 61, , , , , , Sophia With desktops 86, & clusters speedup anomalies occur Sophia more 55 Lyon Sophia Bordeaux 146 Orsay Orsay Toulouse ,667 65, ,333 Time in minutes Total # of CPUs Flow-Shop 17 jobs / 17 machines INRIA-2424 G5K Sophia G5K Orsay G5K Rennes G5K Lyon G5K Bordeaux G5K Nancy G5K Toulouse 43
179 Mixing - Analysis N-Queens problem scales well up to 1007 CPUs 349 Desktops + 5 s Flow-Shop up to 628 CPUs 42 Desktops + 5 s worse performances than using only clusters (anomalies are more frequents) Experimented in closed environments -- security Grid'5000 platform's success hard for running long exp. 44
180 P2P as a Meta-Grid infrastructure 45
181 P2P as a Meta-Grid infrastructure Observation from Grid'5000: 300 nodes available for 2 minutes provide best-effort queue 45
182 P2P as a Meta-Grid infrastructure Observation from Grid'5000: 300 nodes available for 2 minutes provide best-effort queue Idea: take benefit from these nodes [Condor] by hand: not easy with the P2P infrastructure: deploying peers in best-effort queue 45
183 P2P as a Meta-Grid infrastructure Observation from Grid'5000: 300 nodes available for 2 minutes provide best-effort queue Idea: take benefit from these nodes [Condor] by hand: not easy with the P2P infrastructure: deploying peers in best-effort queue Result: a permanent P2P infrastructure over Grid'
184 P2P as a Meta-Grid infrastructure Observation from Grid'5000: 300 nodes available for 2 minutes provide best-effort queue Idea: take benefit from these nodes [Condor] by hand: not easy with the P2P infrastructure: deploying peers in best-effort queue Result: a permanent P2P infrastructure over Grid'5000 P2P infrastructure as a dynamic Grid middleware 45
185 P2P with N-Queens on Grid' Experimentation: n22.log # of CPUs # of Tasks # of workers by minutes tasks computed by minutes Experimentation time in minutes 1384 CPUs - 9 sites 46
186 P2P with N-Queens on Grid' Experimentation: n22.log # of CPUs 1000 Embarrassingly applications can take benefit from Meta-Grid Infrastructure # of Tasks # of workers by minutes tasks computed by minutes Experimentation time in minutes 1384 CPUs - 9 sites 46
187 Agenda Context, Problem, and Related Work Contributions Branch-and-Bound Framework for Grids Desktop Grid with Peer-to-Peer Mixing Desktops & s Perspectives & Conclusion 47
188 Summary Grid'BnB: a BnB framework for Grids communication between workers organizing workers in groups Grid infrastructure based on P2P architecture mixing desktops and clusters deployed at INRIA Sophia lab 48
189 Perspectives Peer-to-Peer Infrastructure: Job Scheduler Resource Localization (PhD) Large-Scale Experiments: International Grid: France, Japan, and Netherlands Grid Pugtests Deployment: Contracts in Grids (GCM Standard) Industrialization: P2P Desktop Resource Virtualization CPER - P2P 1M /4years to professionalize the INRIA Grid 49
190 Conclusion 50
191 Conclusion Branch-and-Bound for Grids solve COPs hide Grid difficulties communication between workers 50
192 Conclusion Branch-and-Bound for Grids solve COPs hide Grid difficulties communication between workers 50
193 Conclusion Branch-and-Bound for Grids solve COPs hide Grid difficulties communication between workers Peer-to-Peer as Grid infrastructure mixing desktops and clusters 50
194 Conclusion Branch-and-Bound for Grids solve COPs hide Grid difficulties communication between workers Peer-to-Peer as Grid infrastructure mixing desktops and clusters 50
195 Conclusion Branch-and-Bound for Grids solve COPs hide Grid difficulties communication between workers Peer-to-Peer as Grid infrastructure mixing desktops and clusters Tested and experimented 50
196 Conclusion Branch-and-Bound for Grids solve COPs hide Grid difficulties communication between workers Peer-to-Peer as Grid infrastructure mixing desktops and clusters Tested and experimented Available in open source 50
197 Conclusion Branch-and-Bound for Grids solve COPs hide Grid difficulties communication between workers Peer-to-Peer as Grid infrastructure mixing desktops and clusters Tested and experimented Available in open source Provides framework and infrastructure to hide Grid difficulties 50
198 [SCCC05] Balancing Active Objects on a Peer to Peer Infrastructure Javier Bustos-Jimenez, Denis Caromel, Alexandre di Costanzo, Mario Leyton and Jose M. Piquer. Proceedings of the XXV International Conference of the Chilean Computer Science Society (SCCC 2005), Valdivia, Chile, November [HIC06] Executing Hydrodynamic Simulation on Desktop Grid with ObjectWeb ProActive Denis Caromel, Vincent Cavé, Alexandre di Costanzo, Céline Brignolles, Bruno Grawitz, and Yann Viala. HIC2006: Proceedings of the 7th International Conference on HydroInformatics, Nice, France, September [HiPC07] A Parallel Branch & Bound Framework for Grids Denis Caromel, Alexandre di Costanzo, Laurent Baduel, and Satoshi Matsuoka. Grid BnB: HiPC 07, Goa, India, December [CMST06] ProActive: an Integrated platform for programming and running applications on grids and P2P systems Denis Caromel, Christian Delbé, Alexandre di Costanzo, and Mario Leyton. Journal on Computational Methods in Science and Technology, volume 12, [PARCO07] Peer-to-Peer for Computational Grids: Mixing s and Desktop Machines Denis Caromel, Alexandre di Costanzo, and Clément Mathieu. Parallel Computing Journal on Large Scale Grid, [FGCS07] Peer-to-Peer and Fault-tolerance: Towards Deployment-based Technical Services Denis Caromel, Alexandre di Costanzo, and Christian Delbé. Journal Future Generation Computer Systems, to appear, and Workshops, Master report,...
Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed
Enabling Large-Scale Testing of IaaS Cloud Platforms on the Grid 5000 Testbed Sébastien Badia, Alexandra Carpen-Amarie, Adrien Lèbre, Lucas Nussbaum Grid 5000 S. Badia, A. Carpen-Amarie, A. Lèbre, L. Nussbaum
Cluster, Grid, Cloud Concepts
Cluster, Grid, Cloud Concepts Kalaiselvan.K Contents Section 1: Cluster Section 2: Grid Section 3: Cloud Cluster An Overview Need for a Cluster Cluster categorizations A computer cluster is a group of
Using Peer to Peer Dynamic Querying in Grid Information Services
Using Peer to Peer Dynamic Querying in Grid Information Services Domenico Talia and Paolo Trunfio DEIS University of Calabria HPC 2008 July 2, 2008 Cetraro, Italy Using P2P for Large scale Grid Information
CLEVER: a CLoud-Enabled Virtual EnviRonment
CLEVER: a CLoud-Enabled Virtual EnviRonment Francesco Tusa Maurizio Paone Massimo Villari Antonio Puliafito {ftusa,mpaone,mvillari,apuliafito}@unime.it Università degli Studi di Messina, Dipartimento di
Asynchronous Peer-to- Peer Web Services and Firewalls
Asynchronous Peer-to- Peer Web s and Firewalls Aleksander Slominski 1, Alexandre di Costanzo 2 Dennis Gannon 1, Denis Caromel 2 1) Indiana University, 2) INRIA Sophia Antipolis Motivation! SOAP: better
System Models for Distributed and Cloud Computing
System Models for Distributed and Cloud Computing Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Classification of Distributed Computing Systems
GridCOMP ProActive/GCM tutorial and and Hands-On Grid Programming
GridCOMP ProActive/GCM tutorial and and Hands-On Grid Programming Cédric Dalmasso, Antonio Cansado and Denis Caromel INRIA - OASIS Team INRIA -- CNRS -- I3S -- Univ. of Nice Sophia-Antipolis, IUF IV Grid@Work,
Principles and characteristics of distributed systems and environments
Principles and characteristics of distributed systems and environments Definition of a distributed system Distributed system is a collection of independent computers that appears to its users as a single
The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland
The Lattice Project: A Multi-Model Grid Computing System Center for Bioinformatics and Computational Biology University of Maryland Parallel Computing PARALLEL COMPUTING a form of computation in which
Comparison on Different Load Balancing Algorithms of Peer to Peer Networks
Comparison on Different Load Balancing Algorithms of Peer to Peer Networks K.N.Sirisha *, S.Bhagya Rekha M.Tech,Software Engineering Noble college of Engineering & Technology for Women Web Technologies
A PROXIMITY-AWARE INTEREST-CLUSTERED P2P FILE SHARING SYSTEM
A PROXIMITY-AWARE INTEREST-CLUSTERED P2P FILE SHARING SYSTEM Dr.S. DHANALAKSHMI 1, R. ANUPRIYA 2 1 Prof & Head, 2 Research Scholar Computer Science and Applications, Vivekanandha College of Arts and Sciences
Peer-to-Peer Computing
Quang Hieu Vu Mihai Lupu Beng Chin Ooi Peer-to-Peer Computing Principles and Applications Springer 1 Introduction 1 1.1 Peer-to-Peer Computing 1 1.2 Potential, Benefits, and Applications 3 1.3 Challenges
Varalakshmi.T #1, Arul Murugan.R #2 # Department of Information Technology, Bannari Amman Institute of Technology, Sathyamangalam
A Survey on P2P File Sharing Systems Using Proximity-aware interest Clustering Varalakshmi.T #1, Arul Murugan.R #2 # Department of Information Technology, Bannari Amman Institute of Technology, Sathyamangalam
High Throughput Computing on P2P Networks. Carlos Pérez Miguel [email protected]
High Throughput Computing on P2P Networks Carlos Pérez Miguel [email protected] Overview High Throughput Computing Motivation All things distributed: Peer-to-peer Non structured overlays Structured
Grid Computing Vs. Cloud Computing
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 6 (2013), pp. 577-582 International Research Publications House http://www. irphouse.com /ijict.htm Grid
Load Balancing: Toward the Infinite Network
Load Balancing: Toward the Infinite Network Javier Bustos-Jiménez, Denis Caromel {Javier.Bustos,Denis.Caromel}@sophia.inria.fr INRIA Sophia-Antipolis, France José M. Piquer [email protected] DCC, Universidad
8 Conclusion and Future Work
8 Conclusion and Future Work This chapter concludes this thesis and provides an outlook on future work in the area of mobile ad hoc networks and peer-to-peer overlay networks 8.1 Conclusion Due to the
Provisioning and Resource Management at Large Scale (Kadeploy and OAR)
Provisioning and Resource Management at Large Scale (Kadeploy and OAR) Olivier Richard Laboratoire d Informatique de Grenoble (LIG) Projet INRIA Mescal 31 octobre 2007 Olivier Richard ( Laboratoire d Informatique
Load Balancing: Towards the Infinite Network and Beyond
Load Balancing: Towards the Infinite Network and Beyond Javier Bustos-Jiménez 1,2,3, Denis Caromel 2, and José M. Piquer 1 1 Departamento de Ciencias de la Computación, Universidad de Chile. Blanco Encalada
How To Create A P2P Network
Peer-to-peer systems INF 5040 autumn 2007 lecturer: Roman Vitenberg INF5040, Frank Eliassen & Roman Vitenberg 1 Motivation for peer-to-peer Inherent restrictions of the standard client/server model Centralised
Symmetric Multiprocessing
Multicore Computing A multi-core processor is a processing system composed of two or more independent cores. One can describe it as an integrated circuit to which two or more individual processors (called
Interconnect Efficiency of Tyan PSC T-630 with Microsoft Compute Cluster Server 2003
Interconnect Efficiency of Tyan PSC T-630 with Microsoft Compute Cluster Server 2003 Josef Pelikán Charles University in Prague, KSVI Department, [email protected] Abstract 1 Interconnect quality
Peer-VM: A Peer-to-Peer Network of Virtual Machines for Grid Computing
Peer-VM: A Peer-to-Peer Network of Virtual Machines for Grid Computing (Research Proposal) Abhishek Agrawal ([email protected]) Abstract This proposal discusses details about Peer-VM which is a peer-to-peer
Storage Virtualization from clusters to grid
Seanodes presents Storage Virtualization from clusters to grid Rennes 4th october 2007 Agenda Seanodes Presentation Overview of storage virtualization in clusters Seanodes cluster virtualization, with
Classic Grid Architecture
Peer-to to-peer Grids Classic Grid Architecture Resources Database Database Netsolve Collaboration Composition Content Access Computing Security Middle Tier Brokers Service Providers Middle Tier becomes
RESEARCH ISSUES IN PEER-TO-PEER DATA MANAGEMENT
RESEARCH ISSUES IN PEER-TO-PEER DATA MANAGEMENT Bilkent University 1 OUTLINE P2P computing systems Representative P2P systems P2P data management Incentive mechanisms Concluding remarks Bilkent University
International journal of Engineering Research-Online A Peer Reviewed International Journal Articles available online http://www.ijoer.
RESEARCH ARTICLE ISSN: 2321-7758 GLOBAL LOAD DISTRIBUTION USING SKIP GRAPH, BATON AND CHORD J.K.JEEVITHA, B.KARTHIKA* Information Technology,PSNA College of Engineering & Technology, Dindigul, India Article
A P2P SERVICE DISCOVERY STRATEGY BASED ON CONTENT
A P2P SERVICE DISCOVERY STRATEGY BASED ON CONTENT CATALOGUES Lican Huang Institute of Network & Distributed Computing, Zhejiang Sci-Tech University, No.5, St.2, Xiasha Higher Education Zone, Hangzhou,
Software Concepts. Uniprocessor Operating Systems. System software structures. CIS 505: Software Systems Architectures of Distributed Systems
CIS 505: Software Systems Architectures of Distributed Systems System DOS Software Concepts Description Tightly-coupled operating system for multiprocessors and homogeneous multicomputers Main Goal Hide
UPS battery remote monitoring system in cloud computing
, pp.11-15 http://dx.doi.org/10.14257/astl.2014.53.03 UPS battery remote monitoring system in cloud computing Shiwei Li, Haiying Wang, Qi Fan School of Automation, Harbin University of Science and Technology
How To Understand The Concept Of A Distributed System
Distributed Operating Systems Introduction Ewa Niewiadomska-Szynkiewicz and Adam Kozakiewicz [email protected], [email protected] Institute of Control and Computation Engineering Warsaw University of
Cellular Computing on a Linux Cluster
Cellular Computing on a Linux Cluster Alexei Agueev, Bernd Däne, Wolfgang Fengler TU Ilmenau, Department of Computer Architecture Topics 1. Cellular Computing 2. The Experiment 3. Experimental Results
Distributed Systems LEEC (2005/06 2º Sem.)
Distributed Systems LEEC (2005/06 2º Sem.) Introduction João Paulo Carvalho Universidade Técnica de Lisboa / Instituto Superior Técnico Outline Definition of a Distributed System Goals Connecting Users
Using Content-Addressable Networks for Load Balancing in Desktop Grids (Extended Version)
Using Content-Addressable Networks for Load Balancing in Desktop Grids (Extended Version) Jik-Soo Kim, Peter Keleher, Michael Marsh, Bobby Bhattacharjee and Alan Sussman UMIACS and Department of Computer
Volunteer Computing, Grid Computing and Cloud Computing: Opportunities for Synergy. Derrick Kondo INRIA, France
Volunteer Computing, Grid Computing and Cloud Computing: Opportunities for Synergy Derrick Kondo INRIA, France Outline Cloud Grid Volunteer Computing Cloud Background Vision Hide complexity of hardware
An approach to grid scheduling by using Condor-G Matchmaking mechanism
An approach to grid scheduling by using Condor-G Matchmaking mechanism E. Imamagic, B. Radic, D. Dobrenic University Computing Centre, University of Zagreb, Croatia {emir.imamagic, branimir.radic, dobrisa.dobrenic}@srce.hr
Distributed Systems. REK s adaptation of Prof. Claypool s adaptation of Tanenbaum s Distributed Systems Chapter 1
Distributed Systems REK s adaptation of Prof. Claypool s adaptation of Tanenbaum s Distributed Systems Chapter 1 1 The Rise of Distributed Systems! Computer hardware prices are falling and power increasing.!
Adapting Distributed Hash Tables for Mobile Ad Hoc Networks
University of Tübingen Chair for Computer Networks and Internet Adapting Distributed Hash Tables for Mobile Ad Hoc Networks Tobias Heer, Stefan Götz, Simon Rieche, Klaus Wehrle Protocol Engineering and
Proposal of Dynamic Load Balancing Algorithm in Grid System
www.ijcsi.org 186 Proposal of Dynamic Load Balancing Algorithm in Grid System Sherihan Abu Elenin Faculty of Computers and Information Mansoura University, Egypt Abstract This paper proposed dynamic load
High Performance Computing. Course Notes 2007-2008. HPC Fundamentals
High Performance Computing Course Notes 2007-2008 2008 HPC Fundamentals Introduction What is High Performance Computing (HPC)? Difficult to define - it s a moving target. Later 1980s, a supercomputer performs
5. Peer-to-peer (P2P) networks
5. Peer-to-peer (P2P) networks PA191: Advanced Computer Networking I. Eva Hladká Slides by: Tomáš Rebok Faculty of Informatics Masaryk University Autumn 2015 Eva Hladká (FI MU) 5. P2P networks Autumn 2015
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 349 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 349 Load Balancing Heterogeneous Request in DHT-based P2P Systems Mrs. Yogita A. Dalvi Dr. R. Shankar Mr. Atesh
A SURVEY ON MAPREDUCE IN CLOUD COMPUTING
A SURVEY ON MAPREDUCE IN CLOUD COMPUTING Dr.M.Newlin Rajkumar 1, S.Balachandar 2, Dr.V.Venkatesakumar 3, T.Mahadevan 4 1 Asst. Prof, Dept. of CSE,Anna University Regional Centre, Coimbatore, [email protected]
LSKA 2010 Survey Report Job Scheduler
LSKA 2010 Survey Report Job Scheduler Graduate Institute of Communication Engineering {r98942067, r98942112}@ntu.edu.tw March 31, 2010 1. Motivation Recently, the computing becomes much more complex. However,
Chapter 18: Database System Architectures. Centralized Systems
Chapter 18: Database System Architectures! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems! Network Types 18.1 Centralized Systems! Run on a single computer system and
PEER TO PEER FILE SHARING USING NETWORK CODING
PEER TO PEER FILE SHARING USING NETWORK CODING Ajay Choudhary 1, Nilesh Akhade 2, Aditya Narke 3, Ajit Deshmane 4 Department of Computer Engineering, University of Pune Imperial College of Engineering
Peer-to-Peer: an Enabling Technology for Next-Generation E-learning
Peer-to-Peer: an Enabling Technology for Next-Generation E-learning Aleksander Bu lkowski 1, Edward Nawarecki 1, and Andrzej Duda 2 1 AGH University of Science and Technology, Dept. Of Computer Science,
GAMS, Condor and the Grid: Solving Hard Optimization Models in Parallel. Michael C. Ferris University of Wisconsin
GAMS, Condor and the Grid: Solving Hard Optimization Models in Parallel Michael C. Ferris University of Wisconsin Parallel Optimization Aid search for global solutions (typically in non-convex or discrete)
An Introduction to Peer-to-Peer Networks
An Introduction to Peer-to-Peer Networks Presentation for MIE456 - Information Systems Infrastructure II Vinod Muthusamy October 30, 2003 Agenda Overview of P2P Characteristics Benefits Unstructured P2P
Distributed Systems and Recent Innovations: Challenges and Benefits
Distributed Systems and Recent Innovations: Challenges and Benefits 1. Introduction Krishna Nadiminti, Marcos Dias de Assunção, and Rajkumar Buyya Grid Computing and Distributed Systems Laboratory Department
Simplest Scalable Architecture
Simplest Scalable Architecture NOW Network Of Workstations Many types of Clusters (form HP s Dr. Bruce J. Walker) High Performance Clusters Beowulf; 1000 nodes; parallel programs; MPI Load-leveling Clusters
Linux Performance Optimizations for Big Data Environments
Linux Performance Optimizations for Big Data Environments Dominique A. Heger Ph.D. DHTechnologies (Performance, Capacity, Scalability) www.dhtusa.com Data Nubes (Big Data, Hadoop, ML) www.datanubes.com
STUDY AND SIMULATION OF A DISTRIBUTED REAL-TIME FAULT-TOLERANCE WEB MONITORING SYSTEM
STUDY AND SIMULATION OF A DISTRIBUTED REAL-TIME FAULT-TOLERANCE WEB MONITORING SYSTEM Albert M. K. Cheng, Shaohong Fang Department of Computer Science University of Houston Houston, TX, 77204, USA http://www.cs.uh.edu
Audio networking. François Déchelle ([email protected]) Patrice Tisserand ([email protected]) Simon Schampijer (schampij@ircam.
Audio networking François Déchelle ([email protected]) Patrice Tisserand ([email protected]) Simon Schampijer ([email protected]) IRCAM Distributed virtual concert project and issues network protocols
A distributed system is defined as
A distributed system is defined as A collection of independent computers that appears to its users as a single coherent system CS550: Advanced Operating Systems 2 Resource sharing Openness Concurrency
Web Service Based Data Management for Grid Applications
Web Service Based Data Management for Grid Applications T. Boehm Zuse-Institute Berlin (ZIB), Berlin, Germany Abstract Web Services play an important role in providing an interface between end user applications
Distributed Computing over Communication Networks: Topology. (with an excursion to P2P)
Distributed Computing over Communication Networks: Topology (with an excursion to P2P) Some administrative comments... There will be a Skript for this part of the lecture. (Same as slides, except for today...
Flauncher and DVMS Deploying and Scheduling Thousands of Virtual Machines on Hundreds of Nodes Distributed Geographically
Flauncher and Deploying and Scheduling Thousands of Virtual Machines on Hundreds of Nodes Distributed Geographically Daniel Balouek, Adrien Lèbre, Flavien Quesnel To cite this version: Daniel Balouek,
Grid Scheduling Architectures with Globus GridWay and Sun Grid Engine
Grid Scheduling Architectures with and Sun Grid Engine Sun Grid Engine Workshop 2007 Regensburg, Germany September 11, 2007 Ignacio Martin Llorente Javier Fontán Muiños Distributed Systems Architecture
Peer-to-Peer Networks. Chapter 6: P2P Content Distribution
Peer-to-Peer Networks Chapter 6: P2P Content Distribution Chapter Outline Content distribution overview Why P2P content distribution? Network coding Peer-to-peer multicast Kangasharju: Peer-to-Peer Networks
Speeding up MATLAB and Simulink Applications
Speeding up MATLAB and Simulink Applications 2009 The MathWorks, Inc. Customer Tour 2009 Today s Schedule Introduction to Parallel Computing with MATLAB and Simulink Break Master Class on Speeding Up MATLAB
GridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources
GridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources Jack Dongarra University of Tennessee and Oak Ridge National Laboratory 2/25/2006 1 Overview Grid/NetSolve
A Comparison of Distributed Systems: ChorusOS and Amoeba
A Comparison of Distributed Systems: ChorusOS and Amoeba Angelo Bertolli Prepared for MSIT 610 on October 27, 2004 University of Maryland University College Adelphi, Maryland United States of America Abstract.
LOAD BALANCING WITH PARTIAL KNOWLEDGE OF SYSTEM
LOAD BALANCING WITH PARTIAL KNOWLEDGE OF SYSTEM IN PEER TO PEER NETWORKS R. Vijayalakshmi and S. Muthu Kumarasamy Dept. of Computer Science & Engineering, S.A. Engineering College Anna University, Chennai,
Object Request Reduction in Home Nodes and Load Balancing of Object Request in Hybrid Decentralized Web Caching
2012 2 nd International Conference on Information Communication and Management (ICICM 2012) IPCSIT vol. 55 (2012) (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V55.5 Object Request Reduction
Learn Oracle WebLogic Server 12c Administration For Middleware Administrators
Wednesday, November 18,2015 1:15-2:10 pm VT425 Learn Oracle WebLogic Server 12c Administration For Middleware Administrators Raastech, Inc. 2201 Cooperative Way, Suite 600 Herndon, VA 20171 +1-703-884-2223
Multilevel Communication Aware Approach for Load Balancing
Multilevel Communication Aware Approach for Load Balancing 1 Dipti Patel, 2 Ashil Patel Department of Information Technology, L.D. College of Engineering, Gujarat Technological University, Ahmedabad 1
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and
Branch-and-Price Approach to the Vehicle Routing Problem with Time Windows
TECHNISCHE UNIVERSITEIT EINDHOVEN Branch-and-Price Approach to the Vehicle Routing Problem with Time Windows Lloyd A. Fasting May 2014 Supervisors: dr. M. Firat dr.ir. M.A.A. Boon J. van Twist MSc. Contents
Load balancing; Termination detection
Load balancing; Termination detection Parallel and Distributed Computing Department of Computer Science and Engineering (DEI) Instituto Superior Técnico November 14, 2013 CPD (DEI / IST) Parallel and Distributed
A Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment
A Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment Arshad Ali 3, Ashiq Anjum 3, Atif Mehmood 3, Richard McClatchey 2, Ian Willers 2, Julian Bunn
Introduction to Computer Networks
Introduction to Computer Networks Chen Yu Indiana University Basic Building Blocks for Computer Networks Nodes PC, server, special-purpose hardware, sensors Switches Links: Twisted pair, coaxial cable,
PEER-TO-PEER (P2P) systems have emerged as an appealing
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 21, NO. 4, APRIL 2009 595 Histogram-Based Global Load Balancing in Structured Peer-to-Peer Systems Quang Hieu Vu, Member, IEEE, Beng Chin Ooi,
CHAPTER 1: OPERATING SYSTEM FUNDAMENTALS
CHAPTER 1: OPERATING SYSTEM FUNDAMENTALS What is an operating? A collection of software modules to assist programmers in enhancing efficiency, flexibility, and robustness An Extended Machine from the users
Virtual machine interface. Operating system. Physical machine interface
Software Concepts User applications Operating system Hardware Virtual machine interface Physical machine interface Operating system: Interface between users and hardware Implements a virtual machine that
