Load balancing; Termination detection

Similar documents

Load balancing; Termination detection

Load balancing Static Load Balancing

Load Balancing and Termination Detection

Load Balancing and Termination Detection

Load Balancing and Termination Detection

Chapter 7 Load Balancing and Termination Detection

Load Balancing Techniques

Static Load Balancing

A number of tasks executing serially or in parallel. Distribute tasks on processors so that minimal execution time is achieved. Optimal distribution

Praktikum Wissenschaftliches Rechnen (Performance-optimized optimized Programming)

Load Balancing Algorithms for Peer to Peer and Client Server Distributed Environments

PPD: Scheduling and Load Balancing 2

LOAD BALANCING TECHNIQUES

A Comparative Performance Analysis of Load Balancing Algorithms in Distributed System using Qualitative Parameters

Various Schemes of Load Balancing in Distributed Systems- A Review

Load Balancing. Load Balancing 1 / 24

Design and Implementation of Efficient Load Balancing Algorithm in Grid Environment

Dynamic Load Balancing in a Network of Workstations

Keywords Load balancing, Dispatcher, Distributed Cluster Server, Static Load balancing, Dynamic Load balancing.

Multiprocessor Scheduling and Scheduling in Linux Kernel 2.6

A Study on the Application of Existing Load Balancing Algorithms for Large, Dynamic, Heterogeneous Distributed Systems

Comparative Study of Load Balancing Algorithms

A Review of Customized Dynamic Load Balancing for a Network of Workstations

Decentralized Task-Aware Scheduling for Data Center Networks

CSE 4351/5351 Notes 7: Task Scheduling & Load Balancing

Performance Analysis of Load Balancing Algorithms in Distributed System

Design and Implementation of Distributed Process Execution Environment

CHAPTER 5 WLDMA: A NEW LOAD BALANCING STRATEGY FOR WAN ENVIRONMENT

Scheduling Allowance Adaptability in Load Balancing technique for Distributed Systems

Operatin g Systems: Internals and Design Principle s. Chapter 10 Multiprocessor and Real-Time Scheduling Seventh Edition By William Stallings

A Dynamic Approach for Load Balancing using Clusters

Lecture Outline Overview of real-time scheduling algorithms Outline relative strengths, weaknesses

Tools Page 1 of 13 ON PROGRAM TRANSLATION. A priori, we have two translation mechanisms available:

A Comparison of Dynamic Load Balancing Algorithms

Design of an Optimized Virtual Server for Efficient Management of Cloud Load in Multiple Cloud Environments

Expanding the CASEsim Framework to Facilitate Load Balancing of Social Network Simulations

A Survey on Load Balancing Techniques Using ACO Algorithm

Comparison on Different Load Balancing Algorithms of Peer to Peer Networks

1. The subnet must prevent additional packets from entering the congested region until those already present can be processed.

A Clustered Approach for Load Balancing in Distributed Systems

Scheduling. Scheduling. Scheduling levels. Decision to switch the running process can take place under the following circumstances:

Routing in packet-switching networks

A Robust Dynamic Load-balancing Scheme for Data Parallel Application on Message Passing Architecture

How To Compare Load Sharing And Job Scheduling In A Network Of Workstations

International Journal of Scientific & Engineering Research, Volume 6, Issue 4, April ISSN

Feedback guided load balancing in a distributed memory environment

Mizan: A System for Dynamic Load Balancing in Large-scale Graph Processing

Interconnection Networks. Interconnection Networks. Interconnection networks are used everywhere!

Journal of Theoretical and Applied Information Technology 20 th July Vol.77. No JATIT & LLS. All rights reserved.

COMPARATIVE STUDY ON LOAD BALANCING TECHNIQUES IN DISTRIBUTED SYSTEMS

SCALABILITY AND AVAILABILITY

Spring 2011 Prof. Hyesoon Kim

Local Area Networks transmission system private speedy and secure kilometres shared transmission medium hardware & software

Chapter 15: Distributed Structures. Topology

Operating Systems. III. Scheduling.

A Review on an Algorithm for Dynamic Load Balancing in Distributed Network with Multiple Supporting Nodes with Interrupt Service

QoS Parameters. Quality of Service in the Internet. Traffic Shaping: Congestion Control. Keeping the QoS

Xiaoqiao Meng, Vasileios Pappas, Li Zhang IBM T.J. Watson Research Center Presented by: Payman Khani

Decentralized Dynamic Load Balancing: The Particles Approach

The Asterope compute cluster

GEDAE TM - A Graphical Programming and Autocode Generation Tool for Signal Processor Applications

Load Balancing in cloud computing

AN EFFICIENT LOAD BALANCING APPROACH IN CLOUD SERVER USING ANT COLONY OPTIMIZATION

Discrete-Event Simulation

Adaptive Load Balancing Method Enabling Auto-Specifying Threshold of Node Load Status for Apache Flume

Overview of Network Hardware and Software. CS158a Chris Pollett Jan 29, 2007.

An Introduction to Parallel Computing/ Programming

MuleSoft Blueprint: Load Balancing Mule for Scalability and Availability

Quality of Service (QoS)) in IP networks

APPENDIX 1 USER LEVEL IMPLEMENTATION OF PPATPAN IN LINUX SYSTEM

Parallel and Distributed Computing Programming Assignment 1

Energy Efficient MapReduce

LOAD BALANCING ALGORITHM REVIEW s IN CLOUD ENVIRONMENT

Classification - Examples

Scheduling Task Parallelism" on Multi-Socket Multicore Systems"

Application Monitoring in the Grid with GRM and PROVE *

E) Modeling Insights: Patterns and Anti-patterns

Key Components of WAN Optimization Controller Functionality

A novel load balancing algorithm for computational grid

walberla: Towards an Adaptive, Dynamically Load-Balanced, Massively Parallel Lattice Boltzmann Fluid Simulation

Load Balancing in Periodic Wireless Sensor Networks for Lifetime Maximisation

Research on Job Scheduling Algorithm in Hadoop

Dynamic Load Balancing in Charm++ Abhinav S Bhatele Parallel Programming Lab, UIUC

Enhanced Power Saving for IEEE WLAN with Dynamic Slot Allocation

Proposal of Dynamic Load Balancing Algorithm in Grid System

Network management and QoS provisioning - QoS in the Internet

Global Load Balancing and Primary Backup Approach for Fault Tolerant Scheduling in Computational Grid

Cloud Computing. Lectures 10 and 11 Map Reduce: System Perspective

CPU SCHEDULING (CONT D) NESTED SCHEDULING FUNCTIONS

High Throughput Computing on P2P Networks. Carlos Pérez Miguel

The International Journal Of Science & Technoledge (ISSN X)

Transcription:

Load balancing; Termination detection Parallel and Distributed Computing Department of Computer Science and Engineering (DEI) Instituto Superior Técnico November 13, 2014 CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 1 / 23

Outline Load Balancing static dynamic centralized decentralized Termination Detection Debugging CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 2 / 23

Foster s Design Methodology Development of scalable parallel algorithms by delaying machine-dependent decisions to later stages. Four steps: partitioning communication agglomeration mapping CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 3 / 23

Mapping Examples discussed in previous classes: circuit satisfiability sieve of Eratosthenes all pairs shortest paths matrix-vector multiplication So far, all primitive tasks require same amount of computation. Aggregation + Mapping strategy: create one task per processor and distribute primitive tasks evenly. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 4 / 23

Mapping Examples discussed in previous classes: circuit satisfiability sieve of Eratosthenes all pairs shortest paths matrix-vector multiplication So far, all primitive tasks require same amount of computation. Aggregation + Mapping strategy: create one task per processor and distribute primitive tasks evenly. What if we can not predict beforehand the amount of computation required per primitive task? CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 4 / 23

Load Balancing P 4 P 4 P 3 P 3 P 2 P 2 P 1 P 1 P 0 P 0 Perfect load balancing t Imperfect load balancing t CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 5 / 23

Load Balancing P 4 P 4 P 3 P 3 P 2 P 2 P 1 P 1 P 0 P 0 Perfect load balancing t Imperfect load balancing t Parallel execution time defined by last task to finish. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 5 / 23

Load Balancing P 4 P 4 P 3 P 3 P 2 P 2 P 1 P 1 P 0 P 0 Perfect load balancing t Imperfect load balancing t Parallel execution time defined by last task to finish. Parallel time minimized when load distributed evenly. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 5 / 23

Load Balancing Static load balancing: assignment of tasks to processors decided during program development mapping problem. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 6 / 23

Load Balancing Static load balancing: assignment of tasks to processors decided during program development mapping problem. Dynamic load balancing: assignment of tasks to processors decided during runtime CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 6 / 23

Static Load Balancing Substantial amount of research has been dedicated to static load balancing. Problem Statement (Partition Problem) Given a number of tasks, each with a given computational effort, and a number of partitions (processors), assign each to a partition such that the maximum computational effort over all partitions is minimized. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 7 / 23

Static Load Balancing Substantial amount of research has been dedicated to static load balancing. Problem Statement (Partition Problem) Given a number of tasks, each with a given computational effort, and a number of partitions (processors), assign each to a partition such that the maximum computational effort over all partitions is minimized. NP-Hard combinatorial problem. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 7 / 23

Static Load Balancing Substantial amount of research has been dedicated to static load balancing. Problem Statement (Partition Problem) Given a number of tasks, each with a given computational effort, and a number of partitions (processors), assign each to a partition such that the maximum computational effort over all partitions is minimized. NP-Hard combinatorial problem. Yet, pseudo-polynomial time solution - The Easiest Hard Problem. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 7 / 23

Limitations of Static Load Balancing Limitations of static load balancing: CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 8 / 23

Limitations of Static Load Balancing Limitations of static load balancing: computational effort of each task may not be known a-priori some problems have an indeterminate number of steps to complete CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 8 / 23

Limitations of Static Load Balancing Limitations of static load balancing: computational effort of each task may not be known a-priori some problems have an indeterminate number of steps to complete programs are subject to variable communication delays CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 8 / 23

Limitations of Static Load Balancing Limitations of static load balancing: computational effort of each task may not be known a-priori some problems have an indeterminate number of steps to complete programs are subject to variable communication delays performance of each processor may vary, and may not be known beforehand Dynamic load balancing overcomes these issues by making the division of the load dependent on the actual runtimes. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 8 / 23

Limitations of Static Load Balancing Limitations of static load balancing: computational effort of each task may not be known a-priori some problems have an indeterminate number of steps to complete programs are subject to variable communication delays performance of each processor may vary, and may not be known beforehand Dynamic load balancing overcomes these issues by making the division of the load dependent on the actual runtimes. Penalty: additional overhead due to task management. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 8 / 23

Dynamic Load Balancing Dynamic load balancing can be accomplished through: centralized management one process (master) is responsible for assigning tasks to slave processes decentralized management all processes are equal and divide work among them cooperatively CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 9 / 23

Centralized Dynamic Load Balancing Work Pool or Processor Farm model: master holds all tasks of the application CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 10 / 23

Centralized Dynamic Load Balancing Work Pool or Processor Farm model: master holds all tasks of the application new tasks may be generated during execution CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 10 / 23

Centralized Dynamic Load Balancing Work Pool or Processor Farm model: master holds all tasks of the application new tasks may be generated during execution when idle, slave process requests a task to the master CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 10 / 23

Centralized Dynamic Load Balancing Work Pool or Processor Farm model: master holds all tasks of the application new tasks may be generated during execution when idle, slave process requests a task to the master master selects tasks among those ready to run and sends to slave CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 10 / 23

Centralized Dynamic Load Balancing Work Pool or Processor Farm model: master holds all tasks of the application new tasks may be generated during execution when idle, slave process requests a task to the master master selects tasks among those ready to run and sends to slave tasks of larger size or importance are executed first CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 10 / 23

Centralized Dynamic Load Balancing Work Pool or Processor Farm model: master holds all tasks of the application new tasks may be generated during execution when idle, slave process requests a task to the master master selects tasks among those ready to run and sends to slave tasks of larger size or importance are executed first if tasks are all the same, a FIFO queue may be used CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 10 / 23

Centralized Dynamic Load Balancing Work Pool or Processor Farm model: master holds all tasks of the application new tasks may be generated during execution when idle, slave process requests a task to the master master selects tasks among those ready to run and sends to slave tasks of larger size or importance are executed first if tasks are all the same, a FIFO queue may be used specialized slaves can be considered CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 10 / 23

Centralized Dynamic Load Balancing Work Pool or Processor Farm model: master holds all tasks of the application new tasks may be generated during execution when idle, slave process requests a task to the master master selects tasks among those ready to run and sends to slave tasks of larger size or importance are executed first if tasks are all the same, a FIFO queue may be used specialized slaves can be considered master can also hold global data CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 10 / 23

Centralized Dynamic Load Balancing Work Pool or Processor Farm model: Work pool T 5 Master Process T 1 T 2 T 7 T 4 T n T 3 T 6 Send Task P 0 Request Task (possibly, submit new task) P 1 P 2 P p Slave Processes CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 11 / 23

Centralized Dynamic Load Balancing Limitations of centralized dynamic load balancing: CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 12 / 23

Centralized Dynamic Load Balancing Limitations of centralized dynamic load balancing: Master can become a bottleneck as it can only issue one task at a time. not a significant problem if few slaves and/or computationally intensive tasks for finer-grained tasks and many slaves, distribute task management CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 12 / 23

Decentralized Dynamic Load Balancing Initial approach can be simply to evolve the centralized system into a tree-like task management scheme: main master distribute tasks to be managed by second-level masters each second-level master behaves as in the centralized method Naturally, this model can be extended to as many levels as desired. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 13 / 23

Decentralized Dynamic Load Balancing Fully distributed work pool: each process starts with a given number of tasks, but may send or receive new tasks to handle. Receiver-initiated method: process request new tasks when it has few or no tasks to do better in high load systems Sender-initiated method: process under heavy load send tasks to processes willing to accept them A mixture of both approaches is possible. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 14 / 23

Decentralized Dynamic Load Balancing Fully distributed work pool: each process starts with a given number of tasks, but may send or receive new tasks to handle. Receiver-initiated method: process request new tasks when it has few or no tasks to do better in high load systems Sender-initiated method: process under heavy load send tasks to processes willing to accept them A mixture of both approaches is possible. Which process to send/request? CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 14 / 23

Decentralized Dynamic Load Balancing Fully distributed work pool: each process starts with a given number of tasks, but may send or receive new tasks to handle. Receiver-initiated method: process request new tasks when it has few or no tasks to do better in high load systems Sender-initiated method: process under heavy load send tasks to processes willing to accept them A mixture of both approaches is possible. Which process to send/request? round-robin random CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 14 / 23

Mapping Strategy Decision tree for mapping strategy: Static number of tasks Dynamic number of tasks Structured communication pattern Unstructured communication pattern Frequent communication between tasks Many tasks with little communication Roughly constant computation time per task Computation per task varies Create one task per processor (agglomerate tasks to minimize communication) Cyclically map tasks to processors to balance load Use a static load balancing algorithm Use a decentralized dynamic load balancing algorithm Use a centralized dynamic load balancing algorithm CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 15 / 23

Termination Detection When can we be sure that the computation has finished? CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 16 / 23

Termination Detection When can we be sure that the computation has finished? Static load balancing CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 16 / 23

Termination Detection When can we be sure that the computation has finished? Static load balancing Typically termination is easy to detect has layout of application is fixed and well controlled. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 16 / 23

Termination Detection When can we be sure that the computation has finished? Static load balancing Typically termination is easy to detect has layout of application is fixed and well controlled. Centralized dynamic load balancing CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 16 / 23

Termination Detection When can we be sure that the computation has finished? Static load balancing Typically termination is easy to detect has layout of application is fixed and well controlled. Centralized dynamic load balancing Also easy for master to recognize termination: task queue is empty every slave is idle, having requested another task, and without generating any new task CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 16 / 23

Termination Detection When can we be sure that the computation has finished? Static load balancing Typically termination is easy to detect has layout of application is fixed and well controlled. Centralized dynamic load balancing Also easy for master to recognize termination: task queue is empty every slave is idle, having requested another task, and without generating any new task Alternatively, if a slave indicates that a solution has been found, master can terminate all slaves. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 16 / 23

Termination Detection When can we be sure that the computation has finished? Static load balancing Typically termination is easy to detect has layout of application is fixed and well controlled. Centralized dynamic load balancing Also easy for master to recognize termination: task queue is empty every slave is idle, having requested another task, and without generating any new task Alternatively, if a slave indicates that a solution has been found, master can terminate all slaves. Decentralized dynamic load balancing CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 16 / 23

Termination Detection When can we be sure that the computation has finished? Static load balancing Typically termination is easy to detect has layout of application is fixed and well controlled. Centralized dynamic load balancing Also easy for master to recognize termination: task queue is empty every slave is idle, having requested another task, and without generating any new task Alternatively, if a slave indicates that a solution has been found, master can terminate all slaves. Decentralized dynamic load balancing Not so trivial... CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 16 / 23

Termination Detection In general, termination at time t requires the following conditions: 1 local termination conditions exist for all processes at time t 2 there are no messages in transit at time t Second condition is what makes this problem difficult. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 17 / 23

Termination Detection In general, termination at time t requires the following conditions: 1 local termination conditions exist for all processes at time t 2 there are no messages in transit at time t Second condition is what makes this problem difficult. Wait long enough before assuming program has finished? CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 17 / 23

Termination Detection Using Acknowledgment Messages each process has two states: active and inactive. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 18 / 23

Termination Detection Using Acknowledgment Messages each process has two states: active and inactive. processes start inactive CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 18 / 23

Termination Detection Using Acknowledgment Messages each process has two states: active and inactive. processes start inactive they become active when they receive first task sender of tasks becomes parent of process CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 18 / 23

Termination Detection Using Acknowledgment Messages each process has two states: active and inactive. processes start inactive they become active when they receive first task sender of tasks becomes parent of process process may receive many other tasks while active (computation itself need not be a tree) CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 18 / 23

Termination Detection Using Acknowledgment Messages each process has two states: active and inactive. processes start inactive they become active when they receive first task sender of tasks becomes parent of process process may receive many other tasks while active (computation itself need not be a tree) every time a process receives a task sends an acknowledgment, with the exception of the parent CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 18 / 23

Termination Detection Using Acknowledgment Messages each process has two states: active and inactive. processes start inactive they become active when they receive first task sender of tasks becomes parent of process process may receive many other tasks while active (computation itself need not be a tree) every time a process receives a task sends an acknowledgment, with the exception of the parent acknowledgment to parent is only sent when process is ready to become inactive CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 18 / 23

Termination Detection Using Acknowledgment Messages each process has two states: active and inactive. processes start inactive they become active when they receive first task sender of tasks becomes parent of process process may receive many other tasks while active (computation itself need not be a tree) every time a process receives a task sends an acknowledgment, with the exception of the parent acknowledgment to parent is only sent when process is ready to become inactive process is ready to become inactive when CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 18 / 23

Termination Detection Using Acknowledgment Messages each process has two states: active and inactive. processes start inactive they become active when they receive first task sender of tasks becomes parent of process process may receive many other tasks while active (computation itself need not be a tree) every time a process receives a task sends an acknowledgment, with the exception of the parent acknowledgment to parent is only sent when process is ready to become inactive process is ready to become inactive when local termination conditions exist (all tasks finished) CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 18 / 23

Termination Detection Using Acknowledgment Messages each process has two states: active and inactive. processes start inactive they become active when they receive first task sender of tasks becomes parent of process process may receive many other tasks while active (computation itself need not be a tree) every time a process receives a task sends an acknowledgment, with the exception of the parent acknowledgment to parent is only sent when process is ready to become inactive process is ready to become inactive when local termination conditions exist (all tasks finished) it has transmitted all acknowledgments for tasks it received CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 18 / 23

Termination Detection Using Acknowledgment Messages each process has two states: active and inactive. processes start inactive they become active when they receive first task sender of tasks becomes parent of process process may receive many other tasks while active (computation itself need not be a tree) every time a process receives a task sends an acknowledgment, with the exception of the parent acknowledgment to parent is only sent when process is ready to become inactive process is ready to become inactive when local termination conditions exist (all tasks finished) it has transmitted all acknowledgments for tasks it received it has received all acknowledgments for tasks it has sent out When first process becomes idle, computation can terminate. CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 18 / 23

Termination Detection Using Acknowledgment Messages Parent Process Inactive Final Ack First Task Ack Task Other processes Active CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 19 / 23

Termination Detection Ring Termination Algorithm processes become white when they terminate CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 20 / 23

Termination Detection Ring Termination Algorithm processes become white when they terminate when P 0 becomes white, it sends a white token to P 1 CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 20 / 23

Termination Detection Ring Termination Algorithm processes become white when they terminate when P 0 becomes white, it sends a white token to P 1 the token is passed to the next process in the ring when the process finishes, and: if the color of the process is black, the token becomes black otherwise, the token keeps the same color CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 20 / 23

Termination Detection Ring Termination Algorithm processes become white when they terminate when P 0 becomes white, it sends a white token to P 1 the token is passed to the next process in the ring when the process finishes, and: if the color of the process is black, the token becomes black otherwise, the token keeps the same color a process P i becomes black if it sends a message to process P j and j < i CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 20 / 23

Termination Detection Ring Termination Algorithm processes become white when they terminate when P 0 becomes white, it sends a white token to P 1 the token is passed to the next process in the ring when the process finishes, and: if the color of the process is black, the token becomes black otherwise, the token keeps the same color a process P i becomes black if it sends a message to process P j and j < i a black process becomes white when it passes the token CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 20 / 23

Termination Detection Ring Termination Algorithm processes become white when they terminate when P 0 becomes white, it sends a white token to P 1 the token is passed to the next process in the ring when the process finishes, and: if the color of the process is black, the token becomes black otherwise, the token keeps the same color a process P i becomes black if it sends a message to process P j and j < i a black process becomes white when it passes the token when P 0 receives a black token, it passes a white token CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 20 / 23

Termination Detection Ring Termination Algorithm processes become white when they terminate when P 0 becomes white, it sends a white token to P 1 the token is passed to the next process in the ring when the process finishes, and: if the color of the process is black, the token becomes black otherwise, the token keeps the same color a process P i becomes black if it sends a message to process P j and j < i a black process becomes white when it passes the token when P 0 receives a black token, it passes a white token when P 0 receives a white token, computation can terminate CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 20 / 23

Debugging Most common tool for debugging of MPI programs: CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 21 / 23

Debugging Most common tool for debugging of MPI programs: printf! CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 21 / 23

Debugging Most common tool for debugging of MPI programs: printf! Program runs slower. Difficulties for parallel debugging? CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 21 / 23

Debugging Most common tool for debugging of MPI programs: printf! Program runs slower. Difficulties for parallel debugging? redirection of stdio: MPI takes care of this by delaying some tasks, behavior of program may change significantly CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 21 / 23

Debugging Most common tool for debugging of MPI programs: printf! Program runs slower. Difficulties for parallel debugging? redirection of stdio: MPI takes care of this by delaying some tasks, behavior of program may change significantly Don t forget fflush! CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 21 / 23

Debugging Most common tool for debugging of MPI programs: printf! Program runs slower. Difficulties for parallel debugging? redirection of stdio: MPI takes care of this by delaying some tasks, behavior of program may change significantly Don t forget fflush! Debuggers can be used to exam execution of individual processes. However: they change program execution times significantly cannot capture relevant aspects of parallel computations such as timing of events CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 21 / 23

Review Load Balancing static dynamic centralized decentralized Termination Detection Debugging CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 22 / 23

Next Class OpenMP vs/with MPI CPD (DEI / IST) Parallel and Distributed Computing 17 2014-11-13 23 / 23