Key Words: Dynamic Load Balancing, and Distributed System

DYNAMIC ROTATING LOAD BALANCING ALGORITHM IN DISTRIBUTED SYSTEMS ROSE SULEIMAN AL DAHOUD ALI ISSA OTOUM Al-Zaytoonah University Al-Zaytoonah University Neelain University rosesuleiman@yahoo.com aldahoud@alzaytoonah.edu.jo issaotoum2k@yahoo.com ABSTRACT Load Balancing in a distributed system is an important process to reduce delays and improve response times in order to speed up applications and results. Different approaches to Load Balancing have different advantages and disadvantages. Classical approaches to load balancing are quite good and mostly efficient, but in many circumstances, the overheads incurred from load balancing are too high and therefore become ineffective. Dynamic Rotating Load Balancing Algorithm in Distributed Systems is proposed in this paper. This new algorithm has much lower overheads and faster response times when compared to the classical approaches, as shown in the data obtained from the simulations done to test this approach. It is also scalable and efficient regardless of the size of the network used. INTRODUCTION This paper is a study about dynamic load balancing in a distributed system. It compares the various approaches to Load Balancing in distributed system, with focus on dynamic approaches in general; some processes are discussed in detail to clarify issues concerning factors contributing to load and the mechanisms used in any balancing process. The paper explains the new algorithm and how it is applied, with discussion of the simulation process used to compare the classical approach to the new- proposed- approach. Also it gives the results and conclusions drawn from the study and further proposals. The tables and diagrams at the end of the study give comparisons between the various approaches and the results of the simulations done as part of the study, with references outlined after that. Key Words: Dynamic Load Balancing, and Distributed System 1- DYNAMIC LOAD DISTRIBUTION 1-1.Load distribution Load distribution seeks to improve the performance of a distributed system, usually in terms of response time or resource availability, by allocating workload amongst a set of cooperating hosts. This division of system load can take place statically or dynamically: 1-2. Dynamic load distribution Dynamic load distribution is designed to overcome the problems of unknown or un-characterizable workloads, non-pervasive scheduling and runtime variation (any situation where the availability of hosts, the composition of the workload or the interaction of human beings can alter resource requirements or availability). Dynamic load distribution systems typically monitor the workload and hosts for any factors that may affect the choice of the most appropriate assignment and distribute jobs accordingly. This very difference between static and dynamic forms of load distribution is the source of the power and interest in dynamic load distribution. The objectives of this thesis lie entirely within the domain of dynamic load balancing. For brevity, I will take the more general term of load distribution to stipulate only the dynamic form. [1, 2, 7, 11] The Degree of Load Distribution Load Sharing: This is the coarsest form of load distribution. Load may only be placed on idle hosts, and can be viewed as binary, where a host is either idle or busy.

Load Balancing: Where load sharing is the coarsest form of load distribution, load balancing is the finest. Load balancing attempts to ensure that the workload on each host is within a small degree (or balance criterion) of the workload present on every other host in the system. Load Leveling: Load leveling occupies the ground between the two extremes of load sharing and load balancing. Rather than trying to obtain a strictly even distribution of load across all hosts, or simply utilizing idle hosts, load leveling seeks to avoid congestion on any one host. Other schemes such as, MOSIX, which could be considered load balancing systems, are in fact load leveling, as the balancing phase occurs periodically. 1-3. Previous Load Distribution Taxonomies There are numerous existing taxonomies available for the classification of load distribution, including Wang and Morris, Casavant and Kuhl and Jacqmot and Milgrom. 2- THE PROPOSED NEW ALGORITHM: A network is made up of nodes connected together in a certain configuration; the configuration will not matter for our purposes here. The nodes are arranged logically from 1 to n; where n is the total number of nodes. We can view the network, regardless of its size, as consisting of adjacent pairs of nodes or triplets, the nodes that are nearest each other are considered as the pair. This breaks up the large network into a number of small networks, the Load Balancing can then be done within the small networks (consisting of only two or three nodes). This grouping has the effect of both reducing the number of messages exchanged and also the physical distances between nodes as they are chosen to be adjacent. This has the effect of reducing the overhead when Load Balancing is done and thus makes for a more efficient process with faster response times and quicker task achievement. To do the Load Balancing in a dynamic manner, we need to set the criteria for the start of Load Balancing Process, and also to have a mechanism for changing the configuration of the network each time load balancing is done. 2-1. Criteria for Load Balancing: 1) When a certain number of tasks (queue length) at the node is reached- i.e. a threshold. 2) Periodically. 2-2. Selection of nodes: Any number of nodes can make up a group, but limiting the size to only 2 or 3 gives us the advantages of low overhead, reduction of job thrashing as well as other advantages; like scalability, robustness, stability and efficiency. The selection of nodes, for 3 nodes per cluster, or for 2 nodes per cluster can be done using the code: The code for the process, shown here in C++: Cycle =1 While (true) For (i=0;i< groups ; i++) For (j=0;j< clustersize ; j++) X=i*clustersize+j+cycle; If (x>nodes) C[i+1][j+1] = x % Nodes Else C[i+1][j+1] = x; The Load Balancing Code is placed in this area If (++cycle>nodes) cycle=1; In this configuration, the load over the groups consisting of nodes I;j;k : NetLoad = int ((load I+ load j)/2), for 2 node groups NetLoad = int ((load I+ load j+ load k)/3), for 3 node groups

2-3.Load Balancing steps: Step 1: Calculation of local load: We need to calculate local load ( i.e. load at each node): Factors of load: Load (L) is directly proportional to : Average queue length(q( avg )). Response time(t resp ( avg )) Average waiting time(t w ( avg )) Load (L) is inversely proportional to : Number of nodes, n. Mathematically: L α (Q( avg )) * (t resp ( avg ))* (t w ( avg )) / n Multiplying by a constant (c) makes this an equation: L = (Q( a )) * (t resp ( a ))* (t w ( a ))*c / n (t resp )= t release t arrival (t w )= t seize t arrival (t resp ( avg ))=the sum of (t resp )/ number of jobs= sum( t release t arrival )/ n j (t w ( avg ))=sum((t w )/number of jobs=sum(t seize t arrival )/ n j (Q( avg )) of node n= the sum of jobs at time (m) The constants, obtained from experimental values as shown in previous studies (Zhou, Kara; 1994): * tasks arrive at nodes in a Poisson distribution manner. * task size follows exponential distribution. time to send message time to receive message time for job sending time for job receiving time to send result time to receive result 0.00001 s (10 ms) 0.00001 s (10 ms) 0.00005 s (50 ms) 0.00005 s (50 ms) 0.00001 s (10 ms) 0.00001 s (10 ms) Step 2: Calculation of total load: 1.For central load balancing approach: Check load of all nodes and distribute load accordingly. 2.For Distributed load balancing approach: Check tables of loads and distribute load accordingly. 3.For rotating algorithm: Loads are calculated locally and the total load for each cluster is added and averaged; if threshold is exceeded then load is distributed by transfer of jobs to neighboring nodes in each cluster then the rotation of nodes is done. Step 3: Trigger of load balancing: 2 mechanisms: 1) Threshold. 2) Periodically. Step 4: Response Times Response times were calculated for different number of nodes by the simulation code for the various approaches of balancing Diagram illustrating concept of Rotational approach

Considerations: To Compare: 1) Centralized approach. 2) Distributed approach. 3) New algorithm. We have to consider the following issues first: The different approaches have different overheads depending on various aspects of the system. They each have advantages and disadvantages. To have a fair comparison with no bias would be quite difficult as the various aspects of the overhead and other delay factors will not be constant for each approach, but if we try to keep all the aspects of the simulations used to do the comparisons constant, apart from those that are inherently different, then the bias will be kept to a minimum. Some variables may be applicable only to some approaches and not in others. Results of the simulation: A simulation was constructed to do the following: 1) Perform as a network with multiple processors- i.e. consisting of n nodes. this n can be varied for the purpose of the study. 2) A set task was given equally to all approaches; the task was divided into smaller tasks in exponential distribution manner and then distributed to the nodes set up in the simulation. 3) The process of load balancing was done for each approach of simulation. 4) The results obtained were plotted as response times vs. number of nodes for all the simulation processes to compare the prospective response times. Mathematical Considerations: Arrival rate = (load * Number of processors) (Required number of processors *Average execution time) The Load is directly proportional to: Arrival rate Mean service time ( = Total service time / number of tasks ) The Load is inversely proportional to number of processors So, mathematical expression to describe Load: Load = Arrival rate * mean service time * (1/number of processors) * constant

The tasks were split into jobs by the exponential distribution method. The results obtained are given in the following graphs : Graph1: Rotational-2nodes, Central, Distributed & No Balancing Graph2: Rotational-3nodes, Central, Distributed & No Balancing

Graph3: Rotational-2nodes, Rotational-3nodes & No Balancing Graph4: Rotational-2nodes, Rotational-3nodes,Central, Distributed& No Balancing ANALYSIS OF RESULTS: The results show the following observations: 1) Plotting the results taken from the simulation for the average response times vs. the number of nodes for No Load Balancing, Central approach, Distributed approach, Rotational approach for 2 nodes per cluster we see that : The response times are better for Rotational 2 which is better than central approach which is better than distributed approach, all better than no Load Balancing.[Graph1] 2) Plotting the results taken from the simulation for the average response times vs. the number of nodes for No Load Balancing, Central approach, Distributed approach, Rotational approach for 3 nodes per cluster we see that : The response times are better for Rotational 3 which is better than central approach which is better than distributed approach, all better than no Load Balancing.[Graph2]

3) Plotting the results taken from the simulation for the average response times vs. the number of nodes for Rotational approach for 3 nodes per cluster & Rotational approach for 2 nodes per cluster we see that : The response times are better for Rotational 3 than Rotational 2, all better than no Load Balancing.[Graph3] 4) To see the effect of increasing the number of nodes and how the behavior changes, Graph 7 shows that as the number of nodes increase, the average response times start increasing after a certain point for the distributed approach and the central approach until eventually they reach a point where no balancing is better. While the rotational approach continues to give better response times regardless of the number of nodes used.[graph4] Conclusions: 1) From the analysis of results we find that the simulations done show that the proposed new algorithm has much lower overheads and faster response times when compared to the classical approaches, as shown in the data obtained from the simulations done to test this approach. 2) It is also scalable and efficient regardless of the size of the network used: as we see that when number of nodes increases it still gives good response times, which is not found in other approaches. 3) Using this approach gives better results because there are virtually no overheads compared to classical approaches. 4) Using clusters of 3 nodes gives better performance than 2 nodes per cluster, this is related to the fact that 3 nodes are better at handling the local load than 2 nodes because of simple mathematical rules involved. References 1) Al_Dahoud Ali, Giacomo Gioffi,. Ramtha : The manager algorithm for Dynamic Load Balancing in Distributed Systems. 2) Al_Dahoud, A. Dynamic Load Balancing in Distributed Systems, Ph.D. Thesis, National Technical University of Ukraine, Kiev, Ukraine, 1996. 3) Andrews. Tanenbaum, Modern Operating Systems, Prentice _ Hall International, Inc, Newjersy, 1992. 4) Allan, R., Michael, L. and Miron, L. Condor Technical Summary, Computer Sciences Department, University of Wisconsin-Madison, 1991. 5) Anna, H. Load Balancing in Distributed Systems: A Summary, Performance Evaluation Review, Vol. 16 #2-4 February 1989, pp. 17-19. 6) Silberschatz A., Galvin. Peter B., Operating System Concepts, Fifth Edition, John Wiley & Sons, Inc., 1999. 7) SANDEEP GUPTA, Dynamic Load Balancing, project website, internet site. Page opened, April 2003. 8) Averill, L. and Kelton W. Simulation Modeling And Analysis, McGraw_HiII, New York, USA, 1991. 9) Bruce, L., Hosseini S., Vairavan K. and Gregory, W. Performance Characteristics ora Load Balancing Algorithm, Journal of Parallel and Distributed Computing, 31, 1995, pp. 159-165. 10) Chi-Chung, H. and Samuel, C. Allocating Task Interaction Graph to Processors in Heterogeneous Networks, IEEE Transactions on Parallel and Distributed Systems, vol. 8, No.9, September 1997, pp. 908-925. 11) Chi-Chung, H. And Samuel, C. Hydrodynamic Load Balancing, IEEE Transactions on Parallel and Distributed Systems, Vol. 10, No.11, November 1999, pp. 1118-1137. 12) Chi-Chung, H. And Samuel, C. Theoretical Analysis of the Heterogeneous Dynamic Load Balancing Problem Using a Hydrodynamic Approach, Department of Computer Science, The Hong Kong University of Science and Technology, Technical Report, March 5, 1996.

13) Chi-Chung, H. And Samuel, C. Theoretical Analysis of the Heterogeneous Dynamic Load Balancing Problem Using a Hydrodynamic Approach, Journal of Parallel and Distributed Computing, 43, 1997, pp. 139-146. 14) Dasgupta, P., Majumder A. and Rhattacharya, P. V_THR : An Adaptive Load Balancing Algorithm, Journal of Parallel And Distributed Computing, 42, 1997, pp. 101-108. 15) Douglas, T., Jim, B. and Miron, L. Condor, Computer sciences Department, University of Wisconsin, 2001. 16) Flavio, R. and Anurag, K. Adaptive Optimal Load Balancing in a Non-homogeneous Multiserver System with a Central Job Scheduler, IEEE Transactions on Computers, vol. 39, No.10, October 1990. 17) Herbert, K. and Andreas, W. Comparison of Dynamic Load Balancing Strategies, Lehrstuhl fur Informatik IL 1990. 18) Jerrell, W. and Stephen, T. A Practical Approach to Dynamic Load Balancing, IEEE Transactions on Parallel and Distributed Systems, Vol. 9, No.3, March 1998, pp. 235-248. 19) Jie, L. and Hisao, K. Optimal Static Load Balancing in Network Configurations With Two- Way Traffic, Journal of Parallel And Distributed Computing, 23, 1994, pp. 364-375. 20) Kai, H. and DeGroot, D. Parallel Processing for Supercomputers and Artificial Intelligence, McGraw-Hill Publishing Company, USA,1989. 21) Mohammed, Z., wei, L. and Srinivasan, P. Customized Dynamic Load Balancing for a Network of Workstations, Journalof Parallel And Distributed Computing, 43, 1997, pp. 156-162. 22) Mourad, K. A Global Plan Policy for Coherent Cooperation in Distributed Dynamic Load Balancing Algorithms, University of Leeds -School of Computer Studies, Research Report Series, Report 94.21, July 1994. 23) Mourad, K. Using Dynamic Load Balancing in Distributed Information Systems, University of Leeds -School of Computer Studies, Research Report Series, Report 94.18, May 1994. 24) Tanenbaurn, A. Distributed Operating Systems, Prentice-Hall International, New Jersey, USA, 1995. 25) Xiaotie, D., Hai-Ning, L., Junsheng, L. And Ring, X. Competitive Analysis of Network Load Balancing, Journal of Parallel And Distributed Computing, 40, 1997, pp. 162-172. 26) Chao-Ju, H. And Kang, S. Implementation of Decentralized Load Sharing in Networked Workstations Using the Condor Package, Journal of Parallel And Distributed Computing,40, 1997, pp. 173-184.