In Proceedings of Performance Tools 98 Lecture Notes in Computer Science, Vol. 1468, pp. 231-242, September 1998



Similar documents
How To Choose Between The Two Of The Four Task Assignment Policies In A Distributed Server System

SIMULATION OF LOAD BALANCING ALGORITHMS: A Comparative Study

Optimization for QoS on Web-Service-Based Systems with Tasks Deadlines 1

Students will benefit from pencils with erasers, if possible since revisions are part of learning.

Analyzing Mission Critical Voice over IP Networks. Michael Todd Gardner

Scheduling. Yücel Saygın. These slides are based on your text book and on the slides prepared by Andrew S. Tanenbaum


Payment minimization and Error-tolerant Resource Allocation for Cloud System Using equally spread current execution load

Stochastic Processes and Queueing Theory used in Cloud Computer Performance Simulations

Operating Systems. III. Scheduling.

Lecture Outline Overview of real-time scheduling algorithms Outline relative strengths, weaknesses

How To Use A Cloud For A Local Cluster

Process Scheduling CS 241. February 24, Copyright University of Illinois CS 241 Staff

IMPROVED LOAD BALANCING MODEL BASED ON PARTITIONING IN CLOUD COMPUTING

Chapter 2: Linear Equations and Inequalities Lecture notes Math 1010

Internet measurements at complexnetworks.fr

How To Model A System

Load Balancing. Load Balancing 1 / 24

A Job Self-Scheduling Policy for HPC Infrastructures

Sums of Independent Random Variables

A Model of Optimum Tariff in Vehicle Fleet Insurance

IP-COM Automatic Call Distributor (ACD)

Comparative Analysis of Load Balancing Algorithms in Cloud Computing

NOVEL PRIORITISED EGPRS MEDIUM ACCESS REGIME FOR REDUCED FILE TRANSFER DELAY DURING CONGESTED PERIODS

A Task-Based Adaptive-TTL approach for Web Server Load Balancing *

Load Balancing Techniques

Elevator Simulation and Scheduling: Automated Guided Vehicles in a Hospital

Road Map. Scheduling. Types of Scheduling. Scheduling. CPU Scheduling. Job Scheduling. Dickinson College Computer Science 354 Spring 2010.

Systems Modelling and Simulation (Lab session 3)

Fair load-balance on parallel systems for QoS 1

Justifying Simulation. Why use simulation? Accurate Depiction of Reality. Insightful system evaluations

An Approach to Load Balancing In Cloud Computing

Data Science at U of U

A Practical Guide to Trade Policy Analysis.

SCHEDULING IN CLOUD COMPUTING

Solving Exponential Equations

Deciding which process to run. (Deciding which thread to run) Deciding how long the chosen process can run

Multiobjective Multicast Routing Algorithm

MA 1125 Lecture 14 - Expected Values. Friday, February 28, Objectives: Introduce expected values.

Waiting Times Chapter 7

VOIP TRAFFIC SHAPING ANALYSES IN METROPOLITAN AREA NETWORKS. Rossitza Goleva, Mariya Goleva, Dimitar Atamian, Tashko Nikolov, Kostadin Golev

Basic Multiplexing models. Computer Networks - Vassilis Tsaoussidis

Teo Chung Piaw National University of Singapore. Copyright 2010, SAS Institute Inc. All rights reserved.

Efficient DNS based Load Balancing for Bursty Web Application Traffic

Financial Time Series Analysis (FTSA) Lecture 1: Introduction

Observingtheeffectof TCP congestion controlon networktraffic

RESOURCE ALLOCATION FOR INTERACTIVE TRAFFIC CLASS OVER GPRS

A Load Balancing Model Based on Cloud Partitioning for the Public Cloud

International Journal of Innovative Research in Science, Engineering and Technology Vol. 2, Issue 5, May 2013

International Journal of Scientific & Engineering Research, Volume 6, Issue 3, March ISSN

Webpage: Volume 3, Issue XI, Nov ISSN

Graph models for the Web and the Internet. Elias Koutsoupias University of Athens and UCLA. Crete, July 2003

Important Notice: In case of divergent interpretation, the original Turkish text shall prevail.

Example 1: Dear Abby. Stat Camp for the Full-time MBA Program

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 4, July-Aug 2014

Is Truck Queuing Productive? Study of truck & shovel operations productivity using simulation platform MineDES

COMMUTATION INSTRUCTIONS

Modeling and Simulation of Queuing Scheduling Disciplines on Packet Delivery for Next Generation Internet Streaming Applications

Sample Solutions for Assignment 2.

Prof. Olivier de Weck

Load Balancing & Termination

OPCFailover in Action

LBPerf: An Open Toolkit to Empirically Evaluate the Quality of Service of Middleware Load Balancing Services

Pr(X = x) = f(x) = λe λx

Fruit Machine. Level. Activity Checklist Follow these INSTRUCTIONS one by one. Test Your Project Click on the green flag to TEST your code

A New Extension of the Exponential Distribution

Takeover Suggestion A Registrar Redundancy Handling Optimization. for Reliable Server Pooling Systems

System Architecture Performance Modeling with SystemC

Chapter 7 - Roots, Radicals, and Complex Numbers

Analysis of Delayed Reservation Scheme in Server-based QoS Management Network

A Load Balancing Algorithm based on the Variation Trend of Entropy in Homogeneous Cluster

A Multiple Access Protocol for Multimedia Transmission over Wireless Networks

INSURANCE RISK THEORY (Problems)

CloudSimDisk: Energy-Aware Storage Simulation in CloudSim

Comparative Analysis of Congestion Control Algorithms Using ns-2

TRAFFIC ENGINEERING OF DISTRIBUTED CALL CENTERS: NOT AS STRAIGHT FORWARD AS IT MAY SEEM. M. J. Fischer D. A. Garbin A. Gharakhanian D. M.

Real Time Bus Monitoring System by Sharing the Location Using Google Cloud Server Messaging

Modeling and Simulation of a Pacing Engine for Proactive Campaigns in Contact Center Environment

ARTIFICIAL NEURAL NETWORKS FOR ADAPTIVE MANAGEMENT TRAFFIC LIGHT OBJECTS AT THE INTERSECTION

A Hybrid Load Balancing Policy underlying Cloud Computing Environment

Optimizing Contact Center Performance with Operational Reports Document Version: 1.0 October 2015

IST 301. Class Exercise: Simulating Business Processes

Advanced Operating Systems CS428

Call Center Statistics Web Monitor. Call Center Statistics Web Monitor

REAL TIME OPERATING SYSTEMS. Lesson-18:

UNITY CALL CENTER REPORTING. Version 1.0

Automatic Call Distribution For Business and Call Centers

Operating System: Scheduling

Introduction. The Supplement shows results of locking using a range of smoothing parameters α, and checkerboard tests.

Performance Evaluation of Dynamic Bandwidth Allocation Algorithm for XG-PON Using Traffic Monitoring and Intra- Scheduling

A Load Balancing Method in SiCo Hierarchical DHT-based P2P Network

AN EFFICIENT DISTRIBUTED CONTROL LAW FOR LOAD BALANCING IN CONTENT DELIVERY NETWORKS

Operating Systems, 6 th ed. Test Bank Chapter 7

Modelling Emergence of Money

12.6 Logarithmic and Exponential Equations PREPARING FOR THIS SECTION Before getting started, review the following:

Feature Highlights: Automatic Call Distribution (ACD) CTI with E1, T1, PSTN, & IP. Support SS7 Signaling. Screen Popup. Find Last Agent.

Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay

Chapter 1: Order of Operations, Fractions & Percents

LivePerson Chat Reports

Betting with the Kelly Criterion

Transcription:

In Proceedings of Performance Tools 98 OnChoosingaTaskAssignmentPolicyfora Lecture Notes in Computer Science, Vol. 1468, pp. 231-242, September 1998 MorHarchol-Balter?;1,MarkE.Crovella??;2,andCristinaD.Murta???;2 DistributedServerSystem 2DepartmentofComputerScience,BostonUniversity 1LaboratoryforComputerScience,MIT harchol@theory.lcs.mit.edu Abstract.Weconsideradistributedserversystemmodelandaskwhich fcrovella,murtag@bu.edu policiescommonlyproposedforsuchdistributedserversystems:round- vicedemandisknowninadvance.weconsiderfourtaskassignment hostprocessestasksinfirst-come-first-serveorderandthetask'sser- Robin,Random,Size-Based,inwhichalltaskswithinagivesizerange policyshouldbeusedforassigningtaskstohosts.inourmodeleach areassignedtoaparticularhost,anddynamic-least-work-remaining, criticallyonthevariabilityinthetasksizedistribution.inparticular decisionofwhichtaskassignmentpolicyisbest.wendthatnooneof inwhichataskisassignedtothehostwiththeleastoutstandingwork. wendthatwhenthetasksizesarenothighlyvariable,thedynamic theabovetaskassignmentpoliciesisbestandthattheanswerdepends Ourgoalistounderstandtheinuenceoftasksizevariabilityonthe policyispreferable.howeverwhentasksizesshowthedegreeofvariabilitymorecharacteristicofempiricallymeasuredcomputerworkloads,the Size-Basedpolicyisthebestchoice.Weusetheresultingobservations performthedynamicpolicybyalmost2ordersofmagnitudeandcan outperformothertaskassignmentpoliciesbymanyordersofmagnitude, toargueinfavorofaspecicsize-basedpolicy,sita-e,thatcanout- 1Introduction underarealistictasksizedistribution. ofthistrendincludedistributedwebservers,distributeddatabaseservers,and distributeddesignsbecauseoftheirscalabilityandcost-eectiveness.examples Tobuildhigh-capacityserversystems,developersareincreasinglyturningto highperformancecomputingclusters.insuchasystem,requestsforservicearrive assigningtaskstohostmachinesisknownasthetaskassignmentpolicy. andmustbeassignedtooneofthehostmachinesforprocessing.therulefor???supportedbyagrantfromcapes,brazil.permanentaddress:depto.deinformatica,universidadefederaldoparana,curitiba,pr81531,brazil.??supportedinpartbynsfgrantsccr-9501822andccr-9706685.?supportedbythensfpostdoctoralfellowshipinthemathematicalsciences.

systeminwhicheachincomingtaskisimmediatelyassignedtoahostmachine, andeachhostmachineprocessesitsassignedtasksinrst-come-rst-served (FCFS)order.Wealsoassumethatthetask'sservicedemandisknowninadvance.Ourmotivationforconsideringthismodelisthatitisanabstractionof Inthispaperweconcentrateontheparticularmodelofadistributedserver someexistingdistributedservers,describedinsection3. tributedserversystems:round-robin,inwhichtasksareassignedtohostsina cyclicalfashion;random,inwhicheachtaskisassignedtoeachhostwithequal probability;size-based,inwhichalltaskswithinacertainsizerangearesent toanindividualhost;anddynamic(alsoknownasleast-work-remaining)in Weconsiderfourtaskassignmentpoliciescommonlyproposedforsuchdis- whichanincomingtaskisassignedtothehostwiththeleastamountofoutstandingworklefttodo(basedonthesumofthesizesofthosetasksinthe queue). ofwhichtaskassignmentpolicyisbest.wearemotivatedinthisrespectby inmanymeasurementsofcomputerworkloads.inparticular,measurementsof theincreasingevidenceforhighvariabilityintasksizedistributions,witnessed manycomputerworkloadshavebeenshowntotaheavy-taileddistributions Ourgoalistostudytheinuenceoftasksizevariabilityonthedecision analysisoranalyticapproximations.weshowthatthevariabilityofthetasksize withveryhighvariance,asdescribedinsection2.2. distributionmakesacrucialdierenceinchoosingataskassignmentpolicy,and thatworkswellunderconditionsofhightasksizevariance. weusetheresultingobservationstoargueforaspecictaskassignmentpolicy Incomparingtaskassignmentpolicies,wemakeuseofsimulationsandalso 2BackgroundandPreviousWork ied,butmanybasicquestionsremainopen.inthecasewheretasksizesare 2.1FundamentalResultsinTaskAssignment Theproblemoftaskassignmentinamodellikeourshasbeenextensivelystudtion,theoptimalityofShortest-Linetaskassignmentpolicy(sendthetaskto unknown,thefollowingresultsexist:underanexponentialtasksizedistribu- thehostwiththeshortestqueue)wasprovenbywinston[14]andextended byweber[12]toincludetasksizedistributionswithnondecreasingfailurerate. TheactualperformanceoftheShortest-Linepolicyisnotknownexactly,butis approximatedbynelsonandphillips[9].infactasthevariabilityofthetask alentoptimalityandperformanceresultshavenotbeendevelopedforthetask sizedistributiongrows,theshortest-linepolicyisnolongeroptimal,whitt[13]. assignmentproblem,tothebestofourknowledge.forthescenarioinwhichthe agesofthetaskscurrentlyservingareknown,weber[12]hasshownthatthe Shortest-Expected-Delayruleisoptimalfortasksizedistributionswithincreasingfailurerate,andWhitt[13]hasshownthatthereexisttasksizedistributions Inthecasewheretheindividualtasksizesareknown,asinourmodel,equiv- forwhichtheshortest-expected-delayruleisnotoptimal.

1 Distribution of process lifetimes (log plot) (fraction of processes with duration > T) 1/2 1/4 1/8 Fig.1.MeasureddistributionofUNIXprocessCPUlifetimes,from[5].Dataindicates 1/16 fractionofjobswhosecpusevicedemandsexceedtseconds,asafunctionoft. 1/32 1/64 2.2Measurementsoftasksizedistributionsincomputer applications 1 2 4 8 16 32 64 Duration (T secs.) Manyapplicationenvironmentsshowamixtureoftasksizesspanningmanyordersofmagnitude.Insuchenvironmentstherearetypicallymanysmalltasks, andfewerlargetasks.muchpreviousworkhasusedtheexponentialdistribution poormodelandthataheavy-taileddistributionismoreaccurate.ingenerala heavy-taileddistributionisoneforwhichprfx>xgx?;where0<<2. surementsindicatethatformanyapplicationstheexponentialdistributionisa tocapturethisvariability,asdescribedinsection2.1.however,recentmea- 1.Decreasingfailurerate:Inparticular,thelongerataskhasrun,thelonger Tasksizesfollowingaheavy-taileddistributionshowthefollowingproperties: 3.Thepropertythataverysmallfraction(<1%)oftheverylargesttasks 2.Innitevariance(andif1,innitemean). makeupalargefraction(half)oftheload.wewillrefertothisimportant itisexpectedtocontinuerunning. Thelowertheparameter,themorevariablethedistribution,andthemore tasksthatcomprisehalftheload. pronouncedistheheavy-tailedproperty,i.e.thesmallerthefactionoflarge propertythroughoutthepaperastheheavy-tailedproperty. measureddistributionofcpurequirementsofoveramillionunixprocesses, takenfrompaper[5].thisdistributioncloselytsthecurve Asaconcreteexample,Figure1depictsgraphicallyonalog-logplotthe ronments,includinginstructional,reasearch,andadministrativeenvironments. In[5]itisshownthatthisdistributionispresentinavarietyofcomputingenvi- PrfProcessLifetime>Tg=1=T:

Infact,heavy-taileddistributionsappeartotmanyrecentmeasurementsof computingsystems.theseinclude,forexample: {UnixprocessCPUrequirements,measuredatUCBerkeley:1[5]. {UnixprocessCPUrequirementsmeasuredatBellcore:11:25[8]. {SizesoflesstoredinUnixlesystems:[7]. {SizesoflestransferredthroughtheWeb:1:11:3[1,3]. {I/Otimes:[11]. servicerequirements. typicallytendstobecloseto1,whichrepresentsveryhighvariabilityintask Inmostofthesecaseswhereestimatesofweremade,12.Infact, {SizesofFTPtransfersintheInternet::91:1[10]. 3ModelandProblemFormulation Weareconcernedwiththefollowingmodelofadistributedserver.Theserver iscomposedofhhosts,eachwithequalprocessingpower.tasksarrivetothe systemaccordingtoapoissonprocesswithrate.whenataskarrivestothe system,itisinspectedbyadispatcherfacilitywhichassignsittooneofthe tasks. hostsforservice.weassumethedispatcherfacilityknowsthesizeofthetask. ThetasksassignedtoeachhostareservedinFCFSorder,andtasksarenot preemptible.weassumethatprocessingpoweristheonlyresourceusedby ontheirjob'sprocessingdemand.ifthejobexceedsthatdemand,itiskilled. batchdistributedcomputingfacilityatmit'slaboratoryforcomputerscience. Thexolasfacilityhasadispatcherfrontendwhichassignseachjobtooneof Xolasconsistsof4identicalmultiprocessorhosts.Usersspecifyanupperbound Theabovemodelforadistributedserverwasinitiallyinspiredbythexolas thehostsforservice.theuserisgivenanupperboundonthetimetheirjob willhavetowaitinthequeue,basedonthesumofthesizesofthejobsinthat queue.thejobsqueuedateachhostareeachruntocompletioninfcfsorder. terizedbythreeparameters:,theexponentofthepowerlaw;k,thesmallest sult,wemodeltasksizesusingadistributionthatfollowsapowerlaw,buthas possibleobservation;andp,thelargestpossibleobservation.theprobability anupperbound.werefertothisdistributionasaboundedpareto.itischarac- Weassumethattasksizesshowsomemaximum(butlarge)value.Asare- massfunctionfortheboundedparetob(k;p;)isdenedas: f(x)= 1?(k=p)x??1kxp: k andvaryovertherange0to2inordertoobservetheeectofchanging ThroughoutthispaperwemodeltasksizesusingaB(k;p;)distribution, (1) variabilityofthedistribution.tofocusontheeectofchangingvariance,we keepthedistributionalmeanxed(at3000)andthemaximumvaluexed(at

10 14 Second Moment of Bounded Pareto Distribution 10 power law f(x) w/ exponent α 1 10 12 10 11 Fig.2.ParametersoftheBoundedParetoDistribution(left);SecondMomentof 10 10 B(k;1010;)asafunctionof,whenEfXg=3000(right). 9 Numberofhosts Systemload 8 Meanservicetime h=8. =:8. 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 k p Taskarrivalprocess EfXg=3000timeunits Poissonwithrate=1=EfXgh=:0021 Maximumtaskservicetime parameter Minimumtaskservicetime p=1010timeunits chosensothatmeantaskservicetimestays 0<2 constantasvaries(0<k1500) tasks/unittime p=1010).inordertokeepthemeanconstant,weadjustkslightlyaschanges Table1.Parametersusedinevaluatingtaskassignmentpolicies thisdistributionwillstillshowveryhighvariabilityifkp.forexample, (0<k1500).TheaboveparametersaresummarizedinTable1. isnotaheavy-taileddistributioninthesensewehavedenedabove.however, Figure2(right)showsthesecondmomentEX2 functionofforp=1010,wherekischosentokeepefxgconstantat3000,(0< NotethattheBoundedParetodistributionhasallitsmomentsnite.Thus,it declines.furthermore,theboundedparetodistributionalsostillexhibitsthe k1500).thegureshowsthatthesecondmomentexplodesexponentiallyas heavy-tailedpropertyand(tosomeextent)thedecreasingfailurerateproperty ofthisdistributionasa oftheunboundedparetodistribution. thebesttaskassignmentpolicy.thefollowingfourarecommonchoices: Random:anincomingtaskissenttohostiwithprobability1=h.Thispolicy Giventheabovemodelofadistributedserversystem,weaskhowtoselect Round-Robin:tasksareassignedtohostsincyclicalfashionwiththeith taskbeingassignedtohostimodh.thispolicyalsoequalizestheexpected equalizestheexpectednumberoftasksateachhost.

Size-Based:Eachhostservestaskswhoseservicedemandfallsinadesignated largetasks. range.thispolicyattemptstokeepsmalltasksfromgetting\stuck"behind timesthanrandom. numberoftasksateachhost,andtypicallyhaslessvariabilityininterarrival Dynamic:Eachincomingtaskisassignedtothehostwiththesmallestamount ofoutstandingwork,whichisthesumofthesizesofthetasksinthehost's queueplustheworkremainingonthattaskcurrentlybeingserved.this tasksizes.theeectivenessofthesetaskassignmentschemeswillbemeasured Inthispaperwecomparethesepoliciesasafunctionofthevariabilityof systemstandpointattemptstoachieveinstantaneousloadbalance. policyisoptimalfromthestandpointofanindividualtask,andfroma 3.1ANewSize-BasedTaskAssignmentPolicy:SITA-E intermsofmeanwaitingtimeandmeanslowdown,whereatask'sslowdownis Beforedelvingintosimulationandanalyticresults,weneedtospecifyafew itswaitingtimedividedbyitsservicedemand.allmeansareper-taskaverages. moreparametersofthesize-basedpolicy. taskissenttotheappropriatehostbasedonitssize.inpracticethesizeranges fortasksofsizebetween15minutesand3hours,a6-hourqueue,a12-hour associatedwiththehostsareoftenchosensomewhatarbitrarily.theremightbe a15-minutequeuefortasksofsizebetween0and15minutes,a3-hourqueue Insize-basedtaskassignment,asizerangeisassociatedwitheachhostanda queueandan18-hourqueue,forexample.(thisexampleisusedinpracticeat thecornelltheorycenteribmsp2jobscheduler[6].) ment,whichwerefertoassita-e SizeIntervalTaskAssignmentwithEqual Load.Theideaissimple:denethesizerangeassociatedwitheachhostsuch thatthetotalwork(load)directedtoeachhostisthesame.themotivationfor doingthisisthatbalancingtheloadminimizesmeanwaitingtime. Inthispaperwechooseamoreformalalgorithmforsize-basedtaskassign- iseasytoobtainbymaintainingahistogram(inthedispatcherunit)ofalltask theexpectedworkdirectedtoeachhostisthesame.thetasksizedistribution sizeswitnessedoveraperiodoftime. thetasksizedistributiontodenethecutopoints(deningtheranges)sothat Themechanismforachievingbalancedexpectedloadatthehostsistouse ofhosts.thenwedetermine\cutopoints"xi,i=0:::hwherek=x0<x1< p(possiblyequaltoinnity)denotethelargesttasksize,andhbethenumber functionoftasksizeswithnitemeanm.letkdenotethesmallesttasksize, x2<:::<xh?1<xh=p,suchthat Moreprecisely,letF(x)=PrfXxgdenotethecumulativedistribution andassigntotheithhostalltasksranginginsizefromxi?1toxi. Zx1 x0=kxdf(x)=zx2 x1xdf(x)==zxh=p xh?1xdf(x)=mh=rpkxdf(x) h

10 8 Simulated mean waiting time 10 5 Simulated mean slowdown 10 7 Random. Round Robin 10 4 Random... Dynamic. Round Robin 10 6 SITA E 10 3... Dynamic SITA E mean.intheremainderofthepaperwewillalwaysassumethetasksizedistributionistheboundedparetodistribution,b(k;p;). SITA-Easdenedcanbeappliedtoanytasksizedistributionwithnite policiesviasimulation.simulationparametersareasshownintable1. 4SimulationResults InthissectionwecomparetheRandom,Round-Robin,SITA-E,andDynamic outsetwellbelowthetruemean;thetruemeanisn'tachieveduntilenoughlarge below[2].thisoccursbecausetherunningaverageoftasksizesistypicallyatthe tasksarrive.theconsequenceforasystemlikeourownisthatsimulationoutputs dicultbecausethesystemapproachessteadystateveryslowlyandusuallyfrom Simulatingaserversystemwithheavy-tailed,highlyvariableservicetimesis appearmoreoptimisticthantheywouldinsteady-state.tomakeoursimulation only.eachdatapointshowninourplotsistheaverageof400independentruns, measurementslesssensitivetothestartuptransient,werunoursimulationfor 4105arrivalsandthencapturedatafromthenextsinglearrivaltothesystem eachofwhichstartedfromanemptysystem. ability).asdescribedinsection2.2,valuesintherange1.0to1.3tendtobe commoninempiricalmeasurementsofcomputingsystems. Weconsidervaluesintherange1.1(highvariability)to1.9(lowervari- 10 2 10 5 Fig.3.MeanWaitingTime(a)andMeanSlowdown(b)underSimulationofFour 10 1 TaskAssignmentPoliciesasaFunctionof. (b) 10 4 10 0 10 3 Figure3showstheperformanceofthesystemforallfourpolicies,asafunctionof(notethelogarithmicscaleontheyaxis).Figure3(a)showsmean waitingtimeand3(b)showsmeanslowdown.belowwesimplysummarizethese 10 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 results;inthenextsection,wewilluseanalysistoexplaintheseresults.

androundrobinpoliciesissimilar,andthatbothcasesperformmuchmore poorlythantheothertwo(sita-eanddynamic).asdeclines,bothofthe performancemetricsundertherandomandround-robinpoliciesexplodeapproximatelyexponentially.thisgivesanindicationofthesevereimpactsthat Firstofall,observethattheperformanceofthesystemundertheRandom heavy-tailedworkloadscanhaveinsystemswithnaivetaskassignmentpolicies. namicisontheorderof100timesbetterforbothmetricswhencomparedto RandomandRoundRobin.Forlarge,thismeansthatDynamicperformsquite well withmeanslowdownlessthan1.howeverasthevariabilityintasksize increases(as!1),dynamicisunabletomaintaingoodperformance.ittoo TheDynamicpolicyshowsthebenetsofinstantaneousloadbalancing.Dy- suersfromroughlyexponentialexplosioninperformancemetricsasdeclines. undersita-eisrelativelyunchanged,withmeanslowdownalwaysbetween2 formancemetricsforrandom,roundrobin,anddynamicallexplode,sita-e's three.overtheentirerangeofvaluesstudied,theperformanceofthesystem and3.thisisthemoststrikingaspectofourdata:inarangeofinwhichper- Incontrast,thebehaviorofSITA-Eisquitedierentfromthatoftheother performanceremainsremarkablyinsensitivetoincreaseintasksizevariability. canbeontheorderof100timesbetterthanthatofdynamic. morecharacteristicofempiricalmeasurements(1:1),sita-e'sperformance mentexhibitsbetterperformance;butwhentasksizesshowthevariabilitythatis Asaresultwendthatwhentasksizeislessvariable,Dynamictaskassign- SITA-EbecomespreferabletoDynamicoveralargerrangeof. acrosstherangeoffrom1.1to1.9isdiculttounderstandusingthetoolsof In[4]wesimulatearangeofloads()andshowthatasloadincreases, simulationalone.forthatreasonthenextsectiondevelopsanalysisofsita-e andtheotherpolicies,andusesthatanalysistoexplainsita-e'sperformance. TheremarkableconsistencyofsystemperformanceundertheSITA-Epolicy Tounderstandthedierencesbetweentheperformanceofthefourtaskassignmentpolicies,weprovideafullanalysisoftheRound-Robin,Random,and 5AnalysisofTaskAssignmentPolicies formulabelowwhichanalyzesthem/g/1fcfsqueue: SITA-Epolicies,andanapproximationoftheDynamicpolicy. IntheanalysisbelowwewillrepeatedlymakeuseofthePollaczek-Kinchin wheredenotestherateofthearrivalprocess,xdenotestheservicetime EfWaitingTimeg=EX2 EfSlowdowng=EfW=Xg=EfWgEX?1 =2(1?) [Pollaczek-Kinchinformula] distribution,anddenotestheutilization(=efxg).theslowdownformulas followfromthefactthatwandxareindependentforafcfsqueue.

tailed,thesecondmomentoftheservicetimeexplodes,asshowninfigure2. RandomTaskAssignment.TheRandompolicysimplyperformsBernoullisplittingontheinputstream,withtheresultthateachhostbecomesanindependenerallypoorbecausethesecondmomentoftheB(k;p;)ishigh. metricsareproportionaltothesecondmomentofb(k;p;).performanceisgen- is,i=.sothepollaczek-kinchinformulaappliesdirectly,andallperformance M=B(k;p;)=1queue.Theloadattheithhost,isequaltothesystemload,that thesecondmomentoftheservicetime.recallthatiftheworkloadisheavy- ObservethateverymetricforthesimpleFCFSqueueisdependentonEX2, RoundRobin.TheRoundRobinpolicysplitstheincomingstreamsoeachhost formanceclosetotherandomcasesinceitstillseeshighvariabilityinservice seesaneh=b(k;p;)=1queue,withutilizationi=.thissystemhasper- SITA-E.TheSITA-EpolicyalsoperformsBernoullisplittingonthearrival times,whichdominatesperformance. stream(whichfollowsfromourassumptionthattasksizesareindependent).by thedenitionofsita-e,i=.howeverthetasksizesateachqueuearedeterminedbytheparticularvaluesoftheintervalcutos,fxig;i=0;:::;h.infact, hostiseesam=b(xi?1;xi;)=1queue.thereasonforthisisthatpartitioning oftheresultingregionstounitprobabilityyieldsanewsetofboundedpareto theboundedparetodistributionintocontiguousregionsandrenormalizingeach distributions.in[4]weshowhowtocalculatethesetofxisfortheb(k;p;) distribution,andwepresenttheresultingformulasthatprovidefullanalysisof thesystemunderthesita-epolicyforalltheperformancemetrics. knownapproximationsfortheperformancemetricsofthem/g/hqueue[15]: performedthesimulationstudy.however,in[4]weprovethatadistributed assignmentisactuallyequivalenttoanm/g/hqueue.fortunately,thereexist Dynamic.TheDynamicpolicyisnotanalyticallytractable,whichiswhywe systemofthetypeinthispaperwithhhostswhichperformsdynamictask wherexdenotestheservicetimedistributionandqdenotesthenumberin queue.what'simportanttoobservehereisthatthemeanqueuelength,and thereforethemeanwaitingtimeandmeanslowdown,areallproportionaltothe EQM=G=h =EQM=M=h EX2 =EfXg2; andround-robintaskassignmentpolicies. secondmomentoftheservicetimedistribution,aswasthecasefortherandom icyoverthewholerangeof.figure5againshowstheseanalytically-derived assignmentpoliciesoverarangeofvalues.figure4showstheanalyticallyderivedmeanwaitingtimeandmeanslowdownofthesystemundereachpol- metrics,butonlyovertherangeof12,whichistherangeofcorrespondingtomostempiricalmeasurementsofprocesslifetimesandlesizes(see Section2.2).(Notethat,becauseofslowsimulationconvergenceasdescribedat thebeginningofsection4,simulationvaluesaregenerallylowerthananalytic Usingtheaboveanalysiswecancomputetheperformanceoftheabovetask predictions;howeverallsimulationtrendsagreewithanalysis).

boththeseguresgrowsworseasdecreases,wheretheperformancecurves followthesameshapeasthesecondmomentoftheboundedparetodistribution,showninfigure2.thisisexpectedsincetheperformanceofrandomand Dynamicisdirectlyproportionaltothesecondmomentoftheservicetimedistribution.Bycontast,lookingatFigure5weseethatintherange1<<2, themeanwaitingtimeandespeciallymeanslowdownunderthesita-epolicy isremarkablyconstant,withmeanslowdownsaround3,whereasrandomand Dynamicexplodeinthisrange.TheinsensitivityofSITA-E'sperformanceto FirstobservethattheperformanceoftheRandomandDynamicpoliciesin inthisrangeisthemoststrikingpropertyofoursimulationsandanalysis. Analytically derived mean waiting time 10 10 Random 10 9 SITA E... Dynamic Approximation 10 8 Analytically derived mean slowdown 10 30 Random SITA E 10 25... Dynamic Approximation 10 20 10 7 10 15 10 6 0<2. Fig.4.Analysisofmeanwaitingtimeandmeanslowdownoverwholerangeof, 10 10 10 5 10 4 10 5 10 3 10 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 10 10 Analytically derived mean waiting time: Upclose 10 9 10 8 Random SITA E... Dynamic Approximation Analytically derived mean slowdown: Upclose 10 7 Random 10 6 SITA E... Dynamic Approximation 10 5 10 7 10 4 10 6 10 3 rangeof,12. Fig.5.Analysisofmeanwaitingtimeandmeanslowdownoverempiricallyrelevant 10 5 10 2 10 4 10 1 10 3 10 0 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2

adynamicpolicyexplodes?acarefulanalysisoftheperformanceofsita-eat eachqueueofthesystem(see[4])leadsustothefollowinganswers: 1.Bylimitingtherangeoftasksizesateachhost,SITA-Egreatlyreduces WhydoesSITA-Eperformsowellinaregionoftasksizevariabilitywherein 2.Whenloadisbalanced,themajorityoftasksareassignedtothelow-numbered thevarianceofthetasksizedistributionwitnessedbythelowered-numbered hosts,therebyimprovingperformanceatthesehosts.infacttheperformance atmosthostsissuperiortothatofanm/m/1queuewithutilization. 3.Furthermore,meanslowdownisimprovedbecausesmalltasksobserveproportionatelylowerwaitingtimes. hosts,whicharethehostswiththebestperformance.thisisintensiedby theheavy-tailedpropertywhichimpliesthatveryfewtasksareassignedto highnumberedhosts. systemperformanceeventuallydeterioratesbadly.thereasonisthatasoverallvariabilityintasksizesincreases,eventuallyevenhost1willwitnesshigh Forthecaseof1,showninFigure4,evenundertheSITA-Epolicy, overwhichsita-eshowsgoodperformance.forexample,whenthenumberof hostsis32,sita-e'sperformancedoesnotdeteriorateuntil:8. variability.furtheranalysis[4]indicatesthataddinghostscanextendtherange 6Conclusion Dynamic(sendingthetasktothehostwiththeleastremainingwork). inuenceswhichtaskassignmentpolicyisbestinadistributedsystem.we Inthispaperwehavestudiedhowthevariabilityofthetasksizedistribution onthevariabilityoftasksizedistribution.whenthetasksizesarenothighly considerfourpolicies:random,round-robin,sita-e(asize-basedpolicy),and variable,thedynamicpolicyispreferable.however,whentasksizesshowthe SITA-Eisbest. degreeofvariabilitymorecharacteristicofempiricalmeasurements(1), Wendthatthebestchoiceoftaskassignmentpolicydependscritically teristicofempiricalmeasurements,sita-eoutperformsdynamicbycloseto2 severalordersofmagnitude.andintherangeoftasksizevariabilitycharac- large:randomandround-robinareinferiortobothsita-eanddynamicby Themagnitudeofthedierenceinperformanceofthesepoliciescanbequite fourpoliciesgleanedfromouranalysis: theirperformanceisdirectlyproportionaltothesecondmomentofthetask OuranalysisoftheRandom,Round-RobinandDynamicpoliciesshowsthat Moreimportantthantheaboveresults,though,istheinsightsaboutthese sizedistribution,whichexplainswhytheirperformancedeterioratesasthetask sizevariabilityincreases.thus,eventhedynamicpolicy,whichcomesclosesto achievinginstantaneousloadbalanceanddirectseachtasktothehostwhere

itwaitstheleast,isnotcapableofcompensatingfortheeectofincreasing varianceinthetasksizedistribution. SITA-Epolicywhichisasimpleformalizationofsize-basedpolicies,denedto afullanalysisofthesita-epolicy,leadingtoa3-foldcharacterizationofits equalizetheexpectedloadateachhost.thisformalizationallowsustoobtain power:(i)bylimitingtherangeoftasksizesateachhost,sita-egreatlyreduces Tounderstandwhysize-basedpoliciesaresopowerful,weintroducethe aresenttothesubsetthehostshavingthebestperformance.(iii)meanslowdown thevariabilityofthetasksizedistributionwitnessedbyeachhost{thereby isimprovedbecausesmalltasksobserveproportionatelylowerwaitingtimes. These3propertiesallowSITA-Etoperformverywellinaregionoftasksize improvingtheperformanceatthehost.(ii)whenloadisbalanced,mosttasks variabilityinwhichthedynamicpolicybreaksdown. References 1.M.E.CrovellaandA.Bestavros.Self-similarityinWorldWideWebtrac:Evidenceandpossiblecauses.IEEE/ACMTransactionsonNetworking,5(6):835{846, 3.M.E.Crovella,M.S.Taqqu,andA.Bestavros.Heavy-tailedprobabilitydistributionsintheworldwideweb.InAPracticalGuideToHeavyTails,pages1{23. December1997. 2.M.E.CrovellaandL.Lipsky.Long-lastingtransientconditionsinsimulationswith heavy-tailedworkloads.in1997wintersimulationconference,1997. 4.M.Harchol-Balter,M.E.Crovella,andC.D.Murta.Onchoosingataskassignmentpolicyforadistributedserversystem.TechnicalReportMIT-LCS-TR-757, Chapman&Hall,NewYork,1998. 6.S.Hotovy,D.Schneider,andT.O'Donnell.Analysisoftheearlyworkloadonthe 5.M.Harchol-BalterandA.Downey.Exploitingprocesslifetimedistributionsfor CornellTheoryCenterIBMSP2.TechnicalReport96TR234,CTC,Jan.1996. dynamicloadbalancing.acmtransactionsoncomputersystems,15(3),1997. MITLaboratoryforComputerScience,1998. 7.G.Irlam.Unixlesizesurvey.http://www.base.com/gordoni/ufs93.html,1994. 8.W.E.LelandandT.J.Ott.Load-balancingheuristicsandprocessbehavior.In 10.V.PaxsonandS.Floyd.Wide-areatrac:ThefailureofPoissonmodeling. 9.R.D.NelsonandT.K.Philips.Anapproximationforthemeanresponsetimefor ProceedingsofPerformanceandACMSigmetrics,pages54{69,1986. 11.D.L.PetersonandD.B.Adams.FractalpatternsinDASDI/Otrac.InCMG IEEE/ACMTransactionsonNetworking,pages226{244,June1995. Evaluation,17:123{139,1998. shortestqueueroutingwithgeneralinterarrivalandservicetimes.performance 12.R.W.Weber.Ontheoptimalassignmentofcustomerstoparallelservers.Journal 13.WardWhitt.Decidingwhichqueuetojoin:Somecounterexamples.Operations Proceedings,December1996. ofappliedprobability,15:406{413,1978. 15.R.W.Wol.StochasticModelingandtheTheoryofQueues.PrenticeHall,1989. 14.W.Winston.Optimalityoftheshortestlinediscipline.JournalofAppliedProbability,14:181{189,1977. Research,34(1):226{244,January1986.