# Combinatorial Algorithms for Data Migration to Minimize Average Completion Time

Save this PDF as:

Size: px
Start display at page:

## Transcription

1 Combinatorial Algorithms for Data Migration to Minimize Average Completion Time Rajiv Gandhi 1 and Julián Mestre 1 Department of Computer Science, Rutgers University-Camden, Camden, NJ Research partially supported by Rutgers University Research Council Grant. Max-Planck-Institute für Informatik, Saarbrücken, Germany. Research done at the University of Maryland; supported by NSF Awards CCR and CCF , and the University of Maryland Dean s Dissertation Fellowship. Abstract. The data migration problem is to compute an efficient plan for moving data stored on devices in a network from one configuration to another. It is modeled by a transfer graph, where vertices represent the storage devices, and edges represent data transfers required between pairs of devices. Each vertex has a non-negative weight, and each edge has a processing time. A vertex completes when all the edges incident on it complete; the constraint is that two edges incident on the same vertex cannot be processed simultaneously. The objective is to minimize the sum of weighted completion times of all vertices. Kim (Journal of Algorithms, 55:4-57, 005 ) gave an LProunding 3-approximation algorithm when edges have unit processing times. We give a more efficient primal-dual algorithm that achieves the same approximation guarantee. When edges have arbitrary processing times we give a primal-dual 5.83-approximation algorithm. We also study a variant of the open shop scheduling problem. This is a special case of the data migration problem in which the transfer graph is bipartite and the objective is to minimize the sum of completion times of edges. We present a simple algorithm that achieves an approximation ratio of 1.414, thus improving the approximation given by Gandhi et al. (ACM Transaction on Algorithms, (1):116-19, 006). We show that the analysis of our algorithm is almost tight. 1 Introduction The data migration problem arises in large storage systems, such as Storage Area Networks [13], where a dedicated network of disks is used to store multimedia data. As the data access pattern changes over time, the load across the disks needs to be rebalanced so as to continue providing efficient service. This is done by computing a new data layout and then migrating data to convert the initial data layout to the target data layout. While migration is being performed, the storage system is running suboptimally, therefore it is important to compute a data migration schedule that converts the initial layout to the target layout quickly. This problem can be modeled as a transfer graph [15], in which the vertices represent the storage disks and an edge between two vertices u and v corresponds to a data object that must be transferred from u to v, or vice-versa. Each edge has a processing time that represents the transfer time of a data object between the disks corresponding to the end points of the edge. An important constraint is that any disk can be involved in at most one transfer at any time. Several variations of the data migration problem have been studied. These variations arise either due to different objective functions or due to additional constraints. One common objective function is to minimize the makespan of the migration schedule, i.e., the time by which all migrations complete. Coffman et al. [4] show that when the edges have unit processing times, the problem reduces to edge coloring of the transfer (multi)graph of the system. The best approximation algorithm known for minimum edge coloring [18] then yields an algorithm for data migration with unit edge processing times, whose makespan is 1.1χ + 0.8, A preliminary version of the paper appeared in the Proceedings of the 9th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, APPROX 006.

2 where χ is the chromatic index of the graph. Approximation algorithms are also developed [9, 1, 13, 14] for generalizations of the makespan minimization problem in which there are storage constraints on disks and constraints on how the data can be transferred. The data migration problem has also been studied with the objective of minimizing the sum of weighted completion time over all storage disks. Kim [15] proved that the problem is NP-hard when edges have unit processing times and showed that Graham s list scheduling algorithm [8], when guided by an optimal solution to a linear programming relaxation, gives an approximation ratio of 3. Gandhi et al. [6] show that Kim s analysis is tight. Very recently, Mestre [17] improves the approximation ratio to 1 + φ, where φ is the Golden ratio; his approach builds on the algorithm presented in this paper. When edges have arbitrary processing times Kim [15] gave a 9-approximate solution. Gandhi et al. [6] improved the approximation factor to They present two algorithms each achieving an approximation ratio of 5.83 and show that combining the two solutions yields an approximation ratio of A problem related to the data migration problem is open shop scheduling. In this problem, we have a set of jobs, J, and a set of machines M 1,..., M m. Each job J j J consists of a set of m j operations. For 1 i m j, operation o j,i has processing time p j,i and must be processed on M φ(j,i). Each machine can process a single operation at any time, and two operations that belong to the same job cannot be processed simultaneously. Each job J j has a positive weight, w j and the objective is to minimize the sum of weighted completion times of all jobs. This problem is a special case of the data migration problem [6]. Open shop scheduling problem has been studied in [3, 1, 0, 1]. There has also been interest in the study of data migration problem with the objective function being to minimize the average completion time over all data transfers. This corresponds to minimizing the average edge completion time in the transfer graph. For arbitrary edge processing times, several constant factor approximation algorithms [11, 15, 7] are known with the best approximation factor being 7.68 [7]. For the case of unit processing times, Bar-Noy et al. [] showed that the problem is NP-hard and gave a simple -approximation algorithm. When restricted to bipartite graphs, the latter problem becomes a variant of open shop scheduling in which the operations have unit processing times and the objective is to minimize the sum of completion times of operations; for this problem Gandhi et al. [6] give a approximate solution that uses a sum coloring algorithm due to Halldórsson et al. [11]. 1.1 Our Contribution First we study the data migration problem with the objective of minimizing the average completion time over all storage disks. Kim [15] gave approximation algorithms for this problem: A 3-approximation when edges have unit processing times and a 9-approximation for the general case. Kim s algorithms round the solution produced by a linear programming relaxation for the problem. This algorithm involves solving a linear program with an exponential number of constraints, though there are equivalent linear programs with a polynomial number of constraints (cf. [7]). Gandhi et al. [6] show that Kim s algorithm can not give an approximation guarantee better than 3. The best approximation guarantee for the general case is 5.03 [6] which is obtained by combining solutions to two algorithms each of which yields an approximation guarantee of At a very high level all known algorithms for this problem comprise of the following two steps: (i) label the edges, (ii) consider the edges for scheduling in sorted order of their labels. The algorithms in [15, 6] label the edges based on the fractional completion times in the linear programming solution. In this work we present simple and efficient primal-dual algorithms, which constitute the first purely combinatorial algorithms for the problem. A novel aspect of our approach is that, unlike typical primal-dual algorithms, the primal solution is not constructed using (relaxed) complementary slackness. Rather, in step (i), the dual update assigns labels to the edges. These labels are used, in step (ii), to guide a scheduling procedure, which is almost identical to that in [15] for unit edge processing times and that in [6] for arbitrary edge processing times. In the analysis, the cost of the schedule is related to the cost of the dual solution via the labels, yielding a 3-approximation when edges have unit processing times, and a 5.83-approximation when edges have arbitrary processing times. The second problem we study is the data migration problem with the objective of minimizing the sum of completion times of edges. In other words, given a graph G = (V, E) we want to partition the edge set

3 E into matchings M 1, M,..., so as to minimize i i M i. A schedule is minimal if M i is maximal with respect to G \ j<i M j. Bar-Noy et al. [] show that any minimal schedule is -approximate. In this work, we introduce the notion of strongly minimal schedules. A schedule is strongly minimal if for all b 1, the b-matching j b M j is maximal in G, where b-matching is a subset of edges in G such that each vertex has at most b of its incident edges in the matching. We show that any strongly minimal schedule is - approximate, and that such schedule always exist for bipartite graphs and can be computed in polynomial time. Data migration in bipartite graphs is equivalent to a variant of open shop scheduling in which we want to minimize the sum of operation completion times. Marx [16] has shown that the problem is APX-hard. Gandhi et al. [6] show that using the sum-coloring algorithm of Halldórsson et al. [11] one can obtain a approximation guarantee. Thus, we improve this ratio to 1.414, though our guarantee does not extend to the objective of minimizing the sum of weighted edge completion times. We also show that the analysis is almost tight by giving an example on which the algorithm gives a approximate solution. Data Migration Problem We are given a graph G = (V, E). All our graphs are multigraphs, for simplicity, however, we drop the multi prefix when talking about graphs and sets of edges. Let E(u) denote the set of edges incident on a vertex u. The vertices and edges in G are jobs to be completed. Each vertex v has weight w v and each edge has processing time p e. The completion time, C e, of edge e is simply the time at which its processing is completed. The completion time, C v, of vertex v is the latest completion time of any edge in E(v). The crucial constraint is that two edges incident on the same vertex cannot be processed at the same time. The objective is to minimize v V w vc v. K q Fig. 1. An instance for the data migration problem. To get more insight into the problem, consider the following natural and intuitive algorithm for the case when edges have unit processing times: Process the edges in any order scheduling them as early as possible without creating conflicts with the edges scheduled so far. While this algorithm gives a solution that is at most twice the cost of optimal for min e C e [], the following example shows that for the objective of min v C v it may produce a solution with cost Ω( 3 n) times the optimum. Consider a complete graphs on q vertices. To this, attach q copies of K 1, q, so that each node in K q is the center of one of the stars, as shown in Figure 1. The optimal solution first schedules the stars in parallel and then the edges in K q, with a total cost of Θ(q ). On the other hand, a minimal solution may schedule the edges in K q before the stars, incurring an overall cost of Θ(q.5 ). Since the graph has Θ(q 1.5 ) vertices, this shows that a minimal schedule can be a factor Ω( 3 n) away from the optimum..1 A Linear Programming Relaxation The linear programming relaxation for the data migration problem was given by Kim [15]. Such relaxations have been proposed earlier by Wolsey [3] and Queyranne [19] for single machine scheduling problems and by Schulz [] and Hall et al. [10] for parallel machines and flow shop problems. For the purpose of clarity, we state only portions of the LP relaxation relevant for obtaining the primal-dual algorithm. 3

4 For a vertex v, let C v represent the completion time of v. Let N(u) represent the set of neighbors of vertex u and E(u) the set of edges incident on u. For any edge e = (u, v), we use p e to denote the processing time of e. For any set F E, let p(f ) = e F p e. subject to S(u) min v V w v C v p e C v p(s(u)) + p(s(u) ) u V, S(u) E(u) (1) C v p(e(v)) v V () The justification for constraints (1) is as follows. By the problem definition, no two edges incident on the same vertex can be scheduled at the same time. Consider any ordering of the edges in S(u) E(u). If an edge e S(u) is the jth edge to be scheduled among the edges in S(u) then, setting C j = C e and p j = p e, we get S(u) j=1 p j C j S(u) j=1 p j j p k = k=1 S(u) j=1 j k=1 p j p k = p(s(u)) + p(s(u) ). Combining this with the fact that C v C e, for all v V and e E(v) gives us the set of constraints (1). The dual LP contains a variable y S(u) for each set S(u), corresponding to constraint (1), and a variable z v for each v V, corresponding to constraint (). The dual LP is given below. max u V S(u) E(u) p(s(u)) + p(s(u) ) y S(u) + v V p(e(v))z v 3 Algorithm subject to z v + y S(u) p e w v v V (3) y S(u) 0 u V, S(u) E(u) (4) z v 0 v V (5) The algorithm consists of two phases labeling and scheduling. In the labeling phase, the vertices receive labels which are then used to label the edges. The scheduling phase uses the edge labels to decide the order in which the edges must be considered for processing. 3.1 Labeling Phase The high level idea of the labeling algorithm is as follows. Initially, each vertex is unlabeled and the dual variables are set to 0. The algorithm proceeds in iterations. In each iteration we find a vertex x which maximizes the total processing times of edges going from x to unlabeled neighbors of x, call this set of edges S(x). If there exists an unlabeled node h such that p(e(h)) > p(s(x)), then assign h a label of p(s(x)) and raise z h until constraint (3) is tight, namely, z h = w h y S(u) p e. e=(u,h) 4

5 Otherwise, the value of the dual variable y S(x) is increased until the dual constraint (3) is met with equality for some unlabeled neighbor v of x. In other words, y S(x) assumes the smallest value such that for some vertex v N(x) such that (x, v) S(x) we have y S(u) p e = w v. All unlabeled neighbors of x for which the above equality holds are labeled p(s(x)). Note that the dual variables are incremented in such a way that no constraint in the dual LP is violated. The label of vertex u is denoted by l(u). The label l(e) of an edge e = (u, v) is the pair (l(u), l(v)). For any two edges e = (u, v) and f = (u, v ), l(e) l(f) if: (i) min{l(u), l(v)} < min{l(u ), l(v )}, or (ii) min{l(u), l(v)} = min{l(u ), l(v )} and max{l(u), l(v)} max{l(u ), l(v )}. Label(G = (V, E)) 1 for each v V do l(v) nil // v is unlabeled 3 while there exists an unlabeled vertex do 4 let x be a vertex maximizing p(s(x)), where S(x) = {(x, v) E(x) l(v) = nil} 5 let h be an unlabeled vertex maximizing p(e(h)) 6 if p(e(h)) > p(s(x)) then 7 z h w h 8 l(h) p(s(x)) // h is now labeled 9 else n o w 10 y S(x) min P v e=(x,v) pe 11 for each v N(x) s.t. l(v) = nil do 1 w v w v y S(x) Pe=(x,v) pe 13 if w v = 0 then 14 l(v) p(s(x)) // v is now labeled 15 for each e = (u, v) E do 16 l(e) (l(u), l(v)) 17 return l Fig.. Labeling procedure. The pseudo-code for the labeling phase is given in Figure. We now state and prove some important properties of the labels and the dual solution generated by our algorithm. Lemma 1. Consider an edge e = (v, w). Let F e (w) = {f E(w) l(f) l(e)}. Then, p(f e (w)) l(v). Proof. Let N e (w) = {y N(w) (w, y) F e (w)}. Note that e F e (w) and for each vertex y N e (w) we have l(y) l(v). Consider the iteration of the algorithm in which the first vertex in N e (w) is labeled, and let y be that vertex. At the beginning of this iteration the sum of the processing times of edges in E(w) whose other endpoints are unlabeled is at least p(f e (w)). Notice that x is chosen (line 4 in the pseudo-code Label) to be the vertex with the maximum value of p(s(x)), where S(x) is set of edges incident to x whose other endpoints are unlabeled. Therefore, it must be the case that p(f e (w)) p(s(x)) = l(y). Lemma. For any h V, if z h > 0 then p(e(h)) > l(h). 5

6 Proof. In the pseudo-code, variable z h is set in line 7. In order to reach line 7, the condition of if statement in line 6 must be satisfied. Thus, p(e(h)) > l(h). Lemma 3. For any v V and S(x) such that (x, v) S(x), if y S(x) > 0, max{l(v), p(e(v))} p(s(x)). Proof. Consider the iteration in which y S(x) was set. In the pseudo-code, variable y S(x) is set in line 10. In order to reach line 10, it must be that in this iteration all unlabeled vertices have degree at most p(s(x)). In particular, since v was unlabeled at the beginning of the iteration, p(e(v)) p(s(x)). If vertex v is labeled in this iteration then l(v) = p(s(x)). Otherwise v is labeled in a later iteration. Since the value of the labels assigned by the algorithm only decreases with time, we have l(v) p(s(x)). 3. Scheduling edges with unit processing times The scheduling phase is almost the same as the list scheduling algorithm [8] used by Kim [15]. The only difference is in the entity that is used to decide the order in which the edges are processed. We use the edge labels generated by algorithm Label to decide the ordering whereas Kim [15] uses the completion times of edges as returned by the linear programming solution. The edges are sorted in increasing order of their labels. The edges are then processed in this order. When processing (u, v) E, the edge is scheduled at the earliest time such that no edge incident upon u or v is already scheduled at that time. The pseudo-code is given in Figure 3. Schedule(G = (V, E), l) 1 sort edges in lexicographic order of their labels for each e = (u, v) E, processed in sorted order do 3 schedule e as early as possible without creating conflicts with edges scheduled so far. Fig. 3. Scheduling with unit processing times In order to gain some intuition, we trace the execution of label and schedule on the unweighted instance from Figure 1. The transfer graph consists of q stars (K 1, q ) whose centers are fully connected, i.e., they induce a K q. The procedure label starts by choosing a star center a and sets the labels of its neighbors to q + q 1; then a gets a label of q + 1 and the remaining nodes get a label of q. As we saw at the beginning of Section the optimal solution first schedules the star edges and then schedules the K q edges. When schedule sorts the edge set, only the star edges incident on a may come before some of the K q edges; thus the algorithm produces a schedule that is close to optimal. Let a be the first node chosen by label and let A be the set of its neighbors. In any schedule the collective finishing time of nodes in A must be at least A / = (q + q 1) /. The finishing time of A in our schedule will be charged to this quantity. To avoid overcharging, the nodes in A are labeled with A = q + q 1. Let b a be a star center and B be its q star neighbors. When label choses b the set of unlabeled neighbors of b is B. Their collective finishing time in any solution is at least B / = q/. Since this quantity should pay for the finishing time of B, these nodes are given a label of B = q. Notice that the budget to pay for A is O(q) larger than the budget to pay for B. Hence, in our schedule we would like the nodes in B to finish earlier than the nodes in A. Indeed, the procedure schedule is set up so that the finishing time of a vertex is not much greater than its given label. 3.3 Analysis when edges have unit processing times Let C v be the completion time of vertex v in our algorithm. Recall that E(v) denotes the set of edges incident on a vertex v, and N(v) the set of neighbors of v. 6

7 Lemma 4. For each v V, C v l(v) + E(v) 1. Proof. Let e v = (w, v) be the last edge to finish among the edges in E(v). Recall from Lemma 1 that F ev (w) = {f E(w) l(f) l(e v )}. Observe that because of the order in which the edges are scheduled, we have C v F ev (w) + E(v) 1. From Lemma 1 we know that p(f ev (w)) = F ev (w) l(v). Substituting this in the above expression completes the proof. Theorem 1. The data migration problem with edges having unit processing times has a 3-approximate primal-dual algorithm. Proof. Let G = (V, E) be an instance of the data migration problem. The cost of the schedule found by our algorithm is given by w v (l(v) + E(v) ) [ using Lemma 4 ] v V w v Cv v V = v V w v l(v) + v V w v E(v) Clearly v V w v E(v) OP T (G). Thus, if we can show that v V w vl(v) OP T (G) we are done. To that end, we relate v V w vl(v) to the cost of the dual feasible solution obtained by the algorithm, which we denote by DF S(G). ( z v + ) y S(u) l(v) [ every vertex is tight ] v V w v l(v) = v V = v V v V v V = v V = v V z v l(v) + v V z v E(v) + v V z v E(v) + v V z v E(v) + z v E(v) + DF S(G) y S(u) l(v) u V e S(u) S(u) E(u) u V S(u) E(u) y S(u) l(v) [ using Lemma ] y S(u) S(u) [ using Lemma 3 ] y S(u) S(u) y S(u) S(u) Since DF S(G) is a lower bound on OP T (G), it follows that v V w vl(v) OP T (G). 3.4 Scheduling edges with arbitrary processing times The scheduling phase is essentially the same as in [6]. The only difference is in the order in which the edges are scheduled we decide the order using the labels on the edges, whereas in [6], the ordering is based 7

8 on the completion times of the edges in the optimal linear programming solution. For completeness, we restate the scheduling algorithm here as described in [6]. In the scheduling phase, each edge e = (u, v) waits for W e time units before it can be scheduled, where W e = β max{p(f e (u)), p(f e (v))}. In the above expression, F e (u) = {f E(u) l(f) l(e)} and F e (v) = {f E(v) l(f) l(e)}. When processing (u, v) E, the edge is scheduled at the earliest time such that no edge incident upon u or v is already scheduled at that time. When e is being processed, we say that e is active. Once it becomes active, it remains active for p e time steps, after which it is finished. A not-yet-active edge can be waiting only if none of its neighboring edges are active; otherwise, it is said to be delayed. Thus, at any time, an edge is in one of four modes: delayed, waiting, active, or finished. When adding new active edges, among those that have done their waiting duty, the algorithm uses the labels on edges as priorities. The precise rules are given in the pseudo-code below. Let wait(e, t) denote the number of time steps that e has waited until the end of time step t. Let Active(t) be the set of active edges during time step t. Let C e ( C u ) be the completion time of edge e (vertex u) in our algorithm. The pseudo code for the algorithm, as it appears in Figure 4, would run in pseudo-polynomial time; however, it is easy to implement the algorithm in strongly polynomial time, by increasing t in each iteration by the smallest remaining processing time of an active edge. Schedule(G = (V, E), l, W ) 1 sort edges (u, v) E in lexicographic order of their labels. t 0 3 F inished Active(t) 4 for each e E do 5 wait(e, t) 0 6 while (F inished E) do 7 t t Active(t) {e e Active(t 1) and e Active(t p e)} 9 for each edge e Active(t 1) \ Active(t) do 10 Ce e t 1 11 F inished F inished {e} // e is finished 1 for each edge e = (u, v) E \ (Active(t) F inished) 13 processed in non-decreasing order of their labels do 14 if (Active(t) (E(u) E(v)) = ) and (wait(e, t 1) = W e) then 15 Active(t) Active(t) {e} // e is active 16 for each edge e = (u, v) E \ (Active(t) F inished) do 17 if (Active(t) (E(u) E(v)) = ) then 18 wait(e, t) wait(e, t 1) + 1 // e is waiting 19 else wait(e, t) wait(e, t 1) // e is delayed 0 return C e Fig. 4. Scheduling with arbitrary processing times. 3.5 Analysis when edges have arbitrary processing times Consider a vertex x and an edge e = (x, y). Let B e (x) = {f E(x) l(f) > l(e), C f < C e }, i.e., edges in E(x) that have a greater label than that of e, but finish before e in our algorithm. Recall that F e (x) = {f E(x) l(f) l(e)}. Note that e F e (x). Let F e (x) = F e (x) \ {e}. We analyze the completion time of an arbitrary but fixed vertex v V. Without loss of generality, let e v = (v, w) be the edge that finishes last among the edges in E(v). We analyze our algorithm for the case 8

9 where all edges in F ev (v) F ev (w) finish before e v in our algorithm. If this is not true then our results can only improve. Let C v be the completion time of vertex v in our algorithm. Lemma 5. For each v V, C v β max{p(f ev (v)), l(v)} + p(e(v)) + p(f ev (w)) + p(b ev (w)). Proof. Observe that when e v is in delayed mode it must be that some edge in F ev (v) B ev (v) F ev (w) B ev (w) must be active. Hence, we have C v = C ev W ev + p(f ev (v)) + p(b ev (v)) + p(f ev (w)) + p(b ev (w)) = β max{p(f ev (v)), p(f ev (w))} + p(e(v)) + p(f ev (w)) + p(b ev (w)) β max{p(f ev (v)), l(v)} + p(e(v)) + p(f ev (w)) + p(b ev (w)) The last expression follows using Lemma 1. Lemma 6. For any vertex v V, p(f ev (w)) + p(b ev (w)) max{p(f ev (v)), l(v)} + 1 β p(e(v)). Proof. Let f = (w, z) B ev (w) be an edge with the largest label. When edge f is waiting, it must be that e v is waiting or some edge in E(v) is being processed. Thus we have β ( p(f ev (w)) + p(b ev (w)) ) W f W ev + p(e(v)) = β max{p(f ev (v)), p(f ev (w))} + p(e(v)) β max{p(f ev (v)), l(v))} + p(e(v)) The last inequality follows from Lemma 1. Lemma 7. For any vertex v V, C v (1 + β) max{p(e(v)), l(v)} + (1 + 1 β ) p(e(v)) Proof. The claim follows from Lemmas 5 and 6, and the fact that p(f ev (v)) p(e(v)). Theorem. The data migration problem with edges having arbitrary processing times has a 5.83-approximate primal-dual algorithm. Proof. Let G = (V, E) be an instance of the data migration problem. The cost of the schedule found by our algorithm is given by ( w v ((1 + β) max{p(e(v)), l(v)} ) ) p(e(v)) [ using Lemma 7 ] β v V w v Cv v V = (1 + β) v V w v max{p(e(v)), l(v)} + Clearly, v V w v p(e(v)) OP T (G). Now, suppose ( ) w v p(e(v)) β v V w v max{p(e(v)), l(v)} OP T (G). (6) v V It follows that w v Cv v V ( 3 + β + 1 ) OP T (G). β 9

10 The right hand side in the above expression is minimized for β = 1, which gives a ratio of It all boils down to showing (6). This is done by relating the left hand side of (6) to the cost of the dual feasible solution obtained by the algorithm, which we denote by DF S(G). w v max{p(e(v)), l(v)} v V = v V v V v V = v V = v V ( z v + DF S(G) z v p(e(v)) + v V z v p(e(v)) + v V z v p(e(v)) + z v p(e(v)) + y S(u) p e ) max{p(e(v)), l(v)} [ all v V are tight ] u V e S(u) S(u) E(u) u V S(u) E(u) y S(u) p e max{p(e(v)), l(v)} [ using Lemma ] y S(u) p e p(s(u)) [ using Lemma 3 ] y S(u) p e p(s(u)) y S(u) p(s(u)) Since DF S(G) is a lower bound on OP T (G), it follows that v V w v max{p(e(v)), l(v)} OP T (G). 4 Minimizing Sum of Edge Completion Times The problem of scheduling the edges (with unit processing times) of a graph to minimize the sum of their completion times can be cast as an edge coloring problem: Given G = (V, E) we want to partition the edge set E into matchings M 1,..., M k as to minimize i i M i. Indeed, this problem is also known as minimum sum edge coloring. Bar-Noy et al. [] show that any minimal schedule is -approximate. In a minimal schedule every matching M i is maximal with respect to G \ j<i M j. The main result of this section is to identify a stronger minimality requirement that results in a better approximation guarantee. Definition 1. A schedule M 1,..., M k of G is said to be strongly minimal if, for all 1 b k, the b-matching i b M i is maximal with respect to G. Theorem 3. Any strongly minimal schedule is -approximate. Proof. The high level idea of the proof is to assign every edge to at least one of its endpoints. Each vertex is responsible for paying for the cost of the edges assigned to it. In order to pay for this cost each vertex charges a lower bound on the completion time of the edges assigned to it. Let (u, v) M i, we say endpoint u is full if u is matched in all M j, j < i. We consider the endpoints of edges in M 1 to be full. Notice that every edge (u, v) M i must have at least one full endpoint, otherwise j<i M j + (u, v) would be a valid (i 1)-matching, which contradicts the fact that the schedule is strongly minimal. If both endpoints of (u, v) are full then the edge is half-assigned to u and v. Otherwise the edge is fully-assigned to the one full endpoint. Every vertex u is responsible for the cost of edges assigned to it. If an edge is half-assigned to u, then u pays for half of its completion time; if the edge is fully-assigned then u pays in full. Let s 1 and s be 10

11 the number of half-assigned and fully-assigned edges to u respectively. Notice that all edges assigned to u must belong to M j for some j s 1 + s. Think of u as paying 1 of the completion time of all edges assigned to it, plus an additional 1 for the fully-assigned edges, which in the worst case will be scheduled the latest, u must pay 1 s 1+s i=1 i + 1 s 1+s i=s 1+1 Vertex u will pay this amount by charging the completion time (in the optimal solution) of the edges assigned to it. Fully-assigned edges are charged a soon-to-be-determined ρ factor, and half-assigned edges are charged ρ. This constitutes u s budget. How fast can the optimal solution possibly schedule these edges? u s budget ρ s 1+s Notice that every edge is charged at most to an extent of ρ: fully-assigned edges are charged ρ once, from a single endpoint, and half-assigned edges are charged ρ twice, once from each endpoint. Thus, strongly minimal schedules are ρ-approximate. The discrepancy between the upper and lower bound on the completion times of edges assigned to u is due to fully-assigned edges which are scheduled the latest in the upper bound, and the earliest in the lower bound. We need to determine the smallest ρ such that u s budget is enough to cover u s payment, namely i=1 i + ρ s i=1 i. i. (s 1 + s )(s 1 + s + 1) 4 + (s 1 + s + 1)s 4 ρ (s 1 + s )(s 1 + s + 1) 4 + ρ s (s + 1). 4 Or equivalently, (s 1 + s ) + (s 1 + s )s ρ(s 1 + s ) + ρs + (ρ 1)(s 1 + s ). Let α = s s 1+s. Since ρ > 1, the above follows, provided 1 + α α 1 + α ρ. The left hand side is maximized for α = 1, which yields ρ While strongly minimal schedules are not guaranteed to exist for general graphs, we now show that in bipartite graphs they always exist and can be computed in polynomial time. The bipartite is an interesting and nontrivial case: It is a variant of the open shop scheduling problem in which we want to minimize the sum of completion time of operations [6]. The problem is APX-hard [16] and the best known approximation guarantee for it is [6]. Theorem 4. The procedure find strongly minimal is a -approximation for minimizing the sum of completion times of open shop with unit processing times. scheduling. Proof. In each iteration, the procedure find strongly minimal computes a matching incident to the maximum degree vertices of G and removes the matching from G. This continues until all edges have been removed. The matchings found are then scheduled in reverse order. Because the degree of G decreases by one with each iteration, the algorithm finishes after iterations, here is the degree of the original graph. Let us argue that the schedule found is strongly minimal. Let e M i and b < i, we want to show that e cannot be added to j b M j without violating the b-matching property. Let G be the remaining graph when M i was computed. One of the endpoint of e must have degree i in G, let u be that endpoint. After 11

12 find strongly minimal(g) 1 for i down to 1 do M i a matching incident to all vertices of G with degree i 3 G G \ M i 4 return M 1, M,..., M Fig. 5. Computing a strongly minimal schedule. removing M i the degree of u becomes i 1, and thus u must be matched in M i 1. In general u will be matched in all M j<i. Therefore, the degree of u in j b M j is b, which in turn means the b-matching is maximal with respect to e. In bipartite graphs a matching incident to all the maximum degree vertices always exists and can be computed in polynomial time (cf. [5]). Together with Theorem 3, this finishes the proof. 4.1 An almost tight example While at first sight the analysis of the approximation factor of strongly minimal schedules may seem too pessimistic, it turns out it is almost tight. Consider the following bipartite graph with vertices u 1,..., u n on one side and vertices v 1,..., v n on the other side of the bipartition. There is an edge (u i, v j ) E if and only if i j. It is not difficult to show that the optimal schedule uses matchings M k = {(u i, v i+k 1 ) i n k + 1} and has cost n i=1 i (n i + 1) = 1 6 n3 + 3n + n. Now suppose we run find strongly minimal. Initially the maximum degree vertices are u 1 and v n, and the algorithm finds the matching M n consisting of (u 1, v n ) and (u n +1, v n ). After removing M n the maximum degree vertices are u 1, u, v n 1, and v n. In general the algorithm may find, for n < k n, M k = {(u i, v i+k n 1 ) i n k + 1} {(u j k+ n +1, v j ) j k}. After these matchings are removed from the graph we are left with a complete bipartite graph on u 1,... u n and v n +1,... v n, thus M k = n for all 1 k n. Therefore, the cost of this strongly minimal schedule is n n n. The ratio of the cost of the optimal and strongly minimal solutions approaches as n. Compare this to the approximation guarantee of obtained in Theorem LP formulation gap Let us now study the inherent limitations of the lower bounding technique used to prove Theorem 3. The lower bound used there can be generalized as follows: For any subset of edges S(u) incident on a vertex u, we know that any feasible schedule must spend at least S(u) ( S(u) +1) time on these edges. We can charge ( the cost incurred by this set of edges, a factor y S(u) 0. If for every edge e the total charge S(u) : e S(u) S(u)) y on e is at most 1, then S(u) ( S(u) +1) S(u) y S(u) offers a lower bound on the cost an optimal schedule. The best such lower bound corresponds to the optimal solution of the following dual linear program. 1

13 Fig. 6. LP formulation gap examples for general and bipartite instances for min P e Ce. Integral finishing times appear in black; fractional finishing times appear in gray. subject to max u V S(u) E(U) S(u) ( S(u) + 1) y S(u) y S(u) 1 e E (7) y S(u) 0 u V, S(u) E(u) In hindsight, the proof of Theorem 3 can be viewed as a case of dual-fitting in which constraint (7) is violated a factor. To determine how good a lower bound the dual offers, we derive the primal LP and study its LP fomulation gap, that is, the ratio of the cost of the best optimal schedule to the cost of the best fractional schedule. (Here we do not talk about integrality gap because the LP formulations are not exact even if integrality is inforced.) Theorem 5. The LP formulation gap of the LP below is at least 4 3 in general graphs and at least 10 9 bipartite graphs. in min e E C e subject to e S(u) C e C e 0 S(u) ( S(u) + 1) u V, S(u) E(u) (8) e E Proof. For general graphs, consider a triangle. The optimal solution schedules one edge at the time, and incurs a cost of 6. The LP can schedule all edges at C e = 1.5, with a cost of 4.5. Thus, the LP formulation gap for this graph is 4 3. For the bipartite case (our example is in fact a tree) consider a spider with three legs of length two. The graph is shown in Figure 6 along with the edge completion times of an optimal schedule (in black and to the left) and of the optimal LP solution (in gray and to the right). Optimum schedules three edges in M 1, two in M and one in M 3, with a total cost of 10. On the other hand, the LP solution manages to schedule all edges in two rounds, with a total cost of 9. Thus the LP formulation gap for bipartite graphs is at least

14 4.3 Limitations of strongly minimal schedules We conclude this section with a note on the limitations of strongly minimal schedules. One common generalization of our scheduling problem is to minimize the weighted sum of completion times. In this setting the proof of Theorem 3 does not go through as we make crucial use of the fact that the edges have uniform weight. It would be natural to hope that the following slight modification of find strongly minimal would produce good schedules: Instead of finding any matching incident to the maximum degree vertices, find one with minimum weight. Unfortunately, the following bipartite example shows that strongly minimal schedules are just not suited for the weighted case. Take a path of length four and replace each edge with a copy of K t,t. The edges in the first and the last K t,t have weight 1, and the ones in the middle have weight 0. The optimal solution schedules the first and the last K t,t in the first t rounds and the remaining edges are scheduled in the next t rounds, with a total cost of t (t + 1). On the other hand, any strongly minimal solution can schedule at most t edges with weight 1 per round, thus incurring a total cost of t (t + 1). The ratio of the cost of the two solutions approaches as t. Acknowledgements: We thank Yoo-Ah Kim for useful discussions. References 1. E. Anderson, J. Hall, J. Hartline, M. Hobbes, A. Karlin, J. Saia, R. Swaminathan, and J. Wilkes. An Experimental Study of Data Migration Algorithms. Proc. of the Workshop on Algorithm Engineering, pages , A. Bar-Noy, M. Bellare, M. M. Halldórsson, H. Shachnai, and T. Tamir. On Chromatic Sums and Distributed Resource Allocation. Information and Computation, 140:183-0, S. Chakrabarti, C. A. Phillips, A. S. Schulz, D. B. Shmoys, C. Stein, and J. Wein. Improved Scheduling Problems For Minsum Criteria. Proc. of the 3rd International Colloquium on Automata, Languages, and Programming, LNCS 1099, , E. G. Coffman, M. R. Garey, D. S. Johnson, and A. S. LaPaugh. Scheduling File Transfers. SIAM Journal on Computing, 14(3): , H. Gabow and O. Kariv. Algorithms for edge coloring bipartite graphs and multigraphs. SIAM Journal of Computing, 11(1), February R. Gandhi, M. M. Halldórsson, G. Kortsarz, and H. Shachnai. Improved Results for Data Migration and Openshop Scheduling. ACM Transactions on Algorithms, (1):116-19, R. Gandhi, M. M. Halldórsson, G. Kortsarz, and H. Shachnai. Improved Bounds for Scheduling Conflicting Jobs with Minsum Criteria. Proc. of the Second Workshop on Approximation and Online Algorithms, 68-8, R. Graham. Bounds for certain multiprocessing anomalies. Bell System Technical Journal, 45: , J. Hall, J. Hartline, A. Karlin, J. Saia, and J. Wilkes. On Algorithms for Efficient Data Migration. Proc. of the 1th ACM-SIAM Symposium on Discrete Algorithms, 60-69, L. Hall, A. S. Schulz, D. B. Shmoys, and J. Wein. Scheduling to Minimize Average Completion Time: Off-line and On-line Approximation Algorithms. Mathematics of Operations Research, : , M. M. Halldórsson, G. Kortsarz, and H. Shachnai. Sum Coloring Interval Graphs and k-claw Free Graphs with Applications for Scheduling Dependent Jobs. Algorithmica, 37:187-09, H. Hoogeveen, P. Schuurman, and G. Woeginger. Non-approximability Results For Scheduling Problems with Minsum Criteria. Proc. of the 6th International Conference on Integer Programming and Combinatorial Optimization, LNCS 141, , S. Khuller, Y. Kim, and Y. C. Wan. Algorithms for Data Migration with Cloning. SIAM Journal on Computing, 33():448461, S. Khuller and A. Malekian and Y. Kim.. Improved Algorithms for Data Migration. In Proc. of the 9th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, , Y. Kim. Data Migration to Minimize the Average Completion Time. Journal of Algorithms,55:4-57, D. Marx. Complexity results for minimum sum edge coloring. Manuscript, J. Mestre. Adaptive Local Ratio. Proc. of the 19th ACM-SIAM Symposium on Discrete Algorithms,

15 18. T. Nishizeki and K. Kashiwagi. On the 1.1 edge-coloring of multigraphs. SIAM Journal on Discrete Mathematics, 3(3): , M. Queyranne. Structure of a Simple Scheduling Polyhedron. Mathematical Programming, 58:63-85, M. Queyranne and M. Sviridenko. A ( + ɛ)-approximation Algorithm for Generalized Preemptive Open Shop Problem with Minsum Objective. Journal of Algorithms, 45:0-1, M. Queyranne and M. Sviridenko. Approximation Algorithms for Shop Scheduling Problems with Minsum Objective. Journal of Scheduling, 5:87-305, 00.. A. S. Schulz. Scheduling to Minimize Total Weighted Completion Time: Performance Guarantees of LPbased Heuristics and Lower Bounds. In Proc. of the 5th International Conference on Integer Programming and Combinatorial Optimization, LNCS 1084, , L. Wolsey. Mixed Integer Programming Formulations for Production Planning and Scheduling Problems. Invited talk at the 1th International Symposium on Mathematical Programming, MIT, Cambridge,

### Improved Results for Data Migration and Open Shop Scheduling

Improved Results for Data Migration and Open Shop Scheduling Rajiv Gandhi 1, Magnús M. Halldórsson, Guy Kortsarz 1, and Hadas Shachnai 3 1 Department of Computer Science, Rutgers University, Camden, NJ

### Improved Algorithms for Data Migration

Improved Algorithms for Data Migration Samir Khuller 1, Yoo-Ah Kim, and Azarakhsh Malekian 1 Department of Computer Science, University of Maryland, College Park, MD 20742. Research supported by NSF Award

### Data Migration in Heterogeneous Storage Systems

011 31st International Conference on Distributed Computing Systems Data Migration in Heterogeneous Storage Systems Chadi Kari Department of Computer Science and Engineering University of Connecticut Storrs,

### ! Solve problem to optimality. ! Solve problem in poly-time. ! Solve arbitrary instances of the problem. #-approximation algorithm.

Approximation Algorithms 11 Approximation Algorithms Q Suppose I need to solve an NP-hard problem What should I do? A Theory says you're unlikely to find a poly-time algorithm Must sacrifice one of three

### Minimal Cost Reconfiguration of Data Placement in a Storage Area Network

Minimal Cost Reconfiguration of Data Placement in a Storage Area Network Hadas Shachnai Gal Tamir Tami Tamir Abstract Video-on-Demand (VoD) services require frequent updates in file configuration on the

### Approximation Algorithms

Approximation Algorithms or: How I Learned to Stop Worrying and Deal with NP-Completeness Ong Jit Sheng, Jonathan (A0073924B) March, 2012 Overview Key Results (I) General techniques: Greedy algorithms

### 2.3 Scheduling jobs on identical parallel machines

2.3 Scheduling jobs on identical parallel machines There are jobs to be processed, and there are identical machines (running in parallel) to which each job may be assigned Each job = 1,,, must be processed

### Definition 11.1. Given a graph G on n vertices, we define the following quantities:

Lecture 11 The Lovász ϑ Function 11.1 Perfect graphs We begin with some background on perfect graphs. graphs. First, we define some quantities on Definition 11.1. Given a graph G on n vertices, we define

### Applied Algorithm Design Lecture 5

Applied Algorithm Design Lecture 5 Pietro Michiardi Eurecom Pietro Michiardi (Eurecom) Applied Algorithm Design Lecture 5 1 / 86 Approximation Algorithms Pietro Michiardi (Eurecom) Applied Algorithm Design

### Approximated Distributed Minimum Vertex Cover Algorithms for Bounded Degree Graphs

Approximated Distributed Minimum Vertex Cover Algorithms for Bounded Degree Graphs Yong Zhang 1.2, Francis Y.L. Chin 2, and Hing-Fung Ting 2 1 College of Mathematics and Computer Science, Hebei University,

Approximation Algorithms Chapter Approximation Algorithms Q. Suppose I need to solve an NP-hard problem. What should I do? A. Theory says you're unlikely to find a poly-time algorithm. Must sacrifice one

### ! Solve problem to optimality. ! Solve problem in poly-time. ! Solve arbitrary instances of the problem. !-approximation algorithm.

Approximation Algorithms Chapter Approximation Algorithms Q Suppose I need to solve an NP-hard problem What should I do? A Theory says you're unlikely to find a poly-time algorithm Must sacrifice one of

### Fairness in Routing and Load Balancing

Fairness in Routing and Load Balancing Jon Kleinberg Yuval Rabani Éva Tardos Abstract We consider the issue of network routing subject to explicit fairness conditions. The optimization of fairness criteria

### Algorithm Design and Analysis

Algorithm Design and Analysis LECTURE 27 Approximation Algorithms Load Balancing Weighted Vertex Cover Reminder: Fill out SRTEs online Don t forget to click submit Sofya Raskhodnikova 12/6/2011 S. Raskhodnikova;

### Approximation Algorithms: LP Relaxation, Rounding, and Randomized Rounding Techniques. My T. Thai

Approximation Algorithms: LP Relaxation, Rounding, and Randomized Rounding Techniques My T. Thai 1 Overview An overview of LP relaxation and rounding method is as follows: 1. Formulate an optimization

### A class of on-line scheduling algorithms to minimize total completion time

A class of on-line scheduling algorithms to minimize total completion time X. Lu R.A. Sitters L. Stougie Abstract We consider the problem of scheduling jobs on-line on a single machine and on identical

### Offline sorting buffers on Line

Offline sorting buffers on Line Rohit Khandekar 1 and Vinayaka Pandit 2 1 University of Waterloo, ON, Canada. email: rkhandekar@gmail.com 2 IBM India Research Lab, New Delhi. email: pvinayak@in.ibm.com

### An Experimental Study of Data Migration Algorithms

An Experimental Study of Data Migration Algorithms Eric Anderson 1,JoeHall 2, Jason Hartline 2, Michael Hobbs 1, Anna R. Karlin 2, Jared Saia 2, Ram Swaminathan 1, and John Wilkes 1 1 Storage Systems Program,

### Online Scheduling with Bounded Migration

Online Scheduling with Bounded Migration Peter Sanders, Naveen Sivadasan, and Martin Skutella Max-Planck-Institut für Informatik, Saarbrücken, Germany, {sanders,ns,skutella}@mpi-sb.mpg.de Abstract. Consider

### 8.1 Min Degree Spanning Tree

CS880: Approximations Algorithms Scribe: Siddharth Barman Lecturer: Shuchi Chawla Topic: Min Degree Spanning Tree Date: 02/15/07 In this lecture we give a local search based algorithm for the Min Degree

### Analysis of Approximation Algorithms for k-set Cover using Factor-Revealing Linear Programs

Analysis of Approximation Algorithms for k-set Cover using Factor-Revealing Linear Programs Stavros Athanassopoulos, Ioannis Caragiannis, and Christos Kaklamanis Research Academic Computer Technology Institute

### Topic: Greedy Approximations: Set Cover and Min Makespan Date: 1/30/06

CS880: Approximations Algorithms Scribe: Matt Elder Lecturer: Shuchi Chawla Topic: Greedy Approximations: Set Cover and Min Makespan Date: 1/30/06 3.1 Set Cover The Set Cover problem is: Given a set of

### Lecture 3: Linear Programming Relaxations and Rounding

Lecture 3: Linear Programming Relaxations and Rounding 1 Approximation Algorithms and Linear Relaxations For the time being, suppose we have a minimization problem. Many times, the problem at hand can

### Scheduling Shop Scheduling. Tim Nieberg

Scheduling Shop Scheduling Tim Nieberg Shop models: General Introduction Remark: Consider non preemptive problems with regular objectives Notation Shop Problems: m machines, n jobs 1,..., n operations

### The Conference Call Search Problem in Wireless Networks

The Conference Call Search Problem in Wireless Networks Leah Epstein 1, and Asaf Levin 2 1 Department of Mathematics, University of Haifa, 31905 Haifa, Israel. lea@math.haifa.ac.il 2 Department of Statistics,

### The Goldberg Rao Algorithm for the Maximum Flow Problem

The Goldberg Rao Algorithm for the Maximum Flow Problem COS 528 class notes October 18, 2006 Scribe: Dávid Papp Main idea: use of the blocking flow paradigm to achieve essentially O(min{m 2/3, n 1/2 }

### The Relative Worst Order Ratio for On-Line Algorithms

The Relative Worst Order Ratio for On-Line Algorithms Joan Boyar 1 and Lene M. Favrholdt 2 1 Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark, joan@imada.sdu.dk

### Lecture 7: Approximation via Randomized Rounding

Lecture 7: Approximation via Randomized Rounding Often LPs return a fractional solution where the solution x, which is supposed to be in {0, } n, is in [0, ] n instead. There is a generic way of obtaining

### CSC2420 Spring 2015: Lecture 3

CSC2420 Spring 2015: Lecture 3 Allan Borodin January 22, 2015 1 / 1 Announcements and todays agenda Assignment 1 due next Thursday. I may add one or two additional questions today or tomorrow. Todays agenda

### Permutation Betting Markets: Singleton Betting with Extra Information

Permutation Betting Markets: Singleton Betting with Extra Information Mohammad Ghodsi Sharif University of Technology ghodsi@sharif.edu Hamid Mahini Sharif University of Technology mahini@ce.sharif.edu

### Week 5 Integral Polyhedra

Week 5 Integral Polyhedra We have seen some examples 1 of linear programming formulation that are integral, meaning that every basic feasible solution is an integral vector. This week we develop a theory

### Completion Time Scheduling and the WSRPT Algorithm

Completion Time Scheduling and the WSRPT Algorithm Bo Xiong, Christine Chung Department of Computer Science, Connecticut College, New London, CT {bxiong,cchung}@conncoll.edu Abstract. We consider the online

### Weighted Sum Coloring in Batch Scheduling of Conflicting Jobs

Weighted Sum Coloring in Batch Scheduling of Conflicting Jobs Leah Epstein Magnús M. Halldórsson Asaf Levin Hadas Shachnai Abstract Motivated by applications in batch scheduling of jobs in manufacturing

### Scheduling to Minimize Power Consumption using Submodular Functions

Scheduling to Minimize Power Consumption using Submodular Functions Erik D. Demaine MIT edemaine@mit.edu Morteza Zadimoghaddam MIT morteza@mit.edu ABSTRACT We develop logarithmic approximation algorithms

### princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 6: Provable Approximation via Linear Programming Lecturer: Sanjeev Arora

princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 6: Provable Approximation via Linear Programming Lecturer: Sanjeev Arora Scribe: One of the running themes in this course is the notion of

### Approximating Minimum Bounded Degree Spanning Trees to within One of Optimal

Approximating Minimum Bounded Degree Spanning Trees to within One of Optimal ABSTACT Mohit Singh Tepper School of Business Carnegie Mellon University Pittsburgh, PA USA mohits@andrew.cmu.edu In the MINIMUM

### A Near-linear Time Constant Factor Algorithm for Unsplittable Flow Problem on Line with Bag Constraints

A Near-linear Time Constant Factor Algorithm for Unsplittable Flow Problem on Line with Bag Constraints Venkatesan T. Chakaravarthy, Anamitra R. Choudhury, and Yogish Sabharwal IBM Research - India, New

Load balancing of temporary tasks in the l p norm Yossi Azar a,1, Amir Epstein a,2, Leah Epstein b,3 a School of Computer Science, Tel Aviv University, Tel Aviv, Israel. b School of Computer Science, The

### Ecient approximation algorithm for minimizing makespan. on uniformly related machines. Chandra Chekuri. November 25, 1997.

Ecient approximation algorithm for minimizing makespan on uniformly related machines Chandra Chekuri November 25, 1997 Abstract We obtain a new ecient approximation algorithm for scheduling precedence

### 11. APPROXIMATION ALGORITHMS

11. APPROXIMATION ALGORITHMS load balancing center selection pricing method: vertex cover LP rounding: vertex cover generalized load balancing knapsack problem Lecture slides by Kevin Wayne Copyright 2005

### Bargaining Solutions in a Social Network

Bargaining Solutions in a Social Network Tanmoy Chakraborty and Michael Kearns Department of Computer and Information Science University of Pennsylvania Abstract. We study the concept of bargaining solutions,

### Primal-Dual Schema for Capacitated Covering Problems

Primal-Dual Schema for Capacitated Covering Problems Tim Carnes and David Shmoys Cornell University, Ithaca NY 14853, USA Abstract. Primal-dual algorithms have played an integral role in recent developments

### GENERATING LOW-DEGREE 2-SPANNERS

SIAM J. COMPUT. c 1998 Society for Industrial and Applied Mathematics Vol. 27, No. 5, pp. 1438 1456, October 1998 013 GENERATING LOW-DEGREE 2-SPANNERS GUY KORTSARZ AND DAVID PELEG Abstract. A k-spanner

### Nan Kong, Andrew J. Schaefer. Department of Industrial Engineering, Univeristy of Pittsburgh, PA 15261, USA

A Factor 1 2 Approximation Algorithm for Two-Stage Stochastic Matching Problems Nan Kong, Andrew J. Schaefer Department of Industrial Engineering, Univeristy of Pittsburgh, PA 15261, USA Abstract We introduce

### Proximal mapping via network optimization

L. Vandenberghe EE236C (Spring 23-4) Proximal mapping via network optimization minimum cut and maximum flow problems parametric minimum cut problem application to proximal mapping Introduction this lecture:

### Ph.D. Thesis. Judit Nagy-György. Supervisor: Péter Hajnal Associate Professor

Online algorithms for combinatorial problems Ph.D. Thesis by Judit Nagy-György Supervisor: Péter Hajnal Associate Professor Doctoral School in Mathematics and Computer Science University of Szeged Bolyai

### Class constrained bin covering

Class constrained bin covering Leah Epstein Csanád Imreh Asaf Levin Abstract We study the following variant of the bin covering problem. We are given a set of unit sized items, where each item has a color

### Weighted Sum Coloring in Batch Scheduling of Conflicting Jobs

Weighted Sum Coloring in Batch Scheduling of Conflicting Jobs Leah Epstein Magnús M. Halldórsson Asaf Levin Hadas Shachnai Abstract Motivated by applications in batch scheduling of jobs in manufacturing

### An Empirical Study of Two MIS Algorithms

An Empirical Study of Two MIS Algorithms Email: Tushar Bisht and Kishore Kothapalli International Institute of Information Technology, Hyderabad Hyderabad, Andhra Pradesh, India 32. tushar.bisht@research.iiit.ac.in,

### A constant-factor approximation algorithm for the k-median problem

A constant-factor approximation algorithm for the k-median problem Moses Charikar Sudipto Guha Éva Tardos David B. Shmoys July 23, 2002 Abstract We present the first constant-factor approximation algorithm

### Scheduling Parallel Jobs with Linear Speedup

Scheduling Parallel Jobs with Linear Speedup Alexander Grigoriev and Marc Uetz Maastricht University, Quantitative Economics, P.O.Box 616, 6200 MD Maastricht, The Netherlands. Email: {a.grigoriev,m.uetz}@ke.unimaas.nl

### SHARP BOUNDS FOR THE SUM OF THE SQUARES OF THE DEGREES OF A GRAPH

31 Kragujevac J. Math. 25 (2003) 31 49. SHARP BOUNDS FOR THE SUM OF THE SQUARES OF THE DEGREES OF A GRAPH Kinkar Ch. Das Department of Mathematics, Indian Institute of Technology, Kharagpur 721302, W.B.,

### Discrete Applied Mathematics. The firefighter problem with more than one firefighter on trees

Discrete Applied Mathematics 161 (2013) 899 908 Contents lists available at SciVerse ScienceDirect Discrete Applied Mathematics journal homepage: www.elsevier.com/locate/dam The firefighter problem with

### List Scheduling in Order of α-points on a Single Machine

List Scheduling in Order of α-points on a Single Machine Martin Skutella Fachbereich Mathematik, Universität Dortmund, D 4422 Dortmund, Germany martin.skutella@uni-dortmund.de http://www.mathematik.uni-dortmund.de/

### Guessing Game: NP-Complete?

Guessing Game: NP-Complete? 1. LONGEST-PATH: Given a graph G = (V, E), does there exists a simple path of length at least k edges? YES 2. SHORTEST-PATH: Given a graph G = (V, E), does there exists a simple

### Energy Efficient Monitoring in Sensor Networks

Energy Efficient Monitoring in Sensor Networks Amol Deshpande, Samir Khuller, Azarakhsh Malekian, Mohammed Toossi Computer Science Department, University of Maryland, A.V. Williams Building, College Park,

### Small Maximal Independent Sets and Faster Exact Graph Coloring

Small Maximal Independent Sets and Faster Exact Graph Coloring David Eppstein Univ. of California, Irvine Dept. of Information and Computer Science The Exact Graph Coloring Problem: Given an undirected

### A simpler and better derandomization of an approximation algorithm for Single Source Rent-or-Buy

A simpler and better derandomization of an approximation algorithm for Single Source Rent-or-Buy David P. Williamson Anke van Zuylen School of Operations Research and Industrial Engineering, Cornell University,

### Steiner Tree Approximation via IRR. Randomized Rounding

Steiner Tree Approximation via Iterative Randomized Rounding Graduate Program in Logic, Algorithms and Computation μπλ Network Algorithms and Complexity June 18, 2013 Overview 1 Introduction Scope Related

### Bicolored Shortest Paths in Graphs with Applications to Network Overlay Design

Bicolored Shortest Paths in Graphs with Applications to Network Overlay Design Hongsik Choi and Hyeong-Ah Choi Department of Electrical Engineering and Computer Science George Washington University Washington,

### 3. Linear Programming and Polyhedral Combinatorics

Massachusetts Institute of Technology Handout 6 18.433: Combinatorial Optimization February 20th, 2009 Michel X. Goemans 3. Linear Programming and Polyhedral Combinatorics Summary of what was seen in the

### Near Optimal Solutions

Near Optimal Solutions Many important optimization problems are lacking efficient solutions. NP-Complete problems unlikely to have polynomial time solutions. Good heuristics important for such problems.

### Approximation Algorithms. Scheduling. Approximation algorithms. Scheduling jobs on a single machine

Approximation algorithms Approximation Algorithms Fast. Cheap. Reliable. Choose two. NP-hard problems: choose 2 of optimal polynomial time all instances Approximation algorithms. Trade-off between time

### Network File Storage with Graceful Performance Degradation

Network File Storage with Graceful Performance Degradation ANXIAO (ANDREW) JIANG California Institute of Technology and JEHOSHUA BRUCK California Institute of Technology A file storage scheme is proposed

### Distributed Computing over Communication Networks: Maximal Independent Set

Distributed Computing over Communication Networks: Maximal Independent Set What is a MIS? MIS An independent set (IS) of an undirected graph is a subset U of nodes such that no two nodes in U are adjacent.

### Duplicating and its Applications in Batch Scheduling

Duplicating and its Applications in Batch Scheduling Yuzhong Zhang 1 Chunsong Bai 1 Shouyang Wang 2 1 College of Operations Research and Management Sciences Qufu Normal University, Shandong 276826, China

### A binary search algorithm for a special case of minimizing the lateness on a single machine

Issue 3, Volume 3, 2009 45 A binary search algorithm for a special case of minimizing the lateness on a single machine Nodari Vakhania Abstract We study the problem of scheduling jobs with release times

### Labeling outerplanar graphs with maximum degree three

Labeling outerplanar graphs with maximum degree three Xiangwen Li 1 and Sanming Zhou 2 1 Department of Mathematics Huazhong Normal University, Wuhan 430079, China 2 Department of Mathematics and Statistics

Online Adwords Allocation Shoshana Neuburger May 6, 2009 1 Overview Many search engines auction the advertising space alongside search results. When Google interviewed Amin Saberi in 2004, their advertisement

### Scheduling Single Machine Scheduling. Tim Nieberg

Scheduling Single Machine Scheduling Tim Nieberg Single machine models Observation: for non-preemptive problems and regular objectives, a sequence in which the jobs are processed is sufficient to describe

### ARTICLE IN PRESS. European Journal of Operational Research xxx (2004) xxx xxx. Discrete Optimization. Nan Kong, Andrew J.

A factor 1 European Journal of Operational Research xxx (00) xxx xxx Discrete Optimization approximation algorithm for two-stage stochastic matching problems Nan Kong, Andrew J. Schaefer * Department of

### Every tree contains a large induced subgraph with all degrees odd

Every tree contains a large induced subgraph with all degrees odd A.J. Radcliffe Carnegie Mellon University, Pittsburgh, PA A.D. Scott Department of Pure Mathematics and Mathematical Statistics University

### The power of -points in preemptive single machine scheduling

JOURNAL OF SCHEDULING J. Sched. 22; 5:121 133 (DOI: 1.12/jos.93) The power of -points in preemptive single machine scheduling Andreas S. Schulz 1; 2; ; and Martin Skutella 1 Massachusetts Institute of

### Diversity Coloring for Distributed Data Storage in Networks 1

Diversity Coloring for Distributed Data Storage in Networks 1 Anxiao (Andrew) Jiang and Jehoshua Bruck California Institute of Technology Pasadena, CA 9115, U.S.A. {jax, bruck}@paradise.caltech.edu Abstract

### An Approximation Algorithm for Bounded Degree Deletion

An Approximation Algorithm for Bounded Degree Deletion Tomáš Ebenlendr Petr Kolman Jiří Sgall Abstract Bounded Degree Deletion is the following generalization of Vertex Cover. Given an undirected graph

### A Sublinear Bipartiteness Tester for Bounded Degree Graphs

A Sublinear Bipartiteness Tester for Bounded Degree Graphs Oded Goldreich Dana Ron February 5, 1998 Abstract We present a sublinear-time algorithm for testing whether a bounded degree graph is bipartite

### Mean Ramsey-Turán numbers

Mean Ramsey-Turán numbers Raphael Yuster Department of Mathematics University of Haifa at Oranim Tivon 36006, Israel Abstract A ρ-mean coloring of a graph is a coloring of the edges such that the average

### Minimum Makespan Scheduling

Minimum Makespan Scheduling Minimum makespan scheduling: Definition and variants Factor 2 algorithm for identical machines PTAS for identical machines Factor 2 algorithm for unrelated machines Martin Zachariasen,

### Uniform Multicommodity Flow through the Complete Graph with Random Edge-capacities

Combinatorics, Probability and Computing (2005) 00, 000 000. c 2005 Cambridge University Press DOI: 10.1017/S0000000000000000 Printed in the United Kingdom Uniform Multicommodity Flow through the Complete

### 5.1 Bipartite Matching

CS787: Advanced Algorithms Lecture 5: Applications of Network Flow In the last lecture, we looked at the problem of finding the maximum flow in a graph, and how it can be efficiently solved using the Ford-Fulkerson

### Analysis of Algorithms, I

Analysis of Algorithms, I CSOR W4231.002 Eleni Drinea Computer Science Department Columbia University Thursday, February 26, 2015 Outline 1 Recap 2 Representing graphs 3 Breadth-first search (BFS) 4 Applications

### Key words. multi-objective optimization, approximate Pareto set, bi-objective shortest path

SMALL APPROXIMATE PARETO SETS FOR BI OBJECTIVE SHORTEST PATHS AND OTHER PROBLEMS ILIAS DIAKONIKOLAS AND MIHALIS YANNAKAKIS Abstract. We investigate the problem of computing a minimum set of solutions that

### Permutation Betting Markets: Singleton Betting with Extra Information

Permutation Betting Markets: Singleton Betting with Extra Information Mohammad Ghodsi Sharif University of Technology ghodsi@sharif.edu Hamid Mahini Sharif University of Technology mahini@ce.sharif.edu

### The chromatic spectrum of mixed hypergraphs

The chromatic spectrum of mixed hypergraphs Tao Jiang, Dhruv Mubayi, Zsolt Tuza, Vitaly Voloshin, Douglas B. West March 30, 2003 Abstract A mixed hypergraph is a triple H = (X, C, D), where X is the vertex

### On the k-path cover problem for cacti

On the k-path cover problem for cacti Zemin Jin and Xueliang Li Center for Combinatorics and LPMC Nankai University Tianjin 300071, P.R. China zeminjin@eyou.com, x.li@eyou.com Abstract In this paper we

### Large induced subgraphs with all degrees odd

Large induced subgraphs with all degrees odd A.D. Scott Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, England Abstract: We prove that every connected graph of order

### High degree graphs contain large-star factors

High degree graphs contain large-star factors Dedicated to László Lovász, for his 60th birthday Noga Alon Nicholas Wormald Abstract We show that any finite simple graph with minimum degree d contains a

### Scheduling Parallel Jobs with Monotone Speedup 1

Scheduling Parallel Jobs with Monotone Speedup 1 Alexander Grigoriev, Marc Uetz Maastricht University, Quantitative Economics, P.O.Box 616, 6200 MD Maastricht, The Netherlands, {a.grigoriev@ke.unimaas.nl,

### Constant Factor Approximation Algorithm for the Knapsack Median Problem

Constant Factor Approximation Algorithm for the Knapsack Median Problem Amit Kumar Abstract We give a constant factor approximation algorithm for the following generalization of the k-median problem. We

### Compressing Forwarding Tables for Datacenter Scalability

TECHNICAL REPORT TR12-03, TECHNION, ISRAEL 1 Compressing Forwarding Tables for Datacenter Scalability Ori Rottenstreich, Marat Radan, Yuval Cassuto, Isaac Keslassy, Carmi Arad, Tal Mizrahi, Yoram Revah

### Minimizing the Number of Machines in a Unit-Time Scheduling Problem

Minimizing the Number of Machines in a Unit-Time Scheduling Problem Svetlana A. Kravchenko 1 United Institute of Informatics Problems, Surganova St. 6, 220012 Minsk, Belarus kravch@newman.bas-net.by Frank

### Finding and counting given length cycles

Finding and counting given length cycles Noga Alon Raphael Yuster Uri Zwick Abstract We present an assortment of methods for finding and counting simple cycles of a given length in directed and undirected

### Practical Guide to the Simplex Method of Linear Programming

Practical Guide to the Simplex Method of Linear Programming Marcel Oliver Revised: April, 0 The basic steps of the simplex algorithm Step : Write the linear programming problem in standard form Linear

### Why? A central concept in Computer Science. Algorithms are ubiquitous.

Analysis of Algorithms: A Brief Introduction Why? A central concept in Computer Science. Algorithms are ubiquitous. Using the Internet (sending email, transferring files, use of search engines, online

### Chapter 13: Binary and Mixed-Integer Programming

Chapter 3: Binary and Mixed-Integer Programming The general branch and bound approach described in the previous chapter can be customized for special situations. This chapter addresses two special situations:

### Classification - Examples

Lecture 2 Scheduling 1 Classification - Examples 1 r j C max given: n jobs with processing times p 1,...,p n and release dates r 1,...,r n jobs have to be scheduled without preemption on one machine taking

### Exponential time algorithms for graph coloring

Exponential time algorithms for graph coloring Uriel Feige Lecture notes, March 14, 2011 1 Introduction Let [n] denote the set {1,..., k}. A k-labeling of vertices of a graph G(V, E) is a function V [k].

### Application Placement on a Cluster of Servers (extended abstract)

Application Placement on a Cluster of Servers (extended abstract) Bhuvan Urgaonkar, Arnold Rosenberg and Prashant Shenoy Department of Computer Science, University of Massachusetts, Amherst, MA 01003 {bhuvan,

### CSC2420 Fall 2012: Algorithm Design, Analysis and Theory

CSC2420 Fall 2012: Algorithm Design, Analysis and Theory Allan Borodin November 15, 2012; Lecture 10 1 / 27 Randomized online bipartite matching and the adwords problem. We briefly return to online algorithms

### ONLINE DEGREE-BOUNDED STEINER NETWORK DESIGN. Sina Dehghani Saeed Seddighin Ali Shafahi Fall 2015

ONLINE DEGREE-BOUNDED STEINER NETWORK DESIGN Sina Dehghani Saeed Seddighin Ali Shafahi Fall 2015 ONLINE STEINER FOREST PROBLEM An initially given graph G. s 1 s 2 A sequence of demands (s i, t i ) arriving