Adding Inferred Constraints Leads to Runtime Inefficiencies in the Optimization of Piped Water Networks

Similar documents
Scheduling Algorithm with Optimization of Employee Satisfaction

Water Supply. Simulation of Water Distribution Networks. Mohammad N. Almasri. Optimal Design of Water Distribution Networks Mohammad N.

Chapter 13: Binary and Mixed-Integer Programming

Abstract Title: Planned Preemption for Flexible Resource Constrained Project Scheduling

Discuss the size of the instance for the minimum spanning tree problem.

A Mathematical Programming Solution to the Mars Express Memory Dumping Problem

5 INTEGER LINEAR PROGRAMMING (ILP) E. Amaldi Fondamenti di R.O. Politecnico di Milano 1

Chapter 10: Network Flow Programming

CHAPTER 3 SECURITY CONSTRAINED OPTIMAL SHORT-TERM HYDROTHERMAL SCHEDULING

CHAPTER 3 WATER DISTRIBUTION SYSTEM MAINTENANCE OPTIMIZATION PROBLEM

Quantitative Analysis BA 452

OPTIMIZED STAFF SCHEDULING AT SWISSPORT

Approximation Algorithms

Student Project Allocation Using Integer Programming

OPTIMAL RESERVOIR OPERATION FOR IRRIGATION OF CROPS USING GENETIC ALGORITHM: A CASE STUDY OF SUKHI RESERVOIR PROJECT

Smart Graphics: Methoden 3 Suche, Constraints

Scenario: Optimization of Conference Schedule.

INTEGER PROGRAMMING. Integer Programming. Prototype example. BIP model. BIP models

Integrating Benders decomposition within Constraint Programming

Discrete Optimization

An Improved ACO Algorithm for Multicast Routing

A Network Flow Approach in Cloud Computing

! Solve problem to optimality. ! Solve problem in poly-time. ! Solve arbitrary instances of the problem. !-approximation algorithm.

Proximal mapping via network optimization

Centre for Technology Alternatives for Rural Areas. Presentation at TEQIP workshop on Water 12 th September, 2014 at IIT Bombay

Adaptive Linear Programming Decoding

A Scalable Approach for QoS-based Web Service Selection

Offline sorting buffers on Line

Linear Programming. Widget Factory Example. Linear Programming: Standard Form. Widget Factory Example: Continued.

SYSM 6304: Risk and Decision Analysis Lecture 5: Methods of Risk Analysis

A CP Scheduler for High-Performance Computers

! Solve problem to optimality. ! Solve problem in poly-time. ! Solve arbitrary instances of the problem. #-approximation algorithm.

How To Solve A Minimum Set Covering Problem (Mcp)

Strategic planning in LTL logistics increasing the capacity utilization of trucks

Clustering and scheduling maintenance tasks over time

CSO Modelling Considering Moving Storms and Tipping Bucket Gauge Failures M. Hochedlinger 1 *, W. Sprung 2,3, H. Kainz 3 and K.

Ant Colony Optimization and Constraint Programming

Lecture 3. Linear Programming. 3B1B Optimization Michaelmas 2015 A. Zisserman. Extreme solutions. Simplex method. Interior point method

1 Introduction. Linear Programming. Questions. A general optimization problem is of the form: choose x to. max f(x) subject to x S. where.

Optimizing the hydraulic designing of pressurized irrigation network on the lands of village Era by using the computerized model WaterGems

Branch-and-Price Approach to the Vehicle Routing Problem with Time Windows

Solving NP Hard problems in practice lessons from Computer Vision and Computational Biology

Optimization models for targeted offers in direct marketing: exact and heuristic algorithms

A MODEL TO SOLVE EN ROUTE AIR TRAFFIC FLOW MANAGEMENT PROBLEM:

Linear Programming. March 14, 2014

An Overview Of Software For Convex Optimization. Brian Borchers Department of Mathematics New Mexico Tech Socorro, NM

Factoring & Primality

Hybrid Data Envelopment Analysis and Neural Networks for Suppliers Efficiency Prediction and Ranking

The Problem of Scheduling Technicians and Interventions in a Telecommunications Company

NEMDE INTERCONNECTOR NON-PHYSICAL LOSS MODEL CHANGES - BUSINESS SPECIFICATION

Waste Collection Vehicle Routing Problem Considering Similarity Pattern of Trashcan

Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay

STUDY OF PROJECT SCHEDULING AND RESOURCE ALLOCATION USING ANT COLONY OPTIMIZATION 1

A* Algorithm Based Optimization for Cloud Storage

International Journal of Software and Web Sciences (IJSWS)

Transportation Polytopes: a Twenty year Update

Comparison of algorithms for automated university scheduling

Linear Programming. Solving LP Models Using MS Excel, 18

Cloud Branching. Timo Berthold. joint work with Domenico Salvagnin (Università degli Studi di Padova)

Analysis of Algorithms I: Optimal Binary Search Trees

InvGen: An Efficient Invariant Generator

A Branch and Bound Algorithm for Solving the Binary Bi-level Linear Programming Problem

A Brief Study of the Nurse Scheduling Problem (NSP)

A Genetic Algorithm Approach for Solving a Flexible Job Shop Scheduling Problem

A Working Knowledge of Computational Complexity for an Optimizer

A Column Generation Model for Truck Routing in the Chilean Forest Industry

On the Interaction and Competition among Internet Service Providers

Three Effective Top-Down Clustering Algorithms for Location Database Systems

STOCHASTIC MODELLING OF WATER DEMAND USING A SHORT-TERM PATTERN-BASED FORECASTING APPROACH

Towards a Benchmark Suite for Modelica Compilers: Large Models

24. The Branch and Bound Method

11. APPROXIMATION ALGORITHMS

5.1 Bipartite Matching

Toward Optimal Resource Provisioning for Cloud MapReduce and Hybrid Cloud Applications

Solutions to Homework 6

Offline 1-Minesweeper is NP-complete

International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering

A Flexible Mixed Integer Programming framework for Nurse Scheduling

Memory Allocation Technique for Segregated Free List Based on Genetic Algorithm

HYBRID GENETIC ALGORITHM PARAMETER EFFECTS FOR OPTIMIZATION OF CONSTRUCTION RESOURCE ALLOCATION PROBLEM. Jin-Lee KIM 1, M. ASCE

Experiences Using Water Network Analysis Modeling for Leak Localization

Tree based ensemble models regularization by convex optimization

A Metaheuristic Optimization Algorithm for Binary Quadratic Problems

A numerically adaptive implementation of the simplex method

Linear Programming I

A Comparison of General Approaches to Multiprocessor Scheduling

Why? A central concept in Computer Science. Algorithms are ubiquitous.

Efficient and Robust Allocation Algorithms in Clouds under Memory Constraints

Genetic Algorithm Based Interconnection Network Topology Optimization Analysis

Module 7: Hydraulic Design of Sewers and Storm Water Drains. Lecture 7 : Hydraulic Design of Sewers and Storm Water Drains

White Paper FPGA Performance Benchmarking Methodology

A New Method for Estimating Maximum Power Transfer and Voltage Stability Margins to Mitigate the Risk of Voltage Collapse

Analysis of Approximation Algorithms for k-set Cover using Factor-Revealing Linear Programs

A SIMULATION MODEL FOR RESOURCE CONSTRAINED SCHEDULING OF MULTIPLE PROJECTS

OPRE 6201 : 2. Simplex Method

Transcription:

Adding Inferred Constraints Leads to Runtime Inefficiencies in the Optimization of Piped Water Networks Nikhil Hooda Department of Computer Science and Engineering Indian Institute of Technology, Bombay nikhilh@cse.iitb.ac.in Om Damani Department of Computer Science and Engineering Indian Institute of Technology, Bombay damani@cse.iitb.ac.in ABSTRACT Pressure-constrained piped water network cost optimization is an important problem with significant real world impact. The main design decision to be made for a piped water network is the choice of pipe diameters from a set of commercially available pipe diameters. Rural water networks in the developing world are typically branched networks with a single water source. For such networks, existing methods either solve the constrainedoptimization problem heuristically, or optimally solve the specific case of single pipe segment per link. In the present study, we discuss two approaches to solve the general problem optimally. We show that the approach that attempts to model constraints capturing the inferred structure of the optimal solution results in much worse runtime performance compared to a more general approach. The general formulation proposed is optimal as well as faster than both the previous formulations for the specific problem of one pipe segment per link and the heuristic approach used to solve the general problem. We have implemented the general formulation in a constraint optimization tool and have made it publicly available. Categories and Subject Descriptors G.1.6 [Optimization]: Constrained optimization General Terms Design, Performance. Keywords Problem Modeling, Network Optimization, Water Network, Optimal Solution Structure, Inferred Constraints 1 INTRODUCTION Piped water network cost optimization has been studied for more than 30 years now. Several constrained optimization techniques from Linear Programming [1] to Genetic Algorithms [14] to newer meta-heuristics like Tabu Search [3] and Shuffled Frog Leaping Algorithm [4] have been employed to solve various variations of the cost optimization problem. In this work, we focus on the cost optimization of rural piped water networks. These networks are typically gravity fed, since reliable electricity supply is not a given. Pumps are deployed only at the water source and not in the rest of the distribution network. Acyclic (branched) networks are common since the redundancy provided by cyclic (looped) networks is an unaffordable luxury. One of the most important aspects in the design of these systems is the choice of pipe diameters from a discrete set of commercially available pipe diameters. In general, each link (connection between two nodes) can consist of several pipe segments of differing diameters. Larger the pipe diameters, better the service (pressure), but higher is the capital cost. The branched piped water network cost optimization problem is the selection of pipe diameters that minimize the system cost while providing the requisite service (pressure at demand points). BRANCH [12] is an optimization tool by the World Bank that attempts to minimize pipe cost for branched pipe networks with a single water source. Though it has limited capabilities in terms of number of pipes (at most 125), and does not guarantee optimal solution, it is used ([10], [15], [16], [17]) in the developing world as the only alternatives are expensive commercial tools like WATERGEMS [18]. Even WATERGEMS does not guarantee optimal solution since it uses a genetic algorithm. In [13], an ILP formulation is proposed for the special case of one pipe diameter per link. While, ILP in general is -hard, we found that for network sizes of real world interest, optimal solution to the special formulation could be computed in reasonable time. This means that currently one can either get an optimal solution for the special case of one piped segment per link [13] or get a non-optimal solution for the general case of multiple pipe segments per link [12, 18]. In this work, our aim is to come up with a formulation that solves the general formulation while still maintaining optimality. For the general problem, in addition to the user provided pressure and flow related constraints, following additional constraint is inferred in [5]: in the optimal solution, each link consists of at most two pipe segments of adjacent diameters from the available diameter set. We take two approaches to the multiple pipes per link formulation. In the first approach, we use the fact that the optimal solution will contain at most two pipe segments per link [5]. In the other approach, we make no assumption on the number of pipe segments per link. Surprisingly, the latter formulation s runtime performance is much better than the former. Of course, both formulations produce the same solution (the optimal solution). But knowing the structure of the optimal solution and trying to capture that as a constraint led to a much worse runtime performance! The second formulation solves the problem optimally and efficiently. Its solution indeed only contains at most two pipe diameters for each link in the network. The structure that we were trying to enforce for the first formulation has come out naturally in the second. The second formulation not only manages to optimally solve the general case, it also has a runtime performance that is better than both the heuristic approach to the general problem [12] as well as the optimal approach to the specific problem [13]. Using the second general formulation we have implemented this constraint optimization model, in a water network design system called JT (name anonymized). The overall goal for JT is wide reaching and will attempt to solve several network design

constraints like source selection, storage location/capacity, choice of pipe diameters, water supply scheduling, cost allocation etc. The scope of the present study is restricted to determining the pipe diameters in a single source acyclic (branched) network. In summary, we make the following contributions in this work: We present a general formulation that is optimal as well as faster than both the optimal approach for the specific problem (one pipe per link) and the heuristic approach for the general problem (multiple pipes per link). We demonstrate that quite counter-intuitively, modeling of inferred constraints that capture the structure of the optimal solution worsens the runtime performance. We have implemented this constraint optimization model in JT system for the design of piped water networks. The rest of the paper is structured as follows. Section 2 describes the problem formulation. Section 3 is a brief description of the environment used to build the JT system. Section 4 describes the six networks that we test our model on. Section 5 presents the performance comparison results. Conclusions and future work directions are presented in Section 6. 2 PROBLEM FORMULATION 2.1 Problem Definition The typical rural piped water network consists of a source MBR (mass balancing reservoir), demand nodes, and pipes connecting them. If required, pumps are used to supply water from a water body to the MBR which then feeds the remaining network by gravity. A gravity fed network has its MBR located at a higher elevation so that no pumping is required for the downstream network. An example network is shown in Figure 1. It consists of 5 nodes and 4 links. Node 1 is the source for the network, and nodes 2, 4 and 5 are the demand points. Node 3 is a zero demand node. Such nodes are typically introduced at intermediate points of high elevation. Figure 1: Gen_5 network Nodes in the network have water demands and minimum pressure requirements that must be maintained. The design process involves selecting the diameters for the pipes that must be used to supply these nodes. Lower the diameter, lower is the cost of the pipes. But lower diameters cause higher friction-losses in the pipes which may lead to insufficient pressures at the demand nodes. Therefore the optimization goal is to minimize pipe cost under the constraint of minimum pressure requirements at the demand nodes. The choice of diameter is to be made from a discrete set of commercial pipes that are available. Rural piped water networks are typically branched (acyclic) as compared to urban networks which are frequently looped. In our present study we restrict ourselves to single source acyclic piped water networks. Since the network is acyclic, flow in each link can be computed easily from the node demands. The pumping from source to MBR is not considered. Instead we assume a source that is able to maintain the supply of water at the given head (head at a point is the height to which water naturally rises. The pressure at a point is the difference between its head and elevation.). Formally, we are solving the following optimization problem: 2.1.1 Input Source node: head Node: elevation, water demand, min pressure requirement Link: start/end node, length Commercial pipe diameter: cost per unit length 2.1.2 Output Length and diameter of pipe segments for each link 2.1.3 Objective Minimize total pipe cost 2.1.4 Constraints Pressure at each node must exceed min. pressure specified Water demand must be met at each node Pipe diameters can only take values from provided commercial pipe diameters 2.2 One Pipe Per Link [13] has an Integer Linear Programming (ILP) formulation for the problem where only one pipe diameter is used per link. It uses binary variables to represent the choice of commercial pipe to be made for each link. Henceforth we use the term OnePipe model while referring to this formulation. 2.2.1 The Objective Function: The objective function to be minimized is the total cost of the pipes chosen for the links in the network. O(. ) = L i C i (D i ) i=1 O(.): The total pipe cost which is a function of the pipe diameters chosen for each link : The number of links in the network Di: Pipe diameter for the link i Li: Length of link i Ci: /length of link i, a function of the pipe diameter Di The diameters Di can only be chosen from the set of available commercial pipe diameters. This restriction is represented via binary variables xij. The modified objective function is: O(. ) = L i C ij (D ij )x ij i=1 = of commercially available pipe diameters xij = 1 if the i th link uses the j th pipe diameter, else 0 2.2.2 Pipe constraint: For each link the objective function has terms for each of the available commercial pipe diameters. Since each link is to be installed with only one diameter, it means that exactly one of the binary variables corresponding to each link must be one. Therefore we get the following pipe constraint for each link i: x ij = 1

2.2.3 Node constraint: At each node a min. amount of pressure needs to be maintained. The pressure at any node is calculated from the headloss in the pipes connecting the node to the reference node i.e. the source for the network. Therefore the pressure constraint for each node n is: P n H R E n HL i i S n Pn: The min. pressure that must be maintained at node n HR: The head supplied by reference node En: The elevation of node n Sn: Set of pipes that connect node n to the reference node R HLi: Headloss in link i. It is modeled as summation over HLij headloss if the pipe diameter j is chosen. We use the Hazen- Williams formula for headloss which is given by: 1.852 flow 10.68 L i i roughness j HL ij = 4.87 diameter j As with the objective cost function, we introduce xij to linearize the node constraint which gives us: P n H R E n HL ij x ij i S n Now for a given (i, j), HLij is just a constant, so we get a linear constraint with xij as our variables. 2.3 Two Pipes Per Link The previous formulation assumes a single pipe diameter for each link in the network. But this might not be (and usually isn t) the optimal solution. In the most general case, we could have several pipes of varying diameters for a single link in the network. In [5] it is shown that in the optimal solution each link will consist of at most two pipe segments of adjacent diameters. The above will hold if the commercial pipe cost is a convex function of a power of its diameter, which is the case in practice. To incorporate this knowledge about optimal solution structure we modify the objective function as given next. 2.3.1 The Objective Function: To generalize the formulation in order to account for multiple pipes per link we introduce two new continuous variables for each link: li1 and li2. The previous boolean variable representing the choice of commercial pipe for each link is now broken into two: xij1 and xij2. Here li1/ li2 represent the length of the two pipes for link i and xij1/ xij2 represent the choice of diameters for the two pipes. The modified objective function is: O(. ) = C ij1 (D ij1 )l i1 x ij1 + C ij2 (D ij2 )l i2 x ij2 i=1 This formulation is not linear since we have terms like li1xij1 both of which are variables. We linearize this equation by introducing: z ij1 = l i1 x ij1 This nonlinear equality is represented by the following set of linear inequalities: z ij1 L i x ij1 z ij1 l i1 z ij1 l i1 L i (1 x ij1 ) A similar set of inequalities will be introduced for zij2 as well. This linearization is possible since z is a product of a continuous variable and a boolean variable. Our new objective function: O(. ) = C ij1 (D ij1 )z ij1 + C ij2 (D ij2 )z ij2 i=1 2.3.2 Pipe Constraint As before, the choice of pipe diameter must be made from the available commercial pipe diameters. x ij1 = 1 Similar equalities hold for xij2. We also have following additional constraint for each link i. l i1 + l i2 = L i 2.3.3 Node Constraint The node constraint for minimum pressure requirement now changes since length of each pipe segment is no longer a constant. The headloss HLi in each link i now becomes: HL ij x ij1 l i1 + HL ij x ij2 l i2 flow 10.68 i roughness j HL ij = 4.87 diameter j 1.852 This is possible since HLij is linear in the length of the pipe. Therefore the new node constraint is: P n H R E n HL ij (z ij1 + z ij2 ) i S n 2.3.4 Performance On testing the new formulation, the runtime performance is found to be very poor. Whereas for the OnePipe model a 100 node network can be solved in 1.5 seconds, the new formulation cannot solve optimally even a 10 node network before getting timed out at 100 seconds. The reason for this is the large number of new constraints that are added to the system. Previously all constraints were linear in the number of nodes. We had only one constraint for each link and one for each node (for an acyclic network number of links is one less than the number of nodes). But now, in order to introduce zij1 and zij2, we have introduced 6 new constraints for each (link, pipe diameter) combination. 2.4 Proposed Model In the previous formulation in order to introduce multiple pipes per link, we determine the lengths of the two pipes in the link and independently assign a commercial pipe diameter to each. This causes a blowup in the number of constraints. To overcome this constraint blowup, we decided to ignore the two pipe segment structure of the original solution and modeled each link as being made up of pipes for each possible commercial pipe diameter. All that needs to be determined is the length of each component. No explicit choice is being made as to which commercial pipe diameter is chosen. The pipe component is chosen if it has non-zero length. We introduce continuous variables lij and do away with the binary choice variables xij. Each link is made up of components

corresponding to the pipe diameters. lij then represents the length of each of these components. 2.4.1 The Objective Function: We replace the terms li*xij with lij in our objective cost: O(. ) = C ij (D ij )l ij i=1 2.4.2 Pipe constraint: Now the pipe constraint simply reduces to: 4.1 Real World Networks 4.1.1 Network 1: Mokhada This is a 37 node network, designed to provide water to 20 villages in the Mokhada taluka of Thane district [10]. Figure 2 shows the villages and road network mapped out on Google Earth. l ij = L i 2.4.3 Node constraint: In the node constraint we replace xij * Li with lij: P n H R E n HL ij l ij i S n We now have a more general formulation for our pipe diameter optimization problem. We have the added benefit of not restricting the solution to two pipes per link. If the necessary convexity conditions [5] are met, our solution should naturally contain only two pipes per link with adjacent diameters. As in the case of OnePipe the number of constraints is linear in the number of nodes and the performance turns out to be much better. Table 1 shows the size of model in terms of variables and constraints for the two approaches. Table 1: Model Size (m is the number of pipe diameters which is taken as 10) Two Pipe Approach General Approach Node Constraints Variables Constraints Variables n 6n + 8nm 2n + 4nm 2n nm 10 860 420 20 200 100 8600 4200 200 2000 For a 100 node network we have around 200 inequalities for the general formulation. But for the two-pipe model, a network with just 10 nodes the number of constraints is around 800! Also by eliminating binary variables and replacing them with continuous variables we have converted what is an Integer Linear Program (ILP) to a Linear Program (LP). While ILPs are hard [11], LPs can be solved in Polynomial time [6]. This is very significant since we are able to solve the LP formulation of a 1000 node network in two seconds even though it contains 2000 constraints. 3 JT System Description The above model has been implemented in the JT system. The implementation is done using Java 7 [8] and GLPK 4.55[7] Linear Program Solver. Java ILP 1.2a [9] is used as the Java interface to the GLPK library. The system description and download is available at http://www.cse.iitb.ac.in/~nikhilh/jaltantra 4 Input Networks We have tested the OnePipe and our proposed model as well as BRANCH over six different networks. Three of these are real world examples from villages of Thane district in Maharashtra, India. The other three networks are artificially generated to test the system performance over a range of network sizes. Figure 2: Mokhada Network Layout 4.1.2 Network 2: Shahpur This is a 21 node network, designed to provide drinking water to 12 villages in the Shahpur taluka of Thane district. 4.1.3 Network 3: Khardi This is an 11 node network, designed to provide drinking water to Khardi town and nearby villages in Thane district [15]. 4.2 Synthetic Networks 4.2.1 Network 4: Gen_5 Gen_5 is a small 5 node network designed to help analyze the difference between a single pipe per link approach versus a multiple pipe per link approach. Figure 1 shows the network layout and Tables 2, 3 and 4 contain the node, link, and commercial pipe information. Table 2: Gen_5 Node Data Node Elevation Demand (L/s) 1 356-6.66 2 346 1.16 3 332 0.00 4 325 3.50 5 296 2.00 Link Table 3: Gen_5 Link Data Start Node End Node Link Length 1 1 2 1561 2 2 3 1231 3 2 5 3589 4 3 4 2135

Table 4: Gen_5 Commercial Pipe Data Diameter (mm) per metre (Rs) 63 107 75 150 90 215 110 319 125 413 140 517 180 851 250 1500 280 1900 315 2400 4.2.2 Networks 5 and 6: Gen_100 and Gen_1000 Networks 5 and 6 contain 100 and 1000 nodes respectively. They were randomly generated to test the performance and scalability of the proposed model. 5 Experimental Results We solve the optimization problem for these six test networks using BRANCH software, OnePipe model as well as our general model. Both the OnePipe model and the proposed model are tested using the GLPK LP solver library. A timeout of 100 seconds is used. If the timeout period elapses, the solver outputs the best feasible result found so far rather than the global optimum. 5.1.1 Gen_5 Network The results of running our proposed model on Gen_5 are shown in Tables 5 and 6. Note that as expected, the maximum number of pipes per link is two and that the diameters for them are indeed adjacent in the list of commercial pipes. Node 1 being the water supply source is shown as having negative demand. In Table 7 we compare the output of the different models for Gen_5. Since the proposed model solves the problem optimally, it has better cost than both BRANCH and the OnePipe model. Table 5: Gen_5 Node Results Demand Elevation Head (L/s) 1-6.66 356 356 0 2 1.16 346 353 7 3 0.00 332 341 9 4 3.50 325 332 7 5 2.00 296 324 28 Node Pressure Table 6: Gen_5 Link Results Link Flow (L/s) Diameter (mm) Length Headloss 1 6.66 125 140 563 998 1.48 1.52 2 3.5 75 1231 11.86 3 2 63 3589 28.68 4 3.5 75 90 118 2017 1.13 8.00 Table 7: Comparison of Pipe Results for Gen_5 BRANCH OnePipe Model Proposed Model Pipe Length Dia Length Dia Length Dia (mm) (mm) (mm) 1 519 125 1561 140 563 125 1042 140 998 140 2 1231 75 1231 75 1231 75 3 3589 63 3589 63 3589 63 4 52 75 2135 90 118 75 2083 90 2017 90 (1k Rs) 1958 2020 1949 5.1.2 Performance comparison over all six networks Table 8 presents the performance in terms of objective cost as well as running time of the three methods over all 6 test networks. For the Gen_100 network, BRANCH terminated with a memory overflow message. Since the stated maximum number of nodes is 125, the Gen_1000 network is not run on BRANCH. The time taken by both the OnePipe model and our proposed model is less than half a second for the first four networks. But the ILP vs. LP nature is borne out for the two larger networks. In fact, for the Gen_1000 the OnePipe model also gets timed out. Being optimal, our proposed model indeed outperforms both BRANCH and the OnePipe model for all six networks. In terms of objective cost. Both BRANCH and our proposed model use a more general formulation and hence better cost results are to be expected. Our model performs better than BRANCH since we use a LP formulation that is solved optimally whereas BRANCH uses a heuristic approach. In fact, for two of the first four networks even the OnePipe model performs better than BRANCH. Table 8: Comparison of Performance over all six networks BRANCH OnePipe Model Proposed Model Network of Nodes (1000 Rs) (1000 Rs) (1000 Rs) Mokhada 37 24,652 119380 24,317 429 24,181 387 Shahpur 21 29,082 34878 29,523 356 28,895 318 Khardi 11 21,281 4207 21,267 256 21,184 247 Gen_5 5 1958 1105 2020 243 1949 228 Gen_100 100 - - 93,718 1531 90,512 628 Gen_1000 1000 - - 5,80,578 out 5,62,564 2023

5.1.3 Choice of the LP solver library To select an LP solver library, we tested the performance of the two popular open source LP solvers: GLPK[7] and lp_solve[2]. For the LP formulation of our proposed model, there is no significant difference in runtimes. However in the case of the ILP formulation of the OnePipe model, GLPK performs much better for larger networks. The results for both, over all six networks for the OnePipe formulation are given in Table 9. In the case of Gen_1000, the lp_solve library cannot even find a feasible solution before the timeout period of 100 seconds. Table 9: Comparison of the two LP solvers for OnePipe model lp_solve GLPK Network No. of Nodes (1k Rs) (1k Rs) Mokhada 37 24,317 388 24,317 429 Shahpur 21 29,523 296 29,523 356 Khardi 11 21,267 228 21,267 256 Gen_5 5 2020 281 2,020 243 Gen_100 100 94,483 out 93,718 1531 Gen_1000 1000 - - 5,80,578 out 6 Conclusions For the cost optimization of piped water networks, we have presented a general formulation that is optimal as well as faster than both the previous formulations for the specific problem of one pipe segment per link, and the heuristic approach used in available software. In the process, we found that, quite counter-intuitively, modeling of inferred constraints that capture the structure of the optimal solution worsens the runtime performance. We have implemented our solution in a water network design system JT. Currently the model developed for JT only allows for single source branched networks. Future work would lie in generalizing the networks to be multi-sourced as well as looped. More pipe water network components like valves and pumps can be part of the network design. Reservoir costs are an important component of the capital cost of a piped water network. Reservoir locations and elevations are currently inputs to the optimizer. But these can also be part of the optimization problem by adding the reservoir costs to the objective function and adding relevant constraints. 7 References [1] Alperovits, E., and U. Shamir. "Design of optimal water distribution systems." Water resources research 13.6 (1977): 885-900. [2] Berkelaar, Michel, Kjell Eikland, and Peter Notebaert. "lp solve 5.5, open source (mixed-integer) linear programming system. Software, May 1 2004." [3] da Conceicao Cunha, Maria, and Luisa Ribeiro. "Tabu search algorithms for water network optimization." European Journal of Operational Research 157.3 (2004): 746-758. [4] Eusuff, Muzaffar M., and Kevin E. Lansey. "Optimization of water distribution network design using the shuffled frog leaping algorithm." Journal of Water Resources Planning and Management 129.3 (2003): 210-225. [5] Fujiwara, Okitsugu, and Debashis Dey. "Two adjacent pipe diameters at the optimal solution in the water distribution network models." Water Resources Research 23.8 (1987): 1457-1460. [6] Karmarkar, Narendra. "A new polynomial-time algorithm for linear programming." Proceedings of the sixteenth annual ACM symposium on Theory of Computing. ACM, 1984. [7] GLPK (GNU Linear Programming Kit). https://www.gnu.org/software/glpk/ (September, 2014) [8] Java 7 https://java.com/en/download/index.jsp (September, 2014) [9] Java ILP Java interface to ILP solvers. http://javailp.sourceforge.net/ (September, 2014) [10] Nikhil Hooda, Rajaram Desai, and Om P. Damani. Design and Optimization of Piped Water Network for Tanker Fed Villages in Mokhada Taluka Technical Report No. TR- CSE-2013e-55, 2013, Dept. of Computer Science and Engineering, IIT Bombay. [11] Papadimitriou, Christos H. "On the complexity of integer programming." Journal of the ACM (JACM) 28.4 (1981): 765-768. [12] Prasad M. Modak, Juzer Dhoonia, A Computer Program in Quick BASIC for the Least Design of Branched Water Distribution Networks. UNDP/ World Bank, Asia Water Supply and Sanitation Sector Development Project, RAS/86/160, December 1991. [13] Samani, Hossein MV, and Alireza Mottaghi. "Optimization of water distribution networks using integer linear programming." Journal of Hydraulic Engineering 132.5 (2006): 501-509. [14] Savic, Dragan A., and Godfrey A. Walters. "Genetic algorithms for least-cost design of water distribution networks." Journal of Water Resources Planning and Management 123.2 (1997): 67-77. [15] V. Choudhary, O. Damani, R. Desai, A. Joshi, M. Kanwat, M. Kaushal, S. Kharole, Y. Pawde, P. Rathore, and M. Sohoni. Redesigning Khardi Rural Piped Water Network Scheme for Sustainability Technical Report No. TR-CSE- 2013-56, 2013, Dept. of Computer Science and Engineering, IIT Bombay. [16] Vyas, Janki H., Narendra J. Shrimali, and Mukesh A. Modi. Optimization of Dhrafad Regional Water Supply Scheme using Branch 3.0., IJIRSET 2014 [17] Yugandhara Lad, J S Main, and S D Chawathe. Optimization of Hydraulic Design of Water Supply Tree Network Based on Present Worth Analysis. Journal of Indian Water Works Association. 2012 [18] Wu, Zheng Yi. "Genetic algorithm plays a role in municipal water systems." ACM SIGEVOlution 1.4 (2006): 2-8.