Nonlinear Optimization: Algorithms 3: Interior-point methods
|
|
|
- Matilda Miles
- 9 years ago
- Views:
Transcription
1 Nonlinear Optimization: Algorithms 3: Interior-point methods INSEAD, Spring 2006 Jean-Philippe Vert Ecole des Mines de Paris Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.1/32
2 Outline Inequality constrained minimization Logarithmic barrier function and central path Barrier method Feasibility and phase I methods Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.2/32
3 Inequality constrained minimization Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.3/32
4 Setting We consider the problem: minimize f(x) subject to g i (x) 0, i = 1,...,m, Ax = b, f and g are supposed to be convex and twice continuously differentiable. A is a p n matrix of rank p < n (i.e., fewer equality constraints than variables, and independent equality constraints). We assume f is finite and attained at x Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.4/32
5 Strong duality hypothesis We finally assume the problem is strictly feasible, i.e., there exists x with g i (x) < 0,i = 1,...,m, and Ax = 0. This means that Slater s constraint qualification holds = strong duality holds and dual optimum is attained, i.e., there exists λ R p and µ R m which together with x satisfy the KKT conditions: f (x ) + m i=1 Ax = b g i (x ) 0, i = 1,...,m µ 0 µ i g i (x ) + A λ = 0 µ ig i (x ) = 0, i = 1,...,m. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.5/32
6 Examples Many problems satisfy these conditions, e.g.: LP, QP, QCQP Entropy maximization with linear inequality constraints minimize subject to n x i log x i i=1 Fx g Ax = b. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.6/32
7 Examples (cont.) To obtain differentiability of the objective and constraints we might reformulate the problem, e.g: ( ) minimize a i x + b i max i=1,...,n with nondifferentiable objective is equivalent to the LP: minimize t subject to a i x + b t, i = 1,...,m. Ax = b. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.7/32
8 Overview Interior-point methods solve the problem (or the KKT conditions) by applying Newton s method to a sequence of equality-constrained problems. They form another level in the hierarchy of convex optimization algorithms: Linear equality constrained quadratic problems (LCQP) are the simplest (set of linear equations that can be solved analytically) Newton s method: reduces linear equality constrained convex optimization problems (LCCP) with twice differentiable objective to a sequence of LCQP. Interior-point methods reduce a problem with linear equality and inequality constraints to a sequence of LCCP. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.8/32
9 Logarithmic barrier function and central path Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.9/32
10 Problem reformulation Our goal is to approximately formulate the inequality constrained problem as an equality constrained problem to which Newton s method can be applied. To this end we first hide the inequality constraint implicit in the objective: m minimize f(x) + subject to Ax = b, i=1 I (g i (x)) where I : R R is the indicator function for nonpositive reals: { 0 if u 0, I (u) = + if u > 0. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.10/32
11 Logarithmic barrier The basic idea of the barrier method is to approximate the indicator function I by the convex and differentiable function Î (u) = 1 t log( u), u < 0, where t > 0 is a parameter that sets the accuracy of the prediction. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.11/32
12 Problem reformulation Subsituting Î for I in the optimization problem gives the approximation: m minimize f(x) + subject to Ax = b, i=1 1 t log ( g i(x)) The objective function of this problem is convex and twice differentiable, so Newton s method can be used to solve it. Of course this problem is just an approximation to the original problem. We will see that the quality of the approximation of the solution increases when t increases. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.12/32
13 Logarithmic barrier function The function φ(x) = m log ( g i (x)) i=1 is called the logarithmic barrier or log barrier for the original optimization problem. Its domain is the set of points that satisfy all inequality constraints strictly, and it grows without bound if g i (x) 0 for any i. Its gradient and Hessian are given by: φ(x) = 2 φ(x) = m i=1 m i=1 1 g i (x) g i(x), 1 g i (x) 2 g i(x) g i (x) + m i=1 1 g i (x) 2 g i (x). Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.13/32
14 Central path Our approximate problem is therefore (equivalent to) the following problem: minimize tf(x) + φ(x) subject to Ax = b. We assume for now that this problem can be solved via Newton s method, in particular that it has a unique solution x (t) for each t > 0. The central path is the set of solutions, i.e.: {x (t) t > 0}. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.14/32
15 Characterization of the central path A point x (t) is on the central path if and only if: it is strictly feasible, i.e., satisfies: Ax (t) = b, g i (x (t)) < 0, i = 1,...,m. there exists a ˆλ R p such that: 0 = t f (x (t)) + φ (x (t)) + A ˆλ m = t f (x 1 (t)) + g i (x (t)) g i (x (t)) + A ˆλ. i=1 Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.15/32
16 Example: LP central path The log barrier for a LP: is given by minimize c x subject to Ax b, φ(x) = m log i=1 ( ) b i a i x where a i is the ith row of A. Its derivatives are:, φ(x) = m i=1 1 b i a i xa i, 2 φ(x) = m i=1 1 ( bi a i x) 2 a ia i. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.16/32
17 Example (cont.) The derivatives can be rewritten more compactly: φ(x) = A d, 2 φ(x) = A diag(d) 2 A, where d R m is defined by d i = 1/ ( b i a i x). The centrality condition for x (t) is: tc + A d = 0 = at each point on the central path, φ(x) is parallel to c. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.17/32
18 Dual points on central path Remember that x = x (t) if there exists a w such that t f (x (t)) + m i=1 1 g i (x (t)) g i (x (t)) + A ˆλ = 0, Ax = b. Let us now define: µ i(t) = 1 tg i (x (t)), i = 1,...,m, λ (t) = ˆλ t. We claim that the pair λ (t),µ (t) is dual feasible. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.18/32
19 Dual points on central path (cont.) Indeed: µ (t) > 0 because g i (x (t)) < 0 x (t) minimizes the Lagrangian L (x,λ (t),µ (t)) = f(x)+ m i=1 µ i (t)g i(x)+λ (t) (Ax b). Therefore the dual function q (µ (t),λ (t)) is finite and: q (µ (t),λ (t)) = L (x (t),λ (t),µ (t)) = f (x (t)) m t Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.19/32
20 Convergence of the central path From the equation: q (µ (t),λ (t)) = f (x (t)) m t we deduce that the duality gap associated with x (t) and the dual feasible pair λ (t),µ (t) is simply m/t. As an important consequence we have: f (x (t)) f m t This confirms the intuition that f (x (t)) f if t. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.20/32
21 Interpretation via KKT conditions We can rewrite the conditions for x to be on the central path by the existence of λ,µ such that: 1. Primal constraints: g i (x) 0, Ax = b 2. Dual constraints : µ 0 3. approximate complementary slackness: µ i g i (x) = 1/t 4. gradient of Lagrangian w.r.t. x vanishes: f(x) + m i=1 µ i g i (x) + A λ = 0 The only difference with KKT is that 0 is replaced by 1/t in 3. For large t, the point on the central path almost satisfies the KKT conditions. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.21/32
22 The barrier method Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.22/32
23 Motivations We have seen that the point x (t) is m/t-suboptimal. In order to solve the optimization problem with a guaranteed specific accuracy ɛ > 0, it suffices to take t = m/ɛ and solve the equality constrained problem: minimize m ɛ f(x) + φ(x) subject to Ax = b by Newton s method. However this only works for small problems, good starting points and moderate accuracy. It is rarely, if ever, used. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.23/32
24 Barrier method given strictly feasible x, t = t (0) > 0,µ > 1, tolerance ɛ > 0. repeat 1. Centering step: compute x (t) by minimizing tf + φ, subject to Ax = b 2. Update: x := x (t). 3. Stopping criterion: quit if m/t < ɛ. 4. Increase t: t := µt. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.24/32
25 Barrier method: Centering Centering is usually done with Newton s method, starting at current x Inexact centering is possible, since the goal is only to obtain a sequence of points x (k) that converges to an optimal point. In practice, however, the cost of computing an extremely accurate minimizer of tf 0 + φ as compared to the cost of computing a good minimizer is only marginal. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.25/32
26 Barrier method: choice of µ The choice of µ involves a trade-off For small µ, the initial point of each Newton process is good and few Newton iterations are required; however, many outer loops (update of t) are required. For large µ, many Newton steps are required after each update of t, since the initial point is probably not very good. However few outer loops are required. In practice µ = works well. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.26/32
27 Barrier method: choice of t (0) The choice of t (0) involves a simple trade-off if t (0) is chosen too large, the first outer iteration will require too many Newton iterations if t (0) is chosen too small, the algorithm will require extra outer iterations Several heuristics exist for this choice. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.27/32
28 Example: LP in inequality form m = 100 inequalities, n = 50 variables. start with x on central paht (t (0) = 1, duality gap 100), terminates when t = 10 8 (gap 10 6 ) centering uses Newton s method with backtracking total number of Newton iterations not very sensitive for µ > 10 Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.28/32
29 Example: A family of standard LP minimize c x subject to Ax = b, x 0 for A R m 2m. Test for m = 10,...,1000: The number of iterations grows very slowly as m ranges over a 100 : 1 ratio. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.29/32
30 Feasibility and phase I methods The barrier method requires a strictly feasible starting point x (0) : g i (x (0)) < 0,i = 1,...,m Ax (0) = 0. When such a point is not known, the barrier method is preceded by a preliminary stage, called phase I, in which a strictly feasible point is computed. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.30/32
31 Basic phase I method minimize s subject to g i (x) s, i = 1,...,m, Ax = b, this problem is always strictly feasible (choose any x, and s large enough). apply the barrier method to this problem = phase I optimization problem. If x,s feasible with s < 0 then x is strictly feasible for the initial problem If f > 0 then the original problem is infeasible. Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.31/32
32 Primal-dual interior-point methods A variant of the barrier method, more efficient when high accurary is needed update primal and dual variables at each iteration: no distinction between inner and outer iterations often exhibit superlinear asymptotic convergence search directions can be interpreted as Newton directions for modified KKT conditions can start at infeasible points cost per iteration same as barrier method Nonlinear optimization c 2006 Jean-Philippe Vert, ([email protected]) p.32/32
Duality in General Programs. Ryan Tibshirani Convex Optimization 10-725/36-725
Duality in General Programs Ryan Tibshirani Convex Optimization 10-725/36-725 1 Last time: duality in linear programs Given c R n, A R m n, b R m, G R r n, h R r : min x R n c T x max u R m, v R r b T
Nonlinear Programming Methods.S2 Quadratic Programming
Nonlinear Programming Methods.S2 Quadratic Programming Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard A linearly constrained optimization problem with a quadratic objective
Numerisches Rechnen. (für Informatiker) M. Grepl J. Berger & J.T. Frings. Institut für Geometrie und Praktische Mathematik RWTH Aachen
(für Informatiker) M. Grepl J. Berger & J.T. Frings Institut für Geometrie und Praktische Mathematik RWTH Aachen Wintersemester 2010/11 Problem Statement Unconstrained Optimality Conditions Constrained
4.6 Linear Programming duality
4.6 Linear Programming duality To any minimization (maximization) LP we can associate a closely related maximization (minimization) LP. Different spaces and objective functions but in general same optimal
Summer course on Convex Optimization. Fifth Lecture Interior-Point Methods (1) Michel Baes, K.U.Leuven Bharath Rangarajan, U.
Summer course on Convex Optimization Fifth Lecture Interior-Point Methods (1) Michel Baes, K.U.Leuven Bharath Rangarajan, U.Minnesota Interior-Point Methods: the rebirth of an old idea Suppose that f is
Lecture 3. Linear Programming. 3B1B Optimization Michaelmas 2015 A. Zisserman. Extreme solutions. Simplex method. Interior point method
Lecture 3 3B1B Optimization Michaelmas 2015 A. Zisserman Linear Programming Extreme solutions Simplex method Interior point method Integer programming and relaxation The Optimization Tree Linear Programming
2.3 Convex Constrained Optimization Problems
42 CHAPTER 2. FUNDAMENTAL CONCEPTS IN CONVEX OPTIMIZATION Theorem 15 Let f : R n R and h : R R. Consider g(x) = h(f(x)) for all x R n. The function g is convex if either of the following two conditions
Linear Programming Notes V Problem Transformations
Linear Programming Notes V Problem Transformations 1 Introduction Any linear programming problem can be rewritten in either of two standard forms. In the first form, the objective is to maximize, the material
Support Vector Machines
Support Vector Machines Charlie Frogner 1 MIT 2011 1 Slides mostly stolen from Ryan Rifkin (Google). Plan Regularization derivation of SVMs. Analyzing the SVM problem: optimization, duality. Geometric
Support Vector Machine (SVM)
Support Vector Machine (SVM) CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Margin concept Hard-Margin SVM Soft-Margin SVM Dual Problems of Hard-Margin
10. Proximal point method
L. Vandenberghe EE236C Spring 2013-14) 10. Proximal point method proximal point method augmented Lagrangian method Moreau-Yosida smoothing 10-1 Proximal point method a conceptual algorithm for minimizing
Interior Point Methods and Linear Programming
Interior Point Methods and Linear Programming Robert Robere University of Toronto December 13, 2012 Abstract The linear programming problem is usually solved through the use of one of two algorithms: either
1 Introduction. Linear Programming. Questions. A general optimization problem is of the form: choose x to. max f(x) subject to x S. where.
Introduction Linear Programming Neil Laws TT 00 A general optimization problem is of the form: choose x to maximise f(x) subject to x S where x = (x,..., x n ) T, f : R n R is the objective function, S
Discrete Optimization
Discrete Optimization [Chen, Batson, Dang: Applied integer Programming] Chapter 3 and 4.1-4.3 by Johan Högdahl and Victoria Svedberg Seminar 2, 2015-03-31 Todays presentation Chapter 3 Transforms using
Duality in Linear Programming
Duality in Linear Programming 4 In the preceding chapter on sensitivity analysis, we saw that the shadow-price interpretation of the optimal simplex multipliers is a very useful concept. First, these shadow
constraint. Let us penalize ourselves for making the constraint too big. We end up with a
Chapter 4 Constrained Optimization 4.1 Equality Constraints (Lagrangians) Suppose we have a problem: Maximize 5, (x 1, 2) 2, 2(x 2, 1) 2 subject to x 1 +4x 2 =3 If we ignore the constraint, we get the
Date: April 12, 2001. Contents
2 Lagrange Multipliers Date: April 12, 2001 Contents 2.1. Introduction to Lagrange Multipliers......... p. 2 2.2. Enhanced Fritz John Optimality Conditions...... p. 12 2.3. Informative Lagrange Multipliers...........
Two-Stage Stochastic Linear Programs
Two-Stage Stochastic Linear Programs Operations Research Anthony Papavasiliou 1 / 27 Two-Stage Stochastic Linear Programs 1 Short Reviews Probability Spaces and Random Variables Convex Analysis 2 Deterministic
The Steepest Descent Algorithm for Unconstrained Optimization and a Bisection Line-search Method
The Steepest Descent Algorithm for Unconstrained Optimization and a Bisection Line-search Method Robert M. Freund February, 004 004 Massachusetts Institute of Technology. 1 1 The Algorithm The problem
Practical Guide to the Simplex Method of Linear Programming
Practical Guide to the Simplex Method of Linear Programming Marcel Oliver Revised: April, 0 The basic steps of the simplex algorithm Step : Write the linear programming problem in standard form Linear
Integer Programming: Algorithms - 3
Week 9 Integer Programming: Algorithms - 3 OPR 992 Applied Mathematical Programming OPR 992 - Applied Mathematical Programming - p. 1/12 Dantzig-Wolfe Reformulation Example Strength of the Linear Programming
Big Data - Lecture 1 Optimization reminders
Big Data - Lecture 1 Optimization reminders S. Gadat Toulouse, Octobre 2014 Big Data - Lecture 1 Optimization reminders S. Gadat Toulouse, Octobre 2014 Schedule Introduction Major issues Examples Mathematics
Introduction to Support Vector Machines. Colin Campbell, Bristol University
Introduction to Support Vector Machines Colin Campbell, Bristol University 1 Outline of talk. Part 1. An Introduction to SVMs 1.1. SVMs for binary classification. 1.2. Soft margins and multi-class classification.
Dual Methods for Total Variation-Based Image Restoration
Dual Methods for Total Variation-Based Image Restoration Jamylle Carter Institute for Mathematics and its Applications University of Minnesota, Twin Cities Ph.D. (Mathematics), UCLA, 2001 Advisor: Tony
An Overview Of Software For Convex Optimization. Brian Borchers Department of Mathematics New Mexico Tech Socorro, NM 87801 borchers@nmt.
An Overview Of Software For Convex Optimization Brian Borchers Department of Mathematics New Mexico Tech Socorro, NM 87801 [email protected] In fact, the great watershed in optimization isn t between linearity
Linear Programming for Optimization. Mark A. Schulze, Ph.D. Perceptive Scientific Instruments, Inc.
1. Introduction Linear Programming for Optimization Mark A. Schulze, Ph.D. Perceptive Scientific Instruments, Inc. 1.1 Definition Linear programming is the name of a branch of applied mathematics that
Solutions of Equations in One Variable. Fixed-Point Iteration II
Solutions of Equations in One Variable Fixed-Point Iteration II Numerical Analysis (9th Edition) R L Burden & J D Faires Beamer Presentation Slides prepared by John Carroll Dublin City University c 2011
24. The Branch and Bound Method
24. The Branch and Bound Method It has serious practical consequences if it is known that a combinatorial problem is NP-complete. Then one can conclude according to the present state of science that no
A NEW LOOK AT CONVEX ANALYSIS AND OPTIMIZATION
1 A NEW LOOK AT CONVEX ANALYSIS AND OPTIMIZATION Dimitri Bertsekas M.I.T. FEBRUARY 2003 2 OUTLINE Convexity issues in optimization Historical remarks Our treatment of the subject Three unifying lines of
Distributed Machine Learning and Big Data
Distributed Machine Learning and Big Data Sourangshu Bhattacharya Dept. of Computer Science and Engineering, IIT Kharagpur. http://cse.iitkgp.ac.in/~sourangshu/ August 21, 2015 Sourangshu Bhattacharya
Interior-Point Algorithms for Quadratic Programming
Interior-Point Algorithms for Quadratic Programming Thomas Reslow Krüth Kongens Lyngby 2008 IMM-M.Sc-2008-19 Technical University of Denmark Informatics and Mathematical Modelling Building 321, DK-2800
Lecture 2. Marginal Functions, Average Functions, Elasticity, the Marginal Principle, and Constrained Optimization
Lecture 2. Marginal Functions, Average Functions, Elasticity, the Marginal Principle, and Constrained Optimization 2.1. Introduction Suppose that an economic relationship can be described by a real-valued
INTERIOR-POINT METHODS FOR NONCONVEX NONLINEAR PROGRAMMING: FILTER METHODS AND MERIT FUNCTIONS
INTERIOR-POINT METHODS FOR NONCONVEX NONLINEAR PROGRAMMING: FILTER METHODS AND MERIT FUNCTIONS HANDE Y. BENSON, DAVID F. SHANNO, AND ROBERT J. VANDERBEI Operations Research and Financial Engineering Princeton
Some representability and duality results for convex mixed-integer programs.
Some representability and duality results for convex mixed-integer programs. Santanu S. Dey Joint work with Diego Morán and Juan Pablo Vielma December 17, 2012. Introduction About Motivation Mixed integer
Optimization in R n Introduction
Optimization in R n Introduction Rudi Pendavingh Eindhoven Technical University Optimization in R n, lecture Rudi Pendavingh (TUE) Optimization in R n Introduction ORN / 4 Some optimization problems designing
Duality of linear conic problems
Duality of linear conic problems Alexander Shapiro and Arkadi Nemirovski Abstract It is well known that the optimal values of a linear programming problem and its dual are equal to each other if at least
The Multiplicative Weights Update method
Chapter 2 The Multiplicative Weights Update method The Multiplicative Weights method is a simple idea which has been repeatedly discovered in fields as diverse as Machine Learning, Optimization, and Game
Support Vector Machines Explained
March 1, 2009 Support Vector Machines Explained Tristan Fletcher www.cs.ucl.ac.uk/staff/t.fletcher/ Introduction This document has been written in an attempt to make the Support Vector Machines (SVM),
CONSTRAINED NONLINEAR PROGRAMMING
149 CONSTRAINED NONLINEAR PROGRAMMING We now turn to methods for general constrained nonlinear programming. These may be broadly classified into two categories: 1. TRANSFORMATION METHODS: In this approach
Solutions Of Some Non-Linear Programming Problems BIJAN KUMAR PATEL. Master of Science in Mathematics. Prof. ANIL KUMAR
Solutions Of Some Non-Linear Programming Problems A PROJECT REPORT submitted by BIJAN KUMAR PATEL for the partial fulfilment for the award of the degree of Master of Science in Mathematics under the supervision
Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh
Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh Peter Richtárik Week 3 Randomized Coordinate Descent With Arbitrary Sampling January 27, 2016 1 / 30 The Problem
Several Views of Support Vector Machines
Several Views of Support Vector Machines Ryan M. Rifkin Honda Research Institute USA, Inc. Human Intention Understanding Group 2007 Tikhonov Regularization We are considering algorithms of the form min
Lecture 2: August 29. Linear Programming (part I)
10-725: Convex Optimization Fall 2013 Lecture 2: August 29 Lecturer: Barnabás Póczos Scribes: Samrachana Adhikari, Mattia Ciollaro, Fabrizio Lecci Note: LaTeX template courtesy of UC Berkeley EECS dept.
On SDP- and CP-relaxations and on connections between SDP-, CP and SIP
On SDP- and CP-relaations and on connections between SDP-, CP and SIP Georg Still and Faizan Ahmed University of Twente p 1/12 1. IP and SDP-, CP-relaations Integer program: IP) : min T Q s.t. a T j =
3. Linear Programming and Polyhedral Combinatorics
Massachusetts Institute of Technology Handout 6 18.433: Combinatorial Optimization February 20th, 2009 Michel X. Goemans 3. Linear Programming and Polyhedral Combinatorics Summary of what was seen in the
A Simple Introduction to Support Vector Machines
A Simple Introduction to Support Vector Machines Martin Law Lecture for CSE 802 Department of Computer Science and Engineering Michigan State University Outline A brief history of SVM Large-margin linear
Further Study on Strong Lagrangian Duality Property for Invex Programs via Penalty Functions 1
Further Study on Strong Lagrangian Duality Property for Invex Programs via Penalty Functions 1 J. Zhang Institute of Applied Mathematics, Chongqing University of Posts and Telecommunications, Chongqing
Linear Programming in Matrix Form
Linear Programming in Matrix Form Appendix B We first introduce matrix concepts in linear programming by developing a variation of the simplex method called the revised simplex method. This algorithm,
LECTURE 5: DUALITY AND SENSITIVITY ANALYSIS. 1. Dual linear program 2. Duality theory 3. Sensitivity analysis 4. Dual simplex method
LECTURE 5: DUALITY AND SENSITIVITY ANALYSIS 1. Dual linear program 2. Duality theory 3. Sensitivity analysis 4. Dual simplex method Introduction to dual linear program Given a constraint matrix A, right
Optimization of Communication Systems Lecture 6: Internet TCP Congestion Control
Optimization of Communication Systems Lecture 6: Internet TCP Congestion Control Professor M. Chiang Electrical Engineering Department, Princeton University ELE539A February 21, 2007 Lecture Outline TCP
Proximal mapping via network optimization
L. Vandenberghe EE236C (Spring 23-4) Proximal mapping via network optimization minimum cut and maximum flow problems parametric minimum cut problem application to proximal mapping Introduction this lecture:
SECOND DERIVATIVE TEST FOR CONSTRAINED EXTREMA
SECOND DERIVATIVE TEST FOR CONSTRAINED EXTREMA This handout presents the second derivative test for a local extrema of a Lagrange multiplier problem. The Section 1 presents a geometric motivation for the
t := maxγ ν subject to ν {0,1,2,...} and f(x c +γ ν d) f(x c )+cγ ν f (x c ;d).
1. Line Search Methods Let f : R n R be given and suppose that x c is our current best estimate of a solution to P min x R nf(x). A standard method for improving the estimate x c is to choose a direction
Chapter 6. Linear Programming: The Simplex Method. Introduction to the Big M Method. Section 4 Maximization and Minimization with Problem Constraints
Chapter 6 Linear Programming: The Simplex Method Introduction to the Big M Method In this section, we will present a generalized version of the simplex method that t will solve both maximization i and
Linear Threshold Units
Linear Threshold Units w x hx (... w n x n w We assume that each feature x j and each weight w j is a real number (we will relax this later) We will study three different algorithms for learning linear
Scheduling Home Health Care with Separating Benders Cuts in Decision Diagrams
Scheduling Home Health Care with Separating Benders Cuts in Decision Diagrams André Ciré University of Toronto John Hooker Carnegie Mellon University INFORMS 2014 Home Health Care Home health care delivery
Advanced Lecture on Mathematical Science and Information Science I. Optimization in Finance
Advanced Lecture on Mathematical Science and Information Science I Optimization in Finance Reha H. Tütüncü Visiting Associate Professor Dept. of Mathematical and Computing Sciences Tokyo Institute of Technology
Probabilistic Models for Big Data. Alex Davies and Roger Frigola University of Cambridge 13th February 2014
Probabilistic Models for Big Data Alex Davies and Roger Frigola University of Cambridge 13th February 2014 The State of Big Data Why probabilistic models for Big Data? 1. If you don t have to worry about
Chapter 2 Solving Linear Programs
Chapter 2 Solving Linear Programs Companion slides of Applied Mathematical Programming by Bradley, Hax, and Magnanti (Addison-Wesley, 1977) prepared by José Fernando Oliveira Maria Antónia Carravilla A
TOMLAB - For fast and robust largescale optimization in MATLAB
The TOMLAB Optimization Environment is a powerful optimization and modeling package for solving applied optimization problems in MATLAB. TOMLAB provides a wide range of features, tools and services for
Principles of Scientific Computing Nonlinear Equations and Optimization
Principles of Scientific Computing Nonlinear Equations and Optimization David Bindel and Jonathan Goodman last revised March 6, 2006, printed March 6, 2009 1 1 Introduction This chapter discusses two related
. P. 4.3 Basic feasible solutions and vertices of polyhedra. x 1. x 2
4. Basic feasible solutions and vertices of polyhedra Due to the fundamental theorem of Linear Programming, to solve any LP it suffices to consider the vertices (finitely many) of the polyhedron P of the
These slides follow closely the (English) course textbook Pattern Recognition and Machine Learning by Christopher Bishop
Music and Machine Learning (IFT6080 Winter 08) Prof. Douglas Eck, Université de Montréal These slides follow closely the (English) course textbook Pattern Recognition and Machine Learning by Christopher
International Doctoral School Algorithmic Decision Theory: MCDA and MOO
International Doctoral School Algorithmic Decision Theory: MCDA and MOO Lecture 2: Multiobjective Linear Programming Department of Engineering Science, The University of Auckland, New Zealand Laboratoire
Optimization Theory for Large Systems
Optimization Theory for Large Systems LEON S. LASDON CASE WESTERN RESERVE UNIVERSITY THE MACMILLAN COMPANY COLLIER-MACMILLAN LIMITED, LONDON Contents 1. Linear and Nonlinear Programming 1 1.1 Unconstrained
1 Portfolio mean and variance
Copyright c 2005 by Karl Sigman Portfolio mean and variance Here we study the performance of a one-period investment X 0 > 0 (dollars) shared among several different assets. Our criterion for measuring
Cost Minimization and the Cost Function
Cost Minimization and the Cost Function Juan Manuel Puerta October 5, 2009 So far we focused on profit maximization, we could look at a different problem, that is the cost minimization problem. This is
IEOR 4404 Homework #2 Intro OR: Deterministic Models February 14, 2011 Prof. Jay Sethuraman Page 1 of 5. Homework #2
IEOR 4404 Homework # Intro OR: Deterministic Models February 14, 011 Prof. Jay Sethuraman Page 1 of 5 Homework #.1 (a) What is the optimal solution of this problem? Let us consider that x 1, x and x 3
Linear Programming. Widget Factory Example. Linear Programming: Standard Form. Widget Factory Example: Continued.
Linear Programming Widget Factory Example Learning Goals. Introduce Linear Programming Problems. Widget Example, Graphical Solution. Basic Theory:, Vertices, Existence of Solutions. Equivalent formulations.
Linear Programming. March 14, 2014
Linear Programming March 1, 01 Parts of this introduction to linear programming were adapted from Chapter 9 of Introduction to Algorithms, Second Edition, by Cormen, Leiserson, Rivest and Stein [1]. 1
Numerical methods for American options
Lecture 9 Numerical methods for American options Lecture Notes by Andrzej Palczewski Computational Finance p. 1 American options The holder of an American option has the right to exercise it at any moment
Walrasian Demand. u(x) where B(p, w) = {x R n + : p x w}.
Walrasian Demand Econ 2100 Fall 2015 Lecture 5, September 16 Outline 1 Walrasian Demand 2 Properties of Walrasian Demand 3 An Optimization Recipe 4 First and Second Order Conditions Definition Walrasian
Reformulating the monitor placement problem: Optimal Network-wide wide Sampling
Reformulating the monitor placement problem: Optimal Network-wide wide Sampling Gianluca Iannaccone Intel Research @ Cambridge Joint work with: G. Cantieni,, P. Thiran (EPFL) C. Barakat (INRIA), C. Diot
Lecture 11: 0-1 Quadratic Program and Lower Bounds
Lecture : - Quadratic Program and Lower Bounds (3 units) Outline Problem formulations Reformulation: Linearization & continuous relaxation Branch & Bound Method framework Simple bounds, LP bound and semidefinite
5 INTEGER LINEAR PROGRAMMING (ILP) E. Amaldi Fondamenti di R.O. Politecnico di Milano 1
5 INTEGER LINEAR PROGRAMMING (ILP) E. Amaldi Fondamenti di R.O. Politecnico di Milano 1 General Integer Linear Program: (ILP) min c T x Ax b x 0 integer Assumption: A, b integer The integrality condition
Pattern Analysis. Logistic Regression. 12. Mai 2009. Joachim Hornegger. Chair of Pattern Recognition Erlangen University
Pattern Analysis Logistic Regression 12. Mai 2009 Joachim Hornegger Chair of Pattern Recognition Erlangen University Pattern Analysis 2 / 43 1 Logistic Regression Posteriors and the Logistic Function Decision
Solving Linear Programs
Solving Linear Programs 2 In this chapter, we present a systematic procedure for solving linear programs. This procedure, called the simplex method, proceeds by moving from one feasible solution to another,
Optimization Modeling for Mining Engineers
Optimization Modeling for Mining Engineers Alexandra M. Newman Division of Economics and Business Slide 1 Colorado School of Mines Seminar Outline Linear Programming Integer Linear Programming Slide 2
Linear Programming I
Linear Programming I November 30, 2003 1 Introduction In the VCR/guns/nuclear bombs/napkins/star wars/professors/butter/mice problem, the benevolent dictator, Bigus Piguinus, of south Antarctica penguins
Agricultural and Environmental Policy Models: Calibration, Estimation and Optimization. Richard E. Howitt
Agricultural and Environmental Policy Models: Calibration, Estimation and Optimization Richard E. Howitt January 18, 2005 ii Contents 1 Introduction 1 1.1 Introduction to Linear Models..................
26 Linear Programming
The greatest flood has the soonest ebb; the sorest tempest the most sudden calm; the hottest love the coldest end; and from the deepest desire oftentimes ensues the deadliest hate. Th extremes of glory
Optimization Methods in Finance
Optimization Methods in Finance Gerard Cornuejols Reha Tütüncü Carnegie Mellon University, Pittsburgh, PA 15213 USA January 2006 2 Foreword Optimization models play an increasingly important role in financial
10.2 ITERATIVE METHODS FOR SOLVING LINEAR SYSTEMS. The Jacobi Method
578 CHAPTER 1 NUMERICAL METHODS 1. ITERATIVE METHODS FOR SOLVING LINEAR SYSTEMS As a numerical technique, Gaussian elimination is rather unusual because it is direct. That is, a solution is obtained after
Special Situations in the Simplex Algorithm
Special Situations in the Simplex Algorithm Degeneracy Consider the linear program: Maximize 2x 1 +x 2 Subject to: 4x 1 +3x 2 12 (1) 4x 1 +x 2 8 (2) 4x 1 +2x 2 8 (3) x 1, x 2 0. We will first apply the
Adaptive Online Gradient Descent
Adaptive Online Gradient Descent Peter L Bartlett Division of Computer Science Department of Statistics UC Berkeley Berkeley, CA 94709 bartlett@csberkeleyedu Elad Hazan IBM Almaden Research Center 650
Linear Programming: Chapter 11 Game Theory
Linear Programming: Chapter 11 Game Theory Robert J. Vanderbei October 17, 2007 Operations Research and Financial Engineering Princeton University Princeton, NJ 08544 http://www.princeton.edu/ rvdb Rock-Paper-Scissors
Lecture 2: The SVM classifier
Lecture 2: The SVM classifier C19 Machine Learning Hilary 2015 A. Zisserman Review of linear classifiers Linear separability Perceptron Support Vector Machine (SVM) classifier Wide margin Cost function
Integrating Benders decomposition within Constraint Programming
Integrating Benders decomposition within Constraint Programming Hadrien Cambazard, Narendra Jussien email: {hcambaza,jussien}@emn.fr École des Mines de Nantes, LINA CNRS FRE 2729 4 rue Alfred Kastler BP
CHAPTER 9. Integer Programming
CHAPTER 9 Integer Programming An integer linear program (ILP) is, by definition, a linear program with the additional constraint that all variables take integer values: (9.1) max c T x s t Ax b and x integral
Statistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
! Solve problem to optimality. ! Solve problem in poly-time. ! Solve arbitrary instances of the problem. #-approximation algorithm.
Approximation Algorithms 11 Approximation Algorithms Q Suppose I need to solve an NP-hard problem What should I do? A Theory says you're unlikely to find a poly-time algorithm Must sacrifice one of three
Increasing for all. Convex for all. ( ) Increasing for all (remember that the log function is only defined for ). ( ) Concave for all.
1. Differentiation The first derivative of a function measures by how much changes in reaction to an infinitesimal shift in its argument. The largest the derivative (in absolute value), the faster is evolving.
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical
Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression
Logistic Regression Department of Statistics The Pennsylvania State University Email: [email protected] Logistic Regression Preserve linear classification boundaries. By the Bayes rule: Ĝ(x) = arg max
INTEGER PROGRAMMING. Integer Programming. Prototype example. BIP model. BIP models
Integer Programming INTEGER PROGRAMMING In many problems the decision variables must have integer values. Example: assign people, machines, and vehicles to activities in integer quantities. If this is
Sensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS
Sensitivity Analysis 3 We have already been introduced to sensitivity analysis in Chapter via the geometry of a simple example. We saw that the values of the decision variables and those of the slack and
