Stochastic Gradient Method: Applications

Similar documents

B. Delyon. Stochastic approximation with decreasing gain: convergence and asymptotic theory. Unpublished lecture notes, Université de Rennes, 2000.

Big Data - Lecture 1 Optimization reminders

Spatial Decomposition/Coordination Methods for Stochastic Optimal Control Problems. Practical aspects and theoretical questions

Pricing and calibration in local volatility models via fast quantization

Monte Carlo Methods in Finance

Mathematical Finance

Increasing for all. Convex for all. ( ) Increasing for all (remember that the log function is only defined for ). ( ) Concave for all.

OPTIMAL CONTROL OF A COMMERCIAL LOAN REPAYMENT PLAN. E.V. Grigorieva. E.N. Khailov

COMPLETE MARKETS DO NOT ALLOW FREE CASH FLOW STREAMS

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

When to Refinance Mortgage Loans in a Stochastic Interest Rate Environment

Simulating Stochastic Differential Equations

STA 4273H: Statistical Machine Learning

Applications to Data Smoothing and Image Processing I

Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm

Statistical Machine Learning

Pattern Analysis. Logistic Regression. 12. Mai Joachim Hornegger. Chair of Pattern Recognition Erlangen University

Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering

Nonlinear Optimization: Algorithms 3: Interior-point methods

Further Study on Strong Lagrangian Duality Property for Invex Programs via Penalty Functions 1

24. The Branch and Bound Method

Linear Threshold Units

Discrete mechanics, optimal control and formation flying spacecraft

Lecture. S t = S t δ[s t ].

HPCFinance: New Thinking in Finance. Calculating Variable Annuity Liability Greeks Using Monte Carlo Simulation

Euler: A System for Numerical Optimization of Programs

Lecture 10. Finite difference and finite element methods. Option pricing Sensitivity analysis Numerical examples

From CFD to computational finance (and back again?)

Introduction to General and Generalized Linear Models

Lecture 5: Variants of the LMS algorithm

Duality of linear conic problems

Beyond stochastic gradient descent for large-scale machine learning

A joint control framework for supply chain planning

Optimization under uncertainty: modeling and solution methods

Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh

Sensitivity analysis of European options in jump-diffusion models via the Malliavin calculus on the Wiener space

Pricing complex options using a simple Monte Carlo Simulation

A New Quantitative Behavioral Model for Financial Prediction

Life Settlement Pricing

ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE

Dimensioning an inbound call center using constraint programming

Stochastic Inventory Control

Decomposition of life insurance liabilities into risk factors theory and application

Nonparametric adaptive age replacement with a one-cycle criterion

The Black-Scholes Formula

From CFD to computational finance (and back again?)

Using simulation to calculate the NPV of a project

Neuro-Dynamic Programming An Overview

C21 Model Predictive Control

Week 1: Introduction to Online Learning

Practical Guide to the Simplex Method of Linear Programming

Index tracking UNDER TRANSACTION COSTS:

Safe robot motion planning in dynamic, uncertain environments

1 Simulating Brownian motion (BM) and geometric Brownian motion (GBM)

Discussion on the paper Hypotheses testing by convex optimization by A. Goldenschluger, A. Juditsky and A. Nemirovski.

Vector Spaces; the Space R n

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan , Fall 2010

Computational Statistics for Big Data

Functional Optimization Models for Active Queue Management

Learning Vector Quantization: generalization ability and dynamics of competing prototypes

Load Balancing and Switch Scheduling

Simulation-based optimization methods for urban transportation problems. Carolina Osorio

Date: April 12, Contents

Logistic Regression (1/24/13)

1 Portfolio mean and variance

Abstract. 1. Introduction. Caparica, Portugal b CEG, IST-UTL, Av. Rovisco Pais, Lisboa, Portugal

Black-Scholes Equation for Option Pricing

An example of a computable

Machine learning and optimization for massive data

Formulations of Model Predictive Control. Dipartimento di Elettronica e Informazione

CS 688 Pattern Recognition Lecture 4. Linear Models for Classification

OPTIMAl PREMIUM CONTROl IN A NON-liFE INSURANCE BUSINESS

A Log-Robust Optimization Approach to Portfolio Management

Part 4 fitting with energy loss and multiple scattering non gaussian uncertainties outliers

Adaptive Online Gradient Descent

MCMC Using Hamiltonian Dynamics

The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy

Master of Mathematical Finance: Course Descriptions

Quant DSL Language Guide

The Black-Scholes-Merton Approach to Pricing Options

Mathieu St-Pierre. Denis Gingras Dr. Ing.

Variance Reduction. Pricing American Options. Monte Carlo Option Pricing. Delta and Common Random Numbers

Nonlinear Programming Methods.S2 Quadratic Programming

Computing Near Optimal Strategies for Stochastic Investment Planning Problems

Lecture 15. Sergei Fedotov Introduction to Financial Mathematics. Sergei Fedotov (University of Manchester) / 6

2.2 Creaseness operator

Stephane Crepey. Financial Modeling. A Backward Stochastic Differential Equations Perspective. 4y Springer

QUANTIZED INTEREST RATE AT THE MONEY FOR AMERICAN OPTIONS

Binary Image Reconstruction

EXIT TIME PROBLEMS AND ESCAPE FROM A POTENTIAL WELL

Solving Method for a Class of Bilevel Linear Programming based on Genetic Algorithms

Transcription:

Stochastic Gradient Method: Applications February 03, 2015 P. Carpentier Master MMMEF Cours MNOS 2014-2015 114 / 267

Lecture Outline 1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 115 / 267

Two-Stage Recourse Problem Trade-off Between Investment and Operation 1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 116 / 267

Two-Stage Recourse Problem Trade-off Between Investment and Operation 1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 117 / 267

A Basic Two-Stage Recourse Problem Two-Stage Recourse Problem Trade-off Between Investment and Operation We consider the management of a water reservoir. Water is drawn from the reservoir by way of random consumers. In order to ensure the reservoir supply, 2 decisions are taken at successive time steps. A first supply decision q 1 is taken without any knowledge of the effective consumption, the associated cost being equal to 1 2 c 1( q1 ) 2, with c1 > 0. Once the consumption d has been observed (realization of a r.v. D defined over a probability space (Ω, A, P)), a second supply decision q 2 is taken in order to maintain the reservoir at its initial level, that is, q 2 = d q 1, the cost associated to this second decision being equal to 1 2 c 2( q2 ) 2, with c2 > c 1 > 0. The problem is to minimize the expected cost of operation. P. Carpentier Master MMMEF Cours MNOS 2014-2015 118 / 267

Mathematical Formulation and Solution Problem Formulation q 1 is a deterministic decision variable, Two-Stage Recourse Problem Trade-off Between Investment and Operation whereas q 2 is the realization of a random variable Q 2. 1 min (q 1,Q 2 ) 2 c ( ) 2 1 ( 1 q1 + 2 E ( ) ) 2 c 2 Q2 s.t. q 1 + Q 2 = D. Equivalent Problem 1 ( min q 1 R 2 E ( ) 2 ( ) ) 2 c 1 q1 + c2 D q1 Analytical solution: q 1 = c 2 c 1 + c 2 E ( D ). P. Carpentier Master MMMEF Cours MNOS 2014-2015 119 / 267

Stochastic Gradient Algorithm Two-Stage Recourse Problem Trade-off Between Investment and Operation Q (k+1) 1 = Q (k) 1 1 ( (c 1 + c 2 )Q (k) 1 c 2 D (k+1)). k Algorithm (initialization) // // Random generator // rand( normal ); rand( seed,123); // // Random consumption // moy = 10.; ect = 5.; // // Criterion // c1 = 3.; c2 = 1.; // // Initialization // x = [ ]; y = [ ]; Algorithm (iterations) // // Algorithm // qk = 0.; for k = 1:100 dk = moy + (ect*rand(1)); gk = ((c1+c2)*qk) - (c2*dk); ek = 1/k; qk = qk - (ek*gk); x = [x ; k]; y = [y ; qk]; end // // Trajectory plot // plot2d(x,y); xtitle( Stochastic Gradient, Iter., Q1 ); P. Carpentier Master MMMEF Cours MNOS 2014-2015 120 / 267

A Realization of the Algorithm Two-Stage Recourse Problem Trade-off Between Investment and Operation 5.0 Stochastic Gradient Algorithm 4.5 4.0 3.5 3.0 Q1 2.5 2.0 1.5 1.0 0.5 0.0 0 10 20 30 40 50 60 70 80 90 100 Iter. P. Carpentier Master MMMEF Cours MNOS 2014-2015 121 / 267

More Realizations... Two-Stage Recourse Problem Trade-off Between Investment and Operation 5.0 Stochastic Gradient Algorithm 4.5 4.0 3.5 3.0 Q1 2.5 2.0 1.5 1.0 0.5 0.0 0 50 100 150 200 250 300 350 400 450 500 Iter. P. Carpentier Master MMMEF Cours MNOS 2014-2015 122 / 267

Two-Stage Recourse Problem Trade-off Between Investment and Operation Slight Modification of the problem As in the basic two-stage recourse problem, a first supply decision q 1 is taken without any knowledge of the effective consumption, the associated cost being equal to 1 2 c 1( q1 ) 2, a second supply decision q 2 is taken once the consumption d has been observed (realization of a r.v. D), the cost of this second decision being equal to 1 2 c 2( q2 ) 2. The difference between supply and demand is penalized thanks to an additional cost term 1 2 c 3( q1 + q 2 d ) 2. The new problem is : 1 ( min (q 1,Q 2 ) 2 E ( ) 2 ( ) 2 ( c 1 q1 + c2 Q2 + c3 q1 + Q 2 D ) ) 2. Question: how to solve it using a stochastic gradient algorithm? P. Carpentier Master MMMEF Cours MNOS 2014-2015 123 / 267

Two-Stage Recourse Problem Trade-off Between Investment and Operation 1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 124 / 267

Two-Stage Recourse Problem Trade-off Between Investment and Operation Trade-off Between Investment and Operation (1) A company owns N production units and has to meet a given (non stochastic) demand d. For each unit i, the decision maker first takes an investment decision u i R, the associated cost being I i (u i ). Then a discrete disturbance w i {w i,a, w i,b, w i,c } occurs. Knowing all noises, the decision maker selects for each unit i an operating point v i R, which leads to a cost c i (v i, w i ) and a production e i (v i, w i ). The goal is to minimize the overall expected cost, subject to the following constraints: investment limitation: Θ(u 1,..., u N ) 0, operating limitation: v i ϕ i (u i ), i = 1..., N, demand satisfaction: N i=1 e i(v i, w i ) d = 0. P. Carpentier Master MMMEF Cours MNOS 2014-2015 125 / 267

Two-Stage Recourse Problem Trade-off Between Investment and Operation Trade-off Between Investment and Operation (2) Questions. 1 Write down the optimization problem. Is it possible to apply the stochastic gradient algorithm in a straightforward manner? 2 Extract the optimization subproblem obtained when both the investment u = (u 1,..., u N ) and the noise w = (w 1,..., w N ) are fixed. The value of this subproblem is denoted f (u, w). Discuss the resolution of this subproblem. Give assumptions in order to have a nice function f. Compute the partial derivatives of f w.r.t. u. 3 Reformulate the initial optimization problem using this new function f and apply the stochastic gradient algorithm in the two following cases: the investment limitation reduces to u i [u i, u i ], i = 1,..., N, the investment limitation has the form Θ(u 1,..., u N ) 0. P. Carpentier Master MMMEF Cours MNOS 2014-2015 126 / 267

Financial Problem Modeling Computing Efficiently the Price Two Algorithms 1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 127 / 267

Financial Problem Modeling Computing Efficiently the Price Two Algorithms 1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 128 / 267

Financial Problem Modeling Computing Efficiently the Price Two Algorithms The Financial Problem The price of an option with payoff ψ(s t, 0 t T ) is given by ( P = E e rt ) ψ(s t, 0 t T ), where the dynamics of the underlying n-dimensional asset S is described by the following stochastic differential equation ds t = S t ( rdt + σ(t, St )dw t ), S0 = x, r being the interest rate and σ(t, y) being the volatility function. P. Carpentier Master MMMEF Cours MNOS 2014-2015 129 / 267

Discretization Financial Problem Modeling Computing Efficiently the Price Two Algorithms Most of the time, the exact solution is not available. To overcome the difficulty, one considers a discretized approximation of S (by using an Euler s scheme), so that the price P is approximated by ( P = E e rt ) ψ(ŝ t 1,..., Ŝ t d ). In such cases, the discretized function can be expressed in terms of the Brownian increments, or equivalently using a random normal vector. A compact form for the discretized price is P = E ( φ(g ) ), where G is a n d-dimensional Gaussian vector with identity covariance matrix and zero-mean. P. Carpentier Master MMMEF Cours MNOS 2014-2015 130 / 267

Financial Problem Modeling Computing Efficiently the Price Two Algorithms 1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 131 / 267

A Parameterized Change of Variable Financial Problem Modeling Computing Efficiently the Price Two Algorithms The problem is now to compute P = E ( φ(g ) ) using Monte Carlo simulations. By a change of variables, we obtain that ) θ 2 θ,g P = E (φ(g + θ)e 2, for any θ R n d. Let us denote by V (θ) the associated variance: ( V (θ) = E φ(g + θ) 2 e 2 θ,g θ 2) ( 2 E φ(g )), ) ( = E (φ(g ) 2 θ 2 2 θ,g + e 2 E φ(g )). The last expression shows that function V is strictly convex and differentiable without any specific assumptions on φ. Moreover, V ) (θ) = E ((θ G )φ(g ) 2 θ 2 θ,g + e 2, so that the gradient of V does not involves any derivative of φ. P. Carpentier Master MMMEF Cours MNOS 2014-2015 132 / 267

Variance Minimization Financial Problem Modeling Computing Efficiently the Price Two Algorithms The goal is to compute the parameter θ such that the variance V (θ) related to P is as small as possible, in order to optimize the convergence speed of the Monte Carlo simulations: ( min E φ(g ) 2 e θ R d θ 2 θ,g + 2 ). This optimization problem is well suited for being solved by the stochastic gradient algorithm: θ (k+1) = θ (k) ɛ (k)( θ (k) G (k+1)) φ ( G (k+1)) 2 e θ (k),g (k+1) + θ(k) 2 2, and its unique solution is denoted θ. P. Carpentier Master MMMEF Cours MNOS 2014-2015 133 / 267

Financial Problem Modeling Computing Efficiently the Price Two Algorithms 1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 134 / 267

Financial Problem Modeling Computing Efficiently the Price Two Algorithms Non adaptive Algorithm 1 Using a N-sample of G, obtain an approximation θ (N) of θ by N iterations of the stochastic gradient algorithm. 2 Once θ (N) has been obtained, use the standard Monte Carlo method to compute an approximation of the price P by using another N-sample of G : P (N) = 1 N N k=1 φ(g (N+k) + θ (N) )e θ(n),g (N+k) θ(n) 2 2. This algorithm requires 2N evaluations of the function φ, whereas a crude Monte Carlo method evaluates φ only N times. The non adaptive algorithm is efficient as soon as V (θ ) V (0)/2. P. Carpentier Master MMMEF Cours MNOS 2014-2015 135 / 267

Adaptive Algorithm Financial Problem Modeling Computing Efficiently the Price Two Algorithms Combine the 2 previous algorithms, and compute simultaneously approximations of θ and P by using the same N-sample of G : 10 θ (k+1) = θ (k) ɛ (k)( θ (k) G (k+1)) φ ( G (k+1)) 2 e θ (k),g (k+1) + θ(k) 2 2, P (k+1) = P (k) 1 (P (k) φ(g (k+1) + θ (k) ) e θ(k),g (k+1) θ(k) ) 2 2 k + 1 A Central Limit Theorem is available for this algorithm: ( (N) N P P) D N (0, V ) (θ ).. 10 the last relation being the recursive form of: P (N) = 1 N N k=1 φ(g (k+1) + θ (k) ) e θ(k),g (k+1) θ(k) 2 2. P. Carpentier Master MMMEF Cours MNOS 2014-2015 136 / 267

1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 137 / 267

1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 138 / 267

Mission to Mars P. Carpentier Master MMMEF Cours MNOS 2014-2015 139 / 267

Satellite Model dr dt = v, dm dt = T g 0 I sp δ. dv dt = µ r r 3 + F m κ, (8a) (8b) (8a): 6-dimensional state vector (position r and velocity v). (8b): 1-dimensional state vector (mass m including fuel). κ involves the direction cosines of the thrust and the on-off switch δ of the engine (3 controls), and µ, F, T, g 0, I sp are constants. The deterministic control problem is to drive the satellite from the initial condition at t i to a known final position r f and velocity v f at t f (given) while minimizing fuel consumption m(t i ) m(t f ). P. Carpentier Master MMMEF Cours MNOS 2014-2015 140 / 267

Deterministic Optimization Problem Using equinoctial coordinates for the position and velocity state vector x R 7, and cartesian coordinates for the thrust of the engine control vector u R 3, the deterministic optimization problem writes as follows: subject to: x(t i ) = x i, min K( x(t f ) ) u( ) x (t) = f ( x(t), u(t) ), u(t) 1 t [t i, t f ], C ( x(t f ) ) = 0. P. Carpentier Master MMMEF Cours MNOS 2014-2015 141 / 267

Engine Failure Sometimes, the engine may fail to work when needed: the satellite drifts away from the deterministic optimal trajectory. After the engine control is recovered, it is not always possible to drive the satellite to the final target at t f. By anticipating such possible failures and by modifying the trajectory followed before any such failure occurs, one may increase the possibility of eventually reaching the target. But such a deviation from the deterministic optimal trajectory results in a deterioration of the economic performance. The problem is thus to balance the increased probability of eventually reaching the target despite possible failures against the expected economic performance, that is, to quantify the price of safety one is ready to pay for. P. Carpentier Master MMMEF Cours MNOS 2014-2015 142 / 267

Stochastic Formulation (1) A failure is modeled using two random variables: t p : random initial time of the failure, t d : random duration of the failure. For any realization (tp, ξ t ξ d ) of a failure: 1 u( ) denotes the control used prior to the failure u is defined over [t i, t f ] but implemented over [t i, tp] ξ and corresponds to an open-loop control, 2 the control during the failure is 0 over [tp, ξ tp ξ + t ξ d ], 3 v ξ ( ) denotes the control used after the failure v ξ is defined over [tp ξ + t ξ d, t f] (if nonempty) and corresponds to a closed-loop strategy V. The satellite dynamics in the stochastic formulation writes: x ξ (t i ) = x i, x ξ (t) = f ξ( x ξ (t), u(t), v ξ (t) ). P. Carpentier Master MMMEF Cours MNOS 2014-2015 143 / 267

Stochastic Formulation (2) The problem is to minimize the expected cost (fuel consumption) w.r.t. the open-loop control u and the closed-loop strategy V, the probability to hit the target at t f being at least equal to p. ( min min E K ( x ξ (t f ) )) u( ) V( ) subject to: x ξ (t i ) = x i, ( min min E K ( x ξ (t f ) ) ( C x ξ (t f ) ) ) = 0 u( ) V( ) x ξ (t) = f ξ( x ξ (t), u(t), v ξ (t) ), u(t) 1 t [t i, t f ], v ξ (t) 1 t [tp ξ + t ξ d, t f], ( P C ( x ξ (t f ) ) ) = 0 p. P. Carpentier Master MMMEF Cours MNOS 2014-2015 144 / 267

1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 145 / 267

Indicator Function Consider the real-valued indicator function: { 1 if y = 0, I(y) = 0 otherwise. Then ( P C ( x ξ (t f ) ) ) ( = 0 = E I ( C ( x ξ (t f ) ) )), and ( E K ( x ξ (t f ) ) ( C x ξ (t f ) ) ) E = 0 = (K ( x ξ (t f ) ) I ( C ( x ξ (t f ) ) )) E (I ( C ( x ξ (t f ) ) )). P. Carpentier Master MMMEF Cours MNOS 2014-2015 146 / 267

Problem Reformulation The problem is (shortly) reformulated as s.t. E min min u( ) V( ) (K ( x ξ (t f ) ) I ( C ( x ξ (t f ) ) )) E (I ( C ( x ξ (t f ) ) )) ( E I ( ( C x ξ (t f ) ) )) p. Such a formulation is however not well-suited for a numerical implementation (e.g. Arrow-Hurwicz algorithm), because a ratio of expectations is not an expectation! P. Carpentier Master MMMEF Cours MNOS 2014-2015 147 / 267

An Useful Lemma Using compact notation, the previous problem writes: min u J(u) Θ(u) in which J and Θ assume positive values. s.t. Θ(u) p, (9) 1 If u is a solution of (9) and if Θ(u ) = p, then u is also a solution of min u J(u) s.t. Θ(u) p. (10) 2 Conversely, if u is a solution of (10), and if an optimal Kuhn-Tucker multiplier β satisfies the condition then u is also a solution of (9). β J(u ) Θ(u ), P. Carpentier Master MMMEF Cours MNOS 2014-2015 148 / 267

Back to a Cost in Expectation Using the previous lemma, we aim at solving a problem in which the cost and the constraint functions correspond to expectations: ( min min E K ( x ξ (t f ) ) I ( C ( x ξ (t f ) ) )) u( ) V( ) ( s.t. E I ( ( C x ξ (t f ) ) )) p, or equivalently (Interchange Theorem [R&W, 1998]): s.t. ( min E min K ( x ξ (t f ) ) I ( ( C x ξ (t f ) ) )) u( ) v ξ ( ) ( E I ( C ( x ξ (t f ) ) )) p. P. Carpentier Master MMMEF Cours MNOS 2014-2015 149 / 267

1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 150 / 267

Lagrangian Formulation s.t. ( min E min K ( x ξ (t f ) ) I ( ( C x ξ (t f ) ) )) u( ) v ξ ( ) ( p E I ( C ( x ξ (t f ) ) )) 0 µ Assume there exists a saddle point for the associated Lagrangian. In order to solve { ( ( ( max min µ p + E min K x ξ (t f ) ) µ ) I ( C ( x ξ (t f ) ) ) ) }. µ 0 u( ) v ξ ( ) } {{ } W (u, µ, ξ) that is, { max min µ p + E ( W (u, µ, ξ) )}, µ 0 u( ) we use an adapted Arrow-Hurwicz algorithm ([Culioli, 1994]). P. Carpentier Master MMMEF Cours MNOS 2014-2015 151 / 267

Algorithm Overview Arrow-Hurwicz algorithm At iteration k, 1 draw a failure ξ k = (tp ξk, t ξk ) according to its probability law, 2 compute the gradient of W w.r.t. u and update u( ): ) u k+1 = Π B (u k ε k u W (u k, µ k, ξ k ), d 3 compute the gradient of W w.r.t. µ and update µ: ( µ k+1 = max 0, µ k + ρ k( p + µ W (u k+1, µ k, ξ k ) )). P. Carpentier Master MMMEF Cours MNOS 2014-2015 152 / 267

First Difficulty: I Is Not a Smooth Function At every iteration k, we must evaluate function W as well as its derivatives w.r.t. u(.) and µ. But W is not differentiable! { ( ) 1 if y = 0, 1 y 2 2 if y [ r, r], I(y) = I r (y) = r 2 0 otherwise, 0 otherwise. 1.0 1.0 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 There are specific rules to drive r to 0 as the iteration number k goes to infinity in order to obtain the best asymptotic Mean Quadratic Error of the gradient estimates ([Andrieu et al., 2007]). P. Carpentier Master MMMEF Cours MNOS 2014-2015 153 / 267

Second Difficulty: Solving the Inner Problem The approximated closed-loop problem to solve at each iteration is: { (K W r k (u k, ξ k, µ k ( ) = min x ξ (t f ) ) µ k) ( I r v ξ k C ( x ξ (t f ) ) )}. ( ) In this setting, we have to check if the target is reached up to r k. Different cases have to be considered: 1 the target can be reached accurately, 2 the target can be reached up to r k only, 3 the target cannot be reached up to r k. Note that if reaching the target is possible but too expensive (that is, if K ( x ξ (t f ) ) µ k ), the best thing to do is to stop the engine! In practice, the solution of the approximated problem is derived from the resolution of two standard optimal control problems... P. Carpentier Master MMMEF Cours MNOS 2014-2015 154 / 267

Parameters Tuning Gradient step length: ε k = a b + k, ρ k = c d + k, usual for a stochastic gradient algorithm. Smoothing parameter: r k = α β + k 1 3, MQE reduced by a factor 1000 in about 100.000 iterations. P. Carpentier Master MMMEF Cours MNOS 2014-2015 155 / 267

1 Two Elementary Exercices on the Stochastic Gradient Two-Stage Recourse Problem Trade-off Between Investment and Operation 2 Financial Problem Modeling Computing Efficiently the Price Two Algorithms 3 P. Carpentier Master MMMEF Cours MNOS 2014-2015 156 / 267

Example: Interplanetary Mission t i = 0.70 and t f = 8.70 (normalized units), t p : exponential distribution: P ( t p t f ) 0.58 = πf, t d : exponential distribution: P ( 0.035 t d 0.125 ) 0.80. 1.0 0.8 0.6 The deterministic optimal control has a bang off bang shape. 0.4 0.2 0.0 0.2 Along the optimal trajectory, the probability to recover a failure is: p det 0.94. 0.4 0 1 2 3 4 5 6 7 8 9 Comp. normale w Comp. tangentielle s Comp. radiale q P. Carpentier Master MMMEF Cours MNOS 2014-2015 157 / 267

0.681 0.680 0.679 0.678 0.677 0.676 0.675 0.674 0 10000 Masse 20000 finale / iterations 30000 40000 50000 60000 70000 80000 90000 100000 0.326 0.324 0.322 0.320 1.0 0.8 0.6 0.4 0.2 0.0 0.2 0.318 0.316 0.314 0 10000 Multiplicateur 20000 probabilite 30000 / iterations 40000 50000 60000 70000 80000 90000 100000 0.4 0 1 2 3 4 5 6 7 8 9 Comp. normale w Comp. tangentielle s Comp. radiale q Figure: Probability level p = 0.750 P. Carpentier Master MMMEF Cours MNOS 2014-2015 158 / 267

0.681 0.680 0.679 0.678 0.677 0.676 0.675 0.674 0.673 0 10000 Masse 20000 finale / iterations 30000 40000 50000 60000 70000 80000 90000 100000 0.332 0.330 0.328 0.326 0.324 0.322 0.320 0.318 0 10000 Multiplicateur 20000 probabilite 30000 / iterations 40000 50000 60000 70000 80000 90000 100000 1.0 0.8 0.6 0.4 0.2 0.0 0.2 0.4 0 1 2 3 4 5 6 7 8 9 Comp. normale w Comp. tangentielle s Comp. radiale q Figure: Probability level p = 0.960 P. Carpentier Master MMMEF Cours MNOS 2014-2015 159 / 267

0.681 0.680 0.679 0.678 0.677 0.676 0.675 0.674 0 10000 Masse 20000 finale / iterations 30000 40000 50000 60000 70000 80000 90000 100000 0.365 0.360 0.355 0.350 0.345 0.340 0.335 0.330 0.325 0.320 0 10000 Multiplicateur 20000 probabilite 30000 / iterations 40000 50000 60000 70000 80000 90000 100000 1.0 0.8 0.6 0.4 0.2 0.0 0.2 0.4 0 1 2 3 4 5 6 7 8 9 Comp. normale w Comp. tangentielle s Comp. radiale q Figure: Probability level p = 0.990 P. Carpentier Master MMMEF Cours MNOS 2014-2015 160 / 267

The Price of Safety... 0.3222 0.3220 0.3218 0.3216 0.3214 0.3212 0.3210 0.3208 0.3206 0.3204 0.85 0.90 0.95 1.00 Consommation sans panne / Probabilite P. Carpentier Master MMMEF Cours MNOS 2014-2015 161 / 267

H. Robbins and S. Monro. A stochastic approximation method. Annals of Mathematical Statistics, 22:400 407, 1951. B. T. Polyak and A. B. Juditsky. Acceleration of stochastic approximation by averaging. SIAM Journal on Control and Optimization, 30(4):838 855, 1992. M. Duflo. Algorithmes stochastiques. Springer Verlag, 1996. H. J. Kushner and D. S. Clark. Stochastic Approximation Methods for Constrained and Unconstrained Systems. Springer Verlag, 1978. J.-C. Culioli and G. Cohen. Decomposition-coordination algorithms in stochastic optimization. SIAM Journal on Control and Optimization, 28(6):1372 1403, 1990. J.-C. Culioli and G. Cohen. Optimisation stochastique sous contraintes en espérance. CRAS, t.320, Série I, pp. 75-758, 1995. Laetitia Andrieu, Guy Cohen and Felisa J. Vázquez-Abad. Gradient-based simulation optimization under probability constraints. European Journal of Operational Research, 212, 345-351, 2011. R. T. Rockafellar and R. J.-B. Wets. Variational Analysis, volume 317 of A series of Comprehensive Studies in Mathematics. Springer Verlag, 1998. P. Carpentier Master MMMEF Cours MNOS 2014-2015 161 / 267