Decentralized control of stochastic multi-agent service system. J. George Shanthikumar Purdue University
|
|
|
- Doreen Bates
- 10 years ago
- Views:
Transcription
1 Decentralized control of stochastic multi-agent service system J. George Shanthikumar Purdue University Joint work with Huaning Cai, Peng Li and Andrew Lim 1
2 Many problems can be thought of as stochastic dynamic resource allocation problems: Airline alliances Investment firms with multiple trading desks Large scale service systems, call centers... Objective: operate the integrated system efficiently. 2
3 Centralized decision making is often not possible: different groups specialize in different tasks. relevant information is not/can not be shared integrating IT systems across different groups is hard. decentralized agents 3
4 Decentralized control involves many things: different agents have knowledge and control of different parts of the system different agents can observe different subsets of the state e.g. Witsenhausen (1968), Ho (1980), Rantzer (2009), Bertsekas (2007), Adelman & Mersereau (2008) different agents have different incentives Key issues: Each decision maker maximizes its own objectives conditional on the available information. In general: local optimality global optimality How do we efficiently manage such multi-agent systems? 4
5 Overview Centralized formulation of the pricing-service problem Decentralized model Stochastic models for pricing and service agents Coordination mechanism Pricing agent s problem Service agent s problem Main results Coordinating through transfer prices Computing transfer prices with info. sharing constraints Example 5
6 Problem description arrivals occur randomly at a rate determined by the prices dynamically controlled by (multiple) pricing agents. service times are random and determined by the dynamically controlled service rate net profits = revenue from arrivals - service costs 6
7 Standard approach: centralized model max E p( ),µ( ) T 0 m j=1 p j (t )dn j (T) T 0 c(x(t), µ(t))dt h(x(t)) dx(t) = m j=1 dn j (t) dj(t), x(0) = x 0 0 x(t) C, t [0, T], N i (t) λ i (p i (t )), J(t) µ(t ). Assumes there is and all-knowing dictator jointly controlling all pricing and service decisions. 7
8 It is possible to write down the dynamic programming equations 0 = V t (t, x)+max p,µ { c(x, µ)+µ ( V(t, x 1) V(t, x) )} i λ i (p i ) [ p i ( V(t, x) V(t, x+1) )] V(t, x) = h(x) but this is irrelevant if no single agent knows all demand models λ i (p i ) and the cost structure (c(x, µ), h(x)) and has the power to jointly control all decision variables. 8
9 Decentralized control 9
10 Pricing agent i controls p i (t) and service agent controls µ(t). Integrated system profits & dynamics: E T m 0 j=1 p j (t )dn j (T) T 0 c(x(t), µ(t))dt h(x(t)) dx(t) = m j=1 dn j (t) dj(t), x(0) = x 0 0 x(t) C, t [0, T], N i (t) λ i (p i (t )), J(t) µ(t ). Coupling comes through the state variable x(t). Key issue: what do pricing/service agents know and how do they make decisions? 10
11 Pricing agent controls p i (t) and knows demand function λ i (p i ). may not know the number of other agents, let alone their demand functions λ j (p j ), their pricing policies p j (t, x) or the aggregate arrival rate. may not know cost functions (c(x, µ), h(x)) for service agent or the service policy µ(t). observes x(t) but may not know its true dynamics. 11
12 Pricing agent s model where dx(t) = dn i (t) dñ i (t) d J i (t) N i (t) λ i (p i ) is the arrival process controlled by agent i (accurately specified) Ñ i (t) λ i (t) is agent i s model for arrivals brought by other agents (mis-specified) J i (t) µ i (t) is agent i s model for the service system (mis-specified) 12
13 Service agent Observes x(t) and assumes dynamics where dx(t) = dñ 0 (t) dj 0 (t) Ñ 0 (t) λ 0 (t) is the service agent s aggregate arrival process model; (mis-specified) J 0 (t) µ(t) is the service process (accurately specified). 13
14 Local optimality does not give global optimality Key observation: no choice of λ i (t) or µ i (t) result in a centrally optimal choice of p i (t). e.g. Pricing agent i s problem: max p i E T 0 p i(t )dn i (t) = E T dx(t) = dn i (t)+dñ i (t) d J i (t) 0 x(t) C 0 p i(t )λ i (p i (t ))dt N i (t) λ i (p i (t )), Ñ i (t) λ i (t), J i (t) µ i (t) Trivial case: when C = the pricing agent maximizes revenue rate p i λ i (p i ) which is not optimal for the system. 14
15 Transfer contracts and revenue sharing 15
16 Revenue transfer when pricing agent 2 makes a sale Service agent p2(t,x) S(t,x) Q1(t,x) Q3(t,x) Pricing agents Income for Agent 2 = p 2 (t, x(t )) Q j (t, x(t )) S(t, x(t )) j 2 16
17 Revenue transfer when service agent clears a customer Service agent C1(t,x) C3(t,x) C2(t,x) Pricing agents Income for service agent = m i=1 C i (t, x(t )) 17
18 Revenue sharing/transfer prices When pricing agent i brings in a customer and changes state from x x+1 at time t: pay Q j (t, x) to pricing agent j; pay S(t, x) to service agent. total cost: R i (t, x) = j iq j (t, x)+s(t, x) When service agent completes service and changes state from x x 1 at time t: receive C i (t, x) from pricing agent i. total revenue: R 0 (t, x) = j C j (t, x(t )) 18
19 Pricing agent s problem Given transfer prices C i, Q i and R i (t, x) = j iq j (t, x)+s(t, x) pricing agent i solves T [ E p i 0 T max + p i (t ) R i (t, x(t )) 0 Q i(t, x(t ))dñ i (t) T dx(t) = dn i (t)+dñ i (t) d J i (t) 0 x(t) C ] dn i (t) 0 C i(t, x(t ))d J i (t) N i (t) λ i (p i (t )), Ñ i (t) λ i (t), J i (t) µ i (t) } {{ } accurate } {{ } mis-specified 19
20 Service agent s problem Given transfer prices S and R 0 (t, x) = j C j (t, x(t )) service agent solves min E µ(t ) T T T c(x(t), µ(t))dt S(t, 0 0 x(t ))dñ 0 (t) 0 R 0(t, x(t ))dj 0 (t)+h(x(t)) dx(t) = dñ 0 (t) dj 0 (t), x(0) = x 0 0 x(t) C Ñ 0 (t) λ 0 (t), J 0 (t) µ(t) } {{ } mis-specified } {{ } accurate 20
21 Weak duality For any choice of transfer prices Centralized (opt.) profit {agent j s (max.) profit} service agents (min.) cost j (RHS depends on the choice of transfer contracts) Is it possible to choose transfer prices that achieve equality? What is the impact of model mis-specification by the agents? 21
22 Consider the transfer contracts Main Result: General system Q j (t, x) = V j (t, x) V j (t, x+1) S(t, x) = W(t, x+1) W(t, x) C j (t, x) = V j (t, x 1) V j (t, x) where 0 = d dt V i(t, x)+maxλ p i (p i ) {p i [ V(t, x) V(t, x+1) ]} i and V i (T, x) = 0 0 = d W(t, x)+min {c(x, µ) µ [ V(t, x) V(t, x 1) ]} dt µ W(T, x) = h(x) Note: V(t, x) is the value function for the centralized problem. 22
23 Contracts coordinate the system; Strong duality: Centralized (opt.) profit {agent j s (max.) profit} service agents (min.) cost = j Robustness to mis-specification: p j (t) is independent of agent j s assumptions about the other pricing agents and the service agent µ (t) is independent of the service agent s assumptions about the pricing agents. 23
24 Intuition: Optimal contracts W(t, x) is the value function for the service agent and V j (t, x) as the value function of pricing agent j under optimal TP s. When pricing agent j makes a sale: x x+1 W(t, x+1) W(t, x) = service agent s expected cost increase V i (t, x+1) V i (t, x) = exp. revenue decrease for p. agent i. When service agent clears a customer: x x 1 V i (t, x 1) V i (t, x)= value of extra space to p. agent i. 24
25 Under optimal transfer prices: when pricing agent i makes a sale: compensate other pricing agents their loss in revenue Q j (t, x) = V j (t, x) V j (t, x+1) compensate the service agent the cost of processing another customer S(t, x) = W(t, x+1) W(t, x) When service clears a customer, each pricing agent pays the service agent its valuation of the extra resource (1 spot) C j (t, x) = V j (t, x 1) V j (t, x) 25
26 Intuition: Robustness to mis-specification If an agent is compensated his valuation of the shared resource, he/she is indifferent as to whether it is used by another agent indifference to rate of consumption by others robustness to mis-specification. 26
27 Additional comments TP s summarize what is important about the other agents: Details such as the number of other agents and dynamics are not required. TP s can be interpreted as Lagrange multipliers for a stochastic dynamic resource allocation problem. Convexity was not used to establish strong duality. 27
28 Key gap: How do we compute transfer prices without needing to solve the centralized problem? i.e. how do we compute optimal contracts when there is no mechanism designer. 28
29 Recall: Contracts Computing the transfer prices Q j (t, x) = V j (t, x) V j (t, x+1) S(t, x) = W(t, x+1) W(t, x) C j (t, x) = V j (t, x 1) V j (t, x) where 0 = d dt V i(t, x)+maxλ p i (p i ) {p i [ V(t, x) V(t, x+1) ]} i and V i (T, x) = 0 0 = d W(t, x)+min {c(x, µ) µ [ V(t, x) V(t, x 1) ]} dt µ W(T, x) = h(x) Problem: Requires knowledge of the value function for the centralized problem. 29
30 A fixed point representation Q j (t, x) = V j (t, x) V j (t, x+1) S(t, x) = W(t, x+1) W(t, x) C j (t, x) = V j (t, x 1) V j (t, x) where 0 = d { W(t, x)+min c(x, µ) dt µ µ [ m j=1 + λ 0 (t, x) C j (t, x)+w(t, x) W(t, x 1) ]} { W(t, x+1) W(t, x) S(t, x) } W(T, x) = h(x) 30
31 A fixed point representation (cont ) 0 = d dt V i(t, x)+maxλ p i (p i ) {p i [ Q j (t, x)+s(t, x) ] i j i [ V i (t, x) V i (t, x+1) ]} + λ i (t, x) {Q i (t, x) [ V i (t, x) V i (t, x+1) ]} { } + µ i (t, x) V i (t, x 1) V i (t, x) C i (t, x) V i (T, x) = 0 31
32 Fixed point representation Find Q = [Q 1,, Q m ], C = [C 1,, C m ] and S such that: Q j (t, x) = V j (t, x) V j (t, x+1) S(t, x) = W(t, x+1) W(t, x) (C j (t, x) = V j (t, x 1) V j (t, x) = Q j (t, x 1)) where V 1 (t, x) value function for pricing agent 1. V m (t, x) value function for pricing agent m W(t, x) value function for service agent Natural idea: Iterate of Q, C and S. 32
33 Observations: The ODEs are DP equations for the pricing and service problems under contracts C, Q and S. The fixed point equation relates optimal contracts to value functions of the associated decentralized problems. 33
34 Interpretation as a negotiation current valuation Qi current cost S PAi Central computer SA new contracts new contracts 34
35 Main result Under standard technical assumptions, the algorithm converges to the optimal transfer prices. Comments: Assumes truth telling in reporting of valuations in each round. Iterative algorithm is analogous to updating dual variables. Convexity is not required for convergence or strong duality. (However, we assume each agent solves a DP at each iteration exactly) All that is communicated between agents are contracts constructed by aggregating transfer prices. (The number of agents and their models are not needed) 35
36 Example Single pricing and service agent; pricing agent: demand rate λ(p) = ae bp ; assumes µ 0 (mis-specified); service agent: costs c(x,µ) = kx+c(e dµ 1) and h(x) = sx; assumes λ 0 (mis-specified); a = 10,b = 1,c = 1,d = 1,k = 0.1,s = 1.3,C = 10,T = 10 36
37 Convergence check: V {V p n W n } as n 14 Convergence Check V 1 +V 2 V max iteration 37
38 Optimal revenue sharing functions Tax function for pricing agent x=2 x=6 R 1 (t,x) t When close to the terminal time T = 10 the pricing agent s tax converges to the cost of not serving the customer. In this case, h(x) = 1.3x (each unserved customer costs 1.3). 38
39 0.35 Bonus function for service agent R 2 (t,x) x=6 x= t When close to terminal time T = 10, the service agent receives less reward for clearing a customer. Intuition: The value of an extra space is of diminishing value as the selling time decreases. 39
40 Conclusion Coordination through transfer prices TP s summarize what is important about other agents. Iterative algorithm for computing TP s that only require exchange of valuations Drawbacks: Assumption of truth-telling when exchanging valuations Each agent needs to solve a DP ( approximations) Extensions: heterogeneous risk attitudes multi-agent portfolio choice problems airline alliances 40
Duality in General Programs. Ryan Tibshirani Convex Optimization 10-725/36-725
Duality in General Programs Ryan Tibshirani Convex Optimization 10-725/36-725 1 Last time: duality in linear programs Given c R n, A R m n, b R m, G R r n, h R r : min x R n c T x max u R m, v R r b T
Role of Stochastic Optimization in Revenue Management. Huseyin Topaloglu School of Operations Research and Information Engineering Cornell University
Role of Stochastic Optimization in Revenue Management Huseyin Topaloglu School of Operations Research and Information Engineering Cornell University Revenue Management Revenue management involves making
Nonlinear Optimization: Algorithms 3: Interior-point methods
Nonlinear Optimization: Algorithms 3: Interior-point methods INSEAD, Spring 2006 Jean-Philippe Vert Ecole des Mines de Paris [email protected] Nonlinear optimization c 2006 Jean-Philippe Vert,
Name: ID: Discussion Section:
Math 28 Midterm 3 Spring 2009 Name: ID: Discussion Section: This exam consists of 6 questions: 4 multiple choice questions worth 5 points each 2 hand-graded questions worth a total of 30 points. INSTRUCTIONS:
A Utility Maximization Example
A Utilit Maximization Example Charlie Gibbons Universit of California, Berkele September 17, 2007 Since we couldn t finish the utilit maximization problem in section, here it is solved from the beginning.
Spatial Decomposition/Coordination Methods for Stochastic Optimal Control Problems. Practical aspects and theoretical questions
Spatial Decomposition/Coordination Methods for Stochastic Optimal Control Problems Practical aspects and theoretical questions P. Carpentier, J-Ph. Chancelier, M. De Lara, V. Leclère École des Ponts ParisTech
An Improved Dynamic Programming Decomposition Approach for Network Revenue Management
An Improved Dynamic Programming Decomposition Approach for Network Revenue Management Dan Zhang Leeds School of Business University of Colorado at Boulder May 21, 2012 Outline Background Network revenue
6.231 Dynamic Programming and Stochastic Control Fall 2008
MIT OpenCourseWare http://ocw.mit.edu 6.231 Dynamic Programming and Stochastic Control Fall 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 6.231
Further Study on Strong Lagrangian Duality Property for Invex Programs via Penalty Functions 1
Further Study on Strong Lagrangian Duality Property for Invex Programs via Penalty Functions 1 J. Zhang Institute of Applied Mathematics, Chongqing University of Posts and Telecommunications, Chongqing
Principles of demand management Airline yield management Determining the booking limits. » A simple problem» Stochastic gradients for general problems
Demand Management Principles of demand management Airline yield management Determining the booking limits» A simple problem» Stochastic gradients for general problems Principles of demand management Issues:»
constraint. Let us penalize ourselves for making the constraint too big. We end up with a
Chapter 4 Constrained Optimization 4.1 Equality Constraints (Lagrangians) Suppose we have a problem: Maximize 5, (x 1, 2) 2, 2(x 2, 1) 2 subject to x 1 +4x 2 =3 If we ignore the constraint, we get the
Moral Hazard. Itay Goldstein. Wharton School, University of Pennsylvania
Moral Hazard Itay Goldstein Wharton School, University of Pennsylvania 1 Principal-Agent Problem Basic problem in corporate finance: separation of ownership and control: o The owners of the firm are typically
MAT 274 HW 2 Solutions c Bin Cheng. Due 11:59pm, W 9/07, 2011. 80 Points
MAT 274 HW 2 Solutions Due 11:59pm, W 9/07, 2011. 80 oints 1. (30 ) The last two problems of Webwork Set 03 Modeling. Show all the steps and, also, indicate the equilibrium solutions for each problem.
Linear Programming Notes V Problem Transformations
Linear Programming Notes V Problem Transformations 1 Introduction Any linear programming problem can be rewritten in either of two standard forms. In the first form, the objective is to maximize, the material
On Adaboost and Optimal Betting Strategies
On Adaboost and Optimal Betting Strategies Pasquale Malacaria School of Electronic Engineering and Computer Science Queen Mary, University of London Email: [email protected] Fabrizio Smeraldi School of
Insurance. Michael Peters. December 27, 2013
Insurance Michael Peters December 27, 2013 1 Introduction In this chapter, we study a very simple model of insurance using the ideas and concepts developed in the chapter on risk aversion. You may recall
A New Quantitative Behavioral Model for Financial Prediction
2011 3rd International Conference on Information and Financial Engineering IPEDR vol.12 (2011) (2011) IACSIT Press, Singapore A New Quantitative Behavioral Model for Financial Prediction Thimmaraya Ramesh
10. Proximal point method
L. Vandenberghe EE236C Spring 2013-14) 10. Proximal point method proximal point method augmented Lagrangian method Moreau-Yosida smoothing 10-1 Proximal point method a conceptual algorithm for minimizing
Controlling the Internet in the era of Software Defined and Virtualized Networks. Fernando Paganini Universidad ORT Uruguay
Controlling the Internet in the era of Software Defined and Virtualized Networks Fernando Paganini Universidad ORT Uruguay CDS@20, Caltech 2014 Motivation 1. The Internet grew in its first 30 years with
Linear Threshold Units
Linear Threshold Units w x hx (... w n x n w We assume that each feature x j and each weight w j is a real number (we will relax this later) We will study three different algorithms for learning linear
Principle of Data Reduction
Chapter 6 Principle of Data Reduction 6.1 Introduction An experimenter uses the information in a sample X 1,..., X n to make inferences about an unknown parameter θ. If the sample size n is large, then
Two-Stage Stochastic Linear Programs
Two-Stage Stochastic Linear Programs Operations Research Anthony Papavasiliou 1 / 27 Two-Stage Stochastic Linear Programs 1 Short Reviews Probability Spaces and Random Variables Convex Analysis 2 Deterministic
14.451 Lecture Notes 10
14.451 Lecture Notes 1 Guido Lorenzoni Fall 29 1 Continuous time: nite horizon Time goes from to T. Instantaneous payo : f (t; x (t) ; y (t)) ; (the time dependence includes discounting), where x (t) 2
Alok Gupta. Dmitry Zhdanov
RESEARCH ARTICLE GROWTH AND SUSTAINABILITY OF MANAGED SECURITY SERVICES NETWORKS: AN ECONOMIC PERSPECTIVE Alok Gupta Department of Information and Decision Sciences, Carlson School of Management, University
A few algorithmic issues in data centers Adam Wierman Caltech
A few algorithmic issues in data centers Adam Wierman Caltech A significant theory literature on green computing has emerged over the last decade BUT theory has yet to have significant impact in practice.
Numerisches Rechnen. (für Informatiker) M. Grepl J. Berger & J.T. Frings. Institut für Geometrie und Praktische Mathematik RWTH Aachen
(für Informatiker) M. Grepl J. Berger & J.T. Frings Institut für Geometrie und Praktische Mathematik RWTH Aachen Wintersemester 2010/11 Problem Statement Unconstrained Optimality Conditions Constrained
A Randomized Linear Programming Method for Network Revenue Management with Product-Specific No-Shows
A Randomized Linear Programming Method for Network Revenue Management with Product-Specific No-Shows Sumit Kunnumkal Indian School of Business, Gachibowli, Hyderabad, 500032, India sumit [email protected]
Economics of Insurance
Economics of Insurance In this last lecture, we cover most topics of Economics of Information within a single application. Through this, you will see how the differential informational assumptions allow
Online Algorithms: Learning & Optimization with No Regret.
Online Algorithms: Learning & Optimization with No Regret. Daniel Golovin 1 The Setup Optimization: Model the problem (objective, constraints) Pick best decision from a feasible set. Learning: Model the
Reinforcement Learning
Reinforcement Learning LU 2 - Markov Decision Problems and Dynamic Programming Dr. Martin Lauer AG Maschinelles Lernen und Natürlichsprachliche Systeme Albert-Ludwigs-Universität Freiburg [email protected]
Recovery of primal solutions from dual subgradient methods for mixed binary linear programming; a branch-and-bound approach
MASTER S THESIS Recovery of primal solutions from dual subgradient methods for mixed binary linear programming; a branch-and-bound approach PAULINE ALDENVIK MIRJAM SCHIERSCHER Department of Mathematical
Envelope Theorem. Kevin Wainwright. Mar 22, 2004
Envelope Theorem Kevin Wainwright Mar 22, 2004 1 Maximum Value Functions A maximum (or minimum) value function is an objective function where the choice variables have been assigned their optimal values.
Lecture 2: Consumer Theory
Lecture 2: Consumer Theory Preferences and Utility Utility Maximization (the primal problem) Expenditure Minimization (the dual) First we explore how consumers preferences give rise to a utility fct which
Simulating Stochastic Differential Equations
Monte Carlo Simulation: IEOR E473 Fall 24 c 24 by Martin Haugh Simulating Stochastic Differential Equations 1 Brief Review of Stochastic Calculus and Itô s Lemma Let S t be the time t price of a particular
Stochastic Models for Inventory Management at Service Facilities
Stochastic Models for Inventory Management at Service Facilities O. Berman, E. Kim Presented by F. Zoghalchi University of Toronto Rotman School of Management Dec, 2012 Agenda 1 Problem description Deterministic
Economics 326: Duality and the Slutsky Decomposition. Ethan Kaplan
Economics 326: Duality and the Slutsky Decomposition Ethan Kaplan September 19, 2011 Outline 1. Convexity and Declining MRS 2. Duality and Hicksian Demand 3. Slutsky Decomposition 4. Net and Gross Substitutes
Optimal shift scheduling with a global service level constraint
Optimal shift scheduling with a global service level constraint Ger Koole & Erik van der Sluis Vrije Universiteit Division of Mathematics and Computer Science De Boelelaan 1081a, 1081 HV Amsterdam The
JANUARY 2016 EXAMINATIONS. Life Insurance I
PAPER CODE NO. MATH 273 EXAMINER: Dr. C. Boado-Penas TEL.NO. 44026 DEPARTMENT: Mathematical Sciences JANUARY 2016 EXAMINATIONS Life Insurance I Time allowed: Two and a half hours INSTRUCTIONS TO CANDIDATES:
Numerical methods for American options
Lecture 9 Numerical methods for American options Lecture Notes by Andrzej Palczewski Computational Finance p. 1 American options The holder of an American option has the right to exercise it at any moment
Scheduling Algorithms for Downlink Services in Wireless Networks: A Markov Decision Process Approach
Scheduling Algorithms for Downlink Services in Wireless Networks: A Markov Decision Process Approach William A. Massey ORFE Department Engineering Quadrangle, Princeton University Princeton, NJ 08544 K.
12.5: CHI-SQUARE GOODNESS OF FIT TESTS
125: Chi-Square Goodness of Fit Tests CD12-1 125: CHI-SQUARE GOODNESS OF FIT TESTS In this section, the χ 2 distribution is used for testing the goodness of fit of a set of data to a specific probability
General Theory of Differential Equations Sections 2.8, 3.1-3.2, 4.1
A B I L E N E C H R I S T I A N U N I V E R S I T Y Department of Mathematics General Theory of Differential Equations Sections 2.8, 3.1-3.2, 4.1 Dr. John Ehrke Department of Mathematics Fall 2012 Questions
GI01/M055 Supervised Learning Proximal Methods
GI01/M055 Supervised Learning Proximal Methods Massimiliano Pontil (based on notes by Luca Baldassarre) (UCL) Proximal Methods 1 / 20 Today s Plan Problem setting Convex analysis concepts Proximal operators
The Optimal Growth Problem
LECTURE NOTES ON ECONOMIC GROWTH The Optimal Growth Problem Todd Keister Centro de Investigación Económica, ITAM [email protected] January 2005 These notes provide an introduction to the study of optimal
High-frequency trading in a limit order book
High-frequency trading in a limit order book Marco Avellaneda & Sasha Stoikov October 5, 006 Abstract We study a stock dealer s strategy for submitting bid and ask quotes in a limit order book. The agent
1 Introduction. Linear Programming. Questions. A general optimization problem is of the form: choose x to. max f(x) subject to x S. where.
Introduction Linear Programming Neil Laws TT 00 A general optimization problem is of the form: choose x to maximise f(x) subject to x S where x = (x,..., x n ) T, f : R n R is the objective function, S
15 Kuhn -Tucker conditions
5 Kuhn -Tucker conditions Consider a version of the consumer problem in which quasilinear utility x 2 + 4 x 2 is maximised subject to x +x 2 =. Mechanically applying the Lagrange multiplier/common slopes
A NEW LOOK AT CONVEX ANALYSIS AND OPTIMIZATION
1 A NEW LOOK AT CONVEX ANALYSIS AND OPTIMIZATION Dimitri Bertsekas M.I.T. FEBRUARY 2003 2 OUTLINE Convexity issues in optimization Historical remarks Our treatment of the subject Three unifying lines of
Course: Model, Learning, and Inference: Lecture 5
Course: Model, Learning, and Inference: Lecture 5 Alan Yuille Department of Statistics, UCLA Los Angeles, CA 90095 [email protected] Abstract Probability distributions on structured representation.
Gambling and Data Compression
Gambling and Data Compression Gambling. Horse Race Definition The wealth relative S(X) = b(x)o(x) is the factor by which the gambler s wealth grows if horse X wins the race, where b(x) is the fraction
A Log-Robust Optimization Approach to Portfolio Management
A Log-Robust Optimization Approach to Portfolio Management Dr. Aurélie Thiele Lehigh University Joint work with Ban Kawas Research partially supported by the National Science Foundation Grant CMMI-0757983
Functional Optimization Models for Active Queue Management
Functional Optimization Models for Active Queue Management Yixin Chen Department of Computer Science and Engineering Washington University in St Louis 1 Brookings Drive St Louis, MO 63130, USA [email protected]
17.6.1 Introduction to Auction Design
CS787: Advanced Algorithms Topic: Sponsored Search Auction Design Presenter(s): Nilay, Srikrishna, Taedong 17.6.1 Introduction to Auction Design The Internet, which started of as a research project in
Super-Agent Based Reputation Management with a Practical Reward Mechanism in Decentralized Systems
Super-Agent Based Reputation Management with a Practical Reward Mechanism in Decentralized Systems Yao Wang, Jie Zhang, and Julita Vassileva Department of Computer Science, University of Saskatchewan,
CHAPTER 9. Integer Programming
CHAPTER 9 Integer Programming An integer linear program (ILP) is, by definition, a linear program with the additional constraint that all variables take integer values: (9.1) max c T x s t Ax b and x integral
Optimal Income Taxation with Multidimensional Types
1 Optimal Income Taxation with Multidimensional Types Kenneth L. Judd Hoover Institution NBER Che-Lin Su Northwestern University (Stanford Ph. D.) PRELIMINARY AND INCOMPLETE November, 2006 2 Introduction
Date: April 12, 2001. Contents
2 Lagrange Multipliers Date: April 12, 2001 Contents 2.1. Introduction to Lagrange Multipliers......... p. 2 2.2. Enhanced Fritz John Optimality Conditions...... p. 12 2.3. Informative Lagrange Multipliers...........
Government Debt Management: the Long and the Short of it
Government Debt Management: the Long and the Short of it E. Faraglia (U. of Cambridge and CEPR) A. Marcet (IAE, UAB, ICREA, BGSE, MOVE and CEPR), R. Oikonomou (U.C. Louvain) A. Scott (LBS and CEPR) ()
Statistics 100A Homework 8 Solutions
Part : Chapter 7 Statistics A Homework 8 Solutions Ryan Rosario. A player throws a fair die and simultaneously flips a fair coin. If the coin lands heads, then she wins twice, and if tails, the one-half
Hydrodynamic Limits of Randomized Load Balancing Networks
Hydrodynamic Limits of Randomized Load Balancing Networks Kavita Ramanan and Mohammadreza Aghajani Brown University Stochastic Networks and Stochastic Geometry a conference in honour of François Baccelli
Integrating Benders decomposition within Constraint Programming
Integrating Benders decomposition within Constraint Programming Hadrien Cambazard, Narendra Jussien email: {hcambaza,jussien}@emn.fr École des Mines de Nantes, LINA CNRS FRE 2729 4 rue Alfred Kastler BP
SINGLE-STAGE MULTI-PRODUCT PRODUCTION AND INVENTORY SYSTEMS: AN ITERATIVE ALGORITHM BASED ON DYNAMIC SCHEDULING AND FIXED PITCH PRODUCTION
SIGLE-STAGE MULTI-PRODUCT PRODUCTIO AD IVETORY SYSTEMS: A ITERATIVE ALGORITHM BASED O DYAMIC SCHEDULIG AD FIXED PITCH PRODUCTIO Euclydes da Cunha eto ational Institute of Technology Rio de Janeiro, RJ
On the Interaction and Competition among Internet Service Providers
On the Interaction and Competition among Internet Service Providers Sam C.M. Lee John C.S. Lui + Abstract The current Internet architecture comprises of different privately owned Internet service providers
Inflation. Chapter 8. 8.1 Money Supply and Demand
Chapter 8 Inflation This chapter examines the causes and consequences of inflation. Sections 8.1 and 8.2 relate inflation to money supply and demand. Although the presentation differs somewhat from that
REVIEW OF MICROECONOMICS
ECO 352 Spring 2010 Precepts Weeks 1, 2 Feb. 1, 8 REVIEW OF MICROECONOMICS Concepts to be reviewed Budget constraint: graphical and algebraic representation Preferences, indifference curves. Utility function
The Multicut L-Shaped Method
1 / 25 The Multicut L-Shaped Method Operations Research Anthony Papavasiliou Contents 2 / 25 1 The Multicut L-Shaped Method [ 5.1 of BL] 2 Example 3 Example: Capacity Expansion Planning Table of Contents
Lecture 11. 3.3.2 Cost functional
Lecture 11 3.3.2 Cost functional Consider functions L(t, x, u) and K(t, x). Here L : R R n R m R, K : R R n R sufficiently regular. (To be consistent with calculus of variations we would have to take L(t,
CHAPTER 14. Z(t; i) + Z(t; 5),
CHAPER 14 FIXED-INCOME PRODUCS AND ANALYSIS: YIELD, DURAION AND CONVEXIY 1. A coupon bond pays out 3% every year, with a principal of $1 and a maturity of five years. Decompose the coupon bond into a set
Manual for SOA Exam MLC.
Chapter 4. Life Insurance. Extract from: Arcones Manual for the SOA Exam MLC. Fall 2009 Edition. available at http://www.actexmadriver.com/ 1/14 Level benefit insurance in the continuous case In this chapter,
Algorithmic Mechanism Design for Load Balancing in Distributed Systems
In Proc. of the 4th IEEE International Conference on Cluster Computing (CLUSTER 2002), September 24 26, 2002, Chicago, Illinois, USA, IEEE Computer Society Press, pp. 445 450. Algorithmic Mechanism Design
Lecture notes on Moral Hazard, i.e. the Hidden Action Principle-Agent Model
Lecture notes on Moral Hazard, i.e. the Hidden Action Principle-Agent Model Allan Collard-Wexler April 19, 2012 Co-Written with John Asker and Vasiliki Skreta 1 Reading for next week: Make Versus Buy in
A Simple Model of Price Dispersion *
Federal Reserve Bank of Dallas Globalization and Monetary Policy Institute Working Paper No. 112 http://www.dallasfed.org/assets/documents/institute/wpapers/2012/0112.pdf A Simple Model of Price Dispersion
Health Insurance and Retirement Incentives
Health Insurance and Retirement Incentives Daniele Marazzina Joint work with Emilio Barucci and Enrico Biffis Emilio Barucci and Daniele Marazzina Dipartimento di Matematica F. Brioschi Politecnico di
Distributed Machine Learning and Big Data
Distributed Machine Learning and Big Data Sourangshu Bhattacharya Dept. of Computer Science and Engineering, IIT Kharagpur. http://cse.iitkgp.ac.in/~sourangshu/ August 21, 2015 Sourangshu Bhattacharya
Volatility, Productivity Correlations and Measures of. International Consumption Risk Sharing.
Volatility, Productivity Correlations and Measures of International Consumption Risk Sharing. Ergys Islamaj June 2014 Abstract This paper investigates how output volatility and productivity correlations
How To Play A Static Bayesian Game
Advanced Microeconomics Static Bayesian games Harald Wiese University of Leipzig Harald Wiese (University of Leipzig) Advanced Microeconomics / 7 Part E. Bayesian games and mechanism design Static Bayesian
Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection
Chapter 311 Introduction Often, theory and experience give only general direction as to which of a pool of candidate variables (including transformed variables) should be included in the regression model.
1 Uncertainty and Preferences
In this chapter, we present the theory of consumer preferences on risky outcomes. The theory is then applied to study the demand for insurance. Consider the following story. John wants to mail a package
Supplement to Call Centers with Delay Information: Models and Insights
Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290
Systems with Persistent Memory: the Observation Inequality Problems and Solutions
Chapter 6 Systems with Persistent Memory: the Observation Inequality Problems and Solutions Facts that are recalled in the problems wt) = ut) + 1 c A 1 s ] R c t s)) hws) + Ks r)wr)dr ds. 6.1) w = w +
Nonlinear Programming Methods.S2 Quadratic Programming
Nonlinear Programming Methods.S2 Quadratic Programming Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard A linearly constrained optimization problem with a quadratic objective
Economics 200B Part 1 UCSD Winter 2015 Prof. R. Starr, Mr. John Rehbeck Final Exam 1
Economics 200B Part 1 UCSD Winter 2015 Prof. R. Starr, Mr. John Rehbeck Final Exam 1 Your Name: SUGGESTED ANSWERS Please answer all questions. Each of the six questions marked with a big number counts
Linear Programming Notes VII Sensitivity Analysis
Linear Programming Notes VII Sensitivity Analysis 1 Introduction When you use a mathematical model to describe reality you must make approximations. The world is more complicated than the kinds of optimization
24. The Branch and Bound Method
24. The Branch and Bound Method It has serious practical consequences if it is known that a combinatorial problem is NP-complete. Then one can conclude according to the present state of science that no
Summer course on Convex Optimization. Fifth Lecture Interior-Point Methods (1) Michel Baes, K.U.Leuven Bharath Rangarajan, U.
Summer course on Convex Optimization Fifth Lecture Interior-Point Methods (1) Michel Baes, K.U.Leuven Bharath Rangarajan, U.Minnesota Interior-Point Methods: the rebirth of an old idea Suppose that f is
Politecnico di Torino, Short Course on: Optimal Control Problems: the Dynamic Programming Approach
Politecnico di Torino, Short Course on: Optimal Control Problems: the Dynamic Programming Approach Fausto Gozzi Dipartimento di Economia e Finanza Università Luiss - Guido Carli, viale Romania 32, 00197
