Computing Near Optimal Strategies for Stochastic Investment Planning Problems
|
|
|
- Annice Walters
- 10 years ago
- Views:
Transcription
1 Computing Near Optimal Strategies for Stochastic Investment Planning Problems Milos Hauskrecfat 1, Gopal Pandurangan 1,2 and Eli Upfal 1,2 Computer Science Department, Box 1910 Brown University Providence, RI02912, U.S.A. Abstract We present efficient techniques for computing near optimal strategies for a class of stochastic commodity trading problems modeled as Markov decision processes (MDPs). The process has a continuous state space and a large action space and cannot be solved efficiently by standard dynamic programming methods. We exploit structural properties of the process, and combine it with Monte- Carlo estimation techniques to obtain novel and efficient algorithms that closely approximate the optimal strategies. 1 Introduction Investment is an act of incurring immediate cost in the expectation of future rewards. Investments options are typically characterized by three parameters: the initial and accumulated costs, the uncertainty over the future rewards, and the leeway in timing the actions. Investors try to optimize their decision with respect to these parameters. Modern economics theory models the uncertainty of future rewards as a stochastic process defining future price curves. The process is typically an Ito process [Dixit and Pindyck, 1994] that is Markovian, thus investment decision can be modeled as a Markov decision process (MDP) where a state of the underlying process needs only to include the current investment portfolio and current prices. While the MDP gives a succinct formalization of the investment decision processes it does not necessarily imply efficient algorithms for computing optimal strategies. The Markov decision process has a continuous state space and a potentially large action space and cannot be solved efficiently by standard dynamic programming methods. A challenging goal in this research area is to characterize special cases of the general investment paradigm that are interesting enough from the application point of view while simple enough to allow efficiently computable analytic solutions. In this work we focus on stochastic planning in the context of commodity trading. Commodity can be bought, stored and eventually sold. In addition to the initial cost of buying the 1 Supported by a research grant from Goldman Sachs. Supported in part by NSF grant CCR commodity, the investment decision must take into account the accumulated cost of storage till the commodity is sold. A standard assumption in mathematical economics is that commodity prices (e.g oil and copper) are best modeled as a mean re verting stochastic process. We study several versions of the commodity trading problem. In the simplest version we assume that there are no restrictions on the amount of commodity that an investor can buy, store or sell at a given time. We give an efficiently computable optimal analytic solution for that case. The problem becomes significantly harder when we turn to a more realistic setting in which there are constrains on the amount of commodity that a trader can buy store and in particular sell at a given time. We present an efficient Monte- Carlo technique that generates an approximate optimal trading policy with volume and storage constrains. There has been extensive research in Ai in recent years on solving MDPs with large state spaces exploiting specific problem structures, in particular through factoring and decompositions [Boutilier et a/., 1995; Dearden and Boutilier, 1997; Dean and Lin, 1995; Meuleau et a/., 1998]. However, all these work assume finite or at least discrete state space. Our solution relies on structure analysis of the problem combined with a Monte-Carlo approximation technique. We show that when prices follow a mean-reverting process, the optimal policy for the constrained commodity trading problem has an elegant compact parametric form. This observation allows for a significant reduction in the space of possible optimal policies. Furthermore, we show that individual parameters, which represent a stack of decision thresholds, can be optimized incrementally thus further reducing the complexity of the problem-solving task. 2 The Model Mathematical economics models commodity price fluctuations as an Ito mean-reverting process (Ornstein-Uhlenbeck process) [Dixit and Pindyck, 1994], where changes in price satisfy a stochastic differential equation such that is the (random) increment of a Wiener process (Brownian motion with normally distributed increments), is the long term average price of the commodity i.e. a value to which the process reverts, is the speed of reversion, and is the standard deviation of the random component UNCERTAINTY AND PROBABILISTIC REASONING
2 Trading occurs at discrete time steps, and price fluctuation between two consecutive steps is modeled by the discrete version {Dixit and Pindyck, 1994]: where is a sequence of independent random variables with normal distribution The agent (investor) can buy and sell the commodity at different time steps. There is a fixed cost for storing a unit of community for a unit time interval. The cost of money (the interest rate) is denoted by We assume that the volume of commodity traded by the agent is small with respect to the total market of that commodity, so that the agent's strategy does not affect the above parameters. In the more general setting there are limits on the number of units a trader can buy and sell in every step, and on the total number of units that can be stored (denoted respectively). Objectives Our goal is to determine the best, profit maximizing, trading strategy for a given initial state, which is determined by the current price and amount of commodity the agent owns. We use the standard valuation method, the expected net present value (NPV) function (see e.g. [Brealey and Myers, 1991; Trigeorgis, 19961)): where s denotes an initial state, is a strategy, is a discount factor, is the decision horizon, and M t is cash flow at time Thus our goal is to find such that (s) is maximized. The objective function corresponds to discounted versions of standard finite or infinite horizon criteria (see [Bellman, 1957; Puterman, 1994]) where rewards correspond to cash flows. In the following we focus on the infinite horizon investment problem. The trading strategy for this case is represented as mapping the current price pt and number of units of commodity we own to the number of units to be held in the next step The results for the infinite trading horizon can be adopted to the finite horizon problem. Example Our model fits a variety of applications. A typical example is oil-trading problem. Oil can be bought and sold in the market. The changes of oil prices on the market follow the mean reverting process (Dixit and Pindyck, 1994] and individual trading activities do not affect the market price. Oil can be stored at a fixed cost per unit, and there are obvious constraints on the amounts that can be stored and released in short time intervals. 3 Standard approach and its shortcomings The value (expected NPV) of a strategy can be expressed using the Bellman equation [Bellman, 1957]: 0) where a state is a cross-product of a commodity price and the amount of commodity held denotes the expected one-step cash flow resulting from action maps the current commodity holdings to the next state commodity holdings under the action dictated by the policy and denotes a distribution of next state prices (a normal distribution defined by Equation 1). In general it is extremely hard to evaluate a given strategy because of the size of a state space we face (in one step we can reach any price) and there is no closed form solution to the integral part. The task of finding the optimal policy is even worse as the space of policies we need to choose from is even larger. Thus, standard dynamic programming approaches cannot be applied directly and we must resort to methods that exploit the problem structure. 4 Trading without constraints To solve the case with no limits on buy, sell and store activities, it is sufficient to study the problem of investing one unit of commodity, since the strategy for one unit can be replicated for any number of units. A strategy for the one-unit problem at any point in time is restricted to only two choices: 0 (do not invest, do not hold the commodity) and 1 (hold, invest in the commodity). Because buy and sell prices are always the same and prices change independently of our trading, the expected net present value of a one-unit strategy can be expressed in terms of step-wise gains: where is the gain from time step is a fixed storage cost, if the strategy chooses to hold the commodity at time and when not. Intuitively, we can replicate payoffs from any strategy by always selling the commodity held at the end of the step and buy it back when the strategy requires us to hold the commodity in the next time step. The term reflects the fact that we always sell (pretend to sell) the initial commodity at price Thus, we can restrict our search to policies of the form mapping prices to actions. The value function for such a policy can be rewritten as: where is the value function for the policystarting from a zero commodity state. satisfies is the expected gain for holding the commodity for one step starting from price denotes the next step price, is a storage cost and is the expected next step price. A nice feature of Equation 2 is that the contribution from following the policy in the future (integral part) is independent of the action choice (2) HAUSKRECHT, PANDURANGAN, AND UPFAL 1311
3 in the first step. This leads to the fact that the greedy one-step lookahead strategy is optimal. 1 Theorem 1 The strategy n maximizingthe expected one-step gain is optimal. Proof Follows directly from Equation 2 and the independence of future values on current actions. The value of the optimal one-unit policy can be written as: with denoting the expected gain for holding zero units of commodity. The optimal policy satisfies: The important thing about this is that we know how to find the optimal policy easily and this despite the fact that its value is hard to compute. 4.1 Decision threshold As the expected gain from not holding the commodity is 0, the choice to hold the commodity is justified only when that is, when the expected gain is positive. Substituting using equation 1, we get the condition for choosing the "hold" action: Thus, we obtain a simple and compact policy: Hold the commodity when the price is lower than Note that the optimal threshold price depends solely on the parameters of the mean reverting process and discount 5 Trading with constraints We consider three types of constraints: buy, sell and store limits. We start with buy and store constraints. 5.1 Buy and store constraints The buy and store constraints are relatively easy to handle when no restriction on the sell is imposed Simply, we always want to take the maximum advantage of a positive gain for investing in the next step. Thus, we always hold the maximum amount of commodity allowed by the buy and store limits, whenever the optimal one-unit policy recommends to hold. That is: (3) (4) 5.2 Sell limit problem The sell limit is tricky, since.trader may not be able to immediately realize all gains if he holds more units than the sell limit A naive solution would be to limit the holding to the sell limit, but such policy is obviously sub-optimal. In studying this case we can assume w.l.o.g. that the sell limit is 1. The solution for an arbitrary sell constraint 1 is then obtained by replicating the optimal strategy with sell limit 1. In the following we first study a special case in which a trader is allowed to hold at most 2 units of commodity (with sell constraint 1) and find the corresponding optimal policy. Later we generalize the idea to any number of units. 5.3 Trading two units under sell limit To solve the two units case we restrict our attention to strategies that always try to trade the first unit using the optimal one-unit strategy. Then our goal is to simply find how and when to invest in the second commodity. We show later that this approach indeed leads to the optimal solution. Let be a set of policies mapping current prices to commodity choices, such that always follows the optimal one-unit strategy for the first unit and chooses to hold the second unit arbitrarily but only when Thus is described as: Note that a strategy may not be directly applicable to the two-unit case and can violate the sell capacity. This happens when recommends to hold two units of the commodity for some price in one step and is forced to recommend 0 units when the next step price jumps above To fix this problem we define a set of policies II 2, such that for every there is a policy mapping current price and current commodity holding to the next step commodity choice and is defined as: (5) The idea behind TI2 is that it replicates when it is consistent with sell constraints, otherwise it recommends to reduce the number of units of commodity held by 1. Because induces we call it a generating policy. To find the optimal investment rule for the second unit we try to quantify its added value. To do this, we create a strategy that recommends to hold two units of commodity when and this only for the first step, otherwise it follows the optimal one-unit policy. The value for such a policy for price and zero units of commodity in terms of gains is: denote buy and store constraints. 1 We note the optimality of a greedy one step lookahead strategy holds not only for the mean reverting process, but also for a more general Ito process. Note that due to the sell limit the second unit of commodity is always held also in the second step. This is captured by the term The key trick now is to rewrite for 1312 UNCERTAINTY AND PROBABILISTIC REASONING
4 where is the expected added gain for investing in the second unit of commodity for represents the difference from investing in the second unit of commodity at compared to the optimal one-unit strategy. It consists of two terms: the expected gain from holding the commodity for one step plus a correction term (a kind of expected loss or negative gain) for holding it one more step in the case we would like to sell it but sell constraint does not allow us to do that. Note that depends solely on the price of a commodity and also The other important feature is that by substituting (p, 0) from Equation 3, the value equals: which means that the integral part of the expression now becomes independent of the commodity held in the first step and disregards any units we were not able to sell. Intuitively, under expectations, the term allows us to pretend that we were able to sell both units at the end of the previous step without restriction, though in reality when the price climbs above we have to keep one unit and sell it later when the sell capacity is available. Using the fact that allows us to disregard any commodity left from previous steps we can express the value function for an arbitrary policy as For zero units of commodity the policy always equals its generating (see equation 5) and therefore (6) The theorem shows how to get the optimal policy from TI for the zero commodity start state. This policy can be applied also to the non-zero commodity state in a straightforward way. Up to this point we know how to find the best (optimal) policy from This policy is no worse than that trades only one unit (simply because However, this does not necessarily imply that the optimal two-unit strategy is also globally optimal. Theorem 3 is the optimal two-unit strategy. Proof To prove this we need to show that the optimal policy always resides in The opposite can happen only when: (1) the optimal strategy is not incremental (the optimal choice for the first unit does not imply the optimal choice for two units case) and (2) the optimal strategy depends on c, such that the relation is not captured by Equation 5 and relation). The incremental property follows from the fact that the maximum gain we can capture by any unit is which equals the expected gain for the first unit. Thus the policy must always trade the first unit optimally. Using the incremental result, the relation in Equation 5 can be violated only when the globally optimal policy at some point recommends to hold 2 units and does not, or vice versa. However, this cannot happen as it would mean that we choose to hold the second unit when is negative or not to hold it when it is positive. Thus must be the optimal policy. Threshold price for the second unit The optimal two-unit strategy says that we want to invest into the second unit only when The question now is if we can come up with a compact representation of this condition, similarly to the threshold price for the one-unit case. Indeed, we can show that if ever becomes positive, there is a unique threshold value such that when then is guaranteed. This follows directly from the monotonicity of which we prove next. Theorem 4 For it holds Proof In order to prove it is sufficient to show, using Equation 6, that for This means that if we want to find the optimal we can do it by simply finding the optimal from from Theorem 2 The optimal two-unit strategy maximizing is defined by a strategy maximizing Proof Using Equation 7, and the fact that the integral part becomes independent of the amount of commodity we actually held in the previous step, we can maximize the value of a policy by maximizing the sum of expected gains. That is: hold. This is trivial and follows from the monotonicity of and the fact that and define normal distributions with means and the same standard deviation. The main consequence of being monotonically decreasing is that there is a unique zero point such that for any 0. Therefore, the optimal generating strategy for trading two units of commodity (with the sell constraint 1) can be defined compactly using a set of threshold prices such that HAUSKRECHT, PANDURANGAN, AND UPFAL 1313
5 The optimal strategy is then compactly represented as: 5.4 Trading -units under sell limit In principle the same ideas as used for the two-unit case can be applied to find the optimal strategy for trading at most units of commodity. Such a policy is guaranteed to be no worse than the policy for trading smaller number of units. We summarize the main results and conclusions, the detailed analysis is deferred to the full paper. The expected added gain from holding unit of commodity is defined recursively as: A nice property of is that it is monotonically decreasing in p and The optimal generating policy equals: can be described compactly using a set of threshold values such that: where thresholds are unique zero points The optimal policy for the sell limit 1 can be then compactly represented as: 5.5 Solution for buy, sell and store limits The optimal policy for at most units of commodity and the sell constraint can be derived directly from the solution for the sell limit 1: Adding buy and store constraints to this result is easy and results in the following policy: 6 Finding optimal thresholds (8) The optimal policy for units can be represented compactly using a set of threshold prices The threshold price for unit is the zero of the the expected added gain for the unit = 0. As the value of depends only on threshold prices the set of threshold prices can be built incrementally. The main problem in this process is that there is no closed form solution for finding the zero point of Figure 1: The optimal generating policy with sell limit 1, 100, 0.6, 10, , 0.2. Stepwise changes reflect positions of thresholds. is the only exception). Thus, in order to find the threshold values for we resort to stochastic (Monte Carlo) approximation techniques (see [Kushner and Yin, 1997]). In particular we use the Robbins-Monro scheme, which finds the zero of the function iteratively using the update where, is the price value at the iteration, is the "noisy" estimate of the function at obtained by Monte Carlo sampling from Equation 8. The sequence e is used to "average out" the "noisy" estimates from the Monte-Carlo sampling. We use the standard sequence, which converges (under reasonable assumptions) to the zero value with probability one. Since is monotonous there is a unique root such that [Kushner and Yin, 1997] also give more details on the rate of convergence. The process for finding new thresholds can be applied incrementally to find the solution for an arbitrary number of units. The issue that remains open is that there can be an infinite number of thresholds to define the complete solution. However, thresholds for larger values of are less likely to be used, as they cover the range of prices with extremely small chance of occurrence. Thus if the initial price is in a reasonable range more complex policies tend to contribute less. To provide for more robustness, we propose an on-line algorithm that keeps building thresholds on the demand basis. That is, only when price encountered in not covered by a current set of thresholds, we start to work on thresholds for higher values of Note that this algorithm can be further refined into an anytime scheme [Dean, 1991], suitable for time critical settings. 7 Experimental results We have tested our approach on several sets of parameters. Figure 1 illustrates a typical policy. To find the zero points of expected added gain functions, we used the basic 1314 UNCERTAINTY AND PROBABILISTIC REASONING
6 Expected gains addressed include the theoretical bound on the number of thresholds necessary to guarantee the near optimal strategy, and exploration of more compact parametric representations of the policy, mapping prices to the number Of units to hold directly (these would eliminate the need to remember all threshold values). Finally, a number of interesting problems will arise if we relax some of the current assumptions of the model. Possible refinements may include price spreads, concurrent trading at multiple interconnected sites, or demand/supply sensitive price models. Acknowledgement We wish to thank Oliver Frankel of Goldman Sachs for introducing us to this problem and for valuable technical discussions. Figure 2: Average NPV for strategies with varying number of thresholds and different initial prices. Robbins-Monro algorithm for 10,000 steps. We use the average of 100 runs. The thresholds were computed incrementally starting from The interesting observation is that for larger values of thresholds become about equally spaced. In such a situation the policy can be more compactly represented using a linear function, directly mapping prices to recommended commodity holdings. The effect of strategy refinement (adding more thresholds and potentially investing more units) on the quality of the solution is illustrated in Figure 2. The results show average profit over 1000 runs for different policies and different starting prices. We see that more complex policies with more thresholds tend to improve the profit performance initially. This effect becomes less significant for larger and expected profits converge to the optimal value Since the prices reach lower values very rarely, the probability of using thresholds for larger is very small and corresponding more complex policies are only rarely utilized. 8 Conclusion We have presented an efficient method for finding the near optimal policies for the commodity planning problem with trading and storage constraints. The problem has continuous state space and large action space and cannot be solved efficiently by standard dynamic programming solutions. Our solution relies on the analysis of the domain and takes advantage of a problem structure and Monte Carlo approximation techniques. We showed that the optimal policy for the commodity trading problem with a mean-reverting price model and trading constraints can be represented compactly using a stack of decision (price) thresholds for investing in additional units of commodity. The thresholds correspond to zero points of expected added gain functions, which we also derived. As these functions do not have analytical solution, we apply Monte Carlo approximation techniques to find their zero points. The properties of the functions guarantee uniqueness of the solution and convergence of the approximation scheme. Interesting questions and opened issues that remain to be References [Bellman, 1957] Richard E. Bellman. Dynamic Programming. Princeton University Press, Princeton, [Boutilier et al., 1995] Craig Boutilier, Richard Dearden, and Moises Goldszmidt. Exploiting structure in policy construction. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pages , Montreal, [Brealey and Myers, 1991] Richard A. Brealey and Stewart C. Myers. Principles of Corporate Finance. McGraw- Hill, [Dean and Lin, 1995] Thomas Dean and Shieu-Hong Lin. Decomposition techniques for planning in stochastic domains. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pages , Montreal, [Dean, 1991] Thomas Dean. Decision-theoretic control of inference for time-critical applications. International journal of Intelligent Systems, 6: ,1991. [Dearden and Boutilier, 1997] Richard Dearden and Craig Boutilier. Abstraction and approximate decision theoretic planning. Artificial Intelligence, 89: ,1997. [Dixit and Pindyck, 1994] Avinash K. Dixit and Robert S. Pindyck. Investment under Uncertainty. Princeton University Press, Princeton, [Kushner and Yin, 1997] Kushner and Yin. Stochastic Approximation Algorithms and Applications. Springer- Verlag, New York, [Meuleau et al., 1998] Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, Leonid Peshkin, Leslie Pack Kaelbling, Thomas Dean, and Craig Boutilier. Solving very large weakly coupled markov decision processes. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pages , Madison, [Puterman, 1994] Martin L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York, [Trigeorgis, 1996] Lenos Trigeorgis. Real Options. MIT Press, Cambridge, HAUSKRECHT, PANDURANGAN, AND UPFAL 1315
Using Markov Decision Processes to Solve a Portfolio Allocation Problem
Using Markov Decision Processes to Solve a Portfolio Allocation Problem Daniel Bookstaber April 26, 2005 Contents 1 Introduction 3 2 Defining the Model 4 2.1 The Stochastic Model for a Single Asset.........................
Maximum likelihood estimation of mean reverting processes
Maximum likelihood estimation of mean reverting processes José Carlos García Franco Onward, Inc. [email protected] Abstract Mean reverting processes are frequently used models in real options. For
Hedging Illiquid FX Options: An Empirical Analysis of Alternative Hedging Strategies
Hedging Illiquid FX Options: An Empirical Analysis of Alternative Hedging Strategies Drazen Pesjak Supervised by A.A. Tsvetkov 1, D. Posthuma 2 and S.A. Borovkova 3 MSc. Thesis Finance HONOURS TRACK Quantitative
1 Introduction to Option Pricing
ESTM 60202: Financial Mathematics Alex Himonas 03 Lecture Notes 1 October 7, 2009 1 Introduction to Option Pricing We begin by defining the needed finance terms. Stock is a certificate of ownership of
Using simulation to calculate the NPV of a project
Using simulation to calculate the NPV of a project Marius Holtan Onward Inc. 5/31/2002 Monte Carlo simulation is fast becoming the technology of choice for evaluating and analyzing assets, be it pure financial
Monte Carlo Methods in Finance
Author: Yiyang Yang Advisor: Pr. Xiaolin Li, Pr. Zari Rachev Department of Applied Mathematics and Statistics State University of New York at Stony Brook October 2, 2012 Outline Introduction 1 Introduction
An Incomplete Market Approach to Employee Stock Option Valuation
An Incomplete Market Approach to Employee Stock Option Valuation Kamil Kladívko Department of Statistics, University of Economics, Prague Department of Finance, Norwegian School of Economics, Bergen Mihail
A Sarsa based Autonomous Stock Trading Agent
A Sarsa based Autonomous Stock Trading Agent Achal Augustine The University of Texas at Austin Department of Computer Science Austin, TX 78712 USA [email protected] Abstract This paper describes an autonomous
When to Refinance Mortgage Loans in a Stochastic Interest Rate Environment
When to Refinance Mortgage Loans in a Stochastic Interest Rate Environment Siwei Gan, Jin Zheng, Xiaoxia Feng, and Dejun Xie Abstract Refinancing refers to the replacement of an existing debt obligation
MATHEMATICS OF FINANCE AND INVESTMENT
MATHEMATICS OF FINANCE AND INVESTMENT G. I. FALIN Department of Probability Theory Faculty of Mechanics & Mathematics Moscow State Lomonosov University Moscow 119992 [email protected] 2 G.I.Falin. Mathematics
EC824. Financial Economics and Asset Pricing 2013/14
EC824 Financial Economics and Asset Pricing 2013/14 SCHOOL OF ECONOMICS EC824 Financial Economics and Asset Pricing Staff Module convenor Office Keynes B1.02 Dr Katsuyuki Shibayama Email [email protected]
LECTURE 15: AMERICAN OPTIONS
LECTURE 15: AMERICAN OPTIONS 1. Introduction All of the options that we have considered thus far have been of the European variety: exercise is permitted only at the termination of the contract. These
ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE
ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE YUAN TIAN This synopsis is designed merely for keep a record of the materials covered in lectures. Please refer to your own lecture notes for all proofs.
BINOMIAL OPTIONS PRICING MODEL. Mark Ioffe. Abstract
BINOMIAL OPTIONS PRICING MODEL Mark Ioffe Abstract Binomial option pricing model is a widespread numerical method of calculating price of American options. In terms of applied mathematics this is simple
The Trip Scheduling Problem
The Trip Scheduling Problem Claudia Archetti Department of Quantitative Methods, University of Brescia Contrada Santa Chiara 50, 25122 Brescia, Italy Martin Savelsbergh School of Industrial and Systems
Linear Programming I
Linear Programming I November 30, 2003 1 Introduction In the VCR/guns/nuclear bombs/napkins/star wars/professors/butter/mice problem, the benevolent dictator, Bigus Piguinus, of south Antarctica penguins
THE FUNDAMENTAL THEOREM OF ARBITRAGE PRICING
THE FUNDAMENTAL THEOREM OF ARBITRAGE PRICING 1. Introduction The Black-Scholes theory, which is the main subject of this course and its sequel, is based on the Efficient Market Hypothesis, that arbitrages
On Correlating Performance Metrics
On Correlating Performance Metrics Yiping Ding and Chris Thornley BMC Software, Inc. Kenneth Newman BMC Software, Inc. University of Massachusetts, Boston Performance metrics and their measurements are
Practical Guide to the Simplex Method of Linear Programming
Practical Guide to the Simplex Method of Linear Programming Marcel Oliver Revised: April, 0 The basic steps of the simplex algorithm Step : Write the linear programming problem in standard form Linear
Moreover, under the risk neutral measure, it must be the case that (5) r t = µ t.
LECTURE 7: BLACK SCHOLES THEORY 1. Introduction: The Black Scholes Model In 1973 Fisher Black and Myron Scholes ushered in the modern era of derivative securities with a seminal paper 1 on the pricing
Does Black-Scholes framework for Option Pricing use Constant Volatilities and Interest Rates? New Solution for a New Problem
Does Black-Scholes framework for Option Pricing use Constant Volatilities and Interest Rates? New Solution for a New Problem Gagan Deep Singh Assistant Vice President Genpact Smart Decision Services Financial
LECTURE 9: A MODEL FOR FOREIGN EXCHANGE
LECTURE 9: A MODEL FOR FOREIGN EXCHANGE 1. Foreign Exchange Contracts There was a time, not so long ago, when a U. S. dollar would buy you precisely.4 British pounds sterling 1, and a British pound sterling
Unraveling versus Unraveling: A Memo on Competitive Equilibriums and Trade in Insurance Markets
Unraveling versus Unraveling: A Memo on Competitive Equilibriums and Trade in Insurance Markets Nathaniel Hendren January, 2014 Abstract Both Akerlof (1970) and Rothschild and Stiglitz (1976) show that
An Introduction to Exotic Options
An Introduction to Exotic Options Jeff Casey Jeff Casey is entering his final semester of undergraduate studies at Ball State University. He is majoring in Financial Mathematics and has been a math tutor
The Behavior of Bonds and Interest Rates. An Impossible Bond Pricing Model. 780 w Interest Rate Models
780 w Interest Rate Models The Behavior of Bonds and Interest Rates Before discussing how a bond market-maker would delta-hedge, we first need to specify how bonds behave. Suppose we try to model a zero-coupon
OPTIMAL DESIGN OF A MULTITIER REWARD SCHEME. Amir Gandomi *, Saeed Zolfaghari **
OPTIMAL DESIGN OF A MULTITIER REWARD SCHEME Amir Gandomi *, Saeed Zolfaghari ** Department of Mechanical and Industrial Engineering, Ryerson University, Toronto, Ontario * Tel.: + 46 979 5000x7702, Email:
Markov Decision Processes for Ad Network Optimization
Markov Decision Processes for Ad Network Optimization Flávio Sales Truzzi 1, Valdinei Freire da Silva 2, Anna Helena Reali Costa 1, Fabio Gagliardi Cozman 3 1 Laboratório de Técnicas Inteligentes (LTI)
4: SINGLE-PERIOD MARKET MODELS
4: SINGLE-PERIOD MARKET MODELS Ben Goldys and Marek Rutkowski School of Mathematics and Statistics University of Sydney Semester 2, 2015 B. Goldys and M. Rutkowski (USydney) Slides 4: Single-Period Market
Monte Carlo Simulation
Monte Carlo Simulation Palisade User Conference 2007 Real Options Analysis Dr. Colin Beardsley, Managing Director Risk-Return Solutions Pty Limited www.riskreturnsolutions.com The Plot Today We will review
An Introduction to Markov Decision Processes. MDP Tutorial - 1
An Introduction to Markov Decision Processes Bob Givan Purdue University Ron Parr Duke University MDP Tutorial - 1 Outline Markov Decision Processes defined (Bob) Objective functions Policies Finding Optimal
Cost Model: Work, Span and Parallelism. 1 The RAM model for sequential computation:
CSE341T 08/31/2015 Lecture 3 Cost Model: Work, Span and Parallelism In this lecture, we will look at how one analyze a parallel program written using Cilk Plus. When we analyze the cost of an algorithm
MODELING CUSTOMER RELATIONSHIPS AS MARKOV CHAINS. Journal of Interactive Marketing, 14(2), Spring 2000, 43-55
MODELING CUSTOMER RELATIONSHIPS AS MARKOV CHAINS Phillip E. Pfeifer and Robert L. Carraway Darden School of Business 100 Darden Boulevard Charlottesville, VA 22903 Journal of Interactive Marketing, 14(2),
The Method of Least Squares
The Method of Least Squares Steven J. Miller Mathematics Department Brown University Providence, RI 0292 Abstract The Method of Least Squares is a procedure to determine the best fit line to data; the
Notes V General Equilibrium: Positive Theory. 1 Walrasian Equilibrium and Excess Demand
Notes V General Equilibrium: Positive Theory In this lecture we go on considering a general equilibrium model of a private ownership economy. In contrast to the Notes IV, we focus on positive issues such
Monte Carlo Estimation of Project Volatility for Real Options Analysis
Monte Carlo Estimation of Project Volatility for Real Options Analysis Pedro Manuel Cortesão Godinho Grupo de Estudos Monetários e Financeiros (GEMF) ABSTRACT Volatility is a fundamental parameter for
Nonparametric adaptive age replacement with a one-cycle criterion
Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: [email protected]
Reinforcement Learning
Reinforcement Learning LU 2 - Markov Decision Problems and Dynamic Programming Dr. Martin Lauer AG Maschinelles Lernen und Natürlichsprachliche Systeme Albert-Ludwigs-Universität Freiburg [email protected]
Linear Programming Notes V Problem Transformations
Linear Programming Notes V Problem Transformations 1 Introduction Any linear programming problem can be rewritten in either of two standard forms. In the first form, the objective is to maximize, the material
Optimization under uncertainty: modeling and solution methods
Optimization under uncertainty: modeling and solution methods Paolo Brandimarte Dipartimento di Scienze Matematiche Politecnico di Torino e-mail: [email protected] URL: http://staff.polito.it/paolo.brandimarte
Chapter 21: The Discounted Utility Model
Chapter 21: The Discounted Utility Model 21.1: Introduction This is an important chapter in that it introduces, and explores the implications of, an empirically relevant utility function representing intertemporal
The Black-Scholes Formula
FIN-40008 FINANCIAL INSTRUMENTS SPRING 2008 The Black-Scholes Formula These notes examine the Black-Scholes formula for European options. The Black-Scholes formula are complex as they are based on the
Review of Basic Options Concepts and Terminology
Review of Basic Options Concepts and Terminology March 24, 2005 1 Introduction The purchase of an options contract gives the buyer the right to buy call options contract or sell put options contract some
On the Existence of a Unique Optimal Threshold Value for the Early Exercise of Call Options
On the Existence of a Unique Optimal Threshold Value for the Early Exercise of Call Options Patrick Jaillet Ehud I. Ronn Stathis Tompaidis July 2003 Abstract In the case of early exercise of an American-style
Stephane Crepey. Financial Modeling. A Backward Stochastic Differential Equations Perspective. 4y Springer
Stephane Crepey Financial Modeling A Backward Stochastic Differential Equations Perspective 4y Springer Part I An Introductory Course in Stochastic Processes 1 Some Classes of Discrete-Time Stochastic
Optimization applications in finance, securities, banking and insurance
IBM Software IBM ILOG Optimization and Analytical Decision Support Solutions White Paper Optimization applications in finance, securities, banking and insurance 2 Optimization applications in finance,
The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy
BMI Paper The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy Faculty of Sciences VU University Amsterdam De Boelelaan 1081 1081 HV Amsterdam Netherlands Author: R.D.R.
Sharing Online Advertising Revenue with Consumers
Sharing Online Advertising Revenue with Consumers Yiling Chen 2,, Arpita Ghosh 1, Preston McAfee 1, and David Pennock 1 1 Yahoo! Research. Email: arpita, mcafee, [email protected] 2 Harvard University.
Pricing complex options using a simple Monte Carlo Simulation
A subsidiary of Sumitomo Mitsui Banking Corporation Pricing complex options using a simple Monte Carlo Simulation Peter Fink Among the different numerical procedures for valuing options, the Monte Carlo
Analysis of a Production/Inventory System with Multiple Retailers
Analysis of a Production/Inventory System with Multiple Retailers Ann M. Noblesse 1, Robert N. Boute 1,2, Marc R. Lambrecht 1, Benny Van Houdt 3 1 Research Center for Operations Management, University
Stochastic Inventory Control
Chapter 3 Stochastic Inventory Control 1 In this chapter, we consider in much greater details certain dynamic inventory control problems of the type already encountered in section 1.3. In addition to the
QUANTIZED INTEREST RATE AT THE MONEY FOR AMERICAN OPTIONS
QUANTIZED INTEREST RATE AT THE MONEY FOR AMERICAN OPTIONS L. M. Dieng ( Department of Physics, CUNY/BCC, New York, New York) Abstract: In this work, we expand the idea of Samuelson[3] and Shepp[,5,6] for
Chapter 7. Sealed-bid Auctions
Chapter 7 Sealed-bid Auctions An auction is a procedure used for selling and buying items by offering them up for bid. Auctions are often used to sell objects that have a variable price (for example oil)
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering Department of Industrial Engineering and Management Sciences Northwestern University September 15th, 2014
Coordinated Pricing and Inventory in A System with Minimum and Maximum Production Constraints
The 7th International Symposium on Operations Research and Its Applications (ISORA 08) Lijiang, China, October 31 Novemver 3, 2008 Copyright 2008 ORSC & APORC, pp. 160 165 Coordinated Pricing and Inventory
Mathematical Finance
Mathematical Finance Option Pricing under the Risk-Neutral Measure Cory Barnes Department of Mathematics University of Washington June 11, 2013 Outline 1 Probability Background 2 Black Scholes for European
Moral Hazard. Itay Goldstein. Wharton School, University of Pennsylvania
Moral Hazard Itay Goldstein Wharton School, University of Pennsylvania 1 Principal-Agent Problem Basic problem in corporate finance: separation of ownership and control: o The owners of the firm are typically
How To Calculate Bond Price And Yield To Maturity
CHAPTER 10 Bond Prices and Yields Interest rates go up and bond prices go down. But which bonds go up the most and which go up the least? Interest rates go down and bond prices go up. But which bonds go
Real Estate Investments with Stochastic Cash Flows
Real Estate Investments with Stochastic Cash Flows Riaz Hussain Kania School of Management University of Scranton Scranton, PA 18510 [email protected] 570-941-7497 April 2006 JEL classification: G12
Example 1. Consider the following two portfolios: 2. Buy one c(s(t), 20, τ, r) and sell one c(s(t), 10, τ, r).
Chapter 4 Put-Call Parity 1 Bull and Bear Financial analysts use words such as bull and bear to describe the trend in stock markets. Generally speaking, a bull market is characterized by rising prices.
Factoring & Primality
Factoring & Primality Lecturer: Dimitris Papadopoulos In this lecture we will discuss the problem of integer factorization and primality testing, two problems that have been the focus of a great amount
On the Efficiency of Competitive Stock Markets Where Traders Have Diverse Information
Finance 400 A. Penati - G. Pennacchi Notes on On the Efficiency of Competitive Stock Markets Where Traders Have Diverse Information by Sanford Grossman This model shows how the heterogeneous information
Lecture Notes: Basic Concepts in Option Pricing - The Black and Scholes Model
Brunel University Msc., EC5504, Financial Engineering Prof Menelaos Karanasos Lecture Notes: Basic Concepts in Option Pricing - The Black and Scholes Model Recall that the price of an option is equal to
BROWNIAN MOTION DEVELOPMENT FOR MONTE CARLO METHOD APPLIED ON EUROPEAN STYLE OPTION PRICE FORECASTING
International journal of economics & law Vol. 1 (2011), No. 1 (1-170) BROWNIAN MOTION DEVELOPMENT FOR MONTE CARLO METHOD APPLIED ON EUROPEAN STYLE OPTION PRICE FORECASTING Petar Koĉović, Fakultet za obrazovanje
EXIT TIME PROBLEMS AND ESCAPE FROM A POTENTIAL WELL
EXIT TIME PROBLEMS AND ESCAPE FROM A POTENTIAL WELL Exit Time problems and Escape from a Potential Well Escape From a Potential Well There are many systems in physics, chemistry and biology that exist
Black-Scholes-Merton approach merits and shortcomings
Black-Scholes-Merton approach merits and shortcomings Emilia Matei 1005056 EC372 Term Paper. Topic 3 1. Introduction The Black-Scholes and Merton method of modelling derivatives prices was first introduced
Approximate Solutions to Factored Markov Decision Processes via Greedy Search in the Space of Finite State Controllers
pproximate Solutions to Factored Markov Decision Processes via Greedy Search in the Space of Finite State Controllers Kee-Eung Kim, Thomas L. Dean Department of Computer Science Brown University Providence,
A Simple Model of Price Dispersion *
Federal Reserve Bank of Dallas Globalization and Monetary Policy Institute Working Paper No. 112 http://www.dallasfed.org/assets/documents/institute/wpapers/2012/0112.pdf A Simple Model of Price Dispersion
A Confidence Interval Triggering Method for Stock Trading Via Feedback Control
A Confidence Interval Triggering Method for Stock Trading Via Feedback Control S. Iwarere and B. Ross Barmish Abstract This paper builds upon the robust control paradigm for stock trading established in
MODELING CUSTOMER RELATIONSHIPS AS MARKOV CHAINS
AS MARKOV CHAINS Phillip E. Pfeifer Robert L. Carraway f PHILLIP E. PFEIFER AND ROBERT L. CARRAWAY are with the Darden School of Business, Charlottesville, Virginia. INTRODUCTION The lifetime value of
PRICING OF GAS SWING OPTIONS. Andrea Pokorná. UNIVERZITA KARLOVA V PRAZE Fakulta sociálních věd Institut ekonomických studií
9 PRICING OF GAS SWING OPTIONS Andrea Pokorná UNIVERZITA KARLOVA V PRAZE Fakulta sociálních věd Institut ekonomických studií 1 Introduction Contracts for the purchase and sale of natural gas providing
Neuro-Dynamic Programming An Overview
1 Neuro-Dynamic Programming An Overview Dimitri Bertsekas Dept. of Electrical Engineering and Computer Science M.I.T. September 2006 2 BELLMAN AND THE DUAL CURSES Dynamic Programming (DP) is very broadly
Reinforcement Learning
Reinforcement Learning LU 2 - Markov Decision Problems and Dynamic Programming Dr. Joschka Bödecker AG Maschinelles Lernen und Natürlichsprachliche Systeme Albert-Ludwigs-Universität Freiburg [email protected]
INTRODUCTION TO OPTIONS MARKETS QUESTIONS
INTRODUCTION TO OPTIONS MARKETS QUESTIONS 1. What is the difference between a put option and a call option? 2. What is the difference between an American option and a European option? 3. Why does an option
Predictive Representations of State
Predictive Representations of State Michael L. Littman Richard S. Sutton AT&T Labs Research, Florham Park, New Jersey {sutton,mlittman}@research.att.com Satinder Singh Syntek Capital, New York, New York
Choice under Uncertainty
Choice under Uncertainty Part 1: Expected Utility Function, Attitudes towards Risk, Demand for Insurance Slide 1 Choice under Uncertainty We ll analyze the underlying assumptions of expected utility theory
Eligibility Traces. Suggested reading: Contents: Chapter 7 in R. S. Sutton, A. G. Barto: Reinforcement Learning: An Introduction MIT Press, 1998.
Eligibility Traces 0 Eligibility Traces Suggested reading: Chapter 7 in R. S. Sutton, A. G. Barto: Reinforcement Learning: An Introduction MIT Press, 1998. Eligibility Traces Eligibility Traces 1 Contents:
CS 522 Computational Tools and Methods in Finance Robert Jarrow Lecture 1: Equity Options
CS 5 Computational Tools and Methods in Finance Robert Jarrow Lecture 1: Equity Options 1. Definitions Equity. The common stock of a corporation. Traded on organized exchanges (NYSE, AMEX, NASDAQ). A common
An Application of Inverse Reinforcement Learning to Medical Records of Diabetes Treatment
An Application of Inverse Reinforcement Learning to Medical Records of Diabetes Treatment Hideki Asoh 1, Masanori Shiro 1 Shotaro Akaho 1, Toshihiro Kamishima 1, Koiti Hasida 1, Eiji Aramaki 2, and Takahide
Online and Offline Selling in Limit Order Markets
Online and Offline Selling in Limit Order Markets Kevin L. Chang 1 and Aaron Johnson 2 1 Yahoo Inc. [email protected] 2 Yale University [email protected] Abstract. Completely automated electronic
A Programme Implementation of Several Inventory Control Algorithms
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume, No Sofia 20 A Programme Implementation of Several Inventory Control Algorithms Vladimir Monov, Tasho Tashev Institute of Information
Computational Finance Options
1 Options 1 1 Options Computational Finance Options An option gives the holder of the option the right, but not the obligation to do something. Conversely, if you sell an option, you may be obliged to
1 if 1 x 0 1 if 0 x 1
Chapter 3 Continuity In this chapter we begin by defining the fundamental notion of continuity for real valued functions of a single real variable. When trying to decide whether a given function is or
Uncertainty and complexity are business as usual in
OR PRACTICE OPTIONS IN THE REAL WORLD: LESSONS LEARNED IN EVALUATING OIL AND GAS INVESTMENTS JAMES E. SMITH and KEVIN F. MCCARDLE Duke University, Durham, North Carolina (Received August 1996; revisions
Retirement Financial Planning: A State/Preference Approach. William F. Sharpe 1 February, 2006
Retirement Financial Planning: A State/Preference Approach William F. Sharpe 1 February, 2006 Introduction This paper provides a framework for analyzing a prototypical set of decisions concerning spending,
A Confidence Interval Triggering Method for Stock Trading Via Feedback Control
A Confidence Interval Triggering Method for Stock Trading Via Feedback Control S. Iwarere and B. Ross Barmish Abstract This paper builds upon the robust control paradigm for stock trading established in
Proposal 1: Model-Based Control Method for Discrete-Parts machining processes
Proposal 1: Model-Based Control Method for Discrete-Parts machining processes Proposed Objective: The proposed objective is to apply and extend the techniques from continuousprocessing industries to create
ALGORITHMIC TRADING WITH MARKOV CHAINS
June 16, 2010 ALGORITHMIC TRADING WITH MARKOV CHAINS HENRIK HULT AND JONAS KIESSLING Abstract. An order book consists of a list of all buy and sell offers, represented by price and quantity, available
