McMaster University. Advanced Optimization Laboratory. Advanced Optimization Laboratory. Title: Title:



Similar documents
Part I. Gambling and Information Theory. Information Theory and Networks. Section 1. Horse Racing. Lecture 16: Gambling and Information Theory

How to Win at the Track

Step 1. Step 2. Pick Your Horse. Decide How Much to Bet.

Lock! Risk-Free Arbitrage in the Japanese Racetrack Betting. Market

Market efficiency in greyhound racing: empirical evidence of absence of favorite-longshot bias

A New Interpretation of Information Rate

The Kelly criterion for spread bets

BETTING MARKET EFFICIENCY AT PREMIERE RACETRACKS

Racetrack Betting and Consensus of Subjective Probabilities

Direct test of Harville's multi-entry competitions model on race-track betting data

Gambling with Information Theory

SOME ASPECTS OF GAMBLING WITH THE KELLY CRITERION. School of Mathematical Sciences. Monash University, Clayton, Victoria, Australia 3168

Breakdown of the Handle

Betting with the Kelly Criterion

BETTING REMEMBER MADE EASY 3 TIPS TO. WIN BETS on the FAVOURITE pay 32% of the time. PLACE BETS on the FAVOURITE pay 53% of the time

Chapter 7: Proportional Play and the Kelly Betting System

EFFICIENCY OF RACETRACK BETTING MARKETS Edition. Efficiency of Racetrack Betting Markets Downloaded from

Lecture 25: Money Management Steven Skiena. skiena

Enriching Tradition Through Technology

Choice Under Uncertainty

Louisiana State Racing Commission

The Impact of Publicly Available Information on Betting Markets: Implications for Bettors, Betting Operators and Regulators

Understanding pricing anomalies in prediction and betting markets with informed traders

Applying the Kelly criterion to lawsuits

פרויקט מסכם לתואר בוגר במדעים )B.Sc( במתמטיקה שימושית

An Economic Analysis of Pari-mutuel Race Competitiveness

Discrete Optimization

A Simple Parrondo Paradox. Michael Stutzer, Professor of Finance. 419 UCB, University of Colorado, Boulder, CO

H&SS Senior Honors Research Thesis Paper. Probabilistic Analysis for Economic Models Applying Game Theory to Horse Racing

Research on information propagation analyzing odds in horse racing

Risk, Return, and Gambling Market Efficiency. William H. Dare Oklahoma State University September 5, 2006

Solving convex MINLP problems with AIMMS

National Sun Yat-Sen University CSE Course: Information Theory. Gambling And Entropy

Good and bad properties of the Kelly criterion

Chance and Uncertainty: Probability Theory

Testing Market Efficiency in a Fixed Odds Betting Market

"TWO DOLLARS TO SHOW ON NUMBER

Gambling and Data Compression

Late Money and Betting Market Efficiency: Evidence from Australia

Efficiency of football betting markets: the economic significance of trading strategies

BEGINNER S GUIDE TO BETTING THE RACES

Volume 30, Issue 4. Market Efficiency and the NHL totals betting market: Is there an under bias?

Picking Winners is For Losers: A Strategy for Optimizing Investment Outcomes

How Efficient is the European Football Betting Market? Evidence from Arbitrage and Trading Strategies

What are the Odds You'll Bet on A Race? Determinants of Wagering Demand at a Thoroughbred Racetrack

Frandsen Publishing Presents Favorite ALL-Ways TM Newsletter Articles. Hedging Your Bets

Annotated Additional Bibliography

The Very Best Way We Know to Play the Daily Double

Determinants of betting market efficiency

January 2012, Number 64 ALL-WAYS TM NEWSLETTER

On Adaboost and Optimal Betting Strategies

May 20, Re: Request for Comments on Pari-Mutuel Gambling Winnings in REG

Authorized By: New Jersey Racing Commission, Frank Zanzuccki, Executive Director

Taking Handle into Account: An Economic Analysis of Account Betting

Wagering. Welcome to the Coeur d Alene Casino! D Alene Casino Resort Hotel

Winning is just the beginning...

How To Bet On An Nfl Football Game With A Machine Learning Program

Algorithms for optimal allocation of bets on many simultaneous events

The Mathematics of Gambling

KNOWLEDGE IS POWER The BEST first-timers guide to betting on and winning at the races you ll EVER encounter

A Note on Proebsting s Paradox

MARKET EFFICIENCY IN FINNISH HARNESS HORSE RACING*


Fixed odds bookmaking with stochastic betting demands

THE FAVOURITE-LONGSHOT BIAS AND MARKET EFFICIENCY IN UK FOOTBALL BETTING

ARCI Simulcast Wagering

Determinants of Simulcast Wagering: The Demand for Harness and Thoroughbred Horse Races

CIRCUS CIRCUS LAS VEGAS RACE AND SPORTS BOOK HOUSE RULES

arxiv: v1 [math.pr] 5 Dec 2011

Gambling and Portfolio Selection using Information theory

Predicting the World Cup. Dr Christopher Watts Centre for Research in Social Simulation University of Surrey

Review Horse Race Gambling and Side Information Dependent horse races and the entropy rate. Gambling. Besma Smida. ES250: Lecture 9.

New Jersey Racing Commission, Frank Zanzuccki, Executive Director. See Summary below for explanation of exception to calendar requirement.

HORSE RACING TAX. RATES Pari-mutuel daily wagering (total rates): Amount Wagered Daily. Next 100, Next 100, Over 400,000 4.

The Fibonacci Strategy Revisited: Can You Really Make Money by Betting on Soccer Draws?

Do Bookmakers Predict Outcomes Better than Betters?

ALIANTE RACE AND SPORTS BOOK HOUSE RULES

Optimal f :Calculating the expected growth-optimal fraction for discretely-distributed outcomes

Decision Theory Rational prospecting

Financial Market Microstructure Theory

Betting on Excel to enliven the teaching of probability

Using SVM Regression to Predict Harness Races: A One Year Study of Northfield Park

Betting Terms Explained

ARCI Calculation Of Payouts And Distribution Of Pools. Part I

Woodbine Entertainment Group. International Simulcast Racing. Racing Symposium Tucson, AZ

Lecture 3: Linear methods for classification

Soccer Analytics. Predicting the outcome of soccer matches. Research Paper Business Analytics. December Nivard van Wijk

The Economics of Gamblin; and National Lotteries

Introduction to Logistic Regression

You Are What You Bet: Eliciting Risk Attitudes from Horse Races

Calculated Bets: Computers, Gambling, and Mathematical Modeling to Win Steven Skiena

But if you want your pulse to race, try putting down a wager on your favorite filly. This little book is designed to help you improve your odds.

A Programme Implementation of Several Inventory Control Algorithms

The Very Best Way We Know to Play the Pick 3

Enhancing the Teaching of Statistics: Portfolio Theory, an Application of Statistics in Finance

Transcription:

McMaster University Advanced Optimization Laboratory Advanced Optimization Laboratory Title: Title: A computational framework for determining square-maximal Chance constrained optimization strings for parimutuel horse race betting Authors: Authors: Antoine Deza, Frantisek Franek, and Mei Jiang Antoine Deza, Kai Huang, and Michael R. Metel AdvOL-Report No. 2011/5 December 2011, Hamilton, Ontario, Canada AdvOL-Report No. 2015/2 March 2015, Hamilton, Ontario, Canada

Chance constrained optimization for parimutuel horse race betting Antoine Deza 1, Kai Huang 2, and Michael R. Metel 3 1 Advanced Optimization Laboratory, Department of Computing and Software, McMaster University, Hamilton, Ontario, Canada, deza@ mcmaster. ca 2 DeGroote School of Business, McMaster University, Hamilton, Ontario, Canada, khuang@ mcmaster. ca 3 School of Computational Science and Engineering, McMaster University, Hamilton, Ontario, Canada, metelm@ mcmaster. ca March 23, 2015 Abstract We consider the time horizon of a gambler in the optimization of horse race betting through the use of chance constrained programming. The optimization problem is formulated as a mixed integer nonlinear program for which global optimal solutions are found using optimization tools. A novel approach to estimating superfecta payouts is presented using maximum likelihood estimation. A computational substantiation with historical race data found an increase in return of over 10% using the chance constrained model. 1 Introduction Beginning in the mid 1980 s, horse racing has witnessed the rise of betting syndicates akin to hedge funds profiting from statistical techniques similar to high frequency traders on stock exchanges [12]. This is possible as parimutuel wagering is employed at racetracks, where money is pooled for each bet type, the racetrack takes a percentage, and the remainder is disbursed to the winners in proportion to the amount wagered. Optimization in the horse racing literature can be traced back to Isaacs deriving a closed form solution for the optimal win bets when maximizing expected profit in 1953 [9]. Hausch et al. [8] utilized an optimization framework to show inefficiencies in the place and show betting pools using win bet odds to estimate race outcomes. In particular, they used the Kelly criterion [13], maximizing the expected log utility of wealth and found profitability when limiting the betting to when the expected return was greater than a fixed percentage. More recently, Smoczynski and Tomkins derived a simple procedure for the optimal win bets under the Kelly criterion using the KKT conditions [16]. Although the Kelly criterion maximizes the asymptotic rate of asset growth, the 1

2 volatility of wealth through time is too large for most, resulting in many professional investors employing a fractional Kelly criterion [17], which has been shown to possess favourable risk-return properties by MacLean et al. [15]. We investigate a further manner of risk management in the form of a chance constraint, taking into account the time horizon of the bettor, which can be employed in conjunction with the Kelly criterion. There are several different types of wagers one can place on horses, but in order to best display the effect of the chance constraint, we concentrate on the riskiest of bets on a single race, the superfecta, which requires the bettor to pick, in order, the first 4 finishers. 2 Optimization Model 2.1 Time Horizon To motivate the discussion, we examine the 4 horse outcome probabilities of race 5 on March 20, 2014 at Flamboro Downs, Hamilton, Ontario, Canada. Information about the race dataset and how these probabilities are estimated can be found in Section 3. Let S represent the set of top 4 horse finishes with each s S corresponding to a sequence of 4 horses. If we bet on this race an infinite number of times, then the average number of races before a superfecta bet on outcome s pays off would be 1 π s, where π s is the outcome s probability. Summary statistics for the average wait time is in the following table. Statistic Races min 141 max 566, 225 median 13, 600 mean 38, 192 Table 1: Average wait time statistics The median wait time for a superfecta bet to payoff is then over 11 seasons with roughly 1,200 races per season. Assuming the horseplayer requires some form of regular income or desires to at least turn a profit every season, consideration of the likelihood of receiving a payoff is warranted. In particular, we can limit betting strategies to those which pay out with high probability over a number of races equal to the desired time horizon, τ. Let x = {x s } be our decision variables dictating how much to wager on each outcome s. For a betting decision ˆx, let Bˆx binomial(τ, πˆx ), where πˆx is the probability of a payout. In order to enforce the gambler s time horizon, we require that P(Bˆx 1) 1 α, where α is our error tolerance, which is chosen arbitrarily small. Rearranging, we require πˆx 1 α 1 τ. Assuming independence between races, limiting

3 betting decisions to having a payout probability of at least 1 α 1 τ will occur with probability at least 1 α over τ races. ensures that a payout 2.2 Optimization Program The objective is to maximize the exponential rate of return. Let P ρ (x) be the random payout given our decision vector x. The payout uncertainty stems from the result of the race, ρ, with S as its sample space. Let w be the current wealth of the gambler. Incorporating the gambler s time horizon through the use of a chance constraint, the optimization problem is below. max E log(p ρ (x) + w s.t. x s w P(P ρ (x) > 0 x s 0 x s ) x s > 0) 1 α 1 τ The chance constraint is conditional on there being favourable bets to be placed, as we do not want to decrease our expected utility below log(w) to satisfy it. We assume that the frequency with which we are forced to abstain from gambling is sufficiently small so as not to significantly alter our effective time horizon. 3 Computational Substantiation The optimization model was tested using historical race data from the 2013-2014 season at Flamboro Downs. This amounted to a total of 1,168 races. Race results, including the payouts, pool sizes, and final win bet odds were collected from TrackIT [3]. Handicapping data, generated by CompuBet [4], was collected from HorsePlayer Interactive [6]. The first 70% of the race dataset was used to calibrate the race outcome probabilities and payout models, with the remaining 30% of races used for out of sample testing. 3.1 Estimating Outcome Probabilities and Payouts The multinomial logistic model, first proposed by Bolton and Chapman [1], was used to estimate win probabilities. Given a vector of handicapping data on each horse h, v h, the horses are given a value V h = β T v h, and assigned winning probabilities π h = ev h. n i=1 ev i A three factor model was used, including the log of the public s implied win probabilities from the win bet odds, log π p h, and the log of two CompuBet factors, which were all found to be statistically significant. The analysis was performed using the mlogit package [5] in R. The discount model, derived by Lo and Bacon-Shone [14], was used

4 π λ 1 j π to estimate the order probabilities, π ijkl = π λ 2 k i s i πλ 1 s λ i s were determined using multinomial logistic regression. s i,j πλ 2 s π λ 3 l s i,j,k πλ 3 s, where optimal Let Q and Q s be the superfecta pool size, and the total amount wagered on sequence s. The only information available to bettors is the value of Q. The approach taken to estimate Q s is motivated by the work of Kanto and Renqvist [11] who fit the win probabilities of the Harville model [7] to the money wagered on Quinella bets using multinomial maximum likelihood estimation. The amount wagered on sequence s is Q s = Q(1 t) P s, where t = 24.7% is the track take at Flamboro Downs and P s is the $1 payout. The minimum superfecta bet allowed in practice is $0.2 with $0.2 increments, so let n = 5Q s be the number of bets placed on s out of N = 5Q, which we assume follows a binomial distribution. We model the public s estimate of outcome probabilities using the discount model with their implied win probabilities, so for s = {i, j, k, l}, πs p = (πp i )θ 1 (π p h )θ 1 (π p j )θ 2 h i (πp h )θ 2 (pi p k )θ 3 h i,j (πp h )θ 3 (π p l )θ 4 = (π h i,j,k (πp h )θ 4. The likelihood function, using (πs p ) l data from R historical races assumed to be independent, with w r being the winning sequence in race r, is L(θ) Π R r=1(πw p r ) nr (1 πw p r ) Nr nr. The negative log-likelihood is a difference of convex functions, log L(θ) R r=1 N r log((π p w r ) l ) (n r log((π p w r ) u )+(N r n r ) log((π p w r ) l (π p w r ) u )). This function was minimized twice using fminunc in Matlab, the first with an initial guess that the public uses the Harville model, θ i = 1, the second assuming that the public believes superfecta outcomes are purely random, θ i = 0, with both resulting in the same solution. The payout function is P s (x) = x s (Q+ u S xu)(1 t) Q s+x s, where we take Q s = π p sq, the expected amount wagered on s. p s ) u 3.2 Optimization Program Formulation Our optimization program now has the following form. When testing the model we round down the optimal solution to the nearest 0.2 to avoid overbetting. The z s variables are used to indicate when x s 0.2, implying P s,ξs (x) > 0 and z nullifies the chance constraint when x s = 0. max (Q + u S π s log(x x u)(1 t) s + w x u ) Q s + x s u S s.t. x s w z π s z s (1 α 1 τ ) z ( ) zs Qs + 0.2 Q s + x s Q s z, z s {0, 1} x s 0 Q s We use the 1 to 1 mapping proposed by Kallberg and Ziemba [10], y s = log(x s + Q s ),

5 which results in the following program whose linear relaxation is convex. max π s log(q + w (t + (1 t)q s e ys ) u s.t. e ys w z + Q π s z s (1 α 1 τ ) z ( ) Qs + 0.2 z s ln y s log Q s Q s z, z s {0, 1} y s log(q s ) e yu ) 3.3 Results The model was tested on a total of 350 races. Given our optimal betting solution, the realized payout was calculated by adjusting the published payout to account for our wagers and breakage. The gambler s wealth over the course of the races was calculated using the optimization program with and without the chance constraint, Opt + and Opt respectively. Initial wealth was set to $1000, with the time horizon set to τ = 350 and α = 0.01. All testing was conducted on a Windows 7 Home Premium 64-bit, Intel Core i5-2320 3GHz processor with 8 GB of RAM. The implementation was done in Matlab R2012a with the OPTI toolbox, using the IPOPT[18] and Bonmin[2] solvers. 1,600 Opt + Opt Wealth 1,400 1,200 1,000 0 100 200 300 Race Figure 1: Wealth over the course of 350 races at Flamboro Downs. The result in Figure 1 is intuitive, as Opt + attempts to mimic Opt, while generally having to take on extra bets to satisfy the chance constraint. This extra cost results in a lower wealth until one of these extra wagers does in fact payout, which occured at

6 approximately race 280, resulting in a superior return of 28.8% compared to 17.8% for Opt. 4 Conclusion We presented a chance constrained optimization model for parimutuel horse race betting, as well as a method for estimating superfecta bet payouts. Profitability was achieved when employing the Kelly criterion, with a superior return when taking into consideration the gambler s time horizon. References [1] R. N. Bolton and R. G. Chapman. Searching for positive returns at the track: A multinomial logic model for handicapping horse races. Management Science, 32(8):1040 1060, 1986. [2] P. Bonami, L. T. Biegler, A. R. Conn, G. Cornuéjols, I. E. Grossmann, C. D. Laird, J. Lee, A. Lodi, F. Margot, N. Sawaya, and A. Wächter. An algorithmic framework for convex mixed integer nonlinear programs. Discrete Optimization, 5(2):186 204, 2008. [3] Standardbred Canada. TrackIT. https://trackit.standardbredcanada.ca. Accessed: 2014-06-02. [4] CompuBet. Compubet. https://compubet.com. Accessed: 2014-06-02. [5] Y. Croissant. Estimation of multinomial logit models in R: The mlogit packages. R package version 0.2.4, 2012. [6] Woodbine Entertainment Group. Horseplayer Interactive. http://www. horseplayerinteractive.com. Accessed: 2014-06-02. [7] D. A. Harville. Assigning probabilities to the outcomes of multi-entry competitions. Journal of the American Statistical Association, 68(342):312 316, 1973. [8] D. B. Hausch, W. T. Ziemba, and M. Rubinstein. Efficiency of the market for racetrack betting. Management Science, 27(12):1435 1452, 1981. [9] R. Isaacs. Optimal horse race bets. American Mathematical Monthly, 60(5):310 315, 1953. [10] J. G. Kallberg and W. T. Ziemba. Concavity properties of racetrack betting models. In D. B. Hausch, V. S. Y. Lo, and W. T. Ziemba, editors, Efficiency of Racetrack Betting Markets, pages 99 107. World Scientific, 2008.

7 [11] A. Kanto and G. Rosenqvist. On the efficiency of the market for double (quinella) bets at a Finnish racetrack. In D. B. Hausch, V. S. Y. Lo, and W. T. Ziemba, editors, Efficiency of Racetrack Betting Markets, pages 485 498. World Scientific, 2008. [12] M. Kaplan. The high tech trifecta. Wired Magazine, 10:10 13, 2002. [13] J. L. Kelly. A new interpretation of information rate. Information Theory, IRE Transactions on, 2(3):185 189, 1956. [14] V. S. Y. Lo and J. Bacon-Shone. Approximating the ordering probabilities of multi-entry competitions by a simple method. Handbook of Sports and Lottery Markets, pages 51 65, 2008. [15] L. C. MacLean, W. T. Ziemba, and G. Blazenko. Growth versus security in dynamic investment analysis. Management Science, 38(11):1562 1585, 1992. [16] P. Smoczynski and D. Tomkins. An explicit solution to the problem of optimizing the allocations of a bettor s wealth when wagering on horse races. Mathematical Scientist, 35(1):10 17, 2010. [17] E. Thorp. Understanding the Kelly criterion. Wilmott Magazine (May 2008). [18] A. Wächter and L. T. Biegler. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Mathematical Programming, 106(1):25 57, 2006.