Efficiency of sports betting markets

Similar documents

Analyzing Information Efficiency in the Betting Market for Association Football League Winners

A Test for Inherent Characteristic Bias in Betting Markets ABSTRACT. Keywords: Betting, Market, NFL, Efficiency, Bias, Home, Underdog

The NCAA Basketball Betting Market: Tests of the Balanced Book and Levitt Hypotheses

Point Spreads. Sports Data Mining. Changneng Xu. Topic : Point Spreads

THE DETERMINANTS OF SCORING IN NFL GAMES AND BEATING THE SPREAD

POINT SPREAD SHADING AND BEHAVIORAL BIASES IN NBA BETTING MARKETS. by Brad R. Humphreys *

Betting Terms Explained

Testing Market Efficiency in a Fixed Odds Betting Market

Testing Efficiency in the Major League of Baseball Sports Betting Market.

Pick Me a Winner An Examination of the Accuracy of the Point-Spread in Predicting the Winner of an NFL Game

The Determinants of Scoring in NFL Games and Beating the Over/Under Line. C. Barry Pfitzner*, Steven D. Lang*, and Tracy D.

Fair Bets and Profitability in College Football Gambling

Volume 30, Issue 4. Market Efficiency and the NHL totals betting market: Is there an under bias?

Using ELO ratings for match result prediction in association football

Herd Behavior and Underdogs in the NFL

The Fibonacci Strategy Revisited: Can You Really Make Money by Betting on Soccer Draws?

A Multinomial Logit-based Statistical Test of Association Football Betting Market Efficiency

Modelling the Scores of Premier League Football Matches

Predicting the World Cup. Dr Christopher Watts Centre for Research in Social Simulation University of Surrey

Forecasting Accuracy and Line Changes in the NFL and College Football Betting Markets

IS THE SOCCER BETTING MARKET EFFICIENT? A CROSS-COUNTRY INVESTIGATION USING THE FIBONACCI STRATEGY

Sports Forecasting. H.O. Stekler. RPF Working Paper No August 13, 2007

Chapter 13 Gambling and the NFL

UZH Business Working Paper Series (ISSN )

Numerical Algorithms for Predicting Sports Results

Fragiskos Archontakis. Institute of Innovation and Knowledge Management (INGENIO) Universidad Politécnica de Valencia-CSIC

Football Bets Explained:

Applied Economics Publication details, including instructions for authors and subscription information:

Prices, Point Spreads and Profits: Evidence from the National Football League

How To Bet On An Nfl Football Game With A Machine Learning Program

EFFICIENCY IN BETTING MARKETS: EVIDENCE FROM ENGLISH FOOTBALL

Home Bias in the NFL Pointspread Market. Matt Cundith Department of Economics California State University, Sacramento

Profiting from arbitrage and odds biases of the European football gambling market

The Market for English Premier League (EPL) Odds

DEVELOPING A MODEL THAT REFLECTS OUTCOMES OF TENNIS MATCHES

Sin City. In poker, the facility to buy additional chips in tournaments. Total payout liability of a casino during any one game.

A Statistical Test of Association Football Betting Market Efficiency

Can Punters win? Are UK betting markets on sporting events. efficiently aggregating information?

How Efficient is the European Football Betting Market? Evidence from Arbitrage and Trading Strategies

Statistical techniques for betting on football markets.

Does NFL Spread Betting Obey the E cient Market Hypothesis?

The Favourite-Longshot Bias in English Football

The Impact of Publicly Available Information on Betting Markets: Implications for Bettors, Betting Operators and Regulators

A Contrarian Approach to the Sports Betting Marketplace

Forecasting Football Results and the Efficiency of Fixed-odds Betting

Testing the Efficiency of the NFL Point Spread Betting Market

EXPECTED VALUES AND VARIANCES IN BOOKMAKER PAYOUTS: A THEORETICAL APPROACH TOWARDS SETTING LIMITS ON ODDS

THE EFFICIENT MARKET HYPOTHESIS AND GAMBLING ON NATIONAL FOOTBALL LEAGUE GAMES YOON TAE SUNG THESIS

SPORTS FORECASTING. There have been an enormous number of studies involving various aspects of sports. We

Sport Hedge Millionaire s Guide to a growing portfolio. Sports Hedge

Fixed odds bookmaking with stochastic betting demands

Late Money and Betting Market Efficiency: Evidence from Australia

What can econometrics tell us about World Cup performance? May 2010

Testing the Efficiency of Sports Betting Markets: An Examination of National Football League and English Premier League Betting

epub WU Institutional Repository

Understanding Price Movements in Point Spread Betting Markets: Evidence from NCAA Basketball

REGULATING INSIDER TRADING IN BETTING MARKETS

EXAMINING NCAA/NFL MARKET EFFICIENCY

Does bettor sentiment affect bookmaker pricing?

Behavioural Biases in the European Football Betting Market

Soccer Analytics. Predicting the outcome of soccer matches. Research Paper Business Analytics. December Nivard van Wijk

THE FAVOURITE-LONGSHOT BIAS AND MARKET EFFICIENCY IN UK FOOTBALL BETTING

Modelling football match results and testing the efficiency of the betting market

ONLINE SPORTS GAMBLING: A LOOK INTO THE EFFICIENCY OF BOOKMAKERS ODDS AS FORECASTS IN THE CASE OF ENGLISH PREMIER LEAGUE

LIVE BETTING ULTRA RULES

Information and e ciency: an empirical study of a xed odds betting market

Prediction accuracy of different market structures bookmakers versus a betting exchange

Multinomial and Ordinal Logistic Regression

Live Betting Extra Rules

Institute for Strategy and Business Economics University of Zurich

Beating the NCAA Football Point Spread

ISSUES IN SPORTS FORECASTING

Testing the Efficiency of Sports Betting Markets

FOOTBALL INVESTOR. Member s Guide 2014/15

THE ULTIMATE FOOTBALL BETTING GUIDE

BETTING MARKET EFFICIENCY AT PREMIERE RACETRACKS

How to cash-in on ALL football matches without ever picking a winning team

Rating Systems for Fixed Odds Football Match Prediction

Data mining and football results prediction

International Statistical Institute, 56th Session, 2007: Phil Everson

Efficiency of football betting markets: the economic significance of trading strategies

Soccer Bot Draw Betting

Do Gamblers Correctly Price Momentum in NBA Betting Markets?

Betting and Gaming Stefan Lüttgen

Predicting sports events from past results

Betting Markets and Social Media

An Analysis of Sportsbook Behavior and How to Profit. Chris Ludwiczak. Advisor: Dr. John Clark

The degree of inefficiency in the. football betting market. Statistical tests

ON ARBITRAGE AND MARKET EFFICIENCY: AN EXAMINATION OF NFL WAGERING. Mark Burkey*

Testing semi-strong efficiency in a fixed odds betting market: Evidence from principal European football leagues.

Transcription:

Efficiency of sports betting markets Ruud H. Koning Department of Economics and Econometrics Faculty of Economics and Business University of Groningen IASE, Gijon (may 2008)

Abstract Sports and betting have grown together to become multi billion dollar industries. The advent of the internet has globalized the betting market even more, with a few clicks it is possible to put a wager on events far away. Betting odds are also routinely used as indicators of winning probabilities or indicators of relative strengths. This assumes that betting markets process information efficiently. In this talk, we test this hypothesis for two types of soccer betting markets, and the market for tennis bets.

Plan Where do we stand (literature review)? NFL Economic theory Soccer Regression-based tests Three applications (soccer, tennis, and again, soccer)

Information efficiency (1) A market in which prices always fully reflect available information is called efficient. (Fama, 1970) A capital market is said to be efficient if it fully and correctly reflects all relevant information in determining prices. Formally, the market is said to be efficient with respect to some information set... if security prices would be unaffected by revealing that information to all participants. Moreover, efficiency with respect to an information set... implies that it is impossible to make economic profits by trading on the basis [of that information set]. (Malkiel, 1992)

Information efficiency (2) A perfect market for a stock is one in which there are no profits to be made by people who have no special information about the company, and in which it is difficult even for people who do have special information to make profits, because the price adjusts so rapidly as the information becomes available... Thus we would like to see randomness in the prices of successive transactions, rather than great continuity... If the price is going to move up, it should move up all at once, rather than in a series of small steps. (Black, 1971)

Terminology (1) (Wikipedia) In probability theory and statistics the odds in favour of an event or a proposition are the quantity p/(1 p), where p is the probability of the event or proposition... In gambling, the odds on display are not the chances that the event will occur, but are the amounts that the bookmaker will pay. However, in a free market under perfect information, the odds of paying tend to drift to the true probability odds.

Terminology (2) (Wikipedia) Taking an event with a 1 in 5 probability of occurring (i.e. 0.2 or 20%), then the odds are 0.2/(1 0.2) = 0.2/0.8 = 0.25. If you bet 1 at fair odds and the event occurred, you would receive back 4 plus your original 1 stake. This would be presented in fractional odds of 4 to 1 against (written as 4 : 1 or 4/1). Decimal odds (Odds): total payout by bookmaker (including stake), so 5.0 in this case. Fair odds (no expected profit, no expected loss): Pr(event) Odds = 1.

Terminology (3) (Wikipedia) Vigorish, or simply vig, or juice, is the amount charged by a bookmaker for his services. The term is Yiddish slang originating from the Russian word for winnings, vyigrysh. The concept is also known as the overround. Example: Final Roland Garros (tennis), Bet365.com paid 1.16 if Henin-Hardene were to win, and 5 if Ivanovic would win. 0.862 vs. 0.20, overround 0.062. Example: RKC Waalwijk-Ajax, Bet365.com paid 5, 3.2, 1.66 for a home win, draw, away win, corresponding to probabilities 0.2, 0.31, 0.60, overround 0.11.

Overround England by season 1.06 1.08 1.10 1.12 2004 2005 2005 2006 150 100 50 density 150 2002 2003 2003 2004 0 100 50 0 1.06 1.08 1.10 1.12 overround

Zuber et al., JPE 1985 Point spread betting, 11 for 10 rule PS it = β 0 + β 1 VL it + ɛ it VL initially posted by bookmaker, later adjusted EMH: β 0 = 0, β 1 = 1 1983 NFL, 13 out of 16 weeks EHM is not rejected 15 out of 16 weeks β 0 = 0, β 1 = 0 not rejected

Zuber et al., JPE 1985 (cont d) PS it = β x it + ɛ it Each variable: HT AT yards rushed, yards passed, number of wins, fumbles, interceptions, number of rookies The results are gratifyingly neat and clear. (p. 803) R 2 = 0.733 (first eight weeks) or R 2 = 0.727 (entire 18 weeks)

Zuber et al., JPE 1985 (cont d) Estimate model up to t 1, PPS it Value bet: PPS it VL it > λ, λ = 0.5, 1, 2, 3 λ = 0.5, bet U$ 110, grand total U$ 2920 Gains not increasing in λ Single NFL season Expoitable opportunity?

Sauer et al., JPE 1988 In particular, there is a stronger weak test. (p. 207) Not all implications of efficiency are used Pooling all 16 weeks: β 0 = 0, β 1 = 1 F = 0.25, β 0 = 0, β 1 = 0, F = 12.2 PS it VL it = β (x h it x a it ) + ɛ it β = 0, F = 0.55

Sauer et al., JPE 1988 (cont d) Estimate model up to t 1, PPS it Value bet: PPS it VL it > λ, λ = 0.5, 1, 2, 3 Strategy is highly profitable in 1983 (U$ 1380), but unsuccessful in 1984: U$ 2920.

Dare and MacDonald, JFE 1996 PS it = β 0 + β 1 VL it + ɛ it, home team minus away team Alternatively: favorite first, so that VL it 0 Only five possibilities: FH-UA, UH-FA, HP-AP, FN-UN, PN Restrictions on regression 1980-1985: β 0 = 0, β 1 = 1 not rejected, only pick-em games and Superbowl

Gray and Gray, JFE 1996 Model Pr(PS it > VL it ) = Φ(β 0 + β 1 HOME i + β 2 FAV i ), β 2 < 0, favorites are less likely to beat the spread Recent past performance: momentum/contrarian investment literature, and applied psychology and economics literature on streaks Team that is doing well is more likely to beat the spread Bet on home team underdog, 4% net return

Boulier et al, App. Ec. 2006 PS it = β 0 + β 1 VL it + ɛ it PS it = β 0 + β 1 VL it + γ z it + ɛ it NYT Power Score, different surface, dome NFL 1994-2000 β 0 > 0, not exploitable, all other tests nonsignificant

Shin, Ec. J.. 1992 Betting as a vehicle to study asset pricing with insider trading Contingent claims market, Arrow-Debreu securities Market exists for a short time, outcome is acknowledged Anecdotal evidence of insider trading Favorite-longshot bias: favorites win too often (compared to odds), and longshots don t win often enough.

Odds Fair odds: Pr(event) Odds = 1. or Pr(event) = 1/Odds. Do these odds reflect all available information? Model based tests and betting based tests

Pope and Peel, Economica 1989 Football matches in the UK, 1981/82 season, 1290 games, four bookmakers Development sample 1055 games, 225 holdout Some bets have positive return HW = β 0 + β 1 (1/Odds) + ɛ ij β 1 < 1, not significant

Pope and Peel, Economica 1989 (cont d) S1: bet if logit probability exceeds implied probability, returns on holdout sample lower than on development sample S2: use info from experts Some bets have positive return Draws: odds do not have all information

Goddard and Asimakopoulos, J. Int. Forecasting 2004 Look for profitable betting strategies Forecast football results Poisson type models vs. discrete choice y ij = θ i θ j + ɛ ij

Goddard and Asimakopoulos, J. Int. Forecasting 2004 (cont d) θ i parametrized: Form Significance of a game Elimination of FA Cup Distance Model development: 1989-1998, forecast 1999, etc

Goddard and Asimakopoulos, J. Int. Forecasting 2004 (cont d) HW = β 0 + β 1 (1/Odds) + β 2 (ˆπ (1/Odds)) + ɛ ij β 0 = 0, β 1 = 1, β 2 = 0 not rejected Bet on highest probability event: positive return at start and end of season HW = β 0 + β 1 (1/Odds) + β 2 (ˆπ (1/Odds)) + ɛ ij + β 2 (π ˆπ)

Goddard and Thomas, J. Int. Sp. Fin. 2006 Results modeled by ordered probit model, based on FIFA rankings and match results from previous tournaments Evaluation of EURO 2004 by simulation Prospects of Greece were not underestimated Home advantage of Portugal is underestimated Individual games: not profitable

Bettings based tests (1) Rating measures difference of quality between two teams Stefani, Norman and Clark, Koning Step 1: estimate rating Step 2: estimate relation between rating and result Step 3: identify value bets

Betting based tests (2) Quality Q it is number of goals scored minus number of goals conceded during the last k games Varies by team and over time Quality difference: Q it Q jt Estimate Pr(HW ) = G(Q it Q jt ) Bet if Pr(HW ) > 1/Odds, expected value Pr(HW ) Odds > 1

Model based tests Estimate Pr(HW ) = 1 Pr(HW ) = 1/Odds = 1 + Odds 1 1 = 1 + exp(log(odds 1)) 1 = 1 + exp( β 0 β 1 log(odds 1)). 1 1 + exp( β 0 β 1 log(odds 1) γ z ijt ) and test β 0 = 0, β 1 = 1 and γ = 0.

Simulation experiment posted odds, Netherlands, 2005-2006, 306 games, 76 affected additive: 1/Odds + 4ψU multiplicative: 1/Odds(1 + ψ) 10000 seasons simulated, test statistics

t test regresion ext. regresion 0.0 0.1 0.2 0.3 0.4 0.5 additive perturbations multiplicative perturbations 0.6 power 0.4 0.2 0.0 0.1 0.2 0.3 0.4 0.5 psi

Soccer Results on soccer competitions in different countries Key variables: home goals, away goals, date, average home win (since 2001-2002) Additional variables: match characteristics as number of bookings, attendance, referee (not for all countries) Data used: from 2002-2003 onwards Odds are posted two days before the game

home win odds draw odds away win odds turkey 0.8 0.7 0.6 0.5 netherlands portugal spain 0.8 0.7 Coefficient of concordance 0.6 0.5 germany greece italy 0.8 0.7 0.6 0.5 belgium england france 0.8 0.7 0.6 0.5 02/03 04/05 06/07 Season 02/03 04/05 06/07

Home win 0.8 0.6 outcome 0.4 0.2 0.2 0.4 0.6 0.8 implied probability by fair odds

Win probability and odds (3) lrm(formula = ftr == "H" ~ log(av.odds.h - 1), data = soccer.data3) Frequencies of Responses FALSE TRUE 8540 7597 Obs C Dxy Gamma R2 Brier 16137 0.698 0.397 0.398 0.165 0.218 Coef S.E. Wald Z P Intercept -0.1640 0.01689-9.71 0 av.odds.h -1.0838 0.02634-41.15 0

home win odds 0.2 0.4 0.6 0.8 away win odds true spain turkey false greece italy netherlands portugal true Outcome false belgium england france germany true false 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 Standardized price

Win probability and odds (4) lrm(formula = ftr == "H" ~ log(b365h.f - 1), data = england) Frequencies of Responses FALSE TRUE 801 719 Coef S.E. Wald Z P Intercept 0.1399 0.05766 2.43 0.0153 b365h.f -1.1019 0.09020-12.22 0.0000

spain spain portugal portugal netherlands netherlands italy italy germany germany france france england england belgium belgium 0.1 0.0 0.1 0.2 0.3 1.4 1.2 1.0 estimated beta_0 estimated beta_1

2005 2006 2005 2006 2004 2005 2004 2005 2003 2004 2003 2004 2002 2003 2002 2003 0.1 0.0 0.1 0.2 0.3 1.4 1.2 1.0 estimated beta_0 estimated beta_1

0.2 0.0 0.2 0.4 0.2 0.0 0.2 0.4 italy netherlands portugal spain 2005 2006 2004 2005 2003 2004 2002 2003 belgium england france germany 2005 2006 2004 2005 2003 2004 2002 2003 0.2 0.0 0.2 0.4 estimated beta_0 0.2 0.0 0.2 0.4

estimated beta_1 2002 2003 2003 2004 2004 2005 2005 2006 1.5 1.0 belgium england 1.5 1.0 france germany italy 1.5 1.0 netherlands portugal 1.5 1.0 2002 2003 2003 2004 2004 2005 2005 2006 spain

Additional variables Financial variables Match characteristics points goals scored moment in season promotion/relegation last season position

Tennis (1) Men s tennis: 2002-2007, women s tennis: 2007 Ranking is important Other information: court (indoor/outdoor), surface, ranking, round, best-of-three or best-of-five Λ(β x) = 1 Λ( β x)

Tennis (2) lrm(formula = Result ~ log(b365.f - 1), data = men.tennis) Coef S.E. Wald Z P Intercept -0.005686 0.02060-0.28 0.7825 B365.F -1.147360 0.02550-44.99 0.0000 lrm(formula = Result ~ log(b365.f - 1), data = women.tennis) Coef S.E. Wald Z P Intercept -0.01855 0.06717-0.28 0.7824 B365.F -1.20002 0.07460-16.09 0.0000

Odds and outcome, men 0.8 0.6 outcome 0.4 0.2 0.2 0.4 0.6 0.8 implied probability by fair odds

Odds and outcome, women 0.8 0.6 outcome 0.4 0.2 0.2 0.4 0.6 0.8 implied probability by fair odds

Tennis (3) > anova(lrm(result~rcs(log(b365.f-1),4),data=men.tennis)) Wald Statistics Response: Result Factor Chi-Square d.f. P B365.F 1952.22 3 <.0001 Nonlinear 8.45 2 0.0146 TOTAL 1952.22 3 <.0001

Odds, logit, and logit with nonlinear terms (men) 0.8 0.6 outcome 0.4 0.2 fair odds logit logit with nonlinear terms 0.2 0.4 0.6 0.8 model probability

Tennis (4) Court: not significant Rank: not significant > anova(lrm(result~best.of*log(b365.f-1),data=men.tennis)) Wald Statistics Response: Result Factor Chi-Square d.f. P Best.of (Factor+Higher 8.87 2 0.0119 All Interactions 8.56 1 0.0034 B365.F (Factor+Higher 2004.80 2 <.0001 All Interactions 8.56 1 0.0034 Best.of * B365.F (Factor 8.56 1 0.0034 TOTAL 2004.83 3 <.0001

Odds and outcome, men 0.2 0.4 0.6 0.8 Best of 3 Best of 5 0.8 0.6 outcome 0.4 0.2 0.2 0.4 0.6 0.8 implied probability by fair odds

Betting exchange (1) Continuous trading 1% commission 1 minute trading stop data set

Newcastle Middlesborough, 3 March 2007, 0 0 home win draw away win 40 30 odds 20 10 0 0 15 30 45 60 75 90 105 time

West Ham United Tottenham Hotspur, 4 March 2007, 3 4 15 odds home win 10 5 B365 (fair) 0 0 1 0 2 0 2 1 2 2 3 23 3 3 4 time

Information efficiency (2) A perfect market for a stock is one in which there are no profits to be made by people who have no special information about the company, and in which it is difficult even for people who do have special information to make profits, because the price adjusts so rapidly as the information becomes available... Thus we would like to see randomness in the prices of successive transactions, rather than great continuity... If the price is going to move up, it should move up all at once, rather than in a series of small steps. (Black, 1971)

West Ham United Tottenham Hotspur, 4 March 2007, 3 4 15 odds home win 10 5 0 0 1 0 2 0 2 1 2 2 3 23 3 3 4 time

West Ham United Tottenham Hotspur, 4 March 2007, 3 4 25 20 odds away win 15 10 5 B365 (fair) 0 0 1 0 2 0 2 1 2 2 3 23 3 3 4 time

West Ham United Tottenham Hotspur, 4 March 2007, 3 4 15 odds draw 10 5 B365 (fair) 0 0 1 0 2 0 2 1 2 2 3 23 3 3 4 time

Betting exchange (2) First minute odds: lrm(formula = team1w ~ log(team1lpm - 1), data = be.data.inplay[min1,]) Frequencies of Responses 0 1 56 49 C 0.738 Coef S.E. Wald Z P Intercept 0.05319 0.2205 0.24 0.8094 team1lpm -1.17135 0.3041-3.85 0.0001

Betting exchange (3) 46th minute odds: lrm(formula = team1w ~ log(team1lpm - 1), data = be.data.inplay[min46,]) Frequencies of Responses 0 1 56 49 Obs C 105 0.885 Coef S.E. Wald Z P Intercept -0.086 0.2627-0.33 0.7434 team1lpm -1.337 0.2729-4.90 0.0000

Betting exchange (4) After first goal in game: lrm(formula = team1.wins ~ log(av.new.odds - 1), data = goals[e, ]) Coef S.E. Wald Z P Intercept -0.1927 0.2106-0.91 0.3602 av.new.odds -0.9907 0.1425-6.95 0.0000 lrm(formula = team1.wins ~ log(first.new.odds - 1), data = goals[e, ]) Coef S.E. Wald Z P Intercept -0.2079 0.2051-1.01 0.3108 first.new.odds -0.9541 0.1389-6.87 0.0000

Betting exchange (5) Interaction with goals being scored by home team or away team are not significant Not enough information to assess effect of red card (Ridder et al, 1991) Trading volumes

Conclusions Logit based tests are useful Not all markets efficient Increasing efficiency over time? Extensions of efficiency research to smaller markets useful Betting exchanges offer new insights