Predicting the World Cup. Dr Christopher Watts Centre for Research in Social Simulation University of Surrey



Similar documents
Modelling the Scores of Premier League Football Matches

Analyzing Information Efficiency in the Betting Market for Association Football League Winners

THE FORM LAB 2.5 GOAL STRATEGY

Soccer Analytics. Predicting the outcome of soccer matches. Research Paper Business Analytics. December Nivard van Wijk

Fragiskos Archontakis. Institute of Innovation and Knowledge Management (INGENIO) Universidad Politécnica de Valencia-CSIC

Equotion. Working with Equotion

The 7 Premier League trends the bookies don t want YOU to know...

HOW TO BET ON FOOTBALL. Gambling can be addictive. Please play responsibly.

The Betting Machine. Martin Belgau Ellefsrød

IS THE SOCCER BETTING MARKET EFFICIENT? A CROSS-COUNTRY INVESTIGATION USING THE FIBONACCI STRATEGY

Testing Efficiency in the Major League of Baseball Sports Betting Market.

Football Match Winner Prediction

A random point process model for the score in sport matches

Efficiency of sports betting markets

Betting Terms Explained

BF Bot Manager V3 and Multiple Strategies bot manual

Using Monte Carlo simulation to calculate match importance: the case of English Premier League

FOOTBALL INVESTOR. Member s Guide 2014/15

Man Vs Bookie. The 3 ways to make profit betting on Football. Man Vs Bookie Sport Betting

Predicting Soccer Match Results in the English Premier League

Football Bets Explained:

Statistical techniques for betting on football markets.

More details >>> HERE <<<

Using ELO ratings for match result prediction in association football

The Market for English Premier League (EPL) Odds

What can econometrics tell us about World Cup performance? May 2010

Full version is >>> HERE <<<

How to bet and win: and why not to trust a winner. Niall MacKay. Department of Mathematics

The Ultimate Professional Guide to Winning at Sports Betting

Euro Team Stats Report

Your essential guide to the bookies best offers

Statistical Football Predictions and Betting Tips

Jordan Miller s Betting Profit Blitz

Commercialisation. Globalisation. Governance. Integrity

Combining player statistics to predict outcomes of tennis matches

FOOTBALL AND CASH HOW TO MAKE MONEY BETTING ON NAIRABET

THE FAVOURITE-LONGSHOT BIAS AND MARKET EFFICIENCY IN UK FOOTBALL BETTING

Beating the bookie: A look at statistical models for prediction of football matches

DEVELOPING A MODEL THAT REFLECTS OUTCOMES OF TENNIS MATCHES

Behavioural Biases in the European Football Betting Market

A Decision Support System for the Professional Soccer Referee in Time-Sensitive Operations

Betting Guide. BET ONline AND MOBI: Betting in running on all matches Spreads in Running on all Matches Exotics available

Dynamic bayesian forecasting models of football match outcomes with estimation of the evolution variance parameter

Betting on Excel to enliven the teaching of probability

How To Avoid Bookmaker Restrictions. Some very useful tips for serious sports investor who uses Racing Profit Booster Daily

BetXchange Rugby World Cup & Betting Guide

Profiting from an Inefficient Association Football Gambling Market: Prediction, Risk and Uncertainty Using Bayesian Networks

MATH Assignment in Mathematics. Modeling and Simulating Football Results

FACT BOOK Use these interesting facts and figures from around the world and the world of soccer to help inspire your soccer season.

HOW TO PLAY JustBet. What is Fixed Odds Betting?

free soccer bet prediction

Betting and Gaming Stefan Lüttgen

UEFA Champions League

SECRET BETTING CLUB FINK TANK FREE SYSTEM GUIDE

Correlation Analysis Between SoccerGame World Ranking and Player League Distribution

A Two-Stage Bayesian Model for Predicting Winners in Major League Baseball

Full version is >>> HERE <<<

EFFICIENCY IN BETTING MARKETS: EVIDENCE FROM ENGLISH FOOTBALL

Joint modelling of goals and bookings in association football matches

Football Bets Explained

pi-football: A Bayesian network model for forecasting Association Football match outcomes

Predicting Market Value of Soccer Players Using Linear Modeling Techniques

2013 SCHOLARSHIP EXAMINATION PRACTICAL SECTION

The most reliable cards bets in the Premier League. bets that paid out in 84%, 89% and 95% of games last season

UZH Business Working Paper Series (ISSN )

Sport Hedge Millionaire s Guide to a growing portfolio. Sports Hedge

Additional information >>> HERE <<< Getting Start biggest win on online roulette User Review

Comparing & Contrasting. - mathematically. Ways of comparing mathematical and scientific quantities...

Paul Ruffy 2015 Cautionary Note/Disclaimer: You are solely responsible for any money that you bet, win or lose. No reproduction or distribution of

A Multinomial Logit-based Statistical Test of Association Football Betting Market Efficiency

How To Make Big Money Betting In Naira On Football Matches!

Profiting from arbitrage and odds biases of the European football gambling market

Re-inventing the Game - Driving Online Gaming Growth. September 2010

bet365 spread betting best odds

Private Members Club Soccer Betting System

The Kelly criterion for spread bets

MODULE 1 INTRODUCTION TO THE BETTING EXCHANGE CONCEPT

The Fibonacci Strategy Revisited: Can You Really Make Money by Betting on Soccer Draws?

Glossary of bet types

Sample Size Designs to Assess Controls

The Global Business of Online Sports Betting: A Market Assessment and Outlook

OBJECTIVE ASSESSMENT OF FORECASTING ASSIGNMENTS USING SOME FUNCTION OF PREDICTION ERRORS

Additional details >>> HERE <<<

International Statistical Institute, 56th Session, 2007: Phil Everson

executive summary permanent tsb

Football Match Result Predictor Website

Texas Hold em. From highest to lowest, the possible five card hands in poker are ranked as follows:

Björn Bäckström, Lars Olsén and Simon Renström Copyright 2011 ClaroBet AB. Updated: May. 12, 11 You may not distribute or copy any of the text or use

Soccer Bot Draw Betting

Simbabet. Tip Code League Time MBS Home 1 X 2 Away 1234 UEFA 16:45 1 Barcelona Arsenal

Virtual Sports Betting Secrets

A Statistical Test of Association Football Betting Market Efficiency

Rating Systems for Fixed Odds Football Match Prediction

Bet Types: Locating. This tutorial serves as a guide to locating the various bet types on the relevant bookmaker websites.

SECRET BETTING CLUB FINK TANK FREE SYSTEM GUIDE

STOCHASTIC MODELLING OF SOCCER MATCH RESULTS. Stephen Dobson. University of the West Indies. John Goddard * University of Wales Swansea

PDF Created with deskpdf PDF Writer - Trial ::

Transcription:

Predicting the World Cup Dr Christopher Watts Centre for Research in Social Simulation University of Surrey

Possible Techniques Tactics / Formation (4-4-2, 3-5-1 etc.) Space, movement and constraints Data on passes attempted and received Agent-based simulation? Robo soccer? Computer games? Picking a team Data on who was playing whenever Rooney scored Combinatorial optimisation Statistical modelling of matches Data on goals scored in each match Poisson model, Markov Chain Monte Carlo (MCMC) Data on win/draw/lose Probit model Prediction distinct from Explanation 2

Why MCMC? Data readily available BBC Sport website, FIFA website, etc. Answers interesting questions Who is likely to win this match? What odds of it ending 5-1? Answers these questions on a large scale Dozens of matches from one model 3

Procedure Get dataset Fit mathematical model (training) Don t overfit model (validation) Predict outcomes or estimate odds (test) Go to William Hill, Ladbrokes etc. 4

Some Reading Dixon & Coles (1997) Karlis (2003) Graham & Stott (2008) Spiegelhalter & Ng (2009) Greenhough et al. (2002) Denis Campbell, The Observer, Sunday 28 May 2006 5

The model Let # goals scored by i against j be Poisson-distributed with parameter lambda = ( A i / D j ) where A i is Attacking strength of i D j is Defensive strength of j 6

Premier League 20 teams in division so 20 attack + 20 defence = 40 unknowns But every team will play every other home and away 20 x 19 = 380 matches per season Use some of this as training data, some as validation and predict the rest Network of known results constrains the unknown parameters 7

Questionable assumptions (1) Poisson distribution Scoring one goal is no more likely after scoring three than after scoring none No confidence / morale effects, no learning 9:0 shouldn t appear every other season (nor every other century?) Alternatives Weibull function (Discretised) Two parameters (alpha, beta) in place of lambda Negative Binomial 8

Questionable assumptions (2) Same parameters all season? New teams members in August and January Rain-soaked pitches lead to defensive mistakes (esp. in November) Fatigue (African Cup of Nations, Europe) Injuries Managerial tinkering, rotation Extra parameters for seasonality? 9

Can we gamble? Bookmakers odds reflect: their need to make a profit so implied probabilities will not sum up to 1 their need to hedge bets 1 million patriots bet on England more information than just past results e.g. Rio Ferdinand is out! (8 to 1, from 7 to 1) Identify undervalued outcomes E.g. bet against the favourite Operate on a large scale (Expensive!) 10

MCMC Simulation Each combination of 20x2 parameters represents a possible system state During simulation system jumps from state to (more likely) state Over time system tends to something close to the most likely state (hopefully) The parameter values that best fit the data 11

Max Likelihood Likelihood Ratio P( Results data Theory1 ) P( Results data Theory2 ) P(X=x) = lambda x * e -lambda / x! Algorithm options: Always adopt the larger (Ascent) Random choice stratified using odds ratio (Gibbs sampling) 12

Log Likelihood Likelihood of the theory parameters: P ( Goals scored X ij = x X ij ~ Pois( A i / D j ) ) Multiply corresponding probability for each goal score (home, away) for each match in data set Equivalently: Sum the log likelihoods Assumptions! Every match result is independent of every other Goals scored is independent of goals conceded 13

Validation data Use separate validation data to demonstrate when model is over-fit to training data Likelihood given validation data peaks Around 13000 iterations in this example 14

Premiership 2009-10 4 th April, 2-3 matches to go 15

Prediction reliability? 2009-10 saw a tight contest at top and bottom! Even with 3 games to go prediction was inaccurate 16

The World Cup 32 nations, selected from 207, 6 continents Fit FIFA data for last 5 years World & Continental competitions Qualifiers (Home + Away) Finals (Usually only one Home team) Friendlies (Home or Away) Few inter-continental matches Longer time scale 2-3 matches, then long breaks Finals: 7 matches in 5 weeks 17

Monte Carlo Simulation Given model of teams simulate the tournament Sample scores for each match Calculate points, winners Repeat 10000 times Estimate odds for: Particular teams reaching the Last 16, Quarter Finals etc. and Winning the competition 18

Beat the bookies Estimate odds If bookmakers offer longer odds England (rows) vs. USA (columns) None of these are tempting 19

Parameters fit and estimated chances 20

Any tips? Model says Brazil have odds of 2.1 to 1 William Hill offer 9 to 2 (=4.5:1) England bad bet at 18 to 1 (WH: 8 to 1) Germany best bet: Model says 11 to 2 (WH: 14 to 1!) Denmark, Serbia also undervalued Forget Italy, Portugal It s not going to be USA, Chile or Greece either 21

Surprised? Germany again?!? Had Home advantage 4 years ago Ballack is out this time Bundesliga uses balls from Adidas Why are Spain not higher? 22

Easy group? Ranked by Chance of getting at least this far Spain could face Brazil, Portugal or Ivory Coast in the Last 16 Things get tougher for England after the Group stage 23

Extensions Reweighted data by age Let importance of result decay exponentially over time Focus on last 12 months Spain now become favourite England still only 5% chance! 24

Any lessons? We model (adaptive!) human social behaviour Use MCMC to fit network data As in Siena / stocnet (ERGM) Energy models (my PhD topic) Individuals energise/de-energise each other when they interact This affects future interactions interaction ritual chains theory (Collins) Stratification: success breeds success (as in science) Learning models (Learning to beat x? To fear x?) 25