TY - JOUR
T1 - Forecasting football match results in national league competitions using score-driven time series models
AU - Koopman, Siem Jan
AU - Lit, Rutger
PY - 2019/4
Y1 - 2019/4
N2 - We develop a new dynamic multivariate model for the analysis and forecasting of football match results in national league competitions. The proposed dynamic model is based on the score of the predictive observation mass function for a high-dimensional panel of weekly match results. Our main interest is in forecasting whether the match result is a win, a loss or a draw for each team. The dynamic model for delivering such forecasts can be based on three different dependent variables: the pairwise count of the number of goals, the difference between the numbers of goals, or the category of the match result (win, loss, draw). The different dependent variables require different distributional assumptions. Furthermore, different dynamic model specifications can be considered for generating the forecasts. We investigate empirically which dependent variable and which dynamic model specification yield the best forecasting results. We validate the precision of the resulting forecasts and the success of the forecasts in a betting simulation in an extensive forecasting study for match results from six large European football competitions. Finally, we conclude that the dynamic model for pairwise counts delivers the most precise forecasts while the dynamic model for the difference between counts is most successful for betting, but that both outperform benchmark and other competing models.
AB - We develop a new dynamic multivariate model for the analysis and forecasting of football match results in national league competitions. The proposed dynamic model is based on the score of the predictive observation mass function for a high-dimensional panel of weekly match results. Our main interest is in forecasting whether the match result is a win, a loss or a draw for each team. The dynamic model for delivering such forecasts can be based on three different dependent variables: the pairwise count of the number of goals, the difference between the numbers of goals, or the category of the match result (win, loss, draw). The different dependent variables require different distributional assumptions. Furthermore, different dynamic model specifications can be considered for generating the forecasts. We investigate empirically which dependent variable and which dynamic model specification yield the best forecasting results. We validate the precision of the resulting forecasts and the success of the forecasts in a betting simulation in an extensive forecasting study for match results from six large European football competitions. Finally, we conclude that the dynamic model for pairwise counts delivers the most precise forecasts while the dynamic model for the difference between counts is most successful for betting, but that both outperform benchmark and other competing models.
KW - Bivariate Poisson
KW - Ordered probit
KW - Probabilistic loss function
KW - Skellam
UR - http://www.scopus.com/inward/record.url?scp=85060302194&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85060302194&partnerID=8YFLogxK
U2 - 10.1016/j.ijforecast.2018.10.011
DO - 10.1016/j.ijforecast.2018.10.011
M3 - Article
AN - SCOPUS:85060302194
SN - 0169-2070
VL - 35
SP - 797
EP - 809
JO - International Journal of Forecasting
JF - International Journal of Forecasting
IS - 2
ER -