Towards smart-data: Improving predictive accuracy in long-term football team performance

被引:32
作者
Constantinou, Anthony [1 ]
Fenton, Norman [1 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, Risk & Informat Management RIM Res Grp, London E1 4NS, England
基金
欧洲研究理事会;
关键词
Data engineering; Dynamic Bayesian networks; Expert systems; Favourite-longshot bias; Football predictions; Knowledge engineering; Smart data; Soccer predictions; ODDS BETTING MARKET; BAYESIAN NETWORKS; ENGLISH FOOTBALL; UK FOOTBALL; EFFICIENCY; UNCERTAINTY; INFORMATION; LEAGUE; MODEL;
D O I
10.1016/j.knosys.2017.03.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite recent promising developments with large datasets and machine learning, the idea that automation alone can discover all key relationships between factors of interest remains a challenging task. Indeed, in many real-world domains, experts can often understand and identify key relationships that data alone may fail to discover, no matter how large the dataset. Hence, while pure machine learning provides obvious benefits, these benefits may come at a cost of accuracy. Here we focus on what we call smart-data; a method which supports data engineering and knowledge engineering approaches that put greater emphasis on applying causal knowledge and real-world 'facts' to the process of model development, driven by what data are really required for prediction, rather than by what data are available. We demonstrate how we exploited knowledge to develop a model that generates accurate predictions of the evolving performance of football teams based on limited data. The model enables us to predict, before a season starts, the total league points a team is expected to accumulate throughout the season. The results compare favourably against a number of other relevant and different types of models, and are on par with some other models which use far more data. The model results also provide a novel and comprehensive attribution study of the factors most influencing change in team performance, and partly address the cause of the widely accepted favourite-longshot bias observed in bookies odds. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:93 / 104
页数:12
相关论文
共 49 条
[1]  
Agena, 2016, BAYESIAN NETWORK SIM
[2]  
[Anonymous], 2009, The world economic outlook: crisis and recovery
[3]  
[Anonymous], 2016, TELEGRAPH GRAPHIC PR
[4]  
[Anonymous], 2001, GLOBALISATION EFFICI
[5]   Bayesian hierarchical model for the prediction of football results [J].
Baio, Gianluca ;
Blangiardo, Marta .
JOURNAL OF APPLIED STATISTICS, 2010, 37 (02) :253-264
[6]   The favourite-longshot bias and market efficiency in UK football betting [J].
Cain, M ;
Law, D ;
Peel, D .
SCOTTISH JOURNAL OF POLITICAL ECONOMY, 2000, 47 (01) :25-36
[7]  
Constantinou A., J PSYCHOL, V15
[8]  
Constantinou A. C., 2013, Journal of Gambling Business and Economics, V7, P41
[9]   pi-football: A Bayesian network model for forecasting Association Football match outcomes [J].
Constantinou, Anthony C. ;
Fenton, Norman E. ;
Neil, Martin .
KNOWLEDGE-BASED SYSTEMS, 2012, 36 :322-339
[10]   Integrating expert knowledge with data in Bayesian networks: Preserving data-driven expectations when the expert variables remain unobserved [J].
Constantinou, Anthony Costa ;
Fenton, Norman ;
Neil, Martin .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 56 :197-208