Predictive analysis and modelling football results using machine learning approach for English Premier League

被引:91
|
作者
Baboota, Rahul [1 ]
Kaur, Harleen [2 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, New Delhi, India
[2] Jamia Hamdard, Sch Engn Sci & Technol, Dept Comp Sci & Engn, New Delhi, India
关键词
Machine learning; Feature engineering; Data mining; Predictive analysis; Random forest; Support vector machines (SVM); Ranked probability score (RPS); Gradient boosting; MATCH; PROBABILITY; SCORES;
D O I
10.1016/j.ijforecast.2018.01.003
中图分类号
F [经济];
学科分类号
02 ;
摘要
The introduction of artificial intelligence has given us the ability to build predictive systems with unprecedented accuracy. Machine learning is being used in virtually all areas in one way or another, due to its extreme effectiveness. One such area where predictive systems have gained a lot of popularity is the prediction of football match results. This paper demonstrates our work on the building of a generalized predictive model for predicting the results of the English Premier League. Using feature engineering and exploratory data analysis, we create a feature set for determining the most important factors for predicting the results of a football match, and consequently create a highly accurate predictive system using machine learning. We demonstrate the strong dependence of our models' performances on important features. Our best model using gradient boosting achieved a performance of 0.2156 on the ranked probability score (RPS) metric for game weeks 6 to 38 for the English Premier League aggregated over two seasons (2014-2015 and 2015-2016), whereas the betting organizations that we consider (Bet365 and Pinnacle Sports) obtained an RPS value of 0.2012 for the same period. Since a lower RPS value represents a higher predictive accuracy, our model was not able to outperform the bookmaker's predictions, despite obtaining promising results. (C) 2018 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:741 / 755
页数:15
相关论文
共 50 条
  • [41] Predictive Analysis Of Breast Cancer Using Machine Learning Techniques
    Agrawal, Rashmi
    INGENIERIA SOLIDARIA, 2019, 15 (29):
  • [42] Modelling wetting angle of solder on substrate using machine learning approach
    Lee, Bing-Xi
    Liu, Yu-chen
    SCIENCE AND TECHNOLOGY OF WELDING AND JOINING, 2025,
  • [43] On the Generalizability of Machine-Learning-Assisted Anisotropy Mappings for Predictive Turbulence Modelling
    McConkey, Ryley
    Yee, Eugene
    Lien, Fue-Sang
    INTERNATIONAL JOURNAL OF COMPUTATIONAL FLUID DYNAMICS, 2022, 36 (07) : 555 - 577
  • [44] Predictive modelling of flexural behaviour of polymer composites: a machine learning approach through material extrusion
    Jain, Akash
    Upadhyay, Saloni
    Pathik, Kanishka
    Raj, Tapish
    Sahai, Ankit
    Sharma, Rahul Swarup
    PROGRESS IN ADDITIVE MANUFACTURING, 2024,
  • [45] Predictive modelling of MapReduce job performance in cloud environments using machine learning techniques
    Bergui, Mohammed
    Hourri, Soufiane
    Najah, Said
    Nikolov, Nikola S.
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [46] Predictive modelling of sustainable concrete compressive strength using advanced machine learning algorithms
    Joshi, Tejas
    Mathur, Pulkit
    Oza, Parita
    Agrawal, Smita
    Narmawala, Husen
    ADVANCES IN CIVIL AND ARCHITECTURAL ENGINEERING, 2024, 15 (29): : 168 - 192
  • [47] A machine learning approach for predictive warehouse design
    Alessandro Tufano
    Riccardo Accorsi
    Riccardo Manzini
    The International Journal of Advanced Manufacturing Technology, 2022, 119 : 2369 - 2392
  • [48] Predictive modelling of turbofan engine components condition using machine and deep learning methods
    Matuszczak, Michal
    Zbikowski, Mateusz
    Teodorczyk, Andrzej
    EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2021, 23 (02): : 359 - 370
  • [49] Predictive modelling of sweep's specific draft using machine learning regression approaches
    Gautam, Prem Veer
    Agrawal, Kamal Nayan
    Roul, Ajay Kumar
    Mansuri, Shekh Mukhtar
    Subeesh, A.
    SOIL USE AND MANAGEMENT, 2024, 40 (01)
  • [50] Predictive Sleep Disorder Modelling: Using Machine Learning with Lifestyle and Sleep Health Data
    Kumar, Mukesh
    Ahmed, Bilal
    Mishra, Hari Mohan
    Jha, Amit Kumar
    Sikarwal, Pushpendra Kumar
    Rampal, Sahil
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,