Predictive analysis and modelling football results using machine learning approach for English Premier League

被引:91
|
作者
Baboota, Rahul [1 ]
Kaur, Harleen [2 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, New Delhi, India
[2] Jamia Hamdard, Sch Engn Sci & Technol, Dept Comp Sci & Engn, New Delhi, India
关键词
Machine learning; Feature engineering; Data mining; Predictive analysis; Random forest; Support vector machines (SVM); Ranked probability score (RPS); Gradient boosting; MATCH; PROBABILITY; SCORES;
D O I
10.1016/j.ijforecast.2018.01.003
中图分类号
F [经济];
学科分类号
02 ;
摘要
The introduction of artificial intelligence has given us the ability to build predictive systems with unprecedented accuracy. Machine learning is being used in virtually all areas in one way or another, due to its extreme effectiveness. One such area where predictive systems have gained a lot of popularity is the prediction of football match results. This paper demonstrates our work on the building of a generalized predictive model for predicting the results of the English Premier League. Using feature engineering and exploratory data analysis, we create a feature set for determining the most important factors for predicting the results of a football match, and consequently create a highly accurate predictive system using machine learning. We demonstrate the strong dependence of our models' performances on important features. Our best model using gradient boosting achieved a performance of 0.2156 on the ranked probability score (RPS) metric for game weeks 6 to 38 for the English Premier League aggregated over two seasons (2014-2015 and 2015-2016), whereas the betting organizations that we consider (Bet365 and Pinnacle Sports) obtained an RPS value of 0.2012 for the same period. Since a lower RPS value represents a higher predictive accuracy, our model was not able to outperform the bookmaker's predictions, despite obtaining promising results. (C) 2018 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:741 / 755
页数:15
相关论文
共 50 条
  • [21] MACHINE LEARNING-BASED PREDICTIVE MODELLING OF LAMINATED COMPOSITES
    Kalita, Kanak
    Chakraborty, Shankar
    Gautam, Preeti
    Petru, Jana
    Samal, S. P.
    MM SCIENCE JOURNAL, 2025, 2025 : 8169 - 8175
  • [22] Predictive Modelling for Concrete Failure at Anchorages Using Machine Learning Techniques
    Spyridis, Panagiotis
    Olalusi, Oladimeji B.
    MATERIALS, 2021, 14 (01) : 1 - 22
  • [23] Chess Results Analysis Using Elo Measure with Machine Learning
    Thabtah, Fadi
    Padmavathy, Arun J.
    Pritchard, Andrew
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2020, 19 (02)
  • [24] Machine Learning for Predictive Modelling of Ambulance Calls
    Yu, Miao
    Kollias, Dimitrios
    Wingate, James
    Siriwardena, Niro
    Kollias, Stefanos
    ELECTRONICS, 2021, 10 (04) : 1 - 12
  • [25] Predictive modeling for wine authenticity using a machine learning approach
    Costa, Nattane Luiza da
    Valentin, Leonardo A.
    Castro, Inar Alves
    Barbosa, Rommel Melgaco
    ARTIFICIAL INTELLIGENCE IN AGRICULTURE, 2021, 5 : 157 - 162
  • [26] Predictive Analysis of Postpartum Depression Using Machine Learning
    Kim, Hyunkyoung
    HEALTHCARE, 2025, 13 (08)
  • [27] Predictive Analysis for Personal Loans by Using Machine Learning
    Huang, Hui-I.
    Wang, Chou-Wen
    Wu, Chin-Wen
    HCI IN BUSINESS, GOVERNMENT AND ORGANIZATIONS, PT I, HCIBGO 2024, 2024, 14720 : 187 - 199
  • [28] Predictive Modelling of Financial Distress of Slovak Companies Using Machine Learning Techniques
    Durica, Marek
    Svabova, Lucia
    Kramarova, Katarina
    ECONOMICS, MANAGEMENT & BUSINESS 2023: CONTEMPORARY ISSUES, INSIGHTS AND NEW CHALLENGES, 2023, : 780 - 786
  • [29] Predictive modelling of metabolic syndrome in Ghanaian diabetic patients: an ensemble machine learning approach
    Acheampong, Emmanuel
    Adua, Eric
    Obirikorang, Christian
    Anto, Enoch Odame
    Peprah-Yamoah, Emmanuel
    Obirikorang, Yaa
    Asamoah, Evans Adu
    Opoku-Yamoah, Victor
    Nyantakyi, Michael
    Taylor, John
    Buckman, Tonnies Abeku
    Yakubu, Maryam
    Afrifa-Yamoah, Ebenezer
    JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2024, 23 (02) : 2233 - 2249
  • [30] Predictive modelling of stress, anxiety and depression: A network analysis and machine learning study
    Ganai, Umer Jon
    Sachdev, Shivani
    Bhushan, Braj
    BRITISH JOURNAL OF CLINICAL PSYCHOLOGY, 2024, 63 (04) : 522 - 542