Predictive analysis and modelling football results using machine learning approach for English Premier League

被引:91
|
作者
Baboota, Rahul [1 ]
Kaur, Harleen [2 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, New Delhi, India
[2] Jamia Hamdard, Sch Engn Sci & Technol, Dept Comp Sci & Engn, New Delhi, India
关键词
Machine learning; Feature engineering; Data mining; Predictive analysis; Random forest; Support vector machines (SVM); Ranked probability score (RPS); Gradient boosting; MATCH; PROBABILITY; SCORES;
D O I
10.1016/j.ijforecast.2018.01.003
中图分类号
F [经济];
学科分类号
02 ;
摘要
The introduction of artificial intelligence has given us the ability to build predictive systems with unprecedented accuracy. Machine learning is being used in virtually all areas in one way or another, due to its extreme effectiveness. One such area where predictive systems have gained a lot of popularity is the prediction of football match results. This paper demonstrates our work on the building of a generalized predictive model for predicting the results of the English Premier League. Using feature engineering and exploratory data analysis, we create a feature set for determining the most important factors for predicting the results of a football match, and consequently create a highly accurate predictive system using machine learning. We demonstrate the strong dependence of our models' performances on important features. Our best model using gradient boosting achieved a performance of 0.2156 on the ranked probability score (RPS) metric for game weeks 6 to 38 for the English Premier League aggregated over two seasons (2014-2015 and 2015-2016), whereas the betting organizations that we consider (Bet365 and Pinnacle Sports) obtained an RPS value of 0.2012 for the same period. Since a lower RPS value represents a higher predictive accuracy, our model was not able to outperform the bookmaker's predictions, despite obtaining promising results. (C) 2018 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:741 / 755
页数:15
相关论文
共 50 条
  • [31] APPLIED MACHINE LEARNING PREDICTIVE MODELLING IN REGIONAL SPATIAL DATA ANALYSIS PROBLEM
    Kovarik, Martin
    Benda, Radek
    FINANCE AND PERFORMANCE OF FIRMS IN SCIENCE, EDUCATION, AND PRACTICE, 2015, : 701 - 715
  • [32] Actionable Predictive Factors of Homelessness in a Psychiatric Population: Results from the REHABase Cohort Using a Machine Learning Approach
    Lio, Guillaume
    Ghazzai, Malek
    Haesebaert, Frederic
    Dubreucq, Julien
    Verdoux, Helene
    Quiles, Clelia
    Jaafari, Nemat
    Chereau-Boudet, Isabelle
    Legros-Lafarge, Emilie
    Guillard-Bouhet, Nathalie
    Massoubre, Catherine
    Gouache, Benjamin
    Plasse, Julien
    Barbalat, Guillaume
    Franck, Nicolas
    Demily, Caroline
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (19)
  • [33] Predictive Analytics of Road Accidents in Oman using Machine Learning Approach
    Narasimhan, Girija
    Cheriyan, Sunitha
    Ephrem, Ben George
    Balasupramanian, N.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 1058 - 1065
  • [34] Predictive Analysis of Cervical Cancer Using Machine Learning Techniques
    Kumawat, Gaurav
    Vishwakarma, Santosh Kumar
    Chakrabarti, Prasun
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 1, SMARTCOM 2024, 2024, 945 : 501 - 516
  • [35] Predictive Analysis of Absenteeism in MNCS Using Machine Learning Algorithm
    Tewari, Krittika
    Vandita, Shriya
    Jain, Shruti
    PROCEEDINGS OF RECENT INNOVATIONS IN COMPUTING, ICRIC 2019, 2020, 597 : 3 - 14
  • [36] Automating predictive maintenance using oil analysis and machine learning
    Keartland, Sarah
    Van Zyl, Terence L.
    2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 167 - 172
  • [37] Predictive modeling of wildfires: A new dataset and machine learning approach
    Oulad Sayad, Younes
    Mousannif, Hajar
    Al Moatassime, Hassan
    FIRE SAFETY JOURNAL, 2019, 104 : 130 - 146
  • [38] A Predictive Analysis of Heart Rates Using Machine Learning Techniques
    Oyeleye, Matthew
    Chen, Tianhua
    Titarenko, Sofya
    Antoniou, Grigoris
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (04)
  • [39] PREDICTIVE ANALYSIS OF HEART DISEASES WITH MACHINE LEARNING APPROACHES
    Ramesh, T. R.
    Lilhore, Umesh Kumar
    Poongodi, M.
    Simaiya, Sarita
    Kaur, Amandeep
    Hamdi, Mounir
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2022, : 132 - 150
  • [40] Predictive Diagnostic Analysis for Early Detection of Alzheimer's disease Using Machine Learning
    Veena, K. C.
    Priya, R. Kavi
    Sumathi, D.
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (01) : 586 - 592