Predictive analysis and modelling football results using machine learning approach for English Premier League

被引:91
|
作者
Baboota, Rahul [1 ]
Kaur, Harleen [2 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, New Delhi, India
[2] Jamia Hamdard, Sch Engn Sci & Technol, Dept Comp Sci & Engn, New Delhi, India
关键词
Machine learning; Feature engineering; Data mining; Predictive analysis; Random forest; Support vector machines (SVM); Ranked probability score (RPS); Gradient boosting; MATCH; PROBABILITY; SCORES;
D O I
10.1016/j.ijforecast.2018.01.003
中图分类号
F [经济];
学科分类号
02 ;
摘要
The introduction of artificial intelligence has given us the ability to build predictive systems with unprecedented accuracy. Machine learning is being used in virtually all areas in one way or another, due to its extreme effectiveness. One such area where predictive systems have gained a lot of popularity is the prediction of football match results. This paper demonstrates our work on the building of a generalized predictive model for predicting the results of the English Premier League. Using feature engineering and exploratory data analysis, we create a feature set for determining the most important factors for predicting the results of a football match, and consequently create a highly accurate predictive system using machine learning. We demonstrate the strong dependence of our models' performances on important features. Our best model using gradient boosting achieved a performance of 0.2156 on the ranked probability score (RPS) metric for game weeks 6 to 38 for the English Premier League aggregated over two seasons (2014-2015 and 2015-2016), whereas the betting organizations that we consider (Bet365 and Pinnacle Sports) obtained an RPS value of 0.2012 for the same period. Since a lower RPS value represents a higher predictive accuracy, our model was not able to outperform the bookmaker's predictions, despite obtaining promising results. (C) 2018 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:741 / 755
页数:15
相关论文
共 50 条
  • [1] Predicting the Outcome of English Premier League Matches using Machine Learning
    Raju, Muntaqim Ahmed
    Mia, Md Solaiman
    Abu Sayed, Md
    Uddin, Md Riaz
    2020 2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR INDUSTRY 4.0 (STI), 2020,
  • [2] Player Recommendation System for Fantasy Premier League using Machine Learning
    Rajesh, Vimal
    Arjun, P.
    Jagtap, Kunal Ravikumar
    Suneera, C. M.
    Prakash, Jay
    2022 19TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2022), 2022,
  • [3] Predictive modelling and analytics for diabetes using a machine learning approach
    Kaur, Harleen
    Kumari, Vinita
    APPLIED COMPUTING AND INFORMATICS, 2022, 18 (1/2) : 90 - 100
  • [4] THE EFFECTS OF DECOMPOSITION OF THE GOALS SCORED IN CLASSIFYING THE OUTCOMES OF FIVE ENGLISH PREMIER LEAGUE SEASONS USING MACHINE LEARNING MODELS
    Iyiola, Tomilayo P.
    Okagbue, Hilary I.
    Adedotun, Adedayo F.
    Akingbade, Toluwalase J.
    ADVANCES AND APPLICATIONS IN STATISTICS, 2023, 87 (01) : 13 - 27
  • [5] Predictive modelling and analytics of students' grades using machine learning algorithms
    Badal, Yudish Teshal
    Sungkur, Roopesh Kevin
    EDUCATION AND INFORMATION TECHNOLOGIES, 2023, 28 (03) : 3027 - 3057
  • [6] Predictive modelling and analytics of students’ grades using machine learning algorithms
    Yudish Teshal Badal
    Roopesh Kevin Sungkur
    Education and Information Technologies, 2023, 28 : 3027 - 3057
  • [7] Predictive Modelling of Diseases Based on a Network and Machine Learning Approach
    Tuan-Truong Quang
    Nghia Le
    Bac Le
    RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, 2022, 1716 : 641 - 654
  • [8] The Football Matches Outcome Prediction for English Premier League (EPL): A Comparative Analysis of Multi-class Models
    Adnan, Nur Amirah
    Asri, Luqman Al Hakim Mohd
    Mustapha, Aida
    Razali, Muhammad Nazim
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING, SCDM 2024, 2024, 1078 : 411 - 420
  • [9] USE OF THE FIRST AND SECOND HALVES RESULTS TO CLASSIFY THE FINAL OUTCOME OF ENGLISH PREMIER LEAGUE MATCHES
    Iyiola, Tomilayo P.
    Okagbue, Hilary I.
    Odetunmibi, Oluwole A.
    ADVANCES AND APPLICATIONS IN STATISTICS, 2022, 82 : 53 - 64
  • [10] Predictive modelling of sustainable lightweight foamed concrete using machine learning novel approach
    Ullah, Haji Sami
    Khushnood, Rao Arsalan
    Ahmad, Junaid
    Farooq, Furqan
    JOURNAL OF BUILDING ENGINEERING, 2022, 56