Evaluating the effectiveness of machine learning models for performance forecasting in basketball: a comparative study

被引:14
作者
Papageorgiou, George [1 ]
Sarlis, Vangelis [1 ]
Tjortjis, Christos [1 ]
机构
[1] Int Hellen Univ, Sch Sci & Technol, 14th Km Thessaloniki Moudania, Thermi 57001, Greece
关键词
Data mining (DM); Data science; Forecasting; Machine learning (ML); Sports analytics (SA); BIG-DATA; REGRESSION; PREDICTION; ANALYTICS; ERROR;
D O I
10.1007/s10115-024-02092-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sports analytics (SA) incorporate machine learning (ML) techniques and models for performance prediction. Researchers have previously evaluated ML models applied on a variety of basketball statistics. This paper aims to benchmark the forecasting performance of 14 ML models, based on 18 advanced basketball statistics and key performance indicators (KPIs). The models were applied on a filtered pool of 90 high-performance players. This study developed individual forecasting scenarios per player and experimented using all 14 models. The models' performance ranking was developed using a bespoke evaluation metric, called weighted average percentage error (WAPE), formulated from the weighted mean absolute percentage error (MAPE) evaluation results of each forecasted statistic and model. Moreover, we employed a comprehensive forecasting approach to improve KPI's results. Results showed that Tree-based models, namely Extra Trees, Random Forest, and Decision Tree, are the best performers in most of the forecasted performance indicators, with the best performance achieved by Extra Trees with a WAPE of 34.14%. In conclusion, we achieved a 3.6% MAPE improvement for the selected KPI with our approach on unseen data.
引用
收藏
页码:4333 / 4375
页数:43
相关论文
共 102 条
[1]   Basketball lineup performance prediction using edge-centric multi-view network analysis [J].
Ahmadalinezhad, Mahboubeh ;
Makrehchi, Masoud .
SOCIAL NETWORK ANALYSIS AND MINING, 2020, 10 (01)
[2]   Using a case-based reasoning approach for trading in sports betting markets [J].
Alberola, Juan M. ;
Garcia-Fornes, Ana .
APPLIED INTELLIGENCE, 2013, 38 (03) :465-477
[3]   AI Meta-Learners and Extra-Trees Algorithm for the Detection of Phishing Websites [J].
Alsariera, Yazan Ahmad ;
Adeyemo, Victor Elijah ;
Balogun, Abdullateef Oluwagbemiga ;
Alazzawi, Ammar Kareem .
IEEE ACCESS, 2020, 8 :142532-142542
[4]   AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION [J].
ALTMAN, NS .
AMERICAN STATISTICIAN, 1992, 46 (03) :175-185
[5]  
Ambesange S, 2020, PROCEEDINGS OF THE 2020 FOURTH WORLD CONFERENCE ON SMART TRENDS IN SYSTEMS, SECURITY AND SUSTAINABILITY (WORLDS4 2020), P827, DOI [10.1109/worlds450073.2020.9210404, 10.1109/WorldS450073.2020.9210404]
[6]   Evaluation of Tree-Based Ensemble Machine Learning Models in Predicting Stock Price Direction of Movement [J].
Ampomah, Ernest Kwame ;
Qin, Zhiguang ;
Nyame, Gabriel .
INFORMATION, 2020, 11 (06)
[7]   Luck is Hard to Beat: The Difficulty of Sports Prediction [J].
Aoki, Raquel Y. S. ;
Assuncao, Renato M. ;
Vaz de Melo, Pedro O. S. .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :1367-1375
[8]   The Box-Cox Transformation: Review and Extensions [J].
Atkinson, Anthony C. ;
Riani, Marco ;
Corbellini, Aldo .
STATISTICAL SCIENCE, 2021, 36 (02) :239-255
[9]   Sports Big Data: Management, Analysis, Applications, and Challenges [J].
Bai, Zhongbo ;
Bai, Xiaomei .
COMPLEXITY, 2021, 2021 (2021)
[10]  
Belega Daniel, 2019, 2019 IEEE 5th International forum on Research and Technology for Society and Industry (RTSI). Proceedings, P1, DOI 10.1109/RTSI.2019.8895576