Evaluating the effectiveness of machine learning models for performance forecasting in basketball: a comparative study

被引:14
作者
Papageorgiou, George [1 ]
Sarlis, Vangelis [1 ]
Tjortjis, Christos [1 ]
机构
[1] Int Hellen Univ, Sch Sci & Technol, 14th Km Thessaloniki Moudania, Thermi 57001, Greece
关键词
Data mining (DM); Data science; Forecasting; Machine learning (ML); Sports analytics (SA); BIG-DATA; REGRESSION; PREDICTION; ANALYTICS; ERROR;
D O I
10.1007/s10115-024-02092-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sports analytics (SA) incorporate machine learning (ML) techniques and models for performance prediction. Researchers have previously evaluated ML models applied on a variety of basketball statistics. This paper aims to benchmark the forecasting performance of 14 ML models, based on 18 advanced basketball statistics and key performance indicators (KPIs). The models were applied on a filtered pool of 90 high-performance players. This study developed individual forecasting scenarios per player and experimented using all 14 models. The models' performance ranking was developed using a bespoke evaluation metric, called weighted average percentage error (WAPE), formulated from the weighted mean absolute percentage error (MAPE) evaluation results of each forecasted statistic and model. Moreover, we employed a comprehensive forecasting approach to improve KPI's results. Results showed that Tree-based models, namely Extra Trees, Random Forest, and Decision Tree, are the best performers in most of the forecasted performance indicators, with the best performance achieved by Extra Trees with a WAPE of 34.14%. In conclusion, we achieved a 3.6% MAPE improvement for the selected KPI with our approach on unseen data.
引用
收藏
页码:4333 / 4375
页数:43
相关论文
共 102 条
[91]  
Wang Zhao, 2022, 2022 7th International Conference on Big Data Analytics (ICBDA), P96, DOI 10.1109/ICBDA55095.2022.9760329
[92]   Big Data and Analytics in Sport Management [J].
Watanabe, Nicholas M. ;
Shapiro, Stephen ;
Drayer, Joris .
JOURNAL OF SPORT MANAGEMENT, 2021, 35 (03) :197-202
[93]   Online transfer learning by leveraging multiple source domains [J].
Wu, Qingyao ;
Zhou, Xiaoming ;
Yan, Yuguang ;
Wu, Hanrui ;
Min, Huaqing .
KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 52 (03) :687-707
[94]   COORDINATE DESCENT ALGORITHMS FOR LASSO PENALIZED REGRESSION [J].
Wu, Tong Tong ;
Lange, Kenneth .
ANNALS OF APPLIED STATISTICS, 2008, 2 (01) :224-244
[95]  
Yanli Liu, 2012, Information Computing and Applications. Proceedings of the Third International Conference, ICICA 2012, P246, DOI 10.1007/978-3-642-34062-8_32
[96]   Machine learning method for simulation of adsorption separation: Comparisons of model's performance in predicting equilibrium concentrations [J].
Yin, Guanwei ;
Alazzawi, Fouad Jameel Ibrahim ;
Mironov, Sergei ;
Reegu, Faheem ;
El-Shafay, A. S. ;
Rahman, Md Lutfor ;
Su, Chia-Hung ;
Lu, Yi-Ze ;
Hoang Chinh Nguyen .
ARABIAN JOURNAL OF CHEMISTRY, 2022, 15 (03)
[97]   Discovering a cohesive football team through players' attributed collaboration networks [J].
Yu, Shenbao ;
Zeng, Yifeng ;
Pan, Yinghui ;
Chen, Bilian .
APPLIED INTELLIGENCE, 2023, 53 (11) :13506-13526
[98]   RETRACTED: Basketball Sports Injury Prediction Model Based on the Grey Theory Neural Network (Retracted Article) [J].
Zhang, Fengyan ;
Huang, Ying ;
Ren, Wengang .
JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
[99]   Modelling the Relationship between Match Outcome and Match Performances during the 2019 FIBA Basketball World Cup: A Quantile Regression Analysis [J].
Zhang, Shaoliang ;
Gomez, Miguel Angel ;
Yi, Qing ;
Dong, Rui ;
Leicht, Anthony ;
Lorenzo, Alberto .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (16) :1-11
[100]   Discriminative Elastic-Net Regularized Linear Regression [J].
Zhang, Zheng ;
Lai, Zhihui ;
Xu, Yong ;
Shao, Ling ;
Wu, Jian ;
Xie, Guo-Sen .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (03) :1466-1481