Performance prediction in major league baseball by long short-term memory networks

被引:12
作者
Sun, Hsuan-Cheng [1 ]
Lin, Tse-Yu [2 ]
Tsai, Yen-Lung [3 ]
机构
[1] Claremont Grad Univ, Ctr Informat Syst & Technol, Claremont, CA 91711 USA
[2] Natl Taiwan Univ, Data Sci Degree Program, Taipei, Taiwan
[3] Natl Chengchi Univ, Dept Math Sci, Taipei, Taiwan
关键词
Deep learning; Long short-term memory; Player performance prediction; Baseball projection; NEURAL-NETWORKS;
D O I
10.1007/s41060-022-00313-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Player performance prediction is a serious problem in every sport since it brings valuable future information for managers to make important decisions. In baseball industries, there already existed variable prediction systems and many types of researches that attempt to provide accurate predictions and help domain users. However, it is a lack of studies about the predicting method or systems based on deep learning. Deep learning models had proven to be the greatest solutions in different fields nowadays, so we believe they could be tried and applied to the prediction problem in baseball. Hence, the predicting abilities of deep learning models are set to be our research problem in this paper. As a beginning, we select numbers of home runs as the target because it is one of the most critical indexes to understand the power and the talent of baseball hitters. Moreover, we use the sequential model Long Short-Term Memory as our main method to solve the home run prediction problem in Major League Baseball. We compare models' ability with several machine learning models and a widely used baseball projection system, sZymborski Projection System. Our results show that Long Short-Term Memory has better performance than others and has the ability to make more exact predictions. We conclude that Long Short-Term Memory is a feasible way for performance prediction problems in baseball and could bring valuable information to fit users' needs.
引用
收藏
页码:93 / 104
页数:12
相关论文
共 45 条
[1]  
Al-Asadi M., 2018, Xinyang Teachers College, V10, P1, DOI 10.13140/RG.2.2.17433.75367
[2]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[3]  
Baumer B, 2014, SABERMETRIC REVOLUTION: ASSESSING THE GROWTH OF ANALYTICS IN BASEBALL, P1
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   IN-SEASON PREDICTION OF BATTING AVERAGES: A FIELD TEST OF EMPIRICAL BAYES AND BAYES METHODOLOGIES [J].
Brown, Lawrence D. .
ANNALS OF APPLIED STATISTICS, 2008, 2 (01) :113-152
[6]  
Cao LB, 2007, IEEE INTELL SYST, V22, P78, DOI 10.1109/MIS.2007.67
[7]   Domain-Driven Data Mining: Challenges and Prospects [J].
Cao, Longbing .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (06) :755-769
[8]  
Chung J., 2014, Empirical evaluation of gated recurrent neural networks on sequence modeling
[9]  
Cross, 2009, STEAMER PROJECTIONS
[10]   DRIVE THEORY AND HOME RUN MILESTONES IN BASEBALL: AN HISTORICAL ANALYSIS [J].
Goldschmied, Nadav ;
Harris, Michael ;
Vira, Damien ;
Kowalczyk, Jason .
PERCEPTUAL AND MOTOR SKILLS, 2014, 118 (01) :1-11