Shot-ViT: Cricket Batting Shots Classification with Vision Transformer Network

被引:6
作者
Dey, A. [1 ]
Biswas, S. [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Sibpur, Howrah, India
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2024年 / 37卷 / 12期
关键词
CricketBatting Shots; Shots Classification; Vision Transformer Network; Computer Vision; Action Recognition; RECOGNITION; ATTENTION;
D O I
10.5829/ije.2024.37.12c.04
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In the realm of computer vision applied to cricket analysis, classifying batting shots poses a formidable challenge, demanding nuanced comprehension and categorization. The classification of cricket shots is crucial as it empowers the players to strategically assess, adapt, and execute their game plans effectively, shaping the outcome of matches. This article introduces the Cricket Batting Shots Image dataset (CBSId), a newbenchmark dataset comprising 2160 meticulously annotatedcricket shot images across sevendistinct categories. The core objective of this research is to develop a robust system capable of effectively classifying cricket batting shots from images. To addressthis, we present a fine-tuned Vision Transformer-based model specifically adapted for cricket shot classification, termed Cricket Batting Shot Vision Transformer (Shot-ViT). Our proposed methodology demonstrates exceptional performance, achieving 92.58% validation accuracy on the CBSId. Shot-ViT notably outperforms established models such as VGG19, ResNet50, I-AlexNet, and ViT_B32 in cricket shot classification accuracy, showcasing the remarkable capabilities of Vision transformers in surpassing existing deep learning architectures for complex visual tasks. Vision transformers have the capacity to capture global context and long-range dependencies within images through self-attention mechanisms, enabling effective feature extraction and representation, which traditional models may struggle to achieve. The accurate classification of cricket batting shots holds profound implications for cricket coaching, player development, and match analysis. It has the potential to revolutionize training methodologies, providing players and coaches with precise insights into batting techniques and strategies and therebycontributingto the overall advancement of the sport.
引用
收藏
页码:2463 / 2472
页数:10
相关论文
共 33 条
[1]   Optimized deep learning-based cricket activity focused network and medium scale benchmark [J].
Ahmad, Waqas ;
Munsif, Muhammad ;
Ullah, Habib ;
Ullah, Mohib ;
Alsuwailem, Alhanouf Abdulrahman ;
Saudagar, Abdul Khader Jilani ;
Muhammad, Khan ;
Sajjad, Muhammad .
ALEXANDRIA ENGINEERING JOURNAL, 2023, 73 :771-779
[2]  
Ahmed MS, 2023, 2023 26 INT C COMP I, DOI [10.1109/ICCIT60459.2023.10441507, DOI 10.1109/ICCIT60459.2023.10441507]
[3]  
Al Islam Md Nafee, 2019, 2019 IEEE International Conference on Signal Processing, Information, Communication & Systems (SPICSCON), P130, DOI 10.1109/SPICSCON48833.2019.9065090
[4]  
Azhar M, 2023, 2023 INT S IM SIGN P, DOI [10.1109/ISPA58351.2023.10279281, DOI 10.1109/ISPA58351.2023.10279281]
[5]  
Devanandan M, 2021, 2021 3 INT C ADV COM, DOI [10.1109/ICAC54203.2021.9671109, DOI 10.1109/ICAC54203.2021.9671109]
[6]   Umpire's Signal Recognition in Cricket Using an Attention based DC-GRU Network [J].
Dey, A. ;
Biswas, S. ;
Abualigah, L. .
INTERNATIONAL JOURNAL OF ENGINEERING, 2024, 37 (04) :662-674
[7]  
Dey A, 2024, MAT CONTINUA, V79, DOI [10.32604/cmc.2024.049512, DOI 10.32604/CMC.2024.049512]
[8]   Recognition of Human Interactions in Still Images using AdaptiveDRNet with Multi-level Attention [J].
Dey, Arnab ;
Biswas, Samit ;
Le, Dac-Nhuong .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) :984-994
[9]  
Fogt JS, 2023, VISION, V7, P57, DOI [10.3390/vision7030057, DOI 10.3390/VISION7030057]
[10]  
Foysal M.F., 2018, Recent Trends in Image Processing and Pattern Recognition, DOI [10.1007/978-981-13-9181-1_10, DOI 10.1007/978-981-13-9181-1_10]