Analysis of Movement and Activities of Handball Players Using Deep Neural Networks

被引:12
作者
Host, Kristina [1 ,2 ]
Pobar, Miran [1 ,2 ]
Ivasic-Kos, Marina [1 ,2 ]
机构
[1] Univ Rijeka, Fac Informat & Digital Technol, Rijeka 51000, Croatia
[2] Univ Rijeka, Ctr Artificial Intelligence & Cybersecur, Rijeka 51000, Croatia
关键词
sports; object detector; object tracking; action recognition; video analysis; YOLO; Mask R-CNN; DeepSORT; BoT SORT; I3D; RECOGNITION;
D O I
10.3390/jimaging9040080
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
This paper focuses on image and video content analysis of handball scenes and applying deep learning methods for detecting and tracking the players and recognizing their activities. Handball is a team sport of two teams played indoors with the ball with well-defined goals and rules. The game is dynamic, with fourteen players moving quickly throughout the field in different directions, changing positions and roles from defensive to offensive, and performing different techniques and actions. Such dynamic team sports present challenging and demanding scenarios for both the object detector and the tracking algorithms and other computer vision tasks, such as action recognition and localization, with much room for improvement of existing algorithms. The aim of the paper is to explore the computer vision-based solutions for recognizing player actions that can be applied in unconstrained handball scenes with no additional sensors and with modest requirements, allowing a broader adoption of computer vision applications in both professional and amateur settings. This paper presents semi-manual creation of custom handball action dataset based on automatic player detection and tracking, and models for handball action recognition and localization using Inflated 3D Networks (I3D). For the task of player and ball detection, different configurations of You Only Look Once (YOLO) and Mask Region-Based Convolutional Neural Network (Mask R-CNN) models fine-tuned on custom handball datasets are compared to original YOLOv7 model to select the best detector that will be used for tracking-by-detection algorithms. For the player tracking, DeepSORT and Bag of tricks for SORT (BoT SORT) algorithms with Mask R-CNN and YOLO detectors were tested and compared. For the task of action recognition, I3D multi-class model and ensemble of binary I3D models are trained with different input frame lengths and frame selection strategies, and the best solution is proposed for handball action recognition. The obtained action recognition models perform well on the test set with nine handball action classes, with average F1 measures of 0.69 and 0.75 for ensemble and multi-class classifiers, respectively. They can be used to index handball videos to facilitate retrieval automatically. Finally, some open issues, challenges in applying deep learning methods in such a dynamic sports environment, and direction for future development will be discussed.
引用
收藏
页数:18
相关论文
共 39 条
[1]  
Acuna D., 2017, P 31 C NEUR INF PROC
[2]   Soccer Video Summarization using Deep Learning [J].
Agyeman, Rockson ;
Muhammad, Rafiq ;
Choi, Gyu Sang .
2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, :270-273
[3]  
Aharon N, 2022, Arxiv, DOI [arXiv:2206.14651, 10.48550/arXiv.2206.14651, DOI 10.48550/ARXIV.2206.14651]
[4]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[5]   Computer vision and deep learning techniques for pedestrian detection and tracking: A survey [J].
Brunetti, Antonio ;
Buongiorno, Domenico ;
Trotta, Gianpaolo Francesco ;
Bevilacqua, Vitoantonio .
NEUROCOMPUTING, 2018, 300 :17-33
[6]  
Buric Matija, 2018, 2018 5th International Conference on Computational Science and Computational Intelligence (CSCI), P319, DOI 10.1109/CSCI46756.2018.00068
[7]   Player Tracking in Sports Videos [J].
Buric, Matija ;
Ivasic-Kos, Marina ;
Pobar, Miran .
11TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2019), 2019, :334-340
[8]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[9]   Temporal segmentation and recognition of team activities in sports [J].
Direkoglu, Cem ;
O'Connor, Noel E. .
MACHINE VISION AND APPLICATIONS, 2018, 29 (05) :891-913
[10]   Skeleton-based comparison of throwing motion for handball players [J].
Elaoud, Amani ;
Barhoumi, Walid ;
Zagrouba, Ezzeddine ;
Agrebi, Brahim .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (01) :419-431