Action recognition for sports video analysis using part-attention spatio-temporal graph convolutional network

被引：6

作者：

Liu, Jiatong ^{[1
]}

Che, Yanli ^{[2
]}

机构：

[1] Zhejiang Wanli Univ, Phys Educ Dept, Ningbo, Zhejiang, Peoples R China

[2] Ludong Univ, Phys Educ Coll, Yantai, Shandong, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2021年 / 30卷 / 03期

关键词：

sports video analysis; action recognition; part-attention; spatio-temporal; graph convolutional network;

D O I：

10.1117/1.JEI.30.3.033017

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Action recognition makes significant contributions to sports video analysis, especially for athletes' training evaluations. For sports video analysis, the action information is mainly conveyed by human body parts' temporal movement, and each of the parts has a unique importance to the action representation. Aiming to involve this point in action recognition, we propose a part-attention spatio-temporal graph convolutional network (PSGCN) to exploit the dynamic spatio-temporal information in a sports video; it learns the importance of different parts to emphasize the contribution on the task of action recognition. Specifically, PSGCN first divides the human body into six parts and extracts their convolutional neural network (CNN) features, as well as concatenating the global feature of the whole frame; it then utilizes a crosspart and cross-frame graph building module to formulate the graph correlation of the parts from different frames. Inspired by the larger temporal variation of the same part containing more action information, we further propose a part-attention (PA) learning module to estimate the importance of each part, which can strengthen the graph correlation to support a PA graph. Finally, PSGCN conducts a graph convolutional network on the learned PA spatio-temporal graph with the learned part CNN features, which can obtain the action representation for the given sports video. In addition, the whole network is optimized by two losses of PA and action classification. To perform the superiority of PSGCN, we carry out extensive experiments of our model compared with several state-of-the-art methods on widely used action recognition datasets, especially for sports action. The results reflect the advantages of the proposed PSGCN on sports video analysis. (C) 2021 SPIE and IS&T

引用

页数：16

共 50 条

[1] Interpretable Spatio-temporal Attention for Video Action Recognition
Meng, Lili
Zhao, Bo
Chang, Bo
Huang, Gao
Sun, Wei
Tung, Frederich
Sigal, Leonid
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1513 - 1522
[2] Spatio-Temporal Dynamic Attention Graph Convolutional Network Based on Skeleton Gesture Recognition
Han, Xiaowei
Cui, Ying
Chen, Xingyu
Lu, Yunjing
Hu, Wen
ELECTRONICS, 2024, 13 (18)
[3] Skeleton Action Recognition Based on Spatio-temporal Feature Enhanced Graph Convolutional Network
Cao, Yi
Wu, Weiguan
Li, Ping
Xia, Yu
Gao, Qingyuan
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (08) : 3022 - 3031
[4] STAN: Spatio-Temporal Analysis Network for efficient video action recognition
Chen, Shilin
Wang, Xingwang
Sun, Yafeng
Yan, Kun
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 268
[5] Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatio-Temporal Graph Convolutional Network for Action Recognition
Papadopoulos, Konstantinos
Ghorbel, Enjie
Aouada, Djamila
Ottersten, Bjoern
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 452 - 458
[6] A Spatio-Temporal Convolutional Neural Network for Skeletal Action Recognition
Hu, Lizhang
Xu, Jinhua
NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 377 - 385
[7] Lightweight Multiscale Spatio-Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Zheng, Zhiyun
Yuan, Qilong
Zhang, Huaizhu
Wang, Yizhou
Wang, Junfeng
BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 310 - 325
[8] PROGRESSIVE SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION
Heidari, Negar
Iosifidis, Alexandros
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3220 - 3224
[9] STCA: an action recognition network with spatio-temporal convolution and attention
Tian, Qiuhong
Miao, Weilun
Zhang, Lizao
Yang, Ziyu
Yu, Yang
Zhao, Yanying
Yao, Lan
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2025, 14 (01)
[10] On the spatial attention in spatio-temporal graph convolutional networks for skeleton-based human action recognition
Heidari, Negar
Iosifidis, Alexandros
Proceedings of the International Joint Conference on Neural Networks, 2021, 2021-July

← 1 2 3 4 5 →