Action recognition for sports video analysis using part-attention spatio-temporal graph convolutional network

被引:6
|
作者
Liu, Jiatong [1 ]
Che, Yanli [2 ]
机构
[1] Zhejiang Wanli Univ, Phys Educ Dept, Ningbo, Zhejiang, Peoples R China
[2] Ludong Univ, Phys Educ Coll, Yantai, Shandong, Peoples R China
关键词
sports video analysis; action recognition; part-attention; spatio-temporal; graph convolutional network;
D O I
10.1117/1.JEI.30.3.033017
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Action recognition makes significant contributions to sports video analysis, especially for athletes' training evaluations. For sports video analysis, the action information is mainly conveyed by human body parts' temporal movement, and each of the parts has a unique importance to the action representation. Aiming to involve this point in action recognition, we propose a part-attention spatio-temporal graph convolutional network (PSGCN) to exploit the dynamic spatio-temporal information in a sports video; it learns the importance of different parts to emphasize the contribution on the task of action recognition. Specifically, PSGCN first divides the human body into six parts and extracts their convolutional neural network (CNN) features, as well as concatenating the global feature of the whole frame; it then utilizes a crosspart and cross-frame graph building module to formulate the graph correlation of the parts from different frames. Inspired by the larger temporal variation of the same part containing more action information, we further propose a part-attention (PA) learning module to estimate the importance of each part, which can strengthen the graph correlation to support a PA graph. Finally, PSGCN conducts a graph convolutional network on the learned PA spatio-temporal graph with the learned part CNN features, which can obtain the action representation for the given sports video. In addition, the whole network is optimized by two losses of PA and action classification. To perform the superiority of PSGCN, we carry out extensive experiments of our model compared with several state-of-the-art methods on widely used action recognition datasets, especially for sports action. The results reflect the advantages of the proposed PSGCN on sports video analysis. (C) 2021 SPIE and IS&T
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Interpretable Spatio-temporal Attention for Video Action Recognition
    Meng, Lili
    Zhao, Bo
    Chang, Bo
    Huang, Gao
    Sun, Wei
    Tung, Frederich
    Sigal, Leonid
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1513 - 1522
  • [2] Spatio-Temporal Dynamic Attention Graph Convolutional Network Based on Skeleton Gesture Recognition
    Han, Xiaowei
    Cui, Ying
    Chen, Xingyu
    Lu, Yunjing
    Hu, Wen
    ELECTRONICS, 2024, 13 (18)
  • [3] Skeleton Action Recognition Based on Spatio-temporal Feature Enhanced Graph Convolutional Network
    Cao, Yi
    Wu, Weiguan
    Li, Ping
    Xia, Yu
    Gao, Qingyuan
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (08) : 3022 - 3031
  • [4] STAN: Spatio-Temporal Analysis Network for efficient video action recognition
    Chen, Shilin
    Wang, Xingwang
    Sun, Yafeng
    Yan, Kun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 268
  • [5] Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatio-Temporal Graph Convolutional Network for Action Recognition
    Papadopoulos, Konstantinos
    Ghorbel, Enjie
    Aouada, Djamila
    Ottersten, Bjoern
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 452 - 458
  • [6] A Spatio-Temporal Convolutional Neural Network for Skeletal Action Recognition
    Hu, Lizhang
    Xu, Jinhua
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 377 - 385
  • [7] Lightweight Multiscale Spatio-Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
    Zheng, Zhiyun
    Yuan, Qilong
    Zhang, Huaizhu
    Wang, Yizhou
    Wang, Junfeng
    BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 310 - 325
  • [8] PROGRESSIVE SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION
    Heidari, Negar
    Iosifidis, Alexandros
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3220 - 3224
  • [9] STCA: an action recognition network with spatio-temporal convolution and attention
    Tian, Qiuhong
    Miao, Weilun
    Zhang, Lizao
    Yang, Ziyu
    Yu, Yang
    Zhao, Yanying
    Yao, Lan
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2025, 14 (01)
  • [10] On the spatial attention in spatio-temporal graph convolutional networks for skeleton-based human action recognition
    Heidari, Negar
    Iosifidis, Alexandros
    Proceedings of the International Joint Conference on Neural Networks, 2021, 2021-July