Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition

被引:0
作者
Zhu, Shasha [1 ]
Sun, Lu [1 ]
Ma, Zeyuan [1 ]
Li, Chenxi [1 ]
He, Dongzhi [1 ]
机构
[1] Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
关键词
Skeleton-based action recognition; Graph convolutional network; Attention mechanism; Dynamic convolution; Prompt learning;
D O I
10.1016/j.neucom.2024.128623
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based action recognition is a core task in the field of video understanding. Skeleton sequences are characterized by high information density, low redundancy, and clear structural information, thereby facilitating the analysis of complex relationships among human behaviors more readily than other modalities. Although existing studies have encoded skeleton data and achieved positive outcomes, they have often overlooked the precise high-level semantic information inherent in the action descriptions. To address this issue, this paper proposes a prompt-supervised dynamic attention graph convolutional network (PDA-GCN). Specifically, the PDA-GCN incorporates a prompt supervision (PS) module that leverages a pre-trained large-scale language model (LLM) as a knowledge engine and retains the generated text features as prompts to provide additional supervision during model training, enhancing the model's ability to discern analogous actions with negligible computational cost. In addition, for the purpose of bolstering the learning of discriminative features, a dynamic attention graph convolution (DA-GC) module is presented. This module utilizes self-attention mechanism to adaptively infer intrinsic relationships between joints and integrates dynamic convolution to strengthen the emphasis on local information. This dual focus on both global context and local details further amplifies the efficiency and effectiveness of the model. Extensive experiments, conducted on the widely-used skeleton-based action recognition datasets NTU RGB+D 60 and NTU RGB+D 120, demonstrate that the PDA-GCN surpasses known state-of-the-art methods, achieving accuracies of 93.4% on the NTU RGB+D 60 cross-subject split and 90.7% on the NTU RGB+D 120 cross-subject split.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A Central Difference Graph Convolutional Operator for Skeleton-Based Action Recognition
    Miao, Shuangyan
    Hou, Yonghong
    Gao, Zhimin
    Xu, Mingliang
    Li, Wanqing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4893 - 4899
  • [32] Dynamic Spatial-temporal Hypergraph Convolutional Network for Skeleton-based Action Recognition
    Wang, Shengqin
    Zhang, Yongji
    Qi, Hong
    Zhao, Minghao
    Jiang, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2147 - 2152
  • [33] Skeleton-Based Action Recognition with Improved Graph Convolution Network
    Yang, Xuqi
    Zhang, Jia
    Qin, Rong
    Su, Yunyu
    Qiu, Shuting
    Yu, Jintian
    Ge, Yongxin
    BIOMETRIC RECOGNITION (CCBR 2021), 2021, 12878 : 31 - 38
  • [34] Multiple Input Branches Shift Graph Convolutional Network with DropEdge for Skeleton-Based Action Recognition
    Liu, Yan
    Deng, Yuelin
    Su, Jinping
    Wang, Ruonan
    Li, Chi
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT I, 2022, 13231 : 584 - 596
  • [35] Skeleton-based action recognition based on multidimensional adaptive convolutional network
    Xia, Yu
    Gao, Qingyuan
    Wu, Weiguan
    Cao, Yi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [36] Cross-Scale Spatial Refinement Graph Convolutional Network for Skeleton-Based Action Recognition
    Chengyuan Ke
    Sheng Liu
    Zhenghao Ke
    Yuan Feng
    Shengyong Chen
    International Journal of Computational Intelligence Systems, 18 (1)
  • [37] Sequence Segmentation Attention Network for Skeleton-Based Action Recognition
    Zhang, Yujie
    Cai, Haibin
    ELECTRONICS, 2023, 12 (07)
  • [38] Part-Wise Adaptive Topology Graph Convolutional Network for Skeleton-Based Action Recognition
    Wang, Jiale
    Zou, Lian
    Fan, Cien
    Chi, Ruan
    ELECTRONICS, 2023, 12 (09)
  • [39] Lightweight Multiscale Spatio-Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
    Zheng, Zhiyun
    Yuan, Qilong
    Zhang, Huaizhu
    Wang, Yizhou
    Wang, Junfeng
    BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 310 - 325
  • [40] PROGRESSIVE SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION
    Heidari, Negar
    Iosifidis, Alexandros
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3220 - 3224