Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition

被引：0

作者：

Zhu, Shasha ^{[1
]}

Sun, Lu ^{[1
]}

Ma, Zeyuan ^{[1
]}

Li, Chenxi ^{[1
]}

He, Dongzhi ^{[1
]}

机构：

[1] Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China

来源：

NEUROCOMPUTING | 2025年 / 611卷

关键词：

Skeleton-based action recognition; Graph convolutional network; Attention mechanism; Dynamic convolution; Prompt learning;

D O I：

10.1016/j.neucom.2024.128623

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Skeleton-based action recognition is a core task in the field of video understanding. Skeleton sequences are characterized by high information density, low redundancy, and clear structural information, thereby facilitating the analysis of complex relationships among human behaviors more readily than other modalities. Although existing studies have encoded skeleton data and achieved positive outcomes, they have often overlooked the precise high-level semantic information inherent in the action descriptions. To address this issue, this paper proposes a prompt-supervised dynamic attention graph convolutional network (PDA-GCN). Specifically, the PDA-GCN incorporates a prompt supervision (PS) module that leverages a pre-trained large-scale language model (LLM) as a knowledge engine and retains the generated text features as prompts to provide additional supervision during model training, enhancing the model's ability to discern analogous actions with negligible computational cost. In addition, for the purpose of bolstering the learning of discriminative features, a dynamic attention graph convolution (DA-GC) module is presented. This module utilizes self-attention mechanism to adaptively infer intrinsic relationships between joints and integrates dynamic convolution to strengthen the emphasis on local information. This dual focus on both global context and local details further amplifies the efficiency and effectiveness of the model. Extensive experiments, conducted on the widely-used skeleton-based action recognition datasets NTU RGB+D 60 and NTU RGB+D 120, demonstrate that the PDA-GCN surpasses known state-of-the-art methods, achieving accuracies of 93.4% on the NTU RGB+D 60 cross-subject split and 90.7% on the NTU RGB+D 120 cross-subject split.

引用

页数：11

共 50 条

[1] Graph convolutional network with STC attention and adaptive normalization for skeleton-based action recognition
Zhou, Haiyun
Xiang, Xuezhi
Qiu, Yujian
Liu, Xuzhao
IMAGING SCIENCE JOURNAL, 2023, 71 (07) : 636 - 646
[2] Ghost Graph Convolutional Network for Skeleton-based Action Recognition
Jang, Sungjun
Lee, Heansung
Cho, Suhwan
Woo, Sungmin
Lee, Sangyoun
2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2021,
[3] Two Stream Multi-Attention Graph Convolutional Network for Skeleton-Based Action Recognition
Zhou, Huijian
Tian, Zhiqiang
Du, Shaoyi
ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 112 - 120
[4] Feedback Graph Convolutional Network for Skeleton-Based Action Recognition
Yang, Hao
Yan, Dan
Zhang, Li
Sun, Yunda
Li, Dong
Maybank, Stephen J.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 164 - 175
[5] Attention adjacency matrix based graph convolutional networks for skeleton-based action recognition
Xie, Jun
Miao, Qiguang
Liu, Ruyi
Xin, Wentian
Tang, Lei
Zhong, Sheng
Gao, Xuesong
NEUROCOMPUTING, 2021, 440 (440) : 230 - 239
[6] Selective directed graph convolutional network for skeleton-based action recognition
Ke, Chengyuan
Liu, Sheng
Feng, Yuan
Chen, Shengyong
PATTERN RECOGNITION LETTERS, 2025, 190 : 141 - 146
[7] Feature reconstruction graph convolutional network for skeleton-based action recognition
Huang, Junhao
Wang, Ziming
Peng, Jian
Huang, Feihu
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[8] Multi-scale sampling attention graph convolutional networks for skeleton-based action recognition
Tian, Haoyu
Zhang, Yipeng
Wu, Hanbo
Ma, Xin
Li, Yibin
NEUROCOMPUTING, 2024, 597
[9] Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition
Jang, Sungjun
Lee, Heansung
Kim, Woo Jin
Lee, Jungho
Woo, Sungmin
Lee, Sangyoun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7244 - 7258
[10] Local and global self-attention enhanced graph convolutional network for skeleton-based action recognition
Wu, Zhize
Ding, Yue
Wan, Long
Li, Teng
Nian, Fudong
PATTERN RECOGNITION, 2025, 159

← 1 2 3 4 5 →