Skeleton-Based Action Recognition Using Multibranch Adaptive Graph Convolutional Network With Pose Refinement

被引：0

作者：

Chen, Luefeng ^{[1
,2
,3
]}

Li, Jiazhuo ^{[1
,2
,3
]}

Li, Min ^{[1
,2
,3
]}

Wu, Min ^{[1
,2
,3
]}

Pedrycz, Witold ^{[4
,5
,6
]}

Hirota, Kaoru ^{[7
]}

机构：

[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China

[2] Hubei Key Lab Adv Control & Intelligent Automat Co, Wuhan 430074, Peoples R China

[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan 430074, Peoples R China

[4] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2R3, Canada

[5] Macau Univ Sci & Technol, Inst Syst Engn, Taipa 999078, Macau, Peoples R China

[6] Istinye Univ, Res Ctr Performance & Prod Anal, TR-34010 Istanbul, Turkiye

[7] Tokyo Inst Technol, Yokohama 2268502, Japan

来源：

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS | 2025年

基金：

中国国家自然科学基金;

关键词：

Skeleton; Feature extraction; Adaptation models; Vectors; Convolution; Human activity recognition; Graph convolutional networks; Attention mechanisms; Adaptive systems; Accuracy; Action recognition; adaptive; pose refinement; skeleton based;

D O I：

10.1109/TCSS.2025.3566733

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

A multibranch adaptive graph convolutional network is proposed for human action recognition by combining graph convolutional networks (GCNs), adaptive learning, and multibranch feature extraction. Through the adaptive graph convolution module, this method can adaptively change parameters during the training process, thereby enhancing the flexibility of the model. Furthermore, the integration of shallow-level features (skeleton joints), with deep-level features including skeleton information, motion information, and motion difference information allows our model to capture both spatial and temporal dynamics of human actions, leading to a more comprehensive representation of human action features. The introduction of the spatio-temporal attention mechanism enables our model to focus on key frames and skeleton joints. The attitude correction module makes the input data to the network more reasonable and reduces the interference of noise. The inclusion of the adaptive mechanism makes the network no longer limited to the inherent physical connections, and the flexibility of the network is enhanced. The addition of second-order features makes the features of the skeletal data fully exploited. This attention mechanism enhances the discriminative ability of the model and improves its ability to recognize subtle variations and important cues in human actions. Through experiments on benchmark datasets, NTU-RGB-D and Kinetics-400, our method achieves significant improvements in action recognition performance compared with existing approaches. On the Kinetics-400 dataset, we achieved 36.5% and 59.6% recognition rates under the Top-1 and Top-5 evaluation metrics, respectively, which is an improvement of about 1% compared with the state-of-the-art method. On the NTU-RGB-D dataset, we achieved 95.8% and 89.4% recognition rates under the X-view and X-subject modes, respectively, with excellent results. These results validate the effectiveness of the multi-branch adaptive graph convolutional network for human action recognition tasks.

引用

页数：11

共 41 条

[1] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].