Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition

被引:0
作者
Lubin Yu
Lianfang Tian
Qiliang Du
Jameel Ahmed Bhutto
机构
[1] South China University of Technology,School of Automation Science and Engineering
[2] The Fifth Electronics Research Institute of Ministry of Industry and Information Technology,School of Computer
[3] Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai),undefined
[4] Sino-Singapore International Joint Research Institute,undefined
[5] Key Laboratory of Autonomous Systems and Network Control of Ministry of Education,undefined
[6] Huanggang Normal University,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Graph convolution; Convolutional Neural Network; Adaptive; Attention module; Action recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Action recognition methods based on spatial-temporal skeleton graphs have been applied extensively. The spatial and temporal graphs are generally modeled individually in previous approaches. Recently, many researchers capture the correlation information of temporal and spatial dimensions in spatial-temporal graphs. However, the existing methods have several issues such as 1. The existing modal graphs are defined based on the human body structure which is not flexible enough; 2. The approach to extracting non-local neighborhood features is insufficiently powerful; 3. Attention modules are limited to a single scale; 4. The fusion of multiple data streams is not sufficiently effective. This work proposes a novel multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition that improves the aforementioned issues. The method utilizes an adaptive topology graph with an adaptive connection coefficient to adaptively optimize the topology of the graph during the training process according to the input data. An optimal high-order adjacency matrix is constructed in our work to balance the weight bias, which captures non-local neighborhood features precisely. Moreover, we design a multi-scale attention mechanism to aggregate information from multiple ranges, which makes the graph convolution focus on more efficient nodes, frames, and channels. To further improve the performance of the model, a novel multi-stream framework is proposed to aggregate the high-order information of the skeleton. The experiment results on the NTU-RGBD and Kinetics-Skeleton prove that our proposed method reveals better results than existing methods.
引用
收藏
页码:14838 / 14854
页数:16
相关论文
共 50 条
[41]   Skeleton-based action recognition for manufacturing assembly task through graph convolution network [J].
Soleymani, Maryam ;
Bonyani, Mahdi ;
Wang, Chao .
JOURNAL OF MANUFACTURING SYSTEMS, 2025, 82 :362-375
[42]   Semantics-Assisted Training Graph Convolution Network for Skeleton-Based Action Recognition [J].
Hu, Huangshui ;
Cao, Yu ;
Fang, Yue ;
Meng, Zhiqiang .
SENSORS, 2025, 25 (06)
[43]   Dual-Excitation SpatialTemporal Graph Convolution Network for Skeleton-Based Action Recognition [J].
Lu, Jian ;
Huang, Tingting ;
Zhao, Bo ;
Chen, Xiaogai ;
Zhou, Jian ;
Zhang, Kaibing .
IEEE SENSORS JOURNAL, 2024, 24 (06) :8184-8196
[44]   Auxiliary Task Graph Convolution Network: A Skeleton-Based Action Recognition for Practical Use [J].
Cho, Junsu ;
Kim, Seungwon ;
Oh, Chi-Min ;
Park, Jeong-Min .
APPLIED SCIENCES-BASEL, 2025, 15 (01)
[45]   Interactive two-stream graph neural network for skeleton-based action recognition [J].
Yang, Dun ;
Zhou, Qing ;
Wen, Ju .
JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
[46]   Lighter and faster: A multi-scale adaptive graph convolutional network for skeleton-based action recognition [J].
Jiang, Yuanjian ;
Deng, Hongmin .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
[47]   Multi-Modality Adaptive Feature Fusion Graph Convolutional Network for Skeleton-Based Action Recognition [J].
Zhang, Haiping ;
Zhang, Xinhao ;
Yu, Dongjin ;
Guan, Liming ;
Wang, Dongjing ;
Zhou, Fuxing ;
Zhang, Wanjun .
SENSORS, 2023, 23 (12)
[48]   Skeleton-Based Action Recognition Using Multibranch Adaptive Graph Convolutional Network With Pose Refinement [J].
Chen, Luefeng ;
Li, Jiazhuo ;
Li, Min ;
Wu, Min ;
Pedrycz, Witold ;
Hirota, Kaoru .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025,
[49]   Spatiotemporal decoupling attention transformer for 3D skeleton-based driver action recognition [J].
Xu, Zhuoyan ;
Xu, Jingke .
COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (04)
[50]   Hierarchical graph attention network with pseudo-metapath for skeleton-based action recognition [J].
Wang, Mingdao ;
Li, XueMing ;
Zhang, Xianlin ;
Zhang, Yue .
NEUROCOMPUTING, 2022, 501 :822-833