Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition

被引：0

作者：

Lubin Yu

Lianfang Tian

Qiliang Du

Jameel Ahmed Bhutto

机构：

[1] South China University of Technology,School of Automation Science and Engineering

[2] The Fifth Electronics Research Institute of Ministry of Industry and Information Technology,School of Computer

[3] Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai),undefined

[4] Sino-Singapore International Joint Research Institute,undefined

[5] Key Laboratory of Autonomous Systems and Network Control of Ministry of Education,undefined

[6] Huanggang Normal University,undefined

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Graph convolution; Convolutional Neural Network; Adaptive; Attention module; Action recognition;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Action recognition methods based on spatial-temporal skeleton graphs have been applied extensively. The spatial and temporal graphs are generally modeled individually in previous approaches. Recently, many researchers capture the correlation information of temporal and spatial dimensions in spatial-temporal graphs. However, the existing methods have several issues such as 1. The existing modal graphs are defined based on the human body structure which is not flexible enough; 2. The approach to extracting non-local neighborhood features is insufficiently powerful; 3. Attention modules are limited to a single scale; 4. The fusion of multiple data streams is not sufficiently effective. This work proposes a novel multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition that improves the aforementioned issues. The method utilizes an adaptive topology graph with an adaptive connection coefficient to adaptively optimize the topology of the graph during the training process according to the input data. An optimal high-order adjacency matrix is constructed in our work to balance the weight bias, which captures non-local neighborhood features precisely. Moreover, we design a multi-scale attention mechanism to aggregate information from multiple ranges, which makes the graph convolution focus on more efficient nodes, frames, and channels. To further improve the performance of the model, a novel multi-stream framework is proposed to aggregate the high-order information of the skeleton. The experiment results on the NTU-RGBD and Kinetics-Skeleton prove that our proposed method reveals better results than existing methods.

引用

页码：14838 / 14854

页数：16

共 50 条

[21] Combining Adaptive Graph Convolution and Temporal Modeling for Skeleton-Based Action Recognition [J].

Zhen, Haoyu ;

Zhang, De .

Computer Engineering and Applications, 2023, 59 (18) :137-144

[22] Adaptive spatiotemporal graph convolutional network with intermediate aggregation of multi-stream skeleton features for action recognition [J].

Zhao, Yukai ;

Wang, Jingwei ;

Wang, Han ;

Liu, Min ;

Ma, Yunlong .

NEUROCOMPUTING, 2022, 505 :116-124

[23] Multi-Part Adaptive Graph Convolutional Network for Skeleton-Based Action Recognition [J].

Wang, Wei ;

Xie, Wei ;

Tu, Zhigang ;

Li, Wanxin ;

Jin, Lianghao .

2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,

[24] Mixed graph convolution and residual transformation network for skeleton-based action recognition [J].

Shuhua Liu ;

Xiaoying Bai ;

Ming Fang ;

Lanting Li ;

Chih-Cheng Hung .

Applied Intelligence, 2022, 52 :1544-1555

[25] Mixed graph convolution and residual transformation network for skeleton-based action recognition [J].

Liu, Shuhua ;

Bai, Xiaoying ;

Fang, Ming ;

Li, Lanting ;

Hung, Chih-Cheng .

APPLIED INTELLIGENCE, 2022, 52 (02) :1544-1555

[26] Spatial adaptive graph convolutional network for skeleton-based action recognition [J].

Zhu, Qilin ;

Deng, Hongmin .

APPLIED INTELLIGENCE, 2023, 53 (14) :17796-17808

[27] Scale Adaptive Graph Convolutional Network for Skeleton-Based Action Recognition [J].

Wang X. ;

Zhong Y. ;

Jin L. ;

Xiao Y. .

Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2022, 55 (03) :306-312

[28] Spatial adaptive graph convolutional network for skeleton-based action recognition [J].

Qilin Zhu ;

Hongmin Deng .

Applied Intelligence, 2023, 53 :17796-17808

[29] Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action Recognition [J].

Shu, Yang ;

Li, Wanggen ;

Li, Doudou ;

Gao, Kun ;

Jie, Biao .

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 :16-28

[30] An Investigation of Skeleton-Based Optical Flow-Guided Features for 3D Action Recognition Using a Multi-Stream CNN Model [J].

Ren, J. ;

Reyes, N. H. ;

Barczak, A. L. C. ;

Scogings, C. ;

Liu, M. .

2018 IEEE 3RD INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC), 2018, :199-203

← 1 2 3 4 5 →