Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition

被引:0
作者
Lubin Yu
Lianfang Tian
Qiliang Du
Jameel Ahmed Bhutto
机构
[1] South China University of Technology,School of Automation Science and Engineering
[2] The Fifth Electronics Research Institute of Ministry of Industry and Information Technology,School of Computer
[3] Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai),undefined
[4] Sino-Singapore International Joint Research Institute,undefined
[5] Key Laboratory of Autonomous Systems and Network Control of Ministry of Education,undefined
[6] Huanggang Normal University,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Graph convolution; Convolutional Neural Network; Adaptive; Attention module; Action recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Action recognition methods based on spatial-temporal skeleton graphs have been applied extensively. The spatial and temporal graphs are generally modeled individually in previous approaches. Recently, many researchers capture the correlation information of temporal and spatial dimensions in spatial-temporal graphs. However, the existing methods have several issues such as 1. The existing modal graphs are defined based on the human body structure which is not flexible enough; 2. The approach to extracting non-local neighborhood features is insufficiently powerful; 3. Attention modules are limited to a single scale; 4. The fusion of multiple data streams is not sufficiently effective. This work proposes a novel multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition that improves the aforementioned issues. The method utilizes an adaptive topology graph with an adaptive connection coefficient to adaptively optimize the topology of the graph during the training process according to the input data. An optimal high-order adjacency matrix is constructed in our work to balance the weight bias, which captures non-local neighborhood features precisely. Moreover, we design a multi-scale attention mechanism to aggregate information from multiple ranges, which makes the graph convolution focus on more efficient nodes, frames, and channels. To further improve the performance of the model, a novel multi-stream framework is proposed to aggregate the high-order information of the skeleton. The experiment results on the NTU-RGBD and Kinetics-Skeleton prove that our proposed method reveals better results than existing methods.
引用
收藏
页码:14838 / 14854
页数:16
相关论文
共 50 条
  • [21] Multi-Part Adaptive Graph Convolutional Network for Skeleton-Based Action Recognition
    Wang, Wei
    Xie, Wei
    Tu, Zhigang
    Li, Wanxin
    Jin, Lianghao
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [22] Mixed graph convolution and residual transformation network for skeleton-based action recognition
    Shuhua Liu
    Xiaoying Bai
    Ming Fang
    Lanting Li
    Chih-Cheng Hung
    Applied Intelligence, 2022, 52 : 1544 - 1555
  • [23] Mixed graph convolution and residual transformation network for skeleton-based action recognition
    Liu, Shuhua
    Bai, Xiaoying
    Fang, Ming
    Li, Lanting
    Hung, Chih-Cheng
    APPLIED INTELLIGENCE, 2022, 52 (02) : 1544 - 1555
  • [24] Spatial adaptive graph convolutional network for skeleton-based action recognition
    Zhu, Qilin
    Deng, Hongmin
    APPLIED INTELLIGENCE, 2023, 53 (14) : 17796 - 17808
  • [25] Scale Adaptive Graph Convolutional Network for Skeleton-Based Action Recognition
    Wang X.
    Zhong Y.
    Jin L.
    Xiao Y.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2022, 55 (03): : 306 - 312
  • [26] Spatial adaptive graph convolutional network for skeleton-based action recognition
    Qilin Zhu
    Hongmin Deng
    Applied Intelligence, 2023, 53 : 17796 - 17808
  • [27] Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action Recognition
    Shu, Yang
    Li, Wanggen
    Li, Doudou
    Gao, Kun
    Jie, Biao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 16 - 28
  • [28] An Investigation of Skeleton-Based Optical Flow-Guided Features for 3D Action Recognition Using a Multi-Stream CNN Model
    Ren, J.
    Reyes, N. H.
    Barczak, A. L. C.
    Scogings, C.
    Liu, M.
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC), 2018, : 199 - 203
  • [29] Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition
    Liu, Di
    Xu, Hui
    Wang, Jianzhong
    Lu, Yinghua
    Kong, Jun
    Qi, Miao
    SENSORS, 2021, 21 (20)
  • [30] Multi-Dimensional Dynamic Topology Learning Graph Convolution for Skeleton-Based Action Recognition
    Luo H.-L.
    Cao L.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (03): : 991 - 1001