An improved spatial temporal graph convolutional network for robust skeleton-based action recognition

被引:15
|
作者
Xing, Yuling [1 ]
Zhu, Jia [2 ]
Li, Yu [1 ]
Huang, Jin [1 ]
Song, Jinlong [1 ]
机构
[1] South China Normal Univ, 55 Zhongshan Ave West, Guangzhou, Peoples R China
[2] Zhejiang Normal Univ, Key Lab Intelligent Educ Technol & Applicat Zheji, Hangzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Adaptive graph; Multi-scale; Occlusion and noise;
D O I
10.1007/s10489-022-03589-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based action recognition methods using complete human skeletons have achieved remarkable performance, but the performance of these methods could significantly deteriorate when critical joints or frames of the skeleton sequence are occluded or disrupted. However, the acquisition of incomplete and noisy human skeletons is inevitable in realistic environments. In order to strengthen the robustness of action recognition model, we propose an Improved Spatial Temporal Graph Convolutional Network (IST-GCN) model, including three modules, namely Multi-dimension Adaptive Graph Convolutional Network (Md-AGCN), Enhanced Attention Mechanism (EAM) and Multi-Scale Temporal Convolutional Network (MS-TCN). Specifically, the Md-AGCN module can first adaptively adjust the graph structure according to different layers and the spatial dimension, temporal dimension, and channel dimension of different action samples to establish corresponding connections for long-range joints with dependencies. Then, the EAM module can focus on important information based on spatial domain, temporal domain and channel to further strengthen the dependencies between important joints. Finally, the MS-TCN module is used to enlarge the receptive field to extract more latent temporal dependencies. The comprehensive experiments on NTU-RGB+D and NTU-RGB+D 120 datasets demonstrate that our approach possesses outstanding performance in terms of both accuracy and robustness when skeleton samples are incomplete and noisy compared with the state-of-the-art (SOTA) approach. Moreover, the parameters and computational complexity of our model are far less than those of the existing approaches.
引用
收藏
页码:4592 / 4608
页数:17
相关论文
共 50 条
  • [41] Attention-Based Generative Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Yang, Kai
    Ding, Xiaolu
    Chen, Wai
    ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 1 - 6
  • [42] Improved Shift Graph Convolutional Network for Action Recognition With Skeleton
    Li, Chuankun
    Li, Shuai
    Gao, Yanbo
    Guo, Lina
    Li, Wanqing
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 438 - 442
  • [43] Graph Edge Convolutional Neural Networks for Skeleton-Based Action Recognition
    Zhang, Xikun
    Xu, Chang
    Tian, Xinmei
    Tao, Dacheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 3047 - 3060
  • [44] A Central Difference Graph Convolutional Operator for Skeleton-Based Action Recognition
    Miao, Shuangyan
    Hou, Yonghong
    Gao, Zhimin
    Xu, Mingliang
    Li, Wanqing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4893 - 4899
  • [45] Information Enhanced Graph Convolutional Networks for Skeleton-based Action Recognition
    Sun, Dengdi
    Zeng, Fanchen
    Luo, Bin
    Tang, Jin
    Ding, Zhuanlian
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [46] Mini-TKAGCN: a lightweight Graph Convolutional Network via Temporal Kernel Attention for Skeleton-based Action Recognition
    Liu, Yanan
    Dong, Shiqi
    Zhang, Hao
    Xu, Dan
    Li, Haipeng
    THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [47] Skeleton-Based Action Recognition Using Multi-Scale and Multi-Stream Improved Graph Convolutional Network
    Li, Wang
    Liu, Xu
    Liu, Zheng
    Du, Feixiang
    Zou, Qiang
    IEEE ACCESS, 2020, 8 (08): : 144529 - 144542
  • [48] Two-Steam Fully Connected Graph Convolutional Network for Skeleton-Based Action Recognition
    Bai, Zhongyu
    Ding, Qichuan
    Tan, Jiawei
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 1056 - 1061
  • [49] Multi-Scale Adaptive Aggregate Graph Convolutional Network for Skeleton-Based Action Recognition
    Zheng, Zhiyun
    Wang, Yizhou
    Zhang, Xingjin
    Wang, Junfeng
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [50] Lightweight channel-topology based adaptive graph convolutional network for skeleton-based action recognition
    Wang K.
    Deng H.
    Zhu Q.
    Neurocomputing, 2023, 560