MSA-GCN: Exploiting Multi-Scale Temporal Dynamics With Adaptive Graph Convolution for Skeleton-Based Action Recognition

被引:0
|
作者
Alowonou, Kowovi Comivi [1 ]
Han, Ji-Hyeong [1 ]
机构
[1] Seoul Natl Univ Sci & Technol, Dept Comp Sci & Engn, Seoul 01811, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Topology; Convolution; Adaptation models; Feature extraction; Joints; Correlation; Logic gates; Bones; Transformers; Solid modeling; Skeleton-based action recognition; GCN; dynamic graph topology; multi-scale temporal processing; FUSION;
D O I
10.1109/ACCESS.2024.3520172
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph convolutional networks (GCNs) have been widely used and have achieved remarkable results in skeleton-based action recognition. We note that existing GCN-based approaches rely on local context information of the skeleton joints to construct adaptive graphs for feature aggregation, limiting their ability to understand actions that involve coordinated movements across various parts of the body. An adaptive graph built upon the global context information of the joints can help move beyond this limitation. Therefore, in this paper, we propose a novel approach to skeleton-based action recognition named Multi-stage Adaptive Graph Convolution Network (MSA-GCN). It consists of two modules: Multi-stage Adaptive Graph Convolution (MSA-GC) and Temporal Multi-Scale Transformer (TMST). These two modules work together to capture complex spatial and temporal patterns within skeleton data effectively. Specifically, MSA-GC explores both local and global context information of the joints across all sequences to construct the adaptive graph and facilitates the understanding of complex and nuanced relationships between joints. On the other hand, the TMST module integrates a Gated Multi-stage Temporal Convolution (GMSTC) with a Temporal Multi-Head Self-Attention (TMHSA) to capture global temporal features and accommodate both long-term and short-term dependencies within action sequences. Through extensive experiments on multiple benchmark datasets, including NTU RGB+D 60, NTU RGB+D 120, and Northwestern-UCLA, MSA-GCN achieves state-of-the-art performance and verifies its effectiveness in skeleton-based action recognition.
引用
收藏
页码:193552 / 193563
页数:12
相关论文
共 50 条
  • [1] Multi-Scale Adaptive Graph Convolution Network for Skeleton-Based Action Recognition
    Hu, Huangshui
    Fang, Yue
    Han, Mei
    Qi, Xingshuo
    IEEE ACCESS, 2024, 12 : 16868 - 16880
  • [2] Adaptive Multi-Scale Difference Graph Convolution Network for Skeleton-Based Action Recognition
    Wang, Xiaojuan
    Gan, Ziliang
    Jin, Lei
    Xiao, Yabo
    He, Mingshu
    ELECTRONICS, 2023, 12 (13)
  • [3] Multi-scale skeleton adaptive weighted GCN for skeleton-based human action recognition in IoT
    Xu Weiyao
    Wu Muqing
    Zhu Jie
    Zhao Min
    APPLIED SOFT COMPUTING, 2021, 104
  • [4] Skeleton-based action recognition with temporal action graph and temporal adaptive graph convolution structure
    Cao, Yi
    Liu, Chen
    Huang, Zilong
    Sheng, Yongjian
    Ju, Yongjian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (19) : 29139 - 29162
  • [5] Skeleton-based action recognition with temporal action graph and temporal adaptive graph convolution structure
    Yi Cao
    Chen Liu
    Zilong Huang
    Yongjian Sheng
    Yongjian Ju
    Multimedia Tools and Applications, 2021, 80 : 29139 - 29162
  • [6] Multi-Scale Mixed Dense Graph Convolution Network for Skeleton-Based Action Recognition
    Xia, Hailun
    Gao, Xinkai
    IEEE ACCESS, 2021, 9 (09): : 36475 - 36484
  • [7] MSA-GCN: Multiscale Adaptive Graph Convolution Network for gait emotion recognition
    Yin, Yunfei
    Jing, Li
    Huang, Faliang
    Yang, Guangchao
    Wang, Zhuowei
    PATTERN RECOGNITION, 2024, 147
  • [8] MSA-GCN:Multiscale Adaptive Graph Convolution Network for Gait Emotion Recognition
    Yin, Yunfei
    Jing, Li
    Huang, Faliang
    Yang, Guangchao
    Wang, Zhuowei
    arXiv, 2022,
  • [9] Combining Adaptive Graph Convolution and Temporal Modeling for Skeleton-Based Action Recognition
    Zhen, Haoyu
    Zhang, De
    Computer Engineering and Applications, 2023, 59 (18) : 137 - 144
  • [10] Multi-scale and attention enhanced graph convolution network for skeleton-based violence action recognition
    Yang, Huaigang
    Ren, Ziliang
    Yuan, Huaqiang
    Wei, Wenhong
    Zhang, Qieshi
    Zhang, Zhaolong
    FRONTIERS IN NEUROROBOTICS, 2022, 16