Self-Adaptive Graph With Nonlocal Attention Network for Skeleton-Based Action Recognition

被引:9
作者
Pang, Chen [1 ,2 ]
Gao, Xingyu [3 ]
Chen, Zhenyu [4 ,5 ]
Lyu, Lei [1 ,2 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[2] Shandong Normal Univ, Shandong Prov Key Lab Novel Distributed Comp Soft, Jinan 250358, Peoples R China
[3] Chinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China
[4] State Grid Corp China, Big Data Ctr, Beijing 100031, Peoples R China
[5] China Elect Power Res Inst, Beijing 100192, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; global attention; graph convolutional network (GCN); self-adaptive graph;
D O I
10.1109/TNNLS.2023.3298950
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional networks (GCNs) have achieved encouraging progress in modeling human body skeletons as spatial-temporal graphs. However, existing methods still suffer from two inherent drawbacks. Firstly, these models process the input data based on the physical structure of the human body, which leads to some latent correlations among joints being ignored. Furthermore, the key temporal relationships between nonadjacent frames are overlooked, preventing to fully learn the changes of the body joints along the temporal dimension. To address these issues, we propose an innovative spatial-temporal model by introducing a self-adaptive GCN (SAGCN) with global attention network, collectively termed SAG-GAN. Specifically, the SAGCN module is proposed to construct two additional dynamic topological graphs to learn the common characteristics of all data and represent a unique pattern for each sample, respectively. Meanwhile, the global attention module (spatial attention (SA) and temporal attention (TA) modules) is designed to extract the global connections between different joints in a single frame and model temporal relationships between adjacent and nonadjacent frames in temporal sequences. In this manner, our network can capture richer features of actions for accurate action recognition and overcome the defect of the standard graph convolution. Extensive experiments on three benchmark datasets (NTU-60, NTU-120, and Kinetics) have demonstrated the superiority of our proposed method.
引用
收藏
页码:17057 / 17069
页数:13
相关论文
共 50 条
  • [21] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
    Lubin Yu
    Lianfang Tian
    Qiliang Du
    Jameel Ahmed Bhutto
    Applied Intelligence, 2023, 53 : 14838 - 14854
  • [22] Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action Recognition
    Shu, Yang
    Li, Wanggen
    Li, Doudou
    Gao, Kun
    Jie, Biao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 16 - 28
  • [23] EARLY FUSION GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION
    Zhao, Xiaoxue
    Liu, Cuiwei
    Shi, Xiangbin
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [24] Relation Selective Graph Convolutional Network for Skeleton-Based Action Recognition
    Yang, Wenjie
    Zhang, Jianlin
    Cai, Jingju
    Xu, Zhiyong
    SYMMETRY-BASEL, 2021, 13 (12):
  • [25] Selective directed graph convolutional network for skeleton-based action recognition
    Ke, Chengyuan
    Liu, Sheng
    Feng, Yuan
    Chen, Shengyong
    PATTERN RECOGNITION LETTERS, 2025, 190 : 141 - 146
  • [26] Multi-Scale Adaptive Aggregate Graph Convolutional Network for Skeleton-Based Action Recognition
    Zheng, Zhiyun
    Wang, Yizhou
    Zhang, Xingjin
    Wang, Junfeng
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [27] Feature reconstruction graph convolutional network for skeleton-based action recognition
    Huang, Junhao
    Wang, Ziming
    Peng, Jian
    Huang, Feihu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [28] Spatial-temporal graph attention networks for skeleton-based action recognition
    Huang, Qingqing
    Zhou, Fengyu
    He, Jiakai
    Zhao, Yang
    Qin, Runze
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (05)
  • [29] Lightweight channel-topology based adaptive graph convolutional network for skeleton-based action recognition
    Wang K.
    Deng H.
    Zhu Q.
    Neurocomputing, 2023, 560
  • [30] Pyramidal Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Li, Fanjia
    Zhu, Aichun
    Liu, Zhongyu
    Huo, Yu
    Xu, Yonggang
    Hua, Gang
    IEEE SENSORS JOURNAL, 2021, 21 (14) : 16183 - 16191