Self-Adaptive Graph With Nonlocal Attention Network for Skeleton-Based Action Recognition

被引：9

作者：

Pang, Chen ^{[1
,2
]}

Gao, Xingyu ^{[3
]}

Chen, Zhenyu ^{[4
,5
]}

Lyu, Lei ^{[1
,2
]}

机构：

[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China

[2] Shandong Normal Univ, Shandong Prov Key Lab Novel Distributed Comp Soft, Jinan 250358, Peoples R China

[3] Chinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China

[4] State Grid Corp China, Big Data Ctr, Beijing 100031, Peoples R China

[5] China Elect Power Res Inst, Beijing 100192, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Action recognition; global attention; graph convolutional network (GCN); self-adaptive graph;

D O I：

10.1109/TNNLS.2023.3298950

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Graph convolutional networks (GCNs) have achieved encouraging progress in modeling human body skeletons as spatial-temporal graphs. However, existing methods still suffer from two inherent drawbacks. Firstly, these models process the input data based on the physical structure of the human body, which leads to some latent correlations among joints being ignored. Furthermore, the key temporal relationships between nonadjacent frames are overlooked, preventing to fully learn the changes of the body joints along the temporal dimension. To address these issues, we propose an innovative spatial-temporal model by introducing a self-adaptive GCN (SAGCN) with global attention network, collectively termed SAG-GAN. Specifically, the SAGCN module is proposed to construct two additional dynamic topological graphs to learn the common characteristics of all data and represent a unique pattern for each sample, respectively. Meanwhile, the global attention module (spatial attention (SA) and temporal attention (TA) modules) is designed to extract the global connections between different joints in a single frame and model temporal relationships between adjacent and nonadjacent frames in temporal sequences. In this manner, our network can capture richer features of actions for accurate action recognition and overcome the defect of the standard graph convolution. Extensive experiments on three benchmark datasets (NTU-60, NTU-120, and Kinetics) have demonstrated the superiority of our proposed method.

引用

页码：17057 / 17069

页数：13

共 50 条

[21] Multi-stream adaptive 3D attention graph convolution network for skeleton-based action recognition
Lubin Yu
Lianfang Tian
Qiliang Du
Jameel Ahmed Bhutto
Applied Intelligence, 2023, 53 : 14838 - 14854
[22] Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action Recognition
Shu, Yang
Li, Wanggen
Li, Doudou
Gao, Kun
Jie, Biao
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 16 - 28
[23] EARLY FUSION GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION
Zhao, Xiaoxue
Liu, Cuiwei
Shi, Xiangbin
2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
[24] Relation Selective Graph Convolutional Network for Skeleton-Based Action Recognition
Yang, Wenjie
Zhang, Jianlin
Cai, Jingju
Xu, Zhiyong
SYMMETRY-BASEL, 2021, 13 (12):
[25] Selective directed graph convolutional network for skeleton-based action recognition
Ke, Chengyuan
Liu, Sheng
Feng, Yuan
Chen, Shengyong
PATTERN RECOGNITION LETTERS, 2025, 190 : 141 - 146
[26] Multi-Scale Adaptive Aggregate Graph Convolutional Network for Skeleton-Based Action Recognition
Zheng, Zhiyun
Wang, Yizhou
Zhang, Xingjin
Wang, Junfeng
APPLIED SCIENCES-BASEL, 2022, 12 (03):
[27] Feature reconstruction graph convolutional network for skeleton-based action recognition
Huang, Junhao
Wang, Ziming
Peng, Jian
Huang, Feihu
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[28] Spatial-temporal graph attention networks for skeleton-based action recognition
Huang, Qingqing
Zhou, Fengyu
He, Jiakai
Zhao, Yang
Qin, Runze
JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (05)
[29] Lightweight channel-topology based adaptive graph convolutional network for skeleton-based action recognition
Wang K.
Deng H.
Zhu Q.
Neurocomputing, 2023, 560
[30] Pyramidal Graph Convolutional Network for Skeleton-Based Human Action Recognition
Li, Fanjia
Zhu, Aichun
Liu, Zhongyu
Huo, Yu
Xu, Yonggang
Hua, Gang
IEEE SENSORS JOURNAL, 2021, 21 (14) : 16183 - 16191

← 1 2 3 4 5 →