Self-Adaptive Graph With Nonlocal Attention Network for Skeleton-Based Action Recognition

被引:9
作者
Pang, Chen [1 ,2 ]
Gao, Xingyu [3 ]
Chen, Zhenyu [4 ,5 ]
Lyu, Lei [1 ,2 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[2] Shandong Normal Univ, Shandong Prov Key Lab Novel Distributed Comp Soft, Jinan 250358, Peoples R China
[3] Chinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China
[4] State Grid Corp China, Big Data Ctr, Beijing 100031, Peoples R China
[5] China Elect Power Res Inst, Beijing 100192, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; global attention; graph convolutional network (GCN); self-adaptive graph;
D O I
10.1109/TNNLS.2023.3298950
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional networks (GCNs) have achieved encouraging progress in modeling human body skeletons as spatial-temporal graphs. However, existing methods still suffer from two inherent drawbacks. Firstly, these models process the input data based on the physical structure of the human body, which leads to some latent correlations among joints being ignored. Furthermore, the key temporal relationships between nonadjacent frames are overlooked, preventing to fully learn the changes of the body joints along the temporal dimension. To address these issues, we propose an innovative spatial-temporal model by introducing a self-adaptive GCN (SAGCN) with global attention network, collectively termed SAG-GAN. Specifically, the SAGCN module is proposed to construct two additional dynamic topological graphs to learn the common characteristics of all data and represent a unique pattern for each sample, respectively. Meanwhile, the global attention module (spatial attention (SA) and temporal attention (TA) modules) is designed to extract the global connections between different joints in a single frame and model temporal relationships between adjacent and nonadjacent frames in temporal sequences. In this manner, our network can capture richer features of actions for accurate action recognition and overcome the defect of the standard graph convolution. Extensive experiments on three benchmark datasets (NTU-60, NTU-120, and Kinetics) have demonstrated the superiority of our proposed method.
引用
收藏
页码:17057 / 17069
页数:13
相关论文
共 50 条
[41]   Insight on Attention Modules for Skeleton-Based Action Recognition [J].
Jiang, Quanyan ;
Wu, Xiaojun ;
Kittler, Josef .
PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 :242-255
[42]   Spatial Graph Convolutional and Temporal Involution Network for Skeleton-based Action Recognition [J].
Wan, Huifan ;
Pan, Guanghui ;
Chen, Yu ;
Ding, Danni ;
Zou, Maoyang .
PROCEEDINGS OF ACM TURING AWARD CELEBRATION CONFERENCE, ACM TURC 2021, 2021, :204-209
[43]   Mixed graph convolution and residual transformation network for skeleton-based action recognition [J].
Shuhua Liu ;
Xiaoying Bai ;
Ming Fang ;
Lanting Li ;
Chih-Cheng Hung .
Applied Intelligence, 2022, 52 :1544-1555
[44]   Richly Activated Graph Convolutional Network for Robust Skeleton-Based Action Recognition [J].
Song, Yi-Fan ;
Zhang, Zhang ;
Shan, Caifeng ;
Wang, Liang .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) :1915-1925
[45]   Mixed graph convolution and residual transformation network for skeleton-based action recognition [J].
Liu, Shuhua ;
Bai, Xiaoying ;
Fang, Ming ;
Li, Lanting ;
Hung, Chih-Cheng .
APPLIED INTELLIGENCE, 2022, 52 (02) :1544-1555
[46]   Temporal Receptive Field Graph Convolutional Network for Skeleton-based Action Recognition [J].
Zhang, Qingqi ;
Wu, Ren ;
Nakata, Mitsuru ;
Ge, Qi-Wei .
2024 INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS, AND COMMUNICATIONS, ITC-CSCC 2024, 2024,
[47]   Adaptive Graph Convolutional Network With Adversarial Learning for Skeleton-Based Action Prediction [J].
Li, Guangxin ;
Li, Nanjun ;
Chang, Faliang ;
Liu, Chunsheng .
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (03) :1258-1269
[48]   Improved Graph Convolutional Network with Enriched Graph Topology Representation for Skeleton-Based Action Recognition [J].
Alsarhan, Tamam ;
Harfoushi, Osama ;
Shdefat, Ahmed Younes ;
Mostafa, Nour ;
Alshinwan, Mohammad ;
Ali, Ahmad .
ELECTRONICS, 2023, 12 (04)
[49]   Multiple temporal scale aggregation graph convolutional network for skeleton-based action recognition [J].
Li, Xuanfeng ;
Lu, Jian ;
Zhou, Jian ;
Liu, Wei ;
Zhang, Kaibing .
COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
[50]   Decoupled Knowledge Embedded Graph Convolutional Network for Skeleton-Based Human Action Recognition [J].
Liu, Yanan ;
Li, Yanqiu ;
Zhang, Hao ;
Zhang, Xuejie ;
Xu, Dan .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) :9445-9457