ICE-GCN: An interactional channel excitation-enhanced graph convolutional network for skeleton-based action recognition

被引:2
作者
Wang, Shuxi [1 ]
Pan, Jiahui [1 ,3 ]
Huang, Binyuan [1 ]
Liu, Pingzhi [1 ]
Li, Zina [2 ]
Zhou, Chengju [1 ]
机构
[1] South China Normal Univ, Sch Software, Foshan 528225, Peoples R China
[2] South China Normal Univ, Sch Psychol, Guangzhou 510631, Peoples R China
[3] Pazhou Lab, Guangzhou 510330, Peoples R China
基金
中国国家自然科学基金;
关键词
Skeleton-based action recognition; Graph convolutional network; Channel-wise attention; Cross-dimensional interaction;
D O I
10.1007/s00138-023-01386-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Thanks to the development of depth sensors and pose estimation algorithms, skeleton-based action recognition has become prevalent in the computer vision community. Most of the existing works are based on spatio-temporal graph convolutional network frameworks, which learn and treat all spatial or temporal features equally, ignoring the interaction with channel dimension to explore different contributions of different spatio-temporal patterns along the channel direction and thus losing the ability to distinguish confusing actions with subtle differences. In this paper, an interactional channel excitation (ICE) module is proposed to explore discriminative spatio-temporal features of actions by adaptively recalibrating channel-wise pattern maps. More specifically, a channel-wise spatial excitation (CSE) is incorporated to capture the crucial body global structure patterns to excite the spatial-sensitive channels. A channel-wise temporal excitation (CTE) is designed to learn temporal inter-frame dynamics information to excite the temporal-sensitive channels. ICE enhances different backbones as a plug-and-play module. Furthermore, we systematically investigate the strategies of graph topology and argue that complementary information is necessary for sophisticated action description. Finally, together equipped with ICE, an interactional channel excited graph convolutional network with complementary topology (ICE-GCN) is proposed and evaluated on three large-scale datasets, NTU RGB+D 60, NTU RGB+D 120, and Kinetics-Skeleton. Extensive experimental results and ablation studies demonstrate that our method outperforms other SOTAs and proves the effectiveness of individual sub-modules. The code will be published at .
引用
收藏
页数:13
相关论文
共 50 条
[21]   Temporal-enhanced graph convolution network for skeleton-based action recognition [J].
Xie, Yulai ;
Zhang, Yang ;
Ren, Fang .
IET COMPUTER VISION, 2022, 16 (03) :266-279
[22]   Multi-channel network: Constructing efficient GCN baselines for skeleton-based action recognition [J].
Hou, Ruijie ;
Wang, Zhihao ;
Ren, Ruimin ;
Cao, Yang ;
Wang, Zhao .
COMPUTERS & GRAPHICS-UK, 2023, 110 :111-117
[23]   Multi-Stage Attention-Enhanced Sparse Graph Convolutional Network for Skeleton-Based Action Recognition [J].
Li, Chaoyue ;
Zou, Lian ;
Fan, Cien ;
Jiang, Hao ;
Liu, Yifeng .
ELECTRONICS, 2021, 10 (18)
[24]   Dual-Excitation SpatialTemporal Graph Convolution Network for Skeleton-Based Action Recognition [J].
Lu, Jian ;
Huang, Tingting ;
Zhao, Bo ;
Chen, Xiaogai ;
Zhou, Jian ;
Zhang, Kaibing .
IEEE SENSORS JOURNAL, 2024, 24 (06) :8184-8196
[25]   Skeleton-based Action Recognition Using Two-stream Graph Convolutional Network with Pose Refinement [J].
Zheng, Biao ;
Chen, Luefeng ;
Wu, Min ;
Pedrycz, Witold ;
Hirota, Kaoru .
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, :6353-6356
[26]   Fast Temporal Graph Convolutional Model for Skeleton-Based Action Recognition [J].
Nan, Mihai ;
Florea, Adina Magda .
SENSORS, 2022, 22 (19)
[27]   A Central Difference Graph Convolutional Operator for Skeleton-Based Action Recognition [J].
Miao, Shuangyan ;
Hou, Yonghong ;
Gao, Zhimin ;
Xu, Mingliang ;
Li, Wanqing .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) :4893-4899
[28]   Skeleton-Based Action Recognition with Improved Graph Convolution Network [J].
Yang, Xuqi ;
Zhang, Jia ;
Qin, Rong ;
Su, Yunyu ;
Qiu, Shuting ;
Yu, Jintian ;
Ge, Yongxin .
BIOMETRIC RECOGNITION (CCBR 2021), 2021, 12878 :31-38
[29]   Multiple Input Branches Shift Graph Convolutional Network with DropEdge for Skeleton-Based Action Recognition [J].
Liu, Yan ;
Deng, Yuelin ;
Su, Jinping ;
Wang, Ruonan ;
Li, Chi .
IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT I, 2022, 13231 :584-596
[30]   Skeleton-based action recognition based on multidimensional adaptive convolutional network [J].
Xia, Yu ;
Gao, Qingyuan ;
Wu, Weiguan ;
Cao, Yi .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127