Multi-stream P&U adaptive graph convolutional networks for skeleton-based action recognition

被引:3
作者
Chen, Minglong [1 ]
Liang, Jiuzhen [1 ]
Liu, Hao [1 ]
机构
[1] Changzhou Univ, Sch Informat & Engn, Changzhou 213164, Peoples R China
关键词
Action recognition; Skeleton-based; Graph convolution network; Graph pooling; Multi-stream;
D O I
10.1007/s11227-024-05900-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, action recognition has been an essential branch of video understanding and a hot research direction. Among them, the graph convolutional network (GCN) is widely used in skeleton-based action recognition and has achieved remarkable performance. However, in practical situations, recognizing human action often depends on the movement of a part of the joints. In the existing GCN-based methods, the size of a single frame of the skeleton graph is fixed, and all joints of the human body will participate in the whole operation process, so the critical joints in the moving process cannot be flexibly selected. Therefore, this paper takes the adaptive graph convolutional network (AGCN) as the baseline and uses the graph-pooling method to select the critical joints in the human moving process. We design two new networks: Pooling-AGCN and U-AGCN and use them to form the multi-stream P&U AGCNs for action recognition. Extensive experiments show the complementarity between the two networks and that the method proposed in this paper outperforms the recent work on the three large-scale public datasets (NTU-RGB+D 60, NTU-RGB+D 120, Kinetics-Skeleton).
引用
收藏
页码:11614 / 11639
页数:26
相关论文
共 44 条
[1]   STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition [J].
Ahn, Dasom ;
Kim, Sangwon ;
Hong, Hyunsu ;
Ko, Byoung Chul .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :3319-3328
[2]  
Baek J, 2021, Arxiv, DOI [arXiv:2102.11533, 10.48550/arXiv.2102.11533]
[3]  
Bai R., 2022, 2022 IEEE INT C MULT, P01
[4]   SkeleMotion: A New Representation of Skeleton Joint Sequences Based on Motion Information for 3D Action Recognition [J].
Caetano, Carlos ;
Sena, Jessica ;
Bremond, Francois ;
dos Santos, Jefersson A. ;
Schwartz, William Robson .
2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
[5]   Deep historical long short-term memory network for action recognition [J].
Cai, Jiaxin ;
Hu, Junlin ;
Tang, Xin ;
Hung, Tzu-Yi ;
Tan, Yap-Peng .
NEUROCOMPUTING, 2020, 407 (407) :428-438
[6]  
Gao HY, 2019, PR MACH LEARN RES, V97
[7]   Optimized Skeleton-based Action Recognition via Sparsified Graph Regression [J].
Gao, Xiang ;
Hu, Wei ;
Tang, Jiaxiang ;
Liu, Jiaying ;
Guo, Zongming .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :601-610
[8]   Space-time representation of people based on 3D skeletal data: A review [J].
Han, Fei ;
Reily, Brian ;
Hoff, William ;
Zhang, Hao .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 158 :85-105
[9]  
Huang LJ, 2020, AAAI CONF ARTIF INTE, V34, P11045
[10]  
Huang XH, 2023, Arxiv, DOI arXiv:2301.10900