Skeleton-based action recognition using sparse spatio-temporal GCN with edge effective resistance

被引:27
作者
Ahmad, Tasweer [1 ]
Jin, Lianwen [1 ]
Lin, Luojun [1 ]
Tang, GuoZhi [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510000, Peoples R China
关键词
Graph convolutional neural networks; Graph sparsification; Self-attention graph pooling; SPARSIFICATION; NETWORK;
D O I
10.1016/j.neucom.2020.10.096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional neural networks have established significant success in solving various machine learning and computer vision problems. For skeleton-based action recognition, graph convolutional neural networks are the most suitable choice since human skeleton resembles to a graph. Stacking body skeletons over the length of video sequence results in a very complex spatio-temporal graph of many nodes and edges. Modeling the graph convolutional network directly with such a complex graph curtails the performance due to the redundancy of insignificant nodes and edges in the graph. Also for skeleton based action recognition, the long-term contextual information is of central importance and many current architectures may fail to capture such contextual information. Therefore in order to alleviate these problems, we propose graph sparsification technique using edge effective resistance to better model the global context information and to eliminate redundant nodes and edges in the graph. Furthermore, we incorporate self-attention graph pooling to retain local properties and graph structures while pooling operation. To the best of our knowledge, we are the first to apply graph sparsification using edge effective resistance for skeleton-based action recognition and our proposed method is confirmed to be effective on action recognition, which achieves state-of-the-art results on publicly available datasets: UTD-MHAD, J-HMDB, NTU-RGB + D-60, NTU-RGB + D-120 and Kinetics dataset. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:389 / 398
页数:10
相关论文
共 52 条
[11]   Optimized Skeleton-based Action Recognition via Sparsified Graph Regression [J].
Gao, Xiang ;
Hu, Wei ;
Tang, Jiaxiang ;
Liu, Jiaying ;
Guo, Zongming .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :601-610
[12]   Neural Graph Matching Networks for Fewshot 3D Action Recognition [J].
Guo, Michelle ;
Chou, Edward ;
Huang, De-An ;
Song, Shuran ;
Yeung, Serena ;
Li Fei-Fei .
COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 :673-689
[13]   Towards understanding action recognition [J].
Jhuang, Hueihan ;
Gall, Juergen ;
Zuffi, Silvia ;
Schmid, Cordelia ;
Black, Michael J. .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3192-3199
[14]   A New Representation of Skeleton Sequences for 3D Action Recognition [J].
Ke, Qiuhong ;
Bennamoun, Mohammed ;
An, Senjian ;
Sohel, Ferdous ;
Boussaid, Farid .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4570-4579
[15]  
Khalid M.U., ARXIV190903466V1
[16]  
Kipf TN., ARXIV160902907
[17]  
Kyng R., 2015, C LEARNING THEORY, P1190
[18]   Sparsified Cholesky and Multigrid Solvers for Connection Laplacians [J].
Kyng, Rasmus ;
Lee, Yin Tat ;
Peng, Richard ;
Sachdeva, Sushant ;
Spielman, Daniel A. .
STOC'16: PROCEEDINGS OF THE 48TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2016, :842-850
[19]  
Lee J., ARXIV190408082
[20]  
Li B, 2019, AAAI CONF ARTIF INTE, P8561