A Novel Skeleton Spatial Pyramid Model for Skeleton-based Action Recognition

被引:0
作者
Li, Yanshan [1 ]
Guo, Tianyu [1 ]
Xia, Rongjie [1 ]
Liu, Xing [1 ]
机构
[1] Shenzhen Univ, ATR Natl Key Lab Def Technol, Shenzhen, Peoples R China
来源
2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019) | 2019年
基金
中国国家自然科学基金;
关键词
Spatial Pyramid Model; Convolutional Neural Networks; Action Recognition;
D O I
10.1109/siprocess.2019.8868666
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the development of computer science and the rise of deep learning technologies, the skeleton-based action recognition dataset has become larger and larger, which has pushed experts and scholars in the field of action recognition to seek more efficient and accurate algorithms. Considering the most critical factors in the task of action recognition arc the intra-frame representation of the joints of a skeleton and the inter-frame representation of the skeleton sequence, we propose a novel skeleton spatial pyramid model (S-SPM). The spatial information of different levels is gradually weighted and aggregated, which effectively models the spatial features of the skeleton sequence. Then the spatio-temporal feature representation based on the skeleton spatial pyramid model is proposed to model the temporal information to obtain deep spatio-temporal feature. Finally, the spatio-temporal feature is fed into the convolutional neural network (CNN) to effectively recognize the actions. The experimental results of the proposed algorithm in the NTU RGB+D dataset show that the S-SPM can improve the accuracies for skeleton-based action recognition.
引用
收藏
页码:16 / 20
页数:5
相关论文
共 17 条
[1]  
Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714
[2]   Deep Learning on Lie Groups for Skeleton-based Action Recognition [J].
Huang, Zhiwu ;
Wan, Chengde ;
Probst, Thomas ;
Van Gool, Luc .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1243-1252
[3]  
Jin S.-Y., 2012, P AS C COMP VIS
[4]   Deformable Spatial Pyramid Matching for Fast Dense Correspondences [J].
Kim, Jaechul ;
Liu, Ce ;
Sha, Fei ;
Grauman, Kristen .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2307-2314
[5]   Tensor Representations via Kernel Linearization for Action Recognition from 3D Skeletons [J].
Koniusz, Piotr ;
Cherian, Anoop ;
Porikli, Fatih .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :37-53
[6]  
Lazebnik S., 2006, 2006 IEEE COMP SOC C, VVolume 2, P2169, DOI DOI 10.1109/CVPR.2006.68
[7]   Intrinsic spatial pyramid matching for deformable 3D shape retrieval [J].
Li, Chunyuan ;
Ben Hamza, A. .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2013, 2 (04) :261-271
[8]   Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition [J].
Liu, Jun ;
Shahroudy, Amir ;
Xu, Dong ;
Wang, Gang .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :816-833
[9]  
Lv F., 2006, EUR C COMP VIS
[10]  
Shahri Alimohammad, 2016, 2016 IEEE Tenth International Conference on Research Challenges in Information Science (RCIS), P1, DOI 10.1109/RCIS.2016.7549312