On Geometric Features for Skeleton-Based Action Recognition using Multilayer LSTM Networks

被引:211
作者
Zhang, Songyang [1 ]
Liu, Xiaoming [2 ]
Xiao, Jun [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou, Zhejiang, Peoples R China
[2] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
来源
2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017) | 2017年
关键词
JOINTS;
D O I
10.1109/WACV.2017.24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RNN-based approaches have achieved outstanding performance on action recognition with skeleton inputs. Currently these methods limit their inputs to coordinates of joints and improve the accuracy mainly by extending RNN models to spatial domains in various ways. While such models explore relations between different parts directly from joint coordinates, we provide a simple universal spatial modeling method perpendicular to the RNN model enhancement. Specifically, we select a set of simple geometric features, motivated by the evolution of previous work. With experiments on a 3-layer LSTM framework, we observe that the geometric relational features based on distances between joints and selected lines outperform other features and achieve state-of-art results on four datasets. Further, we show the sparsity of input gate weights in the first LSTM layer trained by geometric features and demonstrate that utilizing joint-line distances as input require less data for training.
引用
收藏
页码:148 / 157
页数:10
相关论文
共 37 条
[1]  
Anirudh R, 2015, PROC CVPR IEEE, P3147, DOI 10.1109/CVPR.2015.7298934
[2]  
[Anonymous], 2015, PROC CVPR IEEE
[3]  
[Anonymous], 2012, IEEE COMP SOC C COMP, DOI DOI 10.1109/CVPRW.2012.6239234
[4]  
Aydin R, 2014, IN C IND ENG ENG MAN, P1, DOI 10.1109/IEEM.2014.7058588
[5]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[6]  
Breuel T. M., 2015, ARXIV150802774 LSTM
[7]   Bio-inspired Dynamic 3D Discriminative Skeletal Features for Human Action Recognition [J].
Chaudhry, Rizwan ;
Ofli, Ferda ;
Kurillo, Gregorij ;
Bajcsy, Ruzena ;
Vidal, Rene .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, :471-478
[8]   Learning a 3D Human Pose Distance Metric from Geometric Pose Descriptor [J].
Chen, Cheng ;
Zhuang, Yueting ;
Nie, Feiping ;
Yang, Yi ;
Wu, Fei ;
Xiao, Jun .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (11) :1676-1689
[9]  
Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
[10]  
Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714