Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition

被引:962
作者
Liu, Jun [1 ]
Shahroudy, Amir [1 ]
Xu, Dong [2 ]
Wang, Gang [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
[2] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW, Australia
来源
COMPUTER VISION - ECCV 2016, PT III | 2016年 / 9907卷
关键词
3D action recognition; Recurrent neural networks; Long short-term memory; Trust gate; Spatio-temporal analysis; SEQUENCE;
D O I
10.1007/978-3-319-46487-9_50
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D action recognition - analysis of human actions based on 3D skeleton data - becomes popular recently due to its succinctness, robustness, and view-invariant representation. Recent attempts on this problem suggested to develop RNN-based learning methods to model the contextual dependency in the temporal domain. In this paper, we extend this idea to spatio-temporal domains to analyze the hidden sources of action-related information within the input data over both domains concurrently. Inspired by the graphical structure of the human skeleton, we further propose a more powerful tree-structure based traversal method. To handle the noise and occlusion in 3D skeleton data, we introduce new gating mechanism within LSTM to learn the reliability of the sequential input data and accordingly adjust its effect on updating the long-term context information stored in the memory cell. Our method achieves state-of-the-art performance on 4 challenging benchmark datasets for 3D human action analysis.
引用
收藏
页码:816 / 833
页数:18
相关论文
共 72 条
[1]  
Al Alwani A. S., 2016, PR LETT IN PRESS
[2]  
[Anonymous], 2016, CVPR
[3]  
[Anonymous], CVPR
[4]  
[Anonymous], SPACE TIME REPRESENT
[5]  
[Anonymous], 2012, INTERSPEECH
[6]  
[Anonymous], 2010, CVPRW
[7]  
[Anonymous], TPAMI
[8]  
[Anonymous], 2013, ICASSP
[9]  
[Anonymous], 2016, CVPR
[10]  
[Anonymous], 2014, CVPR