Adaptive RNN Tree for Large-Scale Human Action Recognition

被引:83
作者
Li, Wenbo [1 ]
Wen, Longyin [2 ]
Chang, Ming-Ching [1 ]
Lim, Ser Nam [2 ,3 ]
Lyu, Siwei [1 ]
机构
[1] SUNY Albany, Albany, NY 12222 USA
[2] GE Global Res, Niskayuna, NY USA
[3] Avitas Syst, Boston, MA USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
基金
美国国家科学基金会;
关键词
REAL-TIME; POSE;
D O I
10.1109/ICCV.2017.161
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present the RNN Tree (RNN-T), an adaptive learning framework for skeleton based human action recognition. Our method categorizes action classes and uses multiple Recurrent Neural Networks (RNNs) in a treelike hierarchy. The RNNs in RNN-T are co-trained with the action category hierarchy, which determines the structure of RNN-T. Actions in skeletal representations are recognized via a hierarchical inference process, during which individual RNNs differentiate finer-grained action classes with increasing confidence. Inference in RNN-T ends when any RNN in the tree recognizes the action with high confidence, or a leaf node is reached. RNN-T effectively addresses two main challenges of large-scale action recognition: (i) able to distinguish fine-grained action classes that are intractable using a single network, and (ii) adaptive to new action classes by augmenting an existing model. We demonstrate the effectiveness of RNN-T/ACH method and compare it with the state-of-the-art methods on a large-scale dataset and several existing benchmarks.
引用
收藏
页码:1453 / 1461
页数:9
相关论文
共 37 条
[1]   Human activity recognition from 3D data: A review [J].
Aggarwal, J. K. ;
Xia, Lu .
PATTERN RECOGNITION LETTERS, 2014, 48 :70-80
[2]  
[Anonymous], 2013, CMU GRAPH LAB MOT CA
[3]  
[Anonymous], 2013, P 23 INT JOINT C ART
[4]  
[Anonymous], 2012, P SIGCHI C HUM FACT
[5]  
[Anonymous], 1996, NIPS
[6]  
[Anonymous], 2014, Asian Conference on Computer Vision
[7]  
[Anonymous], 2007, Computer Graphics Technical Report CG-2007-2
[8]   Parallel Spectral Clustering in Distributed Systems [J].
Chen, Wen-Yen ;
Song, Yangqiu ;
Bai, Hongjie ;
Lin, Chih-Jen ;
Chang, Edward Y. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :568-586
[9]   Online Deformable Object Tracking Based on Structure-Aware Hyper-Graph [J].
Du, Dawei ;
Qi, Honggang ;
Li, Wenbo ;
Wen, Longyin ;
Huang, Qingming ;
Lyu, Siwei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (08) :3572-3584
[10]  
Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714