A generative restricted Boltzmann machine based method for high-dimensional motion data modeling

被引:32
作者
Nie, Siqi [1 ]
Wang, Ziheng [1 ]
Ji, Qiang [1 ]
机构
[1] Rensselaer Polytech Inst, Dept Elect Comp & Syst Engn, Troy, NY 12180 USA
基金
美国国家科学基金会;
关键词
Restricted Boltzmann machine; Generative model; High-dimensional motion data; Facial expression recognition; Human action recognition; RECOGNITION;
D O I
10.1016/j.cviu.2014.12.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many computer vision applications involve modeling complex spatio-temporal patterns in high-dimensional motion data. Recently, restricted Boltzmann machines (RBMs) have been widely used to capture and represent spatial patterns in a single image or temporal patterns in several time slices. To model global dynamics and local spatial interactions, we propose to theoretically extend the conventional RBMs by introducing another term in the energy function to explicitly model the local spatial interactions in the input data. A learning method is then proposed to perform efficient learning for the proposed model. We further introduce a new method for multi-class classification that can effectively estimate the infeasible partition functions of different RBMs such that RBM is treated as a generative model for classification purpose. The improved RBM model is evaluated on two computer vision applications: facial expression recognition and human action recognition. Experimental results on benchmark databases demonstrate the effectiveness of the proposed algorithm. Published by Elsevier Inc.
引用
收藏
页码:14 / 22
页数:9
相关论文
共 35 条
[1]  
[Anonymous], 2013, CVPR
[2]  
[Anonymous], ARTIFICIAL INTELLIGE
[3]  
[Anonymous], 2013, CVPR
[4]  
Bloom V., 2012, 2012 IEEE COMP SOC C, P7, DOI DOI 10.1109/CVPRW.2012.6239175
[5]   View-based interpretation of real-time optical flow for gesture recognition [J].
Cutler, R ;
Turk, M .
AUTOMATIC FACE AND GESTURE RECOGNITION - THIRD IEEE INTERNATIONAL CONFERENCE PROCEEDINGS, 1998, :416-421
[6]  
Dagli I., 2003, Agent Technologies, Infrastructures, Tools, and Applications for E-Services. NODe 2002 Agent-Related Workshops. Revised Papers (Lecture Notes in Artificial Intelligence Vol.2592), P179
[7]  
Eslami SMA, 2012, PROC CVPR IEEE, P406, DOI 10.1109/CVPR.2012.6247702
[8]  
Hinton G., 2010, Neural Networks: Tricks of the Trade, V9, P926, DOI [10.1007/978-3-642-35289-8_32, DOI 10.1007/978-3-642-35289-8_32]
[9]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507
[10]   Training products of experts by minimizing contrastive divergence [J].
Hinton, GE .
NEURAL COMPUTATION, 2002, 14 (08) :1771-1800