Action recognition by learning temporal slowness invariant features

被引:9
作者
Pei, Lishen [1 ]
Ye, Mao [1 ]
Zhao, Xuezhuan [2 ]
Dou, Yumin [1 ]
Bao, Jiao [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Ctr Robot, Key Lab NeuroInformat,Minist Educ, Chengdu 611731, Peoples R China
[2] Chinese Acad Sci, Chengdu Inst Comp Applicat, Chengdu 610041, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Temporal slowness regularization; Spatio-temporal features; Independent subspace analysis; Support vector machine;
D O I
10.1007/s00371-015-1090-2
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep learning approaches emphasized on learning spatio-temporal features for action recognition. Different to previous works, we separate the spatio-temporal feature learning unity into the spatial feature learning and the spatial/temporal feature pooling procedures. Using the temporal slowness regularized independent subspace analysis network, we learn invariant spatial features from sampled video cubes. To be robust to the cluttered backgrounds, we incorporate the denoising criterion to our network. The local spatio-temporal features are obtained by pooling features from the spatial and the temporal aspects. The key points are that we learn spatial features from video cubes and pool features from spatial feature sequences. We evaluate the learned local spatio-temporal features on three benchmark action datasets. Extensive experiments demonstrate the effectiveness of the novel feature learning architecture.
引用
收藏
页码:1395 / 1404
页数:10
相关论文
共 40 条
[1]  
ANDREW G., 2007, P 24 INT C MACHINE L, P33
[2]  
[Anonymous], 2009, NIPS
[3]  
[Anonymous], IEEE INT C COMP VIS
[4]  
[Anonymous], IEEE C COMP VIS PATT
[5]  
[Anonymous], P ACM INT C MULT
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]  
Chen B, 2010, THESIS U BRIT COLUMB
[8]   'Breaking' position-invariant object recognition [J].
Cox, DD ;
Meier, P ;
Oertelt, N ;
DiCarlo, JJ .
NATURE NEUROSCIENCE, 2005, 8 (09) :1145-1147
[9]   A comprehensive survey of human action recognition with spatio-temporal interest point (STIP) detector [J].
Das Dawn, Debapratim ;
Shaikh, Soharab Hossain .
VISUAL COMPUTER, 2016, 32 (03) :289-306
[10]  
Delaitre V., 2010, Proceedings of the British Machine Vision Conference, DOI [DOI 10.5244/C.24.97, 10.5244/C.24.97]