Histogram of Oriented Principal Components for Cross-View Action Recognition

被引:145
作者
Rahmani, Hossein [1 ]
Mahmood, Arif [1 ]
Du Huynh [1 ]
Mian, Ajmal [1 ]
机构
[1] Univ Western Australia, Sch Comp Sci & Software Engn, 35 Stirling Highway, Crawley, WA 6009, Australia
关键词
Spatio-temporal keypoint; pointcloud; view invariance; ENSEMBLE;
D O I
10.1109/TPAMI.2016.2533389
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing techniques for 3D action recognition are sensitive to viewpoint variations because they extract features from depth images which are viewpoint dependent. In contrast, we directly process pointclouds for cross-view action recognition from unknown and unseen views. We propose the histogram of oriented principal components (HOPC) descriptor that is robust to noise, viewpoint, scale and action speed variations. At a 3D point, HOPC is computed by projecting the three scaled eigenvectors of the pointcloud within its local spatio-temporal support volume onto the vertices of a regular dodecahedron. HOPC is also used for the detection of spatio-temporal keypoints (STK) in 3D pointcloud sequences so that view-invariant STK descriptors (or Local HOPC descriptors) at these key locations only are used for action recognition. We also propose a global descriptor computed from the normalized spatio-temporal distribution of STKs in 4-D, which we refer to as STK-D. We have evaluated the performance of our proposed descriptors against nine existing techniques on two cross-view and three single-view human action recognition datasets. The experimental results show that our techniques provide significant improvement over state-of-the-art methods.
引用
收藏
页码:2430 / 2443
页数:14
相关论文
共 60 条
[1]   Human Activity Analysis: A Review [J].
Aggarwal, J. K. ;
Ryoo, M. S. .
ACM COMPUTING SURVEYS, 2011, 43 (03)
[2]  
[Anonymous], 2007, 2007 IEEE C COMP VIS
[3]  
[Anonymous], COMP VIS ACCV 2012
[4]  
[Anonymous], 2012, P ACM INT C MULT NAR, DOI DOI 10.1145/2393347.2396382
[5]  
Blank M, 2005, IEEE I CONF COMP VIS, P1395
[6]  
CAMPBELL LW, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P624, DOI 10.1109/ICCV.1995.466880
[7]  
Chen YW, 2006, STUDIES FUZZINESS SO, P207
[8]  
Cheng ZW, 2012, LECT NOTES COMPUT SC, V7584, P52, DOI 10.1007/978-3-642-33868-7_6
[9]  
Coxeter H. S. M., 1973, Regular polytopes, V3rd
[10]   Task-specific gesture analysis in real-time using interpolated views [J].
Darrell, TJ ;
Essa, IA ;
Pentland, AP .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (12) :1236-1242