Multi-scale affined-HOF and dimension selection for view-unconstrained action recognition

被引:5
作者
Dinh Tuan Tran [1 ]
Yamazoe, Hirotake [2 ]
Lee, Joo-Ho [2 ]
机构
[1] Ritsumeikan Univ, Grad Sch Informat Sci & Engn, Kyoto, Japan
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, Kyoto, Japan
基金
日本学术振兴会;
关键词
Action recognition; View-invariant; Affine transform; Histogram of optical flow; Dimension selection; Voting algorithm;
D O I
10.1007/s10489-019-01572-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper an action recognition method that can adaptively handle the problems of variations in camera viewpoint is introduced. Our contribution is three-fold. First, a space-sampling algorithm based on affine transform in multiple scales is proposed to yield a series of different viewpoints from a single one. A histogram of dense optical flow is then extracted over each fixed-size patch for a given generated viewpoint as a local feature descriptor. Second, a dimension selection procedure is also proposed to retain only the dimensions that have distinctive information and discard the unnecessary ones in the feature vector space. Third, to adapt to a situation in which video data in multiple viewpoints are used for training; an extended method with a voting algorithm is also introduced to increase the recognition accuracy. By conducting experiments using both simulated and realistic datasets (), the proposed method is validated. The method is found to be accurate and capable of maintaining its accuracy under a wide range of viewpoint changes. In addition, the method is less sensitive to variations in subject scale, subject position, action speed, partial occlusion, and background. The method is also validated by comparing with state-of-the-art view-invariant action recognition methods using well-known i3DPost and MuHAVi public datasets.
引用
收藏
页码:1468 / 1486
页数:19
相关论文
共 48 条
[1]   Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[2]   Silhouette-based human action recognition using sequences of key poses [J].
Andre Chaaraoui, Alexandros ;
Climent-Perez, Pau ;
Florez-Revuelta, Francisco .
PATTERN RECOGNITION LETTERS, 2013, 34 (15) :1799-1807
[3]  
Angelini F., 2018, ARXIV181012126
[4]  
[Anonymous], NIANDR LAK RAC TRACK
[5]  
[Anonymous], 1997, Neural Computation
[6]  
[Anonymous], PATTERN RECOGNITION
[7]  
[Anonymous], EXPL RPG CHAR MEC AN
[8]  
[Anonymous], ROCKVR VID CAPT
[9]  
Azary S, 2012, 2012 WESTERN NEW YORK IMAGE PROCESSING WORKSHOP (WNYIPW), P5, DOI 10.1109/WNYIPW.2012.6466646
[10]   A general tensor representation framework for cross-view gait recognition [J].
Ben, Xianye ;
Zhang, Peng ;
Lai, Zhihui ;
Yan, Rui ;
Zhai, Xinliang ;
Meng, Weixiao .
PATTERN RECOGNITION, 2019, 90 :87-98