Learning view-invariant features using stacked autoencoder for skeleton-based gait recognition

被引:6
作者
Hasan, Md Mahedi [1 ]
Mustafa, Hossen Asiful [1 ]
机构
[1] Bangladesh Univ Engn & Technol BUET, Inst Informat & Commun Technol, Dhaka, Bangladesh
关键词
43;
D O I
10.1049/cvi2.12050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human gait recognition in a multicamera environment is a challenging task in biometrics because of the presence of the large pose and variations in illumination among different views. In this work, to address the problem of variations in view, we present a novel stacked autoencoder for learning discriminant view-invariant gait representations. Our autoencoder can efficiently and progressively translate skeleton joint coordinates from any arbitrary view to a common canonical view without requiring the prior estimation of the view angle or covariate type and without losing temporal information. Then, we construct a discriminative gait feature vector by fusing the encoded features with two other spatiotemporal gait features to feed into the main recurrent neural network. Experimental evaluations of the challenging CASIA A and CASIA B gait datasets demonstrate that the proposed approach outperformed other state-of-the-art methods on single-view gait recognition. In particular, the proposed method achieved 46.31% and 33.86% average correct class recognition on probe set ProbeBG and ProbeCL, respectively, of the CASIA B dataset while considering the view variation; this is 0.3% and 30.68% higher than previous best-performing methods. Furthermore, in cross-view recognition, our method shows better results over other state-of-the-art methods when the view-angle variation is large than 36 degrees.
引用
收藏
页码:527 / 545
页数:19
相关论文
共 43 条
[1]  
[Anonymous], 2013, ARXIV, DOI DOI 10.48550/ARXIV.1308.0850
[2]  
Ariyanto G., 2011, P INT JOINT C BIOM, P1, DOI [10.1109/IJCB.2011.6117582, DOI 10.1109/IJCB.2011.6117582]
[3]   View-Invariant Gait Representation Using Joint Bayesian Regularized Non-negative Matrix Factorization [J].
Babaee, Maryam ;
Rigoll, Gerhard .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2583-2589
[4]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[5]   Multi-Gait Recognition Based on Attribute Discovery [J].
Chen, Xin ;
Weng, Jian ;
Lu, Wei ;
Xu, Jiaming .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (07) :1697-1710
[6]  
Chung J., 2014, NIPS 2014 WORKSH DEE, DOI DOI 10.48550/ARXIV.1412.3555
[7]  
Goffredo M, 2008, 2008 IEEE SECOND INTERNATIONAL CONFERENCE ON BIOMETRICS: THEORY, APPLICATIONS AND SYSTEMS (BTAS), P154
[8]   Framewise phoneme classification with bidirectional LSTM and other neural network architectures [J].
Graves, A ;
Schmidhuber, J .
NEURAL NETWORKS, 2005, 18 (5-6) :602-610
[9]   Individual recognition using Gait Energy Image [J].
Han, J ;
Bhanu, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (02) :316-322
[10]   View-Invariant Discriminative Projection for Multi-View Gait-Based Human Identification [J].
Hu, Maodi ;
Wang, Yunhong ;
Zhang, Zhaoxiang ;
Little, James J. ;
Huang, Di .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2013, 8 (12) :2034-2045