Cross-View Gait Recognition by Discriminative Feature Learning

被引:106
作者
Zhang, Yuqi [1 ]
Huang, Yongzhen [1 ]
Yu, Shiqi [2 ]
Wang, Liang [1 ]
机构
[1] Univ Chinese Acad Sci, Inst Automat, Ctr Res Intelligent Percept & Comp, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Gait recognition; Feature extraction; Three-dimensional displays; Generative adversarial networks; Face recognition; Deep learning; Clothing; discriminative feature learning; angle center loss; spatial-temporal features; DEEP; MOTION; MODEL;
D O I
10.1109/TIP.2019.2926208
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep learning-based cross-view gait recognition has become popular owing to the strong capacity of convolutional neural networks (CNNs). Current deep learning methods often rely on loss functions used widely in the task of face recognition, e.g., contrastive loss and triplet loss. These loss functions have the problem of hard negative mining. In this paper, a robust, effective, and gait-related loss function, called angle center loss (ACL), is proposed to learn discriminative gait features. The proposed loss function is robust to different local parts and temporal window sizes. Different from center loss which learns a center for each identity, the proposed loss function learns multiple sub-centers for each angle of the same identity. Only the largest distance between the anchor feature and the corresponding cross-view sub-centers is penalized, which achieves better intra-subject compactness. We also propose to extract discriminative spatialtemporal features by local feature extractors and a temporal attention model. A simplified spatial transformer network is proposed to localize the suitable horizontal parts of the human body. Local gait features for each horizontal part are extracted and then concatenated as the descriptor. We introduce long short-term memory (LSTM) units as the temporal attention model to learn the attention score for each frame, e.g., focusing more on discriminative frames and less on frames with bad quality. The temporal attention model shows better performance than the temporal average pooling or gait energy images (GEI). By combing the three aspects, we achieve state-of-the-art results on several cross-view gait recognition benchmarks.
引用
收藏
页码:1001 / 1015
页数:15
相关论文
共 70 条
[1]   Improving Gait Recognition with 3D Pose Estimation [J].
An, Weizhi ;
Liao, Rijun ;
Yu, Shiqi ;
Huang, Yongzhen ;
Yuen, Pong C. .
BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 :137-147
[2]  
[Anonymous], 2017, Regularizing neural networks by penalizing confident output distributions
[3]  
[Anonymous], 2016, IEEE T PATTERN ANAL
[4]  
Ariyanto G., 2011, P INT JOINT C BIOM, P1, DOI [10.1109/IJCB.2011.6117582, DOI 10.1109/IJCB.2011.6117582]
[5]   Gait-Based Human Identification by Combining Shallow Convolutional Neural Network-Stacked Long Short-Term Memory and Deep Convolutional Neural Network [J].
Batchuluun, Ganbayar ;
Yoon, Hyo Sik ;
Kang, Jin Kyu ;
Park, Kang Ryoung .
IEEE ACCESS, 2018, 6 :63164-63186
[6]  
Battistone F., PATTERN RECOGNIT LET
[7]   Coupled Bilinear Discriminant Projection for Cross-View Gait Recognition [J].
Ben, Xianye ;
Gong, Chen ;
Zhang, Peng ;
Yan, Rui ;
Wu, Qiang ;
Meng, Weixiao .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (03) :734-747
[8]   Coupled Patch Alignment for Matching Cross-View Gaits [J].
Ben, Xianye ;
Gong, Chen ;
Zhang, Peng ;
Jia, Xitong ;
Wu, Qiang ;
Meng, Weixiao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) :3142-3157
[9]   Cross-View Discriminative Feature Learning for Person Re-Identification [J].
Borgia, Alessandro ;
Hua, Yang ;
Kodirov, Elyor ;
Robertson, Neil M. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) :5338-5349
[10]   On Using Gait in Forensic Biometrics [J].
Bouchrika, Imed ;
Goffredo, Michaela ;
Carter, John ;
Nixon, Mark .
JOURNAL OF FORENSIC SCIENCES, 2011, 56 (04) :882-889