Gait metric learning siamese network exploiting dual of spatio-temporal 3D-CNN intra and LSTM based inter gait-cycle-segment features

被引：16

作者：

Thapar, Daksh ^{[1
]}

Jaswal, Gaurav ^{[1
]}

Nigam, Aditya ^{[1
]}

Arora, Chetan ^{[2
]}

机构：

[1] Indian Inst Technol, Mandi, Himachal Prades, India

[2] Indian Inst Technol, Delhi, India

来源：

PATTERN RECOGNITION LETTERS | 2019年 / 125卷

关键词：

Gait biometrics; Deep learning; 3-D Convolutional neural network; LSTM; Siamese; RECOGNITION;

D O I：

10.1016/j.patrec.2019.07.008

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Gait recognition is a non-invasive biometric technology that can be used to identify humans in surveillance systems. It is based on the style or manner in which a person walk and can be realized with minimal amount of individual cooperation for its acquisition. However, it may causes many challenges in the form of varying viewpoints, carrying conditions and clothing variations. To tackle such limitations, we present a view-invariant gait recognition network that divide the gait cycle into five segments (GCS). The intra gait-cycle-segment (GCS) convolutional spatio-temporal relationships has been obtained by employing a 3D-CNN via. transfer learning mechanism. Later, a stacked LSTM has been trained over spatio-temporal features to learn the long and short relationship between inter gait-cycle-segment. The first step in our work is data pre-processing, in which we create silhouette stereo map (SSM) from the binary silhouettes of the gait video frame and sampled each video into a fixed 80 frames. These 80 frames SSM have been divided into 5 gait-cycle-segments (GCS) of 16 frames each. From each of these GCS, we extract spatio-temporal features using a pre-trained 3-D CNN. These features have been concatenated temporally, and an LSTM cell is used to learn the long-term dependencies between each GCS. Finally, the required class scores are computed by averaging (to handle noise) the output generated by LSTM. The network is trained in an end-to-end fashion using triplet loss function so as to learn the gait metric well using only the hard triplets. All the experiments are carried out on publicly available CASIA-B and OU-ISIR gait dataset. From the experimental results, it has been indicated that the proposed network performs better than the current state-of-the-art gait recognition systems. (C) 2019 Elsevier B.V. All rights reserved.

引用

页码：646 / 653

页数：8

共 25 条

[1] Improved gait recognition based on specialized deep convolutional neural network
Alotaibi, Munif
Mahmood, Ausif
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 164 : 103 - 110
[2] [Anonymous], ARXIV181206271
[3] [Anonymous], 2016, P 2016 ICB HALMST, DOI DOI 10.1109/ICB.2016.7550060
[4] [Anonymous], PATTERN RECOGNIT
[5] [Anonymous], GAITGAN INVARIANT GA
[6] Gait recognition without subject cooperation
Bashir, Khalid
Xiang, Tao
Gong, Shaogang
[J]. PATTERN RECOGNITION LETTERS, 2010, 31 (13) : 2052 - 2060
[7] Feng Y, 2016, INT C PATT RECOG, P325, DOI 10.1109/ICPR.2016.7899654
[8] Glorot X., 2010, P 13 INT C ART INT S, P249
[9] Individual recognition using Gait Energy Image
Han, J
Bhanu, B
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (02) : 316 - 322
[10] Heng Wang, 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3169, DOI 10.1109/CVPR.2011.5995407

← 1 2 3 →