Video Person Re-Identification by Temporal Residual Learning

被引:95
作者
Dai, Ju [1 ]
Zhang, Pingping [1 ]
Wang, Dong [1 ]
Lu, Huchuan [1 ]
Wang, Hongyu [1 ]
机构
[1] Dalian Univ Technol, Fac Elect Informat & Elect Engn, Sch Informat & Commun Engn, Dalian 116024, Peoples R China
关键词
Person re-identification; spatial-temporal transformation; temporal residual learning;
D O I
10.1109/TIP.2018.2878505
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel feature learning framework for video person re-identification (re-ID). The proposed framework largely aims to exploit the adequate temporal information of video sequences and tackle the poor spatial alignment of moving pedestrians. More specifically, for exploiting the temporal information, we design a temporal residual learning (TRL) module to simultaneously extract the generic and specific features of consecutive frames. The TRL module is equipped with two bi-directional LSTM (BiLSTM), which are, respectively, responsible to describe a moving person in different aspects, providing complementary information for better feature representations. To deal with the poor spatial alignment in video re-ID data sets, we propose a spatial-temporal transformer network ((STN)-N-2) module. Transformation parameters in the (STN)-N-2 module are learned by leveraging the high-level semantic information of the current frame as well as the temporal context knowledge from other frames. The proposed (STN)-N-2 module with less learnable parameters allows effective person alignments under significant appearance changes. Extensive experimental results on the large-scale MARS, PRID2011, ILIDS-VID, and SDU-VID data sets demonstrate that the proposed method achieves consistently superior performance and outperforms most of the very recent state-of-the-art methods.
引用
收藏
页码:1366 / 1377
页数:12
相关论文
共 46 条
[1]  
[Anonymous], 2017, CVPR
[2]  
[Anonymous], 2018, P 27 INT JOINT C ART
[3]  
[Anonymous], 2015, RECURRENT SPATIAL TR
[4]  
[Anonymous], 2017, ARXIV170307737
[5]  
[Anonymous], 2016, DEEP RECURRENT CONVO
[6]  
[Anonymous], 2015, PROC 28 INT C NEURAL
[7]   The relation between the ROC curve and the CMC [J].
Bolle, RM ;
Connell, JH ;
Pankanti, S ;
Ratha, NK ;
Senior, AW .
FOURTH IEEE WORKSHOP ON AUTOMATIC IDENTIFICATION ADVANCED TECHNOLOGIES, PROCEEDINGS, 2005, :15-20
[8]   Person Re-Identification by Camera Correlation Aware Feature Augmentation [J].
Chen, Ying-Cong ;
Zhu, Xiatian ;
Zheng, Wei-Shi ;
Lai, Jian-Huang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (02) :392-408
[9]   Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function [J].
Cheng, De ;
Gong, Yihong ;
Zhou, Sanping ;
Wang, Jinjun ;
Zheng, Nanning .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1335-1344
[10]   Improving Person Re-identification via Pose-aware Multi-shot Matching [J].
Cho, Yeong-Jun ;
Yoon, Kuk-Jin .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1354-1362