A Two Stream Siamese Convolutional Neural Network For Person Re-Identification

被引:200
作者
Chung, Dahjung [1 ]
Tahboub, Khalid [1 ]
Delp, Edward J. [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, Video & Image Proc Lab VIPER, W Lafayette, IN 47907 USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
关键词
D O I
10.1109/ICCV.2017.218
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person re-identification is an important task in video surveillance systems. It can be formally defined as establishing the correspondence between images of a person taken from different cameras at different times. In this paper, we present a two stream convolutional neural network where each stream is a Siamese network. This architecture can learn spatial and temporal information separately. We also propose a weighted two stream training objective function which combines the Siamese cost of the spatial and temporal streams with the objective of predicting a person's identity. We evaluate our proposed method on the publicly available PRID2011 and iLIDS-VID datasets and demonstrate the efficacy of our proposed method. On average, the top rank matching accuracy is 4% higher than the accuracy achieved by the cross-view quadratic discriminant analysis used in combination with the hierarchical Gaussian descriptor (GOG+XQDA), and 5% higher than the recurrent neural network method.
引用
收藏
页码:1992 / 2000
页数:9
相关论文
共 41 条
[1]  
[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7299016
[2]   A survey of approaches and trends in person re-identification [J].
Bedagkar-Gala, Apurva ;
Shah, Shishir K. .
IMAGE AND VISION COMPUTING, 2014, 32 (04) :270-286
[3]  
Bromley J., 1993, International Journal of Pattern Recognition and Artificial Intelligence, V7, P669, DOI 10.1142/S0218001493000339
[4]   Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function [J].
Cheng, De ;
Gong, Yihong ;
Zhou, Sanping ;
Wang, Jinjun ;
Zheng, Nanning .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1335-1344
[5]  
Cisco, 2016, White paper
[6]  
Davis J.V., 2007, P 24 INT C MACHINE L, P209, DOI DOI 10.1145/1273496.1273523
[7]  
Delgado B., 2016, P IEEE WINT C APPL C, P1
[8]  
Gong SG, 2014, ADV COMPUT VIS PATT, P1, DOI 10.1007/978-1-4471-6296-4_1
[9]   Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features [J].
Gray, Douglas ;
Tao, Hai .
COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 :262-275
[10]  
Hirzer M, 2012, LECT NOTES COMPUT SC, V7577, P780, DOI 10.1007/978-3-642-33783-3_56