A Discriminatively Learned CNN Embedding for Person Reidentification

被引:619
作者
Zheng, Zhedong [1 ]
Zheng, Liang [1 ]
Yang, Yi [2 ]
机构
[1] Univ Technol Sydney, 15 Broadway, Ultimo, NSW 2007, Australia
[2] Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, 4 South Fourth St, Beijing 100190, Peoples R China
关键词
Person reidentification; convolutional neural networks;
D O I
10.1145/3159171
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we revisit two popular convolutional neural networks in person re-identification (re-ID): verification and identification models. The two models have their respective advantages and limitations due to different loss functions. Here, we shed light on how to combine the two models to learn more discriminative pedestrian descriptors. Specifically, we propose a Siamese network that simultaneously computes the identification loss and verification loss. Given a pair of training images, the network predicts the identities of the two input images and whether they belong to the same identity. Our network learns a discriminative embedding and a similarity measurement at the same time, thus taking full usage of the re-ID annotations. Our method can be easily applied on different pretrained networks. Albeit simple, the learned embedding improves the state-of-the-art performance on two public person re-ID benchmarks. Further, we show that our architecture can also be applied to image retrieval. The code is available at https://github.com/layumi/2016_person_re-ID.
引用
收藏
页数:20
相关论文
共 59 条
[1]  
[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7299016
[2]   Neural Codes for Image Retrieval [J].
Babenko, Artem ;
Slesarev, Anton ;
Chigorin, Alexandr ;
Lempitsky, Victor .
COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :584-599
[3]  
Bromley J., 1993, International Journal of Pattern Recognition and Artificial Intelligence, V7, P669, DOI 10.1142/S0218001493000339
[4]   Semisupervised Feature Analysis by Mining Correlations Among Multiple Tasks [J].
Chang, Xiaojun ;
Yang, Yi .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (10) :2294-2305
[5]   Semantic Pooling for Complex Event Analysis in Untrimmed Videos [J].
Chang, Xiaojun ;
Yu, Yao-Liang ;
Yang, Yi ;
Xing, Eric P. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) :1617-1632
[6]   Similarity Learning with Spatial Constraints for Person Re-identification [J].
Chen, Dapeng ;
Yuan, Zejian ;
Chen, Badong ;
Zheng, Nanning .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1268-1277
[7]   Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function [J].
Cheng, De ;
Gong, Yihong ;
Zhou, Sanping ;
Wang, Jinjun ;
Zheng, Nanning .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1335-1344
[8]   Total recall: Automatic query expansion with a generative feature model for object retrieval [J].
Chum, Ondrej ;
Philbin, James ;
Sivic, Josef ;
Isard, Michael ;
Zisserman, Andrew .
2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, :496-+
[9]   Discriminative Dictionary Learning With Common Label Alignment for Cross-Modal Retrieval [J].
Deng, Cheng ;
Tang, Xu ;
Yan, Junchi ;
Liu, Wei ;
Gao, Xinbo .
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (02) :208-218
[10]   Deep feature learning with relative distance comparison for person re-identification [J].
Ding, Shengyong ;
Lin, Liang ;
Wang, Guangrun ;
Chao, Hongyang .
PATTERN RECOGNITION, 2015, 48 (10) :2993-3003