Embedding Deep Metric for Person Re-identification: A Study Against Large Variations

被引:226
作者
Shi, Hailin [1 ,2 ,3 ]
Yang, Yang [1 ,2 ,3 ]
Zhu, Xiangyu [1 ,2 ,3 ]
Liao, Shengcai [1 ,2 ,3 ]
Lei, Zhen [1 ,2 ,3 ]
Zheng, Weishi [4 ]
Li, Stan Z. [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Ctr Biometr & Secur Res, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
来源
COMPUTER VISION - ECCV 2016, PT I | 2016年 / 9905卷
关键词
Person re-identification; Deep learning; CNN; DIMENSIONALITY REDUCTION;
D O I
10.1007/978-3-319-46448-0_44
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person re-identification is challenging due to the large variations of pose, illumination, occlusion and camera view. Owing to these variations, the pedestrian data is distributed as highly-curved manifolds in the feature space, despite the current convolutional neural networks (CNN)'s capability of feature extraction. However, the distribution is unknown, so it is difficult to use the geodesic distance when comparing two samples. In practice, the current deep embedding methods use the Euclidean distance for the training and test. On the other hand, the manifold learning methods suggest to use the Euclidean distance in the local range, combining with the graphical relationship between samples, for approximating the geodesic distance. From this point of view, selecting suitable positive (i.e. intra-class) training samples within a local range is critical for training the CNN embedding, especially when the data has large intra-class variations. In this paper, we propose a novel moderate positive sample mining method to train robust CNN for person re-identification, dealing with the problem of large variation. In addition, we improve the learning by a metric weight constraint, so that the learned metric has a better generalization ability. Experiments show that these two strategies are effective in learning robust deep metrics for person re-identification, and accordingly our deep model significantly outperforms the state-of-the-art methods on several benchmarks of person re-identification. Therefore, the study presented in this paper may be useful in inspiring new designs of deep models for person re-identification.
引用
收藏
页码:732 / 748
页数:17
相关论文
共 43 条
[1]  
[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7299016
[2]  
[Anonymous], 2013, COMPUTER VISION ACCV, DOI 10.1007/978-3-642-37331-23
[3]  
[Anonymous], ARXIV14064444
[4]  
[Anonymous], 2007, INT WORKSH PERF EV T
[5]  
[Anonymous], P BRIT MACH VIS
[6]  
Bak S., 2011, Proceedings of the 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS 2011), P179, DOI 10.1109/AVSS.2011.6027316
[7]   Multiple-shot person re-identification by chromatic and epitomic analyses [J].
Bazzani, Loris ;
Cristani, Marco ;
Perina, Alessandro ;
Murino, Vittorio .
PATTERN RECOGNITION LETTERS, 2012, 33 (07) :898-903
[8]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[9]  
Davis J.V., 2007, P 24 INT C MACHINE L, P209, DOI DOI 10.1145/1273496.1273523
[10]   Deep feature learning with relative distance comparison for person re-identification [J].
Ding, Shengyong ;
Lin, Liang ;
Wang, Guangrun ;
Chao, Hongyang .
PATTERN RECOGNITION, 2015, 48 (10) :2993-3003