Human Semantic Parsing for Person Re-identification

被引:491
作者
Kalayeh, Mahdi M. [1 ]
Basaran, Emrah [2 ]
Gokmen, Muhittin [3 ]
Kamasak, Mustafa E. [2 ]
Shah, Mubarak [1 ]
机构
[1] Univ Cent Florida, Ctr Res Comp Vis, Orlando, FL 32816 USA
[2] Istanbul Tech Univ, Dept Comp Engn, Istanbul, Turkey
[3] MEF Univ, Dept Comp Engn, Istanbul, Turkey
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person re-identification is a challenging task mainly due to factors such as background clutter, pose, illumination and camera point of view variations. These elements hinder the process of extracting robust and discriminative representations, hence preventing different identities from being successfully distinguished. To improve the representation learning, usually local features from human body parts are extracted. However, the common practice for such a process has been based on bounding box part detection. In this paper, we propose to adopt human semantic parsing which, due to its pixel-level accuracy and capability of modeling arbitrary contours, is naturally a better alternative. Our proposed SPReID integrates human semantic parsing in person re-identification and not only considerably outperforms its counter baseline, but achieves state-of-the-art performance. We also show that, by employing a simple yet effective training strategy, standard popular deep convolutional architectures such as Inception-V3 and ResNet-152, with no modification, while operating solely on full image, can dramatically outperform current state-of-the-art. Our proposed methods improve state-of-the-art person re-identification on: Market-1501 [48] by similar to 17% in mAP and similar to 6% in rank-1, CUHK03 [24] by similar to 4% in rank-1 and DukeMTMC-reID [50] by similar to 24% in mAP and similar to 10% in rank-1.
引用
收藏
页码:1062 / 1071
页数:10
相关论文
共 52 条
[1]  
[Anonymous], 2014, SHINPUHKAN 2014 MULT
[2]  
[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7299016
[3]  
[Anonymous], 2013, COMPUTER VISION ACCV, DOI 10.1007/978-3-642-37331-23
[4]  
[Anonymous], 2017, IEEE transactions on pattern analysis and machine intelligence, DOI [10.1109/TPAMI.2016.2644615, DOI 10.1109/TPAMI.2016.2644615]
[5]  
Bai S, 2017, AAAI CONF ARTIF INTE, P1281
[6]  
Baltieri D., 2011, P 2011 JOINT ACM WOR, P59
[7]  
Bengio Y, 2013, INT CONF ACOUST SPEE, P8624, DOI 10.1109/ICASSP.2013.6639349
[8]  
Cai Y, 2017, PROCEEDINGS OF THE ASME POWER CONFERENCE JOINT WITH ICOPE-17, 2017, VOL 2
[9]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[10]   Attention to Scale: Scale-aware Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Yang, Yi ;
Wang, Jiang ;
Xu, Wei ;
Yuille, Alan L. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3640-3649