Identity-Guided Human Semantic Parsing for Person Re-identification

被引:253
作者
Zhu, Kuan [1 ,2 ]
Guo, Haiyun [1 ]
Liu, Zhiwei [1 ,2 ]
Tang, Ming [1 ,3 ]
Wang, Jinqiao [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[3] Shenzhen Infinova Ltd, Shenzhen, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT III | 2020年 / 12348卷
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Person re-ID; Weakly-supervised human parsing; Aligned representation learning; NETWORK;
D O I
10.1007/978-3-030-58580-8_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing alignment-based methods have to employ the pre-trained human parsing models to achieve the pixel-level alignment, and cannot identify the personal belongings (e.g., backpacks and reticule) which are crucial to person re-ID. In this paper, we propose the identity-guided human semantic parsing approach (ISP) to locate both the human body parts and personal belongings at pixel-level for aligned person re-ID only with person identity labels. We design the cascaded clustering on feature maps to generate the pseudo-labels of human parts. Specifically, for the pixels of all images of a person, we first group them to foreground or background and then group the foreground pixels to human parts. The cluster assignments are subsequently used as pseudo-labels of human parts to supervise the part estimation and ISP iteratively learns the feature maps and groups them. Finally, local features of both human body parts and personal belongings are obtained according to the self-learned part estimation, and only features of visible parts are utilized for the retrieval. Extensive experiments on three widely used datasets validate the superiority of ISP over lots of state-of-the-art methods. Our code is available at https://github.com/CASIA-IVA-Lab/ISP-reID.
引用
收藏
页码:346 / 363
页数:18
相关论文
共 60 条
[1]   Deep Clustering for Unsupervised Learning of Visual Features [J].
Caron, Mathilde ;
Bojanowski, Piotr ;
Joulin, Armand ;
Douze, Matthijs .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156
[2]   Mixed High-Order Attention Network for Person Re-Identification [J].
Chen, Binghui ;
Deng, Weihong ;
Hu, Jiani .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :371-381
[3]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[4]  
Ge YX, 2018, ADV NEUR IN, V31
[5]   DensePose: Dense Human Pose Estimation In The Wild [J].
Guler, Riza Alp ;
Neverova, Natalia ;
Kokkinos, Lasonas .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7297-7306
[6]   Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification [J].
Guo, Jianyuan ;
Yuan, Yuhui ;
Huang, Lang ;
Zhang, Chao ;
Yao, Jin-Ge ;
Han, Kai .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3641-3650
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]  
He LX, 2018, Arxiv, DOI arXiv:1810.07399
[9]   Deep Spatial Feature Reconstruction for Partial Person Re-identification: Alignment-free Approach [J].
He, Lingxiao ;
Liang, Jian ;
Li, Haiqing ;
Sun, Zhenan .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7073-7082
[10]  
Hermans A, 2017, Arxiv, DOI arXiv:1703.07737