Learning deep part-aware emb e dding for person retrieval

被引:20
作者
Zhao, Yang [1 ,2 ]
Shen, Chunhua [2 ]
Yu, Xiaohan [1 ]
Chen, Hao [2 ]
Gao, Yongsheng [1 ]
Xiong, Shengwu [3 ]
机构
[1] Griffith Univ, Sch Engn, Nathan, Qld, Australia
[2] Univ Adelaide, Adelaide, SA, Australia
[3] Wuhan Univ Technol, Sch Comp Sci Technol, Wuhan, Hubei, Peoples R China
基金
澳大利亚研究理事会;
关键词
Person retrieval; Part-aware embedding; Improved triplet loss;
D O I
10.1016/j.patcog.2021.107938
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person retrieval is an important vision task, aiming at matching the images of the same person under various camera views. The key challenge of person retrieval lies in the large intra-class variations among the person images. Therefore, how to learn discriminative feature representations becomes the core problem. In this paper, we propose a deep part-aware representation learning method for person retrieval. First, an improved triplet loss is introduced such that the global feature representations from the same identity are closely clustered. Meanwhile, a localization branch is proposed to automatically localize those discriminative person-wise parts or regions, only using identity labels in a weakly supervised manner. Via the learning simultaneously guided by the global branch and the localization branch, the proposed method can further improve the performance for person retrieval. Through an extensive set of ablation studies, we verify that the localization branch and the improved triplet loss each contributes to the performance boosts of the proposed method. Our model obtains superior (or comparable) performance compared to state-of-the-art methods for person retrieval on the four public person retrieval datasets. On the CUHK03-labeled dataset, for instance, the performance increases from 73.0% mAP and 77.9% rank-1 accuracy to 80.8% (+7.8%) mAP and 83.9% (+6.0%) rank-1 accuracy. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 43 条
  • [1] Deep-Person: Learning discriminative deep features for person Re-Identification
    Bai, Xiang
    Yang, Mingkun
    Huang, Tengteng
    Dou, Zhiyong
    Yu, Rui
    Xu, Yongchao
    [J]. PATTERN RECOGNITION, 2020, 98
  • [2] Multi-Level Factorisation Net for Person Re-Identification
    Chang, Xiaobin
    Hospedales, Timothy M.
    Xiang, Tao
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2109 - 2118
  • [3] Beyond triplet loss: a deep quadruplet network for person re-identification
    Chen, Weihua
    Chen, Xiaotang
    Zhang, Jianguo
    Huang, Kaiqi
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1320 - 1329
  • [4] Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function
    Cheng, De
    Gong, Yihong
    Zhou, Sanping
    Wang, Jinjun
    Zheng, Nanning
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1335 - 1344
  • [5] Batch DropBlock Network for Person Re-identification and Beyond
    Dai, Zuozhuo
    Chen, Mingqiang
    Gu, Xiaodong
    Zhu, Siyu
    Tan, Ping
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3690 - 3700
  • [6] SphereRelD: Deep hypersphere manifold embedding for person re-identification
    Fan, Xing
    Jiang, Wei
    Luo, Hao
    Fei, Mengjuan
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 : 51 - 58
  • [7] Felzenszwalb P, 2008, PROC CVPR IEEE, P1
  • [8] Hadsell R., 2006, IEEE C COMP VIS PATT, P1735, DOI DOI 10.1109/CVPR.2006.100
  • [9] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [10] Hermans A., ARXIV PREPRINT ARXIV