Prototype-Guided Attention Distillation for Discriminative Person Search

被引:0
作者
Kim, Hanjae [1 ]
Lee, Jiyoung [2 ]
Sohn, Kwanghoon [1 ,3 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul 03722, South Korea
[2] NAVER AI Lab, Seongnam 13561, South Korea
[3] Korea Inst Sci & Technol KIST, Seoul 02792, South Korea
基金
新加坡国家研究基金会;
关键词
Prototypes; Transformers; Proposals; Detectors; Training; Noise; Head; Person search; person re-identification; attention distillation; NETWORK;
D O I
10.1109/TPAMI.2024.3461778
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person search aims to localize a person of interest in a large image gallery captured by multiple, non-overlapping cameras. Prevalent unified methods have suffered from (1) noisy proposals with mis-detection and occlusion, and (2) large appearance variation within a class, which deteriorates the prototype-based metric learning. To address these problems, we introduce a Prototype-guided Attention Distillation, shortly PAD, which exploits a prototype (a typical representation of an identity) as a guidance to the attention module to consistently highlight identity-inherent regions across different poses. To utilize the knowledge encoded in prototypes for matching unseen IDs, PAD conducts attention distillation to guide student Re-ID queries by deeply mimicking attention maps from the prototype query. Additionally, to address large intra-class variation induced by pose or camera views, we extend PAD with multiple part prototypes representing consistent local regions across different instances. Furthermore, we exploit an adaptive momentum strategy for robust attention distillation in PAD to update more distinct prototypes. Extensive experiments conducted on CUHK-SYSU and PRW demonstrate the effectiveness of PAD, showcasing state-of-the-art performance. Moreover, our distilled attention surprisingly highlights distinguished multiple regions for person search.
引用
收藏
页码:99 / 115
页数:17
相关论文
共 121 条
  • [51] Diverse Part Discovery: Occluded Person Re-identification with Part-Aware Transformer
    Li, Yulin
    He, Jianfeng
    Zhang, Tianzhu
    Liu, Xiang
    Zhang, Yongdong
    Wu, Feng
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2897 - 2906
  • [52] Online Knowledge Distillation for Efficient Pose Estimation
    Li, Zheng
    Ye, Jingwen
    Song, Mingli
    Huang, Ying
    Pan, Zhigeng
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11720 - 11730
  • [53] Li ZJ, 2021, AAAI CONF ARTIF INTE, V35, P2011
  • [54] Few-shot Learning with Noisy Labels
    Liang, Kevin J.
    Rangrej, Samrudhdhi B.
    Petrovic, Vladan
    Hassner, Tal
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9079 - 9088
  • [55] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 318 - 327
  • [56] Neural Person Search Machines
    Liu, Hao
    Feng, Jiashi
    Jie, Zequn
    Jayashree, Karlekar
    Zhao, Bo
    Qi, Meibin
    Jiang, Jianguo
    Yan, Shuicheng
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 493 - 501
  • [57] A ConvNet for the 2020s
    Liu, Zhuang
    Mao, Hanzi
    Wu, Chao-Yuan
    Feichtenhofer, Christoph
    Darrell, Trevor
    Xie, Saining
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11966 - 11976
  • [58] Loshchilov I, 2019, Arxiv, DOI [arXiv:1711.05101, DOI 10.48550/ARXIV.1711.05101]
  • [59] Bag of Tricks and A Strong Baseline for Deep Person Re-identification
    Luo, Hao
    Gu, Youzhi
    Liao, Xingyu
    Lai, Shenqi
    Jiang, Wei
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1487 - 1495
  • [60] Miyato T, 2018, Arxiv, DOI arXiv:1802.05957