Proxy-Based Embedding Alignment for RGB-Infrared Person Re-Identification

被引:0
作者
Dou, Zhaopeng [1 ]
Sun, Yifan [2 ]
Li, Yali [1 ]
Wang, Shengjin [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[2] Baidu Inc, Beijing 100084, Peoples R China
来源
TSINGHUA SCIENCE AND TECHNOLOGY | 2025年 / 30卷 / 03期
关键词
Measurement; Training; Feature extraction; Vectors; Task analysis; Protocols; Optimization; cross-modality person re-identification; feature alignment; cycle consistency; metric learning;
D O I
10.26599/TST.2023.9010113
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RGB-Infrared person re-IDentification (re-ID) aims to match RGB and infrared (IR) images of the same person. However, the modality discrepancy between RGB and IR images poses a significant challenge for re-ID. To address this issue, this paper proposes a Proxy-based Embedding Alignment (PEA) method to align the RGB and IR modalities in the embedding space. PEA introduces modality-specific identity proxies and leverages the sample-to-proxy relations to learn the model. Specifically, PEA focuses on three types of alignments: intra-modality alignment, inter-modality alignment, and cycle alignment. Intra-modality alignment aims to align sample features and proxies of the same identity within a modality. Inter-modality alignment aims to align sample features and proxies of the same identity across different modalities. Cycle alignment requires that a proxy is aligned with itself after tracing it along a cross-modality cycle (e.g., IR -> RGB -> IR). By integrating these alignments into the training process, PEA effectively mitigates the impact of modality discrepancy and learns discriminative features across modalities. We conduct extensive experiments on several RGB-IR re-ID datasets, and the results show that PEA outperforms current state-of-the-art methods. Notably, on SYSU-MM01 dataset, PEA achieves 71.0% mAP under the multi-shot setting of the indoor-search protocol, surpassing the best-performing method by 7.2%.
引用
收藏
页码:1112 / 1124
页数:13
相关论文
共 55 条
[1]   Ensemble Deep Manifold Similarity Learning using Hard Proxies [J].
Aziere, Nicolas ;
Todorovic, Sinisa .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7291-7299
[2]  
Cao Y, 2017, AAAI CONF ARTIF INTE, P3974
[3]   Neural Feature Search for RGB-Infrared Person Re-Identification [J].
Chen, Yehansen ;
Wan, Lin ;
Li, Zhihang ;
Jing, Qianyan ;
Sun, Zongyuan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :587-597
[4]   Part-based Pseudo Label Refinement for Unsupervised Person Re-identification [J].
Cho, Yoonki ;
Kim, Woo Jae ;
Hong, Seunghoon ;
Yoon, Sung-Eui .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :7298-7308
[5]  
Dai PY, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P677
[6]  
Dou Z., 2022, P IEEE INT C AC SPEE, P2718
[7]   Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval [J].
Dou, Zhaopeng ;
Wang, Zhongdao ;
Chen, Weihua ;
Li, Yali ;
Wang, Shengjin .
COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 :588-605
[8]   Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification [J].
Feng, Zhanxiang ;
Lai, Jianhuang ;
Xie, Xiaohua .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :579-590
[9]   Unsupervised Monocular Depth Estimation with Left-Right Consistency [J].
Godard, Clement ;
Mac Aodha, Oisin ;
Brostow, Gabriel J. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6602-6611
[10]  
Goldberger J., 2004, NIPS