Proxy-Based Embedding Alignment for RGB-Infrared Person Re-Identification

被引：0

作者：

Dou, Zhaopeng ^{[1
]}

Sun, Yifan ^{[2
]}

Li, Yali ^{[1
]}

Wang, Shengjin ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

[2] Baidu Inc, Beijing 100084, Peoples R China

来源：

TSINGHUA SCIENCE AND TECHNOLOGY | 2025年 / 30卷 / 03期

关键词：

Measurement; Training; Feature extraction; Vectors; Task analysis; Protocols; Optimization; cross-modality person re-identification; feature alignment; cycle consistency; metric learning;

D O I：

10.26599/TST.2023.9010113

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

RGB-Infrared person re-IDentification (re-ID) aims to match RGB and infrared (IR) images of the same person. However, the modality discrepancy between RGB and IR images poses a significant challenge for re-ID. To address this issue, this paper proposes a Proxy-based Embedding Alignment (PEA) method to align the RGB and IR modalities in the embedding space. PEA introduces modality-specific identity proxies and leverages the sample-to-proxy relations to learn the model. Specifically, PEA focuses on three types of alignments: intra-modality alignment, inter-modality alignment, and cycle alignment. Intra-modality alignment aims to align sample features and proxies of the same identity within a modality. Inter-modality alignment aims to align sample features and proxies of the same identity across different modalities. Cycle alignment requires that a proxy is aligned with itself after tracing it along a cross-modality cycle (e.g., IR -> RGB -> IR). By integrating these alignments into the training process, PEA effectively mitigates the impact of modality discrepancy and learns discriminative features across modalities. We conduct extensive experiments on several RGB-IR re-ID datasets, and the results show that PEA outperforms current state-of-the-art methods. Notably, on SYSU-MM01 dataset, PEA achieves 71.0% mAP under the multi-shot setting of the indoor-search protocol, surpassing the best-performing method by 7.2%.

引用

页码：1112 / 1124

页数：13

共 55 条

[1] Ensemble Deep Manifold Similarity Learning using Hard Proxies [J].

Aziere, Nicolas ;

Todorovic, Sinisa .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7291-7299

[2]

Cao Y, 2017, AAAI CONF ARTIF INTE, P3974

[3] Neural Feature Search for RGB-Infrared Person Re-Identification [J].

Chen, Yehansen ;

Wan, Lin ;

Li, Zhihang ;

Jing, Qianyan ;

Sun, Zongyuan .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :587-597

[4] Part-based Pseudo Label Refinement for Unsupervised Person Re-identification [J].

Cho, Yoonki ;

Kim, Woo Jae ;

Hong, Seunghoon ;

Yoon, Sung-Eui .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :7298-7308

[5]

Dai PY, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P677

[6]

Dou Z., 2022, P IEEE INT C AC SPEE, P2718

[7] Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval [J].

Dou, Zhaopeng ;

Wang, Zhongdao ;

Chen, Weihua ;

Li, Yali ;

Wang, Shengjin .

COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 :588-605

[8] Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification [J].

Feng, Zhanxiang ;

Lai, Jianhuang ;

Xie, Xiaohua .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :579-590

[9] Unsupervised Monocular Depth Estimation with Left-Right Consistency [J].

Godard, Clement ;

Mac Aodha, Oisin ;

Brostow, Gabriel J. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6602-6611

[10]

Goldberger J., 2004, NIPS

← 1 2 3 4 5 6 →