Inter-Intra Modality Knowledge Learning and Clustering Noise Alleviation for Unsupervised Visible-Infrared Person Re-Identification

被引：2

作者：

Li, Zhiyong ^{[1
]}

Liu, Haojie ^{[1
]}

Peng, Xiantao ^{[2
]}

Jiang, Wei ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310027, Peoples R China

[2] Zhejiang Univ, Hengyi Global Innovat Res Ctr, Hangzhou 310027, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2024年 / 36卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Cameras; Noise measurement; Task analysis; Training; Surveillance; Lighting; Unsupervised learning; visible-infrared person re-identification; label refinement;

D O I：

10.1109/TKDE.2024.3367304

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Unsupervised visible-infrared person re-identification (USVI-ReID) is a valuable yet under-explored task that addresses the challenge of person retrieval from different modalities without annotations. USVI-ReID is faced with large intra and inter modality discrepancy as well as pseudo label noises. In this paper, we propose a novel learning approach termed Spectrum-Cam Aware and Refined Cross-Pair learning strategy (SCA-RCP), crafted to concurrently tackle these issues. The Dual-Spectrum Augmentation (DSA) method is first presented to mine inter-modality knowledge between the visible and infrared modalities by expanding the diversity of spectra for both modalities. Concurrently, to learn intra-modality knowledge, we divide the clusters into camera-based proxies and introduce the Cross-modal Cam-Aware Proxy learning method (CCAP) to pull together camera-based proxies that correspond to the same identity across modalities. Finally, to mitigate clustering noise, the innovative Refined Cross-Pair learning strategy (RCP) is devised comprising the Intra-modality Label Refinement (ILR) and Bi-directional Cross Pairing (BCP) method. ILR calculates maximum-likely label for each instance by employing the intra-modal cluster memories as a classifier, while BCP establish dependable cross-paired labels in a bidirectional manner. Extensive experiments on the cross-modality datasets demonstrate the superior performance of our model over the state-of-art method.

引用

页码：3934 / 3947

页数：14

共 50 条

[21] Cross-Modality Transformer for Visible-Infrared Person Re-Identification
Jiang, Kongzhu
Zhang, Tianzhu
Liu, Xiang
Qian, Bingqiao
Zhang, Yongdong
Wu, Feng
COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 480 - 496
[22] Shallow-Deep Collaborative Learning for Unsupervised Visible-Infrared Person Re-Identification
Yang, Bin
Chen, Jun
Ye, Mang
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16870 - 16879
[23] Dual Consistency-Constrained Learning for Unsupervised Visible-Infrared Person Re-Identification
Yang, Bin
Chen, Jun
Chen, Cuiqun
Ye, Mang
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 1767 - 1779
[24] Image-text feature learning for unsupervised visible-infrared person re-identification
Guo, Jifeng
Pang, Zhiqi
IMAGE AND VISION COMPUTING, 2025, 158
[25] Bidirectional modality information interaction for Visible-Infrared Person Re-identification
Yang, Xi
Liu, Huanling
Wang, Nannan
Gao, Xinbo
PATTERN RECOGNITION, 2025, 161
[26] Unveiling the Power of CLIP in Unsupervised Visible-Infrared Person Re-Identification
Chen, Zhong
Zhang, Zhizhong
Tan, Xin
Qu, Yanyun
Xie, Yuan
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3667 - 3675
[27] Knowledge self-distillation for visible-infrared cross-modality person re-identification
Yu Zhou
Rui Li
Yanjing Sun
Kaiwen Dong
Song Li
Applied Intelligence, 2022, 52 : 10617 - 10631
[28] Knowledge self-distillation for visible-infrared cross-modality person re-identification
Zhou, Yu
Li, Rui
Sun, Yanjing
Dong, Kaiwen
Li, Song
APPLIED INTELLIGENCE, 2022, 52 (09) : 10617 - 10631
[29] Dual-attentive cascade clustering learning for visible-infrared person re-identification
Wang, Xianju
Chen, Cuiqun
Zhu, Yong
Chen, Shuguang
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19729 - 19746
[30] Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification
Liang, Tengfei
Jin, Yi
Liu, Wu
Li, Yidong
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8432 - 8444

← 1 2 3 4 5 →