Inter-Intra Modality Knowledge Learning and Clustering Noise Alleviation for Unsupervised Visible-Infrared Person Re-Identification

被引:2
|
作者
Li, Zhiyong [1 ]
Liu, Haojie [1 ]
Peng, Xiantao [2 ]
Jiang, Wei [1 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Hengyi Global Innovat Res Ctr, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Cameras; Noise measurement; Task analysis; Training; Surveillance; Lighting; Unsupervised learning; visible-infrared person re-identification; label refinement;
D O I
10.1109/TKDE.2024.3367304
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised visible-infrared person re-identification (USVI-ReID) is a valuable yet under-explored task that addresses the challenge of person retrieval from different modalities without annotations. USVI-ReID is faced with large intra and inter modality discrepancy as well as pseudo label noises. In this paper, we propose a novel learning approach termed Spectrum-Cam Aware and Refined Cross-Pair learning strategy (SCA-RCP), crafted to concurrently tackle these issues. The Dual-Spectrum Augmentation (DSA) method is first presented to mine inter-modality knowledge between the visible and infrared modalities by expanding the diversity of spectra for both modalities. Concurrently, to learn intra-modality knowledge, we divide the clusters into camera-based proxies and introduce the Cross-modal Cam-Aware Proxy learning method (CCAP) to pull together camera-based proxies that correspond to the same identity across modalities. Finally, to mitigate clustering noise, the innovative Refined Cross-Pair learning strategy (RCP) is devised comprising the Intra-modality Label Refinement (ILR) and Bi-directional Cross Pairing (BCP) method. ILR calculates maximum-likely label for each instance by employing the intra-modal cluster memories as a classifier, while BCP establish dependable cross-paired labels in a bidirectional manner. Extensive experiments on the cross-modality datasets demonstrate the superior performance of our model over the state-of-art method.
引用
收藏
页码:3934 / 3947
页数:14
相关论文
共 50 条
  • [21] Cross-Modality Transformer for Visible-Infrared Person Re-Identification
    Jiang, Kongzhu
    Zhang, Tianzhu
    Liu, Xiang
    Qian, Bingqiao
    Zhang, Yongdong
    Wu, Feng
    COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 480 - 496
  • [22] Shallow-Deep Collaborative Learning for Unsupervised Visible-Infrared Person Re-Identification
    Yang, Bin
    Chen, Jun
    Ye, Mang
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16870 - 16879
  • [23] Dual Consistency-Constrained Learning for Unsupervised Visible-Infrared Person Re-Identification
    Yang, Bin
    Chen, Jun
    Chen, Cuiqun
    Ye, Mang
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 1767 - 1779
  • [24] Image-text feature learning for unsupervised visible-infrared person re-identification
    Guo, Jifeng
    Pang, Zhiqi
    IMAGE AND VISION COMPUTING, 2025, 158
  • [25] Bidirectional modality information interaction for Visible-Infrared Person Re-identification
    Yang, Xi
    Liu, Huanling
    Wang, Nannan
    Gao, Xinbo
    PATTERN RECOGNITION, 2025, 161
  • [26] Unveiling the Power of CLIP in Unsupervised Visible-Infrared Person Re-Identification
    Chen, Zhong
    Zhang, Zhizhong
    Tan, Xin
    Qu, Yanyun
    Xie, Yuan
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3667 - 3675
  • [27] Knowledge self-distillation for visible-infrared cross-modality person re-identification
    Yu Zhou
    Rui Li
    Yanjing Sun
    Kaiwen Dong
    Song Li
    Applied Intelligence, 2022, 52 : 10617 - 10631
  • [28] Knowledge self-distillation for visible-infrared cross-modality person re-identification
    Zhou, Yu
    Li, Rui
    Sun, Yanjing
    Dong, Kaiwen
    Li, Song
    APPLIED INTELLIGENCE, 2022, 52 (09) : 10617 - 10631
  • [29] Dual-attentive cascade clustering learning for visible-infrared person re-identification
    Wang, Xianju
    Chen, Cuiqun
    Zhu, Yong
    Chen, Shuguang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19729 - 19746
  • [30] Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification
    Liang, Tengfei
    Jin, Yi
    Liu, Wu
    Li, Yidong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8432 - 8444