Frequency domain adaptive framework for visible-infrared person re-identification

被引:0
|
作者
Wang, Jiangcheng [1 ]
Li, Yize [2 ]
Tao, Xuefeng [3 ]
Kong, Jun [3 ]
机构
[1] Jiangnan Univ, Sch Sci, Wuxi 214122, Peoples R China
[2] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China
[3] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
基金
中国国家自然科学基金;
关键词
Visible-infrared person re-identification; Cross modality; Frequency domain; Clustering;
D O I
10.1007/s13042-024-02408-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The visible-infrared person re-identification task aims to achieve mutual retrieval between infrared images and visible images. The primary challenge is to learn the mapping of these two modalities into a common latent space. Prior works have mainly focused on network feature extraction, but have overlooked the local information of high-frequency channel features, the global information of low-frequency channel features, and the interaction effects between them, all of which are crucial for effectively aligning feature spaces and enhancing cross-modal recognition accuracy, robustness, and overall performance. To address this issue, we propose a frequency domain adaptive framework. Specifically, we designed the frequency domain adaptive encoder to achieve frequency domain adaptation. And the diverse wise embedding was designed to efficiently extract multi-scale features with fewer parameters. Additionally, we proposed the similarity distance clustering strategy, which reduces the large gaps between different modalities by minimizing the KL divergence between visible-infrared similarity distributions images and the normalized label clustering distributions. Our method has been proven superior on two public datasets and achieves state-of-the-art performance on the RegDB dataset.
引用
收藏
页码:2553 / 2566
页数:14
相关论文
共 50 条
  • [11] Margin-Based Modal Adaptive Learning for Visible-Infrared Person Re-Identification
    Zhao, Qianqian
    Wu, Hanxiao
    Zhu, Jianqing
    SENSORS, 2023, 23 (03)
  • [12] Unbiased Feature Learning with Causal Intervention for Visible-Infrared Person Re-Identification
    Yuan, Bo wen
    Lu, Jiahao
    You, Sisi
    Bao, Bing-kun
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (10)
  • [13] Visible-infrared person re-identification using query related cluster
    Zhao Q.
    Wu H.
    Huang L.
    Zhu J.
    Zeng H.
    High Technology Letters, 2023, 29 (02) : 194 - 205
  • [14] Stronger Heterogeneous Feature Learning for Visible-Infrared Person Re-Identification
    Hao Wang
    Xiaojun Bi
    Changdong Yu
    Neural Processing Letters, 56
  • [15] Homogeneous and heterogeneous relational graph for visible-infrared person re-identification
    Feng, Yujian
    Chen, Feng
    Yu, Jian
    Ji, Yimu
    Wu, Fei
    Liu, Shangdon
    Jing, Xiao-Yuan
    PATTERN RECOGNITION, 2025, 158
  • [16] Stronger Heterogeneous Feature Learning for Visible-Infrared Person Re-Identification
    Wang, Hao
    Bi, Xiaojun
    Yu, Changdong
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [17] Cross-Modality Transformer for Visible-Infrared Person Re-Identification
    Jiang, Kongzhu
    Zhang, Tianzhu
    Liu, Xiang
    Qian, Bingqiao
    Zhang, Yongdong
    Wu, Feng
    COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 480 - 496
  • [18] Unveiling the Power of CLIP in Unsupervised Visible-Infrared Person Re-Identification
    Chen, Zhong
    Zhang, Zhizhong
    Tan, Xin
    Qu, Yanyun
    Xie, Yuan
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3667 - 3675
  • [19] Unified Conditional Image Generation for Visible-Infrared Person Re-Identification
    Pan, Honghu
    Pei, Wenjie
    Li, Xin
    He, Zhenyu
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 9026 - 9038
  • [20] A guidance and alignment transformer model for visible-infrared person re-identification
    Huang, Linyu
    Xue, Zijie
    Ning, Qian
    Guo, Yong
    Li, Yongsheng
    MULTIMEDIA SYSTEMS, 2025, 31 (02)