Frequency domain adaptive framework for visible-infrared person re-identification

被引:0
作者
Wang, Jiangcheng [1 ]
Li, Yize [2 ]
Tao, Xuefeng [3 ]
Kong, Jun [3 ]
机构
[1] Jiangnan Univ, Sch Sci, Wuxi 214122, Peoples R China
[2] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China
[3] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
基金
中国国家自然科学基金;
关键词
Visible-infrared person re-identification; Cross modality; Frequency domain; Clustering;
D O I
10.1007/s13042-024-02408-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The visible-infrared person re-identification task aims to achieve mutual retrieval between infrared images and visible images. The primary challenge is to learn the mapping of these two modalities into a common latent space. Prior works have mainly focused on network feature extraction, but have overlooked the local information of high-frequency channel features, the global information of low-frequency channel features, and the interaction effects between them, all of which are crucial for effectively aligning feature spaces and enhancing cross-modal recognition accuracy, robustness, and overall performance. To address this issue, we propose a frequency domain adaptive framework. Specifically, we designed the frequency domain adaptive encoder to achieve frequency domain adaptation. And the diverse wise embedding was designed to efficiently extract multi-scale features with fewer parameters. Additionally, we proposed the similarity distance clustering strategy, which reduces the large gaps between different modalities by minimizing the KL divergence between visible-infrared similarity distributions images and the normalized label clustering distributions. Our method has been proven superior on two public datasets and achieves state-of-the-art performance on the RegDB dataset.
引用
收藏
页码:2553 / 2566
页数:14
相关论文
共 50 条
[41]   Video-Based Visible-Infrared Person Re-Identification With Auxiliary Samples [J].
Du, Yunhao ;
Lei, Cheng ;
Zhao, Zhicheng ;
Dong, Yuan ;
Su, Fei .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :1313-1325
[42]   Modality-perceptive harmonization network for visible-infrared person re-identification [J].
Zuo, Xutao ;
Peng, Jinjia ;
Cheng, Tianhang ;
Wang, Huibing .
INFORMATION FUSION, 2025, 118
[43]   Exploring modality enhancement and compensation spaces for visible-infrared person re-identification [J].
Cheng, Xu ;
Deng, Shuya ;
Yu, Hao .
IMAGE AND VISION COMPUTING, 2024, 146
[44]   Dual-Semantic Consistency Learning for Visible-Infrared Person Re-Identification [J].
Zhang, Yiyuan ;
Kang, Yuhao ;
Zhao, Sanyuan ;
Shen, Jianbing .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 :1554-1565
[45]   A cross-modality person re-identification method for visible-infrared images [J].
Sun Y. ;
Wang R. ;
Zhang Q. ;
Lin R. .
Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (06) :2018-2025
[46]   Identity Consistency Construction for Visible-Infrared Person Re-identification in Cloud Environment [J].
Wang, Yiming ;
Xu, Kaixiong ;
Chai, Yi ;
Li, Shuo ;
Jiang, Yutao ;
Liu, Bowen .
PROCEEDINGS OF 2023 CHINESE INTELLIGENT SYSTEMS CONFERENCE, VOL III, 2023, :799-807
[47]   Cascaded Cross-modal Alignment for Visible-Infrared Person Re-Identification [J].
Li, Zhaohui ;
Wang, Qiangchang ;
Chen, Lu ;
Zhang, Xinxin ;
Yin, Yilong .
KNOWLEDGE-BASED SYSTEMS, 2024, 305
[48]   Structure-Aware Positional Transformer for Visible-Infrared Person Re-Identification [J].
Chen, Cuiqun ;
Ye, Mang ;
Qi, Meibin ;
Wu, Jingjing ;
Jiang, Jianguo ;
Lin, Chia-Wen .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :2352-2364
[49]   Learning dual attention enhancement feature for visible-infrared person re-identification [J].
Zhang, Guoqing ;
Zhang, Yinyin ;
Zhang, Hongwei ;
Chen, Yuhao ;
Zheng, Yuhui .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 99
[50]   Multi-Stage Auxiliary Learning for Visible-Infrared Person Re-Identification [J].
Zhang, Huadong ;
Cheng, Shuli ;
Du, Anyu .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) :12032-12047