Frequency domain adaptive framework for visible-infrared person re-identification

被引：0

作者：

Wang, Jiangcheng ^{[1
]}

Li, Yize ^{[2
]}

Tao, Xuefeng ^{[3
]}

Kong, Jun ^{[3
]}

机构：

[1] Jiangnan Univ, Sch Sci, Wuxi 214122, Peoples R China

[2] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China

[3] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2025年 / 16卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Visible-infrared person re-identification; Cross modality; Frequency domain; Clustering;

D O I：

10.1007/s13042-024-02408-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The visible-infrared person re-identification task aims to achieve mutual retrieval between infrared images and visible images. The primary challenge is to learn the mapping of these two modalities into a common latent space. Prior works have mainly focused on network feature extraction, but have overlooked the local information of high-frequency channel features, the global information of low-frequency channel features, and the interaction effects between them, all of which are crucial for effectively aligning feature spaces and enhancing cross-modal recognition accuracy, robustness, and overall performance. To address this issue, we propose a frequency domain adaptive framework. Specifically, we designed the frequency domain adaptive encoder to achieve frequency domain adaptation. And the diverse wise embedding was designed to efficiently extract multi-scale features with fewer parameters. Additionally, we proposed the similarity distance clustering strategy, which reduces the large gaps between different modalities by minimizing the KL divergence between visible-infrared similarity distributions images and the normalized label clustering distributions. Our method has been proven superior on two public datasets and achieves state-of-the-art performance on the RegDB dataset.

引用

页码：2553 / 2566

页数：14

共 50 条

[41] Video-Based Visible-Infrared Person Re-Identification With Auxiliary Samples [J].

Du, Yunhao ;

Lei, Cheng ;

Zhao, Zhicheng ;

Dong, Yuan ;

Su, Fei .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :1313-1325

[42] Modality-perceptive harmonization network for visible-infrared person re-identification [J].

Zuo, Xutao ;

Peng, Jinjia ;

Cheng, Tianhang ;

Wang, Huibing .

INFORMATION FUSION, 2025, 118

[43] Exploring modality enhancement and compensation spaces for visible-infrared person re-identification [J].

Cheng, Xu ;

Deng, Shuya ;

Yu, Hao .

IMAGE AND VISION COMPUTING, 2024, 146

[44] Dual-Semantic Consistency Learning for Visible-Infrared Person Re-Identification [J].

Zhang, Yiyuan ;

Kang, Yuhao ;

Zhao, Sanyuan ;

Shen, Jianbing .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 :1554-1565

[45] A cross-modality person re-identification method for visible-infrared images [J].

Sun Y. ;

Wang R. ;

Zhang Q. ;

Lin R. .

Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (06) :2018-2025

[46] Identity Consistency Construction for Visible-Infrared Person Re-identification in Cloud Environment [J].

Wang, Yiming ;

Xu, Kaixiong ;

Chai, Yi ;

Li, Shuo ;

Jiang, Yutao ;

Liu, Bowen .

PROCEEDINGS OF 2023 CHINESE INTELLIGENT SYSTEMS CONFERENCE, VOL III, 2023, :799-807

[47] Cascaded Cross-modal Alignment for Visible-Infrared Person Re-Identification [J].

Li, Zhaohui ;

Wang, Qiangchang ;

Chen, Lu ;

Zhang, Xinxin ;

Yin, Yilong .

KNOWLEDGE-BASED SYSTEMS, 2024, 305

[48] Structure-Aware Positional Transformer for Visible-Infrared Person Re-Identification [J].

Chen, Cuiqun ;

Ye, Mang ;

Qi, Meibin ;

Wu, Jingjing ;

Jiang, Jianguo ;

Lin, Chia-Wen .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :2352-2364

[49] Learning dual attention enhancement feature for visible-infrared person re-identification [J].

Zhang, Guoqing ;

Zhang, Yinyin ;

Zhang, Hongwei ;

Chen, Yuhao ;

Zheng, Yuhui .

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 99

[50] Multi-Stage Auxiliary Learning for Visible-Infrared Person Re-Identification [J].

Zhang, Huadong ;

Cheng, Shuli ;

Du, Anyu .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) :12032-12047

← 1 2 3 4 5 →