Frequency domain adaptive framework for visible-infrared person re-identification

被引：0

作者：

Wang, Jiangcheng ^{[1
]}

Li, Yize ^{[2
]}

Tao, Xuefeng ^{[3
]}

Kong, Jun ^{[3
]}

机构：

[1] Jiangnan Univ, Sch Sci, Wuxi 214122, Peoples R China

[2] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China

[3] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2024年

基金：

中国国家自然科学基金;

关键词：

Visible-infrared person re-identification; Cross modality; Frequency domain; Clustering;

D O I：

10.1007/s13042-024-02408-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The visible-infrared person re-identification task aims to achieve mutual retrieval between infrared images and visible images. The primary challenge is to learn the mapping of these two modalities into a common latent space. Prior works have mainly focused on network feature extraction, but have overlooked the local information of high-frequency channel features, the global information of low-frequency channel features, and the interaction effects between them, all of which are crucial for effectively aligning feature spaces and enhancing cross-modal recognition accuracy, robustness, and overall performance. To address this issue, we propose a frequency domain adaptive framework. Specifically, we designed the frequency domain adaptive encoder to achieve frequency domain adaptation. And the diverse wise embedding was designed to efficiently extract multi-scale features with fewer parameters. Additionally, we proposed the similarity distance clustering strategy, which reduces the large gaps between different modalities by minimizing the KL divergence between visible-infrared similarity distributions images and the normalized label clustering distributions. Our method has been proven superior on two public datasets and achieves state-of-the-art performance on the RegDB dataset.

引用

页码：2553 / 2566

页数：14

共 50 条

[11] Margin-Based Modal Adaptive Learning for Visible-Infrared Person Re-Identification
Zhao, Qianqian
Wu, Hanxiao
Zhu, Jianqing
SENSORS, 2023, 23 (03)
[12] Unbiased Feature Learning with Causal Intervention for Visible-Infrared Person Re-Identification
Yuan, Bo wen
Lu, Jiahao
You, Sisi
Bao, Bing-kun
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (10)
[13] Visible-infrared person re-identification using query related cluster
Zhao Q.
Wu H.
Huang L.
Zhu J.
Zeng H.
High Technology Letters, 2023, 29 (02) : 194 - 205
[14] Stronger Heterogeneous Feature Learning for Visible-Infrared Person Re-Identification
Hao Wang
Xiaojun Bi
Changdong Yu
Neural Processing Letters, 56
[15] Homogeneous and heterogeneous relational graph for visible-infrared person re-identification
Feng, Yujian
Chen, Feng
Yu, Jian
Ji, Yimu
Wu, Fei
Liu, Shangdon
Jing, Xiao-Yuan
PATTERN RECOGNITION, 2025, 158
[16] Stronger Heterogeneous Feature Learning for Visible-Infrared Person Re-Identification
Wang, Hao
Bi, Xiaojun
Yu, Changdong
NEURAL PROCESSING LETTERS, 2024, 56 (02)
[17] Cross-Modality Transformer for Visible-Infrared Person Re-Identification
Jiang, Kongzhu
Zhang, Tianzhu
Liu, Xiang
Qian, Bingqiao
Zhang, Yongdong
Wu, Feng
COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 480 - 496
[18] Unveiling the Power of CLIP in Unsupervised Visible-Infrared Person Re-Identification
Chen, Zhong
Zhang, Zhizhong
Tan, Xin
Qu, Yanyun
Xie, Yuan
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3667 - 3675
[19] Unified Conditional Image Generation for Visible-Infrared Person Re-Identification
Pan, Honghu
Pei, Wenjie
Li, Xin
He, Zhenyu
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 9026 - 9038
[20] A guidance and alignment transformer model for visible-infrared person re-identification
Huang, Linyu
Xue, Zijie
Ning, Qian
Guo, Yong
Li, Yongsheng
MULTIMEDIA SYSTEMS, 2025, 31 (02)

← 1 2 3 4 5 →