Frequency Domain Nuances Mining for Visible-Infrared Person Re-Identification

被引:0
作者
Zhang, Yukang [1 ,2 ]
Wang, Hanzi [1 ,2 ]
Lu, Yang [1 ,2 ]
Yan, Yan [1 ,2 ]
Li, Xuelong [3 ,4 ]
机构
[1] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart City, Minist Educ China, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Key Lab Multimedia Trusted Percept & Efficient Com, Minist Educ China, Xiamen, Peoples R China
[3] Inst Artificial Intelligence TeleAI, Beijing 100033, Peoples R China
[4] Inst Artificial Intelligence TeleAI, Shanghai 200030, Peoples R China
基金
中国国家自然科学基金;
关键词
Frequency-domain analysis; Fast Fourier transforms; Face recognition; Image reconstruction; Feature extraction; Data mining; Training; Surveillance; Generators; Artificial intelligence; VIReID; frequency domain; nuances mining; VIS-IR image; face recognition; HETEROGENEOUS FACE RECOGNITION;
D O I
10.1109/TIFS.2025.3569176
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper focuses on the visible-infrared person re-identification (VIReID) task, which is essential for information forensics and security as it enables accurate person re-identification across low-light or nighttime conditions. The primary challenge in the VIReID task is to reduce the modality discrepancy between visible and infrared images. Current methods mainly utilize the spatial information, often neglecting the discriminative potential of frequency information. To address this issue, this paper aims to mitigate the modality discrepancy from a frequency domain perspective. Specifically, we propose a novel Frequency Domain Nuances Mining (FDNM) method, which mainly includes a Salience-guided Phase Enhancement (SPE) module and an Amplitude Nuances Mining (ANM) module, to effectively explore the cross-modality frequency domain information. These two modules are mutually beneficial to jointly explore frequency-domain visible-infrared nuances, thereby significantly reducing the modality discrepancy in the frequency domain. Additionally, we propose a Center-guided Nuances Mining (CNM) loss to ensure that the ANM module retains discriminative identity information while discovering diverse cross-modality nuances. Extensive experiments show that the proposed FDNM has significant advantages in improving the performance of VIReID. For instance, our method respectively outperforms the second-best method by 5.2% in Rank-1 accuracy and 5.8% in mAP on the SYSU-MM01 dataset under the indoor search mode. Furthermore, we also demonstrate the effectiveness and generalization of the proposed FDNM method in the challenging visible-infrared face recognition task.
引用
收藏
页码:5411 / 5424
页数:14
相关论文
共 79 条
[1]  
Chalamala SR, 2014, 2014 IEEE 10TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2014), P67, DOI 10.1109/CSPA.2014.6805722
[2]  
Chen J, 2009, PROC CVPR IEEE, P156, DOI 10.1109/CVPRW.2009.5206832
[3]   Neural Feature Search for RGB-Infrared Person Re-Identification [J].
Chen, Yehansen ;
Wan, Lin ;
Li, Zhihang ;
Jing, Qianyan ;
Sun, Zongyuan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :587-597
[4]  
Cheng D, 2023, AAAI CONF ARTIF INTE, P425
[5]   Hi-CMD: Hierarchical Cross-Modality Disentanglement for Visible-Infrared Person Re-Identification [J].
Choi, Seokeon ;
Lee, Sumin ;
Kim, Youngeun ;
Kim, Taekyung ;
Kim, Changick .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10254-10263
[6]   DMA: Dual Modality-Aware Alignment for Visible-Infrared Person Re-Identification [J].
Cui, Zhenyu ;
Zhou, Jiahuan ;
Peng, Yuxin .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :2696-2708
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]   Video-Based Visible-Infrared Person Re-Identification With Auxiliary Samples [J].
Du, Yunhao ;
Lei, Cheng ;
Zhao, Zhicheng ;
Dong, Yuan ;
Su, Fei .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :1313-1325
[9]   Cross-Spectral Face Hallucination via Disentangling Independent Factors [J].
Duan, Boyan ;
Fu, Chaoyou ;
Li, Yi ;
Song, Xingguang ;
He, Ran .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :7927-7935
[10]   Visible-Infrared Person Re-Identification via Semantic Alignment and Affinity Inference [J].
Fang, Xingye ;
Yang, Yang ;
Fu, Ying .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :11236-11245