Frequency Domain Nuances Mining for Visible-Infrared Person Re-Identification

被引:0
作者
Zhang, Yukang [1 ,2 ]
Wang, Hanzi [1 ,2 ]
Lu, Yang [1 ,2 ]
Yan, Yan [1 ,2 ]
Li, Xuelong [3 ,4 ]
机构
[1] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart City, Minist Educ China, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Key Lab Multimedia Trusted Percept & Efficient Com, Minist Educ China, Xiamen, Peoples R China
[3] Inst Artificial Intelligence TeleAI, Beijing 100033, Peoples R China
[4] Inst Artificial Intelligence TeleAI, Shanghai 200030, Peoples R China
基金
中国国家自然科学基金;
关键词
Frequency-domain analysis; Fast Fourier transforms; Face recognition; Image reconstruction; Feature extraction; Data mining; Training; Surveillance; Generators; Artificial intelligence; VIReID; frequency domain; nuances mining; VIS-IR image; face recognition; HETEROGENEOUS FACE RECOGNITION;
D O I
10.1109/TIFS.2025.3569176
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper focuses on the visible-infrared person re-identification (VIReID) task, which is essential for information forensics and security as it enables accurate person re-identification across low-light or nighttime conditions. The primary challenge in the VIReID task is to reduce the modality discrepancy between visible and infrared images. Current methods mainly utilize the spatial information, often neglecting the discriminative potential of frequency information. To address this issue, this paper aims to mitigate the modality discrepancy from a frequency domain perspective. Specifically, we propose a novel Frequency Domain Nuances Mining (FDNM) method, which mainly includes a Salience-guided Phase Enhancement (SPE) module and an Amplitude Nuances Mining (ANM) module, to effectively explore the cross-modality frequency domain information. These two modules are mutually beneficial to jointly explore frequency-domain visible-infrared nuances, thereby significantly reducing the modality discrepancy in the frequency domain. Additionally, we propose a Center-guided Nuances Mining (CNM) loss to ensure that the ANM module retains discriminative identity information while discovering diverse cross-modality nuances. Extensive experiments show that the proposed FDNM has significant advantages in improving the performance of VIReID. For instance, our method respectively outperforms the second-best method by 5.2% in Rank-1 accuracy and 5.8% in mAP on the SYSU-MM01 dataset under the indoor search mode. Furthermore, we also demonstrate the effectiveness and generalization of the proposed FDNM method in the challenging visible-infrared face recognition task.
引用
收藏
页码:5411 / 5424
页数:14
相关论文
共 79 条
[41]  
Tan L, 2023, Arxiv, DOI [arXiv:2302.00884, DOI 10.48550/ARXIV.2302.00884]
[42]   A Fourier-Based Semantic Augmentation for Visible-Thermal Person Re-Identification [J].
Tan, Xiaoheng ;
Chai, Yanxia ;
Chen, Fenglei ;
Liu, Haijun .
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 :1684-1688
[43]  
Ulyanov D, 2017, Arxiv, DOI arXiv:1607.08022
[44]  
van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
[45]   Spatial-Frequency Mutual Learning for Face Super-Resolution [J].
Wang, Chenyang ;
Jiang, Junjun ;
Zhong, Zhiwei ;
Liu, Xianming .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :22356-22366
[46]  
Wang GA, 2020, AAAI CONF ARTIF INTE, V34, P12144
[47]   RGB-Infrared Cross-Modality Person Re-Identification via Joint Pixel and Feature Alignment [J].
Wang, Guan'an ;
Zhang, Tianzhu ;
Cheng, Jian ;
Liu, Si ;
Yang, Yang ;
Hou, Zengguang .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3622-3631
[48]   Learning to Reduce Dual-level Discrepancy for Infrared-Visible Person Re-identification [J].
Wang, Zhixiang ;
Wang, Zheng ;
Zheng, Yinqiang ;
Chuang, Yung-Yu ;
Satoh, Shin'ichi .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :618-626
[49]   Co-Attentive Lifting for Infrared-Visible Person Re-Identification [J].
Wei, Xing ;
Li, Diangang ;
Hong, Xiaopeng ;
Ke, Wei ;
Gong, Yihong .
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :1028-1037
[50]   Dual-Adversarial Representation Disentanglement for Visible Infrared Person Re-Identification [J].
Wei, Ziyu ;
Yang, Xi ;
Wang, Nannan ;
Gao, Xinbo .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :2186-2200