SFCFusion: Spatial-Frequency Collaborative Infrared and Visible Image Fusion

被引:13
作者
Chen, Hanrui [1 ,2 ,3 ]
Deng, Lei [1 ,2 ,3 ]
Chen, Zhixiang [4 ]
Liu, Chenhua [1 ,2 ,3 ]
Zhu, Lianqing [1 ,2 ,3 ]
Dong, Mingli [1 ,2 ,3 ]
Lu, Xitian [1 ,2 ,3 ]
Guo, Chentong [1 ,2 ,3 ]
机构
[1] Beijing Informat Sci & Technol Univ, Minist Educ Optoelect Measurement Technol & Instru, Key Lab, Beijing 100192, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Beijing Lab Opt Fiber Sensing & Syst, Beijing 100192, Peoples R China
[3] Guangzhou Nansha Intelligent Photon Sensing Res In, Guangzhou 511462, Guangdong, Peoples R China
[4] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, England
关键词
Deep learning; image fusion; multiscale transformation (MST); spatial-frequency; visible-infrared image; NONSUBSAMPLED CONTOURLET TRANSFORM; WAVELET; NETWORK;
D O I
10.1109/TIM.2024.3370752
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Infrared images can provide prominent targets based on the radiation difference, making them suitable for use in all day and night conditions. On the other hand, visible images can offer texture details with high spatial resolution. Infrared and visible image fusion is promising to achieve the best of both. Conventional frequency or spatial multiscale transformation (MST) methods are good at preserving image details. Deep-learning-based methods become more and more popular in image fusion because they can preserve high-level semantic features. To tackle the challenge in extracting and fusing cross-modality and cross-domain information, we propose a spatial-frequency collaborative fusion (SFCFusion) framework that effectively fuses spatial and frequency information in the feature space. In the frequency domain, source images are decomposed into base and detail layers with existing frequency decomposition methods. In the spatial domain, a kernel-based saliency generation module is designed to preserve spatial region-level structural information. A deep-learning-based encoder is used to extract features from the source images, decomposed images, and saliency maps. In the shared feature space, we achieve cross-modality SFCFusion through our proposed adaptive fusion scheme. We have conducted experiments to compare our SFCFusion with both the conventional and deep learning approaches on the TNO, LLVIP, and M3FD datasets. The qualitative and quantitative evaluation results demonstrate the effectiveness of our SFCFusion. We have further demonstrated the superiority of our SFCFusion in the downstream detection task. Our code will be available at https://github.com/ChenHanrui430/SFCFusion.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
[41]   Visible and Infrared Image Fusion Using Deep Learning [J].
Zhang, Xingchen ;
Demiris, Yiannis .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) :10535-10554
[42]   A MODEL FOR PERCEIVED SPATIAL-FREQUENCY AND SPATIAL-FREQUENCY DISCRIMINATION [J].
YAGER, D ;
KRAMER, P .
VISION RESEARCH, 1991, 31 (06) :1067-1072
[43]   A multi-weight fusion framework for infrared and visible image fusion [J].
Zhou, Yiqiao ;
He, Kangjian ;
Xu, Dan ;
Shi, Hongzhen ;
Zhang, Hao .
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (27) :68931-68957
[44]   L2FUSION: LOW-LIGHT ORIENTED INFRARED AND VISIBLE IMAGE FUSION [J].
Gao, Xiang ;
Lv, Guohua ;
Dong, Aimei ;
Wei, Zhonghe ;
Cheng, Jinyong .
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, :2405-2409
[45]   Infrared and Visible Image Fusion Based on Image Enhancement and Target Extraction [J].
Zhu, Haoran ;
Zhang, Wenying .
IEEE ACCESS, 2025, 13 :61862-61875
[46]   FOCUS FUSION NETWORK FOR VISIBLE AND INFRARED IMAGE FUSION [J].
Zhang, Yihan ;
Fang, Yichu ;
Zhang, Qian .
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, :3850-3854
[47]   TCCFusion: An infrared and visible image fusion method based on transformer and cross correlation [J].
Tang, Wei ;
He, Fazhi ;
Liu, Yu .
PATTERN RECOGNITION, 2023, 137
[48]   Infrared and Visible Image Fusion Method via Interactive Self-attention [J].
Yang Fan ;
Wang Zhishe ;
Sun Jing ;
Yu Zhaofa .
ACTA PHOTONICA SINICA, 2024, 53 (06)
[49]   Infrared and visible image fusion via dual encoder based on dense connection [J].
Lu, Quan ;
Zhang, Hongbin ;
Yin, Linfei .
PATTERN RECOGNITION, 2025, 163
[50]   BTSFusion: Fusion of infrared and visible image via a mechanism of balancing texture and salience [J].
Qian, Yao ;
Liu, Gang ;
Tang, Haojie ;
Xing, Mengliang ;
Chang, Rui .
OPTICS AND LASERS IN ENGINEERING, 2024, 173