SIGFusion: Semantic Information-Guided Infrared and Visible Image Fusion

被引:6
作者
Lv, Guohua [1 ,2 ,3 ]
Sima, Chaoqun [2 ]
Gao, Yongbiao [1 ,2 ,3 ]
Dong, Aimei [1 ,2 ,3 ]
Ma, Guangxiao [4 ]
Cheng, Jinyong [1 ,2 ,3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan,Minist Educ,Key Lab Comp, Jinan 250316, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Fac Comp Sci & Technol, Jinan 250316, Peoples R China
[3] Shandong Fundamental Res Ctr Comp Sci, Serv Comp, Shandong Prov Key Lab Comp Power Internet, Jinan 250000, Peoples R China
[4] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
关键词
Image fusion; Semantics; Visualization; Training; Data mining; Computer science; Object detection; Deep learning; high-level vision task; image fusion; infrared image; mask; self-attention; visible image; NETWORK; ENSEMBLE;
D O I
10.1109/TIM.2024.3457951
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the field of image fusion, the fusion of infrared and visible images emerges as a crucial work, retaining the inherent characteristics of each source image for the creation of high-quality fusion images. However, existing fusion algorithms primarily prioritize the visual performance and statistical metrics, often neglecting the requisites of high-level vision tasks. To bridge this gap, this article proposes SIGFusion, an end-to-end method for infrared and visible image fusion. SIGFusion not only considers the visual performance and statistical metrics of the fused image but also focuses on meeting the demands of subsequent high-level vision tasks, thereby enhancing performance in fusion and high-level vision tasks. Concretely, we define significant semantic information as salient targets presenting in both infrared and visible images. By utilizing targets, our fusion network is guided to enhance these salient targets. In contrast to the isolated employ of either infrared or visible images, our target-based method comprehensively utilizes the semantic information within the image. In addition, the utilization of target-based self-attention loss enhances the extraction of vital information from the source image. It is worth noting that masks are only necessary during the training stage and are automatically generated through detection algorithms. We conduct experiments in both image fusion and high-level vision tasks. Experimental results demonstrate the effectiveness of SIGFusion in both infrared and visible image fusion tasks, as well as in subsequent high-level vision tasks. The source code is available at: https://github.com/poison-pig/SIGFusion.
引用
收藏
页数:18
相关论文
共 63 条
[31]   STDFusionNet: An Infrared and Visible Image Fusion Network Based on Salient Target Detection [J].
Ma, Jiayi ;
Tang, Linfeng ;
Xu, Meilong ;
Zhang, Hao ;
Xiao, Guobao .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
[32]   DDcGAN: A Dual-Discriminator Conditional Generative Adversarial Network for Multi-Resolution Image Fusion [J].
Ma, Jiayi ;
Xu, Han ;
Jiang, Junjun ;
Mei, Xiaoguang ;
Zhang, Xiao-Ping .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :4980-4995
[33]   Infrared and visible image fusion via detail preserving adversarial learning [J].
Ma, Jiayi ;
Liang, Pengwei ;
Yu, Wei ;
Chen, Chen ;
Guo, Xiaojie ;
Wu, Jia ;
Jiang, Junjun .
INFORMATION FUSION, 2020, 54 :85-98
[34]   FusionGAN: A generative adversarial network for infrared and visible image fusion [J].
Ma, Jiayi ;
Yu, Wei ;
Liang, Pengwei ;
Li, Chang ;
Jiang, Junjun .
INFORMATION FUSION, 2019, 48 :11-26
[35]   Infrared and visible image fusion methods and applications: A survey [J].
Ma, Jiayi ;
Ma, Yong ;
Li, Chang .
INFORMATION FUSION, 2019, 45 :153-178
[36]   Airborne Infrared and Visible Image Fusion for Target Perception Based on Target Region Segmentation and Discrete Wavelet Transform [J].
Niu, Yifeng ;
Xu, Shengtao ;
Wu, Lizhen ;
Hu, Weidong .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2012, 2012
[37]  
Paszke A, 2019, ADV NEUR IN, V32
[38]   DeepFuse: A Deep Unsupervised Approach for Exposure Fusion with Extreme Exposure Image Pairs [J].
Prabhakar, K. Ram ;
Srikar, V. Sai ;
Babu, R. Venkatesh .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4724-4732
[39]   BTSFusion: Fusion of infrared and visible image via a mechanism of balancing texture and salience [J].
Qian, Yao ;
Liu, Gang ;
Tang, Haojie ;
Xing, Mengliang ;
Chang, Rui .
OPTICS AND LASERS IN ENGINEERING, 2024, 173
[40]   U2-Net: Going deeper with nested U-structure for salient object detection [J].
Qin, Xuebin ;
Zhang, Zichen ;
Huang, Chenyang ;
Dehghan, Masood ;
Zaiane, Osmar R. ;
Jagersand, Martin .
PATTERN RECOGNITION, 2020, 106