SIGFusion: Semantic Information-Guided Infrared and Visible Image Fusion

被引:6
作者
Lv, Guohua [1 ,2 ,3 ]
Sima, Chaoqun [2 ]
Gao, Yongbiao [1 ,2 ,3 ]
Dong, Aimei [1 ,2 ,3 ]
Ma, Guangxiao [4 ]
Cheng, Jinyong [1 ,2 ,3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan,Minist Educ,Key Lab Comp, Jinan 250316, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Fac Comp Sci & Technol, Jinan 250316, Peoples R China
[3] Shandong Fundamental Res Ctr Comp Sci, Serv Comp, Shandong Prov Key Lab Comp Power Internet, Jinan 250000, Peoples R China
[4] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
关键词
Image fusion; Semantics; Visualization; Training; Data mining; Computer science; Object detection; Deep learning; high-level vision task; image fusion; infrared image; mask; self-attention; visible image; NETWORK; ENSEMBLE;
D O I
10.1109/TIM.2024.3457951
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the field of image fusion, the fusion of infrared and visible images emerges as a crucial work, retaining the inherent characteristics of each source image for the creation of high-quality fusion images. However, existing fusion algorithms primarily prioritize the visual performance and statistical metrics, often neglecting the requisites of high-level vision tasks. To bridge this gap, this article proposes SIGFusion, an end-to-end method for infrared and visible image fusion. SIGFusion not only considers the visual performance and statistical metrics of the fused image but also focuses on meeting the demands of subsequent high-level vision tasks, thereby enhancing performance in fusion and high-level vision tasks. Concretely, we define significant semantic information as salient targets presenting in both infrared and visible images. By utilizing targets, our fusion network is guided to enhance these salient targets. In contrast to the isolated employ of either infrared or visible images, our target-based method comprehensively utilizes the semantic information within the image. In addition, the utilization of target-based self-attention loss enhances the extraction of vital information from the source image. It is worth noting that masks are only necessary during the training stage and are automatically generated through detection algorithms. We conduct experiments in both image fusion and high-level vision tasks. Experimental results demonstrate the effectiveness of SIGFusion in both infrared and visible image fusion tasks, as well as in subsequent high-level vision tasks. The source code is available at: https://github.com/poison-pig/SIGFusion.
引用
收藏
页数:18
相关论文
共 63 条
[1]   Image fusion based on nonsubsampled contourlet transform for infrared and visible light image [J].
Adu, Jianhua ;
Gan, Jianhong ;
Wang, Yan ;
Huang, Jian .
INFRARED PHYSICS & TECHNOLOGY, 2013, 61 :94-100
[2]   A new image quality metric for image fusion: The sum of the correlations of differences [J].
Aslantas, V. ;
Bendes, E. .
AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2015, 69 (12) :160-166
[3]   THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE [J].
BURT, PJ ;
ADELSON, EH .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) :532-540
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]  
Deshmukh M., 2010, Int. J. Image Process., V4, P484
[6]   Automatic Contrast Enhancement Technology With Saliency Preservation [J].
Gu, Ke ;
Zhai, Guangtao ;
Yang, Xiaokang ;
Zhang, Wenjun ;
Chen, Chang Wen .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (09) :1480-1494
[7]  
Ha Q, 2017, IEEE INT C INT ROBOT, P5108, DOI 10.1109/IROS.2017.8206396
[8]   A non-reference image fusion metric based on mutual information of image features [J].
Haghighat, Mohammad Bagher Akbari ;
Aghagolzadeh, Ali ;
Seyedarabi, Hadi .
COMPUTERS & ELECTRICAL ENGINEERING, 2011, 37 (05) :744-756
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269