SIGFusion: Semantic Information-Guided Infrared and Visible Image Fusion

被引：6

作者：

Lv, Guohua ^{[1
,2
,3
]}

Sima, Chaoqun ^{[2
]}

Gao, Yongbiao ^{[1
,2
,3
]}

Dong, Aimei ^{[1
,2
,3
]}

Ma, Guangxiao ^{[4
]}

Cheng, Jinyong ^{[1
,2
,3
]}

机构：

[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan,Minist Educ,Key Lab Comp, Jinan 250316, Peoples R China

[2] Qilu Univ Technol, Shandong Acad Sci, Fac Comp Sci & Technol, Jinan 250316, Peoples R China

[3] Shandong Fundamental Res Ctr Comp Sci, Serv Comp, Shandong Prov Key Lab Comp Power Internet, Jinan 250000, Peoples R China

[4] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2024年 / 73卷

关键词：

Image fusion; Semantics; Visualization; Training; Data mining; Computer science; Object detection; Deep learning; high-level vision task; image fusion; infrared image; mask; self-attention; visible image; NETWORK; ENSEMBLE;

D O I：

10.1109/TIM.2024.3457951

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In the field of image fusion, the fusion of infrared and visible images emerges as a crucial work, retaining the inherent characteristics of each source image for the creation of high-quality fusion images. However, existing fusion algorithms primarily prioritize the visual performance and statistical metrics, often neglecting the requisites of high-level vision tasks. To bridge this gap, this article proposes SIGFusion, an end-to-end method for infrared and visible image fusion. SIGFusion not only considers the visual performance and statistical metrics of the fused image but also focuses on meeting the demands of subsequent high-level vision tasks, thereby enhancing performance in fusion and high-level vision tasks. Concretely, we define significant semantic information as salient targets presenting in both infrared and visible images. By utilizing targets, our fusion network is guided to enhance these salient targets. In contrast to the isolated employ of either infrared or visible images, our target-based method comprehensively utilizes the semantic information within the image. In addition, the utilization of target-based self-attention loss enhances the extraction of vital information from the source image. It is worth noting that masks are only necessary during the training stage and are automatically generated through detection algorithms. We conduct experiments in both image fusion and high-level vision tasks. Experimental results demonstrate the effectiveness of SIGFusion in both infrared and visible image fusion tasks, as well as in subsequent high-level vision tasks. The source code is available at: https://github.com/poison-pig/SIGFusion.

引用

页数：18

共 63 条

[1] Image fusion based on nonsubsampled contourlet transform for infrared and visible light image [J].