BDMFuse: Multi-scale network fusion for infrared and visible images based on base and detail features

被引:0
作者
Si, Hai-Ping [1 ]
Zhao, Wen-Rui [1 ]
Li, Ting-Ting [1 ]
Li, Fei-Tao [1 ]
Fernado, Bacao [2 ]
Sun, Chang-Xia [1 ]
Li, Yan-Ling [1 ]
机构
[1] Henan Agr Univ, Coll Informat & Management Sci, Zhengzhou 450046, Peoples R China
[2] Univ Nova Lisboa, NOVA Informat Management Sch, P-1070312 Lisbon, Portugal
关键词
infrared image; visible image; image fusion; encoder-decoder; multi-scale features; PERFORMANCE;
D O I
10.11972/j.issn.1001-9014.2025.02.016
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
The fusion of infrared and visible images should emphasize the salient targets in the infrared image while preserving the textural details of the visible images. To meet these requirements, an autoencoder-based method for infrared and visible image fusion is proposed. The encoder designed according to the optimization objective consists of a base encoder and a detail encoder, which is used to extract low-frequency and high-frequency information from the image. This extraction may lead to some information not being captured, so a compensation encoder is proposed to supplement the missing information. Multi-scale decomposition is also employed to extract image features more comprehensively. The decoder combines low-frequency, high-frequency and supplementary information to obtain multi-scale features. Subsequently, the attention strategy and fusion module are introduced to perform multi-scale fusion for image reconstruction. Experimental results on three datasets show that the fused images generated by this network effectively retain salient targets while being more consistent with human visual perception.
引用
收藏
页码:275 / 284
页数:10
相关论文
共 43 条
[1]  
13Li B, 2024, IEEE T GEOSCI REMOTE, P62
[2]  
14Liu T, 2023, IEEE T GEOSCI REMOTE, V61
[3]  
21Sadjadi F, 2005, 2005 IEEE COMP SOC C, P8
[4]   A new image quality metric for image fusion: The sum of the correlations of differences [J].
Aslantas, V. ;
Bendes, E. .
AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2015, 69 (12) :160-166
[5]   Infrared and visible image fusion based on target-enhanced multiscale transform decomposition [J].
Chen, Jun ;
Li, Xuejiao ;
Luo, Linbo ;
Mei, Xiaoguang ;
Ma, Jiayi .
INFORMATION SCIENCES, 2020, 508 :64-78
[6]   Detail preserved fusion of visible and infrared images using regional saliency extraction and multi-scale image decomposition [J].
Cui, Guangmang ;
Feng, Huajun ;
Xu, Zhihai ;
Li, Qi ;
Chen, Yueting .
OPTICS COMMUNICATIONS, 2015, 341 :199-209
[7]   The nonsubsampled contourlet transform: Theory, design, and applications [J].
da Cunha, Arthur L. ;
Zhou, Jianping ;
Do, Minh N. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (10) :3089-3101
[8]   A new image fusion performance metric based on visual information fidelity [J].
Han, Yu ;
Cai, Yunze ;
Cao, Yin ;
Xu, Xiaoming .
INFORMATION FUSION, 2013, 14 (02) :127-135
[9]   LLVIP: A Visible-infrared Paired Dataset for Low-light Vision [J].
Jia, Xinyu ;
Zhu, Chuang ;
Li, Minzhen ;
Tang, Wenqi ;
Zhou, Wenli .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :3489-3497
[10]   SEDRFuse: A Symmetric Encoder-Decoder With Residual Block Network for Infrared and Visible Image Fusion [J].
Jian, Lihua ;
Yang, Xiaomin ;
Liu, Zheng ;
Jeon, Gwanggil ;
Gao, Mingliang ;
Chisholm, David .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70