Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion

被引:0
|
作者
Fuquan Li
Yonghui Zhou
YanLi Chen
Jie Li
ZhiCheng Dong
Mian Tan
机构
[1] Guizhou Normal University,School of Big Data and Computer Science
[2] Chongqing University of Science and Technology,School of Intelligent Technology and Engineering
[3] Tibet University,School of Information Science and Technology
[4] Guizhou Minzu University,Guizhou Key Laboratory of Pattern Recognition and Intelligent System
来源
Complex & Intelligent Systems | 2024年 / 10卷
关键词
Image fusion; Attention model; Dilated convolution; Infrared image; Visible image;
D O I
暂无
中图分类号
学科分类号
摘要
Infrared and visible image fusion aims to generate synthetic images including salient targets and abundant texture details. However, traditional techniques and recent deep learning-based approaches have faced challenges in preserving prominent structures and fine-grained features. In this study, we propose a lightweight infrared and visible image fusion network utilizing multi-scale attention modules and hybrid dilated convolutional blocks to preserve significant structural features and fine-grained textural details. First, we design a hybrid dilated convolutional block with different dilation rates that enable the extraction of prominent structure features by enlarging the receptive field in the fusion network. Compared with other deep learning methods, our method can obtain more high-level semantic information without piling up a large number of convolutional blocks, effectively improving the ability of feature representation. Second, distinct attention modules are designed to integrate into different layers of the network to fully exploit contextual information of the source images, and we leverage the total loss to guide the fusion process to focus on vital regions and compensate for missing information. Extensive qualitative and quantitative experiments demonstrate the superiority of our proposed method over state-of-the-art methods in both visual effects and evaluation metrics. The experimental results on public datasets show that our method can improve the entropy (EN) by 4.80%, standard deviation (SD) by 3.97%, correlation coefficient (CC) by 1.86%, correlations of differences (SCD) by 9.98%, and multi-scale structural similarity (MS_SSIM) by 5.64%, respectively. In addition, experiments with the VIFB dataset further indicate that our approach outperforms other comparable models.
引用
收藏
页码:705 / 719
页数:14
相关论文
共 50 条
  • [1] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
    Li, Fuquan
    Zhou, Yonghui
    Chen, YanLi
    Li, Jie
    Dong, ZhiCheng
    Tan, Mian
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 705 - 719
  • [2] Infrared and visible image fusion based on multi-scale dense attention connection network
    Chen Y.
    Zhang J.
    Wang Z.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (18): : 2253 - 2266
  • [3] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    Infrared Physics and Technology, 2022, 125
  • [4] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    INFRARED PHYSICS & TECHNOLOGY, 2022, 125
  • [5] Infrared and visible image fusion based on dilated residual attention network
    Mustafa, Hafiz Tayyab
    Yang, Jie
    Mustafa, Hamza
    Zareapoor, Masoumeh
    OPTIK, 2020, 224 (224):
  • [6] Prompt learning and multi-scale attention for infrared and visible image fusion
    Li, Yanan
    Ji, Qingtao
    Jiao, Shaokang
    INFRARED PHYSICS & TECHNOLOGY, 2025, 145
  • [7] A Multi-Scale Infrared and Visible Image Fusion Network Based on Context Perception
    Zhao, Huixuan
    Cheng, Jinyong
    Du, Rundong
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 395 - 400
  • [8] Multi-Scale Neural Network With Dilated Convolutions for Image Deblurring
    Ople, Jose Jaena Mari
    Yeh, Pin-Yi
    Sun, Shih-Wei
    Tsai, I-Te
    Hua, Kai-Lung
    IEEE ACCESS, 2020, 8 : 53942 - 53952
  • [9] GridDehazeNet: Attention-Based Multi-Scale Network for Image Dehazing
    Liu, Xiaohong
    Ma, Yongrui
    Shi, Zhihao
    Chen, Jun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7313 - 7322
  • [10] An infrared and visible image fusion network based on multi-scale feature cascades and non-local attention
    Xu, Jing
    Liu, Zhenjin
    Fang, Ming
    IET IMAGE PROCESSING, 2024, 18 (08) : 2114 - 2125