An infrared and visible image fusion network based on multi-scale feature cascades and non-local attention

被引:1
|
作者
Xu, Jing [1 ,3 ]
Liu, Zhenjin [1 ,3 ]
Fang, Ming [2 ,3 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun, Peoples R China
[2] Changchun Univ Sci & Technol, Sch Artificial Intelligence, Changchun, Peoples R China
[3] Changchun Univ Sci & Technol, Zhongshan Inst, Machine Vis & Unmanned Syst Lab, Zhongshan, Peoples R China
关键词
convolutional neural nets; feature extraction; image fusion; image reconstruction; QUALITY ASSESSMENT; NEST;
D O I
10.1049/ipr2.13088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, research on infrared and visible image fusion has mainly focused on deep learning-based approaches, particularly deep neural networks with auto-encoder architectures. However, these approaches suffer from problems such as insufficient feature extraction capability and inefficient fusion strategies. Therefore, this paper introduces a novel image fusion network to address the limitations of infrared and visible image fusion networks with auto-encoder architectures. In the designed network, the encoder employs a multi-branch cascade structure, and these convolution branches with different kernel sizes provide the encoder with an adaptive receptive field to extract multi-scale features. In addition, the fusion layer incorporates a non-local attention module that is inspired by the self-attention mechanism. With its global receptive field, this module is used to build a non-local attention fusion network, which works together with the l1${l}_1$-norm spatial fusion strategy to extract, split, filter, and fuse global and local features. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state-of-the-art fusion approaches. This paper introduces a novel infrared and visible image fusion network to address the limitations of auto-encoder fusion networks. In the designed network, the encoder employs a multi-branch cascade structure with convolution kernels of different sizes to extract multi-scale features, and the fusion layer incorporates a non-local attention module alongside a spatial feature fusion strategy for both global and local feature fusion. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state-of-the-art fusion approaches. image
引用
收藏
页码:2114 / 2125
页数:12
相关论文
共 50 条
  • [1] Infrared and visible image fusion based on multi-scale dense attention connection network
    Chen Y.
    Zhang J.
    Wang Z.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (18): : 2253 - 2266
  • [2] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
    Li, Fuquan
    Zhou, Yonghui
    Chen, YanLi
    Li, Jie
    Dong, ZhiCheng
    Tan, Mian
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 705 - 719
  • [3] A Multi-Scale Infrared and Visible Image Fusion Network Based on Context Perception
    Zhao, Huixuan
    Cheng, Jinyong
    Du, Rundong
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 395 - 400
  • [4] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    INFRARED PHYSICS & TECHNOLOGY, 2022, 125
  • [5] An end-to-end multi-scale network based on autoencoder for infrared and visible image fusion
    Liu, Hongzhe
    Yan, Hua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) : 20139 - 20156
  • [6] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
    Fuquan Li
    Yonghui Zhou
    YanLi Chen
    Jie Li
    ZhiCheng Dong
    Mian Tan
    Complex & Intelligent Systems, 2024, 10 : 705 - 719
  • [7] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
    Liu Y.
    Liu M.
    Lin S.
    Tao Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
  • [8] Deep Neural Network for Infrared and Visible Image Fusion Based on Multi-scale Decomposition and Interactive Residual Coordinate Attention
    Zong, Sha
    Xie, Zhihua
    Li, Qiang
    Liu, Guodong
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 254 - 262
  • [9] Prompt learning and multi-scale attention for infrared and visible image fusion
    Li, Yanan
    Ji, Qingtao
    Jiao, Shaokang
    INFRARED PHYSICS & TECHNOLOGY, 2025, 145
  • [10] AEFusion: A multi-scale fusion network combining Axial attention and Entropy feature Aggregation for infrared and visible images
    Li, Bicao
    Lu, Jiaxi
    Liu, Zhoufeng
    Shao, Zhuhong
    Li, Chunlei
    Du, Yifan
    Huang, Jie
    APPLIED SOFT COMPUTING, 2023, 132