An infrared and visible image fusion network based on multi-scale feature cascades and non-local attention

被引:1
|
作者
Xu, Jing [1 ,3 ]
Liu, Zhenjin [1 ,3 ]
Fang, Ming [2 ,3 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun, Peoples R China
[2] Changchun Univ Sci & Technol, Sch Artificial Intelligence, Changchun, Peoples R China
[3] Changchun Univ Sci & Technol, Zhongshan Inst, Machine Vis & Unmanned Syst Lab, Zhongshan, Peoples R China
关键词
convolutional neural nets; feature extraction; image fusion; image reconstruction; QUALITY ASSESSMENT; NEST;
D O I
10.1049/ipr2.13088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, research on infrared and visible image fusion has mainly focused on deep learning-based approaches, particularly deep neural networks with auto-encoder architectures. However, these approaches suffer from problems such as insufficient feature extraction capability and inefficient fusion strategies. Therefore, this paper introduces a novel image fusion network to address the limitations of infrared and visible image fusion networks with auto-encoder architectures. In the designed network, the encoder employs a multi-branch cascade structure, and these convolution branches with different kernel sizes provide the encoder with an adaptive receptive field to extract multi-scale features. In addition, the fusion layer incorporates a non-local attention module that is inspired by the self-attention mechanism. With its global receptive field, this module is used to build a non-local attention fusion network, which works together with the l1${l}_1$-norm spatial fusion strategy to extract, split, filter, and fuse global and local features. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state-of-the-art fusion approaches. This paper introduces a novel infrared and visible image fusion network to address the limitations of auto-encoder fusion networks. In the designed network, the encoder employs a multi-branch cascade structure with convolution kernels of different sizes to extract multi-scale features, and the fusion layer incorporates a non-local attention module alongside a spatial feature fusion strategy for both global and local feature fusion. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state-of-the-art fusion approaches. image
引用
收藏
页码:2114 / 2125
页数:12
相关论文
共 50 条
  • [21] AEFusion: A multi-scale fusion network combining Axial attention and Entropy feature Aggregation for infrared and visible images
    Li, Bicao
    Lu, Jiaxi
    Liu, Zhoufeng
    Shao, Zhuhong
    Li, Chunlei
    Du, Yifan
    Huang, Jie
    APPLIED SOFT COMPUTING, 2023, 132
  • [22] Multi-scale feature fusion pyramid attention network for single image dehazing
    Liu, Jianlei
    Liu, Peng
    Zhang, Yuanke
    IET IMAGE PROCESSING, 2023, 17 (09) : 2726 - 2735
  • [23] Fusion of visible and infrared images based on multi-scale image enhancement
    Sun, Ming-Chao
    Zhang, Chong
    Liu, Jing-Hong
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2012, 42 (03): : 738 - 742
  • [24] Image super-resolution using multi-scale non-local attention
    Kim, Sowon
    Park, Hanhoon
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [25] Integrating Parallel Attention Mechanisms and Multi-Scale Features for Infrared and Visible Image Fusion
    Xu, Qian
    Zheng, Yuan
    IEEE ACCESS, 2024, 12 : 8359 - 8372
  • [26] An end-to-end multi-scale network based on autoencoder for infrared and visible image fusion
    Liu, Hongzhe
    Yan, Hua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) : 20139 - 20156
  • [27] An end-to-end multi-scale network based on autoencoder for infrared and visible image fusion
    Hongzhe Liu
    Hua Yan
    Multimedia Tools and Applications, 2023, 82 : 20139 - 20156
  • [28] Infrared and visual image fusion based on multi-scale feature decomposition
    Yan, Huibin
    Li, Zhongmin
    OPTIK, 2020, 203
  • [29] MGRCFusion: An infrared and visible image fusion network based on multi-scale group residual convolution
    Zhu, Pan
    Yin, Yufei
    Zhou, Xinglin
    OPTICS AND LASER TECHNOLOGY, 2025, 180
  • [30] Image compression sensing based on non-local feature fusion network
    Zhang, Yongping
    JOURNAL OF RADIATION RESEARCH AND APPLIED SCIENCES, 2024, 17 (02)