An infrared and visible image fusion network based on multi-scale feature cascades and non-local attention

被引:1
|
作者
Xu, Jing [1 ,3 ]
Liu, Zhenjin [1 ,3 ]
Fang, Ming [2 ,3 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun, Peoples R China
[2] Changchun Univ Sci & Technol, Sch Artificial Intelligence, Changchun, Peoples R China
[3] Changchun Univ Sci & Technol, Zhongshan Inst, Machine Vis & Unmanned Syst Lab, Zhongshan, Peoples R China
关键词
convolutional neural nets; feature extraction; image fusion; image reconstruction; QUALITY ASSESSMENT; NEST;
D O I
10.1049/ipr2.13088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, research on infrared and visible image fusion has mainly focused on deep learning-based approaches, particularly deep neural networks with auto-encoder architectures. However, these approaches suffer from problems such as insufficient feature extraction capability and inefficient fusion strategies. Therefore, this paper introduces a novel image fusion network to address the limitations of infrared and visible image fusion networks with auto-encoder architectures. In the designed network, the encoder employs a multi-branch cascade structure, and these convolution branches with different kernel sizes provide the encoder with an adaptive receptive field to extract multi-scale features. In addition, the fusion layer incorporates a non-local attention module that is inspired by the self-attention mechanism. With its global receptive field, this module is used to build a non-local attention fusion network, which works together with the l1${l}_1$-norm spatial fusion strategy to extract, split, filter, and fuse global and local features. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state-of-the-art fusion approaches. This paper introduces a novel infrared and visible image fusion network to address the limitations of auto-encoder fusion networks. In the designed network, the encoder employs a multi-branch cascade structure with convolution kernels of different sizes to extract multi-scale features, and the fusion layer incorporates a non-local attention module alongside a spatial feature fusion strategy for both global and local feature fusion. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state-of-the-art fusion approaches. image
引用
收藏
页码:2114 / 2125
页数:12
相关论文
共 50 条
  • [1] Infrared and visible image fusion based on multi-scale dense attention connection network
    Chen Y.
    Zhang J.
    Wang Z.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (18): : 2253 - 2266
  • [2] Local to non-local: Multi-scale progressive attention network for image restoration
    Shen, Lili
    Zhao, Bo
    Li, Qunxia
    Zhang, Chuhe
    Sun, Xichun
    Peng, Bo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233
  • [3] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    Infrared Physics and Technology, 2022, 125
  • [4] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    INFRARED PHYSICS & TECHNOLOGY, 2022, 125
  • [5] Multi-Scale Feature Fusion with Attention Mechanism Based on CGAN Network for Infrared Image Colorization
    Ai, Yibo
    Liu, Xiaoxi
    Zhai, Haoyang
    Li, Jie
    Liu, Shuangli
    An, Huilong
    Zhang, Weidong
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [6] Multi-scale non-local attention network for image super-resolution
    Wu, Xue
    Zhang, Kaibing
    Hu, Yanting
    He, Xin
    Gao, Xinbo
    SIGNAL PROCESSING, 2024, 218
  • [7] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
    Li, Fuquan
    Zhou, Yonghui
    Chen, YanLi
    Li, Jie
    Dong, ZhiCheng
    Tan, Mian
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 705 - 719
  • [8] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
    Fuquan Li
    Yonghui Zhou
    YanLi Chen
    Jie Li
    ZhiCheng Dong
    Mian Tan
    Complex & Intelligent Systems, 2024, 10 : 705 - 719
  • [9] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
    Liu Y.
    Liu M.
    Lin S.
    Tao Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
  • [10] NLFFTNet: A non-local feature fusion transformer network for multi-scale object detection
    Zeng, Kai
    Ma, Qian
    Wu, Jiawen
    Xiang, Sijia
    Shen, Tao
    Zhang, Lei
    NEUROCOMPUTING, 2022, 493 : 15 - 27