An infrared and visible image fusion network based on multi-scale feature cascades and non-local attention

被引：1

作者：

Xu, Jing ^{[1
,3
]}

Liu, Zhenjin ^{[1
,3
]}

Fang, Ming ^{[2
,3
]}

机构：

[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun, Peoples R China

[2] Changchun Univ Sci & Technol, Sch Artificial Intelligence, Changchun, Peoples R China

[3] Changchun Univ Sci & Technol, Zhongshan Inst, Machine Vis & Unmanned Syst Lab, Zhongshan, Peoples R China

来源：

IET IMAGE PROCESSING | 2024年 / 18卷 / 08期

关键词：

convolutional neural nets; feature extraction; image fusion; image reconstruction; QUALITY ASSESSMENT; NEST;

D O I：

10.1049/ipr2.13088

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, research on infrared and visible image fusion has mainly focused on deep learning-based approaches, particularly deep neural networks with auto-encoder architectures. However, these approaches suffer from problems such as insufficient feature extraction capability and inefficient fusion strategies. Therefore, this paper introduces a novel image fusion network to address the limitations of infrared and visible image fusion networks with auto-encoder architectures. In the designed network, the encoder employs a multi-branch cascade structure, and these convolution branches with different kernel sizes provide the encoder with an adaptive receptive field to extract multi-scale features. In addition, the fusion layer incorporates a non-local attention module that is inspired by the self-attention mechanism. With its global receptive field, this module is used to build a non-local attention fusion network, which works together with the l1${l}_1$-norm spatial fusion strategy to extract, split, filter, and fuse global and local features. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state-of-the-art fusion approaches. This paper introduces a novel infrared and visible image fusion network to address the limitations of auto-encoder fusion networks. In the designed network, the encoder employs a multi-branch cascade structure with convolution kernels of different sizes to extract multi-scale features, and the fusion layer incorporates a non-local attention module alongside a spatial feature fusion strategy for both global and local feature fusion. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state-of-the-art fusion approaches. image

引用

页码：2114 / 2125

页数：12

共 50 条

[1] Infrared and visible image fusion based on multi-scale dense attention connection network
Chen Y.
Zhang J.
Wang Z.
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (18): : 2253 - 2266
[2] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
Li, Fuquan
Zhou, Yonghui
Chen, YanLi
Li, Jie
Dong, ZhiCheng
Tan, Mian
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 705 - 719
[3] A Multi-Scale Infrared and Visible Image Fusion Network Based on Context Perception
Zhao, Huixuan
Cheng, Jinyong
Du, Rundong
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 395 - 400
[4] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
Xu, Dongdong
Zhang, Ning
Zhang, Yuxi
Li, Zheng
Zhao, Zhikang
Wang, Yongcheng
INFRARED PHYSICS & TECHNOLOGY, 2022, 125
[5] An end-to-end multi-scale network based on autoencoder for infrared and visible image fusion
Liu, Hongzhe
Yan, Hua
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) : 20139 - 20156
[6] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
Fuquan Li
Yonghui Zhou
YanLi Chen
Jie Li
ZhiCheng Dong
Mian Tan
Complex & Intelligent Systems, 2024, 10 : 705 - 719
[7] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
Liu Y.
Liu M.
Lin S.
Tao Z.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
[8] Deep Neural Network for Infrared and Visible Image Fusion Based on Multi-scale Decomposition and Interactive Residual Coordinate Attention
Zong, Sha
Xie, Zhihua
Li, Qiang
Liu, Guodong
ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 254 - 262
[9] Prompt learning and multi-scale attention for infrared and visible image fusion
Li, Yanan
Ji, Qingtao
Jiao, Shaokang
INFRARED PHYSICS & TECHNOLOGY, 2025, 145
[10] AEFusion: A multi-scale fusion network combining Axial attention and Entropy feature Aggregation for infrared and visible images
Li, Bicao
Lu, Jiaxi
Liu, Zhoufeng
Shao, Zhuhong
Li, Chunlei
Du, Yifan
Huang, Jie
APPLIED SOFT COMPUTING, 2023, 132

← 1 2 3 4 5 →