MACCNet: Multiscale Attention and Cross- Convolutional Network for Infrared and Visible Image Fusion

被引:0
|
作者
Yang, Yong [1 ]
Zhou, Na [2 ]
Wan, Weiguo [3 ]
Huang, Shuying [4 ]
机构
[1] Tiangong Univ, Sch Comp Sci & Technol, Tianjin 300387, Peoples R China
[2] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang 330032, Peoples R China
[3] Jiangxi Univ Finance & Econ, Sch Software & Internet Things Engn, Nanchang 330032, Peoples R China
[4] Tiangong Univ, Sch Software, Tianjin 300387, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Convolution; Sensors; Image fusion; Task analysis; Kernel; Image reconstruction; Attention mechanism; cross convolution; infrared and visible image fusion (IVIF); multiscale network;
D O I
10.1109/JSEN.2024.3385638
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Infrared and visible image fusion (IVIF) aims to fully preserve the target and detail information of the infrared and visible images in the fusion image. Although deep learning-based methods have been widely used in IVIF, they usually use the same network structure to extract features without considering the differences between different image modalities, leading to insufficient feature extraction and unsatisfactory fusion results. To overcome these problems, we propose a multiscale attention and cross-convolutional network (MACCNet) to obtain competitive fusion results. The method includes a new two-branched structure-based encoder network for extracting features from two different modality images. In one branch, a new multiscale attention module (MAM) extracts the target features at different scales of the input infrared images. In the other branch, the new cross-convolution feature extraction module (CFEM) extracts the detail features of visible images in different directions. We also introduce a local saliency attention fusion network (LSAFN) to obtain two weight maps to improve the fusion of extracted target and detail features of the different modality images. In addition, the two weight maps are averaged as coefficients of the pixel loss terms to adaptively train the network. Finally, we obtain the final fusion result by reconstructing the fused features via a decoder network. Experimental results show that the proposed MACCNet outperforms several state-of-the-art IVIF methods in terms of visual perception and objective evaluation.
引用
收藏
页码:16587 / 16600
页数:14
相关论文
共 50 条
  • [21] Fully convolutional network-based infrared and visible image fusion
    Feng, Yufang
    Lu, Houqing
    Bai, Jingbo
    Cao, Lin
    Yin, Hong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 15001 - 15014
  • [22] Attention Multihop Graph and Multiscale Convolutional Fusion Network for Hyperspectral Image Classification
    Zhou, Hao
    Luo, Fulin
    Zhuang, Huiping
    Weng, Zhenyu
    Gong, Xiuwen
    Lin, Zhiping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [23] Fully convolutional network-based infrared and visible image fusion
    Yufang Feng
    Houqing Lu
    Jingbo Bai
    Lin Cao
    Hong Yin
    Multimedia Tools and Applications, 2020, 79 : 15001 - 15014
  • [24] Infrared and Visible Image Fusion via Multiscale Receptive Field Amplification Fusion Network
    Ji, Chuanming
    Zhou, Wujie
    Lei, Jingsheng
    Ye, Lv
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 493 - 497
  • [25] Infrared and visible image fusion algorithm based on a cross-layer densely connected convolutional network
    Yu, Ruixing
    Chen, Weiyu
    Zhu, Bing
    APPLIED OPTICS, 2022, 61 (11) : 3107 - 3114
  • [26] Self-Attention Progressive Network for Infrared and Visible Image Fusion
    Li, Shuying
    Han, Muyi
    Qin, Yuemei
    Li, Qiang
    REMOTE SENSING, 2024, 16 (18)
  • [27] Infrared and visible image fusion based on dilated residual attention network
    Mustafa, Hafiz Tayyab
    Yang, Jie
    Mustafa, Hamza
    Zareapoor, Masoumeh
    OPTIK, 2020, 224 (224):
  • [28] Attention based dual UNET network for infrared and visible image fusion
    Wang, Xuejiao
    Hua, Zhen
    Li, Jinjiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (25) : 66959 - 66980
  • [29] MATCNN: Infrared and Visible Image Fusion Method Based on Multiscale CNN With Attention Transformer
    Liu, Jingjing
    Zhang, Li
    Zeng, Xiaoyang
    Liu, Wanquan
    Zhang, Jianhua
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [30] EMA-GAN: A Generative Adversarial Network for Infrared and Visible Image Fusion with Multiscale Attention Network and Expectation Maximization Algorithm
    Xi, Xiuliang
    Jin, Xin
    Jiang, Qian
    Lin, Yu
    Zhou, Wei
    Guo, Lei
    ADVANCED INTELLIGENT SYSTEMS, 2023, 5 (11)