MAFusion: Multiscale Attention Network for Infrared and Visible Image Fusion

被引:30
作者
Li, Xiaoling [1 ]
Chen, Houjin [1 ]
Li, Yanfeng [1 ]
Peng, Yahui [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
基金
美国国家科学基金会;
关键词
Image fusion; Feature extraction; Task analysis; Transforms; Decoding; Semantics; Generative adversarial networks; Attention-based model; image fusion; infrared image; multiscale network; skip connection; visible image; FRAMEWORK;
D O I
10.1109/TIM.2022.3181898
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The infrared and visible image fusion aims to generate one image with rich information by integrating thermal regions from the infrared image and texture details from the visible image, which is beneficial to facilitate the capacity of video surveillance and object detection in complex environments. Although there is great progress in image fusion algorithms, artifacts and inconsistencies are still challenging tasks. To alleviate these problems, a multiscale attention network for infrared and visible image fusion (MAFusion) is proposed. The network consists of an encoder, a fusion strategy, and a decoder. Specifically, the encoder is adopted to extract multiscale features by feeding the source images. An attention-based model is then designed as the fusion strategy to integrate different features in the infrared and visible images. The attention-based model can highlight the thermal targets in the infrared image and maintain details in the visible image, so as to avoid the generation of artifacts. The decoder is based on a multiscale skip connection to incorporate low-level details with high-level semantics at different scales. The vital features of infrared and visible images can be fully preserved by the multiscale skip connection network to restrict the introduction of inconsistencies. Furthermore, we develop a feature-preserving loss function to train the proposed network. Experimental results demonstrate that the proposed network delivers advantages and effectiveness compared with the state-of-the-art fusion methods in qualitative and quantitative assessments. Besides, we apply the fused image generated by MAFusion to crowd counting (CC), which can effectively improve the CC performance in low-illumination conditions.
引用
收藏
页数:16
相关论文
共 50 条
  • [11] A Dual Cross Attention Transformer Network for Infrared and Visible Image Fusion
    Zhou, Zhuozhi
    Lan, Jinhui
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 494 - 499
  • [12] Contrast Saliency Information Guided Infrared and Visible Image Fusion
    Wang, Xue
    Guan, Zheng
    Qian, Wenhua
    Cao, Jinde
    Wang, Chengchao
    Yang, Chao
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2023, 9 : 769 - 780
  • [13] EMA-GAN: A Generative Adversarial Network for Infrared and Visible Image Fusion with Multiscale Attention Network and Expectation Maximization Algorithm
    Xi, Xiuliang
    Jin, Xin
    Jiang, Qian
    Lin, Yu
    Zhou, Wei
    Guo, Lei
    ADVANCED INTELLIGENT SYSTEMS, 2023, 5 (11)
  • [14] Multiscale Progressive Fusion of Infrared and Visible Images
    Park, Seonghyun
    Lee, Chul
    IEEE ACCESS, 2022, 10 : 126117 - 126132
  • [15] MCnet: Multiscale visible image and infrared image fusion network
    Sun, Le
    Li, Yuhang
    Zheng, Min
    Zhong, Zhaoyi
    Zhang, Yanchun
    SIGNAL PROCESSING, 2023, 208
  • [16] Multiscale aggregation and illumination-aware attention network for infrared and visible image fusion
    Song, Wenhao
    Zhai, Wenzhe
    Gao, Mingliang
    Li, Qilei
    Chehri, Abdellah
    Jeon, Gwanggil
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (10)
  • [17] MSFNet: MultiStage Fusion Network for infrared and visible image fusion
    Wang, Chenwu
    Wu, Junsheng
    Zhu, Zhixiang
    Chen, Hao
    NEUROCOMPUTING, 2022, 507 : 26 - 39
  • [18] Semantic-Supervised Infrared and Visible Image Fusion Via a Dual-Discriminator Generative Adversarial Network
    Zhou, Huabing
    Wu, Wei
    Zhang, Yanduo
    Ma, Jiayi
    Ling, Haibin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 635 - 648
  • [19] RAN: Infrared and Visible Image Fusion Network Based on Residual Attention Decomposition
    Yu, Jia
    Lu, Gehao
    Zhang, Jie
    ELECTRONICS, 2024, 13 (14)
  • [20] NestFuse: An Infrared and Visible Image Fusion Architecture Based on Nest Connection and Spatial/Channel Attention Models
    Li, Hui
    Wu, Xiao-Jun
    Durrani, Tariq
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (12) : 9645 - 9656