Res2Fusion: Infrared and Visible Image Fusion Based on Dense Res2net and Double Nonlocal Attention Models

被引:94
作者
Wang, Zhishe [1 ]
Wu, Yuanyuan [1 ]
Wang, Junyao [1 ]
Xu, Jiawei [2 ,3 ]
Shao, Wenyu [1 ]
机构
[1] Taiyuan Univ Sci & Technol, Sch Appl Sci, Taiyuan 030024, Peoples R China
[2] Wenzhou Univ, Inst Big Data & Informat Technol, Wenzhou 303205, Peoples R China
[3] Wenzhou Univ, Coll Comp Sci & Artificial Intelligence, Wenzhou 303205, Peoples R China
关键词
Feature extraction; Convolution; Image fusion; Task analysis; Generative adversarial networks; Image reconstruction; Transforms; Deep learning; image fusion; infrared image; nonlocal attention; visible image; MULTISCALE TRANSFORM; PERFORMANCE; FRAMEWORK; NETWORK;
D O I
10.1109/TIM.2021.3139654
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Infrared and visible image fusion intends to generate a synthetic image with superior scene representation and better visual perception. The existing deep learning-based fusion methods merely make use of the convolution operation to extract features with a local receptive field, without fully considering their multiscale and long-range dependency characteristics, which may fail to preserve some essential global context from source images. To this end, we develop a novel and efficient fusion network based on dense Res2net and double nonlocal attention models, termed Res2Fusion. We introduce Res2net and dense connections into the encoder network with multiple available receptive fields, which is used to extract the multiscale features, and can retain as much meaningful information as possible for fusion tasks. In addition, we develop double nonlocal attention models as a fusion layer to model long-range dependencies on the local features. Specifically, these attention models can refine feature maps obtained by the encoder network to more focus on prominent infrared targets and distinct visible details. Finally, the comprehensive attention maps are used to generate a fused result through the simple decoder network. Extensive experiments demonstrate that the proposed method can simultaneously retain highlighted infrared targets and rich visible details and transcends other state-of-the-art fusion methods in terms of subjective and objective evaluation. The corresponding code is publicly available at <uri>https://github.com/Zhishe-Wang/Res2Fusion</uri>.
引用
收藏
页数:12
相关论文
共 43 条
  • [1] A new image quality metric for image fusion: The sum of the correlations of differences
    Aslantas, V.
    Bendes, E.
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2015, 69 (12) : 160 - 166
  • [2] Global Context Networks
    Cao, Yue
    Xu, Jiarui
    Lin, Stephen
    Wei, Fangyun
    Hu, Han
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6881 - 6895
  • [3] Infrared and visible image fusion based on target-enhanced multiscale transform decomposition
    Chen, Jun
    Li, Xuejiao
    Luo, Linbo
    Mei, Xiaoguang
    Ma, Jiayi
    [J]. INFORMATION SCIENCES, 2020, 508 (508) : 64 - 78
  • [4] Res2Net: A New Multi-Scale Backbone Architecture
    Gao, Shang-Hua
    Cheng, Ming-Ming
    Zhao, Kai
    Zhang, Xin-Yu
    Yang, Ming-Hsuan
    Torr, Philip
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 652 - 662
  • [5] Fast saliency-aware multi-modality image fusion
    Han, Jungong
    Pauwels, Eric J.
    de Zeeuw, Paul
    [J]. NEUROCOMPUTING, 2013, 111 : 70 - 80
  • [6] A new image fusion performance metric based on visual information fidelity
    Han, Yu
    Cai, Yunze
    Cao, Yin
    Xu, Xiaoming
    [J]. INFORMATION FUSION, 2013, 14 (02) : 127 - 135
  • [7] CCNet: Criss-Cross Attention for Semantic Segmentation
    Huang, Zilong
    Wang, Xinggang
    Huang, Lichao
    Huang, Chang
    Wei, Yunchao
    Liu, Wenyu
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 603 - 612
  • [8] SEDRFuse: A Symmetric Encoder-Decoder With Residual Block Network for Infrared and Visible Image Fusion
    Jian, Lihua
    Yang, Xiaomin
    Liu, Zheng
    Jeon, Gwanggil
    Gao, Mingliang
    Chisholm, David
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [9] John Vijay, 2021, Pattern Recognition. ICPR International Workshops and Challenges. Proceedings. Lecture Notes in Computer Science (LNCS 12666), P277, DOI 10.1007/978-3-030-68780-9_24
  • [10] RFN-Nest: An end-to-end residual fusion network for infrared and visible images
    Li, Hui
    Wu, Xiao-Jun
    Kittler, Josef
    [J]. INFORMATION FUSION, 2021, 73 : 72 - 86