Res2Fusion: Infrared and Visible Image Fusion Based on Dense Res2net and Double Nonlocal Attention Models

被引：110

作者：

Wang, Zhishe ^{[1
]}

Wu, Yuanyuan ^{[1
]}

Wang, Junyao ^{[1
]}

Xu, Jiawei ^{[2
,3
]}

Shao, Wenyu ^{[1
]}

机构：

[1] Taiyuan Univ Sci & Technol, Sch Appl Sci, Taiyuan 030024, Peoples R China

[2] Wenzhou Univ, Inst Big Data & Informat Technol, Wenzhou 303205, Peoples R China

[3] Wenzhou Univ, Coll Comp Sci & Artificial Intelligence, Wenzhou 303205, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2022年 / 71卷

关键词：

Feature extraction; Convolution; Image fusion; Task analysis; Generative adversarial networks; Image reconstruction; Transforms; Deep learning; image fusion; infrared image; nonlocal attention; visible image; MULTISCALE TRANSFORM; PERFORMANCE; FRAMEWORK; NETWORK;

D O I：

10.1109/TIM.2021.3139654

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Infrared and visible image fusion intends to generate a synthetic image with superior scene representation and better visual perception. The existing deep learning-based fusion methods merely make use of the convolution operation to extract features with a local receptive field, without fully considering their multiscale and long-range dependency characteristics, which may fail to preserve some essential global context from source images. To this end, we develop a novel and efficient fusion network based on dense Res2net and double nonlocal attention models, termed Res2Fusion. We introduce Res2net and dense connections into the encoder network with multiple available receptive fields, which is used to extract the multiscale features, and can retain as much meaningful information as possible for fusion tasks. In addition, we develop double nonlocal attention models as a fusion layer to model long-range dependencies on the local features. Specifically, these attention models can refine feature maps obtained by the encoder network to more focus on prominent infrared targets and distinct visible details. Finally, the comprehensive attention maps are used to generate a fused result through the simple decoder network. Extensive experiments demonstrate that the proposed method can simultaneously retain highlighted infrared targets and rich visible details and transcends other state-of-the-art fusion methods in terms of subjective and objective evaluation. The corresponding code is publicly available at <uri>https://github.com/Zhishe-Wang/Res2Fusion</uri>.

引用

页数：12

共 43 条

[1] A new image quality metric for image fusion: The sum of the correlations of differences [J].

Aslantas, V. ;

Bendes, E. .

AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2015, 69 (12) :160-166

[2] Global Context Networks [J].

Cao, Yue ;

Xu, Jiarui ;

Lin, Stephen ;

Wei, Fangyun ;

Hu, Han .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) :6881-6895

[3] Infrared and visible image fusion based on target-enhanced multiscale transform decomposition [J].

Chen, Jun ;

Li, Xuejiao ;

Luo, Linbo ;

Mei, Xiaoguang ;

Ma, Jiayi .

INFORMATION SCIENCES, 2020, 508 :64-78

[4] Res2Net: A New Multi-Scale Backbone Architecture [J].

Gao, Shang-Hua ;

Cheng, Ming-Ming ;

Zhao, Kai ;

Zhang, Xin-Yu ;

Yang, Ming-Hsuan ;

Torr, Philip .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) :652-662

[5] Fast saliency-aware multi-modality image fusion [J].

Han, Jungong ;

Pauwels, Eric J. ;

de Zeeuw, Paul .

NEUROCOMPUTING, 2013, 111 :70-80

[6] A new image fusion performance metric based on visual information fidelity [J].

Han, Yu ;

Cai, Yunze ;

Cao, Yin ;

Xu, Xiaoming .

INFORMATION FUSION, 2013, 14 (02) :127-135

[7] CCNet: Criss-Cross Attention for Semantic Segmentation [J].

Huang, Zilong ;

Wang, Xinggang ;

Huang, Lichao ;

Huang, Chang ;

Wei, Yunchao ;

Liu, Wenyu .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :603-612

[8] SEDRFuse: A Symmetric Encoder-Decoder With Residual Block Network for Infrared and Visible Image Fusion [J].

Jian, Lihua ;

Yang, Xiaomin ;

Liu, Zheng ;

Jeon, Gwanggil ;

Gao, Mingliang ;

Chisholm, David .

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70

[9]

John Vijay, 2021, Pattern Recognition. ICPR International Workshops and Challenges. Proceedings. Lecture Notes in Computer Science (LNCS 12666), P277, DOI 10.1007/978-3-030-68780-9_24

[10] RFN-Nest: An end-to-end residual fusion network for infrared and visible images [J].

Li, Hui ;

Wu, Xiao-Jun ;

Kittler, Josef .

INFORMATION FUSION, 2021, 73 :72-86

← 1 2 3 4 5 →