MPCFusion: Multi-scale parallel cross fusion for infrared and visible images via convolution and vision Transformer

被引：7

作者：

Tang, Haojie ^{[1
]}

Qian, Yao ^{[1
]}

Xing, Mengliang ^{[1
]}

Cao, Yisheng ^{[1
]}

Liu, Gang ^{[1
]}

机构：

[1] Shanghai Univ Elect Power, Sch Automat Engn, Shanghai 200090, Peoples R China

来源：

OPTICS AND LASERS IN ENGINEERING | 2024年 / 176卷

基金：

中国国家自然科学基金;

关键词：

Image fusion; Vision Transformer; Convolution; Multi-scale feature; Infrared; NETWORK;

D O I：

10.1016/j.optlaseng.2024.108094

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

The image fusion community is thriving with the wave of deep learning, and the most popular fusion methods are usually built upon well -designed network structures. However, most of the current methods do not fully exploit deeper features while ignore the importance of long-range dependencies. In this paper, a convolution and vision Transformer -based multi -scale parallel cross fusion network for infrared and visible images is proposed (MPCFusion). To exploit deeper texture details, a feature extraction module based on convolution and vision Transformer is designed. With a view to correlating the shallow features between different modalities, a parallel cross -attention module is proposed, in which a parallel -channel model efficiently preserves the proprietary modal features, followed by a cross -spatial model that ensures the information interactions between the different modalities. Moreover, a cross -domain attention module based on convolution and vision Transformer is proposed to capturing long-range dependencies between in-depth features and effectively solves the problem of global context loss. Finally, a nest -connection based decoder is used for implementing feature reconstruction. In particular, we design a new texture -guided structural similarity loss function to drive the network to preserve more complete texture details. Extensive experimental results illustrate that MPCFusion shows excellent fusion performance and generalization capabilities. The source code will be released at https:// github .com /YQ -097 /MPCFusion.

引用

页数：13

共 50 条

[31] A Multi-Scale Infrared and Visible Image Fusion Network Based on Context Perception
Zhao, Huixuan
Cheng, Jinyong
Du, Rundong
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 395 - 400
[32] MMF: A Multi-scale MobileNet based fusion method for infrared and visible image
Liu, Yi
Miao, Changyun
Ji, Jianhua
Li, Xianguo
INFRARED PHYSICS & TECHNOLOGY, 2021, 119
[33] Multi-scale saliency measure and orthogonal space for visible and infrared image fusion
Liu, Yaochen
Dong, Lili
Ren, Wei
Xu, Wenhai
INFRARED PHYSICS & TECHNOLOGY, 2021, 118
[34] TFIV: Multigrained Token Fusion for Infrared and Visible Image via Transformer
Li, Jing
Yang, Bin
Bai, Lu
Dou, Hao
Li, Chang
Ma, Lingfei
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[35] DATFuse: Infrared and Visible Image Fusion via Dual Attention Transformer
Tang, Wei
He, Fazhi
Liu, Yu
Duan, Yansong
Si, Tongzhen
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (07) : 3159 - 3172
[36] Fusion of infrared intensity and polarization images using embedded multi-scale transform
Lin, Su-zhen
Wang, Dong-juan
Zhu, Xiao-hong
Zhang, Shang-min
OPTIK, 2015, 126 (24): : 5127 - 5133
[37] Fusion of near-infrared and visible images based on saliency-map-guided multi-scale transformation decomposition
Chen Jun
Cai Lei
Liu Wei
Yu Yang
Multimedia Tools and Applications, 2023, 82 : 34631 - 34651
[38] Infrared and visible image fusion based on double fluid pyramids and multi-scale gradient residual block
Pang, Shan
Huo, Hongtao
Yang, Xin
Li, Jing
Liu, Xiaowen
INFRARED PHYSICS & TECHNOLOGY, 2023, 131
[39] MEEAFusion: Multi-Scale Edge Enhancement and Joint Attention Mechanism Based Infrared and Visible Image Fusion
Xie, Yingjiang
Fei, Zhennan
Deng, Da
Meng, Lingshuai
Niu, Fu
Sun, Jinggong
SENSORS, 2024, 24 (17)
[40] Infrared and Visible Image Fusion using Multi-Scale Decomposition and Visual Saliency Map
Chen, Yunfan
Xie, Han
Yeo, Donghoon
Shin, Hyunchul
2018 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2018, : 243 - 244

← 1 2 3 4 5 →