MSFNet: MultiStage Fusion Network for infrared and visible image fusion

被引:17
作者
Wang, Chenwu [1 ]
Wu, Junsheng [2 ]
Zhu, Zhixiang [3 ]
Chen, Hao [4 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, 127 West Youyi Rd, Xian 710072, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Sch Software, 127 West Youyi Rd, Xian 710072, Shaanxi, Peoples R China
[3] Xian Univ Posts & Telecommun, Sch Modern Posts, 563 South Changan Rd, Xian 710061, Shaanxi, Peoples R China
[4] Xian Univ Posts & Telecommun, Sch Comp Sci, 563 South Changan Rd, Xian 710061, Shaanxi, Peoples R China
关键词
Deep learning; Image fusion; Visible image; Infrared image; Multistage network; ENHANCEMENT; PERFORMANCE; FRAMEWORK;
D O I
10.1016/j.neucom.2022.07.048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a multistage fusion network to solve the infrared and visible image fusion (IVIF) problem. Unlike other deep learning methods, our network architecture is designed to the complex balance between spatial details and contextualized information. In order to optimally balance these competing goals, our main proposal is a multistage architecture that progressively learns IVIF functions for the source images (infrared and visible images), thereby breaking down the overall recovery process into more manageable steps. Specifically, our model first learns the contextural features using the encoderdecoder architecture with downsampling operations and later combines them with a full-resolution branch that retains local details. Between stages, we introduce a cross-stage fusion module (CSFM) to propagate multiscale contextual features from an earlier stage to a later stage. In addition, we introduce an upsampling module that can conquer both checkerboard artifacts and blurring effect by a bilinear interpolation operation followed by a deformable convolution. The resulting tightly interlinked multistage fusion network, named MSFNet, demonstrates the superiority of our method over state-of-theart performance on publicly available datasets. (c) 2022 Published by Elsevier B.V.
引用
收藏
页码:26 / 39
页数:14
相关论文
共 53 条
[31]  
Rahtu E, 2011, IEEE I CONF COMP VIS, P1052, DOI 10.1109/ICCV.2011.6126351
[32]   Infrared and visible image fusion based on variational auto-encoder and infrared feature compensation [J].
Ren, Long ;
Pan, Zhibin ;
Cao, Jianzhong ;
Liao, Jiawen .
INFRARED PHYSICS & TECHNOLOGY, 2021, 117
[33]   Assessment of image fusion procedures using entropy, image quality, and multispectral classification [J].
Roberts, J. Wesley ;
van Aardt, Jan ;
Ahmed, Fethi .
JOURNAL OF APPLIED REMOTE SENSING, 2008, 2
[34]   LabelMe: A database and web-based tool for image annotation [J].
Russell, Bryan C. ;
Torralba, Antonio ;
Murphy, Kevin P. ;
Freeman, William T. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 77 (1-3) :157-173
[35]   Image fusion based on pixel significance using cross bilateral filter [J].
Shreyamsha Kumar, B. K. .
SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (05) :1193-1204
[36]  
Toet A, 2017, DATA BRIEF, V15, P249, DOI 10.1016/j.dib.2017.09.038
[37]   Convolutional LSTM-Based Hierarchical Feature Fusion for Multispectral Pan-Sharpening [J].
Wang, Dong ;
Bai, Yunpeng ;
Wu, Chanyue ;
Li, Ying ;
Shang, Changjing ;
Shen, Qiang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[38]   HETEROGENEOUS TWO-STREAM NETWORK WITH HIERARCHICAL FEATURE PREFUSION FOR MULTISPECTRAL PAN-SHARPENING [J].
Wang, Dong ;
Bai, Yunpeng ;
Bai, Bendu ;
Wu, Chanyue ;
Li, Ying .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :1845-1849
[39]   Going Deeper with Densely Connected Convolutional Neural Networks for Multispectral Pansharpening [J].
Wang, Dong ;
Li, Ying ;
Ma, Li ;
Bai, Zongwen ;
Chan, Jonathan Cheung-Wai .
REMOTE SENSING, 2019, 11 (22)
[40]   U2Fusion: A Unified Unsupervised Image Fusion Network [J].
Xu, Han ;
Ma, Jiayi ;
Jiang, Junjun ;
Guo, Xiaojie ;
Ling, Haibin .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (01) :502-518