IFSepR: A General Framework for Image Fusion Based on Separate Representation Learning

被引:37
作者
Luo, Xiaoqing [1 ]
Gao, Yuanhao [1 ]
Wang, Anqi [1 ]
Zhang, Zhancheng [2 ]
Wu, Xiao-Jun [1 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Peoples R China
[2] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China
基金
中国国家自然科学基金;
关键词
Image fusion; Feature extraction; Task analysis; Image reconstruction; Decoding; Transforms; Knowledge engineering; Autoencoder; contrastive learning; disentangled feature learning; fusion rule; image fusion; MULTISCALE TRANSFORM; NSCT; EXTRACTION;
D O I
10.1109/TMM.2021.3129354
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an image fusion framework based on separate representation learning, called IFSepR. We believe that both the co-modal image and the multi-modal image have common and private features based on prior knowledge, exploiting this disentangled representation can help to image fusion, especially to fusion rule design. Inspired by the autoencoder network and contrastive learning, a multi-branch encoder with contrastive constraints is built to learn the common and private features of paired images. In the fusion stage, based on the disentangled features, a general fusion rule is designed to integrate the private features, then combining the fused private features and the common feature are fed into the decoder, reconstructing the fused image. We perform a series of evaluations on three typical image fusion tasks, including multi-focus image fusion, infrared and visible image fusion, medical image fusion. Quantitative and qualitative comparison with five state-of-art image fusion methods demonstrates the advantages of our proposed model.
引用
收藏
页码:608 / 623
页数:16
相关论文
共 48 条
[1]  
Aanlib, AANLIB
[2]  
[Anonymous], About us
[3]   A new image quality metric for image fusion: The sum of the correlations of differences [J].
Aslantas, V. ;
Bendes, E. .
AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2015, 69 (12) :160-166
[4]   Quadtree-based multi-focus image fusion using a weighted focus-measure [J].
Bai, Xiangzhi ;
Zhang, Yu ;
Zhou, Fugen ;
Xue, Bindang .
INFORMATION FUSION, 2015, 22 :105-118
[5]   Fusion of infrared and visual images through region extraction by using multi scale center-surround top-hat transform [J].
Bai, Xiangzhi ;
Zhou, Fugen ;
Xue, Bindang .
OPTICS EXPRESS, 2011, 19 (09) :8444-8457
[6]   Directive Contrast Based Multimodal Medical Image Fusion in NSCT Domain [J].
Bhatnagar, Gaurav ;
Wu, Q. M. Jonathan ;
Liu, Zheng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (05) :1014-1024
[7]  
Bousmalis K, 2016, ADV NEUR IN, V29
[8]  
Chen T, 2020, PR MACH LEARN RES, V119
[9]   The contourlet transform: An efficient directional multiresolution image representation [J].
Do, MN ;
Vetterli, M .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (12) :2091-2106
[10]   Image fusion based on wavelet transform with genetic algorithms and human visual system [J].
Dou, Jianfang ;
Qin, Qin ;
Tu, Zimei .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (09) :12491-12517