A general image fusion framework using multi-task semi-supervised learning

被引:20
作者
Wang, Wu [1 ]
Deng, Liang-Jian [1 ]
Vivone, Gemine [2 ,3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Math Sci, Chengdu 611731, Sichuan, Peoples R China
[2] Natl Council Res, Inst Methodol Environm Anal, I-85050 Tito, Italy
[3] NBFC Natl Biodivers Future Ctr, I-90133 Palermo, Italy
基金
中国国家自然科学基金;
关键词
Image fusion; Multi-task; Semi-supervised learning; Laplacian pyramid; Fusion rule; Remote sensing; Medical images; ADVERSARIAL NETWORK; QUALITY ASSESSMENT; ALGORITHM; GAN; MODEL; NEST; MFF;
D O I
10.1016/j.inffus.2024.102414
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing image fusion methods primarily focus on solving single -task fusion problems, overlooking the potential information complementarity among multiple fusion tasks. Additionally, there has been no prior research in the field of image fusion that explores the mixed training of labeled and unlabeled data for different fusion tasks. To address these gaps, this paper introduces a novel multi -task semi-supervised learning approach to construct a general image fusion framework. This framework not only facilitates collaborative training for multiple fusion tasks, thereby achieving effective information complementarity among datasets from different fusion tasks, but also promotes the (unsupervised) learning of unlabeled data via the (supervised) learning of labeled data. Regarding the specific network module, we propose a so-called pseudo-siamese Laplacian pyramid transformer (PSLPT), which can effectively distinguish information at different frequencies in source images and discriminatively fuse features from distinct frequencies. More specifically, we take datasets of four typical image fusion tasks into the same PSLPT for weight updates, yielding the final general fusion model. Extensive experiments demonstrate that the obtained general fusion model exhibits promising outcomes for all four image fusion tasks, both visually and quantitatively. Moreover, comprehensive ablation and discussion experiments corroborate the effectiveness of the proposed method. The code is available at https: //github.com/wwhappylife/A-general-image-fusion-framework-using-multi-task-semi-supervised-learning.
引用
收藏
页数:16
相关论文
共 72 条
[1]   Image fusion based on nonsubsampled contourlet transform for infrared and visible light image [J].
Adu, Jianhua ;
Gan, Jianhong ;
Wang, Yan ;
Huang, Jian .
INFRARED PHYSICS & TECHNOLOGY, 2013, 61 :94-100
[2]   Learning a Deep Single Image Contrast Enhancer from Multi-Exposure Images [J].
Cai, Jianrui ;
Gu, Shuhang ;
Zhang, Lei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (04) :2049-2062
[3]  
Cai MR, 2015, INT CONF SOFTW ENG, P237, DOI 10.1109/ICSESS.2015.7339045
[4]   Zero-shot semi-supervised learning for pansharpening [J].
Cao, Qi ;
Deng, Liang-Jian ;
Wang, Wu ;
Hou, Junming ;
Vivone, Gemine .
INFORMATION FUSION, 2024, 101
[5]   A new automated quality assessment algorithm for image fusion [J].
Chen, Yin ;
Blum, Rick S. .
IMAGE AND VISION COMPUTING, 2009, 27 (10) :1421-1432
[6]   PSRT: Pyramid Shuffle-and-Reshuffle Transformer for Multispectral and Hyperspectral Image Fusion [J].
Deng, Shang-Qi ;
Deng, Liang-Jian ;
Wu, Xiao ;
Ran, Ran ;
Hong, Danfeng ;
Vivone, Gemine .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[7]   Zero-Shot Hyperspectral Sharpening [J].
Dian, Renwei ;
Guo, Anjing ;
Li, Shutao .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) :12650-12666
[8]   Spectral Super-Resolution via Model-Guided Cross-Fusion Network [J].
Dian, Renwei ;
Shan, Tianci ;
He, Wei ;
Liu, Haibo .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) :10059-10070
[9]   Laplacian Pyramid Dense Network for Hyperspectral Pansharpening [J].
Dong, Wenqian ;
Zhang, Tongzhen ;
Qu, Jiahui ;
Xiao, Song ;
Liang, Jie ;
Li, Yunsong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10]   Union Laplacian pyramid with multiple features for medical image fusion [J].
Du, Jiao ;
Li, Weisheng ;
Xiao, Bin ;
Nawaz, Qamar .
NEUROCOMPUTING, 2016, 194 :326-339