Infrared and visible image fusion based on a two-stage class conditioned auto-encoder network

被引:7
|
作者
Cao, Yanpeng [1 ,2 ]
Luo, Xing [1 ,2 ]
Tong, Xi [1 ,2 ]
Yang, Jiangxin [1 ,2 ]
Cao, Yanlong [1 ,2 ]
机构
[1] Zhejiang Univ, Sch Mech Engn, State Key Lab Fluid Power Transmiss & Control, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Sch Mech Engn, Key Lab Adv Mfg Technol Zhejiang Prov, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Infrared imaging; Image fusion; Conditional learning; Auto; -encoder; PERFORMANCE; ALGORITHM; NEST;
D O I
10.1016/j.neucom.2023.126248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing auto-encoder based infrared and visible image fusion methods typically utilize a shared encoder to extract features from different modalities and adopt a handcrafted fusion strategy to fuse the extracted features into intermediate representation before the decoder part. In this paper, we present a novel two -stage class conditioned auto-encoder framework for high-quality multispectral fusion tasks. In the first training stage, we introduce a class embedding sub-branch to the encoder network for modeling the characteristics of different modalities and adaptively scaling the intermediate features based on the input modality. Moreover, we design a cross-transfer residual block to promote the content and texture infor-mation flow in the encoder for generating more representative features. In the second training stage, we insert a learnable fusion module between the pre-trained class conditioned encoder and decoder parts to replace the handcrafted fusion strategy. Specific intensity and gradient loss functions are utilized to tune the model for the fusion of distinctive deep features in a data-driven manner. With the important designs including the class conditioned auto-encoder and the two-stage training strategy, our proposed TS-ClassFuse can better preserve distinctive information/features from the source images and decrease the training difficulty for simultaneously extracting informative features and determining the optimal fusion scheme. Experimental results verify the effectiveness of our method in terms of both qualitative and quantitative evaluations.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] FECFusion: Infrared and visible image fusion network based on fast edge convolution
    Chen, Zhaoyu
    Fan, Hongbo
    Ma, Meiyan
    Shao, Dangguo
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 16060 - 16082
  • [22] Infrared and visible image fusion based on infrared background suppression
    Yang, Yang
    Ren, Zhennan
    Li, Beichen
    Lang, Yue
    Pan, Xiaoru
    Li, Ruihai
    Ge, Ming
    OPTICS AND LASERS IN ENGINEERING, 2023, 164
  • [23] Multimodal Medical Image Fusion Using Stacked Auto-encoder in NSCT Domain
    Nahed Tawfik
    Heba A. Elnemr
    Mahmoud Fakhr
    Moawad I. Dessouky
    Fathi E. Abd El-Samie
    Journal of Digital Imaging, 2022, 35 : 1308 - 1325
  • [24] CAMF: An Interpretable Infrared and Visible Image Fusion Network Based on Class Activation Mapping
    Tang, Linfeng
    Chen, Ziang
    Huang, Jun
    Ma, Jiayi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4776 - 4791
  • [25] Multimodal Medical Image Fusion Using Stacked Auto-encoder in NSCT Domain
    Tawfik, Nahed
    Elnemr, Heba A.
    Fakhr, Mahmoud
    Dessouky, Moawad I.
    Abd El-Samie, Fathi E.
    JOURNAL OF DIGITAL IMAGING, 2022, 35 (05) : 1308 - 1325
  • [26] RADFNet: An infrared and visible image fusion framework based on distributed network
    Feng, Siling
    Wu, Can
    Lin, Cong
    Huang, Mengxing
    FRONTIERS IN PLANT SCIENCE, 2023, 13
  • [27] Fully convolutional network-based infrared and visible image fusion
    Feng, Yufang
    Lu, Houqing
    Bai, Jingbo
    Cao, Lin
    Yin, Hong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 15001 - 15014
  • [28] MSFNet: MultiStage Fusion Network for infrared and visible image fusion
    Wang, Chenwu
    Wu, Junsheng
    Zhu, Zhixiang
    Chen, Hao
    NEUROCOMPUTING, 2022, 507 : 26 - 39
  • [29] MEFuse: end-to-end infrared and visible image fusion method based on multibranch encoder
    Hong, Yulu
    Wu, Xiao-Jun
    Xu, Tianyang
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (03)
  • [30] Infrared and visible image fusion via dual encoder based on dense connection
    Lu, Quan
    Zhang, Hongbin
    Yin, Linfei
    PATTERN RECOGNITION, 2025, 163