AFDFusion: An adaptive frequency decoupling fusion network for multi-modality image

被引：2

作者：

Wang, Chengchao ^{[1
]}

Zhao, Zhengpeng ^{[1
]}

Yang, Qiuxia ^{[1
]}

Nie, Rencan ^{[1
]}

Cao, Jinde ^{[2
,3
]}

Pu, Yuanyuan ^{[1
,4
]}

机构：

[1] Yunnan Univ, Coll Informat Sci & Engn, Kunming 650500, Peoples R China

[2] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China

[3] Ahlia Univ, Manama, Bahrain

[4] Univ Key Lab Internet Things Technol & Applicat Yu, Kunming 650500, Yunnan, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 263卷

基金：

中国国家自然科学基金;

关键词：

Image fusion; Adaptive frequency decoupling; Contrastive learning; Associative invariant; Intrinsic specific; ARCHITECTURE; PERFORMANCE; FRAMEWORK; NEST;

D O I：

10.1016/j.eswa.2024.125694

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The multi-modality image fusion goal is to create a single image that provides a comprehensive scene description and conforms to visual perception by integrating complementary information about the merits of the different modalities, e.g ., salient intensities of infrared images and detail textures of visible images. Although some works explore decoupled representations of multi-modality images, they struggle with complex nonlinear relationships, fine modal decoupling, and noise handling. To cope with this issue, we propose an adaptive frequency decoupling module to perceive the associative invariant and inherent specific among cross- modality by dynamically adjusting the learnable low frequency weight of the kernel. Specifically, we utilize a contrastive learning loss for restricting the solution space of feature decoupling to learn representations of both the invariant and specific in the multi-modality images. The underlying idea is that: in decoupling, low frequency features, which are similar in the representation space, should be pulled closer to each other, signifying the associative invariant, while high frequencies are pushed farther away, also indicating the intrinsic specific. Additionally, a multi-stage training manner is introduced into our framework to achieve decoupling and fusion. Stage I, MixEncoder and MixDecoder with the same architecture but different parameters are trained to perform decoupling and reconstruction supervised by the contrastive self-supervised mechanism. Stage II, two feature fusion modules are added to integrate the invariant and specific features and output the fused image. Extensive experiments demonstrated the proposed method superiority over the state-of-the-art methods in both qualitative and quantitative evaluation on two multi-modal image fusion tasks.

引用

页数：16

共 50 条

[1] Equivariant Multi-Modality Image Fusion
Zhao, Zixiang
Hai, Haowen
Zhang, Jiangshe
Zhang, Yulun
Zhane, Kai
Xu, Shuang
Chen, Dongdong
Timofte, Radu
Van Gool, Luc
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 25912 - 25921
[2] Multi-modality Fusion Network for Action Recognition
Huang, Kai
Qin, Zheng
Xu, Kaiping
Ye, Shuxiong
Wang, Guolong
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 139 - 149
[3] ReCoNet: Recurrent Correction Network for Fast and Efficient Multi-modality Image Fusion
Huang, Zhanbo
Liu, Jinyuan
Fan, Xin
Liu, Risheng
Zhong, Wei
Luo, Zhongxuan
COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 539 - 555
[4] Multi-Modality Image Fusion in Adaptive-Parameters SPCNN Based on Inherent Characteristics of Image
Zhang, Lixia
Zeng, Guangping
Wei, Jinjin
Xuan, Zhaocheng
IEEE SENSORS JOURNAL, 2020, 20 (20) : 11820 - 11827
[5] Multi-modality image fusion for image-guided neurosurgery
Haller, JW
Ryken, T
Madsen, M
Edwards, A
Bolinger, L
Vannier, MW
CARS '99: COMPUTER ASSISTED RADIOLOGY AND SURGERY, 1999, 1191 : 681 - 685
[6] STAFuse: A Feature Decomposition Network with Super Token Attention for Multi-modality Image Fusion
Chen, Peng
Chen, Aiguo
Wang, Chuang
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14880 : 324 - 335
[7] An Interpretable Fusion Siamese Network for Multi-Modality Remote Sensing Ship Image Retrieval
Xiong, Wei
Xiong, Zhenyu
Cui, Yaqi
Huang, Linzhou
Yang, Ruining
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2696 - 2712
[8] Multi-Modality Medical Image Fusion Using Convolutional Neural Network and Contrast Pyramid
Wang, Kunpeng
Zheng, Mingyao
Wei, Hongyan
Qi, Guanqiu
Li, Yuanyuan
SENSORS, 2020, 20 (08)
[9] DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion
Zhao, Zixiang
Bai, Haowen
Zhu, Yuanzhi
Zhang, Jiangshe
Xu, Shuang
Zhang, Yulun
Zhang, Kai
Meng, Deyu
Timofte, Radu
Van Gool, Luc
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8048 - 8059
[10] Fast saliency-aware multi-modality image fusion
Han, Jungong
Pauwels, Eric J.
de Zeeuw, Paul
NEUROCOMPUTING, 2013, 111 : 70 - 80

← 1 2 3 4 5 →