CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion

被引:259
作者
Zhao, Zixiang [1 ,2 ]
Bai, Haowen [1 ]
Zhang, Jiangshe [1 ]
Zhang, Yulun [2 ]
Xu, Shuang [3 ,4 ]
Lin, Zudi [5 ]
Timofte, Radu [2 ,6 ]
Van Gool, Luc [2 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Peoples R China
[2] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
[3] Northwestern Polytech Univ Shenzhen, Inst Res & Dev, Shenzhen, Peoples R China
[4] Northwestern Polytech Univ, Xian, Peoples R China
[5] Harvard Univ, Cambridge, England
[6] Univ Wurzburg, Wurzburg, Germany
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年
基金
中国国家自然科学基金;
关键词
NETWORK;
D O I
10.1109/CVPR52729.2023.00572
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-modality (MM) image fusion aims to render fused images that maintain the merits of different modalities, e.g., functional highlight and detailed textures. To tackle the challenge in modeling cross-modality features and decomposing desirable modality-specific and modality-shared features, we propose a novel Correlation-Driven feature Decomposition Fusion (CDDFuse) network. Firstly, CDDFuse uses Restormer blocks to extract cross-modality shallow features. We then introduce a dual-branch Transformer-CNN feature extractor with Lite Transformer (LT) blocks leveraging long-range attention to handle low-frequency global features and Invertible Neural Networks (INN) blocks focusing on extracting high-frequency local information. A correlation-driven loss is further proposed to make the low-frequency features correlated while the high-frequency features un-correlated based on the embedded information. Then, the LT-based global fusion and INN-based local fusion layers output the fused image. Extensive experiments demonstrate that our CDDFuse achieves promising results in multiple fusion tasks, including infrared-visible image fusion and medical image fusion. We also show that CDDFuse can boost the performance in downstream infrared-visible semantic segmentation and object detection in a unified benchmark. The code is available at https://github.com/Zhaozixiang1228/MMIF-CDDFuse.
引用
收藏
页码:5906 / 5916
页数:11
相关论文
共 96 条
  • [51] Mirza Mehdi, 2014, CONDITIONAL GENERATI
  • [52] Qin H., 2022, ICLR
  • [53] Decision-making under uncertainty for buildings exposed to environmental hazards
    Qin, Hao
    [J]. JOURNAL OF SAFETY SCIENCE AND RESILIENCE, 2022, 3 (01): : 1 - 14
  • [54] Forward and Backward Information Retention for Accurate Binary Neural Networks
    Qin, Haotong
    Gong, Ruihao
    Liu, Xianglong
    Shen, Mingzhu
    Wei, Ziran
    Yu, Fengwei
    Song, Jingkuan
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2247 - 2256
  • [55] BASNet: Boundary-Aware Salient Object Detection
    Qin, Xuebin
    Zhang, Zichen
    Huang, Chenyang
    Gao, Chao
    Dehghan, Masood
    Jagersand, Martin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7471 - 7481
  • [56] MobileNetV2: Inverted Residuals and Linear Bottlenecks
    Sandler, Mark
    Howard, Andrew
    Zhu, Menglong
    Zhmoginov, Andrey
    Chen, Liang-Chieh
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4510 - 4520
  • [57] DetFusion: A Detection-driven Infrared and Visible Image Fusion Network
    Sun, Yiming
    Cao, Bing
    Zhu, Pengfei
    Hu, Qinghua
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4003 - 4011
  • [58] SuperFusion: A Versatile Image Registration and Fusion Network with Semantic Awareness
    Tang, Linfeng
    Deng, Yuxin
    Ma, Yong
    Huang, Jun
    Ma, Jiayi
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (12) : 2121 - 2137
  • [59] PIAFusion: A progressive infrared and visible image fusion network based on illumination aware
    Tang, Linfeng
    Yuan, Jiteng
    Zhang, Hao
    Jiang, Xingyu
    Ma, Jiayi
    [J]. INFORMATION FUSION, 2022, 83 : 79 - 92
  • [60] Image fusion in the loop of high-level vision tasks: A semantic-aware real-time infrared and visible image fusion network
    Tang, Linfeng
    Yuan, Jiteng
    Ma, Jiayi
    [J]. INFORMATION FUSION, 2022, 82 : 28 - 42