BMFNet: Bifurcated multi-modal fusion network for RGB-D salient object detection

被引:2
|
作者
Sun, Chenwang [1 ]
Zhang, Qing [1 ]
Zhuang, Chenyu [1 ]
Zhang, Mingqian [2 ]
机构
[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China
[2] Shanghai Inst Technol, Sch Mech Engn, Shanghai 201418, Peoples R China
基金
上海市自然科学基金;
关键词
RGB-D salient object detection; Cross-modal fusion; Multi-modal integration; Multi-level aggregation; IMAGE;
D O I
10.1016/j.imavis.2024.105048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although deep learning-based RGB-D salient object detection methods have achieved impressive results in the recent years, there are still some issues need to be addressed including multi-modal fusion and multi-level aggregation. In this paper, we propose a bifurcated multi-modal fusion network (BMFNet) to address these two issues cooperatively. First, we design a multi-modal feature interaction (MFI) module to fully capture the complementary information between the RGB and depth features by leveraging the channel attention and spatial attention. Second, unlike the widely used layer-by-layer progressive fusion, we adopt a bifurcated fusion strategy for all the multi-level unimodal and cross-modal features to effectively reduce the gaps between features at different levels. For the intra-group feature aggregation, a multi-modal feature fusion (MFF) module is designed to integrate the intra-group multi-modal features to produce a low-level/high-level saliency feature. For the inter-group aggregation, a multi-scale feature learning (MFL) module is introduced to exploit the contextual interactions between different scales to boost fusion performance. Experimental results on five public RGB-D datasets demonstrate the effectiveness and superiority of our proposed network. The code and prediction maps will be available at https://github.com/ZhangQing0329/BMFNet
引用
收藏
页数:15
相关论文
共 50 条
  • [41] HFMDNet: Hierarchical Fusion and Multilevel Decoder Network for RGB-D Salient Object Detection
    Luo, Yi
    Shao, Feng
    Xie, Zhengxuan
    Wang, Huizhi
    Chen, Hangwei
    Mu, Baoyang
    Jiang, Qiuping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 15
  • [42] Transformer-based difference fusion network for RGB-D salient object detection
    Cui, Zhi-Qiang
    Wang, Feng
    Feng, Zheng-Yong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [43] Circular Complement Network for RGB-D Salient Object Detection
    Bai, Zhen
    Liu, Zhi
    Li, Gongyang
    Ye, Linwei
    Wang, Yang
    NEUROCOMPUTING, 2021, 451 : 95 - 106
  • [44] Lightweight Multi-modal Representation Learning for RGB Salient Object Detection
    Xiao, Yun
    Huang, Yameng
    Li, Chenglong
    Liu, Lei
    Zhou, Aiwu
    Tang, Jin
    COGNITIVE COMPUTATION, 2023, 15 (06) : 1868 - 1883
  • [45] Lightweight Multi-modal Representation Learning for RGB Salient Object Detection
    Yun Xiao
    Yameng Huang
    Chenglong Li
    Lei Liu
    Aiwu Zhou
    Jin Tang
    Cognitive Computation, 2023, 15 : 1868 - 1883
  • [46] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1949 - 1961
  • [47] Bilateral Attention Network for RGB-D Salient Object Detection
    Zhang, Zhao
    Lin, Zheng
    Xu, Jun
    Jin, Wen-Da
    Lu, Shao-Ping
    Fan, Deng-Ping
    IEEE Transactions on Image Processing, 2021, 30 : 1949 - 1961
  • [48] Unifying convolution and transformer: a dual stage network equipped with cross-interactive multi-modal feature fusion and edge guidance for RGB-D salient object detection
    Abraham S.E.
    Kovoor B.C.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (04) : 2341 - 2359
  • [49] Dynamic Selective Network for RGB-D Salient Object Detection
    Wen, Hongfa
    Yan, Chenggang
    Zhou, Xiaofei
    Cong, Runmin
    Sun, Yaoqi
    Zheng, Bolun
    Zhang, Jiyong
    Bao, Yongjun
    Ding, Guiguang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9179 - 9192
  • [50] DYNAMIC SELECTION NETWORK FOR RGB-D SALIENT OBJECT DETECTION
    Zhou, Jinlin
    Luo, Zhiming
    Li, Shaozi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 776 - 780