Bidirectional feature learning network for RGB-D salient object detection

被引:1
作者
Niu, Ye
Zhou, Sanping [1 ]
Dong, Yonghao
Wang, Le
Wang, Jinjun
Zheng, Nanning
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
基金
国家重点研发计划; 中国博士后科学基金;
关键词
RGB-D salient object detection; Bidirectional feature fusion; Dual consistency loss; IMAGE; FUSION;
D O I
10.1016/j.patcog.2024.110304
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB-D salient object detection aims to perform the pixel-wise localization of salient objects from both RGB and depth images, whose challenge mainly comes from how to learn complementary features from each modality. Existing works often use increasingly large models for performance enhancement, which need large memory and time consumption in practice. In this paper, we propose a simple yet effective Bidirectional Feature Learning Network (BFLNet) for RGB-D salient object detection under limited memory and time conditions. To achieve accurate performance with lightweight backbone networks, an effective Bidirectional Feature Fusion (BFF) module is designed to merge features from both RGB and depth streams, in which the crossmodal fusions and cross-scale fusions are jointly conducted to fuse the immediate features in multiple scales and multiple modals. What is more, a simple Dual Consistency Loss (DCL) function is designed to prompt cross -modal fusion by keeping the consistency between cross -modal target predictions. Extensive experiments on four benchmark datasets demonstrate that our method has achieved the state-of-the-art performance with high efficiency in RGB-D salient object detection. Code will be available at https://github.com/nightskynostar/BFLNet.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] LIANet: Layer Interactive Attention Network for RGB-D Salient Object Detection
    Han, Yibo
    Wang, Liejun
    Du, Anyu
    Jiang, Shaochen
    IEEE ACCESS, 2022, 10 : 25435 - 25447
  • [22] Salient Object Detection in RGB-D Videos
    Mou, Ao
    Lu, Yukang
    He, Jiahao
    Min, Dingyao
    Fu, Keren
    Zhao, Qijun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6660 - 6675
  • [23] Scenario potentiality-constrain network for RGB-D salient object detection
    Zong, Guanyu
    Li, Xu
    Xu, Qimin
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [24] Dynamic Message Propagation Network for RGB-D and Video Salient Object Detection
    Chen, Baian
    Chen, Zhilei
    Hu, Xiaowei
    Xu, Jun
    Xie, Haoran
    Qin, Jing
    Wei, Mingqiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (01)
  • [25] Depth cue enhancement and guidance network for RGB-D salient object detection
    Li, Xiang
    Zhang, Qing
    Yan, Weiqi
    Dai, Meng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [26] FCDHNet: A feature cross-dimensional hybrid network for RGB-D salient object detection
    Wang, Feifei
    Zheng, Panpan
    Li, Yongming
    Wang, Liejun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
  • [27] TriTransNet: RGB-D Salient Object Detection with a Triplet Transformer Embedding Network
    Liu, Zhengyi
    Wang, Yuan
    Tu, Zhengzheng
    Xiao, Yun
    Tang, Bin
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4481 - 4490
  • [28] Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection
    Zhao, Zhengyun
    Huang, Ziqing
    Chai, Xiuli
    Wang, Jun
    NEURAL PROCESSING LETTERS, 2023, 55 (01) : 361 - 384
  • [29] Cross-modal and multi-level feature refinement network for RGB-D salient object detection
    Gao, Yue
    Dai, Meng
    Zhang, Qing
    VISUAL COMPUTER, 2023, 39 (09) : 3979 - 3994
  • [30] EFGNet: Encoder steered multi-modality feature guidance network for RGB-D salient object detection
    Xia, Chenxing
    Duan, Songsong
    Fang, Xianjin
    Gao, Xiuju
    Sun, Yanguang
    Ge, Bin
    Zhang, Hanling
    Li, Kuan-Ching
    DIGITAL SIGNAL PROCESSING, 2022, 131