Bidirectional feature learning network for RGB-D salient object detection

被引：1

作者：

Niu, Ye

Zhou, Sanping ^{[1
]}

Dong, Yonghao

Wang, Le

Wang, Jinjun

Zheng, Nanning

机构：

[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 150卷

基金：

国家重点研发计划; 中国博士后科学基金;

关键词：

RGB-D salient object detection; Bidirectional feature fusion; Dual consistency loss; IMAGE; FUSION;

D O I：

10.1016/j.patcog.2024.110304

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

RGB-D salient object detection aims to perform the pixel-wise localization of salient objects from both RGB and depth images, whose challenge mainly comes from how to learn complementary features from each modality. Existing works often use increasingly large models for performance enhancement, which need large memory and time consumption in practice. In this paper, we propose a simple yet effective Bidirectional Feature Learning Network (BFLNet) for RGB-D salient object detection under limited memory and time conditions. To achieve accurate performance with lightweight backbone networks, an effective Bidirectional Feature Fusion (BFF) module is designed to merge features from both RGB and depth streams, in which the crossmodal fusions and cross-scale fusions are jointly conducted to fuse the immediate features in multiple scales and multiple modals. What is more, a simple Dual Consistency Loss (DCL) function is designed to prompt cross -modal fusion by keeping the consistency between cross -modal target predictions. Extensive experiments on four benchmark datasets demonstrate that our method has achieved the state-of-the-art performance with high efficiency in RGB-D salient object detection. Code will be available at https://github.com/nightskynostar/BFLNet.

引用

页数：9

共 50 条

[21] LIANet: Layer Interactive Attention Network for RGB-D Salient Object Detection
Han, Yibo
Wang, Liejun
Du, Anyu
Jiang, Shaochen
IEEE ACCESS, 2022, 10 : 25435 - 25447
[22] Salient Object Detection in RGB-D Videos
Mou, Ao
Lu, Yukang
He, Jiahao
Min, Dingyao
Fu, Keren
Zhao, Qijun
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6660 - 6675
[23] Scenario potentiality-constrain network for RGB-D salient object detection
Zong, Guanyu
Li, Xu
Xu, Qimin
KNOWLEDGE-BASED SYSTEMS, 2025, 310
[24] Dynamic Message Propagation Network for RGB-D and Video Salient Object Detection
Chen, Baian
Chen, Zhilei
Hu, Xiaowei
Xu, Jun
Xie, Haoran
Qin, Jing
Wei, Mingqiang
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (01)
[25] Depth cue enhancement and guidance network for RGB-D salient object detection
Li, Xiang
Zhang, Qing
Yan, Weiqi
Dai, Meng
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
[26] FCDHNet: A feature cross-dimensional hybrid network for RGB-D salient object detection
Wang, Feifei
Zheng, Panpan
Li, Yongming
Wang, Liejun
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
[27] TriTransNet: RGB-D Salient Object Detection with a Triplet Transformer Embedding Network
Liu, Zhengyi
Wang, Yuan
Tu, Zhengzheng
Xiao, Yun
Tang, Bin
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4481 - 4490
[28] Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection
Zhao, Zhengyun
Huang, Ziqing
Chai, Xiuli
Wang, Jun
NEURAL PROCESSING LETTERS, 2023, 55 (01) : 361 - 384
[29] Cross-modal and multi-level feature refinement network for RGB-D salient object detection
Gao, Yue
Dai, Meng
Zhang, Qing
VISUAL COMPUTER, 2023, 39 (09) : 3979 - 3994
[30] EFGNet: Encoder steered multi-modality feature guidance network for RGB-D salient object detection
Xia, Chenxing
Duan, Songsong
Fang, Xianjin
Gao, Xiuju
Sun, Yanguang
Ge, Bin
Zhang, Hanling
Li, Kuan-Ching
DIGITAL SIGNAL PROCESSING, 2022, 131

← 1 2 3 4 5 →