Cross-modal and multi-level feature refinement network for RGB-D salient object detection

被引:0
作者
Yue Gao
Meng Dai
Qing Zhang
机构
[1] Shanghai Institute of Technology,School of Computer Science and Information Engineering
来源
The Visual Computer | 2023年 / 39卷
关键词
RGB-D salient object detection; Cross-modal feature interaction; Multi-level feature fusion; Skip connection;
D O I
暂无
中图分类号
学科分类号
摘要
RGB-D salient object detection (SOD) methods adopt depth maps as important supplementary information in order to identify salient objects more accurately. However, there are still two main challenges in the existing RGB-D SOD methods. One typical issue is how to obtain effective cross-modal features, and another issue is how to optimize the integration of multi-level features. To tackle these two issues, we propose a novel cross-modal and multi-level feature refinement network which equips with a cross-modal feature interaction module and a multi-level feature fusion module. Specifically, a cross-modal feature interaction module is designed to enhance depth features from both channel and spatial perspectives and then effectively integrate cross-modal features. Moreover, considering the characteristics of different levels of features, we propose a multi-level feature fusion module which combines contextual information from multi-level features by means of skip connection. Extensive experiments on five benchmark datasets demonstrate that our proposed model outperforms other 17 state-of-the-art RGB-D SOD methods.
引用
收藏
页码:3979 / 3994
页数:15
相关论文
共 89 条
[1]  
Tsai C-C(2018)Image co-saliency detection and co-segmentation via progressive joint optimization IEEE Trans. Image Process. 28 56-71
[2]  
Li W(2016)Saliency detection for stereoscopic images based on depth confidence analysis and multiple cues fusion IEEE Signal Process. Lett. 23 819-823
[3]  
Hsu K-J(2021)Multi-level progressive parallel attention guided salient object detection for RGB-D images Vis. Comput. 37 529-540
[4]  
Qian X(2017)Cnns-based RGB-D saliency detection via cross-view transfer and multiview fusion IEEE Trans. Cybern. 48 3171-3183
[5]  
Lin Y-Y(2017)RGBD salient object detection via deep fusion IEEE Trans. Image Process. 26 2274-2285
[6]  
Cong R(2019)Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection Pattern Recogn. 86 376-385
[7]  
Lei J(2019)Three-stream attention-aware network for RGB-D salient object detection IEEE Trans. Image Process. 28 2825-2835
[8]  
Zhang C(2020)Rethinking RGB-D salient object detection: models, data sets, and large-scale benchmarks IEEE Trans. Neural Netw. Learn. Syst. 32 2075-2089
[9]  
Huang Q(2019)Salient object detection for RGB-D image by single stream recurrent convolution neural network Neurocomputing 363 46-57
[10]  
Cao X(2021)Hierarchical alternate interaction network for RGB-D salient object detection IEEE Trans. Image Process. 30 3528-3542