Accurate RGB-D Salient Object Detection via Collaborative Learning

被引:157
作者
Ji, Wei [1 ]
Li, Jingjing [1 ]
Zhang, Miao [1 ]
Piao, Yongri [1 ]
Lu, Huchuan [1 ,2 ]
机构
[1] Dalian Univ Technol, Dalian, Peoples R China
[2] Pengcheng Lab, Shenzhen, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT XVIII | 2020年 / 12363卷
基金
中国国家自然科学基金;
关键词
ATTENTION; NETWORK;
D O I
10.1007/978-3-030-58523-5_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Benefiting from the spatial cues embedded in depth images, recent progress on RGB-D saliency detection shows impressive ability on some challenge scenarios. However, there are still two limitations. One hand is that the pooling and upsampling operations in FCNs might cause blur object boundaries. On the other hand, using an additional depth-network to extract depth features might lead to high computation and storage cost. The reliance on depth inputs during testing also limits the practical applications of current RGB-D models. In this paper, we propose a novel collaborative learning framework where edge, depth and saliency are leveraged in a more efficient way, which solves those problems tactfully. The explicitly extracted edge information goes together with saliency to give more emphasis to the salient regions and object boundaries. Depth and saliency learning is innovatively integrated into the high-level feature learning process in a mutual-benefit manner. This strategy enables the network to be free of using extra depth networks and depth inputs to make inference. To this end, it makes our model more lightweight, faster and more versatile. Experiment results on seven benchmark datasets show its superior performance.
引用
收藏
页码:52 / 69
页数:18
相关论文
共 70 条
[51]   Visual Tracking: An Experimental Survey [J].
Smeulders, Arnold W. M. ;
Chu, Dung M. ;
Cucchiara, Rita ;
Calderara, Simone ;
Dehghan, Afshin ;
Shah, Mubarak .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) :1442-1468
[52]  
Wang LJ, 2015, PROC CVPR IEEE, P3183, DOI 10.1109/CVPR.2015.7298938
[53]  
Wang P, 2015, PROC CVPR IEEE, P2800, DOI 10.1109/CVPR.2015.7298897
[54]  
Wang W., 2019, TPAMI, V1
[55]  
Wang WG, 2021, Arxiv, DOI arXiv:1904.09146
[56]   Salient Object Detection Driven by Fixation Prediction [J].
Wang, Wenguan ;
Shen, Jianbing ;
Dong, Xingping ;
Borji, Ali .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1711-1720
[57]   Deep Visual Attention Prediction [J].
Wang, Wenguan ;
Shen, Jianbing .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (05) :2368-2378
[58]   CBAM: Convolutional Block Attention Module [J].
Woo, Sanghyun ;
Park, Jongchan ;
Lee, Joon-Young ;
Kweon, In So .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :3-19
[59]   Cascaded Partial Decoder for Fast and Accurate Salient Object Detection [J].
Wu, Zhe ;
Su, Li ;
Huang, Qingming .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3902-3911
[60]   Hierarchical Saliency Detection [J].
Yan, Qiong ;
Xu, Li ;
Shi, Jianping ;
Jia, Jiaya .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :1155-1162