Research Progress of RGB-D Salient Object Detection in Deep Learning Era

被引:0
|
作者
Cong R.-M. [1 ,2 ]
Zhang C. [1 ,2 ]
Xu M. [3 ]
Liu H.-Y. [1 ,2 ]
Zhao Y. [1 ,2 ]
机构
[1] Institute of Information Science, Beijing Jiaotong University, Beijing
[2] Beijing Key Laboratory of Advanced Information Science and Network Technology, Beijing
[3] School of Electronic and Information Engineering, Beihang University, Beijing
来源
Ruan Jian Xue Bao/Journal of Software | 2023年 / 34卷 / 04期
关键词
cross-modality information interaction; depth quality perception; RGB-D images; salient object detection;
D O I
10.13328/j.cnki.jos.006700
中图分类号
学科分类号
摘要
Inspired by the human visual attention mechanism, salient object detection (SOD) aims to detect the most attractive and interesting object or region in a given scene. In recent years, with the development and popularization of depth cameras, depth map has been successfully applied to various computer vision tasks, which also provides new ideas for the salient object detection task at the same time. The introduction of depth map not only enables the computer to simulate the human visual system more comprehensively, but also provides new solutions for the detection of some difficult scenes, such as low contrast and complex backgrounds by utilizing the structure information and location information of the depth map. In view of the rapid development of RGB-D SOD task in the era of deep learning, this studyaims to sort out and summarize the existing related research outputs from the perspective of key scientific problem solutions, and conduct the quantitative analysis and qualitative comparison of different methods on the commonly used RGB-D SOD datasets. Finally, the challenges and prospects are summarized for the future development trends. © 2023 Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:1711 / 1731
页数:20
相关论文
共 113 条
  • [1] Wang WG, Shen JB, Jia YD., Review of visual attention detection, Ruan Jian Xue Bao/Journal of Software, 30, 2, pp. 416-439, (2019)
  • [2] Katsuki F, Constantinidis C., Bottom-up and top-down attention: Different processes and overlapping neural systems, The Neuroscientist, 20, 5, pp. 509-521, (2014)
  • [3] Itti L, Koch C, Niebur E., A model of saliency-based visual attention for rapid scene analysis, IEEE Trans. on Pattern Analysis and Machine Intelligence, 20, 11, pp. 1254-1259, (1998)
  • [4] Cong R, Lei J, Fu H, Et al., Review of visual saliency detection with comprehensive information, IEEE Trans. on Circuits and Systems for Video Technology, 29, 10, pp. 2941-2959, (2018)
  • [5] Zeng Y, Zhuge Y, Lu H, Et al., Joint learning of saliency detection and weakly supervised semantic segmentation, Proc. of the IEEE/CVF Int’l Conf. on Computer Vision, pp. 7223-7233, (2019)
  • [6] Guo C, Zhang L., A novel multiresolution spatiotemporal saliency detection model and its applications in image and video compression, IEEE Trans. on Image Processing, 19, 1, pp. 185-198, (2009)
  • [7] Xiao DG, Xin C, Zhang T, Et al., Saliency texture structure descriptor and its application in pedestrian detection, Ruan Jian Xue Bao/Journal of Software, 25, 3, pp. 675-689, (2014)
  • [8] Fan DP, Zhang J, Xu G, Et al., Salient objects in clutter, (2021)
  • [9] Chen Z, Xu Q, Cong R, Et al., Global context-aware progressive aggregation network for salient object detection, Proc. of the AAAI Conf. on Artificial Intelligence, 34, 7, pp. 10599-10606, (2020)
  • [10] Zeng Y, Zhang P, Zhang J, Et al., Towards high-resolution salient object detection, Proc. of the IEEE/CVF Int’l Conf. on Computer Vision, pp. 7234-7243, (2019)