Improved Saliency Detection in RGB-D Images Using Two-Phase Depth Estimation and Selective Deep Fusion

被引：90

作者：

Chen, Chenglizhao ^{[1
]}

Wei, Jipeng ^{[1
]}

Peng, Chong ^{[1
]}

Zhang, Weizhong ^{[1
]}

Qin, Hong ^{[2
]}

机构：

[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao 266071, Peoples R China

[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2020年 / 29卷

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

RGB-D saliency detection; inter-image correspondences; low-level saliency; selective deep fusion; OBJECT DETECTION; VIDEO;

D O I：

10.1109/TIP.2020.2968250

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To solve the saliency detection problem in RGB-D images, the depth information plays a critical role in distinguishing salient objects or foregrounds from cluttered backgrounds. As the complementary component to color information, the depth quality directly dictates the subsequent saliency detection performance. However, due to artifacts and the limitation of depth acquisition devices, the quality of the obtained depth varies tremendously across different scenarios. Consequently, conventional selective fusion-based RGB-D saliency detection methods may result in a degraded detection performance in cases containing salient objects with low color contrast coupled with a low depth quality. To solve this problem, we make our initial attempt to estimate additional high-quality depth information, which is denoted by Depth(+). Serving as a complement to the original depth, Depth(+) will be fed into our newly designed selective fusion network to boost the detection performance. To achieve this aim, we first retrieve a small group of images that are similar to the given input, and then the inter-image, nonlocal correspondences are built accordingly. Thus, by using these inter-image correspondences, the overall depth can be coarsely estimated by utilizing our newly designed depth-transferring strategy. Next, we build fine-grained, object-level correspondences coupled with a saliency prior to further improve the depth quality of the previous estimation. Compared to the original depth, our newly estimated Depth(+) is potentially more informative for detection improvement. Finally, we feed both the original depth and the newly estimated Depth(+) into our selective deep fusion network, whose key novelty is to achieve an optimal complementary balance to make better decisions toward improving saliency boundaries.

引用

页码：4296 / 4307

页数：12

共 50 条

[1] SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].

Achanta, Radhakrishna ;

Shaji, Appu ;

Smith, Kevin ;

Lucchi, Aurelien ;

Fua, Pascal ;

Suesstrunk, Sabine .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281

[2]

[Anonymous], 2016, IEEE Conf. Comput. Vis. Pattern Recog, DOI DOI 10.1109/CVPR.2016.257

[3] Improved Robust Video Saliency Detection Based on Long-Term Spatial-Temporal Information [J].

Chen, Chenglizhao ;

Wang, Guotao ;

Peng, Chong ;

Zhang, Xiaowei ;

Qin, Hong .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :1090-1100

[4] Bilevel Feature Learning for Video Saliency Detection [J].

Chen, Chenglizhao ;

Li, Shuai ;

Qin, Hong ;

Pan, Zhenkuan ;

Yang, Guowei .

IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (12) :3324-3336

[5] A Novel Bottom-Up Saliency Detection Method for Video With Dynamic Background [J].

Chen, Chenglizhao ;

Li, Yunxiao ;

Li, Shuai ;

Qin, Hong ;

Hao, Aimin .

IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (02) :154-158

[6] Video Saliency Detection via Spatial-Temporal Fusion and Low-Rank Coherency Diffusion [J].

Chen, Chenglizhao ;

Li, Shuai ;

Wang, Yongguang ;

Qin, Hong ;

Hao, Aimin .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) :3156-3170

[7] Robust salient motion detection in non-stationary videos via novel integrated strategies of spatio-temporal coherency clues and low-rank analysis [J].

Chen, Chenglizhao ;

Li, Shuai ;

Qin, Hong ;

Hao, Aimin .

PATTERN RECOGNITION, 2016, 52 :410-432

[8] Real-time and robust object tracking in video via low-rank coherency analysis in feature space [J].

Chen, Chenglizhao ;

Li, Shuai ;

Qin, Hong ;

Hao, Aimin .

PATTERN RECOGNITION, 2015, 48 (09) :2885-2905

[9] Structure-Sensitive Saliency Detection via Multilevel Rank Analysis in Intrinsic Feature Space [J].

Chen, Chenglizhao ;

Li, Shuai ;

Qin, Hong ;

Hao, Aimin .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (08) :2303-2316

[10] Progressively Complementarity-aware Fusion Network for RGB-D Salient Object Detection [J].

Chen, Hao ;

Li, Youfu .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3051-3060

← 1 2 3 4 5 →