Asymmetric Two-Stream Architecture for Accurate RGB-D Saliency Detection

被引:98
作者
Zhang, Miao [1 ]
Fei, Sun Xiao [1 ]
Liu, Jie [1 ]
Xu, Shuang [1 ]
Piao, Yongri [1 ]
Lu, Huchuan [1 ,2 ]
机构
[1] Dalian Univ Technol, Dalian, Peoples R China
[2] Pengcheng Lab, Shenzhen, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT XXVIII | 2020年 / 12373卷
基金
中国国家自然科学基金;
关键词
Saliency detection; Flow ladder; Depth attention; OBJECT DETECTION; NETWORK;
D O I
10.1007/978-3-030-58604-1_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing RGB-D saliency detection methods adopt symmetric two-stream architectures for learning discriminative RGB and depth representations. In fact, there is another level of ambiguity that is often overlooked: if RGB and depth data are necessary to fit into the same network. In this paper, we propose an asymmetric two-stream architecture taking account of the inherent differences between RGB and depth data for saliency detection. First, we design a flow ladder module (FLM) for the RGB stream to fully extract global and local information while maintaining the saliency details. This is achieved by constructing four detail-transfer branches, each of which preserves the detail information and receives global location information from representations of other vertical parallel branches in an evolutionary way. Second, we propose a novel depth attention module (DAM) to ensure depth features with high discriminative power in location and spatial structure being effectively utilized when combined with RGB features in challenging scenes. The depth features can also discriminatively guide the RGB features via our proposed DAM to precisely locate the salient objects. Extensive experiments demonstrate that our method achieves superior performance over 13 state-of-the-art RGB-D approaches on the 7 datasets. Our code will be publicly available.
引用
收藏
页码:374 / 390
页数:17
相关论文
共 49 条
[1]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[2]  
Borji A., 2012, 2012 IEEE COMP SOC C, P23, DOI DOI 10.1109/CVPRW.2012.6239191
[3]   Salient Object Detection: A Benchmark [J].
Borji, Ali ;
Sihite, Dicky N. ;
Itti, Laurent .
COMPUTER VISION - ECCV 2012, PT II, 2012, 7573 :414-429
[4]   Three-Stream Attention-Aware Network for RGB-D Salient Object Detection [J].
Chen, Hao ;
Li, Youfu .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) :2825-2835
[5]   Progressively Complementarity-aware Fusion Network for RGB-D Salient Object Detection [J].
Chen, Hao ;
Li, Youfu .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3051-3060
[6]   Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection [J].
Chen, Hao ;
Li, Youfu ;
Su, Dan .
PATTERN RECOGNITION, 2019, 86 :376-385
[7]   Intelligent Visual Media Processing: When Graphics Meets Vision [J].
Cheng, Ming-Ming ;
Hou, Qi-Bin ;
Zhang, Song-Hai ;
Rosin, Paul L. .
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (01) :110-121
[8]  
Cheng Y, 2014, IEEE INT CON MULTI
[9]   Saliency Detection for Stereoscopic Images Based on Depth Confidence Analysis and Multiple Cues Fusion [J].
Cong, Runmin ;
Lei, Jianjun ;
Zhang, Changqing ;
Huang, Qingming ;
Cao, Xiaochun ;
Hou, Chunping .
IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (06) :819-823
[10]  
Dai JF, 2016, ADV NEUR IN, V29