EF-Net: A novel enhancement and fusion network for RGB-D saliency detection

被引:46
作者
Chen, Qian [1 ]
Fu, Keren [2 ]
Liu, Ze [1 ]
Chen, Geng [3 ]
Du, Hongwei [1 ]
Qiu, Bensheng [1 ]
Shao, Ling [3 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei, Anhui, Peoples R China
[2] Sichuan Univ, Coll Comp Sci, Chengdu, Sichuan, Peoples R China
[3] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Salient object detection; RGB-D image; Depth enhancement; Feature fusion; OBJECT DETECTION; ATTENTION;
D O I
10.1016/j.patcog.2020.107740
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient object detection (SOD) has gained tremendous attention in the field of computer vision. Multimodal SOD based on the complementary information from RGB images and depth maps has shown remarkable success, making RGB-D saliency detection an active research topic. In this paper, we propose a novel multi-modal enhancement and fusion network (EF-Net) for effective RGB-D saliency detection. Specifically, we first utilize a color hint map module with RGB images to predict a hint map, which encodes the coarse information of salient objects. The resulting hint map is then utilized to enhance the depth map with our depth enhancement module, which suppresses the noise and sharpens the object boundary. Finally, we propose an effective layer-wise aggregation module to fuse the features extracted from the enhanced depth maps and RGB images for the accurate detection of salient objects. Our EF-Net utilizes an enhancement-and-fusion framework for saliency detection, which makes full use of the information from RGB images and depth maps. In addition, our depth enhancement module effectively resolves the low-quality issue of depth maps, which boosts the saliency detection performance remarkably. Extensive experiments on five widely-used benchmark datasets demonstrate that our method outperforms 12 state-of-the-art RGB-D saliency detection approaches in terms of five key evaluation metrics. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 72 条
[31]   A Novel Multiresolution Spatiotemporal Saliency Detection Model and Its Applications in Image and Video Compression [J].
Guo, Chenlei ;
Zhang, Liming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (01) :185-198
[32]  
Guo J., 2016, Int. Conf. Multimedia and Expo, P1
[33]   CNNs-Based RGB-D Saliency Detection via Cross-View Transfer and Multiview Fusion [J].
Han, Junwei ;
Chen, Hao ;
Liu, Nian ;
Yan, Chenggang ;
Li, Xuelong .
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (11) :3171-3183
[34]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[35]   Deeply Supervised Salient Object Detection with Short Connections [J].
Hou, Qibin ;
Cheng, Ming-Ming ;
Hu, Xiaowei ;
Borji, Ali ;
Tu, Zhuowen ;
Torr, Philip .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5300-5309
[36]  
Huang Z., 2020, ARXIV200714352
[37]   A model of saliency-based visual attention for rapid scene analysis [J].
Itti, L ;
Koch, C ;
Niebur, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259
[38]   Automatic Salient Object Segmentation Based on Context and Shape Prior [J].
Jiang, Huaizu ;
Wang, Jingdong ;
Yuan, Zejian ;
Liu, Tie ;
Zheng, Nanning ;
Li, Shipeng .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
[39]  
Jiang Y, 2020, ARXIVABS201004968
[40]  
Ju R, 2014, IEEE IMAGE PROC, P1115, DOI 10.1109/ICIP.2014.7025222