JL-DCF: Joint Learning and Densely-Cooperative Fusion Framework for RGB-D Salient Object Detection

被引：268

作者：

Fu, Keren ^{[1
]}

Fan, Deng-Ping ^{[2
,3
]}

Ji, Ge-Peng ^{[4
]}

Zhao, Qijun ^{[1
]}

机构：

[1] Sichuan Univ, Coll Comp Sci, Chengdu, Sichuan, Peoples R China

[2] Nankai Univ, Coll CS, Tianjin, Peoples R China

[3] Wuhan Univ, Incept Inst Artificial Intelligence, Wuhan, Peoples R China

[4] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年

关键词：

IMAGE; SEGMENTATION; MODEL; DEEP; CONTRAST; NETWORK;

D O I：

10.1109/CVPR42600.2020.00312

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a novel joint learning and densely-cooperative fusion (JL-DCF) architecture for RGB-D salient object detection. Existing models usually treat RGB and depth as independent information and design separate networks for feature extraction from each. Such schemes can easily be constrained by a limited amount of training data or over-reliance on an elaborately-designed training process. In contrast, our JL-DCF learns from both RGB and depth inputs through a Siamese network. To this end, we propose two effective components: joint learning (JL), and densely-cooperative fusion (DCF). The JL module provides robust saliency feature learning, while the latter is introduced for complementary feature discovery. Comprehensive experiments on four popular metrics show that the designed framework yields a robust RGB-D saliency detector with good generalization. As a result, JL-DCF significantly advances the top-1 D3Net model by an average of similar to 1.9% (S-measure) across six challenging datasets, showing that the proposed framework offers a potential solution for real-world applications and could provide more insight into the cross-modality complementarity task The code will be available at https://github.com/kerenfu/JLDCF/.

引用

页码：3049 / 3059

页数：11

共 80 条

[1]

Al Azzeh J., 2016, Int. J. Comput. Appl, V153, P31

[2]

[Anonymous], 2015, ICLR

[3] Salient Object Detection: A Benchmark [J].

Borji, Ali ;

Cheng, Ming-Ming ;

Jiang, Huaizu ;

Li, Jia .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :5706-5722

[4]

Borji Ali, 2019, [Computational Visual Media, 计算可视媒体], V5, P117

[5] Three-Stream Attention-Aware Network for RGB-D Salient Object Detection [J].

Chen, Hao ;

Li, Youfu .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) :2825-2835

[6] Progressively Complementarity-aware Fusion Network for RGB-D Salient Object Detection [J].

Chen, Hao ;

Li, Youfu .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3051-3060

[7] Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection [J].

Chen, Hao ;

Li, Youfu ;

Su, Dan .

PATTERN RECOGNITION, 2019, 86 :376-385

[8] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[9] Sketch2Photo: Internet Image Montage [J].

Chen, Tao ;

Cheng, Ming-Ming ;

Tan, Ping ;

Shamir, Ariel ;

Hu, Shi-Min .

ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (05) :1-10

[10] Global Contrast based Salient Region Detection [J].

Cheng, Ming-Ming ;

Zhang, Guo-Xin ;

Mitra, Niloy J. ;

Huang, Xiaolei ;

Hu, Shi-Min .

2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, :409-416

← 1 2 3 4 5 6 7 8 →