Adaptive fusion network for RGB-D salient object detection

被引：34

作者：

Chen, Tianyou ^{[1
]}

Xiao, Jin ^{[1
]}

Hu, Xiaoguang ^{[1
]}

Zhang, Guofeng ^{[1
]}

Wang, Shaojie ^{[1
]}

机构：

[1] Beihang Univ, 37 Xueyuan Rd, Beijing 100191, Peoples R China

来源：

NEUROCOMPUTING | 2023年 / 522卷

关键词：

RGB-D salient object detection; Multi-modality feature interaction; Adaptive fusion; Deep learning;

D O I：

10.1016/j.neucom.2022.12.004

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing state-of-the-art RGB-D saliency detection models mainly utilize the depth information as com-plementary cues to enhance the RGB information. However, depth maps can be easily influenced by envi-ronment and hence are full of noises. Thus, indiscriminately integrating multi-modality (i.e., RGB and depth) features may induce noise-degraded saliency maps. In this paper, we propose a novel Adaptive Fusion Network (AFNet) to solve this problem. Specifically, we design a triplet encoder network consist-ing of three subnetworks to process RGB, depth, and fused features, respectively. The three subnetworks are interlinked and form a grid net to facilitate mutual refinement of these multi-modality features. Moreover, we propose a Multi-modality Feature Interaction (MFI) module to exploit complementary cues between depth and RGB modalities and adaptively fuse the multi-modality features. Finally, we design the Cascaded Feature Interweaved Decoder (CFID) to exploit complementary information between multi-level features and refine them iteratively to achieve accurate saliency detection. Experimental results on six commonly used benchmark datasets verify that the proposed AFNet outperforms 20 state-of-the-art counterparts in terms of six widely adopted evaluation metrics. Source code will be pub-licly available athttps://github.com/clelouch/AFNet upon paper acceptance. (c) 2022 Elsevier B.V. All rights reserved.

引用

页码：152 / 164

页数：13

共 74 条

[1]

Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596

[2] Computing receptive fields of convolutional neural networks [J].

Araujo, André ;

Norris, Wade ;

Sim, Jack .

Distill, 2019, 4 (11)

[3] Salient object detection: A survey [J].

Borji, Ali ;

Cheng, Ming-Ming ;

Hou, Qibin ;

Jiang, Huaizu ;

Li, Jia .

COMPUTATIONAL VISUAL MEDIA, 2019, 5 (02) :117-150

[4] Depth-Quality-Aware Salient Object Detection [J].

Chen, Chenglizhao ;

Wei, Jipeng ;

Peng, Chong ;

Qin, Hong .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :2350-2363

[5] CGMDRNet: Cross-Guided Modality Difference Reduction Network for RGB-T Salient Object Detection [J].

Chen, Gang ;

Shao, Feng ;

Chai, Xiongli ;

Chen, Hangwei ;

Jiang, Qiuping ;

Meng, Xiangchao ;

Ho, Yo-Sung .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) :6308-6323

[6] Three-Stream Attention-Aware Network for RGB-D Salient Object Detection [J].

Chen, Hao ;

Li, Youfu .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) :2825-2835

[7] Progressively Complementarity-aware Fusion Network for RGB-D Salient Object Detection [J].

Chen, Hao ;

Li, Youfu .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3051-3060

[8] Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection [J].

Chen, Hao ;

Li, Youfu ;

Su, Dan .

PATTERN RECOGNITION, 2019, 86 :376-385

[9] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[10]

Chen Q, 2021, AAAI CONF ARTIF INTE, V35, P1063

← 1 2 3 4 5 6 7 8 →