Adaptive fusion network for RGB-D salient object detection

被引：34

作者：

Chen, Tianyou ^{[1
]}

Xiao, Jin ^{[1
]}

Hu, Xiaoguang ^{[1
]}

Zhang, Guofeng ^{[1
]}

Wang, Shaojie ^{[1
]}

机构：

[1] Beihang Univ, 37 Xueyuan Rd, Beijing 100191, Peoples R China

来源：

NEUROCOMPUTING | 2023年 / 522卷

关键词：

RGB-D salient object detection; Multi-modality feature interaction; Adaptive fusion; Deep learning;

D O I：

10.1016/j.neucom.2022.12.004

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing state-of-the-art RGB-D saliency detection models mainly utilize the depth information as com-plementary cues to enhance the RGB information. However, depth maps can be easily influenced by envi-ronment and hence are full of noises. Thus, indiscriminately integrating multi-modality (i.e., RGB and depth) features may induce noise-degraded saliency maps. In this paper, we propose a novel Adaptive Fusion Network (AFNet) to solve this problem. Specifically, we design a triplet encoder network consist-ing of three subnetworks to process RGB, depth, and fused features, respectively. The three subnetworks are interlinked and form a grid net to facilitate mutual refinement of these multi-modality features. Moreover, we propose a Multi-modality Feature Interaction (MFI) module to exploit complementary cues between depth and RGB modalities and adaptively fuse the multi-modality features. Finally, we design the Cascaded Feature Interweaved Decoder (CFID) to exploit complementary information between multi-level features and refine them iteratively to achieve accurate saliency detection. Experimental results on six commonly used benchmark datasets verify that the proposed AFNet outperforms 20 state-of-the-art counterparts in terms of six widely adopted evaluation metrics. Source code will be pub-licly available athttps://github.com/clelouch/AFNet upon paper acceptance. (c) 2022 Elsevier B.V. All rights reserved.

引用

页码：152 / 164

页数：13

共 74 条

[11] Progressively Guided Alternate Refinement Network for RGB-D Salient Object Detection [J].

Chen, Shuhan ;

Fu, Yun .

COMPUTER VISION - ECCV 2020, PT VIII, 2020, 12353 :520-538

[12] Boundary-guided network for camouflaged object detection [J].

Chen, Tianyou ;

Xiao, Jin ;

Hu, Xiaoguang ;

Zhang, Guofeng ;

Wang, Shaojie .

KNOWLEDGE-BASED SYSTEMS, 2022, 248

[13] BINet: Bidirectional interactive network for salient object detection [J].

Chen, Tianyou ;

Hu, Xiaoguang ;

Xiao, Jin ;

Zhang, Guofeng ;

Wang, Shaojie .

NEUROCOMPUTING, 2021, 465 :490-502

[14] DPANet: Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection [J].

Chen, Zuyao ;

Cong, Runmin ;

Xu, Qianqian ;

Huang, Qingming .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :7012-7024

[15] Global Contrast Based Salient Region Detection [J].

Cheng, Ming-Ming ;

Mitra, Niloy J. ;

Huang, Xiaolei ;

Torr, Philip H. S. ;

Hu, Shi-Min .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) :569-582

[16] RepFinder: Finding Approximately Repeated Scene Elements for Image Editing [J].

Cheng, Ming-Ming ;

Zhang, Fang-Lue ;

Mitra, Niloy J. ;

Huang, Xiaolei ;

Hu, Shi-Min .

ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04)

[17]

Cheng Y., 2014, ICIMCS, P23

[18]

Dosovitskiy Alexey, 2021, P ICLR

[19] Structure-measure: A New Way to Evaluate Foreground Maps [J].

Fan, Deng-Ping ;

Cheng, Ming-Ming ;

Liu, Yun ;

Li, Tao ;

Borji, Ali .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4558-4567

[20]

Fan DP, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P698

← 1 2 3 4 5 6 7 8 →