Adaptive fusion network for RGB-D salient object detection

被引:34
作者
Chen, Tianyou [1 ]
Xiao, Jin [1 ]
Hu, Xiaoguang [1 ]
Zhang, Guofeng [1 ]
Wang, Shaojie [1 ]
机构
[1] Beihang Univ, 37 Xueyuan Rd, Beijing 100191, Peoples R China
关键词
RGB-D salient object detection; Multi-modality feature interaction; Adaptive fusion; Deep learning;
D O I
10.1016/j.neucom.2022.12.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing state-of-the-art RGB-D saliency detection models mainly utilize the depth information as com-plementary cues to enhance the RGB information. However, depth maps can be easily influenced by envi-ronment and hence are full of noises. Thus, indiscriminately integrating multi-modality (i.e., RGB and depth) features may induce noise-degraded saliency maps. In this paper, we propose a novel Adaptive Fusion Network (AFNet) to solve this problem. Specifically, we design a triplet encoder network consist-ing of three subnetworks to process RGB, depth, and fused features, respectively. The three subnetworks are interlinked and form a grid net to facilitate mutual refinement of these multi-modality features. Moreover, we propose a Multi-modality Feature Interaction (MFI) module to exploit complementary cues between depth and RGB modalities and adaptively fuse the multi-modality features. Finally, we design the Cascaded Feature Interweaved Decoder (CFID) to exploit complementary information between multi-level features and refine them iteratively to achieve accurate saliency detection. Experimental results on six commonly used benchmark datasets verify that the proposed AFNet outperforms 20 state-of-the-art counterparts in terms of six widely adopted evaluation metrics. Source code will be pub-licly available athttps://github.com/clelouch/AFNet upon paper acceptance. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:152 / 164
页数:13
相关论文
共 74 条
[11]   Progressively Guided Alternate Refinement Network for RGB-D Salient Object Detection [J].
Chen, Shuhan ;
Fu, Yun .
COMPUTER VISION - ECCV 2020, PT VIII, 2020, 12353 :520-538
[12]   Boundary-guided network for camouflaged object detection [J].
Chen, Tianyou ;
Xiao, Jin ;
Hu, Xiaoguang ;
Zhang, Guofeng ;
Wang, Shaojie .
KNOWLEDGE-BASED SYSTEMS, 2022, 248
[13]   BINet: Bidirectional interactive network for salient object detection [J].
Chen, Tianyou ;
Hu, Xiaoguang ;
Xiao, Jin ;
Zhang, Guofeng ;
Wang, Shaojie .
NEUROCOMPUTING, 2021, 465 :490-502
[14]   DPANet: Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection [J].
Chen, Zuyao ;
Cong, Runmin ;
Xu, Qianqian ;
Huang, Qingming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :7012-7024
[15]   Global Contrast Based Salient Region Detection [J].
Cheng, Ming-Ming ;
Mitra, Niloy J. ;
Huang, Xiaolei ;
Torr, Philip H. S. ;
Hu, Shi-Min .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) :569-582
[16]   RepFinder: Finding Approximately Repeated Scene Elements for Image Editing [J].
Cheng, Ming-Ming ;
Zhang, Fang-Lue ;
Mitra, Niloy J. ;
Huang, Xiaolei ;
Hu, Shi-Min .
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04)
[17]  
Cheng Y., 2014, ICIMCS, P23
[18]  
Dosovitskiy Alexey, 2021, P ICLR
[19]   Structure-measure: A New Way to Evaluate Foreground Maps [J].
Fan, Deng-Ping ;
Cheng, Ming-Ming ;
Liu, Yun ;
Li, Tao ;
Borji, Ali .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4558-4567
[20]  
Fan DP, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P698