Specificity-preserving RGB-D saliency detection

被引:0
作者
Tao Zhou
Deng-Ping Fan
Geng Chen
Yi Zhou
Huazhu Fu
机构
[1] Nanjing University of Science and Technology,School of Computer Science and Engineering
[2] Ministry of Education,Key Laboratory of System Control and Information Processing
[3] ETH Zürich,Computer Vision Lab
[4] Northwestern Polytechnical University,School of Computer Science and Engineering
[5] Southeast University,School of Computer Science and Engineering
[6] Inception Institute of Artificial Intelligence,undefined
来源
Computational Visual Media | 2023年 / 9卷
关键词
salient object detection (SOD); RGB-D; cross-enhanced integration module (CIM); multi-modal feature aggregation (MFA);
D O I
暂无
中图分类号
学科分类号
摘要
Salient object detection (SOD) in RGB and depth images has attracted increasing research interest. Existing RGB-D SOD models usually adopt fusion strategies to learn a shared representation from RGB and depth modalities, while few methods explicitly consider how to preserve modality-specific characteristics. In this study, we propose a novel framework, the specificity-preserving network (SPNet), which improves SOD performance by exploring both the shared information and modality-specific properties. Specifically, we use two modality-specific networks and a shared learning network to generate individual and shared saliency prediction maps. To effectively fuse cross-modal features in the shared learning network, we propose a cross-enhanced integration module (CIM) and propagate the fused feature to the next layer to integrate cross-level information. Moreover, to capture rich complementary multi-modal information to boost SOD performance, we use a multi-modal feature aggregation (MFA) module to integrate the modality-specific features from each individual decoder into the shared decoder. By using skip connections between encoder and decoder layers, hierarchical features can be fully combined. Extensive experiments demonstrate that our SPNet outperforms cutting-edge approaches on six popular RGB-D SOD and three camouflaged object detection benchmarks. The project is publicly available at https://github.com/taozh2017/SPNet. [graphic not available: see fulltext]
引用
收藏
页码:297 / 317
页数:20
相关论文
共 50 条
  • [31] RGB-D mutual guidance for semi-supervised defocus blur detection
    Li, Huaguang
    Qian, Wenhua
    Nie, Rencan
    Cao, Jinde
    Liu, Peng
    Xu, Dan
    KNOWLEDGE-BASED SYSTEMS, 2022, 255
  • [32] DMGNet: Depth mask guiding network for RGB-D salient object detection
    Tang, Yinggan
    Li, Mengyao
    NEURAL NETWORKS, 2024, 180
  • [33] Multiscale multilevel context and multimodal fusion for RGB-D salient object detection
    Wu, Junwei
    Zhou, Wujie
    Luo, Ting
    Yu, Lu
    Lei, Jingsheng
    SIGNAL PROCESSING, 2021, 178
  • [34] Implementaion of 3D Collaborative Object Detection Systems using RGB-D Sensors
    Jun, Sungwoo
    Baek, Jaeuk
    Do, Seungwon
    Lee, ChangEun
    2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 366 - 368
  • [35] 3-D Mapping With an RGB-D Camera
    Endres, Felix
    Hess, Juergen
    Sturm, Juergen
    Cremers, Daniel
    Burgard, Wolfram
    IEEE TRANSACTIONS ON ROBOTICS, 2014, 30 (01) : 177 - 187
  • [36] Channel-overcomplete convolutional architectures for RGB-D salient object detection
    Cheng, Longqi
    Wu, Decheng
    Li, Rui
    Cai, Jun
    Yu, Meng
    Li, Yu
    Liu, Sheng
    DIGITAL SIGNAL PROCESSING, 2023, 140
  • [37] Dynamic Knowledge Distillation with Noise Elimination for RGB-D Salient Object Detection
    Ren, Guangyu
    Yu, Yinxiao
    Liu, Hengyan
    Stathaki, Tania
    SENSORS, 2022, 22 (16)
  • [38] Effective Keyframe Extraction from RGB and RGB-D Video Sequences
    Dastjerdi, Niloufar Salehi
    Valognes, Julien
    Amer, Maria A.
    PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017), 2017,
  • [39] Segmentation of Shipping Bags in RGB-D Images
    Vasileva, Elena
    Ivanovski, Zoran
    2022 IEEE 5TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING APPLICATIONS AND SYSTEMS, IPAS, 2022,
  • [40] Visual Saliency Prediction Using Attention-based Cross-modal Integration Network in RGB-D Images
    Zhang, Xinyue
    Jin, Ting
    Han, Mingjie
    Lei, Jingsheng
    Cao, Zhichao
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 30 (02) : 439 - 452