CSNet: a ConvNeXt-based Siamese network for RGB-D salient object detection

被引:0
|
作者
Yunhua Zhang
Hangxu Wang
Gang Yang
Jianhao Zhang
Congjin Gong
Yutao Wang
机构
[1] Northeastern University,
[2] DUT Artificial Intelligence Institute,undefined
来源
The Visual Computer | 2024年 / 40卷
关键词
Salient object detection; Siamese network; ConvNeXt; RGB-D SOD; Multi-modality;
D O I
暂无
中图分类号
学科分类号
摘要
Global contexts are critical to locating salient objects for salient object detection (SOD). However, the convolution operation in CNNs has a local receptive field, which cannot capture long-distance global information. Recent studies have shown that modernized CNN models with large kernel convolution, such as ConvNeXt, can effectively extend the receptive fields. Based on it, this paper explores the potential of large kernel CNN for SOD task. Inspired by the common information between RGB and depth images in salient objects, we propose a ConvNeXt-based Siamese network with shared weight parameters. This structural design can effectively reduce the number of parameters without sacrificing performance. Furthermore, a depth information preprocessing module is proposed to minimize the impact of low-quality depth images on predicted saliency maps. For cross-modal feature interaction, a dynamic fusion module is designed to enhance cross-modal complementarity dynamically. Extensive experiments and evaluation results on six benchmark datasets demonstrate the outstanding performance of the proposed method against 14 state-of-the-art RGB-D methods. Our code will be released at https://github.com/zyh5119232/CSNet.
引用
收藏
页码:1805 / 1823
页数:18
相关论文
共 50 条
  • [41] Advancing in RGB-D Salient Object Detection: A Survey
    Chen, Ai
    Li, Xin
    He, Tianxiang
    Zhou, Junlin
    Chen, Duanbing
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [42] Adaptive Fusion for RGB-D Salient Object Detection
    Wang, Ningning
    Gong, Xiaojin
    IEEE ACCESS, 2019, 7 : 55277 - 55284
  • [43] RGB-D Point Cloud Registration Based on Salient Object Detection
    Wan, Teng
    Du, Shaoyi
    Cui, Wenting
    Yao, Runzhao
    Ge, Yuyan
    Li, Ce
    Gao, Yue
    Zheng, Nanning
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3547 - 3559
  • [44] RGB-D Fusion Based on Fuzzy Optimization for Salient Object Detection
    Bhuyan, Sudipta
    Sen, Debashis
    Deb, Sankha
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023, 2023, 14301 : 523 - 531
  • [45] SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection
    Lee, Minhyeok
    Park, Chaewon
    Cho, Suhwan
    Lee, Sangyoun
    COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 630 - 647
  • [46] Heterogeneous Fusion and Integrity Learning Network for RGB-D Salient Object Detection
    Gao, Haorao
    Su, Yiming
    Wang, Fasheng
    Li, Haojie
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [47] AFLNet: Adversarial focal loss network for RGB-D salient object detection
    Zhao, Xiaoli
    Chen, Zheng
    Hwang, Jenq-Neng
    Shang, Xiwu
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 94
  • [48] Perceptual localization and focus refinement network for RGB-D salient object detection
    Han, Jinyu
    Wang, Mengyin
    Wu, Weiyi
    Jia, Xu
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [49] TriTransNet: RGB-D Salient Object Detection with a Triplet Transformer Embedding Network
    Liu, Zhengyi
    Wang, Yuan
    Tu, Zhengzheng
    Xiao, Yun
    Tang, Bin
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4481 - 4490
  • [50] A multiple-attention refinement network for RGB-D salient object detection
    Jiang, Zijian
    Yu, Ling
    Li, Junru
    Niu, Fanglin
    IET IMAGE PROCESSING, 2024, 18 (14) : 4551 - 4562