CSNet: a ConvNeXt-based Siamese network for RGB-D salient object detection

被引:0
|
作者
Yunhua Zhang
Hangxu Wang
Gang Yang
Jianhao Zhang
Congjin Gong
Yutao Wang
机构
[1] Northeastern University,
[2] DUT Artificial Intelligence Institute,undefined
来源
The Visual Computer | 2024年 / 40卷
关键词
Salient object detection; Siamese network; ConvNeXt; RGB-D SOD; Multi-modality;
D O I
暂无
中图分类号
学科分类号
摘要
Global contexts are critical to locating salient objects for salient object detection (SOD). However, the convolution operation in CNNs has a local receptive field, which cannot capture long-distance global information. Recent studies have shown that modernized CNN models with large kernel convolution, such as ConvNeXt, can effectively extend the receptive fields. Based on it, this paper explores the potential of large kernel CNN for SOD task. Inspired by the common information between RGB and depth images in salient objects, we propose a ConvNeXt-based Siamese network with shared weight parameters. This structural design can effectively reduce the number of parameters without sacrificing performance. Furthermore, a depth information preprocessing module is proposed to minimize the impact of low-quality depth images on predicted saliency maps. For cross-modal feature interaction, a dynamic fusion module is designed to enhance cross-modal complementarity dynamically. Extensive experiments and evaluation results on six benchmark datasets demonstrate the outstanding performance of the proposed method against 14 state-of-the-art RGB-D methods. Our code will be released at https://github.com/zyh5119232/CSNet.
引用
收藏
页码:1805 / 1823
页数:18
相关论文
共 50 条
  • [31] Scale Adaptive Fusion Network for RGB-D Salient Object Detection
    Kong, Yuqiu
    Zheng, Yushuo
    Yao, Cuili
    Liu, Yang
    Wang, He
    COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 608 - 625
  • [32] Salient object detection for RGB-D images by generative adversarial network
    Liu, Zhengyi
    Tang, Jiting
    Xiang, Qian
    Zhao, Peng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (35-36) : 25403 - 25425
  • [33] An adaptive guidance fusion network for RGB-D salient object detection
    Haodong Sun
    Yu Wang
    Xinpeng Ma
    Signal, Image and Video Processing, 2024, 18 : 1683 - 1693
  • [34] Adaptive Depth Enhancement Network for RGB-D Salient Object Detection
    Yi, Kang
    Li, Yumeng
    Tang, Haoran
    Xu, Jing
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 176 - 180
  • [35] Salient object detection for RGB-D images by generative adversarial network
    Zhengyi Liu
    Jiting Tang
    Qian Xiang
    Peng Zhao
    Multimedia Tools and Applications, 2020, 79 : 25403 - 25425
  • [36] TANet: Transformer-based asymmetric network for RGB-D salient object detection
    Liu, Chang
    Yang, Gang
    Wang, Shuo
    Wang, Hangxu
    Zhang, Yunhua
    Wang, Yutao
    IET COMPUTER VISION, 2023, 17 (04) : 415 - 430
  • [37] Transformer-based difference fusion network for RGB-D salient object detection
    Cui, Zhi-Qiang
    Wang, Feng
    Feng, Zheng-Yong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [38] A cascaded refined rgb-d salient object detection network based on the attention mechanism
    Zong, Guanyu
    Wei, Longsheng
    Guo, Siyuan
    Wang, Yongtao
    APPLIED INTELLIGENCE, 2023, 53 (11) : 13527 - 13548
  • [39] A cascaded refined rgb-d salient object detection network based on the attention mechanism
    Guanyu Zong
    Longsheng Wei
    Siyuan Guo
    Yongtao Wang
    Applied Intelligence, 2023, 53 : 13527 - 13548
  • [40] DVSOD: RGB-D Video Salient Object Detection
    Li, Jingjing
    Ji, Wei
    Wang, Size
    Li, Wenbo
    Cheng, Li
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,