CSNet: a ConvNeXt-based Siamese network for RGB-D salient object detection

被引：0

作者：

Yunhua Zhang

Hangxu Wang

Gang Yang

Jianhao Zhang

Congjin Gong

Yutao Wang

机构：

[1] Northeastern University,

[2] DUT Artificial Intelligence Institute,undefined

来源：

The Visual Computer | 2024年 / 40卷

关键词：

Salient object detection; Siamese network; ConvNeXt; RGB-D SOD; Multi-modality;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Global contexts are critical to locating salient objects for salient object detection (SOD). However, the convolution operation in CNNs has a local receptive field, which cannot capture long-distance global information. Recent studies have shown that modernized CNN models with large kernel convolution, such as ConvNeXt, can effectively extend the receptive fields. Based on it, this paper explores the potential of large kernel CNN for SOD task. Inspired by the common information between RGB and depth images in salient objects, we propose a ConvNeXt-based Siamese network with shared weight parameters. This structural design can effectively reduce the number of parameters without sacrificing performance. Furthermore, a depth information preprocessing module is proposed to minimize the impact of low-quality depth images on predicted saliency maps. For cross-modal feature interaction, a dynamic fusion module is designed to enhance cross-modal complementarity dynamically. Extensive experiments and evaluation results on six benchmark datasets demonstrate the outstanding performance of the proposed method against 14 state-of-the-art RGB-D methods. Our code will be released at https://github.com/zyh5119232/CSNet.

引用

页码：1805 / 1823

页数：18

共 50 条

[31] Scale Adaptive Fusion Network for RGB-D Salient Object Detection
Kong, Yuqiu
Zheng, Yushuo
Yao, Cuili
Liu, Yang
Wang, He
COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 608 - 625
[32] Salient object detection for RGB-D images by generative adversarial network
Liu, Zhengyi
Tang, Jiting
Xiang, Qian
Zhao, Peng
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (35-36) : 25403 - 25425
[33] An adaptive guidance fusion network for RGB-D salient object detection
Haodong Sun
Yu Wang
Xinpeng Ma
Signal, Image and Video Processing, 2024, 18 : 1683 - 1693
[34] Adaptive Depth Enhancement Network for RGB-D Salient Object Detection
Yi, Kang
Li, Yumeng
Tang, Haoran
Xu, Jing
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 176 - 180
[35] Salient object detection for RGB-D images by generative adversarial network
Zhengyi Liu
Jiting Tang
Qian Xiang
Peng Zhao
Multimedia Tools and Applications, 2020, 79 : 25403 - 25425
[36] TANet: Transformer-based asymmetric network for RGB-D salient object detection
Liu, Chang
Yang, Gang
Wang, Shuo
Wang, Hangxu
Zhang, Yunhua
Wang, Yutao
IET COMPUTER VISION, 2023, 17 (04) : 415 - 430
[37] Transformer-based difference fusion network for RGB-D salient object detection
Cui, Zhi-Qiang
Wang, Feng
Feng, Zheng-Yong
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
[38] A cascaded refined rgb-d salient object detection network based on the attention mechanism
Zong, Guanyu
Wei, Longsheng
Guo, Siyuan
Wang, Yongtao
APPLIED INTELLIGENCE, 2023, 53 (11) : 13527 - 13548
[39] A cascaded refined rgb-d salient object detection network based on the attention mechanism
Guanyu Zong
Longsheng Wei
Siyuan Guo
Yongtao Wang
Applied Intelligence, 2023, 53 : 13527 - 13548
[40] DVSOD: RGB-D Video Salient Object Detection
Li, Jingjing
Ji, Wei
Wang, Size
Li, Wenbo
Cheng, Li
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →