Specificity-preserving RGB-D saliency detection

被引：0

作者：

Tao Zhou

Deng-Ping Fan

Geng Chen

Yi Zhou

Huazhu Fu

机构：

[1] Nanjing University of Science and Technology,School of Computer Science and Engineering

[2] Ministry of Education,Key Laboratory of System Control and Information Processing

[3] ETH Zürich,Computer Vision Lab

[4] Northwestern Polytechnical University,School of Computer Science and Engineering

[5] Southeast University,School of Computer Science and Engineering

[6] Inception Institute of Artificial Intelligence,undefined

来源：

Computational Visual Media | 2023年 / 9卷

关键词：

salient object detection (SOD); RGB-D; cross-enhanced integration module (CIM); multi-modal feature aggregation (MFA);

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Salient object detection (SOD) in RGB and depth images has attracted increasing research interest. Existing RGB-D SOD models usually adopt fusion strategies to learn a shared representation from RGB and depth modalities, while few methods explicitly consider how to preserve modality-specific characteristics. In this study, we propose a novel framework, the specificity-preserving network (SPNet), which improves SOD performance by exploring both the shared information and modality-specific properties. Specifically, we use two modality-specific networks and a shared learning network to generate individual and shared saliency prediction maps. To effectively fuse cross-modal features in the shared learning network, we propose a cross-enhanced integration module (CIM) and propagate the fused feature to the next layer to integrate cross-level information. Moreover, to capture rich complementary multi-modal information to boost SOD performance, we use a multi-modal feature aggregation (MFA) module to integrate the modality-specific features from each individual decoder into the shared decoder. By using skip connections between encoder and decoder layers, hierarchical features can be fully combined. Extensive experiments demonstrate that our SPNet outperforms cutting-edge approaches on six popular RGB-D SOD and three camouflaged object detection benchmarks. The project is publicly available at https://github.com/taozh2017/SPNet. [graphic not available: see fulltext]

引用

页码：297 / 317

页数：20

共 50 条

[31] RGB-D mutual guidance for semi-supervised defocus blur detection
Li, Huaguang
Qian, Wenhua
Nie, Rencan
Cao, Jinde
Liu, Peng
Xu, Dan
KNOWLEDGE-BASED SYSTEMS, 2022, 255
[32] DMGNet: Depth mask guiding network for RGB-D salient object detection
Tang, Yinggan
Li, Mengyao
NEURAL NETWORKS, 2024, 180
[33] Multiscale multilevel context and multimodal fusion for RGB-D salient object detection
Wu, Junwei
Zhou, Wujie
Luo, Ting
Yu, Lu
Lei, Jingsheng
SIGNAL PROCESSING, 2021, 178
[34] Implementaion of 3D Collaborative Object Detection Systems using RGB-D Sensors
Jun, Sungwoo
Baek, Jaeuk
Do, Seungwon
Lee, ChangEun
2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 366 - 368
[35] 3-D Mapping With an RGB-D Camera
Endres, Felix
Hess, Juergen
Sturm, Juergen
Cremers, Daniel
Burgard, Wolfram
IEEE TRANSACTIONS ON ROBOTICS, 2014, 30 (01) : 177 - 187
[36] Channel-overcomplete convolutional architectures for RGB-D salient object detection
Cheng, Longqi
Wu, Decheng
Li, Rui
Cai, Jun
Yu, Meng
Li, Yu
Liu, Sheng
DIGITAL SIGNAL PROCESSING, 2023, 140
[37] Dynamic Knowledge Distillation with Noise Elimination for RGB-D Salient Object Detection
Ren, Guangyu
Yu, Yinxiao
Liu, Hengyan
Stathaki, Tania
SENSORS, 2022, 22 (16)
[38] Effective Keyframe Extraction from RGB and RGB-D Video Sequences
Dastjerdi, Niloufar Salehi
Valognes, Julien
Amer, Maria A.
PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017), 2017,
[39] Segmentation of Shipping Bags in RGB-D Images
Vasileva, Elena
Ivanovski, Zoran
2022 IEEE 5TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING APPLICATIONS AND SYSTEMS, IPAS, 2022,
[40] Visual Saliency Prediction Using Attention-based Cross-modal Integration Network in RGB-D Images
Zhang, Xinyue
Jin, Ting
Han, Mingjie
Lei, Jingsheng
Cao, Zhichao
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 30 (02) : 439 - 452

← 1 2 3 4 5 →