Point-aware Interaction and CNN-induced Refinement Network for RGB-D Salient Object Detection

被引：39

作者：

Cong, Runmin ^{[1
,2
,5
]}

Liu, Hongyu ^{[1
,6
,7
]}

Zhang, Chen ^{[1
,6
,7
]}

Zhang, Wei ^{[2
,5
]}

Zheng, Feng ^{[3
]}

Song, Ran ^{[2
,5
]}

Kwong, Sam ^{[4
]}

机构：

[1] Beijing Jiaotong Univ, Beijing, Peoples R China

[2] Shandong Univ, Jinan, Shandong, Peoples R China

[3] Southern Univ Sci & Technol, Shenzhen, Guangdong, Peoples R China

[4] City Univ Hong Kong, Hong Kong, Peoples R China

[5] Minist Educ, Key Lab Machine Intelligence & Syst Control, Jinan, Shandong, Peoples R China

[6] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China

[7] Beijing Key Lab Adv Informat Sci & Network Techno, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

基金：

中国国家自然科学基金;

关键词：

salient object detection; RGB-D images; CNNs-assisted Transformer architecture; point-aware interaction; FUSION;

D O I：

10.1145/3581783.3611982

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

By integrating complementary information from RGB image and depth map, the ability of salient object detection (SOD) for complex and challenging scenes can be improved. In recent years, the important role of Convolutional Neural Networks (CNNs) in feature extraction and cross-modality interaction has been fully explored, but it is still insufficient in modeling global long-range dependencies of self-modality and cross-modality. To this end, we introduce CNNs-assisted Transformer architecture and propose a novel RGB-D SOD network with Point-aware Interaction and CNN-induced Refinement (PICR-Net). On the one hand, considering the prior correlation between RGB modality and depth modality, an attention-triggered cross-modality point-aware interaction (CmPI) module is designed to explore the feature interaction of different modalities with positional constraints. On the other hand, in order to alleviate the block effect and detail destruction problems brought by the Transformer naturally, we design a CNN-induced refinement (CNNR) unit for content refinement and supplementation. Extensive experiments on five RGB-D SOD datasets show that the proposed network achieves competitive results in both quantitative and qualitative comparisons. Our code is publicly available at: https://github.com/rmcong/PICR-Net_ACMMM23.

引用

页码：406 / 416

页数：11

共 57 条

[21] Middle-Level Feature Fusion for Lightweight RGB-D Salient Object Detection [J].

Huang, Nianchang ;

Jiao, Qiang ;

Zhang, Qiang ;

Han, Jungong .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :6621-6634

[22] Calibrated RGB-D Salient Object Detection [J].

Ji, Wei ;

Li, Jingjing ;

Yu, Shuang ;

Zhang, Miao ;

Piao, Yongri ;

Yao, Shunyu ;

Bi, Qi ;

Ma, Kai ;

Zheng, Yefeng ;

Lu, Huchuan ;

Cheng, Li .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :9466-9476

[23]

Jiang YD, 2019, ADV NEUR IN, V32

[24] CDNet: Complementary Depth Network for RGB-D Salient Object Detection [J].

Jin, Wen-Da ;

Xu, Jun ;

Han, Qi ;

Zhang, Yi ;

Cheng, Ming-Ming .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3376-3390

[25] Occlusion-aware Bi-directional Guided Network for Light Field Salient Object Detection [J].

Jing, Dong ;

Zhang, Shuo ;

Cong, Runmin ;

Lin, Youfang .

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :1692-1701

[26] SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection [J].

Lee, Minhyeok ;

Park, Chaewon ;

Cho, Suhwan ;

Lee, Sangyoun .

COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 :630-647

[27] ASIF-Net: Attention Steered Interweave Fusion Network for RGB-D Salient Object Detection [J].

Li, Chongyi ;

Cong, Runmin ;

Kwong, Sam ;

Hou, Junhui ;

Fu, Huazhu ;

Zhu, Guopu ;

Zhang, Dingwen ;

Huang, Qingming .

IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) :88-100

[28] RGB-D Salient Object Detection with Cross-Modality Modulation and Selection [J].

Li, Chongyi ;

Cong, Runmin ;

Piao, Yongri ;

Xu, Qianqian ;

Loy, Chen Change .

COMPUTER VISION - ECCV 2020, PT VIII, 2020, 12353 :225-241

[29] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection [J].

Li, Gongyang ;

Liu, Zhi ;

Chen, Minyu ;

Bai, Zhen ;

Lin, Weisi ;

Ling, Haibin .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3528-3542

[30] Saliency Detection on Light Field [J].

Li, Nianyi ;

Ye, Jinwei ;

Ji, Yu ;

Ling, Haibin ;

Yu, Jingyi .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2806-2813

← 1 2 3 4 5 6 →