DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection

被引:48
|
作者
Ji, Wei [1 ,2 ]
Yan, Ge [2 ]
Li, Jingjing [1 ,2 ]
Piao, Yongri [3 ]
Yao, Shunyu [2 ]
Zhang, Miao [4 ]
Cheng, Li [1 ]
Lu, Huchuan [3 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T5V 1A4, Canada
[2] Dalian Univ Technol, Sch Software Technol, Dalian 116024, Peoples R China
[3] Dalian Univ Technol, Sch Informat & Commun Engn, Fac Elect Informat & Elect Engn, Dalian 116024, Peoples R China
[4] Dalian Univ Technol, DUT RU Int Sch Informat & Software Engn, Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian 116024, Peoples R China
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Feature extraction; Saliency detection; Semantics; Random access memory; Cameras; Analytical models; Visualization; RGB-D saliency detection; salient object detection; convolutional neural networks; cross-modal fusion; OBJECT DETECTION; FUSION; SEGMENTATION;
D O I
10.1109/TIP.2022.3154931
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a novel depth-induced multi-scale recurrent attention network for RGB-D saliency detection, named as DMRA. It achieves dramatic performance especially in complex scenarios. There are four main contributions of our network that are experimentally demonstrated to have significant practical merits. First, we design an effective depth refinement block using residual connections to fully extract and fuse cross-modal complementary cues from RGB and depth streams. Second, depth cues with abundant spatial information are innovatively combined with multi-scale contextual features for accurately locating salient objects. Third, a novel recurrent attention module inspired by Internal Generative Mechanism of human brain is designed to generate more accurate saliency results via comprehensively learning the internal semantic relation of the fused feature and progressively optimizing local details with memory-oriented scene understanding. Finally, a cascaded hierarchical feature fusion strategy is designed to promote efficient information interaction of multi-level contextual features and further improve the contextual representability of model. In addition, we introduce a new real-life RGB-D saliency dataset containing a variety of complex scenarios that has been widely used as a benchmark dataset in recent RGB-D saliency detection research. Extensive empirical experiments demonstrate that our method can accurately identify salient objects and achieve appealing performance against 18 state-of-the-art RGB-D saliency models on nine benchmark datasets.
引用
收藏
页码:2321 / 2336
页数:16
相关论文
共 50 条
  • [1] Depth-induced Multi-scale Recurrent Attention Network for Saliency Detection
    Piao, Yongri
    Ji, Wei
    Li, Jingjing
    Zhang, Miao
    Lu, Huchuan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7253 - 7262
  • [2] Deep RGB-D Saliency Detection Without Depth
    Zhang, Yuan-fang
    Zheng, Jiangbin
    Jia, Wenjing
    Huang, Wenfeng
    Li, Long
    Liu, Nian
    Li, Fei
    He, Xiangjian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 755 - 767
  • [3] Progressive Guided Fusion Network With Multi-Modal and Multi-Scale Attention for RGB-D Salient Object Detection
    Wu, Jiajia
    Han, Guangliang
    Wang, Haining
    Yang, Hang
    Li, Qingqing
    Liu, Dongxu
    Ye, Fangjian
    Liu, Peixun
    IEEE ACCESS, 2021, 9 : 150608 - 150622
  • [4] RGB-D Saliency Detection based on Cross-Modal and Multi-scale Feature Fusion
    Zhu, Xuxing
    Wu, Jin
    Zhu, Lei
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 6154 - 6160
  • [5] Cross-Stage Multi-Scale Interaction Network for RGB-D Salient Object Detection
    Yi, Kang
    Zhu, Jinchao
    Guo, Fu
    Xu, Jing
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2402 - 2406
  • [6] RGB-D Saliency Detection Based on Attention Mechanism and Multi-Scale Cross-Modal Fusion
    Cui Z.
    Feng Z.
    Wang F.
    Liu Q.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (06): : 893 - 902
  • [7] Learning Selective Mutual Attention and Contrast for RGB-D Saliency Detection
    Liu, Nian
    Zhang, Ni
    Shao, Ling
    Han, Junwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9026 - 9042
  • [8] Attentive Cross-Modal Fusion Network for RGB-D Saliency Detection
    Liu, Di
    Zhang, Kao
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 967 - 981
  • [9] RGB-D Saliency Detection by Multi-stream Late Fusion Network
    Chen, Hao
    Li, Youfu
    Su, Dan
    COMPUTER VISION SYSTEMS, ICVS 2017, 2017, 10528 : 459 - 468
  • [10] Progressive multi-scale fusion network for RGB-D salient object detection
    Ren, Guangyu
    Xie, Yanchun
    Dai, Tianhong
    Stathaki, Tania
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 223