Boosting Broader Receptive Fields for Salient Object Detection

被引:49
作者
Ma, Mingcan [1 ]
Xia, Changqun [2 ]
Xie, Chenxi [1 ]
Chen, Xiaowu [1 ,2 ]
Li, Jia [1 ,2 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantics; Transformers; Object detection; Decoding; Boosting; Switches; Salient object detection; receptive field; bilateral extreme stripping; loop compensation; NETWORK; MODEL;
D O I
10.1109/TIP.2022.3232209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient Object Detection has boomed in recent years and achieved impressive performance on regular-scale targets. However, existing methods encounter performance bottlenecks in processing objects with scale variation, especially extremely large- or small-scale objects with asymmetric segmentation requirements, since they are inefficient in obtaining more comprehensive receptive fields. With this issue in mind, this paper proposes a framework named BBRF for Boosting Broader Receptive Fields, which includes a Bilateral Extreme Stripping (BES) encoder, a Dynamic Complementary Attention Module (DCAM) and a Switch-Path Decoder (SPD) with a new boosting loss under the guidance of Loop Compensation Strategy (LCS). Specifically, we rethink the characteristics of the bilateral networks, and construct a BES encoder that separates semantics and details in an extreme way so as to get the broader receptive fields and obtain the ability to perceive extreme large- or small-scale objects. Then, the bilateral features generated by the proposed BES encoder can be dynamically filtered by the newly proposed DCAM. This module interactively provides spacial-wise and channel-wise dynamic attention weights for the semantic and detail branches of our BES encoder. Furthermore, we subsequently propose a Loop Compensation Strategy to boost the scale-specific features of multiple decision paths in SPD. These decision paths form a feature loop chain, which creates mutually compensating features under the supervision of boosting loss. Experiments on five benchmark datasets demonstrate that the proposed BBRF has a great advantage to cope with scale variation and can reduce the Mean Absolute Error over 20% compared with the state-of-the-art methods.
引用
收藏
页码:1026 / 1038
页数:13
相关论文
共 71 条
  • [61] A Bi-directional Message Passing Model for Salient Object Detection
    Zhang, Lu
    Dai, Ju
    Lu, Huchuan
    He, You
    Wang, Gang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1741 - 1750
  • [62] Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detectionn
    Zhang, Pingping
    Wang, Dong
    Lu, Huchuan
    Wang, Hongyu
    Ruan, Xiang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 202 - 211
  • [63] Progressive Attention Guided Recurrent Network for Salient Object Detection
    Zhang, Xiaoning
    Wang, Tiantian
    Qi, Jinqing
    Lu, Huchuan
    Wang, Gang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 714 - 722
  • [64] Pyramid Scene Parsing Network
    Zhao, Hengshuang
    Shi, Jianping
    Qi, Xiaojuan
    Wang, Xiaogang
    Jia, Jiaya
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6230 - 6239
  • [65] EGNet: Edge Guidance Network for Salient Object Detection
    Zhao, Jia-Xing
    Liu, Jiang-Jiang
    Fan, Deng-Ping
    Cao, Yang
    Yang, Ju-Feng
    Cheng, Ming-Ming
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8778 - 8787
  • [66] Pyramid Feature Attention Network for Saliency detection
    Zhao, Ting
    Wu, Xiangqian
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3080 - 3089
  • [67] Zhao X., 2020, P EUR C COMP VIS, P35
  • [68] RGB-D Salient Object Detection With Ubiquitous Target Awareness
    Zhao, Yifan
    Zhao, Jiawei
    Li, Jia
    Chen, Xiaowu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7717 - 7731
  • [69] Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection
    Zhao, Zhirui
    Xia, Changqun
    Xie, Chenxi
    Li, Jia
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4967 - 4975
  • [70] Interactive Two-Stream Decoder for Accurate and Fast Saliency Detection
    Zhou, Huajun
    Xie, Xiaohua
    Lai, Jian-Huang
    Chen, Zixuan
    Yang, Lingxiao
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9138 - 9147