Boosting Broader Receptive Fields for Salient Object Detection

被引:49
作者
Ma, Mingcan [1 ]
Xia, Changqun [2 ]
Xie, Chenxi [1 ]
Chen, Xiaowu [1 ,2 ]
Li, Jia [1 ,2 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantics; Transformers; Object detection; Decoding; Boosting; Switches; Salient object detection; receptive field; bilateral extreme stripping; loop compensation; NETWORK; MODEL;
D O I
10.1109/TIP.2022.3232209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient Object Detection has boomed in recent years and achieved impressive performance on regular-scale targets. However, existing methods encounter performance bottlenecks in processing objects with scale variation, especially extremely large- or small-scale objects with asymmetric segmentation requirements, since they are inefficient in obtaining more comprehensive receptive fields. With this issue in mind, this paper proposes a framework named BBRF for Boosting Broader Receptive Fields, which includes a Bilateral Extreme Stripping (BES) encoder, a Dynamic Complementary Attention Module (DCAM) and a Switch-Path Decoder (SPD) with a new boosting loss under the guidance of Loop Compensation Strategy (LCS). Specifically, we rethink the characteristics of the bilateral networks, and construct a BES encoder that separates semantics and details in an extreme way so as to get the broader receptive fields and obtain the ability to perceive extreme large- or small-scale objects. Then, the bilateral features generated by the proposed BES encoder can be dynamically filtered by the newly proposed DCAM. This module interactively provides spacial-wise and channel-wise dynamic attention weights for the semantic and detail branches of our BES encoder. Furthermore, we subsequently propose a Loop Compensation Strategy to boost the scale-specific features of multiple decision paths in SPD. These decision paths form a feature loop chain, which creates mutually compensating features under the supervision of boosting loss. Experiments on five benchmark datasets demonstrate that the proposed BBRF has a great advantage to cope with scale variation and can reduce the Mean Absolute Error over 20% compared with the state-of-the-art methods.
引用
收藏
页码:1026 / 1038
页数:13
相关论文
共 71 条
  • [1] MSTGAR: Multioperator-Based Stereoscopic Thumbnail Generation With Arbitrary Resolution
    Chai, Xiongli
    Shao, Feng
    Jiang, Qiuping
    Ho, Yo-Sung
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) : 1208 - 1219
  • [2] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [3] Reverse Attention for Salient Object Detection
    Chen, Shuhan
    Tan, Xiuli
    Wang, Ben
    Hu, Xuelong
    [J]. COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 236 - 252
  • [4] Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
  • [5] A Highly Efficient Model to Study the Semantics of Salient Object Detection
    Cheng, Ming-Ming
    Gao, Shang-Hua
    Borji, Ali
    Tan, Yong-Qiang
    Lin, Zheng
    Wang, Meng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8006 - 8021
  • [6] Cong RM, 2022, Arxiv, DOI arXiv:2204.08917
  • [7] RRNet: Relational Reasoning Network With Parallel Multiscale Attention for Salient Object Detection in Optical Remote Sensing Images
    Cong, Runmin
    Zhang, Yumo
    Fang, Leyuan
    Li, Jun
    Zhao, Yao
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [8] A tutorial on the cross-entropy method
    De Boer, PT
    Kroese, DP
    Mannor, S
    Rubinstein, RY
    [J]. ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) : 19 - 67
  • [9] Fan DP, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P698
  • [10] Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground
    Fan, Deng-Ping
    Cheng, Ming-Ming
    Liu, Jiang-Jiang
    Gao, Shang-Hua
    Hou, Qibin
    Borji, Ali
    [J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 196 - 212