Boosting Broader Receptive Fields for Salient Object Detection

被引:66
作者
Ma, Mingcan [1 ]
Xia, Changqun [2 ]
Xie, Chenxi [1 ]
Chen, Xiaowu [1 ,2 ]
Li, Jia [1 ,2 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantics; Transformers; Object detection; Decoding; Boosting; Switches; Salient object detection; receptive field; bilateral extreme stripping; loop compensation; NETWORK; MODEL;
D O I
10.1109/TIP.2022.3232209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient Object Detection has boomed in recent years and achieved impressive performance on regular-scale targets. However, existing methods encounter performance bottlenecks in processing objects with scale variation, especially extremely large- or small-scale objects with asymmetric segmentation requirements, since they are inefficient in obtaining more comprehensive receptive fields. With this issue in mind, this paper proposes a framework named BBRF for Boosting Broader Receptive Fields, which includes a Bilateral Extreme Stripping (BES) encoder, a Dynamic Complementary Attention Module (DCAM) and a Switch-Path Decoder (SPD) with a new boosting loss under the guidance of Loop Compensation Strategy (LCS). Specifically, we rethink the characteristics of the bilateral networks, and construct a BES encoder that separates semantics and details in an extreme way so as to get the broader receptive fields and obtain the ability to perceive extreme large- or small-scale objects. Then, the bilateral features generated by the proposed BES encoder can be dynamically filtered by the newly proposed DCAM. This module interactively provides spacial-wise and channel-wise dynamic attention weights for the semantic and detail branches of our BES encoder. Furthermore, we subsequently propose a Loop Compensation Strategy to boost the scale-specific features of multiple decision paths in SPD. These decision paths form a feature loop chain, which creates mutually compensating features under the supervision of boosting loss. Experiments on five benchmark datasets demonstrate that the proposed BBRF has a great advantage to cope with scale variation and can reduce the Mean Absolute Error over 20% compared with the state-of-the-art methods.
引用
收藏
页码:1026 / 1038
页数:13
相关论文
共 71 条
[1]   MSTGAR: Multioperator-Based Stereoscopic Thumbnail Generation With Arbitrary Resolution [J].
Chai, Xiongli ;
Shao, Feng ;
Jiang, Qiuping ;
Ho, Yo-Sung .
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) :1208-1219
[2]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[3]   Reverse Attention for Salient Object Detection [J].
Chen, Shuhan ;
Tan, Xiuli ;
Wang, Ben ;
Hu, Xuelong .
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :236-252
[4]  
Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
[5]   A Highly Efficient Model to Study the Semantics of Salient Object Detection [J].
Cheng, Ming-Ming ;
Gao, Shang-Hua ;
Borji, Ali ;
Tan, Yong-Qiang ;
Lin, Zheng ;
Wang, Meng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :8006-8021
[6]  
Cong RM, 2022, Arxiv, DOI arXiv:2204.08917
[7]   RRNet: Relational Reasoning Network With Parallel Multiscale Attention for Salient Object Detection in Optical Remote Sensing Images [J].
Cong, Runmin ;
Zhang, Yumo ;
Fang, Leyuan ;
Li, Jun ;
Zhao, Yao ;
Kwong, Sam .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[8]   A tutorial on the cross-entropy method [J].
De Boer, PT ;
Kroese, DP ;
Mannor, S ;
Rubinstein, RY .
ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) :19-67
[9]  
Fan DP, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P698
[10]   Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground [J].
Fan, Deng-Ping ;
Cheng, Ming-Ming ;
Liu, Jiang-Jiang ;
Gao, Shang-Hua ;
Hou, Qibin ;
Borji, Ali .
COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 :196-212