Boosting Broader Receptive Fields for Salient Object Detection

被引：66

作者：

Ma, Mingcan ^{[1
]}

Xia, Changqun ^{[2
]}

Xie, Chenxi ^{[1
]}

Chen, Xiaowu ^{[1
,2
]}

Li, Jia ^{[1
,2
]}

机构：

[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Semantics; Transformers; Object detection; Decoding; Boosting; Switches; Salient object detection; receptive field; bilateral extreme stripping; loop compensation; NETWORK; MODEL;

D O I：

10.1109/TIP.2022.3232209

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Salient Object Detection has boomed in recent years and achieved impressive performance on regular-scale targets. However, existing methods encounter performance bottlenecks in processing objects with scale variation, especially extremely large- or small-scale objects with asymmetric segmentation requirements, since they are inefficient in obtaining more comprehensive receptive fields. With this issue in mind, this paper proposes a framework named BBRF for Boosting Broader Receptive Fields, which includes a Bilateral Extreme Stripping (BES) encoder, a Dynamic Complementary Attention Module (DCAM) and a Switch-Path Decoder (SPD) with a new boosting loss under the guidance of Loop Compensation Strategy (LCS). Specifically, we rethink the characteristics of the bilateral networks, and construct a BES encoder that separates semantics and details in an extreme way so as to get the broader receptive fields and obtain the ability to perceive extreme large- or small-scale objects. Then, the bilateral features generated by the proposed BES encoder can be dynamically filtered by the newly proposed DCAM. This module interactively provides spacial-wise and channel-wise dynamic attention weights for the semantic and detail branches of our BES encoder. Furthermore, we subsequently propose a Loop Compensation Strategy to boost the scale-specific features of multiple decision paths in SPD. These decision paths form a feature loop chain, which creates mutually compensating features under the supervision of boosting loss. Experiments on five benchmark datasets demonstrate that the proposed BBRF has a great advantage to cope with scale variation and can reduce the Mean Absolute Error over 20% compared with the state-of-the-art methods.

引用

页码：1026 / 1038

页数：13

共 71 条

[1] MSTGAR: Multioperator-Based Stereoscopic Thumbnail Generation With Arbitrary Resolution [J].

Chai, Xiongli ;

Shao, Feng ;

Jiang, Qiuping ;

Ho, Yo-Sung .

IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) :1208-1219

[2] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[3] Reverse Attention for Salient Object Detection [J].

Chen, Shuhan ;

Tan, Xiuli ;

Wang, Ben ;

Hu, Xuelong .

COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :236-252

[4]

Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599

[5] A Highly Efficient Model to Study the Semantics of Salient Object Detection [J].

Cheng, Ming-Ming ;

Gao, Shang-Hua ;

Borji, Ali ;

Tan, Yong-Qiang ;

Lin, Zheng ;

Wang, Meng .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :8006-8021

[6]

Cong RM, 2022, Arxiv, DOI arXiv:2204.08917

[7] RRNet: Relational Reasoning Network With Parallel Multiscale Attention for Salient Object Detection in Optical Remote Sensing Images [J].

Cong, Runmin ;

Zhang, Yumo ;

Fang, Leyuan ;

Li, Jun ;

Zhao, Yao ;

Kwong, Sam .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[8] A tutorial on the cross-entropy method [J].

De Boer, PT ;

Kroese, DP ;

Mannor, S ;

Rubinstein, RY .

ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) :19-67

[9]

Fan DP, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P698

[10] Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground [J].

Fan, Deng-Ping ;

Cheng, Ming-Ming ;

Liu, Jiang-Jiang ;

Gao, Shang-Hua ;

Hou, Qibin ;

Borji, Ali .

COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 :196-212

← 1 2 3 4 5 6 7 8 →