Boosting Broader Receptive Fields for Salient Object Detection

被引：49

作者：

Ma, Mingcan ^{[1
]}

Xia, Changqun ^{[2
]}

Xie, Chenxi ^{[1
]}

Chen, Xiaowu ^{[1
,2
]}

Li, Jia ^{[1
,2
]}

机构：

[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Semantics; Transformers; Object detection; Decoding; Boosting; Switches; Salient object detection; receptive field; bilateral extreme stripping; loop compensation; NETWORK; MODEL;

D O I：

10.1109/TIP.2022.3232209

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Salient Object Detection has boomed in recent years and achieved impressive performance on regular-scale targets. However, existing methods encounter performance bottlenecks in processing objects with scale variation, especially extremely large- or small-scale objects with asymmetric segmentation requirements, since they are inefficient in obtaining more comprehensive receptive fields. With this issue in mind, this paper proposes a framework named BBRF for Boosting Broader Receptive Fields, which includes a Bilateral Extreme Stripping (BES) encoder, a Dynamic Complementary Attention Module (DCAM) and a Switch-Path Decoder (SPD) with a new boosting loss under the guidance of Loop Compensation Strategy (LCS). Specifically, we rethink the characteristics of the bilateral networks, and construct a BES encoder that separates semantics and details in an extreme way so as to get the broader receptive fields and obtain the ability to perceive extreme large- or small-scale objects. Then, the bilateral features generated by the proposed BES encoder can be dynamically filtered by the newly proposed DCAM. This module interactively provides spacial-wise and channel-wise dynamic attention weights for the semantic and detail branches of our BES encoder. Furthermore, we subsequently propose a Loop Compensation Strategy to boost the scale-specific features of multiple decision paths in SPD. These decision paths form a feature loop chain, which creates mutually compensating features under the supervision of boosting loss. Experiments on five benchmark datasets demonstrate that the proposed BBRF has a great advantage to cope with scale variation and can reduce the Mean Absolute Error over 20% compared with the state-of-the-art methods.

引用

页码：1026 / 1038

页数：13

共 71 条

[61] A Bi-directional Message Passing Model for Salient Object Detection
Zhang, Lu
Dai, Ju
Lu, Huchuan
He, You
Wang, Gang
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1741 - 1750
[62] Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detectionn
Zhang, Pingping
Wang, Dong
Lu, Huchuan
Wang, Hongyu
Ruan, Xiang
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 202 - 211
[63] Progressive Attention Guided Recurrent Network for Salient Object Detection
Zhang, Xiaoning
Wang, Tiantian
Qi, Jinqing
Lu, Huchuan
Wang, Gang
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 714 - 722
[64] Pyramid Scene Parsing Network
Zhao, Hengshuang
Shi, Jianping
Qi, Xiaojuan
Wang, Xiaogang
Jia, Jiaya
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6230 - 6239
[65] EGNet: Edge Guidance Network for Salient Object Detection
Zhao, Jia-Xing
Liu, Jiang-Jiang
Fan, Deng-Ping
Cao, Yang
Yang, Ju-Feng
Cheng, Ming-Ming
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8778 - 8787
[66] Pyramid Feature Attention Network for Saliency detection
Zhao, Ting
Wu, Xiangqian
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3080 - 3089
[67] Zhao X., 2020, P EUR C COMP VIS, P35
[68] RGB-D Salient Object Detection With Ubiquitous Target Awareness
Zhao, Yifan
Zhao, Jiawei
Li, Jia
Chen, Xiaowu
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7717 - 7731
[69] Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection
Zhao, Zhirui
Xia, Changqun
Xie, Chenxi
Li, Jia
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4967 - 4975
[70] Interactive Two-Stream Decoder for Accurate and Fast Saliency Detection
Zhou, Huajun
Xie, Xiaohua
Lai, Jian-Huang
Chen, Zixuan
Yang, Lingxiao
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9138 - 9147

← 1 2 3 4 5 6 7 8 →