Semantic-Guided Attention Refinement Network for Salient Object Detection in Optical Remote Sensing Images

被引：68

作者：

Huang, Zhou ^{[1
]}

Chen, Huaixin ^{[1
]}

Liu, Biyuan ^{[1
]}

Wang, Zhixi ^{[2
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Resources & Environm, Chengdu 611731, Peoples R China

[2] Truly Optoelect Co Ltd, Novel Prod R&D Dept, Shanwei 516600, Peoples R China

来源：

REMOTE SENSING | 2021年 / 13卷 / 11期

关键词：

salient object detection; semantic guidance integration; attention fusion; multi-scale object analysis; edge refinement; optical remote sensing image; AIRPORT DETECTION; REGION DETECTION; VISUAL SALIENCY; MODEL;

D O I：

10.3390/rs13112163

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Although remarkable progress has been made in salient object detection (SOD) in natural scene images (NSI), the SOD of optical remote sensing images (RSI) still faces significant challenges due to various spatial resolutions, cluttered backgrounds, and complex imaging conditions, mainly for two reasons: (1) accurate location of salient objects; and (2) subtle boundaries of salient objects. This paper explores the inherent properties of multi-level features to develop a novel semantic-guided attention refinement network (SARNet) for SOD of NSI. Specifically, the proposed semantic guided decoder (SGD) roughly but accurately locates the multi-scale object by aggregating multiple high-level features, and then this global semantic information guides the integration of subsequent features in a step-by-step feedback manner to make full use of deep multi-level features. Simultaneously, the proposed parallel attention fusion (PAF) module combines cross-level features and semantic-guided information to refine the object's boundary and highlight the entire object area gradually. Finally, the proposed network architecture is trained through an end-to-end fully supervised model. Quantitative and qualitative evaluations on two public RSI datasets and additional NSI datasets across five metrics show that our SARNet is superior to 14 state-of-the-art (SOTA) methods without any post-processing.

引用

页数：19

共 66 条

[1]

[Anonymous], 2017, Proc. of the IEEE Conf. on computer vision and pattern recognition, DOI DOI 10.1109/CVPR.2017.683

[2] An Attentive Survey of Attention Models [J].

Chaudhari, Sneha ;

Mithal, Varun ;

Polatkan, Gungor ;

Ramanath, Rohan .

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (05)

[3] Reverse Attention for Salient Object Detection [J].

Chen, Shuhan ;

Tan, Xiuli ;

Wang, Ben ;

Hu, Xuelong .

COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :236-252

[4] End-to-End Airplane Detection Using Transfer Learning in Remote Sensing Images [J].

Chen, Zhong ;

Zhang, Ting ;

Ouyang, Chao .

REMOTE SENSING, 2018, 10 (01)

[5]

Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599

[6] Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].

Cheng, Gong ;

Han, Junwei ;

Lu, Xiaoqiang .

PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883

[7] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].

Cheng, Gong ;

Zhou, Peicheng ;

Han, Junwei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415

[8] Review of Visual Saliency Detection With Comprehensive Information [J].

Cong, Runmin ;

Lei, Jianjun ;

Fu, Huazhu ;

Cheng, Ming-Ming ;

Lin, Weisi ;

Huang, Qingming .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) :2941-2959

[9]

Deng ZJ, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P684

[10] BBS-Net: RGB-D Salient Object Detection with a Bifurcated Backbone Strategy Network [J].

Fan, Deng-Ping ;

Zhai, Yingjie ;

Borji, Ali ;

Yang, Jufeng ;

Shao, Ling .

COMPUTER VISION - ECCV 2020, PT XII, 2020, 12357 :275-292

← 1 2 3 4 5 6 7 →