One-stop multiscale reconciliation attention network with scribble supervision for salient object detection in optical remote sensing images

被引：5

作者：

Yan, Ruixiang ^{[1
]}

Yan, Longquan ^{[2
]}

Cao, Yufei ^{[1
]}

Geng, Guohua ^{[2
]}

Zhou, Pengbo ^{[3
]}

机构：

[1] Ningxia Univ, Sch Informat Engn, Yinchuan 750021, Peoples R China

[2] Northwest Univ, Sch Informat Sci & Technol, Xian 710119, Peoples R China

[3] Beijing Normal Univ, Coll Informat Sci & Technol, Beijing 100875, Peoples R China

来源：

APPLIED INTELLIGENCE | 2024年 / 54卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Salient object detection; Optical remote sensing images; Weakly supervised learning; Scribble supervision; Attention mechanism;

D O I：

10.1007/s10489-024-05359-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Salient object detection in optical remote sensing images (RSI-SOD) faces significant challenges due to the unique characteristics of RSI imaging. Existing methods heavily rely on labor-intensive pixel-level annotations and overlook the potential of low-cost sparse annotations. Moreover, weakly supervised RSI-SOD methods introduce multiple sparse annotations and training processes, leading to a multistaged SOD task and considerable performance gaps compared to fully supervised approaches. To address these issues, we propose a one-stop end-to-end RSI-SOD method that solely relies on scribble annotations. Our framework, named the one-stop multiscale reconciliation attention network (OMRA-Net), features encoding, reconciliation, polishing, and convergence layers for effective feature extraction, reconciliation, polishing, and object structure restoration. Evaluation on publicly available datasets demonstrates that OMRA-Net outperforms existing weakly supervised and unsupervised SOD methods, achieving comparable or superior performance to fully supervised models. Ablation studies further validate the effectiveness of our proposed model design.

引用

页码：3737 / 3755

页数：19

共 67 条

[51] Adaptive Edge-Aware Semantic Interaction Network for Salient Object Detection in Optical Remote Sensing Images [J].

Zeng, Xiangyu ;

Xu, Mingzhu ;

Hu, Yijun ;

Tang, Haoyu ;

Hu, Yupeng ;

Nie, Liqiang .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61

[52] Multi-source weak supervision for saliency detection [J].

Zeng, Yu ;

Zhuge, Yunzhi ;

Lu, Huchuan ;

Zhang, Lihe ;

Qian, Mingyang ;

Yu, Yizhou .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6067-6076

[53] Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector [J].

Zhang, Dingwen ;

Han, Junwei ;

Zhang, Yu .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4068-4076

[54] Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective [J].

Zhang, Jing ;

Zhang, Tong ;

Dai, Yuchao ;

Harandi, Mehrtash ;

Hartley, Richard .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9029-9038

[55] Saliency detection based on self-adaptive multiple feature fusion for remote sensing images [J].

Zhang, Libao ;

Liu, Yanan ;

Zhang, Jue .

INTERNATIONAL JOURNAL OF REMOTE SENSING, 2019, 40 (22) :8270-8297

[56] Dense Attention Fluid Network for Salient Object Detection in Optical Remote Sensing Images [J].

Zhang, Qijian ;

Cong, Runmin ;

Li, Chongyi ;

Cheng, Ming-Ming ;

Fang, Yuming ;

Cao, Xiaochun ;

Zhao, Yao ;

Kwong, Sam .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :1305-1317

[57] Airport Extraction via Complementary Saliency Analysis and Saliency-Oriented Active Contour Model [J].

Zhang, Qijian ;

Zhang, Libao ;

Shi, Wenqi ;

Liu, Yue .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (07) :1085-1089

[58] EGNet: Edge Guidance Network for Salient Object Detection [J].

Zhao, Jia-Xing ;

Liu, Jiang-Jiang ;

Fan, Deng-Ping ;

Cao, Yang ;

Yang, Ju-Feng ;

Cheng, Ming-Ming .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8778-8787

[59] Suppress and Balance: A Simple Gated Network for Salient Object Detection [J].

Zhao, Xiaoqi ;

Pang, Youwei ;

Zhang, Lihe ;

Lu, Huchuan ;

Zhang, Lei .

COMPUTER VISION - ECCV 2020, PT II, 2020, 12347 :35-51

[60] Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection [J].

Zhao, Zhirui ;

Xia, Changqun ;

Xie, Chenxi ;

Li, Jia .

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :4967-4975

← 1 2 3 4 5 6 7 →