One-stop multiscale reconciliation attention network with scribble supervision for salient object detection in optical remote sensing images

被引:5
作者
Yan, Ruixiang [1 ]
Yan, Longquan [2 ]
Cao, Yufei [1 ]
Geng, Guohua [2 ]
Zhou, Pengbo [3 ]
机构
[1] Ningxia Univ, Sch Informat Engn, Yinchuan 750021, Peoples R China
[2] Northwest Univ, Sch Informat Sci & Technol, Xian 710119, Peoples R China
[3] Beijing Normal Univ, Coll Informat Sci & Technol, Beijing 100875, Peoples R China
基金
中国国家自然科学基金;
关键词
Salient object detection; Optical remote sensing images; Weakly supervised learning; Scribble supervision; Attention mechanism;
D O I
10.1007/s10489-024-05359-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient object detection in optical remote sensing images (RSI-SOD) faces significant challenges due to the unique characteristics of RSI imaging. Existing methods heavily rely on labor-intensive pixel-level annotations and overlook the potential of low-cost sparse annotations. Moreover, weakly supervised RSI-SOD methods introduce multiple sparse annotations and training processes, leading to a multistaged SOD task and considerable performance gaps compared to fully supervised approaches. To address these issues, we propose a one-stop end-to-end RSI-SOD method that solely relies on scribble annotations. Our framework, named the one-stop multiscale reconciliation attention network (OMRA-Net), features encoding, reconciliation, polishing, and convergence layers for effective feature extraction, reconciliation, polishing, and object structure restoration. Evaluation on publicly available datasets demonstrates that OMRA-Net outperforms existing weakly supervised and unsupervised SOD methods, achieving comparable or superior performance to fully supervised models. Ablation studies further validate the effectiveness of our proposed model design.
引用
收藏
页码:3737 / 3755
页数:19
相关论文
共 67 条
[31]  
Perazzi F, 2012, PROC CVPR IEEE, P733, DOI 10.1109/CVPR.2012.6247743
[32]   U2-Net: Going deeper with nested U-structure for salient object detection [J].
Qin, Xuebin ;
Zhang, Zichen ;
Huang, Chenyang ;
Dehghan, Masood ;
Zaiane, Osmar R. ;
Jagersand, Martin .
PATTERN RECOGNITION, 2020, 106
[33]  
Qiu Yajun, 2023, IEEE T CIRCUITS SYST
[34]   Fully Squeezed Multiscale Inference Network for Fast and Accurate Saliency Detection in Optical Remote-Sensing Images [J].
Shen, Kunye ;
Zhou, Xiaofei ;
Wan, Bin ;
Shi, Ran ;
Zhang, Jiyong .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[35]  
Song D, 2023, IEEE T GEOSCI ELECT
[36]   ORSI Salient Object Detection via Multiscale Joint Region and Boundary Model [J].
Tu, Zhengzheng ;
Wang, Chao ;
Li, Chenglong ;
Fan, Minghao ;
Zhao, Haifeng ;
Luo, Bin .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[37]   Learning to Detect Salient Objects with Image-level Supervision [J].
Wang, Lijun ;
Lu, Huchuan ;
Wang, Yifan ;
Feng, Mengyang ;
Wang, Dong ;
Yin, Baocai ;
Ruan, Xiang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3796-3805
[38]   Salient Object Detection in the Deep Learning Era: An In-Depth Survey [J].
Wang, Wenguan ;
Lai, Qiuxia ;
Fu, Huazhu ;
Shen, Jianbing ;
Ling, Haibin ;
Yang, Ruigang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) :3239-3259
[39]   A New Potential-Field Downward Continuation Iteration Method Based on Adaptive Filtering [J].
Wang, Ze ;
Zhang, Qi ;
Chen, Dixiang ;
Liu, Zhongyan ;
Pan, Mengchun ;
Hu, Jiafei ;
Chen, Zhuo ;
Xu, Yujing ;
Wang, Zhenxiong ;
Ren, Xintian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[40]   Scribble-Based Weakly Supervised Deep Learning for Road Surface Extraction From Remote Sensing Images [J].
Wei, Yao ;
Ji, Shunping .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60