One-stop multiscale reconciliation attention network with scribble supervision for salient object detection in optical remote sensing images

被引:5
作者
Yan, Ruixiang [1 ]
Yan, Longquan [2 ]
Cao, Yufei [1 ]
Geng, Guohua [2 ]
Zhou, Pengbo [3 ]
机构
[1] Ningxia Univ, Sch Informat Engn, Yinchuan 750021, Peoples R China
[2] Northwest Univ, Sch Informat Sci & Technol, Xian 710119, Peoples R China
[3] Beijing Normal Univ, Coll Informat Sci & Technol, Beijing 100875, Peoples R China
基金
中国国家自然科学基金;
关键词
Salient object detection; Optical remote sensing images; Weakly supervised learning; Scribble supervision; Attention mechanism;
D O I
10.1007/s10489-024-05359-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient object detection in optical remote sensing images (RSI-SOD) faces significant challenges due to the unique characteristics of RSI imaging. Existing methods heavily rely on labor-intensive pixel-level annotations and overlook the potential of low-cost sparse annotations. Moreover, weakly supervised RSI-SOD methods introduce multiple sparse annotations and training processes, leading to a multistaged SOD task and considerable performance gaps compared to fully supervised approaches. To address these issues, we propose a one-stop end-to-end RSI-SOD method that solely relies on scribble annotations. Our framework, named the one-stop multiscale reconciliation attention network (OMRA-Net), features encoding, reconciliation, polishing, and convergence layers for effective feature extraction, reconciliation, polishing, and object structure restoration. Evaluation on publicly available datasets demonstrates that OMRA-Net outperforms existing weakly supervised and unsupervised SOD methods, achieving comparable or superior performance to fully supervised models. Ablation studies further validate the effectiveness of our proposed model design.
引用
收藏
页码:3737 / 3755
页数:19
相关论文
共 67 条
[1]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[2]   Weakly supervised pavement crack semantic segmentation based on multi-scale object localization and incremental annotation refinement [J].
Al-Huda, Zaid ;
Peng, Bo ;
Algburi, Riyadh Nazar Ali ;
Alfasly, Saghir ;
Li, Tianrui .
APPLIED INTELLIGENCE, 2023, 53 (11) :14527-14546
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]  
Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
[5]  
Cheng X, 2022, IEEE T CIRCUITS SYST
[6]   Global-and-Local Collaborative Learning for Co-Salient Object Detection [J].
Cong, Runmin ;
Yang, Ning ;
Li, Chongyi ;
Fu, Huazhu ;
Zhao, Yao ;
Huang, Qingming ;
Kwong, Sam .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) :1920-1931
[7]  
Fan D.D., 2018, arXiv
[8]   Structure-measure: A New Way to Evaluate Foreground Maps [J].
Fan, Deng-Ping ;
Cheng, Ming-Ming ;
Liu, Yun ;
Li, Tao ;
Borji, Ali .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4558-4567
[9]  
Feng Dejun, 2023, ARXIV
[10]  
Gong A, 2023, IEEE T CIRCUITS SYST