SAM-Induced Pseudo Fully Supervised Learning for Weakly Supervised Object Detection in Remote Sensing Images

被引:2
作者
Qian, Xiaoliang [1 ]
Lin, Chenyang [1 ]
Chen, Zhiwu [1 ]
Wang, Wei [1 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Elect & Informat Engn, Zhengzhou 450002, Peoples R China
基金
中国国家自然科学基金;
关键词
SAM-induced seed instance mining (SSIM); SAM-based pseudo-ground truth mining (SPGTM); pseudo-fully supervised training; weakly supervised object detection (WSOD); remote sensing image (RSI);
D O I
10.3390/rs16091532
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Weakly supervised object detection (WSOD) in remote sensing images (RSIs) aims to detect high-value targets by solely utilizing image-level category labels; however, two problems have not been well addressed by existing methods. Firstly, the seed instances (SIs) are mined solely relying on the category score (CS) of each proposal, which is inclined to concentrate on the most salient parts of the object; furthermore, they are unreliable because the robustness of the CS is not sufficient due to the fact that the inter-category similarity and intra-category diversity are more serious in RSIs. Secondly, the localization accuracy is limited by the proposals generated by the selective search or edge box algorithm. To address the first problem, a segment anything model (SAM)-induced seed instance-mining (SSIM) module is proposed, which mines the SIs according to the object quality score, which indicates the comprehensive characteristic of the category and the completeness of the object. To handle the second problem, a SAM-based pseudo-ground truth-mining (SPGTM) module is proposed to mine the pseudo-ground truth (PGT) instances, for which the localization is more accurate than traditional proposals by fully making use of the advantages of SAM, and the object-detection heads are trained by the PGT instances in a fully supervised manner. The ablation studies show the effectiveness of the SSIM and SPGTM modules. Comprehensive comparisons with 15 WSOD methods demonstrate the superiority of our method on two RSI datasets.
引用
收藏
页数:19
相关论文
共 70 条
  • [1] Weakly Supervised Deep Detection Networks
    Bilen, Hakan
    Vedaldi, Andrea
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2846 - 2854
  • [2] Online Progressive Instance-Balanced Sampling for Weakly Supervised Vibration Damper Detection
    Chen, Minghao
    Tian, Yunong
    Li, Zhishuo
    Li, En
    Liang, Zize
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [3] GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification
    Chen, Weitao
    Ouyang, Shubing
    Tong, Wei
    Li, Xianju
    Zheng, Xiongwei
    Wang, Lizhe
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 1150 - 1162
  • [4] Chen Z, 2020, PROC CVPR IEEE, P12992, DOI 10.1109/CVPR42600.2020.01301
  • [5] SFRNet: Fine-Grained Oriented Object Recognition via Separate Feature Refinement
    Cheng, Gong
    Li, Qingyang
    Wang, Guangxing
    Xie, Xingxing
    Min, Lingtong
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [6] Self-Guided Proposal Generation for Weakly Supervised Object Detection
    Cheng, Gong
    Xie, Xuan
    Chen, Weining
    Feng, Xiaoxu
    Yao, Xiwen
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images
    Cheng, Gong
    Zhou, Peicheng
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12): : 7405 - 7415
  • [8] Cheng X, 2024, IEEE T INSTRUM MEAS, V73, DOI [10.1109/TIM.2024.3373087, 10.1109/TIM.2023.3330225]
  • [9] Two-Stream Isolation Forest Based on Deep Features for Hyperspectral Anomaly Detection
    Cheng, Xi
    Zhang, Min
    Lin, Sheng
    Zhou, Kexue
    Zhao, Shaobo
    Wang, Hai
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [10] Weakly Supervised Localization and Learning with Generic Knowledge
    Deselaers, Thomas
    Alexe, Bogdan
    Ferrari, Vittorio
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 100 (03) : 275 - 293