Semantic Assistance in SAR Object Detection: A Mask-Guided Approach

被引:0
|
作者
Liu, Wei [1 ]
Zhou, Lifan [1 ]
Zhong, Shan [1 ]
Gong, Shengrong [1 ]
机构
[1] Changshu Inst Technol, Suzhou 215500, Peoples R China
基金
中国国家自然科学基金;
关键词
DEtection TRansformer (DETR); object detection; segment anything model (SAM); synthetic aperture radar (SAR); PYRAMID NETWORK; FOCAL LOSS;
D O I
10.1109/JSTARS.2024.3481368
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The unique challenge in SAR object detection is the strong speckle noise inherent in SAR imagery. Existing learning-based works mainly focus on architectural enhancements, and fail to consider the valuable semantic information that can mitigate the effects of speckle noise. Large pretrained segment anything model (SAM) is a powerful foundational model with general semantic knowledge. However, SAM is not fully exploited for SAR object detection. This study paves the way for applying SAM for SAR object detection. Rather than fine-tuning the SAM network, we propose three mask-guided learning strategies by simply utilizing the semantic masks generated by SAM. Built upon the advanced RealTime DEtection TRansformer (RT-DETR) framework, the Semantic Assisted DETR, deemed as SA-DETR, integrates prior semantics from SAM into the SAR detection task. To be specific, first, we propose the mask-guided feature denoising module in the encoder stage, to enhance the network's discrimination of positives and negatives. Second, we propose the mask-guided query selection for initial query generation, which is beneficial for the decoder refinement. Finally, the mask-guided instance segmentation is proposed to achieve more accurate localization. To validate the superiority of the proposed SA-DETR, extensive experiments are conducted on two benchmark datasets, i.e., the SAR ship detection dataset (SSDD) and the recently published COCO-level large-scale multiclass SAR object detection dataset (SARDet-100K). Experimental results on both datasets outperform previous advanced detectors, achieving a new state-of-the-art with 99.0 $AP_{50}$ and 88.4 $mAP_{50}$ on SSDD and SARDet-100 K, respectively.
引用
收藏
页码:19395 / 19407
页数:13
相关论文
共 50 条
  • [11] Distilling object detectors with efficient logit mimicking and mask-guided feature imitation
    Lu, Xin
    Cao, Yichao
    Chen, Shikun
    Li, Weixuan
    Zhou, Xin
    Lu, Xiaobo
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
  • [12] Distilling object detectors with mask-guided feature and relation-based knowledge
    Zeng, Liang
    Ma, Liyan
    Luo, Xiangfeng
    Guo, Yinsai
    Chen, Xue
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2024, 27 (02) : 195 - 203
  • [13] Mask-Guided Joint Single Image Specular Highlight Detection and Removal
    Chen, Hao
    Li, Li
    Yu, Neng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 457 - 468
  • [14] A Feature Prefusion and Mask-Guided Network for Camera Decoration Defect Detection
    Wang, Hui
    Zhao, Yuqian
    Zhang, Fan
    Gui, Gui
    Luo, Qiwu
    Yang, Chunhua
    Gui, Weihua
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [15] Mask-guided network for image captioning
    Lim, Jian Han
    Chan, Chee Seng
    PATTERN RECOGNITION LETTERS, 2023, 173 : 79 - 86
  • [16] Mask-guided modality difference reduction network for RGB-T semantic segmentation
    Liang, Wenli
    Yang, Yuanjian
    Li, Fangyu
    Long, Xi
    Shan, Caifeng
    NEUROCOMPUTING, 2023, 523 : 9 - 17
  • [17] An optimized mask-guided mobile pedestrian detection network with millisecond scale
    Bai, Qiong
    Xin, Jinming
    Yan, Ming
    Wang, Yu
    Li, Erpeng
    Zhao, Sanjun
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4975 - 4980
  • [18] SMG-Diff: Adversarial Attack Method Based on Semantic Mask-Guided Diffusion
    Zhang, Yongliang
    Liu, Jing
    MULTIMEDIA MODELING, MMM 2025, PT IV, 2025, 15523 : 44 - 57
  • [19] GENERATING FUTURE FRAMES WITH MASK-GUIDED PREDICTION
    Wu, Qian
    Chen, Xiongtao
    Huang, Zhongyi
    Wang, Wenmin
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [20] Mask-guided Image Classification with Siamese Networks
    Alqasir, Hiba
    Muselet, Damien
    Ducottet, Christophe
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 536 - 543