Semantic Assistance in SAR Object Detection: A Mask-Guided Approach

被引:0
|
作者
Liu, Wei [1 ]
Zhou, Lifan [1 ]
Zhong, Shan [1 ]
Gong, Shengrong [1 ]
机构
[1] Changshu Inst Technol, Suzhou 215500, Peoples R China
基金
中国国家自然科学基金;
关键词
DEtection TRansformer (DETR); object detection; segment anything model (SAM); synthetic aperture radar (SAR); PYRAMID NETWORK; FOCAL LOSS;
D O I
10.1109/JSTARS.2024.3481368
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The unique challenge in SAR object detection is the strong speckle noise inherent in SAR imagery. Existing learning-based works mainly focus on architectural enhancements, and fail to consider the valuable semantic information that can mitigate the effects of speckle noise. Large pretrained segment anything model (SAM) is a powerful foundational model with general semantic knowledge. However, SAM is not fully exploited for SAR object detection. This study paves the way for applying SAM for SAR object detection. Rather than fine-tuning the SAM network, we propose three mask-guided learning strategies by simply utilizing the semantic masks generated by SAM. Built upon the advanced RealTime DEtection TRansformer (RT-DETR) framework, the Semantic Assisted DETR, deemed as SA-DETR, integrates prior semantics from SAM into the SAR detection task. To be specific, first, we propose the mask-guided feature denoising module in the encoder stage, to enhance the network's discrimination of positives and negatives. Second, we propose the mask-guided query selection for initial query generation, which is beneficial for the decoder refinement. Finally, the mask-guided instance segmentation is proposed to achieve more accurate localization. To validate the superiority of the proposed SA-DETR, extensive experiments are conducted on two benchmark datasets, i.e., the SAR ship detection dataset (SSDD) and the recently published COCO-level large-scale multiclass SAR object detection dataset (SARDet-100K). Experimental results on both datasets outperform previous advanced detectors, achieving a new state-of-the-art with 99.0 $AP_{50}$ and 88.4 $mAP_{50}$ on SSDD and SARDet-100 K, respectively.
引用
收藏
页码:19395 / 19407
页数:13
相关论文
共 50 条
  • [41] Mask-Guided Mamba Fusion for Drone-Based Visible-Infrared Vehicle Detection
    Wang, Simiao
    Wang, Chunpeng
    Shi, Chaoyi
    Liu, Yunan
    Lu, Mingyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [42] Mask-guided cycle-GAN for specular highlight removal
    Hu, Guangwei
    Zheng, Yuanfeng
    Yan, Haoran
    Hua, Guang
    Yan, Yuchen
    PATTERN RECOGNITION LETTERS, 2022, 161 : 108 - 114
  • [43] Mask-Guided Vision Transformer for Few-Shot Learning
    Chen, Yuzhong
    Xiao, Zhenxiang
    Pan, Yi
    Zhao, Lin
    Dai, Haixing
    Wu, Zihao
    Li, Changhe
    Zhang, Tuo
    Li, Changying
    Zhu, Dajiang
    Liu, Tianming
    Jiang, Xi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [44] Mask-Guided Local-2013;Global Attentive Network for Change Detection in Remote Sensing Images
    Xiong, Fengchao
    Li, Tianhan
    Chen, Jingzhou
    Zhou, Jun
    Qian, Yuntao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 3366 - 3378
  • [45] Mask R-CNN for Object Detection in Multitemporal SAR Images
    Qian, Yu
    Liu, Qin
    Zhu, Hongming
    Fan, Hongfei
    Du, Bowen
    Liu, Sicong
    2019 10TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2019,
  • [46] Mask-Guided Attention Network and Occlusion-Sensitive Hard Example Mining for Occluded Pedestrian Detection
    Xie, Jin
    Pang, Yanwei
    Khan, Muhammad Haris
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3872 - 3884
  • [47] RefinePocket: An Attention-Enhanced and Mask-Guided Deep Learning Approach for Protein Binding Site Prediction
    Liu, Yongchang
    Li, Peiying
    Tu, Shikui
    Xu, Lei
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (05) : 3314 - 3321
  • [48] Mixed Supervised Object Detection by Transferring Mask Prior and Semantic Similarity
    Liu, Yan
    Zhang, Zhijie
    Niu, Li
    Chen, Junjie
    Zhang, Liqing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [49] Mask-Guided Multiscale Feature Aggregation Network for Hand Gesture Recognition
    Liang, Hao
    Fei, Lunke
    Zhao, Shuping
    Wen, Jie
    Teng, Shaohua
    Xu, Yong
    Pattern Recognition, 2024, 145
  • [50] Mask-Guided Region Attention Network for Person Re-Identification
    Zhou, Cong
    Yu, Han
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 286 - 298