Semantic Assistance in SAR Object Detection: A Mask-Guided Approach

被引:0
|
作者
Liu, Wei [1 ]
Zhou, Lifan [1 ]
Zhong, Shan [1 ]
Gong, Shengrong [1 ]
机构
[1] Changshu Inst Technol, Suzhou 215500, Peoples R China
基金
中国国家自然科学基金;
关键词
DEtection TRansformer (DETR); object detection; segment anything model (SAM); synthetic aperture radar (SAR); PYRAMID NETWORK; FOCAL LOSS;
D O I
10.1109/JSTARS.2024.3481368
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The unique challenge in SAR object detection is the strong speckle noise inherent in SAR imagery. Existing learning-based works mainly focus on architectural enhancements, and fail to consider the valuable semantic information that can mitigate the effects of speckle noise. Large pretrained segment anything model (SAM) is a powerful foundational model with general semantic knowledge. However, SAM is not fully exploited for SAR object detection. This study paves the way for applying SAM for SAR object detection. Rather than fine-tuning the SAM network, we propose three mask-guided learning strategies by simply utilizing the semantic masks generated by SAM. Built upon the advanced RealTime DEtection TRansformer (RT-DETR) framework, the Semantic Assisted DETR, deemed as SA-DETR, integrates prior semantics from SAM into the SAR detection task. To be specific, first, we propose the mask-guided feature denoising module in the encoder stage, to enhance the network's discrimination of positives and negatives. Second, we propose the mask-guided query selection for initial query generation, which is beneficial for the decoder refinement. Finally, the mask-guided instance segmentation is proposed to achieve more accurate localization. To validate the superiority of the proposed SA-DETR, extensive experiments are conducted on two benchmark datasets, i.e., the SAR ship detection dataset (SSDD) and the recently published COCO-level large-scale multiclass SAR object detection dataset (SARDet-100K). Experimental results on both datasets outperform previous advanced detectors, achieving a new state-of-the-art with 99.0 $AP_{50}$ and 88.4 $mAP_{50}$ on SSDD and SARDet-100 K, respectively.
引用
收藏
页码:19395 / 19407
页数:13
相关论文
共 50 条
  • [31] AMGmal: Adaptive mask-guided adversarial attack against malware detection with minimal perturbation
    Zhan, Dazhi
    Duan, Yexin
    Hu, Yue
    Yin, Lujia
    Pan, Zhisong
    Guo, Shize
    COMPUTERS & SECURITY, 2023, 127
  • [32] Mask-guided image person removal with data synthesis
    Jiang, Yunliang
    Gu, Chenyang
    Xue, Zhenfeng
    Zhang, Xiongtao
    Liu, Yong
    IET IMAGE PROCESSING, 2023, 17 (07) : 2214 - 2224
  • [33] Mask-Guided Font Style Transfer in Natural Scene
    Yang, Jing
    Xia, Siyu
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6286 - 6291
  • [34] Mask-guided GAN for robust text editing in the scene
    Yu, Boxi
    Xu, Yong
    Huang, Yan
    Yang, Shuai
    Liu, Jiaying
    NEUROCOMPUTING, 2021, 441 : 192 - 201
  • [35] SAR Image Denoising and Semantic Enhancement for Object Detection
    Qu Haicheng
    Shen Lei
    ACTA PHOTONICA SINICA, 2022, 51 (04) : 321 - 335
  • [36] The Segmentation Tracker With Mask-Guided Background Suppression Strategy
    Tian, Erlin
    Lei, Yunpeng
    Sun, Junfeng
    Zhou, Keyan
    Zhou, Bin
    Li, Hanfei
    IEEE ACCESS, 2024, 12 : 124032 - 124044
  • [37] Mask-Guided Deformation Adaptive Network for Human Parsing
    Mao, Aihua
    Liang, Yuan
    Jiao, Jianbo
    Liu, Yongtuo
    He, Shengfeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [38] Mask-guided BERT for few-shot text classification
    Liao, Wenxiong
    Liu, Zhengliang
    Dai, Haixing
    Wu, Zihao
    Zhang, Yiyang
    Huang, Xiaoke
    Chen, Yuzhong
    Jiang, Xi
    Liu, David
    Zhu, Dajiang
    Li, Sheng
    Liu, Wei
    Liu, Tianming
    Li, Quanzheng
    Cai, Hongmin
    Li, Xiang
    NEUROCOMPUTING, 2024, 610
  • [39] Mask-guided noise restriction adversarial attacks for image classification
    Duan, Yexin
    Zhou, Xingyu
    Zou, Junhua
    Qiu, Junyang
    Zhang, Jin
    Pan, Zhisong
    COMPUTERS & SECURITY, 2021, 100
  • [40] MASK-GUIDED STYLE TRANSFER NETWORK FOR PURIFYING REAL IMAGES
    Zhao, Tongtong
    Yan, Yuxiao
    Peng, Jinjia
    Wang, Huibing
    Fu, Xianping
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 429 - 434