CCDN-DETR: A Detection Transformer Based on Constrained Contrast Denoising for Multi-Class Synthetic Aperture Radar Object Detection

被引:6
|
作者
Zhang, Lei [1 ]
Zheng, Jiachun [1 ]
Li, Chaopeng [1 ]
Xu, Zhiping [1 ]
Yang, Jiawen [1 ]
Wei, Qiuxin [2 ]
Wu, Xinyi [1 ]
机构
[1] Jimei Univ, Sch Ocean Informat Engn, Xiamen 361021, Peoples R China
[2] Fujlan Elect Port Co Ltd, Xiamen 361000, Peoples R China
关键词
detection transformer; SAR; object detection; deep learning;
D O I
10.3390/s24061793
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The effectiveness of the SAR object detection technique based on Convolutional Neural Networks (CNNs) has been widely proven, and it is increasingly used in the recognition of ship targets. Recently, efforts have been made to integrate transformer structures into SAR detectors to achieve improved target localization. However, existing methods rarely design the transformer itself as a detector, failing to fully leverage the long-range modeling advantages of self-attention. Furthermore, there has been limited research into multi-class SAR target detection. To address these limitations, this study proposes a SAR detector named CCDN-DETR, which builds upon the framework of the detection transformer (DETR). To adapt to the multiscale characteristics of SAR data, cross-scale encoders were introduced to facilitate comprehensive information modeling and fusion across different scales. Simultaneously, we optimized the query selection scheme for the input decoder layers, employing IOU loss to assist in initializing object queries more effectively. Additionally, we introduced constrained contrastive denoising training at the decoder layers to enhance the model's convergence speed and improve the detection of different categories of SAR targets. In the benchmark evaluation on a joint dataset composed of SSDD, HRSID, and SAR-AIRcraft datasets, CCDN-DETR achieves a mean Average Precision (mAP) of 91.9%. Furthermore, it demonstrates significant competitiveness with 83.7% mAP on the multi-class MSAR dataset compared to CNN-based models.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Scalable Multi-class Object Detection
    Razavi, Nima
    Gall, Juergen
    Van Gool, Luc
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1505 - 1512
  • [2] FOREIGN OBJECT DETECTION BASED ON CIRCULAR BISTATIC SYNTHETIC APERTURE RADAR
    Mohammadpoor, Mojtaba
    Abdullah, Raja S. A.
    Ismail, Alyani
    Abas, Ahmad F.
    PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2013, 134 : 301 - 322
  • [3] Vision Transformer Based Multi-class Lesion Detection in IVOCT
    Wang, Zixuan
    Shao, Yifan
    Sun, Jingyi
    Huang, Zhili
    Wang, Su
    Li, Qiyong
    Li, Jinsong
    Yu, Qian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 327 - 336
  • [4] The object detection efficiency in synthetic aperture radar systems
    Chernoyarov O.V.
    Dobrucky B.
    Ivanov V.A.
    Faulgaber A.N.
    International Journal of Engineering, Transactions B: Applications, 2020, 33 (02): : 337 - 343
  • [5] The Object Detection Efficiency in Synthetic Aperture Radar Systems
    Chernoyarov, O., V
    Dobrucky, B.
    Ivanov, V. A.
    Faulgaber, A. N.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2020, 33 (02): : 337 - 343
  • [6] Segmentation-based multi-class semantic object detection
    Vieux, Remi
    Benois-Pineau, Jenny
    Domenger, Jean-Philippe
    Braquelaire, Achille
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 305 - 326
  • [7] Improving multi-class Boosting-based object detection
    Miguel Buenaposada, Jose
    Baumela, Luis
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2021, 28 (01) : 81 - 96
  • [8] Segmentation-based multi-class semantic object detection
    Remi Vieux
    Jenny Benois-Pineau
    Jean-Philippe Domenger
    Achille Braquelaire
    Multimedia Tools and Applications, 2012, 60 : 305 - 326
  • [9] Joint Learning for Multi-class Object Detection
    Fard, Hamidreza Odabai
    Chaouch, Mohamed
    Quoc-cuong Pham
    Vacavant, Antoine
    Chateau, Thierry
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 104 - 112
  • [10] Layered Object Detection for Multi-Class Segmentation
    Yang, Yi
    Hallman, Sam
    Ramanan, Deva
    Fowlkes, Charless
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3113 - 3120