RS-DETR: An Improved Remote Sensing Object Detection Model Based on RT-DETR

被引:1
|
作者
Zhang, Hao [1 ]
Ma, Zheng [2 ]
Li, Xiang [1 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Informatizat Off, Wuhan 430074, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 22期
关键词
cascaded group attention; Focaler-GIoU; EBiFPN; object detection;
D O I
10.3390/app142210331
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Object detection is a fundamental task in computer vision. Recently, deep-learning-based object detection has made significant progress. However, due to large variations in target scale, the predominance of small targets, and complex backgrounds in remote sensing imagery, remote sensing object detection still faces challenges, including low detection accuracy, poor real-time performance, high missed detection rates, and high false detection rates in practical applications. To enhance remote sensing target detection performance, this study proposes a new model, the remote sensing detection transformer (RS-DETR). First, we incorporate cascaded group attention (CGA) into the attention-driven feature interaction module. By capturing features at different levels, it enhances the interaction between features through cascading and improves computational efficiency. Additionally, we propose an enhanced bidirectional feature pyramid network (EBiFPN) to facilitate multi-scale feature fusion. By integrating features across multiple scales, it improves object detection accuracy and robustness. Finally, we propose a novel bounding box regression loss function, Focaler-GIoU, which makes the model focus more on difficult samples, improving detection performance for small and overlapping targets. Experimental results on the satellite imagery multi-vehicles dataset (SIMD) and the high-resolution remote sensing object detection (TGRS-HRRSD) dataset show that the improved algorithm achieved mean average precision (mAP) of 78.2% and 91.6% at an intersection over union threshold of 0.5, respectively, which is an improvement of 2.0% and 1.5% over the baseline model. This result demonstrates the effectiveness and robustness of our proposed method for remote sensing image object detection.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] MSRT-DETR: A novel RT-DETR model with multi-scale feature sequence for cell detection
    Zhou, Chuncheng
    He, Haiyang
    Zhou, Hao
    Ge, Fuhua
    Yu, Pengfei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 103
  • [22] DST-DETR: Image Dehazing RT-DETR for Safety Helmet Detection in Foggy Weather
    Liu, Ziyuan
    Sun, Chunxia
    Wang, Xiaopeng
    SENSORS, 2024, 24 (14)
  • [23] An Efficient Real-time Metal Crack Detection Model Based on RT-DETR
    Cheng, Zhang
    Yang, Fan
    Deng, Shusen
    Chen, Fenglin
    Huo, Junzhou
    2024 9TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS, ACIRS, 2024, : 220 - 224
  • [24] DETR-ORD: An Improved DETR Detector for Oriented Remote Sensing Object Detection with Feature Reconstruction and Dynamic Query
    He, Xiaohai
    Liang, Kaiwen
    Zhang, Weimin
    Li, Fangxing
    Jiang, Zhou
    Zuo, Zhengqing
    Tan, Xinyan
    REMOTE SENSING, 2024, 16 (18)
  • [25] Eye Landmarks Detection using RT-DETR with Rules
    Boonnithititikul, Chatree
    Jaknamon, Teetouch
    Chawuthai, Rathachai
    2024 21ST INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, ECTI-CON 2024, 2024,
  • [26] An Improved DETR Based on Angle Denoising and Oriented Boxes Refinement for Remote Sensing Object Detection
    Wang, Hongmei
    Li, Chenkai
    Wu, Qiaorong
    Wang, Jingyu
    REMOTE SENSING, 2024, 16 (23)
  • [27] Detecting surface defects of pine wood using an improved RT-DETR model
    Hu J.
    Zhang G.
    Shen M.
    Li W.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2024, 40 (07): : 210 - 218
  • [28] OHEH-RTDETR: an improved RT-DETR detection model based on frequency layered processing and advanced feature selection
    Wang, Haochun
    Zhang, Yungui
    Wu, Weihang
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (04)
  • [29] FedsNet: the real-time network for pedestrian detection based on RT-DETR
    Peng, Hao
    Chen, Shiqiang
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (04)
  • [30] A Comparative Analysis of RT-DETR and YOLOv8 for Urban Zone Aerial Object Detection
    Jun, Eddy Lai Thin
    Tham, Mau-Luen
    Kwan, Ban-Hoe
    2024 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS, I2CACIS 2024, 2024, : 340 - 345