Oriented Object Detection in Aerial Images Based on the Scaled Smooth L1 Loss Function

被引:6
作者
Wei, Linhai [1 ]
Zheng, Chen [2 ]
Hu, Yijun [1 ]
机构
[1] Wuhan Univ, Sch Math & Stat, Wuhan 430072, Peoples R China
[2] Henan Univ, Sch Math & Stat, Kaifeng 475001, Peoples R China
基金
中国国家自然科学基金;
关键词
object detection; convolution network; loss function; remote sensing image; aerial image; DETECTION FRAMEWORK; VEHICLE DETECTION;
D O I
10.3390/rs15051350
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Although many state-of-the-art object detectors have been developed, detecting small and densely packed objects with complicated orientations in remote sensing aerial images remains challenging. For object detection in remote sensing aerial images, different scales, sizes, appearances, and orientations of objects from different categories could most likely enlarge the variance in the detection error. Undoubtedly, the variance in the detection error should have a non-negligible impact on the detection performance. Motivated by the above consideration, in this paper, we tackled this issue, so that we could improve the detection performance and reduce the impact of this variance on the detection performance as much as possible. By proposing a scaled smooth L1 loss function, we developed a new two-stage object detector for remote sensing aerial images, named Faster R-CNN-NeXt with RoI-Transformer. The proposed scaled smooth L1 loss function is used for bounding box regression and makes regression invariant to scale. This property ensures that the bounding box regression is more reliable in detecting small and densely packed objects with complicated orientations and backgrounds, leading to improved detection performance. To learn rotated bounding boxes and produce more accurate object locations, a RoI-Transformer module is employed. This is necessary because horizontal bounding boxes are inadequate for aerial image detection. The ResNeXt backbone is also adopted for the proposed object detector. Experimental results on two popular datasets, DOTA and HRSC2016, show that the variance in the detection error significantly affects detection performance. The proposed object detector is effective and robust, with the optimal scale factor for the scaled smooth L1 loss function being around 2.0. Compared to other promising two-stage oriented methods, our method achieves a mAP of 70.82 on DOTA, with an improvement of at least 1.26 and up to 16.49. On HRSC2016, our method achieves an mAP of 87.1, with an improvement of at least 0.9 and up to 1.4.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Arbitrary-Oriented Object Detection in Remote Sensing Images Based on Polar Coordinates
    Zhou, Lin
    Wei, Haoran
    Li, Hao
    Zhao, Wenzhe
    Zhang, Yi
    Zhang, Yue
    IEEE ACCESS, 2020, 8 (08): : 223373 - 223384
  • [42] Sinextnet: A New Small Object Detection Model for Aerial Images Based on PP-Yoloe
    Zhang, Wenkang
    Hong, Zhiyong
    Xiong, Liping
    Zeng, Zhiqiang
    Cai, Zhishun
    Tan, Kunyu
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2024, 14 (03) : 251 - 265
  • [43] Probability Differential-Based Class Label Noise Purification for Object Detection in Aerial Images
    Hu, Zibo
    Gao, Kun
    Zhang, Xiaodian
    Wang, Junwei
    Wang, Hong
    Han, Jiawei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [44] Rotated-DETR: an End-to-End Transformer-based Oriented Object Detector for Aerial Images
    Kim, Jinbeom
    Lee, Giljun
    Kim, Taejune
    Woo, Simon S.
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1248 - 1255
  • [45] Universal Synchronization Loss Optimization in DETR-Based Oriented and Rotated Object Detection
    Liu, Gang
    Yu, Qingchen
    Gong, Mingming
    Yang, Hao
    IEEE ACCESS, 2025, 13 : 45669 - 45681
  • [46] Small Object Detection in Hyperspectral Images Based on Radial Basis Activation Function
    Wang Bofan
    Zhao Haitao
    ACTA OPTICA SINICA, 2021, 41 (23)
  • [47] Rotated Object Detection of Remote Sensing Image Based on Binary Smooth Encoding and Ellipse-Like Focus Loss
    Geng, Jie
    Xu, Zhe
    Zhao, Zihao
    Jiang, Wen
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [48] Aerial images object detection method based on cross-scale multi- feature fusion
    Pan, Yang
    Yang, Jinhua
    Zhu, Lei
    Yao, Lina
    Zhang, Bo
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 16148 - 16168
  • [49] Multi-Stream Fusion Network With Generalized Smooth L1 Loss for Single Image Dehazing
    Zhu, Xinshan
    Li, Shuoshi
    Gan, Yongdong
    Zhang, Yun
    Sun, Biao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7620 - 7635
  • [50] Maritime Small Object Detection Algorithm in Drone Aerial Images Based on Improved YOLOv8
    Ling, Peng
    Zhang, Yihong
    Ma, Shuai
    IEEE ACCESS, 2024, 12 : 176527 - 176538