Oriented Object Detection in Aerial Images Based on the Scaled Smooth L1 Loss Function

被引:6
|
作者
Wei, Linhai [1 ]
Zheng, Chen [2 ]
Hu, Yijun [1 ]
机构
[1] Wuhan Univ, Sch Math & Stat, Wuhan 430072, Peoples R China
[2] Henan Univ, Sch Math & Stat, Kaifeng 475001, Peoples R China
基金
中国国家自然科学基金;
关键词
object detection; convolution network; loss function; remote sensing image; aerial image; DETECTION FRAMEWORK; VEHICLE DETECTION;
D O I
10.3390/rs15051350
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Although many state-of-the-art object detectors have been developed, detecting small and densely packed objects with complicated orientations in remote sensing aerial images remains challenging. For object detection in remote sensing aerial images, different scales, sizes, appearances, and orientations of objects from different categories could most likely enlarge the variance in the detection error. Undoubtedly, the variance in the detection error should have a non-negligible impact on the detection performance. Motivated by the above consideration, in this paper, we tackled this issue, so that we could improve the detection performance and reduce the impact of this variance on the detection performance as much as possible. By proposing a scaled smooth L1 loss function, we developed a new two-stage object detector for remote sensing aerial images, named Faster R-CNN-NeXt with RoI-Transformer. The proposed scaled smooth L1 loss function is used for bounding box regression and makes regression invariant to scale. This property ensures that the bounding box regression is more reliable in detecting small and densely packed objects with complicated orientations and backgrounds, leading to improved detection performance. To learn rotated bounding boxes and produce more accurate object locations, a RoI-Transformer module is employed. This is necessary because horizontal bounding boxes are inadequate for aerial image detection. The ResNeXt backbone is also adopted for the proposed object detector. Experimental results on two popular datasets, DOTA and HRSC2016, show that the variance in the detection error significantly affects detection performance. The proposed object detector is effective and robust, with the optimal scale factor for the scaled smooth L1 loss function being around 2.0. Compared to other promising two-stage oriented methods, our method achieves a mAP of 70.82 on DOTA, with an improvement of at least 1.26 and up to 16.49. On HRSC2016, our method achieves an mAP of 87.1, with an improvement of at least 0.9 and up to 1.4.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Adaptive Saliency Biased Loss for Object Detection in Aerial Images
    Sun, Peng
    Chen, Guang
    Shang, Yi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (10): : 7154 - 7165
  • [2] Object Detection in Aerial Images Based on Cascaded CNN
    Zhang, Wei
    Li, Jiaojie
    Qi, Shengxiang
    2018 INTERNATIONAL CONFERENCE ON SENSOR NETWORKS AND SIGNAL PROCESSING (SNSP 2018), 2018, : 434 - 439
  • [3] Sparse Label Assignment for Oriented Object Detection in Aerial Images
    Ming, Qi
    Miao, Lingjuan
    Zhou, Zhiqiang
    Song, Junjie
    Yang, Xue
    REMOTE SENSING, 2021, 13 (14)
  • [4] Learning RoI Transformer for Oriented Object Detection in Aerial Images
    Ding, Jian
    Xue, Nan
    Long, Yang
    Xia, Gui-Song
    Lu, Qikai
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2844 - 2853
  • [5] Multi-Oriented Object Detection in Aerial Images With Double Horizontal Rectangles
    Nie, Guangtao
    Huang, Hua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4932 - 4944
  • [6] Deep ensemble based object detection from aerial images
    Park J.-C.
    Son S.-B.
    Lee S.-H.
    Jung J.-U.
    Park Y.-J.
    Oh H.-S.
    Journal of Institute of Control, Robotics and Systems, 2021, 27 (12) : 944 - 952
  • [7] EFN: Field-Based Object Detection for Aerial Images
    Liu, Jin
    Zheng, Haokun
    REMOTE SENSING, 2020, 12 (21) : 1 - 16
  • [8] GSDet: Object Detection in Aerial Images Based on Scale Reasoning
    Li, Wei
    Wei, Wei
    Zhang, Lei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4599 - 4609
  • [9] THE ELLIPTIC ENERGY LOSS FOR ROTATED OBJECT DETECTION IN AERIAL IMAGES
    Zhang, Cong
    Luo, Kunming
    Meng, Fanman
    Wu, Qingbo
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3384 - 3388
  • [10] Multi-scale oriented object detection in aerial images based on convolutional neural networks with global attention
    Fei, Jingjing
    Wang, Zhicheng
    Yu, Zhaohui
    Gu, Xi
    Wei, Gang
    MIPPR 2019: REMOTE SENSING IMAGE PROCESSING, GEOGRAPHIC INFORMATION SYSTEMS, AND OTHER APPLICATIONS, 2020, 11432