Oriented Object Detection in Aerial Images Based on the Scaled Smooth L1 Loss Function

被引:6
作者
Wei, Linhai [1 ]
Zheng, Chen [2 ]
Hu, Yijun [1 ]
机构
[1] Wuhan Univ, Sch Math & Stat, Wuhan 430072, Peoples R China
[2] Henan Univ, Sch Math & Stat, Kaifeng 475001, Peoples R China
基金
中国国家自然科学基金;
关键词
object detection; convolution network; loss function; remote sensing image; aerial image; DETECTION FRAMEWORK; VEHICLE DETECTION;
D O I
10.3390/rs15051350
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Although many state-of-the-art object detectors have been developed, detecting small and densely packed objects with complicated orientations in remote sensing aerial images remains challenging. For object detection in remote sensing aerial images, different scales, sizes, appearances, and orientations of objects from different categories could most likely enlarge the variance in the detection error. Undoubtedly, the variance in the detection error should have a non-negligible impact on the detection performance. Motivated by the above consideration, in this paper, we tackled this issue, so that we could improve the detection performance and reduce the impact of this variance on the detection performance as much as possible. By proposing a scaled smooth L1 loss function, we developed a new two-stage object detector for remote sensing aerial images, named Faster R-CNN-NeXt with RoI-Transformer. The proposed scaled smooth L1 loss function is used for bounding box regression and makes regression invariant to scale. This property ensures that the bounding box regression is more reliable in detecting small and densely packed objects with complicated orientations and backgrounds, leading to improved detection performance. To learn rotated bounding boxes and produce more accurate object locations, a RoI-Transformer module is employed. This is necessary because horizontal bounding boxes are inadequate for aerial image detection. The ResNeXt backbone is also adopted for the proposed object detector. Experimental results on two popular datasets, DOTA and HRSC2016, show that the variance in the detection error significantly affects detection performance. The proposed object detector is effective and robust, with the optimal scale factor for the scaled smooth L1 loss function being around 2.0. Compared to other promising two-stage oriented methods, our method achieves a mAP of 70.82 on DOTA, with an improvement of at least 1.26 and up to 16.49. On HRSC2016, our method achieves an mAP of 87.1, with an improvement of at least 0.9 and up to 1.4.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Transformer-based End-to-End Object Detection in Aerial Images
    Vo, Nguyen D.
    Le, Nguyen
    Ngo, Giang
    Doan, Du
    Le, Do
    Nguyen, Khang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 1072 - 1079
  • [32] Lightweight small object detection network for aerial images based on cross-attention and information injection
    Dong Wang
    Junnan Liu
    Shengyi Jin
    Journal of Real-Time Image Processing, 2025, 22 (3)
  • [33] Object detection method based on CIoU improved bounding box loss function
    Liu Xiong-biao
    Yang Xian-zhao
    Chen Yang
    Zhao Shuai-tong
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (05) : 656 - 665
  • [34] Gaussian Focal Loss: Learning Distribution Polarized Angle Prediction for Rotated Object Detection in Aerial Images
    Wang, Jian
    Li, Fan
    Bi, Haixia
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [35] A Lightweight Keypoint-Based Oriented Object Detection of Remote Sensing Images
    Li, Yangyang
    Mao, Heting
    Liu, Ruijiao
    Pei, Xuan
    Jiao, Licheng
    Shang, Ronghua
    REMOTE SENSING, 2021, 13 (13)
  • [36] A Self-Adaptive Object Detection Network for Aerial Images Based on Feature Enhancement
    Zhao, Ming
    Zhao, Kai
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [37] Improved YOLOv7-Tiny for Object Detection Based on UAV Aerial Images
    Zhang, Zitong
    Xie, Xiaolan
    Guo, Qiang
    Xu, Jinfan
    ELECTRONICS, 2024, 13 (15)
  • [38] Improved Faster RCNN Based on Feature Amplification and Oversampling Data Augmentation for Oriented Vehicle Detection in Aerial Images
    Mo, Nan
    Yan, Li
    REMOTE SENSING, 2020, 12 (16)
  • [39] Mask OBB: A Semantic Attention-Based Mask Oriented Bounding Box Representation for Multi-Category Object Detection in Aerial Images
    Wang, Jinwang
    Ding, Jian
    Guo, Haowen
    Cheng, Wensheng
    Pan, Ting
    Yang, Wen
    REMOTE SENSING, 2019, 11 (24)
  • [40] Oriented object detection in satellite images using convolutional neural network based on ResNeXt
    Haryono, Asep
    Jati, Grafika
    Jatmiko, Wisnu
    ETRI JOURNAL, 2024, 46 (02) : 307 - 322