Oriented Object Detection in Aerial Images Based on the Scaled Smooth L1 Loss Function

被引：6

作者：

Wei, Linhai ^{[1
]}

Zheng, Chen ^{[2
]}

Hu, Yijun ^{[1
]}

机构：

[1] Wuhan Univ, Sch Math & Stat, Wuhan 430072, Peoples R China

[2] Henan Univ, Sch Math & Stat, Kaifeng 475001, Peoples R China

来源：

REMOTE SENSING | 2023年 / 15卷 / 05期

基金：

中国国家自然科学基金;

关键词：

object detection; convolution network; loss function; remote sensing image; aerial image; DETECTION FRAMEWORK; VEHICLE DETECTION;

D O I：

10.3390/rs15051350

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Although many state-of-the-art object detectors have been developed, detecting small and densely packed objects with complicated orientations in remote sensing aerial images remains challenging. For object detection in remote sensing aerial images, different scales, sizes, appearances, and orientations of objects from different categories could most likely enlarge the variance in the detection error. Undoubtedly, the variance in the detection error should have a non-negligible impact on the detection performance. Motivated by the above consideration, in this paper, we tackled this issue, so that we could improve the detection performance and reduce the impact of this variance on the detection performance as much as possible. By proposing a scaled smooth L1 loss function, we developed a new two-stage object detector for remote sensing aerial images, named Faster R-CNN-NeXt with RoI-Transformer. The proposed scaled smooth L1 loss function is used for bounding box regression and makes regression invariant to scale. This property ensures that the bounding box regression is more reliable in detecting small and densely packed objects with complicated orientations and backgrounds, leading to improved detection performance. To learn rotated bounding boxes and produce more accurate object locations, a RoI-Transformer module is employed. This is necessary because horizontal bounding boxes are inadequate for aerial image detection. The ResNeXt backbone is also adopted for the proposed object detector. Experimental results on two popular datasets, DOTA and HRSC2016, show that the variance in the detection error significantly affects detection performance. The proposed object detector is effective and robust, with the optimal scale factor for the scaled smooth L1 loss function being around 2.0. Compared to other promising two-stage oriented methods, our method achieves a mAP of 70.82 on DOTA, with an improvement of at least 1.26 and up to 16.49. On HRSC2016, our method achieves an mAP of 87.1, with an improvement of at least 0.9 and up to 1.4.

引用

页数：23

共 50 条

[31] Transformer-based End-to-End Object Detection in Aerial Images
Vo, Nguyen D.
Le, Nguyen
Ngo, Giang
Doan, Du
Le, Do
Nguyen, Khang
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 1072 - 1079
[32] Lightweight small object detection network for aerial images based on cross-attention and information injection
Dong Wang
Junnan Liu
Shengyi Jin
Journal of Real-Time Image Processing, 2025, 22 (3)
[33] Object detection method based on CIoU improved bounding box loss function
Liu Xiong-biao
Yang Xian-zhao
Chen Yang
Zhao Shuai-tong
CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (05) : 656 - 665
[34] Gaussian Focal Loss: Learning Distribution Polarized Angle Prediction for Rotated Object Detection in Aerial Images
Wang, Jian
Li, Fan
Bi, Haixia
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[35] A Lightweight Keypoint-Based Oriented Object Detection of Remote Sensing Images
Li, Yangyang
Mao, Heting
Liu, Ruijiao
Pei, Xuan
Jiao, Licheng
Shang, Ronghua
REMOTE SENSING, 2021, 13 (13)
[36] A Self-Adaptive Object Detection Network for Aerial Images Based on Feature Enhancement
Zhao, Ming
Zhao, Kai
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[37] Improved YOLOv7-Tiny for Object Detection Based on UAV Aerial Images
Zhang, Zitong
Xie, Xiaolan
Guo, Qiang
Xu, Jinfan
ELECTRONICS, 2024, 13 (15)
[38] Improved Faster RCNN Based on Feature Amplification and Oversampling Data Augmentation for Oriented Vehicle Detection in Aerial Images
Mo, Nan
Yan, Li
REMOTE SENSING, 2020, 12 (16)
[39] Mask OBB: A Semantic Attention-Based Mask Oriented Bounding Box Representation for Multi-Category Object Detection in Aerial Images
Wang, Jinwang
Ding, Jian
Guo, Haowen
Cheng, Wensheng
Pan, Ting
Yang, Wen
REMOTE SENSING, 2019, 11 (24)
[40] Oriented object detection in satellite images using convolutional neural network based on ResNeXt
Haryono, Asep
Jati, Grafika
Jatmiko, Wisnu
ETRI JOURNAL, 2024, 46 (02) : 307 - 322

← 1 2 3 4 5 →