RS-YOLOX: A High-Precision Detector for Object Detection in Satellite Remote Sensing Images

被引:27
作者
Yang, Lei [1 ]
Yuan, Guowu [1 ,2 ]
Zhou, Hao [1 ,2 ]
Liu, Hongyu [1 ]
Chen, Jian [1 ]
Wu, Hao [1 ,2 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650504, Yunnan, Peoples R China
[2] Yunnan Key Lab Intelligent Syst & Comp, Kunming 650504, Yunnan, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 17期
关键词
object detection; remote sensing image; attention mechanisms; feature fusion; varifocal loss; Slicing Aided Hyper Inference (SAHI); NETWORK;
D O I
10.3390/app12178707
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Automatic object detection by satellite remote sensing images is of great significance for resource exploration and natural disaster assessment. To solve existing problems in remote sensing image detection, this article proposes an improved YOLOX model for satellite remote sensing image automatic detection. This model is named RS-YOLOX. To strengthen the feature learning ability of the network, we used Efficient Channel Attention (ECA) in the backbone network of YOLOX and combined the Adaptively Spatial Feature Fusion (ASFF) with the neck network of YOLOX. To balance the numbers of positive and negative samples in training, we used the Varifocal Loss function. Finally, to obtain a high-performance remote sensing object detector, we combined the trained model with an open-source framework called Slicing Aided Hyper Inference (SAHI). This work evaluated models on three aerial remote sensing datasets (DOTA-v1.5, TGRS-HRRSD, and RSOD). Our comparative experiments demonstrate that our model has the highest accuracy in detecting objects in remote sensing image datasets.
引用
收藏
页数:22
相关论文
共 58 条
[1]  
Akyon F. C., 2022, ARXIV
[2]   Characterizing the Patterns and Trends of Urban Growth in Saudi Arabia's 13 Capital Cities Using a Landsat Time Series [J].
Aljaddani, Amal H. ;
Song, Xiao-Peng ;
Zhu, Zhe .
REMOTE SENSING, 2022, 14 (10)
[3]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[4]  
Ding J., 2021, ARXIV, DOI [10.1109/TPAMI.2021.3117983, DOI 10.1109/TPAMI.2021.3117983]
[5]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[6]  
Fu C., 2017, DSSD: deconvolutional single shot detector
[7]  
Ge Z, 2021, Arxiv, DOI arXiv:2107.08430
[8]  
Gevorgyan Z., 2022, ARXIV
[9]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587