RS-YOLOX: A High-Precision Detector for Object Detection in Satellite Remote Sensing Images

被引:25
作者
Yang, Lei [1 ]
Yuan, Guowu [1 ,2 ]
Zhou, Hao [1 ,2 ]
Liu, Hongyu [1 ]
Chen, Jian [1 ]
Wu, Hao [1 ,2 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650504, Yunnan, Peoples R China
[2] Yunnan Key Lab Intelligent Syst & Comp, Kunming 650504, Yunnan, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 17期
关键词
object detection; remote sensing image; attention mechanisms; feature fusion; varifocal loss; Slicing Aided Hyper Inference (SAHI); NETWORK;
D O I
10.3390/app12178707
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Automatic object detection by satellite remote sensing images is of great significance for resource exploration and natural disaster assessment. To solve existing problems in remote sensing image detection, this article proposes an improved YOLOX model for satellite remote sensing image automatic detection. This model is named RS-YOLOX. To strengthen the feature learning ability of the network, we used Efficient Channel Attention (ECA) in the backbone network of YOLOX and combined the Adaptively Spatial Feature Fusion (ASFF) with the neck network of YOLOX. To balance the numbers of positive and negative samples in training, we used the Varifocal Loss function. Finally, to obtain a high-performance remote sensing object detector, we combined the trained model with an open-source framework called Slicing Aided Hyper Inference (SAHI). This work evaluated models on three aerial remote sensing datasets (DOTA-v1.5, TGRS-HRRSD, and RSOD). Our comparative experiments demonstrate that our model has the highest accuracy in detecting objects in remote sensing image datasets.
引用
收藏
页数:22
相关论文
共 58 条
  • [1] Akyon F.C., 2022, arXiv
  • [2] Characterizing the Patterns and Trends of Urban Growth in Saudi Arabia's 13 Capital Cities Using a Landsat Time Series
    Aljaddani, Amal H.
    Song, Xiao-Peng
    Zhu, Zhe
    [J]. REMOTE SENSING, 2022, 14 (10)
  • [3] [Anonymous], 2017, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2017.690
  • [4] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
  • [5] Ding J., 2021, ARXIV, DOI [10.1109/TPAMI.2021.3117983, DOI 10.1109/TPAMI.2021.3117983]
  • [6] Learning RoI Transformer for Oriented Object Detection in Aerial Images
    Ding, Jian
    Xue, Nan
    Long, Yang
    Xia, Gui-Song
    Lu, Qikai
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2844 - 2853
  • [7] Fu C.-Y., 2017, ARXIV170106659
  • [8] Ge Z, 2021, Arxiv, DOI arXiv:2107.08430
  • [9] Gevorgyan Z., 2022, ARXIV
  • [10] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448