Enhancing Object Detection in Remote Sensing: A Hybrid YOLOv7 and Transformer Approach with Automatic Model Selection

被引:3
作者
Ahmed, Mahmoud [1 ]
El-Sheimy, Naser [2 ]
Leung, Henry [1 ]
Moussa, Adel [2 ,3 ]
机构
[1] Univ Calgary, Dept Elect & Software Engn, Calgary, AB T2N 1N4, Canada
[2] Univ Calgary, Dept Geomat Engn, Calgary, AB T2N 1N4, Canada
[3] Port Said Univ, Dept Elect & Comp Engn, Port Said 42523, Egypt
关键词
object detection; detection transformer; YOLOv7; multimodalities; NETWORKS;
D O I
10.3390/rs16010051
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In the remote sensing field, object detection holds immense value for applications such as land use classification, disaster monitoring, and infrastructure planning, where accurate and efficient identification of objects within images is essential for informed decision making. However, achieving object localization with high precision can be challenging even if minor errors exist at the pixel level, which can significantly impact the ground distance measurements. To address this critical challenge, our research introduces an innovative hybrid approach that combines the capabilities of the You Only Look Once version 7 (YOLOv7) and DEtection TRansformer (DETR) algorithms. By bridging the gap between local receptive field and global context, our approach not only enhances overall object detection accuracy, but also promotes precise object localization, a key requirement in the field of remote sensing. Furthermore, a key advantage of our approach is the introduction of an automatic selection module which serves as an intelligent decision-making component. This module optimizes the selection process between YOLOv7 and DETR, and further improves object detection accuracy. Finally, we validate the improved performance of our new hybrid approach through empirical experimentation, and thus confirm its contribution to the field of target recognition and detection in remote sensing images.
引用
收藏
页数:17
相关论文
共 52 条
  • [1] Transformers in Remote Sensing: A Survey
    Aleissaee, Abdulaziz Amer
    Kumar, Amandeep
    Anwer, Rao Muhammad
    Khan, Salman
    Cholakkal, Hisham
    Xia, Gui-Song
    Khan, Fahad Shahbaz
    [J]. REMOTE SENSING, 2023, 15 (07)
  • [2] Comparative Research on Deep Learning Approaches for Airplane Detection from Very High-Resolution Satellite Images
    Alganci, Ugur
    Soydas, Mehmet
    Sertel, Elif
    [J]. REMOTE SENSING, 2020, 12 (03)
  • [3] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [4] Cesar L.B., 2023, Engineering Proceedings, V39, DOI [10.3390/engproc2023039026, DOI 10.3390/ENGPROC2023039026]
  • [5] Zeiler MD, 2012, Arxiv, DOI arXiv:1212.5701
  • [6] Dosovitskiy A., 2021, ICLR
  • [7] Feng H.-Z., 2023, J. Electron. Sci. Technol, V21, P100215, DOI [10.1016/j.jnlest.2023.100215, DOI 10.1016/J.JNLEST.2023.100215]
  • [8] Gao Y., 2021, 2021 18 INT COMPUTER, P304, DOI DOI 10.1109/ICCWAMTIP53232.2021.9674150
  • [9] LocNet: Improving Localization Accuracy for Object Detection
    Gidaris, Spyros
    Komodakis, Nikos
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 789 - 798
  • [10] Gupta K., 2022, arXiv