YOLO-ViT-Based Method for Unmanned Aerial Vehicle Infrared Vehicle Target Detection

被引:29
作者
Zhao, Xiaofeng [1 ]
Xia, Yuting [1 ]
Zhang, Wenwen [1 ]
Zheng, Chao [1 ]
Zhang, Zhili [1 ]
机构
[1] Xian Res Inst High Tech, Xian 710025, Peoples R China
基金
中国国家自然科学基金;
关键词
unmanned aerial vehicle target detection; vehicle detection; infrared small target; deep learning; Yolov7; NETWORK; UAV;
D O I
10.3390/rs15153778
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The detection of infrared vehicle targets by UAVs poses significant challenges in the presence of complex ground backgrounds, high target density, and a large proportion of small targets, which result in high false alarm rates. To alleviate these deficiencies, a novel YOLOv7-based, multi-scale target detection method for infrared vehicle targets is proposed, which is termed YOLO-ViT. Firstly, within the YOLOV7-based framework, the lightweight MobileViT network is incorporated as the feature extraction backbone network to fully extract the local and global features of the object and reduce the complexity of the model. Secondly, an innovative C3-PANet neural network structure is delicately designed, which adopts the CARAFE upsampling method to utilize the semantic information in the feature map and improve the model's recognition accuracy of the target region. In conjunction with the C3 structure, the receptive field will be increased to enhance the network's accuracy in recognizing small targets and model generalization ability. Finally, the K-means++ clustering method is utilized to optimize the anchor box size, leading to the design of anchor boxes better suited for detecting small infrared targets from UAVs, thereby improving detection efficiency. The present article showcases experimental findings attained through the use of the HIT-UAV public dataset. The results demonstrate that the enhanced YOLO-ViT approach, in comparison to the original method, achieves a reduction in the number of parameters by 49.9% and floating-point operations by 67.9%. Furthermore, the mean average precision (mAP) exhibits an improvement of 0.9% over the existing algorithm, reaching a value of 94.5%, which validates the effectiveness of the method for UAV infrared vehicle target detection.
引用
收藏
页数:16
相关论文
共 53 条
  • [1] Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
  • [2] Car Detection using Unmanned Aerial Vehicles: Comparison between Faster R-CNN and YOLOv3
    Benjdira, Bilel
    Khursheed, Taha
    Koubaa, Anis
    Ammar, Adel
    Ouni, Kais
    [J]. 2019 1ST INTERNATIONAL CONFERENCE ON UNMANNED VEHICLE SYSTEMS-OMAN (UVS), 2019,
  • [3] Vehicle Detection From UAV Imagery With Deep Learning: A Review
    Bouguettaya, Abdelmalek
    Zarzour, Hafed
    Kechida, Ahmed
    Taberkit, Amine Mohammed
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6047 - 6067
  • [4] Local Convergence Index-Based Infrared Small Target Detection against Complex Scenes
    Cao, Siying
    Deng, Jiakun
    Luo, Junhai
    Li, Zhi
    Hu, Junsong
    Peng, Zhenming
    [J]. REMOTE SENSING, 2023, 15 (05)
  • [5] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [6] IRSTFormer: A Hierarchical Vision Transformer for Infrared Small Target Detection
    Chen, Gao
    Wang, Weihua
    Tan, Sirui
    [J]. REMOTE SENSING, 2022, 14 (14)
  • [7] YOLOv5-Based Vehicle Detection Method for High-Resolution UAV Images
    Chen, Ziwen
    Cao, Lijie
    Wang, Qihua
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [8] Design of search and rescue system using autonomous Multi-UAVs
    Choutri, Kheireddine
    Mohand, Lagha
    Dala, Laurent
    [J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2020, 14 (04): : 553 - 564
  • [9] Asymmetric Contextual Modulation for Infrared Small Target Detection
    Dai, Yimian
    Wu, Yiquan
    Zhou, Fei
    Barnard, Kobus
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 949 - 958
  • [10] SI-EDTL: Swarm intelligence ensemble deep transfer learning for multiple vehicle detection in UAV images
    Darehnaei, Zeinab Ghasemi
    Shokouhifar, Mohammad
    Yazdanjouei, Hossein
    Fatemi, Seyed Mohammad Jalal Rastegar
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (05)