YOLO-ViT-Based Method for Unmanned Aerial Vehicle Infrared Vehicle Target Detection

被引:29
作者
Zhao, Xiaofeng [1 ]
Xia, Yuting [1 ]
Zhang, Wenwen [1 ]
Zheng, Chao [1 ]
Zhang, Zhili [1 ]
机构
[1] Xian Res Inst High Tech, Xian 710025, Peoples R China
基金
中国国家自然科学基金;
关键词
unmanned aerial vehicle target detection; vehicle detection; infrared small target; deep learning; Yolov7; NETWORK; UAV;
D O I
10.3390/rs15153778
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The detection of infrared vehicle targets by UAVs poses significant challenges in the presence of complex ground backgrounds, high target density, and a large proportion of small targets, which result in high false alarm rates. To alleviate these deficiencies, a novel YOLOv7-based, multi-scale target detection method for infrared vehicle targets is proposed, which is termed YOLO-ViT. Firstly, within the YOLOV7-based framework, the lightweight MobileViT network is incorporated as the feature extraction backbone network to fully extract the local and global features of the object and reduce the complexity of the model. Secondly, an innovative C3-PANet neural network structure is delicately designed, which adopts the CARAFE upsampling method to utilize the semantic information in the feature map and improve the model's recognition accuracy of the target region. In conjunction with the C3 structure, the receptive field will be increased to enhance the network's accuracy in recognizing small targets and model generalization ability. Finally, the K-means++ clustering method is utilized to optimize the anchor box size, leading to the design of anchor boxes better suited for detecting small infrared targets from UAVs, thereby improving detection efficiency. The present article showcases experimental findings attained through the use of the HIT-UAV public dataset. The results demonstrate that the enhanced YOLO-ViT approach, in comparison to the original method, achieves a reduction in the number of parameters by 49.9% and floating-point operations by 67.9%. Furthermore, the mean average precision (mAP) exhibits an improvement of 0.9% over the existing algorithm, reaching a value of 94.5%, which validates the effectiveness of the method for UAV infrared vehicle target detection.
引用
收藏
页数:16
相关论文
共 53 条
  • [41] CARAFE: Content-Aware ReAssembly of FEatures
    Wang, Jiaqi
    Chen, Kai
    Xu, Rui
    Liu, Ziwei
    Loy, Chen Change
    Lin, Dahua
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3007 - 3016
  • [42] Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
    Wang, Wenhai
    Xie, Enze
    Li, Xiang
    Fan, Deng-Ping
    Song, Kaitao
    Liang, Ding
    Lu, Tong
    Luo, Ping
    Shao, Ling
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 548 - 558
  • [43] Small vehicles detection based on UAV
    Wu Chen
    Zhao Baojun
    Tang Linbo
    Zhao Boya
    [J]. JOURNAL OF ENGINEERING-JOE, 2019, 2019 (21): : 7894 - 7897
  • [44] Deep Learning for Unmanned Aerial Vehicle-Based Object Detection and Tracking: A Survey
    Wu, Xin
    Li, Wei
    Hong, Danfeng
    Tao, Ran
    Du, Qian
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2022, 10 (01) : 91 - 124
  • [45] Infrared Small-Target Detection Under a Complex Background Based on a Local Gradient Contrast Method
    Yang, Linna
    Xie, Tao
    Liu, Mingxing
    Zhang, Mingjiang
    Qi, Shuaihui
    Yang, Jungang
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2023, 33 (01) : 33 - 43
  • [46] Infrared Image Small-Target Detection Based on Improved FCOS and Spatio-Temporal Features
    Yao, Shengbo
    Zhu, Qiuyu
    Zhang, Tao
    Cui, Wennan
    Yan, Peimin
    [J]. ELECTRONICS, 2022, 11 (06)
  • [47] Yuan L, 2021, Arxiv, DOI [arXiv:2101.11986, DOI 10.48550/ARXIV.2101.11986]
  • [48] CHFNet: Curvature Half-Level Fusion Network for Single-Frame Infrared Small Target Detection
    Zhang, Mingjin
    Li, Bate
    Wang, Tianyu
    Bai, Haichen
    Yue, Ke
    Li, Yunsong
    [J]. REMOTE SENSING, 2023, 15 (06)
  • [49] ISNet: Shape Matters for Infrared Small Target Detection
    Zhang, Mingjin
    Zhang, Rui
    Yang, Yuxiang
    Bai, Haichen
    Zhang, Jing
    Guo, Jie
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 867 - 876
  • [50] Multi-scale adversarial network for vehicle detection in UAV imagery
    Zhang, Ruiqian
    Newsam, Shawn
    Shao, Zhenfeng
    Huang, Xiao
    Wang, Jiaming
    Li, Deren
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 180 : 283 - 295