Vis-YOLO: a lightweight and efficient image detector for unmanned aerial vehicle small objects

被引:0
作者
Deng, Xiangyu [1 ]
Du, Jiangyong [1 ]
机构
[1] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou, Peoples R China
关键词
small objects; YOLOv8s; lightweight and efficient; unmanned aerial vehicle;
D O I
10.1117/1.JEI.33.5.053003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Yolo series models are extensive within the domain of object detection. Aiming at the challenge of small object detection, we analyze the limitations of existing detection models and propose a Vis-YOLO object detection algorithm based on YOLOv8s. First, the down-sampling times are reduced to retain more features, and the detection head is replaced to adapt to the small object. Then, deformable convolutional networks are used to improve the C2f module, improving its feature extraction ability. Finally, the separation and enhancement attention module is introduced to the model to give more weight to the useful information. Experiments show that the improved Vis-YOLO model outperforms the YOLOv8s model on the visdrone2019 dataset. The precision improved by 5.4%, the recall by 6.3%, and the mAP50 by 6.8%. Moreover, Vis-YOLO models are smaller and suitable for mobile deployment. This research provides a new method and idea for small object detection, which has excellent potential application value. (c) 2024 SPIE and IS&T
引用
收藏
页数:15
相关论文
共 32 条
  • [1] Benjumea A, 2021, Arxiv, DOI [arXiv:2112.11798, 10.48550/arXiv.2112.11798]
  • [2] YOLO-S: A Lightweight and Accurate YOLO-like Network for Small Target Selection in Aerial Imagery
    Betti, Alessandro
    Tucci, Mauro
    [J]. SENSORS, 2023, 23 (04)
  • [3] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
  • [4] Carion N., 2020, EUR C COMP VIS, P213, DOI [10.1007/978-3-030-58452-8, 10., 10.1007/978-3-030-58452-813]
  • [5] Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges
    Ding, Jian
    Xue, Nan
    Xia, Gui-Song
    Bai, Xiang
    Yang, Wen
    Yang, Michael Ying
    Belongie, Serge
    Luo, Jiebo
    Datcu, Mihai
    Pelillo, Marcello
    Zhang, Liangpei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 7778 - 7796
  • [6] Dosovitskiy A., 2020, Comput. Vis. Pattern Recognit
  • [7] Du D., 2019, P IEEE CVF INT C COM, DOI [10.1109/ICCVW.2019.00031, DOI 10.1109/ICCVW.2019.00031]
  • [8] Cas-VSwin transformer: A variant swin transformer for surface-defect detection
    Gao, Linfeng
    Zhang, Jianxun
    Yang, Changhui
    Zhou, Yuechuan
    [J]. COMPUTERS IN INDUSTRY, 2022, 140
  • [9] Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, 10.48550/arXiv.2107.08430, DOI 10.48550/ARXIV.2107.08430]
  • [10] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587