Vis-YOLO: a lightweight and efficient image detector for unmanned aerial vehicle small objects

被引：0

作者：

Deng, Xiangyu ^{[1
]}

Du, Jiangyong ^{[1
]}

机构：

[1] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2024年 / 33卷 / 05期

关键词：

small objects; YOLOv8s; lightweight and efficient; unmanned aerial vehicle;

D O I：

10.1117/1.JEI.33.5.053003

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Yolo series models are extensive within the domain of object detection. Aiming at the challenge of small object detection, we analyze the limitations of existing detection models and propose a Vis-YOLO object detection algorithm based on YOLOv8s. First, the down-sampling times are reduced to retain more features, and the detection head is replaced to adapt to the small object. Then, deformable convolutional networks are used to improve the C2f module, improving its feature extraction ability. Finally, the separation and enhancement attention module is introduced to the model to give more weight to the useful information. Experiments show that the improved Vis-YOLO model outperforms the YOLOv8s model on the visdrone2019 dataset. The precision improved by 5.4%, the recall by 6.3%, and the mAP50 by 6.8%. Moreover, Vis-YOLO models are smaller and suitable for mobile deployment. This research provides a new method and idea for small object detection, which has excellent potential application value. (c) 2024 SPIE and IS&T

引用

页数：15

共 32 条

[1] Benjumea A, 2021, Arxiv, DOI [arXiv:2112.11798, 10.48550/arXiv.2112.11798]
[2] YOLO-S: A Lightweight and Accurate YOLO-like Network for Small Target Selection in Aerial Imagery
Betti, Alessandro
Tucci, Mauro
[J]. SENSORS, 2023, 23 (04)
[3] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[4] Carion N., 2020, EUR C COMP VIS, P213, DOI [10.1007/978-3-030-58452-8, 10., 10.1007/978-3-030-58452-813]
[5] Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges
Ding, Jian
Xue, Nan
Xia, Gui-Song
Bai, Xiang
Yang, Wen
Yang, Michael Ying
Belongie, Serge
Luo, Jiebo
Datcu, Mihai
Pelillo, Marcello
Zhang, Liangpei
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 7778 - 7796
[6] Dosovitskiy A., 2020, Comput. Vis. Pattern Recognit
[7] Du D., 2019, P IEEE CVF INT C COM, DOI [10.1109/ICCVW.2019.00031, DOI 10.1109/ICCVW.2019.00031]
[8] Cas-VSwin transformer: A variant swin transformer for surface-defect detection
Gao, Linfeng
Zhang, Jianxun
Yang, Changhui
Zhou, Yuechuan
[J]. COMPUTERS IN INDUSTRY, 2022, 140
[9] Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, 10.48550/arXiv.2107.08430, DOI 10.48550/ARXIV.2107.08430]
[10] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587

← 1 2 3 4 →