DFTD-YOLO: Lightweight Multi-Target Detection From Unmanned Aerial Vehicle Viewpoints

被引:1
作者
Chen, Yuteng [1 ]
Liu, Zhaoguang [1 ]
机构
[1] Shandong Univ Finance & Econ, Sch Comp Sci & Technol, Jinan 250200, Peoples R China
关键词
UAV multi-target detection; YOLO; feature fusion; detection head;
D O I
10.1109/ACCESS.2025.3535624
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the low detection accuracy of small and dense target objects in multi-target detection tasks from the unmanned aerial vehicle (UAV) perspective and the deployment of deep learning models for UAVs as embedded devices, these models must be lightweight. In this study, we propose an improved algorithm, DFTD-YOLO, based on YOLOv8n. We designed a new neck feature fusion network. The network better balances information transfer between shallow and deep layers through a detailed information extraction module and an abstract feature information aggregation module, effectively reducing the loss of detail information with gradient flow and improving detection performance. In addition, we designed a new detection head called the TDD-Head. This module enhances the feature interaction between the classification and regression tasks through the task alignment mechanism and shared convolution, which reduces model parameters and computation and improves model performance. To validate the model, we conducted validation experiments on the VisDrone2021 dataset. The experimental results showed a 33.67% reduction in the number of parameters, 17.3% reduction in the amount of computation, 10.74% improvement in mAP@0.5, and 13.2% improvement in mAP@0.5:0.95 compared with the existing YOLOv8n. The results demonstrate the considerable potential of the model for multitarget detection tasks from the UAV perspective.
引用
收藏
页码:24672 / 24680
页数:9
相关论文
共 33 条
[1]  
Adaimi G., 2020, arXiv, DOI DOI 10.48550/ARXIV.2009.07611
[2]  
Amit SNKB, 2017, 2017 INTERNATIONAL ELECTRONICS SYMPOSIUM ON KNOWLEDGE CREATION AND INTELLIGENT COMPUTING (IES-KCIC), P239, DOI 10.1109/KCIC.2017.8228593
[3]   Utilizing YOLOv8 for enhanced traffic monitoring in intelligent transportation systems (ITS) applications [J].
Bakirci, Murat .
DIGITAL SIGNAL PROCESSING, 2024, 152
[4]   Vehicle Detection From UAV Imagery With Deep Learning: A Review [J].
Bouguettaya, Abdelmalek ;
Zarzour, Hafed ;
Kechida, Ahmed ;
Taberkit, Amine Mohammed .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) :6047-6067
[5]   VisDrone-DET2021: The Vision Meets Drone Object detection Challenge Results [J].
Cao, Yaru ;
He, Zhijian ;
Wang, Lujia ;
Wang, Wenguan ;
Yuan, Yixuan ;
Zhang, Dingwen ;
Zhang, Jinglin ;
Zhu, Pengfei ;
Van Gool, Luc ;
Han, Junwei ;
Hoi, Steven ;
Hu, Qinghua ;
Liu, Ming ;
Cheng, Chong ;
Liu, Fanfan ;
Cao, Guojin ;
Li, Guozhen ;
Wang, Hongkai ;
He, Jianye ;
Wan, Junfeng ;
Wan, Qi ;
Zhao, Qi ;
Lyu, Shuchang ;
Zhao, Wenzhe ;
Lu, Xiaoqiang ;
Zhu, Xingkui ;
Liu, Yingjie ;
Lv, Yixuan ;
Ma, Yujing ;
Yang, Yuting ;
Wang, Zhe ;
Xu, Zhenyu ;
Luo, Zhipeng ;
Zhang, Zhimin ;
Zhang, Zhiguang ;
Li, Zihao ;
Zhang, Zixiao .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :2847-2854
[6]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[7]   TOOD: Task-aligned One-stage Object Detection [J].
Feng, Chengjian ;
Zhong, Yujie ;
Gao, Yu ;
Scott, Matthew R. ;
Huang, Weilin .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3490-3499
[8]  
Flores A., 2022, P IEEE INT C AUT 25, P1
[9]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[10]  
Ibork Y., 2024, P INT C INT SYST COM, P1