EDR-YOLOv8: a lightweight target detection model for UAV aerial photography using advanced feature fusion methods

被引:0
作者
Hao, Yongchang [1 ]
Guo, Chenxia [1 ]
Yang, Ruifeng [1 ]
Zhao, Yuhui [1 ]
机构
[1] North Univ China, Sch Instrument & Elect, Taiyuan 030051, Shanxi, Peoples R China
关键词
YOLOv8; aerial photograph by drone; target detection; C2f_RepGhost; lightweighting;
D O I
10.1088/1361-6501/ada0d1
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Target detection from the aerial perspective of drones plays a crucial role in various fields. However, due to its unique high-altitude overhead view, images captured often exhibit a high proportion of small-sized targets amidst complex backgrounds and varying scales, posing significant challenges for detection. To address these issues, the EDR-YOLOv8 model has been proposed for drone-based aerial target detection. Firstly, the backbone of YOLOv8l is replaced with the high-resolution visual module EfficientViT, reducing the parameter count while maintaining the model's capability to express important features. Secondly, the feature fusion network is redesigned with a four-level prediction layer to enhance the detection accuracy of small-sized targets. Additionally, the lightweight dynamic upsampler DySample is introduced to preserve more detailed target information. Finally, we design the feature fusion module C2f_RepGhost, which integrates the RepGhost bottleneck structure with YOLOv8's C2f, thereby reducing computational complexity. Experimental results demonstrate that EDR-YOLOv8 achieves a 4.1% higher mAP@0.5 compared to the baseline YOLOv8l on the VisDrone2019-DET dataset, with a reduction of 40.5% in model size and 42.0% in parameter count. This illustrates that EDR-YOLOv8 achieves both lightweight modeling and improved detection accuracy.
引用
收藏
页数:16
相关论文
共 35 条
  • [1] Applications, databases and open computer vision research from drone videos and images: a survey
    Akbari, Younes
    Almaadeed, Noor
    Al-maadeed, Somaya
    Elharrouss, Omar
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (05) : 3887 - 3938
  • [2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
  • [3] Cai H., 2022, arXiv, DOI arXiv:2205.14756
  • [4] Chen CP, 2024, Arxiv, DOI [arXiv:2211.06088, DOI 10.48550/ARXIV.2211.06088]
  • [5] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [6] VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results
    Du, Dawei
    Zhu, Pengfei
    Wen, Longyin
    Bian, Xiao
    Ling, Haibin
    Hu, Qinghua
    Peng, Tao
    Zheng, Jiayu
    Wang, Xinyao
    Zhang, Yue
    Bo, Liefeng
    Shi, Hailin
    Zhu, Rui
    Kumar, Aashish
    Li, Aijin
    Zinollayev, Almaz
    Askergaliyev, Anuar
    Schumann, Arne
    Mao, Binjie
    Lee, Byeongwon
    Liu, Chang
    Chen, Changrui
    Pan, Chunhong
    Huo, Chunlei
    Yu, Da
    Cong, Dechun
    Zeng, Dening
    Pailla, Dheeraj Reddy
    Li, Di
    Wang, Dong
    Cho, Donghyeon
    Zhang, Dongyu
    Bai, Furui
    Jose, George
    Gao, Guangyu
    Liu, Guizhong
    Xiong, Haitao
    Qi, Hao
    Wang, Haoran
    Qiu, Heqian
    Li, Hongliang
    Lu, Huchuan
    Kim, Ildoo
    Kim, Jaekyum
    Shen, Jane
    Lee, Jihoon
    Ge, Jing
    Xu, Jingjing
    Zhou, Jingkai
    Meier, Jonas
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 213 - 226
  • [7] A Real-Time Small Target Vehicle Detection Algorithm with an Improved YOLOv5m Network Model
    Du, Yaoyao
    Jiang, Xiangkui
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (01): : 303 - 327
  • [8] Object Detection with Discriminatively Trained Part-Based Models
    Felzenszwalb, Pedro F.
    Girshick, Ross B.
    McAllester, David
    Ramanan, Deva
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645
  • [9] Fine-tuned YOLOv5 for real-time vehicle detection in UAV imagery: Architectural improvements and performance boost
    Hamzenejadi, Mohammad Hossein
    Mohseni, Hadis
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231
  • [10] Kim J.-H., 2023, ICASSP 2023 2023 IEE, P1