The Research of Small Object Detection based on YOLOX in UAV

被引:0
作者
Liu, Xinli [1 ]
Yang, Ming [1 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing, Peoples R China
来源
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024 | 2024年
关键词
UAV; YOLOX; lightweight; small object detection;
D O I
10.1109/CSCWD61410.2024.10580555
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Object detection on Unmanned Aerial Vehicles (UAVs) is a challenging problem due to the limited computing resources of the edge GPU of the Internet of Things (IoT) nodes and the presence of a large number of small objects in aerial images. Therefore, this paper proposes a lightweight deep learning architecture based on YOLOX model. Firstly, we design a lightweight backbone network to replace the backbone network in YOLOX. Then, we use four different sizes of neck feature maps for detection, which can improve the accuracy of small object detection much better. At the same time, we reduce the number of parameters by removing one convolution from the header and adding a max-pooling layer to obtain local information for classification. Compared to YOLOX-s, our model has improved the mAP@50 and mAP@0.5:0.95 by 4.6% and 2.5% respectively on the Visdrone2023 validation set. It is worth noting that our model has only 6.61M parameters, and we also provide a tiny version with only 2.59M parameters. A series of experimental results illustrates that our enhanced algorithm outperforms YOLOX-s, YOLOV7-tiny, and the latest YOLOV8-s.
引用
收藏
页码:507 / 512
页数:6
相关论文
共 24 条
  • [1] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
  • [2] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
    Chen, Jierun
    Kao, Shiu-Hong
    He, Hao
    Zhuo, Weipeng
    Wen, Song
    Lee, Chul-Ho
    Chan, S. -H. Gary
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
  • [3] Parallel Residual Bi-Fusion Feature Pyramid Network for Accurate Single-Shot Object Detection
    Chen, Ping-Yang
    Chang, Ming-Ching
    Hsieh, Jun-Wei
    Chen, Yong-Sheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9099 - 9111
  • [4] Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, DOI 10.48550/ARXIV.2107.08430]
  • [5] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
  • [6] Jocher G., 2023, YOLO ULTRALYTICS
  • [7] Robust Obstacle Detection and Recognition for Driver Assistance Systems
    Leng, Jiaxu
    Liu, Ying
    Du, Dawei
    Zhang, Tianlin
    Quan, Pei
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (04) : 1560 - 1571
  • [8] Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
    Li, Feng
    Zhang, Hao
    Xu, Huaizhe
    Liu, Shilong
    Zhang, Lei
    Ni, Lionel M.
    Shum, Heimg-Yeung
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3041 - 3050
  • [9] Perceptual Generative Adversarial Networks for Small Object Detection
    Li, Jianan
    Liang, Xiaodan
    Wei, Yunchao
    Xu, Tingfa
    Feng, Jiashi
    Yan, Shuicheng
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1951 - 1959
  • [10] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944