The Research of Small Object Detection based on YOLOX in UAV

被引：0

作者：

Liu, Xinli ^{[1
]}

Yang, Ming ^{[1
]}

机构：

[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing, Peoples R China

来源：

PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024 | 2024年

关键词：

UAV; YOLOX; lightweight; small object detection;

D O I：

10.1109/CSCWD61410.2024.10580555

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Object detection on Unmanned Aerial Vehicles (UAVs) is a challenging problem due to the limited computing resources of the edge GPU of the Internet of Things (IoT) nodes and the presence of a large number of small objects in aerial images. Therefore, this paper proposes a lightweight deep learning architecture based on YOLOX model. Firstly, we design a lightweight backbone network to replace the backbone network in YOLOX. Then, we use four different sizes of neck feature maps for detection, which can improve the accuracy of small object detection much better. At the same time, we reduce the number of parameters by removing one convolution from the header and adding a max-pooling layer to obtain local information for classification. Compared to YOLOX-s, our model has improved the mAP@50 and mAP@0.5:0.95 by 4.6% and 2.5% respectively on the Visdrone2023 validation set. It is worth noting that our model has only 6.61M parameters, and we also provide a tiny version with only 2.59M parameters. A series of experimental results illustrates that our enhanced algorithm outperforms YOLOX-s, YOLOV7-tiny, and the latest YOLOV8-s.

引用

页码：507 / 512

页数：6

共 24 条

[1] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[2] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Chen, Jierun
Kao, Shiu-Hong
He, Hao
Zhuo, Weipeng
Wen, Song
Lee, Chul-Ho
Chan, S. -H. Gary
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
[3] Parallel Residual Bi-Fusion Feature Pyramid Network for Accurate Single-Shot Object Detection
Chen, Ping-Yang
Chang, Ming-Ching
Hsieh, Jun-Wei
Chen, Yong-Sheng
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9099 - 9111
[4] Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, DOI 10.48550/ARXIV.2107.08430]
[5] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[6] Jocher G., 2023, YOLO ULTRALYTICS
[7] Robust Obstacle Detection and Recognition for Driver Assistance Systems
Leng, Jiaxu
Liu, Ying
Du, Dawei
Zhang, Tianlin
Quan, Pei
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (04) : 1560 - 1571
[8] Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Li, Feng
Zhang, Hao
Xu, Huaizhe
Liu, Shilong
Zhang, Lei
Ni, Lionel M.
Shum, Heimg-Yeung
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3041 - 3050
[9] Perceptual Generative Adversarial Networks for Small Object Detection
Li, Jianan
Liang, Xiaodan
Wei, Yunchao
Xu, Tingfa
Feng, Jiashi
Yan, Shuicheng
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1951 - 1959
[10] Feature Pyramid Networks for Object Detection
Lin, Tsung-Yi
Dollar, Piotr
Girshick, Ross
He, Kaiming
Hariharan, Bharath
Belongie, Serge
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944

← 1 2 3 →