LODNU: lightweight object detection network in UAV vision

被引：0

作者：

Naiyuan Chen

Yan Li

Zhuomin Yang

Zhensong Lu

Sai Wang

Junang Wang

机构：

[1] Nanjing University of Information Science and Technology,School of Automation

[2] Wuxi University,School of Internet of Things Engineering

[3] Ministry of Public Security,Research Institute of Traffic Management

来源：

The Journal of Supercomputing | 2023年 / 79卷

关键词：

Computer vision; Object detection; UAV; Lightweight; CA attention mechanism;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

With the development of unmanned aerial vehicle (UAV) technology, using UAV to detect objects has become a research focus. However, most of the current object detection algorithms based on deep learning are resource consuming, which is difficult to deploy on embedded devices lacking memory and computing power, such as UAV. To meet these challenges, this paper proposes a lightweight object detection network in UAV vision (LODNU) based on YOLOv4, which can meet the application requirements of resource-constrained devices while ensuring the detection accuracy. Based on YOLOv4, LODNU uses depth-wise separable convolution to reconstruct the backbone network to reduce the parameters of the model, and embeds improved coordinate attention in the backbone network to improve the extraction ability of key object features. Meanwhile, the adaptive scale weighted feature fusion module is added to the path aggregation network to improve the accuracy of multi-scale object detection. In addition, in order to balance the proportion of sample size, we propose a patching data augmentation. LODNU achieves the performance near to YOLOv4 with fewer parameters. Compared with YOLOv4, LODNU achieves more balanced results between model size and accuracy in the VisDrone2019 dataset. The experimental show that the parameters of LODNU are only 13.1% of YOLOv4, and the floating-point operations are only 13.6% of YOLOv4. The mean average precision of LODNU in the VisDrone2019 dataset is 31.4%, which is 77% of YOLOv4.

引用

页码：10117 / 10138

页数：21

共 37 条

[1] Gupta H(2022)monitoring and surveillance of urban road traffic using low altitude drone images: a deep learning approach Multimed Tools Appl 81 19683-19703
[2] Verma OP(2021)A real-time deep learning forest fire monitoring algorithm based on an improved pruned+ kd model J Real Time Image Proc 18 2319-2329
[3] Wang S(2014)Object detection in optical remote sensing images based on weakly supervised learning and high-level feature learning IEEE Trans Geosci Remote Sens 53 3325-3337
[4] Zhao J(2013)Ship detection in high-resolution optical imagery based on anomaly detector and local shape feature IEEE Trans Geosci Remote Sens 52 4511-4523
[5] Ta N(2015)The pascal visual object classes challenge: a retrospective Int J Comput Vis 111 98-136
[6] Zhao X(2015)Imagenet large scale visual recognition challenge Int J Comput Vis 115 211-252
[7] Xiao M(2020)Deep learning for generic object detection: a survey Int J Comput Vis 128 261-318
[8] Wei H(2015)Region-based convolutional networks for accurate object detection and segmentation IEEE Trans Pattern Anal Mach Intell 38 142-158
[9] Han J(2017)Faster r-cnn: Towards real-time object detection with region proposal networks IEEE Trans Pattern Anal Mach Intell 39 1137-1149
[10] Zhang D(2016)R-fcn: object detection via region-based fully convolutional networks Adv Neural Inf Process Syst 29 379-387

← 1 2 3 4 →