A Lightweight and Accurate UAV Detection Method Based on YOLOv4

被引:7
作者
Cai, Hao [1 ]
Xie, Yuanquan [1 ]
Xu, Jianlong [1 ]
Xiong, Zhi [1 ]
机构
[1] Shantou Univ, Dept Comp Sci, Shantou 515041, Peoples R China
关键词
object detection; UAV detection; deep learning; depth-wise separable convolution; NETWORKS;
D O I
10.3390/s22186874
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
At present, the UAV (Unmanned Aerial Vehicle) has been widely used both in civilian and military fields. Most of the current object detection algorithms used to detect UAVs require more parameters, and it is difficult to achieve real-time performance. In order to solve this problem while ensuring a high accuracy rate, we further lighten the model and reduce the number of parameters of the model. This paper proposes an accurate and lightweight UAV detection model based on YOLOv4. To verify the effectiveness of this model, we made a UAV dataset, which contains four types of UAVs and 20,365 images. Through comparative experiments and optimization of existing deep learning and object detection algorithms, we found a lightweight model to achieve an efficient and accurate rapid detection of UAVs. First, from the comparison of the one-stage method and the two-stage method, it is concluded that the one-stage method has better real-time performance and considerable accuracy in detecting UAVs. Then, we further compared the one-stage methods. In particular, for YOLOv4, we replaced MobileNet with its backbone network, modified the feature extraction network, and replaced standard convolution with depth-wise separable convolution, which greatly reduced the parameters and realized 82 FPS and 93.52% mAP while ensuring high accuracy and taking into account the real-time performance.
引用
收藏
页数:18
相关论文
共 37 条
[1]  
Bochkovskiy A., 2020, ARXIV 200410934
[2]  
Bosquet B., 2018, P BRIT MACH VIS C, P253
[3]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[4]   Nonparametric Variational Auto-encoders for Hierarchical Representation Learning [J].
Goyal, Prasoon ;
Hu, Zhiting ;
Liang, Xiaodan ;
Wang, Chenyu ;
Xing, Eric P. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5104-5112
[5]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[6]   Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) :1904-1916
[7]   High-Speed Tracking with Kernelized Correlation Filters [J].
Henriques, Joao F. ;
Caseiro, Rui ;
Martins, Pedro ;
Batista, Jorge .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) :583-596
[8]  
Hoffer E, 2018, Arxiv, DOI [arXiv:1705.08741, 10.48550/arXiv.1705.08741]
[9]  
Howard AG., 2017, ARXIV, DOI DOI 10.48550/ARXIV.1704.04861
[10]   Searching for MobileNetV3 [J].
Howard, Andrew ;
Sandler, Mark ;
Chu, Grace ;
Chen, Liang-Chieh ;
Chen, Bo ;
Tan, Mingxing ;
Wang, Weijun ;
Zhu, Yukun ;
Pang, Ruoming ;
Vasudevan, Vijay ;
Le, Quoc V. ;
Adam, Hartwig .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1314-1324