Lightweight Object Detection Networks for UAV Aerial Images Based on YOLO

被引:1
作者
Li, Yanshan [1 ,2 ]
Wang, Jiarong [1 ,2 ]
Zhang, Kunhua [1 ,2 ]
Yi, Jiawei [1 ,2 ]
Wei, Miaomiao [1 ,2 ]
Zheng, Lirong [1 ,2 ]
Xie, Weixin [1 ,2 ]
机构
[1] Shenzhen Univ, ATR Natl Key Lab Def Technol, Shenzhen 518000, Peoples R China
[2] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Shenzhen 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Aerial images; Deep learning; You only look once; Lightweight network; Object detection;
D O I
10.23919/cje.2022.00.300
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing high-precision object detection algorithms for UAV (unmanned aerial vehicle) aerial images often have a large number of parameters and heavy weight, which makes it difficult to be applied to mobile devices. We propose three YOLO-based lightweight object detection networks for UAVs, named YOLO-L, YOLO-S, and YOLO-M, respectively. In YOLO-L, we adopt a deconvolution approach to explore suitable upsampling rules during training to improve the detection accuracy. The convolution-batch normalization-SiLU activation function (CBS) structure is replaced with Ghost CBS to reduce the number of parameters and weight, meanwhile Maxpool maximum pooling operation is proposed to replace the CBS structure to avoid generating parameters and weight. YOLO-S greatly reduces the weight of the network by directly introducing CSPGhostNeck residual structures, so that the parameters and weight are respectively decreased by about 15% at the expense of 2.4% mAP. And YOLO-M adopts the CSPGhostNeck residual structure and deconvolution to reduce parameters by 5.6% and weight by 5.7%, while mAP only by 1.8%. The results show that the three lightweight detection networks proposed in this paper have good performance in UAV aerial image object detection task.
引用
收藏
页码:997 / 1009
页数:13
相关论文
共 21 条
[11]   Action Status Based Novel Relative Feature Representations for Interaction Recognition [J].
Li Yanshan ;
Guo Tianyu ;
Liu Xing ;
Luo Wenhan ;
Xie Weixin .
CHINESE JOURNAL OF ELECTRONICS, 2022, 31 (01) :168-180
[12]   Multidimensional Local Binary Pattern for Hyperspectral Image Classification [J].
Li, Yanshan ;
Tang, Haojin ;
Xie, Weixin ;
Luo, Wenhan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[13]   A lightweight multi-scale aggregated model for detecting aerial images captured by UAVs [J].
Li, Zhaokun ;
Liu, Xueliang ;
Zhao, Ye ;
Liu, Bo ;
Huang, Zhen ;
Hong, Richang .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 77
[14]   ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design [J].
Ma, Ningning ;
Zhang, Xiangyu ;
Zheng, Hai-Tao ;
Sun, Jian .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :122-138
[15]   Deep learning-based object detection in low-altitude UAV datasets: A survey [J].
Mittal, Payal ;
Singh, Raman ;
Sharma, Akashdeep .
IMAGE AND VISION COMPUTING, 2020, 104 (104)
[16]   MobileNetV2: Inverted Residuals and Linear Bottlenecks [J].
Sandler, Mark ;
Howard, Andrew ;
Zhu, Menglong ;
Zhmoginov, Andrey ;
Chen, Liang-Chieh .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4510-4520
[17]   YOLOrs: Object Detection in Multimodal Remote Sensing Imagery [J].
Sharma, Manish ;
Dhanaraj, Mayur ;
Karnam, Srivallabha ;
Chachlakis, Dimitris G. ;
Ptucha, Raymond ;
Markopoulos, Panos P. ;
Saber, Eli .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :1497-1508
[18]   Small object detection via dual inspection mechanism for UAV visual images [J].
Tian, Gangyi ;
Liu, Jianran ;
Zhao, Hong ;
Yang, Wenyuan .
APPLIED INTELLIGENCE, 2022, 52 (04) :4244-4257
[19]   Multiscale Object Detection from Drone Imagery Using Ensemble Transfer Learning [J].
Walambe, Rahee ;
Marathe, Aboli ;
Kotecha, Ketan .
DRONES, 2021, 5 (03)
[20]   Survey on Deep Multi-modal Data Analytics: Collaboration, Rivalry, and Fusion [J].
Wang, Yang .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)