Multi-scale Dilated Convolutional Neural Network for Object Detection in UAV Images

被引:0
作者
Zhang R. [1 ]
Shao Z. [2 ]
Aleksei P. [3 ]
Wang J. [2 ]
机构
[1] School of Remote Sensing and Information Engineering, Wuhan University, Wuhan
[2] State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan
[3] Moscow State University of Geodesy and Cartography, Moscow
来源
Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University | 2020年 / 45卷 / 06期
基金
中国国家自然科学基金;
关键词
Dilated convolutional neural network; Multi-scale network; Object detection; UAV images;
D O I
10.13203/j.whugis20200253
中图分类号
学科分类号
摘要
As a new type of remote sensing sensor, unmanned aerial vehicle(UAV)has been used in various fields such as medical treatment, transportation, environmental monitoring, disaster warning, animal protection and military increasingly. Since UAV images are acquired from multiple flying altitudes, perspectives with high speed, objects in UAV images have various scales and perspectives with different distributions, which brings a series of problems to object detection in UAV images.To address these problems, we propose an object detection method based on multi-scale dilated convolutional neural network. It improves existing detection methods by a creative multi-scale dilated convolutional module which facilitates the whole network to learn deep features with increased field of view perception and further improves the performance of object detection in UAV images.We adopt three comparative experiments on base network and our proposed method. And experimental results show that our proposed network has a high precision and recall for object detection in UAV images. Moreover, objects are detected with high performance in multiple perspectives, various scales and complex backgrounds, which indicates the effectiveness and robustness of our method.Object detection in UAV image is significant in both civil and military fields. However, existing methods are limited with objects in multiple perspectives, scales and backgrounds.Our proposed method improves the performance of existing networks by dilated convolutional operator. Experimental results demonstrate the effectiveness and robustness of the proposed method. © 2020, Editorial Board of Geomatics and Information Science of Wuhan University. All right reserved.
引用
收藏
页码:895 / 903
页数:8
相关论文
共 38 条
[31]  
Bazi Y, Melgani F., Convolutional SVM Networks for Object Detection in UAV Imagery[J], IEEE Transactions on Geoscience and Remote Sensing, 56, 6, pp. 3107-3118, (2018)
[32]  
Liang Dong, Gao Sai, Sun Han, Et al., UAV Detection in a Motion Camera Combining Kernelized Correlation Filters and Deep Learning, Acta Aeronautica et Astronautica Sinica, (2020)
[33]  
Kapania S, Saini D, Goyal S, Et al., Multi Object Tracking with UAVs Using Deep SORT and YOLOv3 RetinaNet Detection Framework, The 1st ACM Workshop on Autonomous and Intelligent Mobile Systems, (2020)
[34]  
Aguilar W G, Quisaguano F J, Rodriguez G A, Et al., Convolutional Neuronal Networks Based Monocular Object Detection and Depth Perception for Micro UAVs, International Conference on Intelligent Science and Big Data Engineering, (2018)
[35]  
Yu F, Koltun V., Multi-Scale Context Aggregation by Dilated Convolutions, The International Conference on Learning Representations, (2016)
[36]  
Qu Changbo, Jiang Siyao, Wu Deyang, Multiscale Semantic Segmentation Network Based on Cavity Convolution, Computer Engineering and Applications, 55, 24, pp. 91-95, (2019)
[37]  
He Kaiming, Zhang Xiangyu, Ren Shaoqing, Et al., Deep Residual Learning for Image Recognition, IEEE Conference on Computer Vision and Pattern Recognition, (2016)
[38]  
Pang Jiangmiao, Chen Kai, Shi Jianping, Et al., Libra R-CNN: Towards Balanced Learning for Object Detection, IEEE Conference on Computer Vision and Pattern Recognition, (2019)