Detection of small target in aerial photography based on deep learning

被引:14
作者
Liang Hua [1 ,2 ]
Song Yu-long [1 ]
Qian Feng [1 ]
Song Ce [1 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Jilin, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
machine vision; target detection; convolutional neural network; convolution feature;
D O I
10.3788/YJYXS20183309.0793
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
In order to solve the problem of low recognition rate and poor positioning in aerial images, a target detection method based on deep learning is proposed. This algorithm uses VGG16 network as a fine tuning network and adds some deep network in it. Joint training is carried out by extracting the features of the shallow layers and the deep features of the target to overcome the contradiction between location and recognition in the process of detection. The singular value decomposition technology is used to compress the convolution features to reduce the computing and storage requirements of the model, and Multi scale training method is adopted to adapt to the change of aerial target scale. The experimental results show that 0.76 mAP can be implemented on the general data set PASCAL, and the detection speed is 16 fps. The 0.63 mAP can be achieved on the special aviation target data set UCAS-AOD, and the detection speed is 18 fps. It can satisfy the requirements for small target detection accuracy, and the detection speed can be close to the real-time detection effect.
引用
收藏
页码:793 / 800
页数:8
相关论文
共 19 条
[1]  
Andrew Zisserman, 2015, Arxiv, DOI arXiv:1409.1556
[2]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[3]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[4]  
Felzenszwalb P, 2008, PROC CVPR IEEE, P1984
[5]   Vehicle type recognition algorithm based on improved HOG feature [J].
Geng Qing-tian ;
Zhao Hao-yu ;
Yu Fan-hua ;
Wang Yu-ting ;
Zhao Hong-weig .
CHINESE OPTICS, 2018, 11 (02) :174-181
[6]  
Girshick R., 2014, P IEEE C COMP VIS PA, DOI DOI 10.1109/CVPR.2014.81
[7]  
Jian Sun, 2015, Arxiv, DOI arXiv:1406.4729
[8]  
Krizhevsky A., 2017, ADV NEURAL INFORM PR, V60, DOI [10.1145/3065386, DOI 10.1145/3065386]
[9]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[10]   Optical remote sensing image retrieval based on convolutional neural networks [J].
Li, Yu ;
Liu, Xue-Ying ;
Zhang, Hong-Qun ;
Li, Xiang-Juan ;
Sun, Xiao-Yao .
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2018, 26 (01) :200-207