Improved SSD Algorithm and Its Performance Analysis of Small Target Detection in Remote Sensing Images

被引:21
作者
Wang Junqiang [1 ,2 ]
Li Jiansheng [1 ]
Zhou Xuewen [2 ]
Zhang Xu [1 ]
机构
[1] Informat Engn Univ, Inst Geospatial Informat, Zhengzhou 450000, Henan, Peoples R China
[2] 78123 Troops, Chengdu 610000, Sichuan, Peoples R China
关键词
remote sensing; small target detection; deep learning; multi-scale prediction; feature pyramid; mean average precision;
D O I
10.3788/AOS201939.0628005
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
An improved single shot multibox detector (SSD) algorithm is proposed aiming at the problems of slow detection speed of the target proposal based remote sensing image target detection method represented by faster regions with convolutional neural network (R-CNN) and the low performance in small target detection by the SSD algorithm. The algorithm can combine the advantages of the existing detection methods based on target proposal and one-stage target detection to improve the target detection performance. Furthermore, the algorithm replaces the original visual geometry group net with a densely connected network as the backbone network and constructs a feature pyramid between the densely connected modules instead of the original multi-scale feature map. A sample data online acquisition system is designed to verify the accuracy and performance of the proposed algorithm. A sample set of aircraft and playground target is collected as the experimental sample. The network structure stability is verified by training the improved SSD algorithm. Consequently, good results can be achieved without the support of transfer learning. Moreover, the training process is not easy to diverge. By comparing the Faster R-CNN algorithm using ResNet101 as the backbone network and the R-FCN (region-based fully convolutional networks) algorithm, we find that the mean average precision (MAP) of the improved SSD algorithm is 9. 13% and 8. 48% higher than that of the faster R-CNN and R-FCN algorithms in the test set, respectively. The proposed SSD algorithm improves the MAP in the small target detection by 14.46% and 13.92% compared to the faster R-CNN and R-FCN algorithms, respectively. Detecting a single image takes 71.8 ms, which is 45. 7 ms and 7. 5 ms less than that of the faster R-CNN and R-FCN algorithms, respectively.
引用
收藏
页数:10
相关论文
共 26 条
[1]  
[曹家梓 Cao Jiazi], 2015, [仪器仪表学报, Chinese Journal of Scientific Instrument], V36, P776
[2]  
Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
[3]   A tutorial on the cross-entropy method [J].
De Boer, PT ;
Kroese, DP ;
Mannor, S ;
Rubinstein, RY .
ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) :19-67
[4]   Aerial Target Detection Based on Improved Faster R-CNN [J].
Feng Xiaoyu ;
Mei Wei ;
Hu Dashuai .
ACTA OPTICA SINICA, 2018, 38 (06)
[5]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269
[8]   Speed/accuracy trade-offs for modern convolutional object detectors [J].
Huang, Jonathan ;
Rathod, Vivek ;
Sun, Chen ;
Zhu, Menglong ;
Korattikara, Anoop ;
Fathi, Alireza ;
Fischer, Ian ;
Wojna, Zbigniew ;
Song, Yang ;
Guadarrama, Sergio ;
Murphy, Kevin .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3296-+
[9]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[10]   Feature Pyramid Networks for Object Detection [J].
Lin, Tsung-Yi ;
Dollar, Piotr ;
Girshick, Ross ;
He, Kaiming ;
Hariharan, Bharath ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944