Comparative Research on Deep Learning Approaches for Airplane Detection from Very High-Resolution Satellite Images

被引:85
作者
Alganci, Ugur [1 ]
Soydas, Mehmet [2 ]
Sertel, Elif [1 ]
机构
[1] Istanbul Tech Univ, Civil Engn Fac, Geomat Engn Dept, ITU Ayazaga Campus, TR-34469 Istanbul, Turkey
[2] Istanbul Tech Univ, Inst Informat, Satellite Commun & Remote Sensing Program, ITU Ayazaga Campus,Inst Informat Bldg, TR-34469 Istanbul, Turkey
关键词
convolutional neural networks (CNNs); end-to-end detection; transfer learning; remote sensing; single shot multi-box detector (SSD); You Look Only Once-v3 (YOLO-v3); Faster RCNN; SCENE CLASSIFICATION; OBJECT DETECTION; TARGET DETECTION; SHIP DETECTION; NETWORKS;
D O I
10.3390/rs12030458
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Object detection from satellite images has been a challenging problem for many years. With the development of effective deep learning algorithms and advancement in hardware systems, higher accuracies have been achieved in the detection of various objects from very high-resolution (VHR) satellite images. This article provides a comparative evaluation of the state-of-the-art convolutional neural network (CNN)-based object detection models, which are Faster R-CNN, Single Shot Multi-box Detector (SSD), and You Look Only Once-v3 (YOLO-v3), to cope with the limited number of labeled data and to automatically detect airplanes in VHR satellite images. Data augmentation with rotation, rescaling, and cropping was applied on the test images to artificially increase the number of training data from satellite images. Moreover, a non-maximum suppression algorithm (NMS) was introduced at the end of the SSD and YOLO-v3 flows to get rid of the multiple detection occurrences near each detected object in the overlapping areas. The trained networks were applied to five independent VHR test images that cover airports and their surroundings to evaluate their performance objectively. Accuracy assessment results of the test regions proved that Faster R-CNN architecture provided the highest accuracy according to the F1 scores, average precision (AP) metrics, and visual inspection of the results. The YOLO-v3 ranked as second, with a slightly lower performance but providing a balanced trade-off between accuracy and speed. The SSD provided the lowest detection performance, but it was better in object localization. The results were also evaluated in terms of the object size and detection accuracy manner, which proved that large- and medium-sized airplanes were detected with higher accuracy.
引用
收藏
页数:28
相关论文
共 46 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2014, IEEE T GEOSCIENCE RE
[3]   Learning Oriented Region-based Convolutional Neural Networks for Building Detection in Satellite Remote Sensing Images [J].
Chen, Chaoyue ;
Gong, Weiguo ;
Hu, Yan ;
Chen, Yongliang ;
Ding, Yi .
ISPRS HANNOVER WORKSHOP: HRIGI 17 - CMRT 17 - ISA 17 - EUROCOW 17, 2017, 42-1 (W1) :461-464
[4]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[5]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[6]   A survey on object detection in optical remote sensing images [J].
Cheng, Gong ;
Han, Junwei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 :11-28
[7]   Object detection in remote sensing imagery using a discriminatively trained mixture model [J].
Cheng, Gong ;
Han, Junwei ;
Guo, Lei ;
Qian, Xiaoliang ;
Zhou, Peicheng ;
Yao, Xiwen ;
Hu, Xintao .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2013, 85 :32-43
[8]   Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA [J].
Cheng, Gong ;
Guo, Lei ;
Zhao, Tianyun ;
Han, Junwei ;
Li, Huihui ;
Fang, Jun .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2013, 34 (01) :45-59
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338