Geospatial Object Detection in High Resolution Satellite Images Based on Multi-Scale Convolutional Neural Network

被引:146
作者
Guo, Wei [1 ,2 ]
Yang, Wen [1 ,2 ]
Zhang, Haijian [1 ]
Hua, Guang [1 ]
机构
[1] Wuhan Univ, Sch Elect Informat, Wuhan 430072, Peoples R China
[2] CETC Key Lab Aerosp Informat Applicat, Shijiazhuang 050081, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
high resolution satellite images; geospatial object detection; object proposal network; object detection network; SCENE CLASSIFICATION; AUTOMATED DETECTION; BUILDINGS; HISTOGRAMS; AERIAL;
D O I
10.3390/rs10010131
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Daily acquisition of large amounts of aerial and satellite images has facilitated subsequent automatic interpretations of these images. One such interpretation is object detection. Despite the great progress made in this domain, the detection of multi-scale objects, especially small objects in high resolution satellite (HRS) images, has not been adequately explored. As a result, the detection performance turns out to be poor. To address this problem, we first propose a unified multi-scale convolutional neural network (CNN) for geospatial object detection in HRS images. It consists of a multi-scale object proposal network and a multi-scale object detection network, both of which share a multi-scale base network. The base network can produce feature maps with different receptive fields to be responsible for objects with different scales. Then, we use the multi-scale object proposal network to generate high quality object proposals from the feature maps. Finally, we use these object proposals with the multi-scale object detection network to train a good object detector. Comprehensive evaluations on a publicly available remote sensing object detection dataset and comparisons with several state-of-the-art approaches demonstrate the effectiveness of the presented method. The proposed method achieves the best mean average precision (mAP) value of 89.6%, runs at 10 frames per second (FPS) on a GTX 1080Ti GPU.
引用
收藏
页数:21
相关论文
共 58 条
[1]   Face description with local binary patterns:: Application to face recognition [J].
Ahonen, Timo ;
Hadid, Abdenour ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041
[2]   An automated airplane detection system for large panchromatic image with high spatial resolution [J].
An, Zhenyu ;
Shi, Zhenwei ;
Teng, Xichao ;
Yu, Xinran ;
Tang, Wei .
OPTIK, 2014, 125 (12) :2768-2775
[3]  
[Anonymous], P IEEE C COMP VIS PA
[4]  
[Anonymous], 2017, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2017.557
[5]  
[Anonymous], 2005, P INT C SPAC INF TEC
[6]   Geographic Object-Based Image Analysis - Towards a new paradigm [J].
Blaschke, Thomas ;
Hay, Geoffrey J. ;
Kelly, Maggi ;
Lang, Stefan ;
Hofmann, Peter ;
Addink, Elisabeth ;
Feitosa, Raul Queiroz ;
van der Meer, Freek ;
van der Werff, Harald ;
van Coillie, Frieke ;
Tiede, Dirk .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 87 :180-191
[7]  
Bo S., 2010, 2010 3 INT C IM SIGN, V4, P1923
[8]  
Cai Zhaowei, 2016, LECT NOTES COMPUT SC, P354, DOI DOI 10.1007/978-3-319-46493-0_22
[9]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[10]   A survey on object detection in optical remote sensing images [J].
Cheng, Gong ;
Han, Junwei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 :11-28