Geospatial Object Detection in High Resolution Satellite Images Based on Multi-Scale Convolutional Neural Network

被引:141
作者
Guo, Wei [1 ,2 ]
Yang, Wen [1 ,2 ]
Zhang, Haijian [1 ]
Hua, Guang [1 ]
机构
[1] Wuhan Univ, Sch Elect Informat, Wuhan 430072, Peoples R China
[2] CETC Key Lab Aerosp Informat Applicat, Shijiazhuang 050081, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
high resolution satellite images; geospatial object detection; object proposal network; object detection network; SCENE CLASSIFICATION; AUTOMATED DETECTION; BUILDINGS; HISTOGRAMS; AERIAL;
D O I
10.3390/rs10010131
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Daily acquisition of large amounts of aerial and satellite images has facilitated subsequent automatic interpretations of these images. One such interpretation is object detection. Despite the great progress made in this domain, the detection of multi-scale objects, especially small objects in high resolution satellite (HRS) images, has not been adequately explored. As a result, the detection performance turns out to be poor. To address this problem, we first propose a unified multi-scale convolutional neural network (CNN) for geospatial object detection in HRS images. It consists of a multi-scale object proposal network and a multi-scale object detection network, both of which share a multi-scale base network. The base network can produce feature maps with different receptive fields to be responsible for objects with different scales. Then, we use the multi-scale object proposal network to generate high quality object proposals from the feature maps. Finally, we use these object proposals with the multi-scale object detection network to train a good object detector. Comprehensive evaluations on a publicly available remote sensing object detection dataset and comparisons with several state-of-the-art approaches demonstrate the effectiveness of the presented method. The proposed method achieves the best mean average precision (mAP) value of 89.6%, runs at 10 frames per second (FPS) on a GTX 1080Ti GPU.
引用
收藏
页数:21
相关论文
共 58 条
  • [1] Face description with local binary patterns:: Application to face recognition
    Ahonen, Timo
    Hadid, Abdenour
    Pietikainen, Matti
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) : 2037 - 2041
  • [2] An automated airplane detection system for large panchromatic image with high spatial resolution
    An, Zhenyu
    Shi, Zhenwei
    Teng, Xichao
    Yu, Xinran
    Tang, Wei
    [J]. OPTIK, 2014, 125 (12): : 2768 - 2775
  • [3] [Anonymous], P IEEE C COMP VIS PA
  • [4] [Anonymous], 2017, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2017.557
  • [5] [Anonymous], 2005, P INT C SPAC INF TEC
  • [6] Geographic Object-Based Image Analysis - Towards a new paradigm
    Blaschke, Thomas
    Hay, Geoffrey J.
    Kelly, Maggi
    Lang, Stefan
    Hofmann, Peter
    Addink, Elisabeth
    Feitosa, Raul Queiroz
    van der Meer, Freek
    van der Werff, Harald
    van Coillie, Frieke
    Tiede, Dirk
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 87 : 180 - 191
  • [7] Bo S., 2010, 2010 3 INT C IM SIGN, V4, P1923
  • [8] Cai Zhaowei, 2016, LECT NOTES COMPUT SC, P354, DOI DOI 10.1007/978-3-319-46493-0_22
  • [9] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images
    Cheng, Gong
    Zhou, Peicheng
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12): : 7405 - 7415
  • [10] A survey on object detection in optical remote sensing images
    Cheng, Gong
    Han, Junwei
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 : 11 - 28