A Unified Deep Learning Framework of Multi-scale Detectors for Geo-spatial Object Detection in High-Resolution Satellite Images

被引:47
作者
Khan, Sultan Daud [1 ]
Alarabi, Louai [2 ]
Basalamah, Saleh [3 ]
机构
[1] Natl Univ Technol, Dept Comp Sci, Islamabad, Pakistan
[2] Umm Al Qura Univ, Dept Comp Sci, Mecca, Saudi Arabia
[3] Umm Al Qura Univ, Dept Comp Engn, Mecca, Saudi Arabia
关键词
Geo-spatial object detection; Region proposal networks; Multi-scale object proposals; REMOTE-SENSING IMAGES; NEAREST-NEIGHBOR; NEURAL-NETWORK; ORIENTED GRADIENTS; AIRPORT DETECTION; TARGET DETECTION; VISUAL SALIENCY; SHIP DETECTION; CLASSIFICATION; AERIAL;
D O I
10.1007/s13369-021-06288-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Geo-spatial object detection in high-resolution satellite images has many applications in urban planning, military applications, maritime surveillance, environment control and management. Despite the success of convolutional neural networks in object detection tasks in natural images, the current deep learning models face challenges in geo-spatial object detection in satellite images due to complex background, arbitrary views and large variations in object sizes. In this paper, we propose a framework that tackles these problems in efficient and effective way. The framework consists of two stages. The first stage generates multi-scale object proposals and the second stage classifies each proposal into different classes. The first stage utilizes feature pyramid network to obtain multi-scale feature maps and then convert each level of the pyramid into an independent multi-scale proposal generator by appending multiple region proposal networks (RPNs). We define scale range for each RPN to capture different scales of the target. The multi-scale object proposals are provided as input to the detection sub-network. We evaluate proposed framework on publicly available benchmark dataset, and from the experiment results, we demonstrate that proposed framework outperformed other reference methods
引用
收藏
页码:9489 / 9504
页数:16
相关论文
共 100 条
[1]  
Ahonen T, 2004, LECT NOTES COMPUT SC, V3021, P469
[2]   Automatic Mapping of Linear Woody Vegetation Features in Agricultural Landscapes Using Very High Resolution Imagery [J].
Aksoy, Selim ;
Akcay, H. Goekhan ;
Wassenaar, Tom .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2010, 48 (01) :511-522
[3]  
[Anonymous], 2006, ADV NEURAL INFORM PR
[4]  
[Anonymous], 2017, Obstacle avoidance through deep networks based intermediate perception
[5]  
[Anonymous], 2013, ARXIV PREPRINT ARXIV
[6]   Multiscale Combinatorial Grouping [J].
Arbelaez, Pablo ;
Pont-Tuset, Jordi ;
Barron, Jonathan T. ;
Marques, Ferran ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :328-335
[7]  
Bochkovskiy A., 2020, YOLOv4: Optimal Speed and Accuracy of Object Detection
[8]   A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection [J].
Cai, Zhaowei ;
Fan, Quanfu ;
Feris, Rogerio S. ;
Vasconcelos, Nuno .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :354-370
[9]   Geospatial Object Detection in Remote Sensing Imagery Based on Multiscale Single-Shot Detector with Activated Semantics [J].
Chen, Shiqi ;
Zhan, Ronghui ;
Zhang, Jun .
REMOTE SENSING, 2018, 10 (06)
[10]  
Chen X., 2021, IEEE SENS J, V6, P66