Context-Aware Convolutional Neural Network for Object Detection in VHR Remote Sensing Imagery

被引:75
作者
Gong, Yiping [1 ]
Xiao, Zhifeng [1 ]
Tan, Xiaowei [1 ]
Sui, Haigang [1 ]
Xu, Chuan [1 ]
Duan, Haiwang [2 ]
Li, Deren [1 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China
[2] SZ DJI Technol Co Ltd, Shenzhen 518048, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2020年 / 58卷 / 01期
基金
中国国家自然科学基金;
关键词
Feature extraction; Object detection; Proposals; Microsoft Windows; Semantics; Context modeling; Convolutional codes; Contextual information mining; convolutional neural network (CNN); object detection; FEATURES;
D O I
10.1109/TGRS.2019.2930246
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Object detection in very-high-resolution (VHR) remote sensing imagery remains a challenge. Environmental factors, such as illumination intensity and weather, reduce image quality, resulting in poor feature representation and limited detection accuracy. To enrich the feature representation and mine the underlying context information among objects, this article proposes a context-aware convolutional neural network (CA-CNN) model for object detection that includes proposal generation, context feature extraction, feature fusion, and classification. During feature extraction, we propose integrating a context-regions-of-interests (Context-RoIs) mining layer into the CNN model and extracting context features by mapping Context-RoIs mined from the foreground proposals to multilevel feature maps. Finally, the context features extracted from multilevel layers are fused into a single layer, and the proposals represented by the fused features are classified by a softmax classifier. In this article, through numerous experiments, we thoroughly explore the influence of key factors, such as Context-RoIs, different feature scales, and different spatial context window sizes. Because of the end-to-end network design approach, our proposed model simultaneously maintains high efficiency and effectiveness. We conducted all model testing on the public NWPU VHR-10 data set. The experimental results demonstrate that our proposed CA-CNN model achieves significantly improved model performance and better detection results compared with the state-of-the-art methods.
引用
收藏
页码:34 / 44
页数:11
相关论文
共 45 条
[1]   Texture-Based Airport Runway Detection [J].
Aytekin, O. ;
Zongur, U. ;
Halici, U. .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2013, 10 (03) :471-475
[2]   Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [J].
Bell, Sean ;
Zitnick, C. Lawrence ;
Bala, Kavita ;
Girshick, Ross .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2874-2883
[3]   SCENE PERCEPTION - DETECTING AND JUDGING OBJECTS UNDERGOING RELATIONAL VIOLATIONS [J].
BIEDERMAN, I ;
MEZZANOTTE, RJ ;
RABINOWITZ, JC .
COGNITIVE PSYCHOLOGY, 1982, 14 (02) :143-177
[4]   A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection [J].
Cai, Zhaowei ;
Fan, Quanfu ;
Feris, Rogerio S. ;
Vasconcelos, Nuno .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :354-370
[5]   Land-use scene classification using multi-scale completed local binary patterns [J].
Chen, Chen ;
Zhang, Baochang ;
Su, Hongjun ;
Li, Wei ;
Wang, Lu .
SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (04) :745-752
[6]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[7]   Multi-class geospatial object detection and geographic image classification based on collection of part detectors [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Guo, Lei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 :119-132
[8]   Object detection in remote sensing imagery using a discriminatively trained mixture model [J].
Cheng, Gong ;
Han, Junwei ;
Guo, Lei ;
Qian, Xiaoliang ;
Zhou, Peicheng ;
Yao, Xiwen ;
Hu, Xintao .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2013, 85 :32-43
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848