Spatial hierarchy perception and hard samples metric learning for high-resolution remote sensing image object detection

被引:0
作者
Dongjun Zhu
Shixiong Xia
Jiaqi Zhao
Yong Zhou
Qiang Niu
Rui Yao
Ying Chen
机构
[1] China University of Mining and Technology,School of Computer Science and Technology
[2] Ministry of Education of the Peoples Republic of China,Engineering Research Center of Mine Digitization
来源
Applied Intelligence | 2022年 / 52卷
关键词
Remote sensing; Object detection; Spatial hierarchy perception; Metric learning;
D O I
暂无
中图分类号
学科分类号
摘要
Due to the different shooting angles, altitudes and scenes, remote sensing images contain many complex backgrounds and multi-scale objects. Moreover, objects in remote sensing images are much smaller relative to the backgrounds, easily occluded by buildings and trees. These cause difficult feature extraction and increase the intra-class diversity of objects, making object detection on remote sensing images more challenging. In this paper, we propose a novel remote sensing image object detection method (SHDet) based on spatial hierarchy perception component (SHPC) and hard samples metric learning (HSML). We design a SHPC to extract the feature under the different spatial hierarchies and learn the contribution weights between feature channels to enhance the feature representation. HSML is proposed to narrow the feature differences of hard samples in the same category, reducing the error detection caused by intra-class diversity. Besides, we decouple the complex background to build the pre-training datasets for pre-training the object detection model, strengthening the object feature learning. The experiments carried out on two widely used remote sensing datasets (NWPU VHR-10 and DOTA-v1.5) show that the proposed method has better detection performance compared with several state-of-the-art object detection methods.
引用
收藏
页码:3193 / 3208
页数:15
相关论文
共 111 条
[21]  
Wen C(2019)A novel effectively optimized one-stage network for object detection in remote sensing imagery Remote Sens 11 1376-2348
[22]  
Teng X(2010)The pascal visual object classes (voc) challenge Int J Comput Vis 88 303-undefined
[23]  
Chen Y(2020)Foveabox: Beyound anchor-based object detection IEEE Trans Image Process 29 7389-undefined
[24]  
Guan H(2018)Rotation-insensitive and context-augmented object detection in remote sensing images IEEE Trans Geosci Remote Sens 56 2337-undefined
[25]  
Luo H(2018)Msri-ccf: Multi-scale and rotation-insensitive convolutional channel features for geospatial object detection Remote Sens 10 1990-undefined
[26]  
Cao L(undefined)undefined undefined undefined undefined-undefined
[27]  
Li J(undefined)undefined undefined undefined undefined-undefined
[28]  
Qiu S(undefined)undefined undefined undefined undefined-undefined
[29]  
Wen G(undefined)undefined undefined undefined undefined-undefined
[30]  
Fan Y(undefined)undefined undefined undefined undefined-undefined