Remote sensing image target detection combining multi-scale and attention mechanism

被引:2
|
作者
Zhang Y.-Z. [1 ,2 ]
Guo W. [1 ]
Cai Z.-Q. [3 ]
Li W.-B. [1 ]
机构
[1] School of Information Science and Technology, Shijiazhuang Tiedao University, Shijiazhuang
[2] Hebei Key Laboratory of Electromagnetic Environmental Effects and Information Processing, Shijiazhuang Tiedao University, Shijiazhuang
[3] Shanwei Institute of Technology, Shanwei
关键词
attention module; feature fusion; multi-scale feature; non-maximum suppression; remote sensing image; target detection; YOLOv5s algorithm;
D O I
10.3785/j.issn.1008-973X.2022.11.012
中图分类号
学科分类号
摘要
Remote sensing images have deficiencies such as complex backgrounds, significant differences in target scales, and dense distribution, resulting in poor detection of existing algorithms. A remote sensing image object detection algorithm that combined multi-scale and attention mechanisms was proposed. The receptive field of images of different sizes improved the atrous spatial pyramid pooling module. An attention module was proposed to improve the feature extraction ability for target regions of remote sensing images under complex backgrounds by learning the feature map channel information and the spatial location information. A weighted bidirectional feature pyramid network structure was introduced to combine with the backbone network to improve the fusion of multi-level features. A distance-based non-maximum suppression method was used for postprocessing, which improved the problem of easy overlapping of detection frames. Experimental results on DIOR and NWPU VHR-10 datasets showed that the mean average precision (mAP) of the proposed algorithm reached 71.6% and 91.6%, which were 2.9% and 1.5% higher than those of the mainstream YOLOv5s algorithm respectively. The algorithm achieved good detection results for complex remote sensing images. © 2022 Zhejiang University. All rights reserved.
引用
收藏
页码:2215 / 2223
页数:8
相关论文
共 26 条
  • [1] JIANG Xin, CHEN Wu-xiong, NIE Hai-tao, Et al., Real-time ships target detection based on aerial remote sensing images [J], Optics and Precision Engineering, 28, 10, pp. 2360-2369, (2020)
  • [2] NIE Guang-tao, HUANG Hua, A survey of object detection in optical remote sensing images [J], Acta Automatica Sinica, 47, 8, pp. 1749-1768, (2021)
  • [3] WANG Chang, ZHANG Yong-sheng, WANG Xu, Et al., Remote sensing image change detection method based on deep neural networks [J], Journal of Zhejiang University: Engineering Science, 54, 11, pp. 2138-2148, (2020)
  • [4] YANG X, SUN H, SUN X, Et al., Position detection and direction prediction for arbitrary-oriented ships via multitask rotation region convolutional neural network [J], IEEE Access, 6, pp. 50839-50849, (2018)
  • [5] FENG J, LIANG Y P, YE Z W, Et al., Small object detection in optical remote sensing video with motion guided R-CNN [C], IEEE International Geoscience and Remote Sensing Symposium, pp. 272-275, (2020)
  • [6] GUAN H Y, YU Y T, LI D L, Et al., Road Caps FPN: capsule feature pyramid network for road extraction from VHR optical remote sensing imagery [J], IEEE Transactions on Intelligent Transportation Systems, pp. 1-11, (2021)
  • [7] COURTRAI L, PHAM M T, LEFEVRE S., Small object detection in remote sensing images based on super-resolution with auxiliary generative adversarial networks, Remote Sensing, 12, 19, (2020)
  • [8] ZHANG X D, ZHU K, CHEN G Z, Et al., Geospatial object detection on high resolution remote sensing imagery based on double multi-scale feature pyramid network, Remote Sensing, 11, 7, (2019)
  • [9] LI L L, CHENG L, GUO X H, Et al., Deep adaptive proposal network in optical remote sensing images objective detection [C], IEEE International Geoscience and Remote Sensing Symposium, pp. 2651-2654, (2020)
  • [10] CHEN C Y, GONG W G, CHEN Y L, Et al., Object detection in remote sensing images based on a scene-contextual feature pyramid network, Remote Sensing, 11, 3, (2019)