Object Detection in Remote Sensing Images Based on Adaptive Multi-Scale Feature Fusion Method

被引:15
作者
Liu, Chun [1 ]
Zhang, Sixuan [1 ]
Hu, Mengjie [1 ]
Song, Qing [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Pattern Recognit & Intelligent Vis PRIV, Beijing 100876, Peoples R China
关键词
feature fusion; remote sensing; object detection; attention mechanism;
D O I
10.3390/rs16050907
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Multi-scale object detection is critical for analyzing remote sensing images. Traditional feature pyramid networks, which are aimed at accommodating objects of varying sizes through multi-level feature extraction, face significant challenges due to the diverse scale variations present in remote sensing images. This situation often forces single-level features to span a broad spectrum of object sizes, complicating accurate localization and classification. To tackle these challenges, this paper proposes an innovative algorithm that incorporates an adaptive multi-scale feature enhancement and fusion module (ASEM), which enhances remote sensing image object detection through sophisticated multi-scale feature fusion. Our method begins by employing a feature pyramid to gather coarse multi-scale features. Subsequently, it integrates a fine-grained feature extraction module at each level, utilizing atrous convolutions with varied dilation rates to refine multi-scale features, which markedly improves the information capture from widely varied object scales. Furthermore, an adaptive enhancement module is applied to the features of each level by employing an attention mechanism for feature fusion. This strategy concentrates on the features of critical scale, which significantly enhance the effectiveness of capturing essential feature information. Compared with the baseline method, namely, Rotated FasterRCNN, our method achieved an mAP of 74.21% ( 0.81%) on the DOTA-v1.0 dataset and an mAP of 84.90% (+9.2%) on the HRSC2016 dataset. These results validated the effectiveness and practicality of our method and demonstrated its significant application value in multi-scale remote sensing object detection tasks.
引用
收藏
页数:15
相关论文
共 39 条
  • [1] Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery
    Azimi, Seyed Majid
    Vig, Eleonora
    Bahmanyar, Reza
    Koerner, Marco
    Reinartz, Peter
    [J]. COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 150 - 165
  • [2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934]
  • [3] Info-FPN: An Informative Feature Pyramid Network for object detection in remote sensing images
    Chen, Silin
    Zhao, Jiaqi
    Zhou, Yong
    Wang, Hanzheng
    Yao, Rui
    Zhang, Lixu
    Xue, Yong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 214
  • [4] Multi-scale object detection in remote sensing imagery with convolutional neural networks
    Deng, Zhipeng
    Sun, Hao
    Zhou, Shilin
    Zhao, Juanping
    Lei, Lin
    Zou, Huanxin
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 3 - 22
  • [5] Learning RoI Transformer for Oriented Object Detection in Aerial Images
    Ding, Jian
    Xue, Nan
    Long, Yang
    Xia, Gui-Song
    Lu, Qikai
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2844 - 2853
  • [6] Global to Local: A Scale-Aware Network for Remote Sensing Object Detection
    Gao, Tao
    Niu, Qianqian
    Zhang, Jing
    Chen, Ting
    Mei, Shaohui
    Jubair, Ahmad
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [7] NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
    Ghiasi, Golnaz
    Lin, Tsung-Yi
    Le, Quoc V.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7029 - 7038
  • [8] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [9] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
  • [10] Jaderberg M, 2015, ADV NEUR IN, V28