A multi-scale pyramid feature fusion-based object detection method for remote sensing images

被引:1
作者
Huangfu, Panpan [1 ]
Dang, Lanxue [2 ]
机构
[1] Shangqiu Inst Technol, Sch Informat & Elect Engn, Shangqiu, Peoples R China
[2] Henan Univ, Sch Comp & Informat Engn, Henan Key Lab Big Data Anal & Proc, Kaifeng, Peoples R China
关键词
Object detection; remote sensing image; feature fusion; neural network; YOLO; SHIP DETECTION;
D O I
10.1080/01431161.2023.2288947
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Object detection is a basic and challenging task in remote sensing image analysis that has received extensive attention in recent years. Feature fusion is one of the key steps in object detection. Most existing methods of feature fusion first complete the preliminary fusion of feature maps of different scales through 'add' or 'concat' operations, followed by using a single-scale convolution to further improve the fusion effect. However, due to the fact that multi-level features exhibit multi-scale representations, the fusion effect of existing methods is limited. To improve the efficiency of feature fusion, we propose a multi-scale pyramid feature fusion network, which performs multi-scale learning through multi-scale convolution kernels to complete multi-level feature fusion more effectively. Then we propose a lightweight decoupled head, which alleviates the conflict between the classification task and the localization task. We conducted experiments on the dataset of object detection in aerial images (DOTA) dataset and the HRSC2016 dataset to verify our proposed methods. The results show that the performance of our proposed methods is better than other existing methods, with an mAP of 73.3%, 67.6%, 65.0%, and 96.7% on the DOTA1.0, DOTA1.5, DOTA2.0, and HRSC2016 datasets, respectively. Meanwhile, the parameter quantity of the proposed model is 10.3 M, and the inference time is 5.1 ms, which meets the requirement of lightweight and ensures the timeliness of detection.
引用
收藏
页码:7790 / 7807
页数:18
相关论文
共 42 条
  • [1] A Path Aggregation Network Based on Residual Feature Enhancement for Object Detection in Remote Sensing Imagery
    Dang, Lanxue
    Huangfu, Panpan
    Hou, Yan-e
    Liu, Yang
    Han, Hongyu
    [J]. REMOTE SENSING LETTERS, 2023, 14 (06) : 598 - 608
  • [2] Learning RoI Transformer for Oriented Object Detection in Aerial Images
    Ding, Jian
    Xue, Nan
    Long, Yang
    Xia, Gui-Song
    Lu, Qikai
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2844 - 2853
  • [3] TOOD: Task-aligned One-stage Object Detection
    Feng, Chengjian
    Zhong, Yujie
    Gao, Yu
    Scott, Matthew R.
    Huang, Weilin
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3490 - 3499
  • [4] Hyperspectral and Multispectral Classification for Coastal Wetland Using Depthwise Feature Interaction Network
    Gao, Yunhao
    Li, Wei
    Zhang, Mengmeng
    Wang, Jianbu
    Sun, Weiwei
    Tao, Ran
    Du, Qian
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [5] Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, DOI 10.48550/ARXIV.2107.08430]
  • [6] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [7] Effective Fusion Factor in FPN for Tiny Object Detection
    Gong, Yuqi
    Yu, Xuehui
    Ding, Yao
    Peng, Xiaoke
    Zhao, Jian
    Han, Zhenjun
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1159 - 1167
  • [8] Revisiting the Sibling Head in Object Detector
    Song, Guanglu
    Liu, Yu
    Wang, Xiaogang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11560 - 11569
  • [9] SAFF-SSD: Self-Attention Combined Feature Fusion-Based SSD for Small Object Detection in Remote Sensing
    Huo, Bihan
    Li, Chenglong
    Zhang, Jianwei
    Xue, Yingjian
    Lin, Zhoujin
    [J]. REMOTE SENSING, 2023, 15 (12)
  • [10] You Should Look at All Objects
    Jin, Zhenchao
    Yu, Dongdong
    Song, Luchuan
    Yuan, Zehuan
    Yu, Lequan
    [J]. COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 332 - 349