A multi-scale pyramid feature fusion-based object detection method for remote sensing images

被引:1
作者
Huangfu, Panpan [1 ]
Dang, Lanxue [2 ]
机构
[1] Shangqiu Inst Technol, Sch Informat & Elect Engn, Shangqiu, Peoples R China
[2] Henan Univ, Sch Comp & Informat Engn, Henan Key Lab Big Data Anal & Proc, Kaifeng, Peoples R China
关键词
Object detection; remote sensing image; feature fusion; neural network; YOLO; SHIP DETECTION;
D O I
10.1080/01431161.2023.2288947
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Object detection is a basic and challenging task in remote sensing image analysis that has received extensive attention in recent years. Feature fusion is one of the key steps in object detection. Most existing methods of feature fusion first complete the preliminary fusion of feature maps of different scales through 'add' or 'concat' operations, followed by using a single-scale convolution to further improve the fusion effect. However, due to the fact that multi-level features exhibit multi-scale representations, the fusion effect of existing methods is limited. To improve the efficiency of feature fusion, we propose a multi-scale pyramid feature fusion network, which performs multi-scale learning through multi-scale convolution kernels to complete multi-level feature fusion more effectively. Then we propose a lightweight decoupled head, which alleviates the conflict between the classification task and the localization task. We conducted experiments on the dataset of object detection in aerial images (DOTA) dataset and the HRSC2016 dataset to verify our proposed methods. The results show that the performance of our proposed methods is better than other existing methods, with an mAP of 73.3%, 67.6%, 65.0%, and 96.7% on the DOTA1.0, DOTA1.5, DOTA2.0, and HRSC2016 datasets, respectively. Meanwhile, the parameter quantity of the proposed model is 10.3 M, and the inference time is 5.1 ms, which meets the requirement of lightweight and ensures the timeliness of detection.
引用
收藏
页码:7790 / 7807
页数:18
相关论文
共 42 条
[31]   DOTA: A Large-scale Dataset for Object Detection in Aerial Images [J].
Xia, Gui-Song ;
Bai, Xiang ;
Ding, Jian ;
Zhu, Zhen ;
Belongie, Serge ;
Luo, Jiebo ;
Datcu, Mihai ;
Pelillo, Marcello ;
Zhang, Liangpei .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3974-3983
[32]   Ship detection based on a superpixel-level CFAR detector for SAR imagery [J].
Xie, Tao ;
Liu, Mingxing ;
Zhang, Mingjiang ;
Qi, Shuaihui ;
Yang, Jungang .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (09) :3412-3428
[33]  
Xu SL, 2022, Arxiv, DOI [arXiv:2203.16250, 10.48550/arXiv.2203.16250]
[34]   On the Arbitrary-Oriented Object Detection: Classification Based Approaches Revisited [J].
Yang, Xue ;
Yan, Junchi .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (05) :1340-1365
[35]   Dense Label Encoding for Boundary Discontinuity Free Rotation Detection [J].
Yang, Xue ;
Hou, Liping ;
Zhou, Yue ;
Wang, Wentao ;
Yan, Junchi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15814-15824
[36]  
Yang X, 2021, PR MACH LEARN RES, V139
[37]   Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks [J].
Yang, Xue ;
Sun, Hao ;
Fu, Kun ;
Yang, Jirui ;
Sun, Xian ;
Yan, Menglong ;
Guo, Zhi .
REMOTE SENSING, 2018, 10 (01)
[38]  
Yang Xue, 2021, PROC ANN C NEURAL IN, V34
[39]  
Yue Wu, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P10183, DOI 10.1109/CVPR42600.2020.01020
[40]   Hyperspectral and LiDAR Data Classification Based on Structural Optimization Transmission [J].
Zhang, Mengmeng ;
Li, Wei ;
Zhang, Yuxiang ;
Tao, Ran ;
Du, Qian .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (05) :3153-3164