A multi-scale pyramid feature fusion-based object detection method for remote sensing images

被引：2

作者：

Huangfu, Panpan ^{[1
]}

Dang, Lanxue ^{[2
]}

机构：

[1] Shangqiu Inst Technol, Sch Informat & Elect Engn, Shangqiu, Peoples R China

[2] Henan Univ, Sch Comp & Informat Engn, Henan Key Lab Big Data Anal & Proc, Kaifeng, Peoples R China

来源：

INTERNATIONAL JOURNAL OF REMOTE SENSING | 2023年 / 44卷 / 24期

关键词：

Object detection; remote sensing image; feature fusion; neural network; YOLO; SHIP DETECTION;

D O I：

10.1080/01431161.2023.2288947

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Object detection is a basic and challenging task in remote sensing image analysis that has received extensive attention in recent years. Feature fusion is one of the key steps in object detection. Most existing methods of feature fusion first complete the preliminary fusion of feature maps of different scales through 'add' or 'concat' operations, followed by using a single-scale convolution to further improve the fusion effect. However, due to the fact that multi-level features exhibit multi-scale representations, the fusion effect of existing methods is limited. To improve the efficiency of feature fusion, we propose a multi-scale pyramid feature fusion network, which performs multi-scale learning through multi-scale convolution kernels to complete multi-level feature fusion more effectively. Then we propose a lightweight decoupled head, which alleviates the conflict between the classification task and the localization task. We conducted experiments on the dataset of object detection in aerial images (DOTA) dataset and the HRSC2016 dataset to verify our proposed methods. The results show that the performance of our proposed methods is better than other existing methods, with an mAP of 73.3%, 67.6%, 65.0%, and 96.7% on the DOTA1.0, DOTA1.5, DOTA2.0, and HRSC2016 datasets, respectively. Meanwhile, the parameter quantity of the proposed model is 10.3 M, and the inference time is 5.1 ms, which meets the requirement of lightweight and ensures the timeliness of detection.

引用

页码：7790 / 7807

页数：18

共 42 条

[1] A Path Aggregation Network Based on Residual Feature Enhancement for Object Detection in Remote Sensing Imagery [J].

Dang, Lanxue ;

Huangfu, Panpan ;

Hou, Yan-e ;

Liu, Yang ;

Han, Hongyu .

REMOTE SENSING LETTERS, 2023, 14 (06) :598-608

[2] Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].

Ding, Jian ;

Xue, Nan ;

Long, Yang ;

Xia, Gui-Song ;

Lu, Qikai .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853

[3] TOOD: Task-aligned One-stage Object Detection [J].

Feng, Chengjian ;

Zhong, Yujie ;

Gao, Yu ;

Scott, Matthew R. ;

Huang, Weilin .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3490-3499

[4] Hyperspectral and Multispectral Classification for Coastal Wetland Using Depthwise Feature Interaction Network [J].

Gao, Yunhao ;

Li, Wei ;

Zhang, Mengmeng ;

Wang, Jianbu ;

Sun, Weiwei ;

Tao, Ran ;

Du, Qian .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[5]

Ge Z, 2021, Arxiv, DOI arXiv:2107.08430

[6] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[7] Effective Fusion Factor in FPN for Tiny Object Detection [J].

Gong, Yuqi ;

Yu, Xuehui ;

Ding, Yao ;

Peng, Xiaoke ;

Zhao, Jian ;

Han, Zhenjun .

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :1159-1167

[8] SAFF-SSD: Self-Attention Combined Feature Fusion-Based SSD for Small Object Detection in Remote Sensing [J].

Huo, Bihan ;

Li, Chenglong ;

Zhang, Jianwei ;

Xue, Yingjian ;

Lin, Zhoujin .

REMOTE SENSING, 2023, 15 (12)

[9] You Should Look at All Objects [J].

Jin, Zhenchao ;

Yu, Dongdong ;

Song, Luchuan ;

Yuan, Zehuan ;

Yu, Lequan .

COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 :332-349

[10]

Li CY, 2022, Arxiv, DOI [arXiv:2209.02976, 10.48550/arXiv.2209.02976]

← 1 2 3 4 5 →