A Deep CNN-Based Detection Method for Multi-Scale Fine-Grained Objects in Remote Sensing Images

被引:6
|
作者
Xie, Qiyang [1 ,2 ]
Zhou, Daiying [2 ]
Tang, Rui [1 ]
Feng, Hao [2 ]
机构
[1] China West Normal Univ, Sch Elect Informat Engn, Nanchong 637002, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep CNN; multi-scale fine-grained object detection; multi-scale region generation network; fully convolutional region classification network;
D O I
10.1109/ACCESS.2024.3356716
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The detection of fine-grained objects in remote sensing images has been recognized as a challenging issue, which cannot be well addressed by the existing deep learning-based methods due to their inadaptability to multi-scale objects, slow convergence speed, and limitations to scarce datasets. To cope with the above issue, we propose a deep convolutional neural network (CNN)-based detection method for multi-scale fine-grained objects in complex remote sensing scenarios. Specifically, we adopt a deep CNN with a residual structure as the backbone network, to extract deep-level details from the image. Besides, we introduce a multi-scale region generation network to overcome the limitations of fixed receptive field convolution kernels and enable multi-scale object detection. Lastly, we replace fully connected layers in the fully convolutional region classification network with 1 x 1 convolutional layer to enhance detection efficiency and detection speed. To overcome the limitation of scarce datasets, we conducted experiments on the FAIR1M dataset, which is currently the largest fine-grained object detection dataset in the remote sensing field. Simulation results show that the proposed detection method achieves the highest average precision (35.86%) among all benchmarks and outperforms the classic Faster R-CNN-based method by 3.44%. Furthermore, our method demonstrates significantly improved detection speed compared to the Faster R-CNN-based methods.
引用
收藏
页码:15622 / 15630
页数:9
相关论文
共 50 条
  • [31] Based on the multi-scale information sharing network of fine-grained attention for agricultural pest detection
    Wang Linfeng
    Liu Yong
    Liu Jiayao
    Wang Yunsheng
    Xu Shipu
    PLOS ONE, 2023, 18 (10):
  • [32] Hybrid-Gird:An explainable method for fine-grained classification of remote sensing images
    Zhu, Kaiwen
    You, Yanan
    Cao, Jingyi
    Meng, Gang
    Qiao, Yuanyuan
    Yang, Jie
    National Remote Sensing Bulletin, 2024, 28 (07) : 1722 - 1734
  • [33] A multi-scale pyramid feature fusion-based object detection method for remote sensing images
    Huangfu, Panpan
    Dang, Lanxue
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (24) : 7790 - 7807
  • [34] An Object Fine-Grained Change Detection Method Based on Frequency Decoupling Interaction for High-Resolution Remote Sensing Images
    Tang, Yingjie
    Feng, Shou
    Zhao, Chunhui
    Fan, Yuanze
    Shi, Qian
    Li, Wei
    Tao, Ran
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [35] Multi-Scale Salient Features Bilinear Attention Fine-Grained Classification Method
    Liu G.
    Zhan H.
    Meng Y.
    Wang B.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (11): : 1683 - 1691
  • [36] Novel CNN-Based Approach for Burn Severity Assessment and Fine-Grained Boundary Segmentation in Burn Images
    Abdolahnejad, Mahla
    Lee, Justin
    Chan, Hannah
    Morzycki, Alex
    Ethier, Olivier
    Mo, Anthea
    Liu, Peter X.
    Wong, Joshua N.
    Hong, Colin
    Joshi, Rakesh
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [37] Progressive Learning Vision Transformer for Open Set Recognition of Fine-Grained Objects in Remote Sensing Images
    Fu, Yimin
    Liu, Zhunga
    Zhang, Zuowei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [38] A Novel Pyramidal CNN Deep Structure for Multiple Objects Detection in Remote Sensing Images
    Khaled Mohammed Elgamily
    M. A. Mohamed
    Ahmed Mohamed Abou-Taleb
    Mohamed Maher Ata
    Journal of the Indian Society of Remote Sensing, 2024, 52 (1) : 41 - 61
  • [39] A Novel Pyramidal CNN Deep Structure for Multiple Objects Detection in Remote Sensing Images
    Elgamily, Khaled Mohammed
    Mohamed, M. A.
    Abou-Taleb, Ahmed Mohamed
    Ata, Mohamed Maher
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2023, : 41 - 61
  • [40] Multi-Attention Object Detection Model in Remote Sensing Images Based on Multi-Scale
    Ying, Xiang
    Wang, Qiang
    Li, Xuewei
    Yu, Mei
    Jiang, Han
    Gao, Jie
    Liu, Zhiqiang
    Yu, Ruiguo
    IEEE ACCESS, 2019, 7 : 94508 - 94519