Multiple adaptive fusion network with Mittag Leffler IoU loss for aircraft detection in remote sensing images

被引:0
作者
Wang, Fengxian [1 ]
Li, Dailin [1 ]
Zhang, Jie [1 ]
Wang, Xiabing [1 ]
Li, Linwei [1 ]
Shi, Xiaoping [2 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Elect & Informat Engn, 5 Dongfeng Rd, Zhengzhou 450002, Henan, Peoples R China
[2] Harbin Inst Technol, Control & Simulat Ctr, 2 Yikuang St, Harbin 150008, Heilongjiang, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2025年 / 28卷 / 02期
基金
美国国家科学基金会;
关键词
Aircraft detection; Remote sensing; Low sample quality; Adaptively selecting; Automatically learning; OBJECT DETECTION;
D O I
10.1007/s10586-024-04823-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aircraft detection in remote sensing is essential for airport traffic control and other applications. However, due to the influence of small objects, strong clutter, and partial occlusion, the accuracy of the aircraft detection model is severely compromised. This paper proposes a method called Multiple Adaptive Fusion Network with Mittag Leffler IoU Loss (MAML) for aircraft detection. Specifically, we first design a Mittag Leffler IoU Loss (LOSSMLIoU\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$LOSS _{MLIoU}$$\end{document}) to resolve the effects of low sample quality, distance, and aspect ratio on the model through the penalty term QMLIoU\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q_{MLIoU}$$\end{document}. Secondly, we propose Receive Field Adaptive Spatial Pyramid Pooling (RFA-SPP), which further improves the receive field of the network by adaptively selecting the size of the pooled kernels. Finally, we propose Adaptive Feature Efficient Aggregation (AFEA), which solves the problem of information redundancy and increased computational complexity caused by direct feature fusion by automatically learning the relationship between multi-scale features. Our model achieves 96.41% mAP on the RSOD dataset and 42.64 FPS on the RTX4000 GPU, outperforming the advanced model in speed and accuracy. In addition, the results of robustness experiments on the NWPU VHR-10 and TGRS-HRRSD datasets also show the excellent portability of our method, which provides strong technical support for airport traffic control and aviation safety monitoring.
引用
收藏
页数:19
相关论文
共 57 条
  • [1] AeroDetectNet: a lightweight, high-precision network for enhanced detection of small objects in aerial remote sensing imagery
    Bai, Ruihan
    Lu, Jiahui
    Zhang, Zhiping
    Wang, Mingkang
    Wang, Qiang
    [J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
  • [2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
  • [3] Feature-Fused SSD: Fast Detection for Small Objects
    Cao, Guimei
    Xie, Xuemei
    Yang, Wenzhe
    Liao, Quan
    Shi, Guangming
    Wu, Jinjian
    [J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [4] MDCT: Multi-Kernel Dilated Convolution and Transformer for One-Stage Object Detection of Remote Sensing Images
    Chen, Juanjuan
    Hong, Hansheng
    Song, Bin
    Guo, Jie
    Chen, Chen
    Xu, Junjie
    [J]. REMOTE SENSING, 2023, 15 (02)
  • [5] Embedding Attention and Residual Network for Accurate Salient Object Detection
    Chen, Shuhan
    Wang, Ben
    Tan, Xiuli
    Hu, Xuelong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (05) : 2050 - 2062
  • [6] Multi-class geospatial object detection and geographic image classification based on collection of part detectors
    Cheng, Gong
    Han, Junwei
    Zhou, Peicheng
    Guo, Lei
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 : 119 - 132
  • [7] Control of goal-directed and stimulus-driven attention in the brain
    Corbetta, M
    Shulman, GL
    [J]. NATURE REVIEWS NEUROSCIENCE, 2002, 3 (03) : 201 - 215
  • [8] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [9] Evaluation of Robust Spatial Pyramid Pooling Based on Convolutional Neural Network for Traffic Sign Recognition System
    Dewi, Christine
    Chen, Rung-Ching
    Tai, Shao-Kuo
    [J]. ELECTRONICS, 2020, 9 (06)
  • [10] Di Fan, 2020, Advances in 3D Image and Graphics Representation, Analysis, Computing and Information Technology. Algorithms and Applications. Proceedings of IC3DIT 2019. Smart Innovation, Systems and Technologies (SIST 180), P109, DOI 10.1007/978-981-15-3867-4_14