YOLO-MMS for aerial object detection model based on hybrid feature extractor and improved multi-scale prediction

被引:1
|
作者
Junos, Mohamad Haniff [1 ]
Khairuddin, Anis Salwa Mohd [2 ]
机构
[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Penang, Malaysia
[2] Univ Malaya, Fac Engn, Dept Elect Engn, Kuala Lumpur 50603, Malaysia
来源
关键词
Lightweight YOLO; MixMBConv; Aerial object detection; Deep learning; NETWORK;
D O I
10.1007/s00371-024-03689-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Object detection in aerial images has become an important research subject due to the widespread use of aerial platforms, including satellites and unmanned aerial vehicles. However, the task is challenging because it involves a complex background, a high number of small objects, and densely distributed objects, leading to poor detection accuracy. Moreover, despite their excellent detection accuracy, existing one-stage object detection methods have complex structures that require huge computational power, generate high parameters, and exhibit slow inference speed, which makes them unsuitable for edge device applications. To address these issues, this paper proposes an accurate and lightweight object detection model named the YOLO-MMS model. The developed model incorporates several improvements, notably the hybrid backbone structure, which integrates a novel Mix-Mobile inverted bottleneck module to optimize efficiency by reducing the number of generated parameters. Additionally, the multi-scale prediction employs small efficient layer aggregation network and spatial pyramid pooling modules to improve feature extraction across multiple scales. Finally, the model includes an additional detection head and utilizes the Swish activation function to enhance detection accuracy. The evaluation results on the VisDrone and VEDAI datasets demonstrate that the proposed YOLO-MMS model achieved superior accuracy compared to other lightweight YOLO-based models. Furthermore, the proposed model showed significant improvements in model size with a reduction of 41.77% compared to its original YOLOv4-tiny model. These findings indicate that the proposed model presents optimal trade-offs in terms of accuracy and efficiency, rendering it exceptionally suitable for real-time applications on embedded systems. Our code is available at: https://github.com/hanifjunos/YOLO-MMS.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Salient Object Detection Based on Multi-scale Feature Extraction and Multi-level Feature Fusion
    Li, Lingli
    Meng, Lingbing
    Li, Jinbao
    Gongcheng Kexue Yu Jishu/Advanced Engineering Sciences, 2021, 53 (01): : 170 - 177
  • [42] Small object detection in unmanned aerial vehicle images using multi-scale hybrid attention
    Song, Gang
    Du, Hongwei
    Zhang, Xinyue
    Bao, Fangxun
    Zhang, Yunfeng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 128
  • [43] SED-YOLO based multi-scale attention for small object detection in remote sensing
    Wei, Xiaotan
    Li, Zhensong
    Wang, Yutong
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [44] Remote Sensing Rotating Object Detection Based on Multi-Scale Feature Extraction
    Wu, Luobing
    Gu, Yuhai
    Wu, Wenhao
    Fan, Shuaixin
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (12)
  • [45] Small Object Detection Based on Bidirectional Feature Fusion and Multi-scale Distillation
    Wang, Lingyu
    Zhou, Zijie
    Shi, Guanqun
    Guo, Junkang
    Liu, Zhigang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 200 - 214
  • [46] MsfNet: a novel small object detection based on multi-scale feature fusion
    Song, Ziying
    Wu, Peiliang
    Yang, Kuihe
    Zhang, Yu
    Liu, Yi
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 700 - 704
  • [47] Substation rotational object detection based on multi-scale feature fusion and refinement
    Li, Bin
    Li, Yalin
    Zhu, Xinshan
    Qu, Luyao
    Wang, Shuai
    Tian, Yangyang
    Xu, Dan
    ENERGY AND AI, 2023, 14
  • [48] Multi-scale object detection in UAV images based on adaptive feature fusion
    Tan, Siqi
    Duan, Zhijian
    Pu, Longzhong
    PLOS ONE, 2024, 19 (03):
  • [49] Electrode defect YOLO detection algorithm based on attention mechanism and multi-scale feature fusion
    Li Y.-W.
    Sun H.-R.
    Hu Y.-M.
    Han Y.-J.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (09): : 2578 - 2586
  • [50] YOLO-GEA: infrared target detection algorithm based on multi-scale feature fusion
    Da, Mei
    Tao, Youfeng
    Jiang, Lin
    Hu, Jue
    Zhang, Zhijian
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (04)