MBAB-YOLO: A Modified Lightweight Architecture for Real-Time Small Target Detection

被引:5
作者
Zhang, Jun [1 ]
Meng, Yizhen [1 ]
Yu, Xiaohui [1 ]
Bi, Hongjing [1 ]
Chen, Zhipeng [1 ]
Li, Huafeng [1 ]
Yang, Runtao [1 ]
Tian, Jingjun [1 ]
机构
[1] Tangshan Normal Univ, Comp Sci Dept, Tangshan 063000, Peoples R China
关键词
Deep learning; target detection; channel-wise attention; space-attention; YOLO; OBJECT DETECTION; FUSION;
D O I
10.1109/ACCESS.2023.3286031
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current target detection methods have achieved high accuracy for detecting large and medium-sized targets. However, due to factors such as the small number of pixels and features available for targets in images, the detection performance for small targets is generally unsatisfactory. In addition, the real-time performance of target detection is also critical. In conclusion, a modified lightweight architecture for real-time small target detection, i.e., MBAB-YOLO, is proposed based on You Only Look Once (YOLO) model by combining channel-wise attention block, space-attention block and multi-branch-ConvNet (Convolutional Neural network) structure. Specifically, our method is more suitable for the rich scale information of small targets through proposed adaptive multi-receptive-field focusing, and then combines proposed blended attention block (BAB) to re-calibrate small target information to make it more prominent and improve the discriminability of small target features. Finally, extensive experiments have been conducted on the open source data set for the proposed real-time small target detection method, i.e., MBAB-YOLO. The results of ablation experiment and contrast experiment show that our method has excellent performance, not only with high detection accuracy, but also with fast detection speed. Compared with the various benchmark methods, it achieves a good trade-off between the two aspects mentioned above. In addition, this paper gives a comprehensive and detailed review of the current work about small target detection from different several perspectives, which can be used as a reference for future researchers.
引用
收藏
页码:78384 / 78401
页数:18
相关论文
共 72 条
  • [1] Ali S., 2021, 29 IEEE C SIGN PROC, P1, DOI DOI 10.1109/SIU53274.2021.9478027
  • [2] Multibranch Selective Kernel Networks for Hyperspectral Image Classification
    Alipour-Fard, T.
    Paoletti, M. E.
    Haut, Juan M.
    Arefi, H.
    Plaza, J.
    Plaza, A.
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (06) : 1089 - 1093
  • [3] Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
    Bell, Sean
    Zitnick, C. Lawrence
    Bala, Kavita
    Girshick, Ross
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2874 - 2883
  • [4] MnasFPN : Learning Latency-aware Pyramid Architecture for Object Detection on Mobile Devices
    Chen, Bo
    Ghiasi, Golnaz
    Liu, Hanxiao
    Lin, Tsung-Yi
    Kalenichenko, Dmitry
    Adam, Hartwig
    Le, Quoc, V
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 13604 - 13613
  • [5] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
  • [6] Micro-expression recognition using 3D DenseNet fused Squeeze-and-Excitation Networks
    Cai, Linqin
    Li, Hao
    Dong, Wei
    Fang, Haodu
    [J]. APPLIED SOFT COMPUTING, 2022, 119
  • [7] NCMS: Towards accurate anchor free object detection through l2 norm calibration and multi-feature selection
    Chen, Fangyi
    Zhu, Chenchen
    Shen, Zhiqiang
    Zhang, Han
    Savvides, Marios
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 200
  • [8] Adaptive multi-level feature fusion and attention-based network for arbitrary-oriented object detection in remote sensing imagery
    Chen, Luchang
    Liu, Chunsheng
    Chang, Faliang
    Li, Shuang
    Nie, Zhaoying
    [J]. NEUROCOMPUTING, 2021, 451 : 67 - 80
  • [9] LIGHT-WEIGHT MIXED STAGE PARTIAL NETWORK FOR SURVEILLANCE OBJECT DETECTION WITH BACKGROUND DATA AUGMENTATION
    Chen, Ping-Yang
    Hsieh, Jun-Wei
    Gochoo, Munkhjargal
    Chen, Yong-Sheng
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3333 - 3337
  • [10] Chen S., 2021, J. Phys., Conf. Ser.