Starting from the structure: A review of small object detection based on deep learning

被引:5
作者
Zheng, Xiuling [1 ]
Wang, Huijuan [1 ]
Shang, Yu [1 ]
Chen, Gang [1 ]
Zou, Suhua [1 ]
Yuan, Quanbo [1 ,2 ]
机构
[1] North China Inst Aerosp Engn, Sch Comp, Langfang 065000, Peoples R China
[2] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China
关键词
Small object detection; Data augmentation; Feature extraction; Feature fusion; Unsupervised; Transfer learning; Anchor; -free; FEATURE PYRAMID NETWORK;
D O I
10.1016/j.imavis.2024.105054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection, as one of the most fundamental and essential tasks in the field of computer vision, has been the focus of unremitting efforts by researchers, who are committed to modifying the neural network structure in order to improve the accuracy of object detection and expedite task execution. As the application scope continues to expand, small object detection has gradually emerged as a crucial branch in the field of object detection. In this paper, the development history of object detection algorithms is introduced, the concept of small objects is introduced, and the current problems and challenges faced by small object detection are outlined. In this paper, the network structure is disassembled from a macroscopic point of view, and improved algorithms such as enhanced data augmentation, improved feature extraction, superior feature fusion, and refined loss functions are described in detail. Furthermore, the paper explores a series of emerging and improved algorithms for small object detection. It encompasses the introduction of advanced strategies such as unsupervised learning, end-to-end training, density cropping, transfer learning, and anchor-free approaches. The paper provides a comprehensive list of commonly used general-purpose datasets and domain-specific datasets for small object detection tasks, offering performance comparisons of the mentioned improved algorithms. In conclusion, the paper summarizes and provides an outlook on current small object detection algorithms, furnishing the reader with a thorough understanding of the field and insights into future directions.
引用
收藏
页数:20
相关论文
共 113 条
  • [1] SLICING AIDED HYPER INFERENCE AND FINE-TUNING FOR SMALL OBJECT DETECTION
    Akyon, Fatih Cagatay
    Altinuc, Sinan Onur
    Temizel, Alptekin
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 966 - 970
  • [2] SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial Network
    Bai, Yancheng
    Zhang, Yongqiang
    Ding, Mingli
    Ghanem, Bernard
    [J]. COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 210 - 226
  • [3] A full data augmentation pipeline for small object detection based on generative adversarial networks
    Bosquet, Brais
    Cores, Daniel
    Seidenari, Lorenzo
    Brea, Victor M.
    Mucientes, Manuel
    Del Bimbo, Alberto
    [J]. PATTERN RECOGNITION, 2023, 133
  • [4] Cascade R-CNN: Delving into High Quality Object Detection
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
  • [5] Cao Yefan, 2022, 2022 7th International Conference on Image, Vision and Computing (ICIVC), P100, DOI 10.1109/ICIVC55077.2022.9886277
  • [6] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [7] R-CNN for Small Object Detection
    Chen, Chenyi
    Liu, Ming-Yu
    Tuzel, Oncel
    Xiao, Jianxiong
    [J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 214 - 230
  • [8] Object detection in remote sensing images based on deep transfer learning
    Chen, Jinyong
    Sun, Jianguo
    Li, Yuqian
    Hou, Changbo
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (09) : 12093 - 12109
  • [9] Parallel Residual Bi-Fusion Feature Pyramid Network for Accurate Single-Shot Object Detection
    Chen, Ping-Yang
    Chang, Ming-Ching
    Hsieh, Jun-Wei
    Chen, Yong-Sheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9099 - 9111
  • [10] Classification methods of a small sample target object in the sky based on the higher layer visualizing feature and transfer learning deep networks
    Chen, Yu
    Meng, Hongbing
    Wen, Xinling
    Ma, Pengge
    Qin, Yuxin
    Ma, Zhengxiang
    Liu, Zhaoyu
    [J]. EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2018,