Review of YOLO Methods for Universal Object Detection

被引:1
作者
Mi, Zeng [1 ]
Lian, Zhe [1 ]
机构
[1] School of Computer Science and Technology, Inner Mongolia Normal University, Hohhot
关键词
computer vision; deep learning; object detection; YOLO method;
D O I
10.3778/j.issn.1002-8331.2404-0130
中图分类号
学科分类号
摘要
As the first single-stage object detection algorithm in the era of deep learning, YOLO has sparked a wave of enthusiasm in the field of computer vision with its powerful and unique paradigm, and has become a milestone achievement in object detection algorithms. It is still a typical algorithm that achieves the best balance between speed and accuracy, and is widely used in industrial fields such as autonomous driving and intelligent vision systems. In the past eight years, driven by deep learning technology, YOLO methods have developed rapidly and have profound impact on the entire field of object detection. This paper conducts an in-depth investigation of the YOLO method related work from the perspective of technological evolution, comprehensively summarizing the innovation and contributions of each iteration from the initial YOLO v1 to the latest YOLO v9 and YOLO v10. Based on the significant technological improvements at different time points, the YOLO method is divided into four parts: early basic YOLO, standard version YOLO, standard improvement YOLO, and unique improvement YOLO. The unique perspectives of the improvement methods in each period are introduced in detail. In addition, the dataset and indicators for evaluating the YOLO method are summarized, and detailed experimental results of different versions of YOLO and different models of the same version of YOLO are collected. The development and changes of YOLO are summarized from both macro and micro levels. Through analysis, the differences and inherent connections in the development framework, backbone network architecture, and prior box usage among different versions of YOLO are revealed, emphasizing the importance of balancing speed and accuracy in YOLO. Finally, through systematic review, the future development trends of YOLO method is summarized. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:38 / 54
页数:16
相关论文
共 52 条
[1]  
PAPAGEORGIOU C P, OREN M, POGGIO T., A general framework for object detection, Proceedings of the IEEE Sixth International Conference on Computer Vision, pp. 555-562, (1998)
[2]  
VIOLA P, JONES M., Rapid object detection using a boosted cascade of simple features, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), (2001)
[3]  
DALAL N, TRIGGS B., Histograms of oriented gradients for human detection, Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), pp. 886-893, (2005)
[4]  
KRIZHEVSKY A, SUTSKEVER I, HINTON G E., ImageNet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, (2012)
[5]  
GIRSHICK R, DONAHUE J, DARRELL T, Et al., Region-based convolutional networks for accurate object detection and segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 38, 1, pp. 142-158, (2015)
[6]  
GIRSHICK R., Fast R- CNN, Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)
[7]  
REDMON J, DIVVALA S, GIRSHICK R, Et al., You only look once: unified, real-time object detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)
[8]  
REDMON J, FARHADI A., YOLO9000: better, faster, stronger, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263-7271, (2017)
[9]  
REDMON J, FARHADI A., YOLOv3: an incremental improvement, (2018)
[10]  
BOCHKOVSKIY A, WANG C Y, LIAO H Y M., YOLOv4: optimal speed and accuracy of object detection, (2020)