Research on small target pedestrian detection based on improved YOLO

被引:1
作者
Xu X. [1 ]
Wang K. [1 ]
Zhao Y. [2 ]
机构
[1] School of Mechanical and Energy Engineering, Zhejiang University of Science and Technology, Hangzhou, Zhejiang
[2] School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou, Zhejiang
基金
中国国家自然科学基金;
关键词
Deep learning; Pedestrian detection; Small target; YOLO;
D O I
10.1504/IJWMC.2021.115659
中图分类号
学科分类号
摘要
Aiming at the problems of low detection accuracy and speed for small target pedestrian in traffic scenes, the YOLO-SP based on YOLO-v4 is proposed. Firstly, the KITTI and INRIA data sets are used to make the new data set, and k-means algorithm is used to cluster the anchor box. Secondly, in the feature fusion phase (Neck), we increase the number of fused channels and simplify the number of output channels. Finally, to optimise the loss function, GIOU is used to calculate the coordinate loss, Focal is used to calculate the confidence loss. The test shows that all the improvement measures show better effect on small and overlapping pedestrians, the final detection accuracy (AP) is increased by 4.0%, the detection speed is accelerated by 11.3%. YOLO-SP has a certain practicality in the small target pedestrian detection. Copyright © 2021 Inderscience Enterprises Ltd.
引用
收藏
页码:281 / 289
页数:8
相关论文
共 14 条
[1]  
Bochkovskiy A., Wang C.Y., Liao H.Y.M., YOLOv4: optimal speed and accuracy of object detection, (2020)
[2]  
Ioffe S., Szegedy C., Batch normalization: accelerating deep network training by reducing internal covariate shift, Proceedings of the International Conference on Machine Learning, pp. 448-456, (2015)
[3]  
Ju M., Luo H., Et al., Improved YOLO V3 algorithm and its application in small target detection, Acta Optica Sinica, 39, 7, pp. 253-260, (2019)
[4]  
Lin T.Y., Goyal P., Girshick R., He K., Dollar P., Focal loss for dense object detection, Proceedings of the IEEE International Conference on Computer Vision, pp. 2980-2988, (2017)
[5]  
Liu W., Anguelov D., Erhan D., Szegedy C., Reed S., Fu C.Y., SSD: single shot multibox detector, Proceedings of the European Conference on Computer Vision, pp. 21-37, (2016)
[6]  
Loshchilov I., Hutter F., SGDR: stochastic gradient descent with warm restarts, (2016)
[7]  
Misra D., Mish: a self-regularized nonmonotonic neural activation function, (2019)
[8]  
Redmon J., Farhadi A., YOLO9000: better, faster, stronger, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517-6525, (2017)
[9]  
Redmon J., Farhadi A., Yolov3: an incremental improvement, (2018)
[10]  
Redmon J., Divvala S., Girshick R., Farhadi A., You only look once: unified, real-time object detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2015)