Electric Tricycle Detection Based on Improved YOLOv5s Model

被引:0
作者
Ou, Xiaofang [1 ]
Han, Fengchun [1 ]
Tian, Jing [1 ]
Tang, Jijie [1 ]
Yang, Zhengtao [1 ]
机构
[1] Peoples Publ Secur Univ China, Sch Traff Management, Beijing 100038, Peoples R China
关键词
machine vision; object detection; YOLOv5; algorithm; Transformer structure; electric tricycle;
D O I
10.3788/LOP241065
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To address the problems related to target detection of electric tricycles in road traffic management in China and the shortcomings of current detection models in small target detection and real-time performance, this study proposes a detection method based on an improved YOLOv5s model. The original YOLOv5s model is first improved by adding a small object detection head and by introducing a Transformer structure that combines an efficient additive attention mechanism, and then a dataset based on urban road scenes is built. The model is improved in terms of accuracy, recall, and mean average precision (mAP@0. 5) by 0. 67%, 2. 68 %, and 5. 78 %, respectively. The model also achieves a frame rate of 92 frame/s and demonstrates good processing capabilities, thus meeting the real-time detection requirements for actual road traffic situations.
引用
收藏
页数:9
相关论文
共 24 条
[1]  
[Anonymous], [15] https://teavm.org/docs/flavour/templates.html.2/12/2021 (cit. on p. 9).
[2]  
Beijing Municipal Public Security Bureau Beijing Municipal Transportation Commission Beijing Municipal Market Supervision and Administration BureauBeijing Municipal Bureau of Commerce, 2022, Gazette of the Beijing Municipal People's Government, P30
[3]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
[4]  
[陈世权 Chen Shiquan], 2022, [电光与控制, Electronics Optics & Control], V29, P96
[5]  
Chen X F, 2024, Journal of Harbin University, V29, P107
[6]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[7]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[8]   Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) :1904-1916
[9]  
Huangfu J Y, 2023, Computer Systems & Applications, V32, P149
[10]   Focal Loss for Dense Object Detection [J].
Lin, Tsung-Yi ;
Goyal, Priya ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) :318-327