T-YOLO: Tiny Vehicle Detection Based on YOLO and Multi-Scale Convolutional Neural Networks

被引：41

作者：

Carrasco, Daniel Padilla ^{[1
,2
]}

Rashwan, Hatem A. ^{[1
]}

Garcia, Miguel Angel ^{[3
]}

Puig, Domenec ^{[1
]}

机构：

[1] Univ Rovira & Virgili, Dept Comp Engn & Math, Tarragona 43003, Spain

[2] Quercus Technol, Reus 43203, Spain

[3] Univ Autonoma Madrid, Dept Elect & Commun Technol, Madrid 28049, Spain

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Object detection; Cameras; Feature extraction; Computational modeling; Automobiles; Convolutional neural networks; Detectors; tiny objects; smart parking;

D O I：

10.1109/ACCESS.2021.3137638

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To solve real-life problems for different smart city applications, using deep Neural Network, such as parking occupancy detection, requires fine-tuning of these networks. For large parking, it is desirable to use a cenital-plane camera located at a high distance that allows the monitoring of the entire parking space or a large parking area with only one camera. Today's most popular object detection models, such as YOLO, achieve good precision scores at real-time speed. However, if we use our own data different from that of the general-purpose datasets, such as COCO and ImageNet, we have a large margin for improvisation. In this paper, we propose a modified, yet lightweight, deep object detection model based on the YOLO-v5 architecture. The proposed model can detect large, small, and tiny objects. Specifically, we propose the use of a multi-scale mechanism to learn deep discriminative feature representations at different scales and automatically determine the most suitable scales for detecting objects in a scene (i.e., in our case vehicles). The proposed multi-scale module reduces the number of trainable parameters compared to the original YOLO-v5 architecture. The experimental results also demonstrate that precision is improved by a large margin. In fact, as shown in the experiments, the results show a small reduction from 7.28 million parameters of the YOLO-v5-S profile to 7.26 million parameters in our model. In addition, we reduced the detection speed by inferring 30 fps compared to the YOLO-v5-L/X profiles. In addition, the tiny vehicle detection performance was significantly improved by 33% compared to the YOLO-v5-X profile.

引用

页码：22430 / 22440

页数：11

共 28 条

[1] Deep learning for decentralized parking lot occupancy detection
Amato, Giuseppe
Carrara, Fabio
Falchi, Fabrizio
Gennaro, Claudio
Meghini, Carlo
Vairo, Claudio
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 : 327 - 334
[2] Leveraging Deep Learning and IoT big data analytics to support the smart cities development: Review and future directions
Ben Atitallah, Safa
Driss, Maha
Boulila, Wadii
Ben Ghezala, Henda
[J]. COMPUTER SCIENCE REVIEW, 2020, 38
[3] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[4] Survey methodology for measuring parking occupancy: Impacts of an on-street parking pricing scheme in an urban center
Cats, Oded
Zhang, Chen
Nissan, Albania
[J]. TRANSPORT POLICY, 2016, 47 : 55 - 63
[5] Chethan Kumar B., 2020, 2020 Third International Conference on Smart Systems and Inventive Technology (ICSSIT), P1316, DOI 10.1109/ICSSIT48917.2020.9214094
[6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7] Everingham M., 2010, INT J COMPUT VISION, V88, P303, DOI DOI 10.1007/s11263-009-0275-4
[8] Farag M., 2020, Indonesian J. Elect.Eng. Comput. Sci., V19, P964, DOI 10.11591/ijeecs.v19i2.pp964-973
[9] Faraji SJ., 2019, J AIR POLLUT HLTH, V4, P53
[10] Real Time IP Camera Parking Occupancy Detection using Deep Learning
Farley, Albertus
Ham, Hanry
Hendra
[J]. 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 606 - 614

← 1 2 3 →