T-YOLO: Tiny Vehicle Detection Based on YOLO and Multi-Scale Convolutional Neural Networks

被引:41
作者
Carrasco, Daniel Padilla [1 ,2 ]
Rashwan, Hatem A. [1 ]
Garcia, Miguel Angel [3 ]
Puig, Domenec [1 ]
机构
[1] Univ Rovira & Virgili, Dept Comp Engn & Math, Tarragona 43003, Spain
[2] Quercus Technol, Reus 43203, Spain
[3] Univ Autonoma Madrid, Dept Elect & Commun Technol, Madrid 28049, Spain
关键词
Object detection; Cameras; Feature extraction; Computational modeling; Automobiles; Convolutional neural networks; Detectors; tiny objects; smart parking;
D O I
10.1109/ACCESS.2021.3137638
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To solve real-life problems for different smart city applications, using deep Neural Network, such as parking occupancy detection, requires fine-tuning of these networks. For large parking, it is desirable to use a cenital-plane camera located at a high distance that allows the monitoring of the entire parking space or a large parking area with only one camera. Today's most popular object detection models, such as YOLO, achieve good precision scores at real-time speed. However, if we use our own data different from that of the general-purpose datasets, such as COCO and ImageNet, we have a large margin for improvisation. In this paper, we propose a modified, yet lightweight, deep object detection model based on the YOLO-v5 architecture. The proposed model can detect large, small, and tiny objects. Specifically, we propose the use of a multi-scale mechanism to learn deep discriminative feature representations at different scales and automatically determine the most suitable scales for detecting objects in a scene (i.e., in our case vehicles). The proposed multi-scale module reduces the number of trainable parameters compared to the original YOLO-v5 architecture. The experimental results also demonstrate that precision is improved by a large margin. In fact, as shown in the experiments, the results show a small reduction from 7.28 million parameters of the YOLO-v5-S profile to 7.26 million parameters in our model. In addition, we reduced the detection speed by inferring 30 fps compared to the YOLO-v5-L/X profiles. In addition, the tiny vehicle detection performance was significantly improved by 33% compared to the YOLO-v5-X profile.
引用
收藏
页码:22430 / 22440
页数:11
相关论文
共 28 条
  • [1] Deep learning for decentralized parking lot occupancy detection
    Amato, Giuseppe
    Carrara, Fabio
    Falchi, Fabrizio
    Gennaro, Claudio
    Meghini, Carlo
    Vairo, Claudio
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 : 327 - 334
  • [2] Leveraging Deep Learning and IoT big data analytics to support the smart cities development: Review and future directions
    Ben Atitallah, Safa
    Driss, Maha
    Boulila, Wadii
    Ben Ghezala, Henda
    [J]. COMPUTER SCIENCE REVIEW, 2020, 38
  • [3] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
  • [4] Survey methodology for measuring parking occupancy: Impacts of an on-street parking pricing scheme in an urban center
    Cats, Oded
    Zhang, Chen
    Nissan, Albania
    [J]. TRANSPORT POLICY, 2016, 47 : 55 - 63
  • [5] Chethan Kumar B., 2020, 2020 Third International Conference on Smart Systems and Inventive Technology (ICSSIT), P1316, DOI 10.1109/ICSSIT48917.2020.9214094
  • [6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [7] Everingham M., 2010, INT J COMPUT VISION, V88, P303, DOI DOI 10.1007/s11263-009-0275-4
  • [8] Farag M., 2020, Indonesian J. Elect.Eng. Comput. Sci., V19, P964, DOI 10.11591/ijeecs.v19i2.pp964-973
  • [9] Faraji SJ., 2019, J AIR POLLUT HLTH, V4, P53
  • [10] Real Time IP Camera Parking Occupancy Detection using Deep Learning
    Farley, Albertus
    Ham, Hanry
    Hendra
    [J]. 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 606 - 614