Road object detection: a comparative study of deep learning-based algorithms

被引:29
作者
Mahaur, Bharat [1 ]
Singh, Navjot [2 ]
Mishra, K. K. [1 ]
机构
[1] Motilal Nehru Natl Inst Technol Allahabad, Dept Comp Sci & Engn, Allahabad, Uttar Pradesh, India
[2] Indian Inst Informat Technol Allahabad, Dept Informat Technol, Allahabad, Uttar Pradesh, India
关键词
Autonomous vehicles; Intelligent transportation system (ITS); Object detection; Deep learning; VEHICLE DETECTION;
D O I
10.1007/s11042-022-12447-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning field has progressed the vision-based surround perception and has become the most trending area in the field of Intelligent Transportation System (ITS). Many deep learning-based algorithms using two-dimensional images have become an essential tool for autonomous vehicles with object detection, tracking, and segmentation for road target detection, primarily including pedestrians, vehicles, traffic lights, and traffic signs. Autonomous vehicles rely heavily on visual data to classify and generalize target objects which can satisfy pedestrians' and other vehicles' safety requirements in their environment. In real-time, outstanding results are obtained by deep learning-based algorithms for object detection. While several studies have thoroughly examined different types of deep learning-based object detection methods, there are a few comparable studies that either test the detection speed or accuracy of the object detection algorithms. In addition to speed and accuracy, autonomous driving also depends on model size and energy efficiency. However, there is a lack of comparison on various such metrics among existing deep learning-based methods. This article aims to provide a detailed and systematic comparative analysis of five independent mainstream deep learning-based algorithms for road object detection, namely the R-FCN, Mask R-CNN, SSD, RetinaNet, and YOLOv4 on a large-scale Berkeley DeepDrive (BDD100K) dataset. The experimental results are analyzed using the mean Average Precision (mAP) value and inference time. Additionally, various practical metrics, such as model size, computational complexity, and energy efficiency of deep learning-based models are precisely computed. Furthermore, the performance of each algorithm is evaluated under different road environmental conditions at various times of day and night. The comparison presented in this article helps to gain insight into the strengths and limitations of the popular deep learning-based algorithms under practical constraints with their real-time deployment feasibility. Code is publicly available at: https://github.com/bharatmahaur/ComparativeStudy
引用
收藏
页码:14247 / 14282
页数:36
相关论文
共 58 条
[1]  
Abadi M, 2016, ACM SIGPLAN NOTICES, V51, P1, DOI [10.1145/2951913.2976746, 10.1145/3022670.2976746]
[2]   Exploring Deep Learning-Based Architecture, Strategies, Applications and Current Trends in Generic Object Detection: A Comprehensive Review [J].
Aziz, Lubna ;
Haji Salam, Md. Sah Bin ;
Sheikh, Usman Ullah ;
Ayub, Sara .
IEEE ACCESS, 2020, 8 :170461-170495
[3]  
Bochkovskiy A, 2020, ARXIV, DOI 10.48550/ARXIV.2004.10934
[4]   EuroCity Persons: A Novel Benchmark for Person Detection in Traffic Scenes [J].
Braun, Markus ;
Krebs, Sebastian ;
Flohr, Fabian ;
Gavrila, Dariu M. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (08) :1844-1861
[5]  
Broggi Alberto, 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings, P912, DOI 10.1109/IVS.2014.6856490
[6]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628
[7]   Deep Neural Network Based Vehicle and Pedestrian Detection for Autonomous Driving: A Survey [J].
Chen, Long ;
Lin, Shaobo ;
Lu, Xiankai ;
Cao, Dongpu ;
Wu, Hangbin ;
Guo, Chi ;
Liu, Chun ;
Wang, Fei-Yue .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) :3234-3246
[8]   Vehicle and Pedestrian Detection Using Support Vector Machine and Histogram of Oriented Gradients Features [J].
Chen, Zhiqian ;
Chen, Kai ;
Chen, James .
2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND APPLICATIONS (CSA), 2013, :365-368
[9]   Autonomous vehicles and future mobility solutions [J].
Coppola, Pierluigi ;
Silvestri, Fulvio .
AUTONOMOUS VEHICLES AND FUTURE MOBILITY, 2019, :1-15
[10]  
Dai JF, 2016, ADV NEUR IN, V29