Evaluation of Robust Spatial Pyramid Pooling Based on Convolutional Neural Network for Traffic Sign Recognition System

被引:42
作者
Dewi, Christine [1 ,2 ]
Chen, Rung-Ching [1 ]
Tai, Shao-Kuo [1 ]
机构
[1] Chaoyang Univ Technol, Dept Informat Management, Taichung 41349, Taiwan
[2] Satya Wacana Christian Univ, Fac Informat Technol, Central Java 50711, Indonesia
关键词
spatial pyramid pooling; Yolo V3; object recognition; convolutional neural network; DEEP; VIDEO;
D O I
10.3390/electronics9060889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traffic sign recognition (TSR) is a noteworthy issue for real-world applications such as systems for autonomous driving as it has the main role in guiding the driver. This paper focuses on Taiwan's prohibitory sign due to the lack of a database or research system for Taiwan's traffic sign recognition. This paper investigates the state-of-the-art of various object detection systems (Yolo V3, Resnet 50, Densenet, and Tiny Yolo V3) combined with spatial pyramid pooling (SPP). We adopt the concept of SPP to improve the backbone network of Yolo V3, Resnet 50, Densenet, and Tiny Yolo V3 for building feature extraction. Furthermore, we use a spatial pyramid pooling to study multi-scale object features thoroughly. The observation and evaluation of certain models include vital metrics measurements, such as the mean average precision (mAP), workspace size, detection time, intersection over union (IoU), and the number of billion floating-point operations (BFLOPS). Our findings show that Yolo V3 SPP strikes the best total BFLOPS (65.69), and mAP (98.88%). Besides, the highest average accuracy is Yolo V3 SPP at 99%, followed by Densenet SPP at 87%, Resnet 50 SPP at 70%, and Tiny Yolo V3 SPP at 50%. Hence, SPP can improve the performance of all models in the experiment.
引用
收藏
页数:21
相关论文
共 63 条
[1]  
[Anonymous], 2019, IMAGE VISION COMPUT, DOI DOI 10.1016/j.imavis.2019.04.007
[2]  
[Anonymous], 2016, ADV NEUR INF PROC SY, DOI [DOI 10.2165/00129785-200404040-00005, DOI 10.1145/3065386]
[3]   Evaluation of deep neural networks for traffic sign detection systems [J].
Arcos-Garcia, Alvaro ;
Alvarez-Garcia, Juan A. ;
Soria-Morillo, Luis M. .
NEUROCOMPUTING, 2018, 316 :332-344
[4]   Summary of current radiometric calibration coefficients for Landsat MSS, TM, ETM+, and EO-1 ALI sensors [J].
Chander, Gyanesh ;
Markham, Brian L. ;
Helder, Dennis L. .
REMOTE SENSING OF ENVIRONMENT, 2009, 113 (05) :893-903
[5]   Research on Recognition Method of Electrical Components Based on YOLO V3 [J].
Chen, Haipeng ;
He, Zhentao ;
Shi, Bowen ;
Zhong, Tie .
IEEE ACCESS, 2019, 7 :157818-157829
[6]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[7]  
Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
[8]  
Dewi C, 2019, IEEE SYS MAN CYBERN, P2496, DOI 10.1109/SMC.2019.8913868
[9]   Tinier-YOLO: A Real-Time Object Detection Method for Constrained Environments [J].
Fang, Wei ;
Wang, Lin ;
Ren, Peiming .
IEEE ACCESS, 2020, 8 :1935-1944
[10]  
Fang WZ, 2019, IEEE J-STARS, V12, P3168, DOI 10.1109/JSTARS.2019.2929601