Evaluation of Robust Spatial Pyramid Pooling Based on Convolutional Neural Network for Traffic Sign Recognition System

被引:42
作者
Dewi, Christine [1 ,2 ]
Chen, Rung-Ching [1 ]
Tai, Shao-Kuo [1 ]
机构
[1] Chaoyang Univ Technol, Dept Informat Management, Taichung 41349, Taiwan
[2] Satya Wacana Christian Univ, Fac Informat Technol, Central Java 50711, Indonesia
关键词
spatial pyramid pooling; Yolo V3; object recognition; convolutional neural network; DEEP; VIDEO;
D O I
10.3390/electronics9060889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traffic sign recognition (TSR) is a noteworthy issue for real-world applications such as systems for autonomous driving as it has the main role in guiding the driver. This paper focuses on Taiwan's prohibitory sign due to the lack of a database or research system for Taiwan's traffic sign recognition. This paper investigates the state-of-the-art of various object detection systems (Yolo V3, Resnet 50, Densenet, and Tiny Yolo V3) combined with spatial pyramid pooling (SPP). We adopt the concept of SPP to improve the backbone network of Yolo V3, Resnet 50, Densenet, and Tiny Yolo V3 for building feature extraction. Furthermore, we use a spatial pyramid pooling to study multi-scale object features thoroughly. The observation and evaluation of certain models include vital metrics measurements, such as the mean average precision (mAP), workspace size, detection time, intersection over union (IoU), and the number of billion floating-point operations (BFLOPS). Our findings show that Yolo V3 SPP strikes the best total BFLOPS (65.69), and mAP (98.88%). Besides, the highest average accuracy is Yolo V3 SPP at 99%, followed by Densenet SPP at 87%, Resnet 50 SPP at 70%, and Tiny Yolo V3 SPP at 50%. Hence, SPP can improve the performance of all models in the experiment.
引用
收藏
页数:21
相关论文
共 63 条
[31]  
Lazebnik S., 2006, 2006 IEEE COMP SOC C, DOI DOI 10.1109/CVPR.2006.68
[32]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[33]   Mini-YOLOv3: Real-Time Object Detector for Embedded Applications [J].
Mao, Qi-Chao ;
Sun, Hong-Mei ;
Liu, Yan-Bo ;
Jia, Rui-Sheng .
IEEE ACCESS, 2019, 7 :133529-133538
[34]   Adenocarcinoma Recognition in Endoscopy Images Using Optimized Convolutional Neural Networks [J].
Park, Hyun-Cheol ;
Kim, Yoon-Jae ;
Lee, Sang-Woong .
APPLIED SCIENCES-BASEL, 2020, 10 (05)
[35]  
Rahayudi B, 2019, PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2019), P100, DOI [10.1109/SIET48054.2019.8986009, 10.1109/siet48054.2019.8986009]
[36]  
Reddy A. Sai Bharadwaj, 2019, 2019 International Conference on Communication and Signal Processing (ICCSP), P0945, DOI 10.1109/ICCSP.2019.8697909
[37]  
Redmon J., 2016, 150602640 CVPR
[38]  
Redmon J, 2018, Arxiv, DOI [arXiv:1804.02767, DOI 10.48550/ARXIV.1804.02767]
[39]   Real-time traffic sign recognition from video by class-specific discriminative features [J].
Ruta, Andrzej ;
Li, Yongmin ;
Liu, Xiaohui .
PATTERN RECOGNITION, 2010, 43 (01) :416-430
[40]  
Sai Sundar K.V., 1964, P 2 INT C EL COMM AE, P1964