Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets

被引:5
作者
Cortes, Andoni [1 ]
Rodriguez, Clemente [1 ]
Velez, Gorka [2 ]
Barandiaran, Javier [2 ]
Nieto, Marcos [2 ]
机构
[1] Univ Pais Vasco UPV, Comp Architecture & Technol Dept, San Sebastian 20018, Spain
[2] Vicomtech Fdn, Basque Res & Technol Alliance BRTA, Intelligent Transportat Syst & Engn Dept, San Sebastian 20009, Spain
基金
欧盟地平线“2020”;
关键词
Training; Data models; Machine learning; Pipelines; Detectors; Vehicles; Synthetic datasets; deep learning; traffic sign recognition; TRAFFIC SIGN RECOGNITION; DEEP NEURAL-NETWORK;
D O I
10.1109/TITS.2020.3009186
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
A major challenges of deep learning (DL) is the necessity to collect huge amounts of training data. Often, the lack of a sufficiently large dataset discourages the use of DL in certain applications. Typically, acquiring the required amounts of data costs considerable time, material and effort. To mitigate this problem, the use of synthetic images combined with real data is a popular approach, widely adopted in the scientific community to effectively train various detectors. In this study, we examined the potential of synthetic data-based training in the field of intelligent transportation systems. Our focus is on camera-based traffic sign recognition applications for advanced driver assistance systems and autonomous driving. The proposed augmentation pipeline of synthetic datasets includes novel augmentation processes such as structured shadows and gaussian specular highlights. A well-known DL model was trained with different datasets to compare the performance of synthetic and real image-based trained models. Additionally, a new, detailed method to objectively compare these models is proposed. Synthetic images are generated using a semi-supervised errors-guide method which is also described. Our experiments showed that a synthetic image-based approach outperforms in most cases real image-based training when applied to cross-domain test datasets (+10% precision for GTSRB dataset) and consequently, the generalization of the model is improved decreasing the cost of acquiring images.
引用
收藏
页码:190 / 199
页数:10
相关论文
共 52 条
[1]   What Else Can Fool Deep Learning? Addressing Color Constancy Errors on Deep Neural Network Performance [J].
Afifi, Mahmoud ;
Brown, Michael S. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :243-252
[2]   A practical approach for detection and classification of traffic signs using Convolutional Neural Networks [J].
Aghdam, Hamed Habibi ;
Heravi, Elnaz Jahani ;
Puig, Domenec .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 84 :97-112
[3]  
[Anonymous], 2018, ARXIV180307721
[4]  
[Anonymous], 2012, Euro Truck Simulator 2
[5]  
[Anonymous], 2016, FERNBUS SIMULATOR
[6]  
[Anonymous], 2019, PAPER DATA
[7]  
[Anonymous], PRESCAN SIMULATION A
[8]   Deep neural network for traffic sign recognition systems: An analysis of spatial transformers and stochastic optimisation methods [J].
Arcos-Garcia, Alvaro ;
Alvarez-Garcia, Juan A. ;
Soria-Morillo, Luis M. .
NEURAL NETWORKS, 2018, 99 :158-165
[9]  
Baird Henry S., 1992, Structured Document Image Analysis, DOI DOI 10.1007/978-3-642-77281-8
[10]   Biomedical image augmentation using Augmentor [J].
Bloice, Marcus D. ;
Roth, Peter M. ;
Holzinger, Andreas .
BIOINFORMATICS, 2019, 35 (21) :4522-4524