Are We Ready for Accurate and Unbiased Fine-Grained Vehicle Classification in Realistic Environments?

被引:9
作者
Corrales Sanchez, Hector [1 ]
Hernandez Parra, Noelia [1 ]
Parra Alonso, Ignacio [1 ]
Nebot, Eduardo [2 ]
Fernandez-Llorca, David [1 ,3 ]
机构
[1] Univ Alcala, Comp Engn Dept, Alcala De Henares 28801, Spain
[2] Univ Sydney, Australian Ctr Field Robot, Sydney, NSW 2006, Australia
[3] European Commiss, Joint Res Ctr, Seville 41092, Spain
关键词
Feature extraction; Three-dimensional displays; Solid modeling; Automobiles; Annotations; Training; Licenses; Fine-grained classification; vehicle make and model; dataset bias; curriculum learning; weighted loss; cross-datasets; CONVOLUTIONAL NEURAL-NETWORK; RECOGNITION;
D O I
10.1109/ACCESS.2021.3104340
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-grained vehicle classification from images, also known as Vehicle Make and Model Recognition (VMMR), has become an important research topic in the last years, with a growing number of scientific contributions in multiple application areas, such as autonomous vehicles, surveillance systems, traffic monitoring and management, among others. Recent techniques based on deep learning have proven to be very effective in addressing this problem. So effective that, based on the state-of-the-art results (above 95% accuracy), it would seem that the problem is practically solved. However, our main hypothesis is that the existing datasets to date have limited variability, which precludes good and unbiased generalisation of the models trained with them. In particular, it is observed that the test datasets are very similar in nature to those used for training and validation which makes these benchmarks prone to dataset bias and to overfitting. When these systems are tested with more challenging data or data from different datasets performance degrades considerably. In this paper, on the one hand, we evaluate state-of-the-art deep learning models to perform fine-grained vehicle classification and explore multiple training techniques, such as curriculum learning or weighted losses, to mitigate the bias between different makes and models and to assess the limits of current approaches. On the other hand, we analyse the existing datasets, present an additional dataset from a challenging scenario, and merge all the data into a cross-dataset that includes common samples and classes from the existing datasets. In this way, we can evaluate geographical, make and model biases, and performance and generalisation capabilities from a more realistic perspective. The obtained results suggest that we are still far from accurate and unbiased vehicle make and model recognition in realistic traffic and driving scenarios.
引用
收藏
页码:116338 / 116355
页数:18
相关论文
共 60 条
[1]  
[Anonymous], 2000, Seventeenth International Conference on Machine Learning
[2]  
[Anonymous], 2003, WORKSHOP LEARNING IM
[3]  
[Anonymous], 2016, ARXIV161101714
[4]  
Barua S, 2011, LECT NOTES COMPUT SC, V7063, P735, DOI 10.1007/978-3-642-24958-7_85
[5]  
Bengio Y., 2009, P 26 ANN INT C MACH, P41
[6]   Benchmark Analysis of Representative Deep Neural Network Architectures [J].
Bianco, Simone ;
Cadene, Remi ;
Celona, Luigi ;
Napoletano, Paolo .
IEEE ACCESS, 2018, 6 :64270-64277
[7]  
Buzzelli M., SENSORS-BASEL, V21, P596
[8]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[9]  
Corrales H., 2020, Computer Aided Systems Theory - EUROCAST 2019. 17th International Conference. Revised Selected Papers. Lecture Notes in Computer Science (LNCS 12014), P104, DOI 10.1007/978-3-030-45096-0_13
[10]   Simple Baseline for Vehicle Pose Estimation: Experimental Validation [J].
Corrales Sanchez, Hector ;
Hernandez Martinez, Antonio ;
Izquierdo Gonzalo, Ruben ;
Hernandez Parra, Noelia ;
Parra Alonso, Ignacio ;
Fernandez-Llorca, David .
IEEE ACCESS, 2020, 8 :132539-132550