DeepCar 5.0: Vehicle Make and Model Recognition Under Challenging Conditions

被引:25
作者
Amirkhani, Abdollah [1 ]
Barshooi, Amir Hossein [1 ]
机构
[1] Iran Univ Sci & Technol, Sch Automot Engn, Tehran 1684613114, Iran
关键词
Attention mechanism; DeepCar; 5.0; intelligent transportation systems; multi-agent system; vehicle make and model recognition (VMMR); TRACKING; IMAGES; SYSTEM; SURF;
D O I
10.1109/TITS.2022.3212921
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Inspired by multi-agent systems (MASs) and ensemble models, a novel method of vehicle make and model recognition (VMMR) based on a vehicle's front-view images is presented in this paper. By exploiting the attention mechanism in this work, it is demonstrated that most of the features used for classifying a vehicle are extracted from its headlight, grill, scoop, and bumper sections. These areas are designated as the regions of interest (ROIs) in our approach. Contrary to the other methods in which a whole ROI is fed to a deep convolutional neural network, in this scheme, different ROIs are extracted from each image, and then a preprocessing block and a distinctive network are designed for each ROI, which is considered as a single agent. Each agent is then trained separately, and a vehicle's type is determined with the collaboration of these agents and based on the blackboard classification system. Also, a new dataset (DeepCar 5.0) is compiled by using the data from the top 50 automakers. This dataset contains 40,185 images of the front views and the front three-quarters of vehicles in 480 different classes, and all the parts of dataset are labeled manually. The proposed technique is able to achieve the accuracies of 92.14 and 96.72% in the automated and the manual scenarios, respectively; and contrary to the current methods, it can perform flawless classification even when just a portion of a vehicle's front view image is available for processing. The dataset and parts of the code are available at: https://github.com/DeepCar/DeepCar5.0.
引用
收藏
页码:541 / 553
页数:13
相关论文
共 65 条
[1]   Consensus in multi-agent systems: a review [J].
Amirkhani, Abdollah ;
Barshooi, Amir Hossein .
ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (05) :3897-3935
[2]   Enhancing the Robustness of Visual Object Tracking via Style Transfer [J].
Amirkhani, Abdollah ;
Barshooi, Amir Hossein ;
Ebrahimi, Amir .
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (01) :981-997
[3]  
[Anonymous], 2002, Introduction to MPEG-7: Multimedia Content Description Interface
[4]  
[Anonymous], 2018, P 2018 IEEE 5 INT C, DOI DOI 10.1109/ICSIMA.2018.8688775
[5]  
[Anonymous], 2021, INSURANCE INFORM I
[6]   Group-Sensitive Triplet Embedding for Vehicle Reidentification [J].
Bai, Yan ;
Lou, Yihang ;
Gao, Feng ;
Wang, Shiqi ;
Wu, Yuwei ;
Duan, Ling-Yu .
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) :2385-2399
[7]   SURF: Speeded up robust features [J].
Bay, Herbert ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417
[8]   A Cascaded Part-Based System for Fine-Grained Vehicle Classification [J].
Biglari, Mohsen ;
Soleimani, Ali ;
Hassanpour, Hamid .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (01) :273-283
[9]   Part-based recognition of vehicle make and model [J].
Biglari, Mohsen ;
Soleimani, Ali ;
Hassanpour, Hamid .
IET IMAGE PROCESSING, 2017, 11 (07) :483-491
[10]   Video Face Manipulation Detection Through Ensemble of CNNs [J].
Bonettini, Nicolo ;
Cannas, Edoardo Daniele ;
Mandelli, Sara ;
Bondi, Luca ;
Bestagini, Paolo ;
Tubaro, Stefano .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :5012-5019