Mid-Level-Representation based Lexicon for Vehicle Make and Model Recognition

被引:13
作者
Fraz, Muhammad [1 ]
Edirisinghe, Eran A. [1 ]
Sarfraz, M. Saquib [2 ]
机构
[1] Univ Loughborough, Dept Comp Sci, Loughborough, Leics, England
[2] Karlsruhe Inst Technol, Inst Anthropomat, D-76021 Karlsruhe, Germany
来源
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2014年
关键词
Vehicle Make and Model Recognition; Dense Features; Mid-level representation; Fisher Vectors;
D O I
10.1109/ICPR.2014.76
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel framework for representation of images as a combination of multiple mid-level feature descriptor representation based group of visual words. The mid-level feature representation is computed on discriminative patches of the image to build a lexicon, the visual words of which are used to represent the shape within that image. The proposed image representation method has been applied to the application of vehicles make and model recognition. Each make, model class is represented as an over complete sub-lexicon of mid-level feature representation. The classification of vehicles is performed by comparing the visual words of probe image with the learned lexicon of training data using Euclidean distance. The proposed framework offers the advantage of accurate recognition in the presence of significant background clutter. The experiments have shown that the proposed representation successfully captures the fine-grained inter and intra-class discrimination to recognize the model and make of the vehicle without any strict requirement of precise region of interest segmentation. Another important contribution of the paper is a comprehensive dataset of cars depicting images collected in the wild.
引用
收藏
页码:393 / 398
页数:6
相关论文
共 28 条
[1]  
[Anonymous], 2013, BMVC
[2]  
[Anonymous], ECCV
[3]  
[Anonymous], 2011, BMVC
[4]  
[Anonymous], 2010, CVPR
[5]  
Chen L, 2013, ITAOI
[6]  
Clady X, 2008, LECT NOTES ARTIF INT, V5064, P228, DOI 10.1007/978-3-540-69939-2_22
[7]  
Csurka G., 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678
[8]  
Daya B., 2010, Proceedings of the 2010 International Conference on Computational Intelligence and Communication Networks (CICN 2010), P162, DOI 10.1109/CICN.2010.42
[9]  
Jakkola T, 1998, NIPS, P487
[10]  
Jegou H., 2010, CVPR