A Review of Codebook Models in Patch-Based Visual Object Recognition

被引:0
作者
Amirthalingam Ramanan
Mahesan Niranjan
机构
[1] University of Southampton,School of Electronics and Computer Science
[2] University of Jaffna,Department of Computer Science
来源
Journal of Signal Processing Systems | 2012年 / 68卷
关键词
Bag-of-features; Cluster analysis; Object recognition; Visual codebook; SIFT;
D O I
暂无
中图分类号
学科分类号
摘要
The codebook model-based approach, while ignoring any structural aspect in vision, nonetheless provides state-of-the-art performances on current datasets. The key role of a visual codebook is to provide a way to map the low-level features into a fixed-length vector in histogram space to which standard classifiers can be directly applied. The discriminative power of such a visual codebook determines the quality of the codebook model, whereas the size of the codebook controls the complexity of the model. Thus, the construction of a codebook is an important step which is usually done by cluster analysis. However, clustering is a process that retains regions of high density in a distribution and it follows that the resulting codebook need not have discriminant properties. This is also recognised as a computational bottleneck of such systems. In our recent work, we proposed a resource-allocating codebook, to constructing a discriminant codebook in a one-pass design procedure that slightly outperforms more traditional approaches at drastically reduced computing times. In this review we survey several approaches that have been proposed over the last decade with their use of feature detectors, descriptors, codebook construction schemes, choice of classifiers in recognising objects, and datasets that were used in evaluating the proposed methods.
引用
收藏
页码:333 / 352
页数:19
相关论文
共 55 条
  • [1] Agarwal S(2004)Learning to detect objects in images via a sparse, part-based representation IEEE Transactions on Pattern Analysis and Machince Intelligence 26 1475-1490
  • [2] Awan A(2008)SURF: Speeded up robust features Computer Vision and Image Understanding 110 346-359
  • [3] Roth D(2003)Latent Dirichlet allocation Journal of Machine Learning Research 3 993-1022
  • [4] Bay H(2001)Random forests Machine Learning 45 5-32
  • [5] Ess A(1986)A computational approach to edge detection IEEE Transactions on Pattern Analysis and Machine Intelligence 8 544-567
  • [6] Tuytelaars T(2002)Mean shift: A robust approach toward feature space analysis IEEE Transactions on Pattern Analysis and Machine Intelligence 24 603-619
  • [7] Gool LV(1999)Data clustering: A review ACM Computing Surveys 31 264-323
  • [8] Blei DM(2009)A comparison of SIFT, PCA-SIFT and SURF International Journal of Image Processing 3 143-152
  • [9] Ng AY(2001)Saliency, scale and image description International Journal of Computer Vision 45 83-105
  • [10] Jordan MI(2007)Automated insect identification through concatenated histograms of local appearance features Machine Vision and Applications 19 105-123