Sparse Spatial Coding: A Novel Approach to Visual Recognition

被引:10
作者
Oliveira, Gabriel Leivas [1 ]
Nascimento, Erickson R. [2 ]
Vieira, Antonio Wilson [3 ]
Montenegro Campos, Mario Fernando [2 ]
机构
[1] Univ Minnesota, Dept Comp Sci, Minneapolis, MN 55455 USA
[2] Univ Fed Minas Gerais, Dept Comp Sci, BR-31270901 Belo Horizonte, MG, Brazil
[3] Univ Estadual Montes Claros, Dept Math & Comp Sci, BR-39440 Montes Claros, Brazil
关键词
Object recognition; image coding; learning (artificial intelligence); computer vision; vision and scene undertanding; sparse coding; IMAGE; REPRESENTATIONS; EFFICIENT;
D O I
10.1109/TIP.2014.2317988
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Successful image-based object recognition techniques have been constructed founded on powerful techniques such as sparse representation, in lieu of the popular vector quantization approach. However, one serious drawback of sparse space-based methods is that local features that are quite similar can be quantized into quite distinct visual words. We address this problem with a novel approach for object recognition, called sparse spatial coding, which efficiently combines a sparse coding dictionary learning and spatial constraint coding stage. We performed experimental evaluation using the Caltech 101, Caltech 256, Corel 5000, and Corel 10000 data sets, which were specifically designed for object recognition evaluation. Our results show that our approach achieves high accuracy comparable with the best single feature method previously published on those databases. Our method outperformed, for the same bases, several multiple feature methods, and provided equivalent, and in few cases, slightly less accurate results than other techniques specifically designed to that end. Finally, we report state-of-the-art results for scene recognition on COsy Localization Dataset (COLD) and high performance results on the MIT-67 indoor scene recognition, thus demonstrating the generalization of our approach for such tasks.
引用
收藏
页码:2719 / 2731
页数:13
相关论文
共 49 条
  • [1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation
    Aharon, Michal
    Elad, Michael
    Bruckstein, Alfred
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) : 4311 - 4322
  • [2] [Anonymous], 2006, ADV NEURAL INF PROCE
  • [3] [Anonymous], 2011, P AISTATS
  • [4] [Anonymous], 2007, P CVPR
  • [5] [Anonymous], NEURAL INFORM PROCES
  • [6] [Anonymous], P VISAPP
  • [7] [Anonymous], 7 MN USA I MATH APPL
  • [8] [Anonymous], 2007, P 6 ACM INT C IM VID, DOI [DOI 10.1145/1282280.1282340, 10.1145/1282280.1282340]
  • [9] [Anonymous], P 21 INT C MACH LEAR
  • [10] [Anonymous], NEURAL INFORM PROCES