Sparse Spatial Coding: A Novel Approach to Visual Recognition

被引：10

作者：

Oliveira, Gabriel Leivas ^{[1
]}

Nascimento, Erickson R. ^{[2
]}

Vieira, Antonio Wilson ^{[3
]}

Montenegro Campos, Mario Fernando ^{[2
]}

机构：

[1] Univ Minnesota, Dept Comp Sci, Minneapolis, MN 55455 USA

[2] Univ Fed Minas Gerais, Dept Comp Sci, BR-31270901 Belo Horizonte, MG, Brazil

[3] Univ Estadual Montes Claros, Dept Math & Comp Sci, BR-39440 Montes Claros, Brazil

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2014年 / 23卷 / 06期

关键词：

Object recognition; image coding; learning (artificial intelligence); computer vision; vision and scene undertanding; sparse coding; IMAGE; REPRESENTATIONS; EFFICIENT;

D O I：

10.1109/TIP.2014.2317988

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Successful image-based object recognition techniques have been constructed founded on powerful techniques such as sparse representation, in lieu of the popular vector quantization approach. However, one serious drawback of sparse space-based methods is that local features that are quite similar can be quantized into quite distinct visual words. We address this problem with a novel approach for object recognition, called sparse spatial coding, which efficiently combines a sparse coding dictionary learning and spatial constraint coding stage. We performed experimental evaluation using the Caltech 101, Caltech 256, Corel 5000, and Corel 10000 data sets, which were specifically designed for object recognition evaluation. Our results show that our approach achieves high accuracy comparable with the best single feature method previously published on those databases. Our method outperformed, for the same bases, several multiple feature methods, and provided equivalent, and in few cases, slightly less accurate results than other techniques specifically designed to that end. Finally, we report state-of-the-art results for scene recognition on COsy Localization Dataset (COLD) and high performance results on the MIT-67 indoor scene recognition, thus demonstrating the generalization of our approach for such tasks.

引用

页码：2719 / 2731

页数：13

共 49 条

[1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation
Aharon, Michal
Elad, Michael
Bruckstein, Alfred
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) : 4311 - 4322
[2] [Anonymous], 2006, ADV NEURAL INF PROCE
[3] [Anonymous], 2011, P AISTATS
[4] [Anonymous], 2007, P CVPR
[5] [Anonymous], NEURAL INFORM PROCES
[6] [Anonymous], P VISAPP
[7] [Anonymous], 7 MN USA I MATH APPL
[8] [Anonymous], 2007, P 6 ACM INT C IM VID, DOI [DOI 10.1145/1282280.1282340, 10.1145/1282280.1282340]
[9] [Anonymous], P 21 INT C MACH LEAR
[10] [Anonymous], NEURAL INFORM PROCES

← 1 2 3 4 5 →