Sparse Output Coding for Scalable Visual Recognition

被引：3

作者：

Zhao, Bin ^{[1
]}

Xing, Eric P. ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2016年 / 119卷 / 01期

关键词：

Scalable classification; Output coding; Probabilistic decoding; Object recognition; Scene recognition; IMAGE CLASSIFICATION; MULTICLASS;

D O I：

10.1007/s11263-015-0839-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many vision tasks require a multi-class classifier to discriminate multiple categories, on the order of hundreds or thousands. In this paper, we propose sparse output coding, a principled way for large-scale multi-class classification, by turning high-cardinality multi-class categorization into a bit-by-bit decoding problem. Specifically, sparse output coding is composed of two steps: efficient coding matrix learning with scalability to thousands of classes, and probabilistic decoding. Empirical results on object recognition and scene classification demonstrate the effectiveness of our proposed approach.

引用

页码：60 / 75

页数：16

共 62 条

[1] Reducing multiclass to binary: A unifying approach for margin classifiers [J].

Allwein, EL ;

Schapire, RE ;

Singer, Y .

JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :113-141

[2]

[Anonymous], 2004, PROC INT C MACH LEAR

[3]

[Anonymous], 2011, Advances in Neural Information Processing Systems

[4]

[Anonymous], 2004, P 13 ACM INT C INF K

[5]

[Anonymous], MATH PROGR COMPUT

[6]

[Anonymous], 1997, ICML

[7]

[Anonymous], 1999, Tech. Rep.

[8]

[Anonymous], 2010, Advances in Neural Information Processing Systems

[9]

[Anonymous], 1997, ICML

[10]

[Anonymous], 2011, IJCAI

← 1 2 3 4 5 6 7 →