Sparse Output Coding for Scalable Visual Recognition

被引:3
作者
Zhao, Bin [1 ]
Xing, Eric P. [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
Scalable classification; Output coding; Probabilistic decoding; Object recognition; Scene recognition; IMAGE CLASSIFICATION; MULTICLASS;
D O I
10.1007/s11263-015-0839-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many vision tasks require a multi-class classifier to discriminate multiple categories, on the order of hundreds or thousands. In this paper, we propose sparse output coding, a principled way for large-scale multi-class classification, by turning high-cardinality multi-class categorization into a bit-by-bit decoding problem. Specifically, sparse output coding is composed of two steps: efficient coding matrix learning with scalability to thousands of classes, and probabilistic decoding. Empirical results on object recognition and scene classification demonstrate the effectiveness of our proposed approach.
引用
收藏
页码:60 / 75
页数:16
相关论文
共 62 条
[1]   Reducing multiclass to binary: A unifying approach for margin classifiers [J].
Allwein, EL ;
Schapire, RE ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :113-141
[2]  
[Anonymous], 2004, PROC INT C MACH LEAR
[3]  
[Anonymous], 2011, Advances in Neural Information Processing Systems
[4]  
[Anonymous], 2004, P 13 ACM INT C INF K
[5]  
[Anonymous], MATH PROGR COMPUT
[6]  
[Anonymous], 1997, ICML
[7]  
[Anonymous], 1999, Tech. Rep.
[8]  
[Anonymous], 2010, Advances in Neural Information Processing Systems
[9]  
[Anonymous], 1997, ICML
[10]  
[Anonymous], 2011, IJCAI