An Ensemble of Deep Support Vector Machines for Image Categorization

被引:20
作者
Abdullah, Azizi [1 ]
Veltkamp, Remco C. [1 ]
Wiering, Marco A. [2 ]
机构
[1] Univ Utrecht, Dept Informat & Comp Sci, NL-3508TC Utrecht, Netherlands
[2] Univ Groningen, Dept Artificial Intelligence, NL-9700AB Groningen, Netherlands
来源
2009 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION | 2009年
关键词
Image categorization; support vector machines; ensemble methods; product rule; deep architectures;
D O I
10.1109/SoCPaR.2009.67
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the deep support vector machine (D-SVM) inspired by the increasing popularity of deep belief networks for image recognition. Our deep SVM trains an SVM in the standard way and then uses the kernel activations of support vectors as inputs for training another SVM at the next layer. In this way, instead of the normal linear combination of kernel activations, we can create non-linear combinations of kernel activations on prototype examples. Furthermore, we combine different descriptors in an ensemble of deep SVMs where the product rule is used for combining probability estimates of the different classifiers. We have performed experiments on 20 classes from the Caltech object database and 10 classes from the Corel dataset. The results show that our ensemble of deep SVMs significantly outperforms the naive approach that combines all descriptors directly in a very large single input vector for an SVM. Furthermore, our ensemble of D-SVMs achieves an accuracy of 95.2% on the Corel dataset with 10 classes, which is the best performance reported in literature until now.
引用
收藏
页码:301 / +
页数:2
相关论文
共 23 条
[11]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[12]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507
[13]   To recognize shapes, first learn to generate images [J].
Hinton, Geoffrey E. .
COMPUTATIONAL NEUROSCIENCE: THEORETICAL INSIGHTS INTO BRAIN FUNCTION, 2007, 165 :535-547
[14]   HIERARCHICAL MIXTURES OF EXPERTS AND THE EM ALGORITHM [J].
JORDAN, MI ;
JACOBS, RA .
NEURAL COMPUTATION, 1994, 6 (02) :181-214
[15]  
Lazebnik S., 2006, P IEEE COMP SOC C CO, V2, P2169, DOI 10.1109/CVPR.2006.68
[16]   Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories [J].
Li Fei-Fei ;
Fergus, Rob ;
Perona, Pietro .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2007, 106 (01) :59-70
[17]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[18]  
SCHOLKOPF B, 1995, P 1 INT C KNOWL DISC, P252
[19]  
Tax DavidM. J., 1997, Proc. Workshop on Statistical Pattern Recognition, P165
[20]  
Vapnik V., 1999, NATURE STAT LEARNING