An Ensemble of Deep Support Vector Machines for Image Categorization

被引:20
作者
Abdullah, Azizi [1 ]
Veltkamp, Remco C. [1 ]
Wiering, Marco A. [2 ]
机构
[1] Univ Utrecht, Dept Informat & Comp Sci, NL-3508TC Utrecht, Netherlands
[2] Univ Groningen, Dept Artificial Intelligence, NL-9700AB Groningen, Netherlands
来源
2009 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION | 2009年
关键词
Image categorization; support vector machines; ensemble methods; product rule; deep architectures;
D O I
10.1109/SoCPaR.2009.67
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the deep support vector machine (D-SVM) inspired by the increasing popularity of deep belief networks for image recognition. Our deep SVM trains an SVM in the standard way and then uses the kernel activations of support vectors as inputs for training another SVM at the next layer. In this way, instead of the normal linear combination of kernel activations, we can create non-linear combinations of kernel activations on prototype examples. Furthermore, we combine different descriptors in an ensemble of deep SVMs where the product rule is used for combining probability estimates of the different classifiers. We have performed experiments on 20 classes from the Caltech object database and 10 classes from the Corel dataset. The results show that our ensemble of deep SVMs significantly outperforms the naive approach that combines all descriptors directly in a very large single input vector for an SVM. Furthermore, our ensemble of D-SVMs achieves an accuracy of 95.2% on the Corel dataset with 10 classes, which is the best performance reported in literature until now.
引用
收藏
页码:301 / +
页数:2
相关论文
共 23 条
[1]   CIREC: Cluster correlogram image retrieval and categorization using MPEG-7 descriptors [J].
Abdullah, A. ;
Wiering, Marco A. .
2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN IMAGE AND SIGNAL PROCESSING, 2007, :431-437
[2]  
ABDULLAH A, 2009, INT JOINT C NEUR NET
[3]  
[Anonymous], 2007, ADV NEURAL INFORM PR
[4]  
[Anonymous], 2003, PRACTICAL GUIDE SUPP
[5]  
[Anonymous], 2007, LARGE SCALE KERNEL M
[6]  
Bosch A., 2008, INT J COMPUTER UNPUB
[7]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[8]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]   Ensemble methods in machine learning [J].
Dietterich, TG .
MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15