On sparsity of data representation in support vector machines

被引:0
作者
Ancona, N [1 ]
Maglietta, R [1 ]
Stella, E [1 ]
机构
[1] CNR, Ist Studi Sistemi Intelligenti Automaz, I-70126 Bari, Italy
来源
Proceedings of the Sixth IASTED International Conference on Signal and Image Processing | 2004年
关键词
method of frame; matching pursuit; data representation; classification; kernel methods; machine learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on the problem of how data representation influences the generalization error of kernel based learning machines like Support Vector Machines (SVM) for classification. Frame theory provides a well founded mathematical framework for representing data in many different ways. We analyze the effects of sparse and dense data representations on the generalization error of such learning machines measured by using leave-one-out error Given a finite number of training data. We show that, in the case of sparse data representation, the Generalization capacity of an SVM trained by using polynomial or Gaussian kernel functions is equal to the one of a linear SVM. We show that sparse data representations reduce the generalization error as long as the representation is not too sparse, as in the case of very large dictionaries. Very sparse representations increase drastically the Generalization error of kernel based methods. Dense data representations, on the contrary, reduce the Generalization error also in the case of very large dictionaries. We use two different schemes for representing data in overcomplete Haar and Gabor dictionaries, and measure SVM Generalization error on bench mark data set.
引用
收藏
页码:596 / 601
页数:6
相关论文
共 8 条
[1]  
[Anonymous], CBMS NSF REGIONAL C
[2]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[3]  
CHEN S, 1995, 479 STANF U DEP STAT
[4]  
KOLMOGOROV AN, 1970, INTRO ANAL
[5]   MATCHING PURSUITS WITH TIME-FREQUENCY DICTIONARIES [J].
MALLAT, SG ;
ZHANG, ZF .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) :3397-3415
[6]  
MANDRIOTA C, 2002, 022002 RIIESICNR
[7]  
NANCONA N, 2003, 022003 RIIESICNR
[8]  
Vapnik V., 1998, STAT LEARNING THEORY, V1, P2