Gaussian Kernel Width Optimization for Sparse Bayesian Learning

被引:32
作者
Mohsenzadeh, Yalda [1 ]
Sheikhzadeh, Hamid [1 ]
机构
[1] Amirkabir Univ Technol, Dept Elect Engn, Tehran 1415854546, Iran
关键词
Adaptive kernel learning (AKL); expectation maximization (EM); kernel width optimization; regression; relevance vector machine (RVM); sparse Bayesian learning; supervised kernel methods; MACHINE; POSE;
D O I
10.1109/TNNLS.2014.2321134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sparse kernel methods have been widely used in regression and classification applications. The performance and the sparsity of these methods are dependent on the appropriate choice of the corresponding kernel functions and their parameters. Typically, the kernel parameters are selected using a cross-validation approach. In this paper, a learning method that is an extension of the relevance vector machine (RVM) is presented. The proposed method can find the optimal values of the kernel parameters during the training procedure. This algorithm uses an expectation-maximization approach for updating kernel parameters as well as other model parameters; therefore, the speed of convergence and computational complexity of the proposed method are the same as the standard RVM. To control the convergence of this fully parameterized model, the optimization with respect to the kernel parameters is performed using a constraint on these parameters. The proposed method is compared with the typical RVM and other competing methods to analyze the performance. The experimental results on the commonly used synthetic data, as well as benchmark data sets, demonstrate the effectiveness of the proposed method in reducing the performance dependency on the initial choice of the kernel parameters.
引用
收藏
页码:709 / 719
页数:11
相关论文
共 28 条
[1]  
Agarwal A, 2004, PROC CVPR IEEE, P882
[2]  
Berger J.O., 1985, Statistical Decision Theory and Bayesian Analysis
[3]   Generalized multiscale radial basis function networks [J].
Billings, Stephen A. ;
Wei, Hua-Liang ;
Balikhin, Michael A. .
NEURAL NETWORKS, 2007, 20 (10) :1081-1094
[4]  
Bishop C. M., 2006, Journal of Electronic Imaging, V128
[5]   A Bayesian model for local smoothing in kernel density estimation [J].
Brewer, MJ .
STATISTICS AND COMPUTING, 2000, 10 (04) :299-309
[6]   Regression Based on Sparse Bayesian Learning and the Applications in Electric Systems [J].
Duan, Qing ;
Zhao, Jian-guo ;
Niu, Lin ;
Luo, Ke .
ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 1, PROCEEDINGS, 2008, :106-110
[7]   PROBABILISTIC PREDICTION OF NEUROLOGICAL DISORDERS WITH A STATISTICAL ASSESSMENT OF NEUROIMAGING DATA MODALITIES [J].
Filippone, M. ;
Marquand, A. F. ;
Blain, C. R. V. ;
Williams, S. C. R. ;
Mourao-Miranda, J. ;
Girolami, M. .
ANNALS OF APPLIED STATISTICS, 2012, 6 (04) :1883-1905
[8]  
Girolami M., 2005, Proceedings of the 22nd International Conference on Machine Learning, P241
[9]   Kernel methods in machine learning [J].
Hofmann, Thomas ;
Schoelkopf, Bernhard ;
Smola, Alexander J. .
ANNALS OF STATISTICS, 2008, 36 (03) :1171-1220
[10]  
HOOKE R, 1961, J ACM, V8, P212, DOI 10.1145/321062.321069