A gradient-based forward greedy algorithm for sparse Gaussian process regression

被引:0
作者
Sun, Ping [1 ]
Yao, Xin [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, CERCIA, Edgbaston Pk Rd, Birmingham B15 2TT, W Midlands, England
来源
TRENDS IN NEURAL COMPUTATION | 2007年 / 35卷
关键词
Gaussian process regression; sparse approximation; sequential forward greedy algorithm; basis vector selection; basis vector construction; gradient-based optimisation; gradient boosting;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this chaper, we present a gradient-based forward greedy method for sparse approximation of Bayesian Gaussian Process Regression (GPR) model. Different from previous work, which is mostly based on various basis vector selection strategies, we propose to construct instead of select a new basis vector at each iterative step. This idea was motivated from the well-known gradient boosting approach. The resulting algorithm built on gradient-based optimisation packages incurs similar computational cost and memory requirements to other leading sparse GPR algorithms. Moreover, the proposed work is a general framework which can be extended to deal with other popular kernel machines, including Kernel Logistic Regression (KLR) and Support Vector Machines (SVMs). Numerical experiments on a wide range of datasets are presented to demonstrate the superiority of our algorithm in terms of generalisation performance.
引用
收藏
页码:241 / +
页数:4
相关论文
共 39 条
[11]   The kernel recursive least-squares algorithm [J].
Engel, Y ;
Mannor, S ;
Meir, R .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) :2275-2285
[12]   Efficient SVM training using low-rank kernel representations [J].
Fine, S ;
Scheinberg, K .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) :243-264
[13]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[14]  
Fung G., 2001, KDD-2001. Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P77, DOI 10.1145/502512.502527
[15]  
Golub G. H., 1996, MATRIX COMPUTATIONS
[16]  
Gray A., 2000, ADV NEURAL INFORM PR, V13, P521
[17]  
Gray A. G., 2004, FAST KERNEL MATRIX V
[18]  
KEERTHI S, 2006, ADV NEURAL INFORM PR, V18
[19]  
MacKay D. J. C., 1998, Neural Networks and Machine Learning. Proceedings, P133
[20]  
Meir R, 2002, LECT NOTES ARTIF INT, V2600, P118