Bayesian kernel projections for classification of high dimensional data

被引:4
作者
Domijan, Katarina [1 ]
Wilson, Simon P. [2 ]
机构
[1] NUI Maynooth, Dept Math, Maynooth, Kildare, Ireland
[2] Trinity Coll Dublin, Sch Comp Sci, Dublin, Ireland
关键词
Bayesian inference; Multinomial logistic regression; Reproducing kernel Hilbert spaces; Kernel principal components analysis; Bayesian decision theory; SUPPORT VECTOR MACHINES; ADAPTIVE SPARSENESS; REGRESSION; VARIABLES; SELECTION; ARTICLE; BROWNE; CHOICE;
D O I
10.1007/s11222-009-9161-8
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A Bayesian multi-category kernel classification method is proposed. The algorithm performs the classification of the projections of the data to the principal axes of the feature space. The advantage of this approach is that the regression coefficients are identifiable and sparse, leading to large computational savings and improved classification performance. The degree of sparsity is regulated in a novel framework based on Bayesian decision theory. The Gibbs sampler is implemented to find the posterior distributions of the parameters, thus probability distributions of prediction can be obtained for new data points, which gives a more complete picture of classification. The algorithm is aimed at high dimensional data sets where the dimension of measurements exceeds the number of observations. The applications considered in this paper are microarray, image processing and near-infrared spectroscopy data.
引用
收藏
页码:203 / 216
页数:14
相关论文
共 50 条
  • [1] AIZERMAN MA, 1965, AUTOMAT REM CONTR+, V25, P821
  • [2] [Anonymous], 1996, Evaluation of Gaussian Processes and other Methods for Non-Linear Regression
  • [3] Bakir GH, 2004, ADV NEUR IN, V16, P449
  • [4] Bernardo J. M., 1994, Bayesian theory, DOI 10.1002/9780470316870
  • [5] Bishop C.M., 2000, P C UNC ART INT, P46
  • [6] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [7] Brown PJ, 1999, BIOMETRIKA, V86, P635
  • [8] Chakraborty S., 2007, Sankhya, V69, P514
  • [9] Using unlabelled data to update classification rules with applications in food authenticity studies
    Dean, N
    Murphy, TB
    Downey, G
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2006, 55 : 1 - 14
  • [10] Denison D, 2002, Bayesian methods for nonlinear classification and regression