Bagging linear sparse Bayesian learning models for variable selection in cancer diagnosis

被引:20
作者
Lu, Chuan [1 ]
Devos, Andy
Suykens, Johan A. K.
Arus, Carles
Van Huffel, Sabine
机构
[1] Univ Wales, Dept Comp Sci, Aberystwyth SY23 3DB, Dyfed, Wales
[2] Katholieke Univ Leuven, Dept Elect Engn, SCD, SISTA,ESAT, B-3001 Heverlee, Belgium
[3] Univ Autonoma Barcelona, Dept Bioquim & Biol Mol, E-08193 Barcelona, Spain
来源
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE | 2007年 / 11卷 / 03期
关键词
bagging; kernel-based probabilistic classifiers; magnetic resonance spectroscopy (MRS); microarray; sparse Bayesian learning (SBL); variable selection (VS);
D O I
10.1109/TITB.2006.889702
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates variable selection (VS) and classification for biomedical datasets with a small sample size and a very high input dimension. The sequential sparse Bayesian learning methods with linear bases are used as the basic VS algorithm. Selected variables are fed to the kernel-based probabilistic classifiers: Bayesian least squares support vector machines (BayLSSVMs) and relevance vector machines (RVMs). We employ the bagging techniques for both VS and model building in order to improve the reliability of the selected variables and the predictive performance. This modeling strategy is applied to real-life medlical classification problems, including two binary cancer diagnosis problems based on microarray data and a brain tumor multiclass classification problem using spectra acquired via magnetic resonance spectroscopy. The work is experimentally compared to other VS methods. It is shown that the use of bagging can improve the reliability and stability of both VS and model prediction.
引用
收藏
页码:338 / 347
页数:10
相关论文
共 32 条
  • [1] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [2] [Anonymous], 1998, ADV NEURAL INFORM PR
  • [3] [Anonymous], 2003, Advances in Learning Theory: Methods, Models and Applications
  • [4] [Anonymous], 2002, Least Squares Support Vector Machines
  • [5] Bishop CM., 1995, Neural networks for pattern recognition
  • [6] Bagging predictors
    Breiman, L
    [J]. MACHINE LEARNING, 1996, 24 (02) : 123 - 140
  • [7] Assignment of the 2.03 ppm resonance in in vivo 1H MRS of human brain tumour cystic fluid:: contribution of macromolecules
    Candiota, AP
    Majós, C
    Bassols, A
    Cabañas, ME
    Acebes, JJ
    Quintero, MR
    Arús, C
    [J]. MAGNETIC RESONANCE MATERIALS IN PHYSICS BIOLOGY AND MEDICINE, 2004, 17 (01) : 36 - 46
  • [8] The use of multivariate MR imaging intensities versus metabolic data from MR spectroscopic imaging for brain tumour classification
    Devos, A
    Simonetti, AW
    van der Graaf, M
    Lukas, L
    Suykens, JAK
    Vanhamme, L
    Buydens, LMC
    Heerschap, A
    Van Huffel, S
    [J]. JOURNAL OF MAGNETIC RESONANCE, 2005, 173 (02) : 218 - 228
  • [9] Friedman J, 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5
  • [10] Improving reliability of gene selection from microarray functional genomics data
    Fu, LM
    Youn, ES
    [J]. IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2003, 7 (03): : 191 - 196