Classification of Mass Spectrometric Data Based on Kernel Sliced Inverse Regression and Linear Discriminant Analysis

被引:0
作者
Cheng Zhong [1 ]
Zhu Ai-Shi [1 ]
Zhang Li-Qing [1 ]
机构
[1] Zhejiang Univ Sci & Technol, Dept Biol & Chem Engn, Hangzhou 310012, Zhejiang, Peoples R China
关键词
Sliced inverse regression; principal component analysis; kernel function; linear discriminate analysis; pattern classification; mass spectrometry;
D O I
暂无
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Extracting the most discriminatory features was important in mass spectrometry recognition tasks. In the case of a small number of mass spectrometric samples, the existed methods for extracting discriminatory features encountered various problems, such as the over-fitting, the multicollinearity within numerous predictor variables, nonlinear quantitative relationship between the structure and properties and etc. A novel classification method was constructed by combination of the kernel sliced inverse regression (KSIR) with linear discriminant analysis (LDA). The resulting discriminate model based on this proposed approach (KSIR-LDA) was divided into two steps: the first step was to perform the SIR algorithm in the reproducing kernel Hilbert space (RKHS) induced by applying the kernel trick to the original high dimensional mass spectrometric data for nonlinear dimension reduction and feature extraction, and the second step was that to build the LDA discriminate model in making use of the extracted feature variates. Finally, application to the soft drinks four-group classification problem was presented with a comparison to some other methods. The results show that it is an effective classification method in the structural risk minimization, non-linear characteristics, avoiding the over-fitting and strong generalization ability. At the same time, the KSIR-LDA discriminate model is more concise and can be used to observe the structure of the set of samples.
引用
收藏
页码:1657 / 1661
页数:5
相关论文
共 14 条
  • [1] Duda R. O., 2001, PATTERN CLASSIFICATI, P202
  • [2] DUDA RO, 1973, PATTERN CLASSIFICATI, P113
  • [3] PARTIAL LEAST-SQUARES REGRESSION - A TUTORIAL
    GELADI, P
    KOWALSKI, BR
    [J]. ANALYTICA CHIMICA ACTA, 1986, 185 : 1 - 17
  • [4] Jing Ling, 2004, Journal of China Agricultural University, V9, P79
  • [6] Li YZ, 2007, CHINESE J ANAL CHEM, V35, P1331
  • [7] Lin YP, 2007, CHINESE J ANAL CHEM, V35, P1535
  • [8] Nonlinear component analysis as a kernel eigenvalue problem
    Scholkopf, B
    Smola, A
    Muller, KR
    [J]. NEURAL COMPUTATION, 1998, 10 (05) : 1299 - 1319
  • [9] Tao SH, 2005, CHINESE J ANAL CHEM, V33, P50
  • [10] Theodoridis S, 2006, PATTERN RECOGNITION, 3RD EDITION, P1