Exploring nonlinear relationships in chemical data using kernel-based methods

被引:74
作者
Cao, Dong-Sheng [1 ]
Liang, Yi-Zeng [1 ]
Xu, Qing-Song [2 ]
Hu, Qian-Nan [3 ]
Zhang, Liang-Xiao [1 ]
Fu, Guang-Hui [2 ]
机构
[1] Cent S Univ, Res Ctr Modernizat Tradit Chinese Med, Changsha 410083, Hunan, Peoples R China
[2] Cent S Univ, Sch Math Sci & Comp Technol, Changsha 410083, Hunan, Peoples R China
[3] Wuhan Univ, Coll Pharmacy, Syst Drug Design Lab, Wuhan 430071, Peoples R China
关键词
Kernel methods; Dual solution; Support vector machines (SVMs); Kernel principal component analysis (KPCA); Kernel partial least squares (KPLS); Kernel Fisher discriminant analysis (KFDA); PARTIAL LEAST-SQUARES; RADIAL BASIS FUNCTIONS; CROSS-VALIDATION; DISCRIMINANT-ANALYSIS; PATTERN-RECOGNITION; NEURAL-NETWORKS; PCA ALGORITHMS; WIDE DATA; REGRESSION; MODEL;
D O I
10.1016/j.chemolab.2011.02.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Kernel methods, in particular support vector machines, have been further extended into a new class of methods, which could effectively solve nonlinear problems in chemistry by using simple linear transformation. In fact, the kernel function used in kernel methods might be regarded as a general protocol to deal with nonlinear data in chemistry. In this paper, the basic idea and modularity of kernel methods, together with some simple examples, are discussed in detail to give an in-depth understanding for kernel methods. Three key ingredients of kernel methods, namely dual form, nonlinear mapping and kernel function, provide a consistent framework of kernel-based algorithms. The modularity of kernel methods allows linear algorithms to combine with any kernel function. Thus, some commonly used chemometric algorithms are easily extended to their kernel versions. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:106 / 115
页数:10
相关论文
共 56 条
[1]   A nonlinear partial least squares algorithm using quadratic fuzzy inference system [J].
Abdel-Rahman, Araby I. ;
Lim, Gino J. .
JOURNAL OF CHEMOMETRICS, 2009, 23 (9-10) :530-537
[2]   A comparison of nine PLS1 algorithms [J].
Andersson, Martin .
JOURNAL OF CHEMOMETRICS, 2009, 23 (9-10) :518-529
[3]   Generalized discriminant analysis using a kernel approach [J].
Baudat, G ;
Anouar, FE .
NEURAL COMPUTATION, 2000, 12 (10) :2385-2404
[4]  
Breiman L., 1984, Classification and regression trees, DOI DOI 10.1201/9781315139470
[5]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[6]   About kernel latent variable approaches and SVM [J].
Czekaj, T ;
Wu, W ;
Walczak, B .
JOURNAL OF CHEMOMETRICS, 2005, 19 (5-7) :341-354
[7]   RETRACTED: Double cross-validation and improved sensitivity of the rapid screen of mild traumatic brain injury (Retracted Article. See vol 29, pg 904, 2007) [J].
De Monte, VE ;
Geffen, GM ;
May, CR ;
McFarland, K .
JOURNAL OF CLINICAL AND EXPERIMENTAL NEUROPSYCHOLOGY, 2004, 26 (05) :628-644
[8]   SIMPLS - AN ALTERNATIVE APPROACH TO PARTIAL LEAST-SQUARES REGRESSION [J].
DEJONG, S .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 18 (03) :251-263
[9]   Improvements on cross-validation: The .632+ bootstrap method [J].
Efron, B ;
Tibshirani, R .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (438) :548-560
[10]  
Efron B., 1993, INTRO BOOTSTRAP, DOI 10.1007/978-1-4899-4541-9