Classification of the fragrance properties of chemical compounds based on support vector machine and linear discriminant analysis

被引:10
作者
Luan, F. [1 ]
Liu, H. T. [1 ]
Wen, Y. Y. [1 ]
Zhang, X. Y. [2 ]
机构
[1] Yantai Univ, Dept Appl Chem, Yantai 264005, Peoples R China
[2] Lanzhou Univ, Dept Chem, Lanzhou 730000, Peoples R China
关键词
classification; fragrance property; linear discriminant analysis; support vector machine;
D O I
10.1002/ffj.1876
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
Classification models of the fragrance properties of chemical compounds were performed using linear and non-linear models. The dataset was divided into three classes on the basis of their fragrances: apple, pineapple and rose. The three-class problem was first explored by a linear classifier approach, using linear discriminant analysis (LDA). A more accurate prediction model, the non-linear machine-learning technique, support vector machine (SVM), was subsequently investigated. Descriptors calculated from the molecular structures alone were used to represent the characteristics of compounds. The model containing four descriptors founded by SVM showed better predictive ability than LDA. The accuracy in the prediction for the three datasets was 96.6%, 80.0% and 100% for SVM, respectively. The results indicate that SVM can be used as a powerful modelling tool for QSAR studies and the selected descriptors can represent the fragrances of these chemical compounds. Copyright (C) 2008 John Wiley & Sons, Ltd.
引用
收藏
页码:232 / 238
页数:7
相关论文
共 35 条
[1]  
[Anonymous], CODESSA TRAINING MAN
[2]  
[Anonymous], 1990, ISIS DRAW2 3
[3]  
[Anonymous], 1986, STAT ANAL
[4]  
[Anonymous], 2000, HYPERCHEM 6 01
[5]  
[Anonymous], 1997, NEURAL NETWORKS PATT
[6]   Oil-water partition coefficient of odorants: reliable index for smell but deceptive for odor rating! [J].
Brossard, C ;
Rousseau, F ;
Dumont, JP .
FOOD QUALITY AND PREFERENCE, 2002, 13 (02) :65-72
[7]   Drug design by machine learning: support vector machines for pharmaceutical data analysis [J].
Burbidge, R ;
Trotter, M ;
Buxton, B ;
Holden, S .
COMPUTERS & CHEMISTRY, 2001, 26 (01) :5-14
[8]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[9]   Enzyme family classification by support vector machines [J].
Cai, CZ ;
Han, LY ;
Ji, ZL ;
Chen, YZ .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 55 (01) :66-76
[10]  
Chastrette M, 1997, SAR QSAR Environ Res, V6, P215, DOI 10.1080/10629369708033253