Support Vector Regression Based QSPR for the Prediction of Retention Time of Peptides in Reversed-Phase Liquid Chromatography

被引:15
作者
Golmohammadi, Hassan [1 ]
Dashtbozorgi, Zahra [2 ]
Heyden, Yvan Vander [3 ]
机构
[1] Islamic Azad Univ, Young Researchers & Elite Club, Shahr E Rey Branch, Tehran, Iran
[2] Islamic Azad Univ, Young Researchers & Elite Club, Cent Tehran Branch, Tehran, Iran
[3] Vrije Univ Brussel, Inst Pharmaceut, Dept Analyt Chem & Pharmaceut Technol, FABI, B-1090 Brussels, Belgium
关键词
Reversed-phase liquid chromatography; Peptides; Retention time; Quantitative structure-property relationship; Support vector machine; STRUCTURE-PROPERTY RELATIONSHIP; MULTIPLE LINEAR-REGRESSION; ARTIFICIAL NEURAL-NETWORKS; PROTEIN SEPARATION; ORGANIC-COMPOUNDS; FEATURE-SELECTION; MACHINE; MODEL; IDENTIFICATION; COEFFICIENTS;
D O I
10.1007/s10337-014-2819-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In the present study, Quantitative Structure-Property Relationship (QSPR) models were developed to investigate the retention times (t(R)) of various peptides in seven reversed-phase liquid chromatography systems using Partial Least Squares (PLS), Artificial Neural Network (ANN) and Support Vector Machine (SVM) techniques. Different types of molecular descriptors were calculated to represent the molecular structures of the various compounds studied. Important descriptors were selected by a Genetic Algorithm-Partial Least Square (GA-PLS) method. The four descriptors selected using GA-PLS were used as inputs for PLS, ANN and SVM to build models to predict the retention times. Our study reveals that the relation between the chemical properties and retention time is a nonlinear phenomenon and that the PLS method is not capable to properly model it. The results obtained demonstrate that, for all seven data sets, the t(R) values estimated by SVM were in good agreement with the experimental data, and the performances of the SVM models were comparable or superior to those of PLS and ANN.
引用
收藏
页码:7 / 19
页数:13
相关论文
共 57 条
[1]  
Alma O.G., 2012, ASIAN J MATH STAT, V5, P82, DOI [DOI 10.3923/AJMS.2012.82.92, 10.3923/ajms.2012.82.92]
[2]  
[Anonymous], MOPAC FOR WINDOWS
[3]  
[Anonymous], DRAGON SOFTWARE VERS
[4]  
[Anonymous], MATLAB 7 0
[5]  
[Anonymous], 2010, RECENT ADV QSAR STUD
[6]  
[Anonymous], HYP 4 WIND
[7]  
[Anonymous], ARTIFICIAL NEURAL NE
[8]  
Awad M., 2015, EFFICIENT LEARNING M, P67, DOI [10.1007/978-1-4302-5990-9_4, DOI 10.1007/978-1-4302-5990-9_4]
[9]   Prediction of peptide retention at different HPLC conditions from multiple linear regression models [J].
Baczek, T ;
Wiczling, P ;
Marszall, M ;
Vander Heyden, Y ;
Kaliszan, R .
JOURNAL OF PROTEOME RESEARCH, 2005, 4 (02) :555-563
[10]   Predictions of peptides' retention times in reversed-phase liquid chromatography as a new supportive tool to improve protein identification in proteomics [J].
Baczek, Tomasz ;
Kaliszan, Roman .
PROTEOMICS, 2009, 9 (04) :835-847