Prediction of protein structure class by coupling improved genetic algorithm and support vector machine

被引:0
作者
Z.-C. Li
X.-B. Zhou
Y.-R. Lin
X.-Y. Zou
机构
[1] Sun Yat-Sen University,School of Chemistry and Chemical Engineering
来源
Amino Acids | 2008年 / 35卷
关键词
Feature selection; Genetic algorithm; Protein structure class; Support vector machine;
D O I
暂无
中图分类号
学科分类号
摘要
Structural class characterizes the overall folding type of a protein or its domain. Most of the existing methods for determining the structural class of a protein are based on a group of features that only possesses a kind of discriminative information for the prediction of protein structure class. However, different types of discriminative information associated with primary sequence have been completely missed, which undoubtedly has reduced the success rate of prediction. We present a novel method for the prediction of protein structure class by coupling the improved genetic algorithm (GA) with the support vector machine (SVM). This improved GA was applied to the selection of an optimized feature subset and the optimization of SVM parameters. Jackknife tests on the working datasets indicated that the prediction accuracies for the different classes were in the range of 97.8–100% with an overall accuracy of 99.5%. The results indicate that the approach has a high potential to become a useful tool in bioinformatics.
引用
收藏
页码:581 / 590
页数:9
相关论文
共 401 条
[81]  
Chou KC(2005)Low-frequency Fourier spectrum for predicting membrane protein types Protein J 24 385-4225
[82]  
Elrod DW(2002)Using Fourier spectrum analysis and pseudo amino acid composition for prediction of membrane protein types Eur J Biochem 269 4219-371
[83]  
Chou KC(2003)Prediction of protein structural class by amino acid and polypeptide composition Comput Biol Chem 27 363-1182
[84]  
Elrod DW(1993)A chaotic approach to maintain the population diversity of genetic algorithm in network training Protein Sci 2 1170-260
[85]  
Chou KC(2006)Cross-validation of protein structural class prediction using statistical clustering and neural networks J Theor Biol 243 252-1615
[86]  
Elrod DW(2007)Pseudo amino acid composition and multi-class support vector machines approach for conotoxin superfamily classification Pattern Recognit Lett 28 1610-492
[87]  
Chou KC(2006)Using pseudo amino acid composition to predict protein subnuclear localization: approached with PSSM Protein Peptide Lett 13 489-402
[88]  
Maggiora GM(2003)Predicting protein structural class with AdaBoost learner J Protein Chem 22 395-265
[89]  
Chou KC(2007)Application of pseudo amino acid composition for predicting protein subcellular location: stochastic signal processing approach J Theor Biol 247 259-756
[90]  
Shen HB(2005)Prediction of membrane protein types from sequences and position-specific scoring matrices Biochem Biophys Res Comm 337 752-292