Prediction of protein structure class by coupling improved genetic algorithm and support vector machine

被引:0
作者
Z.-C. Li
X.-B. Zhou
Y.-R. Lin
X.-Y. Zou
机构
[1] Sun Yat-Sen University,School of Chemistry and Chemical Engineering
来源
Amino Acids | 2008年 / 35卷
关键词
Feature selection; Genetic algorithm; Protein structure class; Support vector machine;
D O I
暂无
中图分类号
学科分类号
摘要
Structural class characterizes the overall folding type of a protein or its domain. Most of the existing methods for determining the structural class of a protein are based on a group of features that only possesses a kind of discriminative information for the prediction of protein structure class. However, different types of discriminative information associated with primary sequence have been completely missed, which undoubtedly has reduced the success rate of prediction. We present a novel method for the prediction of protein structure class by coupling the improved genetic algorithm (GA) with the support vector machine (SVM). This improved GA was applied to the selection of an optimized feature subset and the optimization of SVM parameters. Jackknife tests on the working datasets indicated that the prediction accuracies for the different classes were in the range of 97.8–100% with an overall accuracy of 99.5%. The results indicate that the approach has a high potential to become a useful tool in bioinformatics.
引用
收藏
页码:581 / 590
页数:9
相关论文
共 401 条
  • [1] Aguero-Chapin G(2006)Novel 2D maps and coupling numbers for protein sequences. The first QSAR study of polygalacturonases; isolation and prediction of a novel sequence from FEBS Lett 580 723-730
  • [2] Gonzalez-Diaz H(1997) L Proteins 29 172-185
  • [3] Molina R(2007)Understanding the recognition of protein structureal classes by amino acid composition J Mol Graph Model 26 166-178
  • [4] Varona-Santos J(2005)Proteometric study of ghrelin receptor function variations upon mutations using amino acid sequence autocorrelation vectors and genetic algorithm-based least square support vector machines J Proteome Res 4 109-111
  • [5] Uriarte E(2005)Using functional domain composition to predict enzyme family classes J Proteome Res 4 967-971
  • [6] Gonzalez-Diaz Y(2006)Predicting enzyme subclass by functional domain composition and pseudo amino acid composition J Theor Bio 238 395-400
  • [7] Bahar I(2000)Predicting membrane protein type by functional domain composition and pseudo amino acid composition Biochimie 82 783-785
  • [8] Atilgan AR(2001)Prediction of protein structural classes by neural network BMC Bioinformatics 2 1-5
  • [9] Jernigan RL(2002)Support vector machines for predicting protein structural class Comput Chem 26 293-296
  • [10] Erman B(2004)Prediction of protein structural classes by support vector machines J Theor Boil 226 373-376