Predicting protein structural class based on multi-features fusion

被引:73
作者
Chen, Chao [1 ,2 ]
Chen, Li-Xuan [3 ]
Zou, Xiao-Yong [2 ]
Cai, Pei-Xiang [2 ]
机构
[1] Guangdong Pharmaceut Univ, Sch Tradit Chinese Med, Guangzhou 510006, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Sch Chem & Chem Engn, Guangzhou 510275, Guangdong, Peoples R China
[3] Guangzhou Inst Standardizat, Guangzhou 510170, Guangdong, Peoples R China
关键词
protein structural classes; support vector machine; PROFEAT; fusion; prediction;
D O I
10.1016/j.jtbi.2008.03.009
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Structural class characterizes the overall folding type of a protein or its domain and the prediction of protein structural class has become both an important and a challenging topic in protein science. Moreover, the prediction itself can stimulate the development of novel predictors that may be straightforwardly applied to many other relational areas. In this paper, 10 frequently used sequence-derived structural and physicochemical features, which can be easily computed by the PROFEAT (Protein Features) web server, were taken as inputs of support vector machines to develop statistical learning models for predicting the protein structural class. More importantly, a strategy of merging different features, called best-first search, was developed. It was shown through the rigorous jackknife cross-validation test that the success rates by our method were significantly improved. We anticipate that the present method may also have important impacts on boosting the predictive accuracies for a series of other protein attributes, such as subcellular localization, membrane types, enzyme family and subfamily classes, among many others. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:388 / 392
页数:5
相关论文
共 71 条
  • [51] PseAAC: A flexible web server for generating various kinds of protein pseudo amino acid composition
    Shen, Hong-Bin
    Chou, Kuo-Chen
    [J]. ANALYTICAL BIOCHEMISTRY, 2008, 373 (02) : 386 - 388
  • [52] Fuzzy KNN for predicting membrane protein types from pseudo-amino acid composition
    Shen, Hong-Bin
    Yang, Jie
    Chou, Kuo-Chen
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2006, 240 (01) : 9 - 13
  • [53] Prediction of protein subcellular localization by support vector machines using multi-scale energy and pseudo amino acid composition
    Shi, J.-Y.
    Zhang, S.-W.
    Pan, Q.
    Cheng, Y.-M.
    Xie, J.
    [J]. AMINO ACIDS, 2007, 33 (01) : 69 - 74
  • [54] Prediction of protein structural classes using support vector machines
    Sun, X. -D.
    Huang, R. -B.
    [J]. AMINO ACIDS, 2006, 30 (04) : 469 - 475
  • [55] Using stacked generalization to predict membrane protein types based on pseudo-amino acid composition
    Wang, Shuang-Quan
    Yang, Jie
    Chou, Kuo-Chen
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2006, 242 (04) : 941 - 946
  • [56] Using pseudo amino acid composition to predict protein structural classes: Approached with complexity measure factor
    Xiao, X
    Shao, SH
    Huang, ZD
    Chou, KC
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2006, 27 (04) : 478 - 482
  • [57] Xiao X, 2006, AMINO ACIDS, V30, P49, DOI 10.1007/s00726-005-0225-6
  • [58] An application of gene comparative image for predicting the effect on replication ratio by HBV virus gene missense mutation
    Xiao, X
    Shao, SH
    Ding, YS
    Huang, ZD
    Chen, XJ
    Chou, KC
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2005, 235 (04) : 555 - 565
  • [59] Improving the prediction of human microRNA target genes by using ensemble algorithm
    Yan, Xingqi
    Chao, Tengfei
    Tu, Kang
    Zhang, Yu
    Xie, Lu
    Gong, Yanhua
    Yuan, Jiangang
    Qiang, Boqin
    Peng, Xiaozhong
    [J]. FEBS LETTERS, 2007, 581 (08) : 1587 - 1593
  • [60] ZHANG CT, 1992, PROTEIN SCI, V1, P401