Comparison study on statistical features of predicted secondary structures for protein structural class prediction: From content to position

被引:35
作者
Dai, Qi [1 ]
Li, Yan [1 ]
Liu, Xiaoqing [2 ]
Yao, Yuhua [1 ]
Cao, Yunjie [1 ]
He, Pingan [3 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou 310018, Zhejiang, Peoples R China
[2] Hangzhou Dianzi Univ, Coll Sci, Hangzhou 310018, Zhejiang, Peoples R China
[3] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou 310018, Zhejiang, Peoples R China
来源
BMC BIOINFORMATICS | 2013年 / 14卷
基金
中国国家自然科学基金;
关键词
AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINE; SEQUENCES; CLASSIFICATION; DATABASE; REPRESENTATION; SIMILARITY; HOMOLOGY; CATH; SCOP;
D O I
10.1186/1471-2105-14-152
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Many content-based statistical features of secondary structural elements (CBF-PSSEs) have been proposed and achieved promising results in protein structural class prediction, but until now position distribution of the successive occurrences of an element in predicted secondary structure sequences hasn't been used. It is necessary to extract some appropriate position-based features of the secondary structural elements for prediction task. Results: We proposed some position-based features of predicted secondary structural elements (PBF-PSSEs) and assessed their intrinsic ability relative to the available CBF-PSSEs, which not only offers a systematic and quantitative experimental assessment of these statistical features, but also naturally complements the available comparison of the CBF-PSSEs. We also analyzed the performance of the CBF-PSSEs combined with the PBF-PSSE and further constructed a new combined feature set, PBF11CBF-PSSE. Based on these experiments, novel valuable guidelines for the use of PBF-PSSEs and CBF-PSSEs were obtained. Conclusions: PBF-PSSEs and CBF-PSSEs have a compelling impact on protein structural class prediction. When combining with the PBF-PSSE, most of the CBF-PSSEs get a great improvement over the prediction accuracies, so the PBF-PSSEs and the CBF-PSSEs have to work closely so as to make significant and complementary contributions to protein structural class prediction. Besides, the proposed PBF-PSSE's performance is extremely sensitive to the choice of parameter k. In summary, our quantitative analysis verifies that exploring the position information of predicted secondary structural elements is a promising way to improve the abilities of protein structural class prediction.
引用
收藏
页数:14
相关论文
共 42 条
  • [31] HYPROSP II - A knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence
    Lin, HN
    Chang, JM
    Wu, KP
    Sung, TY
    Hsu, WL
    [J]. BIOINFORMATICS, 2005, 21 (15) : 3227 - 3233
  • [32] A high-accuracy protein structural class prediction algorithm using predicted secondary structural information
    Liu, Tian
    Jia, Cangzhi
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2010, 267 (03) : 272 - 275
  • [33] Prediction of protein structural class by amino acid and polypeptide composition
    Luo, RY
    Feng, ZP
    Liu, JK
    [J]. EUROPEAN JOURNAL OF BIOCHEMISTRY, 2002, 269 (17): : 4219 - 4225
  • [34] Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences
    Mizianty, Marcin J.
    Kurgan, Lukasz
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [35] SCOP - A STRUCTURAL CLASSIFICATION OF PROTEINS DATABASE FOR THE INVESTIGATION OF SEQUENCES AND STRUCTURES
    MURZIN, AG
    BRENNER, SE
    HUBBARD, T
    CHOTHIA, C
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1995, 247 (04) : 536 - 540
  • [36] CATH - a hierarchic classification of protein domain structures
    Orengo, CA
    Michie, AD
    Jones, S
    Jones, DT
    Swindells, MB
    Thornton, JM
    [J]. STRUCTURE, 1997, 5 (08) : 1093 - 1108
  • [37] Prediction of protein structural classes using support vector machines
    Sun, X. -D.
    Huang, R. -B.
    [J]. AMINO ACIDS, 2006, 30 (04) : 469 - 475
  • [38] Prediction of protein structural classes for low-homology sequences based on predicted secondary structure
    Yang, Jian-Yi
    Peng, Zhen-Ling
    Chen, Xin
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [39] Prediction of protein B-factor profiles
    Yuan, Z
    Bailey, TL
    Teasdale, RD
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (04) : 905 - 912
  • [40] High-accuracy prediction of protein structural class for low-similarity sequences based on predicted secondary structure
    Zhang, Shengli
    Ding, Shuyan
    Wang, Tianming
    [J]. BIOCHIMIE, 2011, 93 (04) : 710 - 714