Comparison study on statistical features of predicted secondary structures for protein structural class prediction: From content to position

被引:35
作者
Dai, Qi [1 ]
Li, Yan [1 ]
Liu, Xiaoqing [2 ]
Yao, Yuhua [1 ]
Cao, Yunjie [1 ]
He, Pingan [3 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou 310018, Zhejiang, Peoples R China
[2] Hangzhou Dianzi Univ, Coll Sci, Hangzhou 310018, Zhejiang, Peoples R China
[3] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou 310018, Zhejiang, Peoples R China
来源
BMC BIOINFORMATICS | 2013年 / 14卷
基金
中国国家自然科学基金;
关键词
AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINE; SEQUENCES; CLASSIFICATION; DATABASE; REPRESENTATION; SIMILARITY; HOMOLOGY; CATH; SCOP;
D O I
10.1186/1471-2105-14-152
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Many content-based statistical features of secondary structural elements (CBF-PSSEs) have been proposed and achieved promising results in protein structural class prediction, but until now position distribution of the successive occurrences of an element in predicted secondary structure sequences hasn't been used. It is necessary to extract some appropriate position-based features of the secondary structural elements for prediction task. Results: We proposed some position-based features of predicted secondary structural elements (PBF-PSSEs) and assessed their intrinsic ability relative to the available CBF-PSSEs, which not only offers a systematic and quantitative experimental assessment of these statistical features, but also naturally complements the available comparison of the CBF-PSSEs. We also analyzed the performance of the CBF-PSSEs combined with the PBF-PSSE and further constructed a new combined feature set, PBF11CBF-PSSE. Based on these experiments, novel valuable guidelines for the use of PBF-PSSEs and CBF-PSSEs were obtained. Conclusions: PBF-PSSEs and CBF-PSSEs have a compelling impact on protein structural class prediction. When combining with the PBF-PSSE, most of the CBF-PSSEs get a great improvement over the prediction accuracies, so the PBF-PSSEs and the CBF-PSSEs have to work closely so as to make significant and complementary contributions to protein structural class prediction. Besides, the proposed PBF-PSSE's performance is extremely sensitive to the choice of parameter k. In summary, our quantitative analysis verifies that exploring the position information of predicted secondary structural elements is a promising way to improve the abilities of protein structural class prediction.
引用
收藏
页数:14
相关论文
共 42 条
  • [1] Accurate prediction of protein structural classes using functional domains and predicted secondary structure sequences
    Adl, Amin Ahmadi
    Nowzari-Dalini, Abbas
    Xue, Bin
    Uversky, Vladimir N.
    Qian, Xiaoning
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2012, 29 (06) : 623 - 633
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] SCOP database in 2004: refinements integrate structure and sequence family data
    Andreeva, A
    Howorth, D
    Brenner, SE
    Hubbard, TJP
    Chothia, C
    Murzin, AG
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D226 - D229
  • [4] [Anonymous], 2000, NATURE STAT LEARNING, DOI DOI 10.1007/978-1-4757-3264-1
  • [5] Protein secondary structure prediction for a single-sequence using hidden semi-Markov models
    Aydin, Zafer
    Altunbasak, Yucel
    Borodovsky, Mark
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [6] A new representation for protein secondary structure prediction based on frequent patterns
    Birzele, Fabian
    Kramer, Stefan
    [J]. BIOINFORMATICS, 2006, 22 (21) : 2628 - 2634
  • [7] Prediction of protein structural classes by support vector machines
    Cai, YD
    Liu, XJ
    Xu, XB
    Chou, KC
    [J]. COMPUTERS & CHEMISTRY, 2002, 26 (03): : 293 - 296
  • [8] Prediction of protein structural class with Rough Sets
    Cao, YF
    Liu, S
    Zhang, LD
    Qin, J
    Wang, J
    Tang, KX
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [9] Using pseudo-amino acid composition and support vector machine to predict protein structural class
    Chen, Chao
    Tian, Yuan-Xin
    Zou, Xiao-Yong
    Cai, Pei-Xiang
    Mo, Jin-Yuan
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (03) : 444 - 448
  • [10] Prediction of protein structural class using novel evolutionary collocation-based sequence representation
    Chen, Ke
    Kurgan, Lukasz A.
    Ruan, Jishou
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2008, 29 (10) : 1596 - 1604