Using increment of diversity to predict mitochondrial proteins of malaria parasite: integrating pseudo-amino acid composition and structural alphabet

被引:27
作者
Chen, Ying-Li [1 ,2 ]
Li, Qian-Zhong [1 ]
Zhang, Li-Qing [1 ,2 ,3 ]
机构
[1] Inner Mongolia Univ, Sch Phys Sci & Technol, Lab Theoret Biophys, Hohhot, Peoples R China
[2] Virginia Tech, Dept Comp Sci, Blacksburg, VA USA
[3] Virginia Tech, Program Genet Bioinformat & Computat Biol, Blacksburg, VA USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Plasmodium falciparum; Mitochondrial proteins; Increment of diversity; Reduced amino acid alphabet; Hydropathy distribution; SUPPORT VECTOR MACHINE; SUBCELLULAR LOCATION; LOCALIZATION; RECOGNITION; SEQUENCE;
D O I
10.1007/s00726-010-0825-7
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Due to the complexity of Plasmodium falciparum (PF) genome, predicting mitochondrial proteins of PF is more difficult than other species. In this study, using the n-peptide composition of reduced amino acid alphabet (RAAA) obtained from structural alphabet named Protein Blocks as feature parameter, the increment of diversity (ID) is firstly developed to predict mitochondrial proteins. By choosing the 1-peptide compositions on the N-terminal regions with 20 residues as the only input vector, the prediction performance achieves 86.86% accuracy with 0.69 Mathew's correlation coefficient (MCC) by the jackknife test. Moreover, by combining with the hydropathy distribution along protein sequence and several reduced amino acid alphabets, we achieved maximum MCC 0.82 with accuracy 92% in the jackknife test by using the developed ID model. When evaluating on an independent dataset our method performs better than existing methods. The results indicate that the ID is a simple and efficient prediction method for mitochondrial proteins of malaria parasite.
引用
收藏
页码:1309 / 1316
页数:8
相关论文
共 50 条
  • [1] Using increment of diversity to predict mitochondrial proteins of malaria parasite: integrating pseudo-amino acid composition and structural alphabet
    Ying-Li Chen
    Qian-Zhong Li
    Li-Qing Zhang
    Amino Acids, 2012, 42 : 1309 - 1316
  • [2] Using reduced amino acid composition to predict defensin family and subfamily: Integrating similarity measure and structural alphabet
    Zuo, Yong-Chun
    Li, Qian-Zhong
    PEPTIDES, 2009, 30 (10) : 1788 - 1793
  • [3] Using pseudo-amino acid composition and support vector machine to predict protein structural class
    Chen, Chao
    Tian, Yuan-Xin
    Zou, Xiao-Yong
    Cai, Pei-Xiang
    Mo, Jin-Yuan
    JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (03) : 444 - 448
  • [4] Using K-minimum increment of diversity to predict secretory proteins of malaria parasite based on groupings of amino acids
    Zuo, Yong-Chun
    Li, Qian-Zhong
    AMINO ACIDS, 2010, 38 (03) : 859 - 867
  • [5] Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile
    Verma, Ruchi
    Varshney, Grish C.
    Raghava, G. P. S.
    AMINO ACIDS, 2010, 39 (01) : 101 - 110
  • [6] Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition
    Chen, Ying-Li
    Li, Qian-Zhong
    JOURNAL OF THEORETICAL BIOLOGY, 2007, 248 (02) : 377 - 381
  • [7] Using K-minimum increment of diversity to predict secretory proteins of malaria parasite based on groupings of amino acids
    Yong-Chun Zuo
    Qian-Zhong Li
    Amino Acids, 2010, 38 : 859 - 867
  • [8] Prediction of pattern recognition receptor family using pseudo-amino acid composition
    Gao, Qing-Bin
    Zhao, Hongyu
    Ye, Xiaofei
    He, Jia
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2012, 417 (01) : 73 - 77
  • [9] Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile
    Ruchi Verma
    Grish C. Varshney
    G. P. S. Raghava
    Amino Acids, 2010, 39 : 101 - 110
  • [10] Using stacked generalization to predict membrane protein types based on pseudo-amino acid composition
    Wang, Shuang-Quan
    Yang, Jie
    Chou, Kuo-Chen
    JOURNAL OF THEORETICAL BIOLOGY, 2006, 242 (04) : 941 - 946