Using increment of diversity to predict mitochondrial proteins of malaria parasite: integrating pseudo-amino acid composition and structural alphabet

被引:27
作者
Chen, Ying-Li [1 ,2 ]
Li, Qian-Zhong [1 ]
Zhang, Li-Qing [1 ,2 ,3 ]
机构
[1] Inner Mongolia Univ, Sch Phys Sci & Technol, Lab Theoret Biophys, Hohhot, Peoples R China
[2] Virginia Tech, Dept Comp Sci, Blacksburg, VA USA
[3] Virginia Tech, Program Genet Bioinformat & Computat Biol, Blacksburg, VA USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Plasmodium falciparum; Mitochondrial proteins; Increment of diversity; Reduced amino acid alphabet; Hydropathy distribution; SUPPORT VECTOR MACHINE; SUBCELLULAR LOCATION; LOCALIZATION; RECOGNITION; SEQUENCE;
D O I
10.1007/s00726-010-0825-7
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Due to the complexity of Plasmodium falciparum (PF) genome, predicting mitochondrial proteins of PF is more difficult than other species. In this study, using the n-peptide composition of reduced amino acid alphabet (RAAA) obtained from structural alphabet named Protein Blocks as feature parameter, the increment of diversity (ID) is firstly developed to predict mitochondrial proteins. By choosing the 1-peptide compositions on the N-terminal regions with 20 residues as the only input vector, the prediction performance achieves 86.86% accuracy with 0.69 Mathew's correlation coefficient (MCC) by the jackknife test. Moreover, by combining with the hydropathy distribution along protein sequence and several reduced amino acid alphabets, we achieved maximum MCC 0.82 with accuracy 92% in the jackknife test by using the developed ID model. When evaluating on an independent dataset our method performs better than existing methods. The results indicate that the ID is a simple and efficient prediction method for mitochondrial proteins of malaria parasite.
引用
收藏
页码:1309 / 1316
页数:8
相关论文
共 54 条
[1]   Properties and prediction of mitochondrial transit peptides from Plasmodium falciparum [J].
Bender, A ;
van Dooren, GG ;
Ralph, SA ;
McFadden, GI ;
Schneider, G .
MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 2003, 132 (02) :59-66
[2]   Feature-based prediction of non-classical and leaderless protein secretion [J].
Bendtsen, JD ;
Jensen, LJ ;
Blom, N ;
von Heijne, G ;
Brunak, S .
PROTEIN ENGINEERING DESIGN & SELECTION, 2004, 17 (04) :349-356
[3]   Predicting membrane protein type by functional domain composition and pseudo-amino acid composition [J].
Cai, YD ;
Chou, KC .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 238 (02) :395-400
[4]   Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition [J].
Chen, Ying-Li ;
Li, Qian-Zhong .
JOURNAL OF THEORETICAL BIOLOGY, 2007, 248 (02) :377-381
[5]   Prediction of the subcellular location of apoptosis proteins [J].
Chen, Ying-Li ;
Li, Qian-Zhong .
JOURNAL OF THEORETICAL BIOLOGY, 2007, 245 (04) :775-783
[6]   Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes [J].
Chou, KC .
BIOINFORMATICS, 2005, 21 (01) :10-19
[7]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349
[8]   Prediction of protein cellular attributes using pseudo-amino acid composition [J].
Chou, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03) :246-255
[9]   Euk-mPLoc: A fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites [J].
Chou, Kuo-Chen ;
Shen, Hong-Bin .
JOURNAL OF PROTEOME RESEARCH, 2007, 6 (05) :1728-1734
[10]   Large-scale predictions of gram-negative bacterial protein subcellular locations [J].
Chou, Kuo-Chen ;
Shen, Hong-Bin .
JOURNAL OF PROTEOME RESEARCH, 2006, 5 (12) :3420-3428