Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile

被引:0
作者
Ruchi Verma
Grish C. Varshney
G. P. S. Raghava
机构
[1] Institute of Microbial Technology,Bioinformatics Centre
[2] Institute of Microbial Technology,Cell biology and Immunology
来源
Amino Acids | 2010年 / 39卷
关键词
 ; Mitochondria; Support vector machine; Position specific scoring matrix; Online server;
D O I
暂无
中图分类号
学科分类号
摘要
The rate of human death due to malaria is increasing day-by-day. Thus the malaria causing parasite Plasmodium falciparum (PF) remains the cause of concern. With the wealth of data now available, it is imperative to understand protein localization in order to gain deeper insight into their functional roles. In this manuscript, an attempt has been made to develop prediction method for the localization of mitochondrial proteins. In this study, we describe a method for predicting mitochondrial proteins of malaria parasite using machine-learning technique. All models were trained and tested on 175 proteins (40 mitochondrial and 135 non-mitochondrial proteins) and evaluated using five-fold cross validation. We developed a Support Vector Machine (SVM) model for predicting mitochondrial proteins of P. falciparum, using amino acids and dipeptides composition and achieved maximum MCC 0.38 and 0.51, respectively. In this study, split amino acid composition (SAAC) is used where composition of N-termini, C-termini, and rest of protein is computed separately. The performance of SVM model improved significantly from MCC 0.38 to 0.73 when SAAC instead of simple amino acid composition was used as input. In addition, SVM model has been developed using composition of PSSM profile with MCC 0.75 and accuracy 91.38%. We achieved maximum MCC 0.81 with accuracy 92% using a hybrid model, which combines PSSM profile and SAAC. When evaluated on an independent dataset our method performs better than existing methods. A web server PFMpred has been developed for predicting mitochondrial proteins of malaria parasites (http://www.imtech.res.in/raghava/pfmpred/).
引用
收藏
页码:101 / 110
页数:9
相关论文
共 136 条
[1]  
Ashburner M(2000)Gene ontology: tool for the unification of biology. The Gene Ontology Consortium Nat Genet 25 25-29
[2]  
Ball CA(2003)Properties and prediction of mitochondrial transit peptides from Mol Biochem Parasitol 132 59-66
[3]  
Blake JA(2004)  Nucleic Acids Res 32 W414-W419
[4]  
Botstein D(2002)ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST Comput Chem 26 293-296
[5]  
Butler H(2005)Prediction of protein structural classes by support vector machines Peptides 24 159-161
[6]  
Cherry JM(2008)Support vector machines for prediction of protein signal sequences and their cleavage sites J Theor Biol 253 388-392
[7]  
Davis AP(2006)Predicting protein structural class based on multi-features fusion J Proteome Res 5 1888-1897
[8]  
Dolinski K(2006)Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-nearest neighbor classifiers Biochem Biophys Res Commun 347 150-157
[9]  
Dwight SS(2007)Hum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization Biochem Biophys Res Commun 360 339-345
[10]  
Eppig JT(2007)MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM J Cell Biochem 100 665-678