Recognition of Mitochondrial Proteins in Plasmodium Based on the Tripeptide Composition

被引:7
作者
Bian, Haodong [1 ]
Guo, Maozu [2 ,3 ]
Wang, Juan [1 ,4 ]
机构
[1] Inner Mongolia Univ, Sch Comp Sci, Hohhot, Peoples R China
[2] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing, Peoples R China
[3] Beijing Key Lab Intelligent Proc Bldg Big Data, Beijing, Peoples R China
[4] Stage Key Labs Reprod Regulat & Breeding Grasslan, Hohhot, Peoples R China
基金
中国国家自然科学基金;
关键词
malaria; Plasmodium; mitochondrion; tripeptide composition; support vector machine; SUPPORT VECTOR MACHINE; AMINO-ACID-COMPOSITION; SVM-BASED METHOD; SUBCELLULAR-LOCALIZATION; PREDICTION METHOD; CLASSIFIER; IDENTIFICATION; GENERATION;
D O I
10.3389/fcell.2020.578901
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Mitochondria play essential roles in eukaryotic cells, especially in Plasmodium cells. They have several unusual evolutionary and functional features that are incredibly vital for disease diagnosis and drug design. Thus, predicting mitochondrial proteins of Plasmodium has become a worthwhile work. However, existing computational methods can only predict mitochondrial proteins ofPlasmodium falciparum(P. falciparumfor short), and these methods have low accuracy. It is highly desirable to design a classifier with high accuracy for predicting mitochondrial proteins for all Plasmodium species, not onlyP. falciparum. We proposed a novel method, named as PM-OTC, for predicting mitochondrial proteins in Plasmodium. PM-OTC uses the Support Vector Machine (SVM) as the classifier and the selected tripeptide composition as the features. We adopted the 5-fold cross-validation method to train and test PM-OTC. Results demonstrate that PM-OTC achieves an accuracy of 94.91%, and performances of PM-OTC are superior to other methods.
引用
收藏
页数:7
相关论文
共 39 条
[1]   Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition [J].
Ahmad, Khurshid ;
Waris, Muhammad ;
Hayat, Maqsood .
JOURNAL OF MEMBRANE BIOLOGY, 2016, 249 (03) :293-304
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Improving support vector machine classifiers by modifying kernel functions [J].
Amari, S ;
Wu, S .
NEURAL NETWORKS, 1999, 12 (06) :783-789
[4]   A new method for non-parametric multivariate analysis of variance [J].
Anderson, MJ .
AUSTRAL ECOLOGY, 2001, 26 (01) :32-46
[5]   Properties and prediction of mitochondrial transit peptides from Plasmodium falciparum [J].
Bender, A ;
van Dooren, GG ;
Ralph, SA ;
McFadden, GI ;
Schneider, G .
MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 2003, 132 (02) :59-66
[6]   ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST [J].
Bhasin, M ;
Raghava, GPS .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W414-W419
[7]   GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors [J].
Bhasin, M ;
Raghava, GPS .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W383-W389
[8]   Using increment of diversity to predict mitochondrial proteins of malaria parasite: integrating pseudo-amino acid composition and structural alphabet [J].
Chen, Ying-Li ;
Li, Qian-Zhong ;
Zhang, Li-Qing .
AMINO ACIDS, 2012, 42 (04) :1309-1316
[9]   Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes [J].
Chou, KC .
BIOINFORMATICS, 2005, 21 (01) :10-19
[10]   Multi-class protein fold recognition using support vector machines and neural networks [J].
Ding, CHQ ;
Dubchak, I .
BIOINFORMATICS, 2001, 17 (04) :349-358