Identification of Mitochondrial Proteins of Malaria Parasite Adding the New Parameter

被引:1
作者
Feng Yonge [1 ]
Xie Weixia [1 ]
机构
[1] Inner Mongolia Agr Univ, Coll Sci, Hohhot 010018, Peoples R China
关键词
Mitochondrial proteins of malaria parasite; protein secondary structure; amino acid composition; dipeptide composition; analysis of variance; support vector machine; SUPPORT VECTOR MACHINE; PREDICTING SUBCELLULAR-LOCALIZATION; FEATURE-SELECTION; EVOLUTIONARY INFORMATION; MEMBRANE-PROTEINS; IDENTIFY; SITES;
D O I
10.2174/1570178615666180608100348
中图分类号
O62 [有机化学];
学科分类号
070303 ; 081704 ;
摘要
Malaria has been one of the serious infectious diseases caused by Plasmodium falciparum (P. falciparum). Mitochondrial proteins of P. falciparum are regarded as effective drug targets against malaria. Thus, it is necessary to accurately identify mitochondria' proteins of malaria parasite. Many algorithms have been proposed for the prediction of mitochondria' proteins of malaria parasite and yielded the better results. However, the parameters used by these methods were primarily based on amino acid sequences. In this study, we added a novel parameter for predicting mitochondria' proteins of malaria parasite based on protein secondary structure. Firstly, we extracted three feature parameters, namely, three kinds of protein secondary structures compositions (3PSS), 20 amino acid compositions (20AAC) and 400 dipeptide compositions (400DC), and used the analysis of variance (ANOVA) to screen 400 dipeptides. Secondly, we adopted these features to predict mitochondrial proteins of malaria parasite by using support vector machine (SVM). Finally, we found that 1) adding the feature of protein secondary structure (3PSS) can indeed improve the prediction accuracy. This result demonstrated that the parameter of protein secondary structure is a valid feature in the prediction of mitochondrial proteins of malaria parasite; 2) feature combination can improve the prediction's results; feature selection can reduce the dimension and simplify the calculation. We achieved the sensitivity (Sn) of 98.16%, the specificity (Sp) of 97.64% and overall accuracy (Acc) of 97.88% with 0.957 of Mathew's correlation coefficient (MCC) by using 3PSS+ 20AAC+ 34DC as a feature in 15-fold cross-validation. This result is compared with that of the similar work in the same dataset, showing the superiority of our work.
引用
收藏
页码:258 / 262
页数:5
相关论文
共 52 条
[1]   Properties and prediction of mitochondrial transit peptides from Plasmodium falciparum [J].
Bender, A ;
van Dooren, GG ;
Ralph, SA ;
McFadden, GI ;
Schneider, G .
MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 2003, 132 (02) :59-66
[2]   Scalable web services for the PSIPRED Protein Analysis Workbench [J].
Buchan, Daniel W. A. ;
Minneci, Federico ;
Nugent, Tim C. O. ;
Bryson, Kevin ;
Jones, David T. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) :W349-W357
[3]   SMOQ: a tool for predicting the absolute residue-specific quality of a single protein model with support vector machines [J].
Cao, Renzhi ;
Wang, Zheng ;
Wang, Yiheng ;
Cheng, Jianlin .
BMC BIOINFORMATICS, 2014, 15
[4]   PFRES: protein fold classification by using evolutionary information and predicted secondary structure [J].
Chen, Ke ;
Kurgan, Lukasz .
BIOINFORMATICS, 2007, 23 (21) :2843-2850
[5]   iDNA4mC: identifying DNA N4-methylcytosine sites based on nucleotide chemical properties [J].
Chen, Wei ;
Yang, Hui ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao .
BIOINFORMATICS, 2017, 33 (22) :3518-3523
[6]   Identification of Bacterial Cell Wall Lyases via Pseudo Amino Acid Composition [J].
Chen, Xin-Xin ;
Tang, Hua ;
Li, Wen-Chao ;
Wu, Hao ;
Chen, Wei ;
Ding, Hui ;
Lin, Hao .
BIOMED RESEARCH INTERNATIONAL, 2016, 2016
[7]   Using increment of diversity to predict mitochondrial proteins of malaria parasite: integrating pseudo-amino acid composition and structural alphabet [J].
Chen, Ying-Li ;
Li, Qian-Zhong ;
Zhang, Li-Qing .
AMINO ACIDS, 2012, 42 (04) :1309-1316
[8]   Some remarks on protein attribute prediction and pseudo amino acid composition [J].
Chou, Kuo-Chen .
JOURNAL OF THEORETICAL BIOLOGY, 2011, 273 (01) :236-247
[9]   Recent Advances in Conotoxin Classification by Using Machine Learning Methods [J].
Dao, Fu-Ying ;
Yang, Hui ;
Su, Zhen-Dong ;
Yang, Wuritu ;
Wu, Yun ;
Ding, Hui ;
Chen, Wei ;
Tang, Hua ;
Lin, Hao .
MOLECULES, 2017, 22 (07)
[10]   Identification of mycobacterial membrane proteins and their types using over-represented tripeptide compositions [J].
Ding, Chen ;
Yuan, Lu-Feng ;
Guo, Shou-Hui ;
Lin, Hao ;
Chen, Wei .
JOURNAL OF PROTEOMICS, 2012, 77 :321-328