Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile

被引:47
作者
Verma, Ruchi [1 ]
Varshney, Grish C.
Raghava, G. P. S. [1 ]
机构
[1] Inst Microbial Technol, Bioinformat Ctr, Chandigarh, India
关键词
Plasmodium falciparum; Mitochondria; Support vector machine; Position specific scoring matrix; Online server; SUPPORT VECTOR MACHINE; FUSING FUNCTIONAL DOMAIN; SUBCELLULAR-LOCALIZATION; WEB-SERVER; EVOLUTIONARY INFORMATION; GENE ONTOLOGY; LOCATION; PLOC; IDENTIFICATION; CLASSIFIER;
D O I
10.1007/s00726-009-0381-1
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The rate of human death due to malaria is increasing day-by-day. Thus the malaria causing parasite Plasmodium falciparum (PF) remains the cause of concern. With the wealth of data now available, it is imperative to understand protein localization in order to gain deeper insight into their functional roles. In this manuscript, an attempt has been made to develop prediction method for the localization of mitochondrial proteins. In this study, we describe a method for predicting mitochondrial proteins of malaria parasite using machine-learning technique. All models were trained and tested on 175 proteins (40 mitochondrial and 135 non-mitochondrial proteins) and evaluated using five-fold cross validation. We developed a Support Vector Machine (SVM) model for predicting mitochondrial proteins of P. falciparum, using amino acids and dipeptides composition and achieved maximum MCC 0.38 and 0.51, respectively. In this study, split amino acid composition (SAAC) is used where composition of N-termini, C-termini, and rest of protein is computed separately. The performance of SVM model improved significantly from MCC 0.38 to 0.73 when SAAC instead of simple amino acid composition was used as input. In addition, SVM model has been developed using composition of PSSM profile with MCC 0.75 and accuracy 91.38%. We achieved maximum MCC 0.81 with accuracy 92% using a hybrid model, which combines PSSM profile and SAAC. When evaluated on an independent dataset our method performs better than existing methods. A web server PFMpred has been developed for predicting mitochondrial proteins of malaria parasites ( http://www.imtech.res.in/raghava/pfmpred ).
引用
收藏
页码:101 / 110
页数:10
相关论文
共 50 条
  • [41] Protein subcellular location prediction based on pseudo amino acid composition and PSI-blast profile
    Xu, Huimin
    Yan, Shoujiang
    Dai, Qi
    He, Ping-An
    Liao, Bo
    Yao, Yu-Hua
    Journal of Computational and Theoretical Nanoscience, 2015, 12 (10) : 3756 - 3762
  • [42] Prediction of Thermophilic Protein with Pseudo Amino Acid Composition: An Approach from Combined Feature Selection and Reduction
    Wang, De
    Yang, Liang
    Fu, Zhengqi
    Xia, Jingbo
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (07) : 684 - 689
  • [43] Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition
    Chen, Ying-Li
    Li, Qian-Zhong
    JOURNAL OF THEORETICAL BIOLOGY, 2007, 248 (02) : 377 - 381
  • [44] Prediction of palmitoylation sites using the composition of k-spaced amino acid pairs
    Wang, Xiao-Bo
    Wu, Ling-Yun
    Wang, Yong-Cui
    Deng, Nai-Yang
    PROTEIN ENGINEERING DESIGN & SELECTION, 2009, 22 (11) : 707 - 712
  • [45] Protein location prediction using atomic composition and global features of the amino acid sequence
    Cherian, Betsy Sheena
    Nair, Achuthsankar S.
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2010, 391 (04) : 1670 - 1674
  • [46] Prediction of Presynaptic and Postsynaptic Neurotoxins Using Hybrid Approach and Pseudo Amino Acid Composition
    Yang, Lei
    Li, Qianzhong
    Zuo, Yongchun
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 1504 - 1507
  • [47] TargetFreeze: Identifying Antifreeze Proteins via a Combination of Weights using Sequence Evolutionary Information and Pseudo Amino Acid Composition
    He, Xue
    Han, Ke
    Hu, Jun
    Yan, Hui
    Yang, Jing-Yu
    Shen, Hong-Bin
    Yu, Dong-Jun
    JOURNAL OF MEMBRANE BIOLOGY, 2015, 248 (06) : 1005 - 1014
  • [48] Prediction of Allergenic Proteins by Means of the Concept of Chou's Pseudo Amino Acid Composition and a Machine Learning Approach
    Mohabatkar, Hassan
    Beigi, Majid Mohammad
    Abdolahi, Kolsoum
    Mohsenzadeh, Sasan
    MEDICINAL CHEMISTRY, 2013, 9 (01) : 133 - 137
  • [49] Predicting subcellular localization of mycobacterial proteins by using Chou's pseudo amino acid composition
    Lin, Hao
    Ding, Hui
    Guo, Feng-Biao
    Zhang, An-Ying
    Huang, Jian
    PROTEIN AND PEPTIDE LETTERS, 2008, 15 (07) : 739 - 744
  • [50] Prediction of mucin-type O-glycosylation sites in mammalian proteins using the composition of k-spaced amino acid pairs
    Yong-Zi Chen
    Yu-Rong Tang
    Zhi-Ya Sheng
    Ziding Zhang
    BMC Bioinformatics, 9