Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile

被引:47
作者
Verma, Ruchi [1 ]
Varshney, Grish C.
Raghava, G. P. S. [1 ]
机构
[1] Inst Microbial Technol, Bioinformat Ctr, Chandigarh, India
关键词
Plasmodium falciparum; Mitochondria; Support vector machine; Position specific scoring matrix; Online server; SUPPORT VECTOR MACHINE; FUSING FUNCTIONAL DOMAIN; SUBCELLULAR-LOCALIZATION; WEB-SERVER; EVOLUTIONARY INFORMATION; GENE ONTOLOGY; LOCATION; PLOC; IDENTIFICATION; CLASSIFIER;
D O I
10.1007/s00726-009-0381-1
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The rate of human death due to malaria is increasing day-by-day. Thus the malaria causing parasite Plasmodium falciparum (PF) remains the cause of concern. With the wealth of data now available, it is imperative to understand protein localization in order to gain deeper insight into their functional roles. In this manuscript, an attempt has been made to develop prediction method for the localization of mitochondrial proteins. In this study, we describe a method for predicting mitochondrial proteins of malaria parasite using machine-learning technique. All models were trained and tested on 175 proteins (40 mitochondrial and 135 non-mitochondrial proteins) and evaluated using five-fold cross validation. We developed a Support Vector Machine (SVM) model for predicting mitochondrial proteins of P. falciparum, using amino acids and dipeptides composition and achieved maximum MCC 0.38 and 0.51, respectively. In this study, split amino acid composition (SAAC) is used where composition of N-termini, C-termini, and rest of protein is computed separately. The performance of SVM model improved significantly from MCC 0.38 to 0.73 when SAAC instead of simple amino acid composition was used as input. In addition, SVM model has been developed using composition of PSSM profile with MCC 0.75 and accuracy 91.38%. We achieved maximum MCC 0.81 with accuracy 92% using a hybrid model, which combines PSSM profile and SAAC. When evaluated on an independent dataset our method performs better than existing methods. A web server PFMpred has been developed for predicting mitochondrial proteins of malaria parasites ( http://www.imtech.res.in/raghava/pfmpred ).
引用
收藏
页码:101 / 110
页数:10
相关论文
共 50 条
  • [31] Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition
    Ahmad, Khurshid
    Waris, Muhammad
    Hayat, Maqsood
    JOURNAL OF MEMBRANE BIOLOGY, 2016, 249 (03) : 293 - 304
  • [32] Improving secretory proteins prediction in Mycobacterium tuberculosis using the unbiased dipeptide composition with support vector machine
    Ahmed, Saeed
    Kabir, Muhammad
    Arif, Muhammad
    Ali, Zakir
    Ali, Farman
    Swati, Zar Nawab Khan
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 21 (03) : 212 - 229
  • [33] Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition
    Takeyuki Tamura
    Tatsuya Akutsu
    BMC Bioinformatics, 8
  • [34] Prediction of Subcellular Localization of Apoptosis Protein Using Chou’s Pseudo Amino Acid Composition
    Hao Lin
    Hao Wang
    Hui Ding
    Ying-Li Chen
    Qian-Zhong Li
    Acta Biotheoretica, 2009, 57 : 321 - 330
  • [35] Prediction of Protein Secondary Structure Content by Using the Concept of Chou's Pseudo Amino Acid Composition and Support Vector Machine
    Chen, Chao
    Chen, Lixuan
    Zou, Xiaoyong
    Cai, Peixiang
    PROTEIN AND PEPTIDE LETTERS, 2009, 16 (01) : 27 - 31
  • [36] Using Pseudo Amino Acid Composition to Predict Protein Attributes Via Cellular Automata and Other Approaches
    Xiao, Xuan
    Chou, Kuo-Chen
    CURRENT BIOINFORMATICS, 2011, 6 (02) : 251 - 260
  • [37] Proteome-wide prediction of novel DNA/RNA-binding proteins using amino acid composition and periodicity in the hyperthermophilic Archaeon Pyrococcus furiosus
    Fujshima, Kosuke
    Komasa, Mizuki
    Kitamura, Sayaka
    Suzuki, Haruo
    Tomita, Masaru
    Kanai, Akio
    DNA RESEARCH, 2007, 14 (03) : 91 - 102
  • [38] Using K-minimum increment of diversity to predict secretory proteins of malaria parasite based on groupings of amino acids
    Zuo, Yong-Chun
    Li, Qian-Zhong
    AMINO ACIDS, 2010, 38 (03) : 859 - 867
  • [39] Using the concept of Chou's pseudo amino acid composition for risk type prediction of human papillomaviruses
    Esmaeili, Maryam
    Mohabatkar, Hassan
    Mohsenzadeh, Sasan
    JOURNAL OF THEORETICAL BIOLOGY, 2010, 263 (02) : 203 - 209
  • [40] Prediction of GABAA receptor proteins using the concept of Chou's pseudo-amino acid composition and support vector machine
    Mohabatkar, Hassan
    Beigi, Majid Mohammad
    Esmaeili, Abolghasem
    JOURNAL OF THEORETICAL BIOLOGY, 2011, 281 (01) : 18 - 23