Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile

被引:47
作者
Verma, Ruchi [1 ]
Varshney, Grish C.
Raghava, G. P. S. [1 ]
机构
[1] Inst Microbial Technol, Bioinformat Ctr, Chandigarh, India
关键词
Plasmodium falciparum; Mitochondria; Support vector machine; Position specific scoring matrix; Online server; SUPPORT VECTOR MACHINE; FUSING FUNCTIONAL DOMAIN; SUBCELLULAR-LOCALIZATION; WEB-SERVER; EVOLUTIONARY INFORMATION; GENE ONTOLOGY; LOCATION; PLOC; IDENTIFICATION; CLASSIFIER;
D O I
10.1007/s00726-009-0381-1
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The rate of human death due to malaria is increasing day-by-day. Thus the malaria causing parasite Plasmodium falciparum (PF) remains the cause of concern. With the wealth of data now available, it is imperative to understand protein localization in order to gain deeper insight into their functional roles. In this manuscript, an attempt has been made to develop prediction method for the localization of mitochondrial proteins. In this study, we describe a method for predicting mitochondrial proteins of malaria parasite using machine-learning technique. All models were trained and tested on 175 proteins (40 mitochondrial and 135 non-mitochondrial proteins) and evaluated using five-fold cross validation. We developed a Support Vector Machine (SVM) model for predicting mitochondrial proteins of P. falciparum, using amino acids and dipeptides composition and achieved maximum MCC 0.38 and 0.51, respectively. In this study, split amino acid composition (SAAC) is used where composition of N-termini, C-termini, and rest of protein is computed separately. The performance of SVM model improved significantly from MCC 0.38 to 0.73 when SAAC instead of simple amino acid composition was used as input. In addition, SVM model has been developed using composition of PSSM profile with MCC 0.75 and accuracy 91.38%. We achieved maximum MCC 0.81 with accuracy 92% using a hybrid model, which combines PSSM profile and SAAC. When evaluated on an independent dataset our method performs better than existing methods. A web server PFMpred has been developed for predicting mitochondrial proteins of malaria parasites ( http://www.imtech.res.in/raghava/pfmpred ).
引用
收藏
页码:101 / 110
页数:10
相关论文
共 50 条
  • [21] Prediction of endoplasmic reticulum resident proteins using fragmented amino acid composition and support vector machine
    Kumar, Ravindra
    Kumari, Bandana
    Kumar, Manish
    PEERJ, 2017, 5
  • [22] Prediction subcellular localization of Gram-negative bacterial proteins by support vector machine using wavelet denoising and Chou's pseudo amino acid composition
    Yu, Bin
    Li, Shan
    Chen, Cheng
    Xu, Jiameng
    Qiu, Wenying
    Wu, Xue
    Chen, Ruixin
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 167 : 102 - 112
  • [23] PECM: Prediction of extracellular matrix proteins using the concept of Chou's pseudo amino acid composition
    Zhang, Jian
    Sun, Pingping
    Zhao, Xiaowei
    Ma, Zhiqiang
    JOURNAL OF THEORETICAL BIOLOGY, 2014, 363 : 412 - 418
  • [24] Prediction of pupylation sites using the composition of k-spaced amino acid pairs
    Tung, Chun-Wei
    JOURNAL OF THEORETICAL BIOLOGY, 2013, 336 : 11 - 17
  • [25] Prediction of RNA binding sites in a protein using SVM and PSSM profile
    Kumar, Manish
    Gromiha, A. Michael
    Raghava, G. P. S.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (01) : 189 - 194
  • [26] Prediction of pattern recognition receptor family using pseudo-amino acid composition
    Gao, Qing-Bin
    Zhao, Hongyu
    Ye, Xiaofei
    He, Jia
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2012, 417 (01) : 73 - 77
  • [27] Mito-GSAAC: mitochondria prediction using genetic ensemble classifier and split amino acid composition
    Afridi, Tariq Habib
    Khan, Asifullah
    Lee, Yeon Soo
    AMINO ACIDS, 2012, 42 (04) : 1443 - 1454
  • [28] A protein fold classifier formed by fusing different modes of pseudo amino acid composition via PSSM
    Kavousi, Kaveh
    Moshiri, Behzad
    Sadeghi, Mehdi
    Araabi, Babak N.
    Moosavi-Movahedi, Ali Akbar
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2011, 35 (01) : 1 - 9
  • [29] Prediction of citrullination sites by incorporating k-spaced amino acid pairs into Chou's general pseudo amino acid composition
    Ju, Zhe
    Wang, Shi-Yun
    GENE, 2018, 664 : 78 - 83
  • [30] The Prediction of Succinylation Site in Protein by Analyzing Amino Acid Composition
    Van-Minh Bui
    Van-Nui Nguyen
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 538 : 633 - 642