Prediction of GTP interacting residues, dipeptides and tripeptides in a protein from its evolutionary information

被引:44
作者
Chauhan, Jagat S. [1 ]
Mishra, Nitish K. [1 ]
Raghava, Gajendra P. S. [1 ]
机构
[1] Inst Microbial Technol IMTECH, Bioinformat Ctr, Chandigarh 160036, India
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
SUPPORT VECTOR MACHINES; AMINO-ACID-COMPOSITION; DNA-BINDING PROTEINS; ATP-BINDING; GUANINE; SITES; DISCRIMINATION; IDENTIFICATION; RECEPTOR; ADENINE;
D O I
10.1186/1471-2105-11-301
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Guanosine triphosphate (GTP)-binding proteins play an important role in regulation of G-protein. Thus prediction of GTP interacting residues in a protein is one of the major challenges in the field of the computational biology. In this study, an attempt has been made to develop a computational method for predicting GTP interacting residues in a protein with high accuracy (Acc), precision (Prec) and recall (Rc). Result: All the models developed in this study have been trained and tested on a non-redundant (40% similarity) dataset using five-fold cross-validation. Firstly, we have developed neural network based models using single sequence and PSSM profile and achieved maximum Matthews Correlation Coefficient (MCC) 0.24 (Acc 61.30%) and 0.39 (Acc 68.88%) respectively. Secondly, we have developed a support vector machine (SVM) based models using single sequence and PSSM profile and achieved maximum MCC 0.37 (Prec 0.73, Rc 0.57, Acc 67.98%) and 0.55 (Prec 0.80, Rc 0.73, Acc 77.17%) respectively. In this work, we have introduced a new concept of predicting GTP interacting dipeptide (two consecutive GTP interacting residues) and tripeptide (three consecutive GTP interacting residues) for the first time. We have developed SVM based model for predicting GTP interacting dipeptides using PSSM profile and achieved MCC 0.64 with precision 0.87, recall 0.74 and accuracy 81.37%. Similarly, SVM based model have been developed for predicting GTP interacting tripeptides using PSSM profile and achieved MCC 0.70 with precision 0.93, recall 0.73 and accuracy 83.98%. Conclusion: These results show that PSSM based method performs better than single sequence based method. The prediction models based on dipeptides or tripeptides are more accurate than the traditional model based on single residue. A web server "GTPBinder" http://www.imtech.res.in/raghava/gtpbinder/based on above models has been developed for predicting GTP interacting residues in a protein.
引用
收藏
页数:9
相关论文
共 32 条
  • [1] Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information
    Ahmad, S
    Gromiha, MM
    Sarai, A
    [J]. BIOINFORMATICS, 2004, 20 (04) : 477 - 486
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] Electrostatic potential of nucleotide-free protein is sufficient for discrimination between adenine and guanine-specific binding sites
    Basu, G
    Sivanesan, D
    Kawabata, T
    Go, N
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2004, 342 (03) : 1053 - 1066
  • [4] SuperSite: dictionary of metabolite and drug binding sites in proteins
    Bauer, Raphael Andre
    Guenther, Stefan
    Jansen, Dominic
    Heeger, Carolin
    Thaben, Paul Florian
    Preissner, Robert
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D195 - D200
  • [5] Pcleavage: an SVM based method for prediction of constitutive proteasome and immunoproteasome cleavage sites in antigenic sequences
    Bhasin, M
    Raghava, GPS
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : W202 - W207
  • [6] Support vector machines for predicting the specificity of GaINAc-transferase
    Cai, YD
    Liu, XJ
    Xu, XB
    Chou, KC
    [J]. PEPTIDES, 2002, 23 (01) : 205 - 208
  • [7] Prediction of protein structural classes by support vector machines
    Cai, YD
    Liu, XJ
    Xu, XB
    Chou, KC
    [J]. COMPUTERS & CHEMISTRY, 2002, 26 (03): : 293 - 296
  • [8] Identification of ATP binding residues of a protein from its primary sequence
    Chauhan, Jagat S.
    Mishra, Nitish K.
    Raghava, Gajendra P. S.
    [J]. BMC BIOINFORMATICS, 2009, 10 : 434
  • [9] Coupling interaction between thromboxane A2 receptor and alpha-13 subunit of guanine nucleotide-binding protein
    Chou, KC
    [J]. JOURNAL OF PROTEOME RESEARCH, 2005, 4 (05) : 1681 - 1686
  • [10] PREDICTION OF PROTEIN STRUCTURAL CLASSES
    CHOU, KC
    ZHANG, CT
    [J]. CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) : 275 - 349