Sequence-based predictor of ATP-binding residues using random forest and mRMR-IFS feature selection

被引:15
作者
Ma, Xin [1 ]
Sun, Xiao [2 ]
机构
[1] Nanjing Audit Univ, Golden Audit Coll, Nanjing 210029, Jiangsu, Peoples R China
[2] Southeast Univ, State Key Lab Bioelect, Nanjing 210096, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Position-specific scoring matrix; Physicochemical property; Binding propensity; MUSCLE-CONTRACTION; PROTEIN; SITES; EXPRESSION; GENERATION; RELEVANCE; ACCURACY; DATABASE; ENZYMES; SERVER;
D O I
10.1016/j.jtbi.2014.06.037
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We develop a computational and statistical approach (ATPBR) for predicting ATP-binding residues in proteins from amino acid sequences by using random forests with a novel hybrid feature. The hybrid feature incorporates a new feature called PSSMPP, the predicted secondary structure and orthogonal binary vectors. The mRMR-IFS feature selection method is utilized to construct the best prediction model. At last, ATPBR achieves significantly improved performance over existing methods, with 87.53% accuracy and a Matthew's correlation coefficient of 0.554. In addition, our further analysis demonstrates that PSSMPP distinguishes more effectively between ATP-binding and non-binding residues. Besides, the optimal features selected by the mRMR-IFS method improve the prediction performance and may provide useful insights for revealing the mechanisms of ATP and proteins interactions. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:59 / 66
页数:8
相关论文
共 40 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [3] Correlations between the activities of 19 standard anticancer agents, antioxidative enzyme activities and the expression of ATP-binding cassette transporters:: comparison with the National Cancer Institute data
    Bracht, Karin
    Liebeke, Manuel
    Ritter, Christoph A.
    Gruenert, Renate
    Bednarski, Patrick J.
    [J]. ANTI-CANCER DRUGS, 2007, 18 (04) : 389 - 404
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] ATP Binding Cassette Transporter A1 (ABCA1) Associated Proteins: Potential Drug Targets in the Metabolic Syndrome and Atherosclerotic Disease?
    Buechler, Christa
    Bauer, Sabrina
    [J]. CURRENT PHARMACEUTICAL BIOTECHNOLOGY, 2012, 13 (02) : 319 - 330
  • [6] Mechanical processes in biochemistry
    Bustamante, C
    Chemla, YR
    Forde, NR
    Izhaky, D
    [J]. ANNUAL REVIEW OF BIOCHEMISTRY, 2004, 73 : 705 - 748
  • [7] Identification of ATP binding residues of a protein from its primary sequence
    Chauhan, Jagat S.
    Mishra, Nitish K.
    Raghava, Gajendra P. S.
    [J]. BMC BIOINFORMATICS, 2009, 10 : 434
  • [8] ATPsite: sequence-based prediction of ATP-binding residues
    Chen, Ke
    Mizianty, Marcin J.
    Kurgan, Lukasz
    [J]. PROTEOME SCIENCE, 2011, 9
  • [9] Investigation of Atomic Level Patterns in Protein-Small Ligand Interactions
    Chen, Ke
    Kurgan, Lukasz
    [J]. PLOS ONE, 2009, 4 (02):
  • [10] Mechanical effects of muscle contraction increase intravascular ATP draining quiescent and active skeletal muscle in humans
    Crecelius, Anne R.
    Kirby, Brett S.
    Richards, Jennifer C.
    Dinenno, Frank A.
    [J]. JOURNAL OF APPLIED PHYSIOLOGY, 2013, 114 (08) : 1085 - 1093