Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment

被引:684
作者
Yang, Jianyi [1 ]
Roy, Ambrish [1 ]
Zhang, Yang [1 ,2 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
关键词
I-TASSER; PREDICTION; RESIDUES; ALGORITHM; DATABASE; SEARCH; SERVER;
D O I
10.1093/bioinformatics/btt447
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Identification of protein-ligand binding sites is critical to protein function annotation and drug discovery. However, there is no method that could generate optimal binding site prediction for different protein types. Combination of complementary predictions is probably the most reliable solution to the problem. Results: We develop two new methods, one based on binding-specific substructure comparison (TM-SITE) and another on sequence profile alignment (S-SITE), for complementary binding site predictions. The methods are tested on a set of 500 non-redundant proteins harboring 814 natural, drug-like and metal ion molecules. Starting from low-resolution protein structure predictions, the methods successfully recognize >51% of binding residues with average Matthews correlation coefficient (MCC) significantly higher (with P-value >10(-9) in student t-test) than other state-of-the-art methods, including COFACTOR, FINDSITE and ConCavity. When combining TM-SITE and S-SITE with other structure-based programs, a consensus approach (COACH) can increase MCC by 15% over the best individual predictions. COACH was examined in the recent community-wide COMEO experiment and consistently ranked as the best method in last 22 individual datasets with the Area Under the Curve score 22.5% higher than the second best method. These data demonstrate a new robust approach to protein-ligand binding site recognition, which is ready for genome-wide structure-based function annotations.
引用
收藏
页码:2588 / 2595
页数:8
相关论文
共 31 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Pocketome via comprehensive identification and classification of ligand binding envelopes
    An, JH
    Totrov, M
    Abagyan, R
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (06) : 752 - 761
  • [3] [Anonymous], 2006, P ACMSIGKDD INT C KN
  • [4] [Anonymous], STRUCTURE BASED DRUG
  • [5] A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation
    Brylinski, Michal
    Skolnick, Jeffrey
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (01) : 129 - 134
  • [6] Predicting functionally important residues from sequence conservation
    Capra, John A.
    Singh, Mona
    [J]. BIOINFORMATICS, 2007, 23 (15) : 1875 - 1882
  • [7] Predicting Protein Ligand Binding Sites by Combining Evolutionary Sequence Conservation and 3D Structure
    Capra, John A.
    Laskowski, Roman A.
    Thornton, Janet M.
    Singh, Mona
    Funkhouser, Thomas A.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (12)
  • [8] Prediction of protein functional residues from sequence by probability density estimation
    Fischer, J. D.
    Mayer, C. E.
    Soeding, J.
    [J]. BIOINFORMATICS, 2008, 24 (05) : 613 - 620
  • [9] 3D-Jury: a simple approach to improve protein structure predictions
    Ginalski, K
    Elofsson, A
    Fischer, D
    Rychlewski, L
    [J]. BIOINFORMATICS, 2003, 19 (08) : 1015 - 1018
  • [10] APPLICATION OF THE 3-DIMENSIONAL STRUCTURES OF PROTEIN TARGET MOLECULES IN STRUCTURE-BASED DRUG DESIGN
    GREER, J
    ERICKSON, JW
    BALDWIN, JJ
    VARNEY, MD
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1994, 37 (08) : 1035 - 1054