Predicting Protein Ligand Binding Sites by Combining Evolutionary Sequence Conservation and 3D Structure

被引:306
|
作者
Capra, John A. [1 ,2 ]
Laskowski, Roman A. [3 ]
Thornton, Janet M. [3 ]
Singh, Mona [1 ,2 ]
Funkhouser, Thomas A. [1 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[2] Princeton Univ, Lewis Sigler Inst Integrat Genom, Princeton, NJ 08544 USA
[3] Wellcome Trust Genome Campus, European Bioinformat Inst, Cambridge, England
基金
美国国家科学基金会; 英国生物技术与生命科学研究理事会;
关键词
FUNCTIONALLY IMPORTANT RESIDUES; 3-DIMENSIONAL STRUCTURE; CATALYTIC RESIDUES; ALGORITHM; CAVITIES; POCKETS; FAMILY; IDENTIFICATION; INFORMATION; SPECIFICITY;
D O I
10.1371/journal.pcbi.1000585
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Identifying a protein's functional sites is an important step towards characterizing its molecular function. Numerous structure-and sequence-based methods have been developed for this problem. Here we introduce ConCavity, a small molecule binding site prediction algorithm that integrates evolutionary sequence conservation estimates with structure-based methods for identifying protein surface cavities. In large-scale testing on a diverse set of single-and multi-chain protein structures, we show that ConCavity substantially outperforms existing methods for identifying both 3D ligand binding pockets and individual ligand binding residues. As part of our testing, we perform one of the first direct comparisons of conservation-based and structure-based methods. We find that the two approaches provide largely complementary information, which can be combined to improve upon either approach alone. We also demonstrate that ConCavity has state-of-the-art performance in predicting catalytic sites and drug binding pockets. Overall, the algorithms and analysis presented here significantly improve our ability to identify ligand binding sites and further advance our understanding of the relationship between evolutionary sequence conservation and structural and functional attributes of proteins. Data, source code, and prediction visualizations are available on the ConCavity web site (http://compbio.cs.princeton.edu/concavity/).
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Protein 3D Structure Computed from Evolutionary Sequence Variation
    Marks, Debora S.
    Colwell, Lucy J.
    Sheridan, Robert
    Hopf, Thomas A.
    Pagnani, Andrea
    Zecchina, Riccardo
    Sander, Chris
    PLOS ONE, 2011, 6 (12):
  • [2] Improving detection of protein-ligand binding sites with 3D segmentation
    Stepniewska-Dziubinska, Marta M.
    Zielenkiewicz, Piotr
    Siedlecki, Pawel
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [3] Improving detection of protein-ligand binding sites with 3D segmentation
    Marta M. Stepniewska-Dziubinska
    Piotr Zielenkiewicz
    Pawel Siedlecki
    Scientific Reports, 10
  • [4] Predicting Protein Ligand Binding Sites with Structure Alignment Method on Hadoop
    Liu, Guangzhong
    Liu, Min
    Chen, Daozheng
    Chen, Lei
    Zhu, Jiali
    Zhou, Bo
    Gao, Jun
    CURRENT PROTEOMICS, 2016, 13 (02) : 113 - 121
  • [5] Evolutionary conservation of sequence motifs at sites of protein modification
    Li, Shuang
    Dohlman, Henrik G.
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2023, 299 (05)
  • [6] From Sequence Data to Protein 3D Structure Using Evolutionary Couplings
    Fieldhouse, Robert
    Hayat, Sikander
    Sheridan, Robert
    Marks, Debora
    Sander, Chris
    PROTEIN SCIENCE, 2015, 24 : 101 - 101
  • [7] Machine learning approaches for predicting protein-ligand binding sites from sequence data
    Vural, Orhun
    Jololian, Leon
    FRONTIERS IN BIOINFORMATICS, 2025, 5
  • [8] Investigation of the Importance of Protein 3D Structure for Assessing Conservation of Lysine Acetylation Sites in Protein Homologs
    Jew, Kristen M.
    Le, Van Thi Bich
    Amaral, Kiana
    Ta, Allysa
    Nguyen May, Nina M.
    Law, Melissa
    Adelstein, Nicole
    Kuhn, Misty L.
    FRONTIERS IN MICROBIOLOGY, 2022, 12
  • [9] A new protein-ligand binding sites prediction method based on the integration of protein sequence conservation information
    Dai, Tianli
    Liu, Qi
    Gao, Jun
    Cao, Zhiwei
    Zhu, Ruixin
    BMC BIOINFORMATICS, 2011, 12 : S9
  • [10] A new protein-ligand binding sites prediction method based on the integration of protein sequence conservation information
    Tianli Dai
    Qi Liu
    Jun Gao
    Zhiwei Cao
    Ruixin Zhu
    BMC Bioinformatics, 12