CryptoSite: Expanding the Druggable Proteome by Characterization and Prediction of Cryptic Binding Sites

被引:142
作者
Cimermancic, Peter [1 ,2 ]
Weinkam, Patrick [1 ]
Rettenmaier, T. Justin [3 ,4 ,5 ]
Bichmann, Leon [1 ]
Keedy, Daniel A. [1 ]
Woldeyes, Rahel A. [1 ,3 ]
Schneidman-Duhovny, Dina [1 ]
Demerdash, Omar N. [6 ]
Mitchell, Julie C. [7 ,8 ]
Wells, James A. [4 ,5 ,9 ]
Fraser, James S. [1 ]
Sali, Andrej [1 ,4 ,5 ]
机构
[1] Univ Calif San Francisco, Dept Bioengn & Therapeut Sci, San Francisco, CA 94158 USA
[2] Univ Calif San Francisco, Grad Grp Biol & Med Informat, San Francisco, CA 94158 USA
[3] Univ Calif San Francisco, Grad Grp Chem & Chem Biol, San Francisco, CA 94158 USA
[4] Univ Calif San Francisco, Pharmaceut Chem, San Francisco, CA 94158 USA
[5] Univ Calif San Francisco, Calif Inst Quantitat Biosci, San Francisco, CA 94158 USA
[6] Univ Calif Berkeley, Dept Chem & Biomol Engn, Berkeley, CA 94720 USA
[7] Univ Wisconsin, Dept Biochem, Madison, WI 53706 USA
[8] Univ Wisconsin, Dept Math, Madison, WI 53706 USA
[9] Univ Calif San Francisco, Cellular & Mol Pharmacol, San Francisco, CA 94158 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
cryptic binding sites; protein dynamics; undruggable proteins; machine learning; FRAGMENT-BASED IDENTIFICATION; ALLOSTERIC INHIBITION; LIGAND-BINDING; HOT-SPOTS; DYNAMICS SIMULATIONS; PROTEINS; FLEXIBILITY; DISCOVERY; INTERFACE; DENSITY;
D O I
10.1016/j.jmb.2016.01.029
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Many proteins have small-molecule binding pockets that are not easily detectable in the ligand-free structures. These cryptic sites require a conformational change to become apparent; a cryptic site can therefore be defined as a site that forms a pocket in a holo structure, but not in the apo structure. Because many proteins appear to lack druggable pockets, understanding and accurately identifying cryptic sites could expand the set of drug targets. Previously, cryptic sites were identified experimentally by fragment-based ligand discovery and computationally by long molecular dynamics simulations and fragment docking. Here, we begin by constructing a set of structurally defined apo-holo pairs with cryptic sites. Next, we comprehensively characterize the cryptic sites in terms of their sequence, structure, and dynamics attributes. We find that cryptic sites tend to be as conserved in evolution as traditional binding pockets but are less hydrophobic and more flexible. Relying on this characterization, we use machine learning to predict cryptic sites with relatively high accuracy (for our benchmark, the true positive and false positive rates are 73% and 29%, respectively). We then predict cryptic sites in the entire structurally characterized human proteome (11,201 structures, covering 23% of all residues in the proteome). CryptoSite increases the size of the potentially "druggable" human proteome from similar to 40% to similar to 78% of disease-associated proteins. Finally, to demonstrate the utility of our approach in practice, we experimentally validate a cryptic site in protein tyrosine phosphatase 1B using a covalent ligand and NMR spectroscopy. The CryptoSite Web server is available at http://salilab.org/cryptosite. Published by Elsevier Ltd.
引用
收藏
页码:709 / 719
页数:11
相关论文
共 66 条
  • [1] Small-molecule inhibitors of protein-protein interactions: Progressing towards the dream
    Arkin, MR
    Wells, JA
    [J]. NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (04) : 301 - 317
  • [2] Druggability Assessment of Allosteric Proteins by Dynamics Simulations in the Presence of Probe Molecules
    Bakan, Ahmet
    Nevins, Neysa
    Lakdawala, Ami S.
    Bahar, Ivet
    [J]. JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2012, 8 (07) : 2435 - 2447
  • [3] Functional flexibility of human cyclin-dependent kinase-2 and its evolutionary conservation
    Bartova, Iveta
    Koca, Jaroslav
    Otyepka, Michal
    [J]. PROTEIN SCIENCE, 2008, 17 (01) : 22 - 33
  • [4] Binding MOAD, a high-quality protein-ligand database
    Benson, Mark L.
    Smith, Richard D.
    Khazanov, Nickolay A.
    Dimcheff, Brandon
    Beaver, John
    Dresslar, Peter
    Nerothin, Jason
    Carlson, Heather A.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D674 - D678
  • [5] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [6] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [7] Discovery of multiple hidden allosteric sites by combining Markov state models and experiments
    Bowman, Gregory R.
    Bolin, Eric R.
    Hart, Kathryn M.
    Maguire, Brendan C.
    Marqusee, Susan
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (09) : 2734 - 2739
  • [8] Equilibrium fluctuations of a single folded protein reveal a multitude of potential cryptic allosteric sites
    Bowman, Gregory R.
    Geissler, Phillip L.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (29) : 11681 - 11686
  • [9] Fragment-based identification of druggable 'hot spots' of proteins using Fourier domain correlation techniques
    Brenke, Ryan
    Kozakov, Dima
    Chuang, Gwo-Yu
    Beglov, Dmitri
    Hall, David
    Landon, Melissa R.
    Mattos, Carla
    Vajda, Sandor
    [J]. BIOINFORMATICS, 2009, 25 (05) : 621 - 627
  • [10] Effects of conformational dynamics on predicted protein druggability
    Brown, Scott P.
    Hajduk, Philip J.
    [J]. CHEMMEDCHEM, 2006, 1 (01) : 70 - +