Blind assessment of monomeric AlphaFold2 protein structure models with experimental NMR data

被引:13
作者
Li, Ethan H. [1 ]
Spaman, Laura E. [1 ]
Tejero, Roberto [1 ]
Huang, Yuanpeng Janet [1 ]
Ramelot, Theresa A. [1 ]
Fraga, Keith J. [1 ]
Prestegard, James H. [2 ]
Kennedy, Michael A. [3 ]
Montelione, Gaetano T. [1 ]
机构
[1] Rensselaer Polytech Inst, Ctr Biotechnol & Interdisciplinary Sci, Dept Chem & Chem Biol, Troy, NY 12180 USA
[2] Univ Georgia, Complex Carbohydrate Res Ctr, Athens, GA 30602 USA
[3] Miami Univ, Dept Chem & Biochem, Oxford, OH 45056 USA
基金
美国国家卫生研究院;
关键词
AlphaFold2; Artificial intelligence; Deep learning; Protein NMR; Model validation; NMR data; STRUCTURE VALIDATION; X-RAY; SEQUENCE; PREDICTIONS; REFINEMENT; ALGORITHMS; PRECISION; SOFTWARE; GEOMETRY; FOLD;
D O I
10.1016/j.jmr.2023.107481
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recent advances in molecular modeling of protein structures are changing the field of structural biology. AlphaFold-2 (AF2), an AI system developed by DeepMind, Inc., utilizes attention-based deep learning to predict models of protein structures with high accuracy relative to structures determined by X-ray crys-tallography and cryo-electron microscopy (cryoEM). Comparing AF2 models to structures determined using solution NMR data, both high similarities and distinct differences have been observed. Since AF2 was trained on X-ray crystal and cryoEM structures, we assessed how accurately AF2 can model small, monomeric, solution protein NMR structures which (i) were not used in the AF2 training data set, and (ii) did not have homologous structures in the Protein Data Bank at the time of AF2 training. We identified nine open-source protein NMR data sets for such "blind" targets, including chemical shift, raw NMR FID data, NOESY peak lists, and (for 1 case) 15N-1H residual dipolar coupling data. For these nine small (70- 108 residues) monomeric proteins, we generated AF2 prediction models and assessed how well these models fit to these experimental NMR data, using several well-established NMR structure validation tools. In most of these cases, the AF2 models fit the NMR data nearly as well, or sometimes better than, the corresponding NMR structure models previously deposited in the Protein Data Bank. These results provide benchmark NMR data for assessing new NMR data analysis and protein structure prediction methods. They also document the potential for using AF2 as a guiding tool in protein NMR data analysis, and more generally for hypothesis generation in structural biology research.& COPY; 2023 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http:// creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:11
相关论文
共 67 条
[11]   Sampling alternative conformational states of transporters and receptors with AlphaFold2 [J].
del Alamo, Diego ;
Sala, Davide ;
Mchaourab, Hassane S. ;
Meiler, Jens .
ELIFE, 2022, 11
[12]  
Delano W. L., 2002, PYMOL MOL GRAPHICS S
[13]   The accuracy of protein structures in solution determined by AlphaFold and NMR [J].
Fowler, Nicholas J. ;
Williamson, Mike P. .
STRUCTURE, 2022, 30 (07) :925-+
[14]   Inhibitor Bound Dengue NS2B-NS3pro Reveals Multiple Dynamic Binding Modes [J].
Gibbs, Alan C. ;
Steele, Ruth ;
Liu, Gaohua ;
Tounge, Brett A. ;
Montelione, Gaetano T. .
BIOCHEMISTRY, 2018, 57 (10) :1591-1602
[15]   Multi-state modeling of G-protein coupled receptors at experimental accuracy [J].
Heo, Lim ;
Feig, Michael .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2022, :1873-1885
[16]   Protein NMR recall, precision, and F-measure scores (RPF scores):: Structure quality assessment measures based on information retrieval statistics [J].
Huang, YJ ;
Powers, R ;
Montelione, GT .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2005, 127 (06) :1665-1674
[17]   Assessment of prediction methods for protein structures determined by NMR in CASP14: Impact of AlphaFold2 [J].
Huang, Yuanpeng Janet ;
Zhang, Ning ;
Bersch, Beate ;
Fidelis, Krzysztof ;
Inouye, Masayori ;
Ishida, Yojiro ;
Kryshtafovych, Andriy ;
Kobayashi, Naohiro ;
Kuroda, Yutaka ;
Liu, Gaohua ;
LiWang, Andy ;
Swapna, G. V. T. ;
Wu, Nan ;
Yamazaki, Toshio ;
Montelione, Gaetano T. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) :1959-1976
[18]   RPF: a quality assessment tool for protein NMR structures [J].
Huang, Yuanpeng Janet ;
Rosato, Antonio ;
Singh, Gautam ;
Montelione, Gaetano T. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (W1) :W542-W546
[19]   CCNet: Criss-Cross Attention for Semantic Segmentation [J].
Huang, Zilong ;
Wang, Xinggang ;
Huang, Lichao ;
Huang, Chang ;
Wei, Yunchao ;
Liu, Wenyu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :603-612
[20]   THE SOLUTION STRUCTURE OF EGLIN-C BASED ON MEASUREMENTS OF MANY NOES AND COUPLING-CONSTANTS AND ITS COMPARISON WITH X-RAY STRUCTURES [J].
HYBERTS, SG ;
GOLDBERG, MS ;
HAVEL, TF ;
WAGNER, G .
PROTEIN SCIENCE, 1992, 1 (06) :736-751