AlphaFold2 fails to predict protein fold switching

被引:85
作者
Chakravarty, Devlina [1 ]
Porter, Lauren L. [1 ,2 ]
机构
[1] Natl Ctr Biotechnol Informat, NIH, Natl Lib Med, Bethesda, MD 20894 USA
[2] NHLBI, NIH, Bethesda, MD USA
基金
美国国家卫生研究院;
关键词
AlphaFold2; fold-switching; protein-folding; structural heterogeneity; EVOLUTION;
D O I
10.1002/pro.4353
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
AlphaFold2 has revolutionized protein structure prediction by leveraging sequence information to rapidly model protein folds with atomic-level accuracy. Nevertheless, previous work has shown that these predictions tend to be inaccurate for structurally heterogeneous proteins. To systematically assess factors that contribute to this inaccuracy, we tested AlphaFold2's performance on 98-fold-switching proteins, which assume at least two distinct-yet-stable secondary and tertiary structures. Topological similarities were quantified between five predicted and two experimentally determined structures of each fold-switching protein. Overall, 94% of AlphaFold2 predictions captured one experimentally determined conformation but not the other. Despite these biased results, AlphaFold2's estimated confidences were moderate-to-high for 74% of fold-switching residues, a result that contrasts with overall low confidences for intrinsically disordered proteins, which are also structurally heterogeneous. To investigate factors contributing to this disparity, we quantified sequence variation within the multiple sequence alignments used to generate AlphaFold2's predictions of fold-switching and intrinsically disordered proteins. Unlike intrinsically disordered regions, whose sequence alignments show low conservation, fold-switching regions had conservation rates statistically similar to canonical single-fold proteins. Furthermore, intrinsically disordered regions had systematically lower prediction confidences than either fold-switching or single-fold proteins, regardless of sequence conservation. AlphaFold2's high prediction confidences for fold switchers indicate that it uses sophisticated pattern recognition to search for one most probable conformer rather than protein biophysics to model a protein's structural ensemble. Thus, it is not surprising that its predictions often fail for proteins whose properties are not fully apparent from solved protein structures. Our results emphasize the need to look at protein structure as an ensemble and suggest that systematic examination of fold-switching sequences may reveal propensities for multiple stable secondary and tertiary structures.
引用
收藏
页数:11
相关论文
共 46 条
[41]   Interconversion between two unrelated protein folds in the lymphotactin native state [J].
Tuinstra, Robbyn L. ;
Peterson, Francis C. ;
Kutlesa, Snjezana ;
Elgin, E. Sonay ;
Kron, Michael A. ;
Volkman, Brian F. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (13) :5057-5062
[42]   Highly accurate protein structure prediction for the human proteome [J].
Tunyasuvunakool, Kathryn ;
Adler, Jonas ;
Wu, Zachary ;
Green, Tim ;
Zielinski, Michal ;
Zidek, Augustin ;
Bridgland, Alex ;
Cowie, Andrew ;
Meyer, Clemens ;
Laydon, Agata ;
Velankar, Sameer ;
Kleywegt, Gerard J. ;
Bateman, Alex ;
Evans, Richard ;
Pritzel, Alexander ;
Figurnov, Michael ;
Ronneberger, Olaf ;
Bates, Russ ;
Kohl, Simon A. A. ;
Potapenko, Anna ;
Ballard, Andrew J. ;
Romera-Paredes, Bernardino ;
Nikolov, Stanislav ;
Jain, Rishub ;
Clancy, Ellen ;
Reiman, David ;
Petersen, Stig ;
Senior, Andrew W. ;
Kavukcuoglu, Koray ;
Birney, Ewan ;
Kohli, Pushmeet ;
Jumper, John ;
Hassabis, Demis .
NATURE, 2021, 596 (7873) :590-+
[43]  
Virtanen P, 2020, NAT METHODS, V17, P261, DOI 10.1038/s41592-019-0686-2
[44]  
Waskom ML., 2021, Journal of Open Source Software, V6, P3021, DOI 10.21105/joss.03021
[45]   How significant is a protein structure similarity with TM-score=0.5? [J].
Xu, Jinrui ;
Zhang, Yang .
BIOINFORMATICS, 2010, 26 (07) :889-895
[46]   TM-align: a protein structure alignment algorithm based on the TM-score [J].
Zhang, Y ;
Skolnick, J .
NUCLEIC ACIDS RESEARCH, 2005, 33 (07) :2302-2309