Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2

被引:70
作者
Alderson, Reid [1 ,2 ]
Pritisanac, Iva [3 ,4 ,5 ]
Kolaric, Desika [5 ]
Moses, Alan M. [3 ]
Forman-Kay, Julie D. [1 ,4 ]
机构
[1] Univ Toronto, Dept Biochem, Toronto, ON M5S 1A8, Canada
[2] Univ Toronto, Dept Mol Genet, Toronto, ON M5S IA8, Canada
[3] Univ Toronto, Dept Cell & Syst Biol, Toronto, ON M5S 35G, Canada
[4] Hosp Sick Children, Mol Med Program, Toronto, ON M5G 0A4, Canada
[5] Med Univ Graz, Dept Mol Biol & Biochem, Gottfried Schatz Res Ctr Cell Signaling Metab & A, A-8010 Graz, Austria
基金
加拿大创新基金会; 美国国家卫生研究院; 加拿大健康研究院;
关键词
AlphaFold2; intrinsically disordered proteins; structural biology; conditional folding; NMR spectroscopy; LIQUID PHASE-SEPARATION; SECONDARY STRUCTURE; ANGLE DISTRIBUTIONS; WEB SERVER; PROTEIN; BINDING; PREDICTION; SYNUCLEIN; HOMEODOMAIN; MUTATIONS;
D O I
10.1073/pnas.2304302120
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The AlphaFold Protein Structure Database contains predicted structures for millions of proteins. For the majority of human proteins that contain intrinsically disordered regions (IDRs), which do not adopt a stable structure, it is generally assumed that these regions have low AlphaFold2 confidence scores that reflect low- confidence structural predictions. Here, we show that AlphaFold2 assigns confident structures to nearly 15% of human IDRs. By comparison to experimental NMR data for a subset of IDRs that are known to conditionally fold (i.e., upon binding or under other specific conditions), we find that AlphaFold2 often predicts the structure of the conditionally folded state. Based on databases of IDRs that are known to conditionally fold, we estimate that AlphaFold2 can identify conditionally folding IDRs at a precision as high as 88% at a 10% false positive rate, which is remarkable considering that conditionally folded IDR structures were minimally represented in its training data. We find that human disease mutations are nearly fivefold enriched in conditionally folded IDRs over IDRs in general and that up to 80% of IDRs in prokaryotes are predicted to conditionally fold, compared to less than 20% of eukaryotic IDRs. These results indicate that a large majority of IDRs in the proteomes of human and other eukaryotes function in the absence of conditional folding, but the regions that do acquire folds are more sensitive to mutations. We emphasize that the AlphaFold2 predictions do not reveal functionally relevant structural plasticity within IDRs and cannot offer realistic ensemble representations of conditionally folded IDRs.
引用
收藏
页数:12
相关论文
共 125 条
[1]   A structural biology community assessment of AlphaFold2 applications [J].
Akdel, Mehmet ;
Pires, Douglas E., V ;
Porta Pardo, Eduard ;
Janes, Jurgen ;
Zalevsky, Arthur O. ;
Meszaros, Balint ;
Bryant, Patrick ;
Good, Lydia L. ;
Laskowski, Roman A. ;
Pozzati, Gabriele ;
Shenoy, Aditi ;
Zhu, Wensi ;
Kundrotas, Petras ;
Serra, Victoria Ruiz ;
Rodrigues, Carlos H. M. ;
Dunham, Alistair S. ;
Burke, David ;
Borkakoti, Neera ;
Velankar, Sameer ;
Frost, Adam ;
Basquin, Jerome ;
Lindorff-Larsen, Kresten ;
Bateman, Alex ;
Kajava, Andrey, V ;
Valencia, Alfonso ;
Ovchinnikov, Sergey ;
Durairaj, Janani ;
Ascher, David B. ;
Thornton, Janet M. ;
Davey, Norman E. ;
Stein, Amelie ;
Elofsson, Arne ;
Croll, Tristan, I ;
Beltrao, Pedro .
NATURE STRUCTURAL & MOLECULAR BIOLOGY, 2022, 29 (11) :1056-+
[2]   Liquid-Liquid Phase Separation in Disease [J].
Alberti, Simon ;
Dormann, Dorothee .
ANNUAL REVIEW OF GENETICS, VOL 53, 2019, 53 :171-+
[3]   Conditional Disorder in Small Heat-shock Proteins [J].
Alderson, T. Reid ;
Ying, Jinfa ;
Bax, Ad ;
Benesch, Justin L. P. ;
Baldwin, Andrew J. .
JOURNAL OF MOLECULAR BIOLOGY, 2020, 432 (09) :3033-3049
[4]   Propensity for cis-Proline Formation in Unfolded Proteins [J].
Alderson, T. Reid ;
Lee, Jung Ho ;
Charlier, Cyril ;
Ying, Jinfa ;
Bax, Ad .
CHEMBIOCHEM, 2018, 19 (01) :37-42
[5]   Machine learning in protein structure prediction [J].
AlQuraishi, Mohammed .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2021, 65 :1-8
[6]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[7]   OMIM.org: leveraging knowledge across phenotype-gene relationships [J].
Amberger, Joanna S. ;
Bocchini, Carol A. ;
Scott, Alan F. ;
Hamosh, Ada .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D1038-D1043
[8]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[9]   Origins of coevolution between residues distant in protein 3D structures [J].
Anishchenko, Ivan ;
Ovchinnikov, Sergey ;
Kamisetty, Hetunandan ;
Baker, David .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (34) :9122-9127
[10]   Proteome-Wide Discovery of Evolutionary Conserved Sequences in Disordered Regions [J].
Ba, Alex N. Nguyen ;
Yeh, Brian J. ;
van Dyk, Dewald ;
Davidson, Alan R. ;
Andrews, Brenda J. ;
Weiss, Eric L. ;
Moses, Alan M. .
SCIENCE SIGNALING, 2012, 5 (215)