Conformational ensembles of the human intrinsically disordered proteome

被引:100
作者
Tesei, Giulio [1 ]
Trolle, Anna Ida [1 ]
Jonsson, Nicolas [1 ]
Betz, Johannes [1 ]
Knudsen, Frederik E. [1 ]
Pesce, Francesco [1 ]
Johansson, Kristoffer E. [1 ]
Lindorff-Larsen, Kresten [1 ]
机构
[1] Univ Copenhagen, Linderstrom Lang Ctr Prot Sci, Dept Biol, Struct Biol & NMR Lab, Copenhagen, Denmark
关键词
PHASE-SEPARATION; CHARGE; PROTEINS; PROVIDE;
D O I
10.1038/s41586-023-07004-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Intrinsically disordered proteins and regions (collectively, IDRs) are pervasive across proteomes in all kingdoms of life, help to shape biological functions and are involved in numerous diseases. IDRs populate a diverse set of transiently formed structures and defy conventional sequence-structure-function relationships1. Developments in protein science have made it possible to predict the three-dimensional structures of folded proteins at the proteome scale2. By contrast, there is a lack of knowledge about the conformational properties of IDRs, partly because the sequences of disordered proteins are poorly conserved and also because only a few of these proteins have been characterized experimentally. The inability to predict structural properties of IDRs across the proteome has limited our understanding of the functional roles of IDRs and how evolution shapes them. As a supplement to previous structural studies of individual IDRs3, we developed an efficient molecular model to generate conformational ensembles of IDRs and thereby to predict their conformational properties from sequences4,5. Here we use this model to simulate nearly all of the IDRs in the human proteome. Examining conformational ensembles of 28,058 IDRs, we show how chain compaction is correlated with cellular function and localization. We provide insights into how sequence features relate to chain compaction and, using a machine-learning model trained on our simulation data, show the conservation of conformational properties across orthologues. Our results recapitulate observations from previous studies of individual protein systems and exemplify how to link-at the proteome scale-conformational ensembles with cellular function and localization, amino acid sequence, evolutionary conservation and disease variants. Our freely available database of conformational properties will encourage further experimental investigation and enable the generation of hypotheses about the biological roles and evolution of IDRs. A computational model generates conformational ensembles of 28,058 intrinsically disordered proteins and regions (IDRs) in the human proteome and sheds light on the relationship between sequence, conformational properties and functions of IDRs.
引用
收藏
页码:897 / 904
页数:29
相关论文
共 99 条
[1]  
Ahmed Samrein B M, 2017, J Mol Signal, V12, P2, DOI 10.5334/1750-2187-12-2
[2]   A structural biology community assessment of AlphaFold2 applications [J].
Akdel, Mehmet ;
Pires, Douglas E., V ;
Porta Pardo, Eduard ;
Janes, Jurgen ;
Zalevsky, Arthur O. ;
Meszaros, Balint ;
Bryant, Patrick ;
Good, Lydia L. ;
Laskowski, Roman A. ;
Pozzati, Gabriele ;
Shenoy, Aditi ;
Zhu, Wensi ;
Kundrotas, Petras ;
Serra, Victoria Ruiz ;
Rodrigues, Carlos H. M. ;
Dunham, Alistair S. ;
Burke, David ;
Borkakoti, Neera ;
Velankar, Sameer ;
Frost, Adam ;
Basquin, Jerome ;
Lindorff-Larsen, Kresten ;
Bateman, Alex ;
Kajava, Andrey, V ;
Valencia, Alfonso ;
Ovchinnikov, Sergey ;
Durairaj, Janani ;
Ascher, David B. ;
Thornton, Janet M. ;
Davey, Norman E. ;
Stein, Amelie ;
Elofsson, Arne ;
Croll, Tristan, I ;
Beltrao, Pedro .
NATURE STRUCTURAL & MOLECULAR BIOLOGY, 2022, 29 (11) :1056-+
[3]   Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2 [J].
Alderson, Reid ;
Pritisanac, Iva ;
Kolaric, Desika ;
Moses, Alan M. ;
Forman-Kay, Julie D. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (44)
[4]   The Gene Ontology knowledgebase in 2023 [J].
Aleksander, Suzi A. ;
Balhoff, James ;
Carbon, Seth ;
Cherry, J. Michael ;
Drabkin, Harold J. ;
Ebert, Dustin ;
Feuermann, Marc ;
Gaudet, Pascale ;
Harris, Nomi L. ;
Hill, David P. ;
Lee, Raymond ;
Mi, Huaiyu ;
Moxon, Sierra ;
Mungall, Christopher J. ;
Muruganugan, Anushya ;
Mushayahama, Tremayne ;
Sternberg, Paul W. ;
Thomas, Paul D. ;
Van Auken, Kimberly ;
Ramsey, Jolene ;
Siegele, Deborah A. ;
Chisholm, Rex L. ;
Fey, Petra ;
Aspromonte, Maria Cristina ;
Nugnes, Maria Victoria ;
Quaglia, Federica ;
Tosatto, Silvio ;
Giglio, Michelle ;
Nadendla, Suvarna ;
Antonazzo, Giulia ;
Attrill, Helen ;
dos Santos, Gil ;
Marygold, Steven ;
Strelets, Victor ;
Tabone, Christopher J. ;
Thurmond, Jim ;
Zhou, Pinglei ;
Ahmed, Saadullah H. ;
Asanitthong, Praoparn ;
Luna Buitrago, Diana ;
Erdol, Meltem N. ;
Gage, Matthew C. ;
Ali Kadhum, Mohamed ;
Li, Kan Yan Chloe ;
Long, Miao ;
Michalak, Aleksandra ;
Pesala, Angeline ;
Pritazahra, Armalya ;
Saverimuttu, Shirin C. C. ;
Su, Renzhi .
GENETICS, 2023, 224 (01)
[5]   OMA orthology in 2021: website overhaul, conserved isoforms, ancestral gene order and more [J].
Altenhoff, Adrian M. ;
Train, Clement-Marie ;
Gilbert, Kimberly J. ;
Mediratta, Ishita ;
de Farias, Tarcisio Mendes ;
Moi, David ;
Nevers, Yannis ;
Radoykova, Hale-Seda ;
Rossier, Victor ;
Vesztrocy, Alex Warwick ;
Glover, Natasha M. ;
Dessimoz, Christophe .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D373-D379
[6]   HOOMD-blue: A Python']Python package for high-performance molecular dynamics and hard particle Monte Carlo simulations [J].
Anderson, Joshua A. ;
Glaser, Jens ;
Glotzer, Sharon C. .
COMPUTATIONAL MATERIALS SCIENCE, 2020, 173
[7]   UNIVERSAL FEATURES OF POLYMER SHAPES [J].
ARONOVITZ, JA ;
NELSON, DR .
JOURNAL DE PHYSIQUE, 1986, 47 (09) :1445-1456
[8]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[9]   Mutations in the KIF21B kinesin gene cause neurodevelopmental disorders through imbalanced canonical motor activity [J].
Asselin, Laure ;
Alvarez, Jose Rivera ;
Heide, Solveig ;
Bonnet, Camille S. ;
Tilly, Peggy ;
Vitet, Helene ;
Weber, Chantal ;
Bacino, Carlos A. ;
Baranano, Kristin ;
Chassevent, Anna ;
Dameron, Amy ;
Faivre, Laurence ;
Hanchard, Neil A. ;
Mahida, Sonal ;
McWalter, Kirsty ;
Mignot, Cyril ;
Nava, Caroline ;
Rastetter, Agnes ;
Streff, Haley ;
Thauvin-Robinet, Christel ;
Weiss, Marjan M. ;
Zapata, Gladys ;
Zwijnenburg, Petra J. G. ;
Saudou, Frederic ;
Depienne, Christel ;
Golzio, Christelle ;
Heron, Delphine ;
Godin, Juliette D. .
NATURE COMMUNICATIONS, 2020, 11 (01)
[10]   Genetic variation associated with condensate dysregulation in disease [J].
Banani, Salman F. ;
Afeyan, Lena K. ;
Hawken, Susana W. ;
Henninger, Jonathan E. ;
Dall'Agnese, Alessandra ;
Clark, Victoria E. ;
Platt, Jesse M. ;
Oksuz, Ozgur ;
Hannett, Nancy M. ;
Sagi, Ido ;
Lee, Tong Ihn ;
Young, Richard A. .
DEVELOPMENTAL CELL, 2022, 57 (14) :1776-+