DisProt in 2024: improving function annotation of intrinsically disordered proteins

被引:52
作者
Aspromonte, Maria Cristina [1 ]
Nugnes, Maria Victoria [1 ]
Quaglia, Federica [1 ,2 ]
Bouharoua, Adel [1 ]
Tosatto, Silvio C. E. [1 ]
Piovesan, Damiano [1 ]
机构
[1] Univ Padua, Dept Biomed Sci, Padua, Italy
[2] Natl Res Council CNR IBIOM, Inst Biomembranes Bioenerget & Mol Biotechnol, Bari, Italy
基金
欧盟地平线“2020”;
关键词
DATABASE; TOOL;
D O I
10.1093/nar/gkad928
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
DisProt (URL: https://disprot.org) is the gold standard database for intrinsically disordered proteins and regions, providing valuable information about their functions. The latest version of DisProt brings significant advancements, including a broader representation of functions and an enhanced curation process. These improvements aim to increase both the quality of annotations and their coverage at the sequence level. Higher coverage has been achieved by adopting additional evidence codes. Quality of annotations has been improved by systematically applying Minimum Information About Disorder Experiments (MIADE) principles and reporting all the details of the experimental setup that could potentially influence the structural state of a protein. The DisProt database now includes new thematic datasets and has expanded the adoption of Gene Ontology terms, resulting in an extensive functional repertoire which is automatically propagated to UniProtKB. Finally, we show that DisProt's curated annotations strongly correlate with disorder predictions inferred from AlphaFold2 pLDDT (predicted Local Distance Difference Test) confidence scores. This comparison highlights the utility of DisProt in explaining apparent uncertainty of certain well-defined predicted structures, which often correspond to folding-upon-binding fragments. Overall, DisProt serves as a comprehensive resource, combining experimental evidence of disorder information to enhance our understanding of intrinsically disordered proteins and their functional implications. Graphical Abstract
引用
收藏
页码:D434 / D441
页数:8
相关论文
共 31 条
[1]   The Gene Ontology knowledgebase in 2023 [J].
Aleksander, Suzi A. ;
Balhoff, James ;
Carbon, Seth ;
Cherry, J. Michael ;
Drabkin, Harold J. ;
Ebert, Dustin ;
Feuermann, Marc ;
Gaudet, Pascale ;
Harris, Nomi L. ;
Hill, David P. ;
Lee, Raymond ;
Mi, Huaiyu ;
Moxon, Sierra ;
Mungall, Christopher J. ;
Muruganugan, Anushya ;
Mushayahama, Tremayne ;
Sternberg, Paul W. ;
Thomas, Paul D. ;
Van Auken, Kimberly ;
Ramsey, Jolene ;
Siegele, Deborah A. ;
Chisholm, Rex L. ;
Fey, Petra ;
Aspromonte, Maria Cristina ;
Nugnes, Maria Victoria ;
Quaglia, Federica ;
Tosatto, Silvio ;
Giglio, Michelle ;
Nadendla, Suvarna ;
Antonazzo, Giulia ;
Attrill, Helen ;
dos Santos, Gil ;
Marygold, Steven ;
Strelets, Victor ;
Tabone, Christopher J. ;
Thurmond, Jim ;
Zhou, Pinglei ;
Ahmed, Saadullah H. ;
Asanitthong, Praoparn ;
Luna Buitrago, Diana ;
Erdol, Meltem N. ;
Gage, Matthew C. ;
Ali Kadhum, Mohamed ;
Li, Kan Yan Chloe ;
Long, Miao ;
Michalak, Aleksandra ;
Pesala, Angeline ;
Pritazahra, Armalya ;
Saverimuttu, Shirin C. C. ;
Su, Renzhi .
GENETICS, 2023, 224 (01)
[2]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkh121, 10.1093/nar/gkp985, 10.1093/nar/gkr1065]
[3]   The InterPro protein families and domains database: 20 years on [J].
Blum, Matthias ;
Chang, Hsin-Yu ;
Chuguransky, Sara ;
Grego, Tiago ;
Kandasaamy, Swaathi ;
Mitchell, Alex ;
Nuka, Gift ;
Paysan-Lafosse, Typhaine ;
Qureshi, Matloob ;
Raj, Shriya ;
Richardson, Lorna ;
Salazar, Gustavo A. ;
Williams, Lowri ;
Bork, Peer ;
Bridge, Alan ;
Gough, Julian ;
Haft, Daniel H. ;
Letunic, Ivica ;
Marchler-Bauer, Aron ;
Mi, Huaiyu ;
Natale, Darren A. ;
Necci, Marco ;
Orengo, Christine A. ;
Pandurangan, Arun P. ;
Rivoire, Catherine ;
Sigrist, Christian J. A. ;
Sillitoe, Ian ;
Thanki, Narmada ;
Thomas, Paul D. ;
Tosatto, Silvio C. E. ;
Wu, Cathy H. ;
Bateman, Alex ;
Finn, Robert D. .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D344-D354
[4]   Sampling alternative conformational states of transporters and receptors with AlphaFold2 [J].
del Alamo, Diego ;
Sala, Davide ;
Mchaourab, Hassane S. ;
Meiler, Jens .
ELIFE, 2022, 11
[5]   Critical assessment of protein intrinsic disorder prediction (CAID) - Results of round 2 [J].
Del Conte, Alessio ;
Mehdiabadi, Mahta ;
Bouhraoua, Adel ;
Mozon, Alexander M. ;
Tosatto, Silvio C. E. ;
Piovesan, Damiano .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2023, 91 (12) :1925-1934
[6]   CAID prediction portal: a comprehensive service for predicting intrinsic disorder and binding regions in proteins [J].
Del Conte, Alessio ;
Bouhraoua, Adel ;
Mehdiabadi, Mahta ;
Clementel, Damiano ;
Monzon, Alexander Miguel ;
CAID Predictors, Damiano ;
Tosatto, Silvio C. E. ;
Piovesan, Damiano .
NUCLEIC ACIDS RESEARCH, 2023, 51 (W1) :W62-W69
[7]  
Fersht A., 2009, Structure and Function of Intrinsically Disordered Proteins
[8]   One Step Closer to the Understanding of the Relationship IDR-LCR-Structure [J].
Goncalves-Kulik, Mariane ;
Schmid, Friederike ;
Andrade-Navarro, Miguel A. .
GENES, 2023, 14 (09)
[9]   Low Complexity Induces Structure in Protein Regions Predicted as Intrinsically Disordered [J].
Goncalves-Kulik, Mariane ;
Mier, Pablo ;
Kastano, Kristina ;
Cortes, Juan ;
Bernado, Pau ;
Schmid, Friederike ;
Andrade-Navarro, Miguel A. .
BIOMOLECULES, 2022, 12 (08)
[10]   APICURON: a database to credit and acknowledge the work of biocurators [J].
Hatos, Andras ;
Quaglia, Federica ;
Piovesan, Damiano ;
Tosatto, Silvio C. E. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2021,