UniProt: the Universal Protein Knowledgebase in 2025

被引:191
作者
Bateman, Alex [1 ]
Martin, Maria-Jesus [1 ]
Orchard, Sandra [1 ]
Magrane, Michele [1 ]
Adesina, Aduragbemi [1 ]
Ahmad, Shadab [1 ]
Bowler-Barnett, Emily H. [1 ]
Bye-A-Jee, Hema [1 ]
Carpentier, David [1 ]
Denny, Paul [1 ]
Fan, Jun [1 ]
Garmiri, Penelope [1 ]
Gonzales, Leonardo Jose da Costa [1 ]
Hussein, Abdulrahman [1 ]
Ignatchenko, Alexandr [1 ]
Insana, Giuseppe [1 ]
Ishtiaq, Rizwan [1 ]
Joshi, Vishal [1 ]
Jyothi, Dushyanth [1 ]
Kandasaamy, Swaathi [1 ]
Lock, Antonia [1 ]
Luciani, Aurelien [1 ]
Luo, Jie [1 ]
Lussi, Yvonne [1 ]
Marin, Juan Sebastian Martinez [1 ]
Raposo, Pedro [1 ]
Rice, Daniel L. [1 ]
Santos, Rafael [1 ]
Speretta, Elena [1 ]
Stephenson, James [1 ]
Totoo, Prabhat [1 ]
Tyagi, Nidhi [1 ]
Urakova, Nadya [1 ]
Vasudev, Preethi [1 ]
Warner, Kate [1 ]
Wijerathne, Supun [1 ]
Yu, Conny Wing-Heng [1 ]
Zaru, Rossana [1 ]
Bridge, Alan J. [3 ]
Aimo, Lucila [3 ]
Argoud-Puy, Ghislaine [3 ]
Auchincloss, Andrea H. [3 ]
Axelsen, Kristian B. [3 ]
Bansal, Parit [3 ]
Baratin, Delphine [3 ]
Batista Neto, Teresa M. [3 ]
Blatter, Marie-Claude [3 ]
Bolleman, Jerven T. [3 ]
Boutet, Emmanuel [3 ]
Breuza, Lionel [3 ]
机构
[1] European Bioinformat Inst EMBL EBI, European Mol Biol Lab, Wellcome Genome Campus, Hinxton CB10 1SD, England
[2] Ctr Med Univ Geneva, SIB Swiss Inst Bioinformat, 1 Rue Michel Servet, CH-1211 Geneva 4, Switzerland
[3] SIB Swiss Inst Bioinformat, Geneva, Switzerland
[4] Prot Informat Resource, Washington, DC USA
基金
美国国家卫生研究院; 英国生物技术与生命科学研究理事会; 欧盟地平线“2020”; 美国国家科学基金会;
关键词
ANNOTATION; RESOURCE; DATABASE; UPDATE;
D O I
10.1093/nar/gkae1010
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The aim of the UniProt Knowledgebase (UniProtKB; https://www.uniprot.org/) is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this publication, we describe ongoing changes to our production pipeline to limit the sequences available in UniProtKB to high-quality, non-redundant reference proteomes. We continue to manually curate the scientific literature to add the latest functional data and use machine learning techniques. We also encourage community curation to ensure key publications are not missed. We provide an update on the automatic annotation methods used by UniProtKB to predict information for unreviewed entries describing unstudied proteins. Finally, updates to the UniProt website are described, including a new tab linking protein to genomic information. In recognition of its value to the scientific community, the UniProt database has been awarded Global Core Biodata Resource status. [GRAPHICS] .
引用
收藏
页码:D609 / D617
页数:9
相关论文
共 32 条
[1]   LitSuggest: a web-based system for literature recommendation and curation using machine learning [J].
Allot, Alexis ;
Lee, Kyubum ;
Chen, Qingyu ;
Luo, Ling ;
Lu, Zhiyong .
NUCLEIC ACIDS RESEARCH, 2021, 49 (W1) :W352-W358
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   The international nucleotide sequence database collaboration [J].
Arita, Masanori ;
Karsch-Mizrachi, Ilene ;
Cochrane, Guy .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D121-D124
[4]   Rhea, the reaction knowledgebase in 2022 [J].
Bansal, Parit ;
Morgat, Anne ;
Axelsen, Kristian B. ;
Muthukrishnan, Venkatesh ;
Coudert, Elisabeth ;
Aimo, Lucila ;
Hyka-Nouspikel, Nevila ;
Gasteiger, Elisabeth ;
Kerhornou, Arnaud ;
Neto, Teresa Batista ;
Pozzato, Monica ;
Blatter, Marie-Claude ;
Ignatchenko, Alex ;
Redaschi, Nicole ;
Bridge, Alan .
NUCLEIC ACIDS RESEARCH, 2022, 50 (D1) :D693-D700
[5]   UniProt: the Universal Protein Knowledgebase in 2023 [J].
Bateman, Alex ;
Martin, Maria-Jesus ;
Orchard, Sandra ;
Magrane, Michele ;
Ahmad, Shadab ;
Alpi, Emanuele ;
Bowler-Barnett, Emily H. ;
Britto, Ramona ;
Cukura, Austra ;
Denny, Paul ;
Dogan, Tunca ;
Ebenezer, ThankGod ;
Fan, Jun ;
Garmiri, Penelope ;
Gonzales, Leonardo Jose da Costa ;
Hatton-Ellis, Emma ;
Hussein, Abdulrahman ;
Ignatchenko, Alexandr ;
Insana, Giuseppe ;
Ishtiaq, Rizwan ;
Joshi, Vishal ;
Jyothi, Dushyanth ;
Kandasaamy, Swaathi ;
Lock, Antonia ;
Luciani, Aurelien ;
Lugaric, Marija ;
Luo, Jie ;
Lussi, Yvonne ;
MacDougall, Alistair ;
Madeira, Fabio ;
Mahmoudy, Mahdi ;
Mishra, Alok ;
Moulang, Katie ;
Nightingale, Andrew ;
Pundir, Sangya ;
Qi, Guoying ;
Raj, Shriya ;
Raposo, Pedro ;
Rice, Daniel L. ;
Saidi, Rabie ;
Santos, Rafael ;
Speretta, Elena ;
Stephenson, James ;
Totoo, Prabhat ;
Turner, Edward ;
Tyagi, Nidhi ;
Vasudev, Preethi ;
Warner, Kate ;
Watkins, Xavier ;
Zellner, Hermann .
NUCLEIC ACIDS RESEARCH, 2023, 51 (D1) :D523-D531
[6]   UniProt: the universal protein knowledgebase in 2021 [J].
Bateman, Alex ;
Martin, Maria-Jesus ;
Orchard, Sandra ;
Magrane, Michele ;
Agivetova, Rahat ;
Ahmad, Shadab ;
Alpi, Emanuele ;
Bowler-Barnett, Emily H. ;
Britto, Ramona ;
Bursteinas, Borisas ;
Bye-A-Jee, Hema ;
Coetzee, Ray ;
Cukura, Austra ;
Da Silva, Alan ;
Denny, Paul ;
Dogan, Tunca ;
Ebenezer, ThankGod ;
Fan, Jun ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Georghiou, George ;
Gonzales, Leonardo ;
Hatton-Ellis, Emma ;
Hussein, Abdulrahman ;
Ignatchenko, Alexandr ;
Insana, Giuseppe ;
Ishtiaq, Rizwan ;
Jokinen, Petteri ;
Joshi, Vishal ;
Jyothi, Dushyanth ;
Lock, Antonia ;
Lopez, Rodrigo ;
Luciani, Aurelien ;
Luo, Jie ;
Lussi, Yvonne ;
Mac-Dougall, Alistair ;
Madeira, Fabio ;
Mahmoudy, Mahdi ;
Menchi, Manuela ;
Mishra, Alok ;
Moulang, Katie ;
Nightingale, Andrew ;
Oliveira, Carla Susana ;
Pundir, Sangya ;
Qi, Guoying ;
Raj, Shriya ;
Rice, Daniel ;
Lopez, Milagros Rodriguez ;
Saidi, Rabie ;
Sampson, Joseph .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D480-D489
[7]   UniProt and Mass Spectrometry-Based Proteomics-A 2-Way Working Relationship [J].
Bowler-Barnett, E. H. ;
Fan, J. ;
Luo, J. ;
Magrane, M. ;
Martin, M. J. ;
Orchard, S. .
MOLECULAR & CELLULAR PROTEOMICS, 2023, 22 (08)
[8]  
Brevdo E., 2023, ProtNLM: Model-based Natural Language Protein Annotation
[9]   The Gene Ontology resource: enriching a GOld mine [J].
Carbon, Seth ;
Douglass, Eric ;
Good, Benjamin M. ;
Unni, Deepak R. ;
Harris, Nomi L. ;
Mungall, Christopher J. ;
Basu, Siddartha ;
Chisholm, Rex L. ;
Dodson, Robert J. ;
Hartline, Eric ;
Fey, Petra ;
Thomas, Paul D. ;
Albou, Laurent-Philippe ;
Ebert, Dustin ;
Kesling, Michael J. ;
Mi, Huaiyu ;
Muruganujan, Anushya ;
Huang, Xiaosong ;
Mushayahama, Tremayne ;
LaBonte, Sandra A. ;
Siegele, Deborah A. ;
Antonazzo, Giulia ;
Attrill, Helen ;
Brown, Nick H. ;
Garapati, Phani ;
Marygold, Steven J. ;
Trovisco, Vitor ;
Dos Santos, Gil ;
Falls, Kathleen ;
Tabone, Christopher ;
Zhou, Pinglei ;
Goodman, Joshua L. ;
Strelets, Victor B. ;
Thurmond, Jim ;
Garmiri, Penelope ;
Ishtiaq, Rizwan ;
Rodriguez-Lopez, Milagros ;
Acencio, Marcio L. ;
Kuiper, Martin ;
Laegreid, Astrid ;
Logie, Colin ;
Lovering, Ruth C. ;
Kramarz, Barbara ;
Saverimuttu, Shirin C. C. ;
Pinheiro, Sandra M. ;
Gunn, Heather ;
Su, Renzhi ;
Thurlow, Katherine E. ;
Chibucos, Marcus ;
Giglio, Michelle .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D325-D334
[10]   ComplexViewer: visualization of curated macromolecular complexes [J].
Combe, Colin W. ;
Sivade, Marine ;
Hermjakob, Henning ;
Heimbach, Joshua ;
Meldal, Birgit H. M. ;
Micklem, Gos ;
Orchard, Sandra ;
Rappsilber, Juri .
BIOINFORMATICS, 2017, 33 (22) :3673-3675