RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures

被引:39
作者
Paladin, Lisanna [1 ]
Bevilacqua, Martina [1 ]
Errigo, Sara [1 ]
Piovesan, Damiano [1 ]
Micetic, Ivan [1 ]
Necci, Marco [1 ]
Monzon, Alexander Miguel [1 ]
Laura Fabre, Maria [2 ]
Luis Lopez, Jose [2 ]
Nilsson, Juliet F. [2 ]
Rios, Javier [3 ]
Lorenzano Menna, Pablo [3 ]
Cabrera, Maia [3 ]
Gonzalez Buitron, Martin [3 ]
Kulik, Mariane Goncalves [4 ]
Fernandez-Alberti, Sebastian [3 ]
Fornasari, Maria Silvina [3 ]
Parisi, Gustavo [3 ]
Lagares, Antonio [2 ]
Hirsh, Layla [5 ]
Andrade-Navarro, Miguel A. [4 ]
Kajava, Andrey, V [6 ]
Tosatto, Silvio C. E. [1 ]
机构
[1] Univ Padua, Dept Biomed Sci, Via Ugo Bassi 58-B, I-35121 Padua, Italy
[2] La Plata Natl Univ, Dept Biol Sci, IBBM CONICET, 49 Y 115, RA-1900 La Plata, Argentina
[3] Natl Univ Quilmes, Dept Sci & Technol, Roque Saenz Pena 352, Bernal, Buenos Aires, Argentina
[4] Johannes Gutenberg Univ Mainz, Fac Biol, Inst Organism & Mol Evolut, Hans Dieter Husch Weg 15, D-55128 Mainz, Germany
[5] Pontifical Catholic Univ Peru, Fac Sci & Engn, Dept Engn, Av Univ 1801 San Miguel, Lima 32, Peru
[6] Univ Montpellier, Ctr Rech Biol Cellulaire Montpellier, CNRS, UMR 5237, Montpellier, France
关键词
IDENTIFICATION; VISUALIZATION;
D O I
10.1093/nar/gkaa1097
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The RepeatsDB database (URL: https://repeatsdb.org/) provides annotations and classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein tandem repeats are ubiquitous in all branches of the tree of life. The accumulation of solved repeat structures provides new possibilities for classification and detection, but also increasing the need for annotation. Here we present RepeatsDB 3.0, which addresses these challenges and presents an extended classification scheme. The major conceptual change compared to the previous version is the hierarchical classification combining top levels based solely on structural similarity (Class > Topology > Fold) with two new levels (Clan > Family) requiring sequence similarity and describing repeat motifs in collaboration with Pfam. Data growth has been addressed with improved mechanisms for browsing the classification hierarchy. A new UniProt-centric view unifies the increasingly frequent annotation of structures from identical or similar sequences. This update of RepeatsDB aligns with our commitment to develop a resource that extracts, organizes and distributes specialized information on tandem repeat protein structures.
引用
收藏
页码:D452 / D457
页数:6
相关论文
共 35 条
[1]   MemSTATS: A Benchmark Set of Membrane Protein Symmetries and Pseudosymmetries [J].
Aleksandrova, Antoniya A. ;
Sarti, Edoardo ;
Forrest, Lucy R. .
JOURNAL OF MOLECULAR BIOLOGY, 2020, 432 (02) :597-604
[2]   Protein repeats: Structures, functions, and evolution [J].
Andrade, MA ;
Perez-Iratxeta, C ;
Ponting, CP .
JOURNAL OF STRUCTURAL BIOLOGY, 2001, 134 (2-3) :117-131
[3]   The SCOP database in 2020: expanded classification of representative family and superfamily domains of known protein structures [J].
Andreeva, Antonina ;
Kulesha, Eugene ;
Gough, Julian ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2020, 48 (D1) :D376-D382
[4]   Untitled [J].
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D1-D1
[5]   Analyzing the symmetrical arrangement of structural repeats in proteins with CE-Symm [J].
Bliven, Spencer E. ;
Lafita, Aleix ;
Rose, Peter W. ;
Capitani, Guido ;
Prlic, Andreas ;
Bourne, Philip E. .
PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (04)
[6]   D3: Data-Driven Documents [J].
Bostock, Michael ;
Ogievetsky, Vadim ;
Heer, Jeffrey .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (12) :2301-2309
[7]   RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy [J].
Burley, Stephen K. ;
Berman, Helen M. ;
Bhikadiya, Charmi ;
Bi, Chunxiao ;
Chen, Li ;
Di Costanzo, Luigi ;
Christie, Cole ;
Dalenberg, Ken ;
Duarte, Jose M. ;
Dutta, Shuchismita ;
Feng, Zukang ;
Ghosh, Sutapa ;
Goodsell, David S. ;
Green, Rachel K. ;
Guranovic, Vladimir ;
Guzenko, Dmytro ;
Hudson, Brian P. ;
Kalro, Tara ;
Liang, Yuhe ;
Lowe, Robert ;
Namkoong, Harry ;
Peisach, Ezra ;
Periskova, Irina ;
Prlic, Andreas ;
Randle, Chris ;
Rose, Alexander ;
Rose, Peter ;
Sala, Raul ;
Sekharan, Monica ;
Shao, Chenghua ;
Tan, Lihua ;
Tao, Yi-Ping ;
Valasatava, Yana ;
Voigt, Maria ;
Westbrook, John ;
Woo, Jesse ;
Yang, Huanwang ;
Young, Jasmine ;
Zhuravleva, Marina ;
Zardecki, Christine .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D464-D474
[8]   SIFTS: updated Structure Integration with Function, Taxonomy and Sequences resource allows 40-fold increase in coverage of structure-based annotations for proteins [J].
Dana, Jose M. ;
Gutmanas, Aleksandras ;
Tyagi, Nidhi ;
Qi, Guoying ;
O'Donovan, Claire ;
Martin, Maria ;
Velankar, Sameer .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D482-D489
[9]   A New Census of Protein Tandem Repeats and Their Relationship with Intrinsic Disorder [J].
Delucchi, Matteo ;
Schaper, Elke ;
Sachenkova, Oxana ;
Elofsson, Arne ;
Anisimova, Maria .
GENES, 2020, 11 (04)
[10]   RepeatsDB: a database of tandem repeat protein structures [J].
Di Domenico, Tomas ;
Potenza, Emilio ;
Walsh, Ian ;
Parra, R. Gonzalo ;
Giollo, Manuel ;
Minervini, Giovanni ;
Piovesan, Damiano ;
Ihsan, Awais ;
Ferrari, Carlo ;
Kajava, Andrey V. ;
Tosatto, Silvio C. E. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D352-D357