RepeatsDB: a database of tandem repeat protein structures

被引:51
作者
Di Domenico, Tomas [1 ]
Potenza, Emilio [1 ]
Walsh, Ian [1 ]
Parra, R. Gonzalo [2 ]
Giollo, Manuel [1 ,3 ]
Minervini, Giovanni [1 ]
Piovesan, Damiano [1 ]
Ihsan, Awais [1 ,4 ]
Ferrari, Carlo [3 ]
Kajava, Andrey V. [5 ,6 ]
Tosatto, Silvio C. E. [1 ]
机构
[1] Univ Padua, Dept Biomed Sci, I-35131 Padua, Italy
[2] Univ Buenos Aires, Dept Biol Chem, Buenos Aires, DF, Argentina
[3] Univ Padua, Dept Informat Engn, I-35121 Padua, Italy
[4] COMSATS Inst Informat Technol, Dept Biosci, Sahiwal, Pakistan
[5] CNRS, Ctr Rech Biochim Macromol, F-34293 Montpellier 5, France
[6] Inst Biol Computat, F-34293 Montpellier 5, France
关键词
LEUCINE-RICH REPEAT; SEQUENCES; RECOGNITION; PERIODICITY; VALIDATION; EVOLUTION;
D O I
10.1093/nar/gkt1175
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
RepeatsDB (http://repeatsdb.bio.unipd.it/) is a database of annotated tandem repeat protein structures. Tandem repeats pose a difficult problem for the analysis of protein structures, as the underlying sequence can be highly degenerate. Several repeat types haven been studied over the years, but their annotation was done in a case-by-case basis, thus making large-scale analysis difficult. We developed RepeatsDB to fill this gap. Using state-of-the-art repeat detection methods and manual curation, we systematically annotated the Protein Data Bank, predicting 10 745 repeat structures. In all, 2797 structures were classified according to a recently proposed classification schema, which was expanded to accommodate new findings. In addition, detailed annotations were performed in a subset of 321 proteins. These annotations feature information on start and end positions for the repeat regions and units. RepeatsDB is an ongoing effort to systematically classify and annotate structural protein repeats in a consistent way. It provides users with the possibility to access and download high-quality datasets either interactively or programmatically through web services.
引用
收藏
页码:D352 / D357
页数:6
相关论文
共 46 条
  • [21] Heger A, 2000, PROTEINS, V41, P224, DOI 10.1002/1097-0134(20001101)41:2<224::AID-PROT70>3.0.CO
  • [22] 2-Z
  • [23] Tandem-repeat proteins: regularity plus modularity equals design-ability
    Javadi, Yalda
    Itzhaki, Laura S.
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2013, 23 (04) : 622 - 631
  • [24] Novel sequences propel familiar folds
    Jawad, Z
    Paoli, M
    [J]. STRUCTURE, 2002, 10 (04) : 447 - 454
  • [25] PRDB: Protein Repeat DataBase
    Jorda, Julien
    Baudrand, Thierry
    Kajava, Andrey V.
    [J]. PROTEOMICS, 2012, 12 (09) : 1333 - 1336
  • [26] PROTEIN HOMOREPEATS: SEQUENCES, STRUCTURES, EVOLUTION, AND FUNCTIONS
    Jorda, Julien
    Kajava, Andrey V.
    [J]. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY, VOL 79, 2010, 79 : 59 - 88
  • [27] β-rolls, β-helices, and other β-solenoid proteins
    Kajava, Andrey V.
    Steven, Alasdair C.
    [J]. FIBROUS PROTEINS: AMYLOIDS, PRIONS AND BETA PROTEINS, 2006, 73 : 55 - +
  • [28] Tandem repeats in proteins: From sequence to structure
    Kajava, Andrey V.
    [J]. JOURNAL OF STRUCTURAL BIOLOGY, 2012, 179 (03) : 279 - 288
  • [29] New HEAT-like repeat motifs in proteins regulating proteasome structure and function
    Kajava, AV
    Gorbea, C
    Ortega, J
    Rechsteiner, M
    Steven, AC
    [J]. JOURNAL OF STRUCTURAL BIOLOGY, 2004, 146 (03) : 425 - 430
  • [30] When protein folding is simplified to protein coiling: the continuum of solenoid protein structures
    Kobe, B
    Kajava, AV
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 2000, 25 (10) : 509 - 515