Revealing aperiodic aspects of solenoid proteins from sequence information

被引:2
作者
Hrabe, Thomas [1 ]
Jaroszewski, Lukasz [1 ]
Godzik, Adam [1 ]
机构
[1] Sanford Burnham Prebys Med Discovery Inst, Dept Bioinformat & Syst Biol, La Jolla, CA 92037 USA
关键词
LEUCINE-RICH-REPEAT; PEAK DETECTION; RECOGNITION; PERIODICITY; ALIGNMENT; FEATURES;
D O I
10.1093/bioinformatics/btw319
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Repeat proteins, which contain multiple repeats of short sequence motifs, form a large but seldom-studied group of proteins. Methods focusing on the analysis of 3D structures of such proteins identified many subtle effects in length distribution of individual motifs that are important for their functions. However, similar analysis was yet not applied to the vast majority of repeat proteins with unknown 3D structures, mostly because of the extreme diversity of the underlying motifs and the resulting difficulty to detect those. Results: We developed FAIT, a sequence-based algorithm for the precise assignment of individual repeats in repeat proteins and introduced a framework to classify and compare aperiodicity patterns for large protein families. FAIT extracts repeat positions by post-processing FFAS alignment matrices with image processing methods. On examples of proteins with Leucine Rich Repeat (LRR) domains and other solenoids like proteins, we show that the automated analysis with FAIT correctly identifies exact lengths of individual repeats based entirely on sequence information.
引用
收藏
页码:2776 / 2782
页数:7
相关论文
共 30 条
  • [11] Hrabe T., 2011, ENCY LIFE SCI, DOI [10.1002/9780470015902.a0023175, DOI 10.1002/9780470015902.A0023175]
  • [12] PDBFlex: exploring flexibility in protein structures
    Hrabe, Thomas
    Li, Zhanwen
    Sedova, Mayya
    Rotkiewicz, Piotr
    Jaroszewski, Lukasz
    Godzik, Adam
    [J]. NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) : D423 - D428
  • [13] ConSole: using modularity of Contact maps to locate Solenoid domains in protein structures
    Hrabe, Thomas
    Godzik, Adam
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [14] Jacobson ML, 2001, P ANN INT IEEE EMBS, V23, P2194, DOI 10.1109/IEMBS.2001.1017206
  • [15] FFAS server: novel features and applications
    Jaroszewski, Lukasz
    Li, Zhanwen
    Cai, Xiao-hui
    Weber, Christoph
    Godzik, Adam
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : W38 - W44
  • [16] Tandem repeats in proteins: From sequence to structure
    Kajava, Andrey V.
    [J]. JOURNAL OF STRUCTURAL BIOLOGY, 2012, 179 (03) : 279 - 288
  • [17] Structural diversity of leucine-rich repeat proteins
    Kajava, AV
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1998, 277 (03) : 519 - 527
  • [18] The leucine-rich repeat as a protein recognition motif
    Kobe, B
    Kajava, AV
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2001, 11 (06) : 725 - 732
  • [19] Ankyrin repeat: A unique motif mediating protein-protein interactions
    Li, Junan
    Mahajan, Anjali
    Tsai, Ming-Daw
    [J]. BIOCHEMISTRY, 2006, 45 (51) : 15168 - 15178
  • [20] Understanding and identifying amino acid repeats
    Luo, Hong
    Nijveen, Harm
    [J]. BRIEFINGS IN BIOINFORMATICS, 2014, 15 (04) : 582 - 591