Functional insights from the distribution and role of homopeptide repeat-containing proteins

被引:161
作者
Faux, NG
Bottomley, SP
Lesk, AM
Irving, JA
Morrison, JR
de la Banda, MC
Whisstock, JC
机构
[1] Monash Univ, Victorian Bioinformat Consortium, Melbourne, Vic 3800, Australia
[2] Monash Univ, Prot Crystallog Unit, Dept Biochem & Mol Biol, Melbourne, Vic 3800, Australia
[3] Monash Univ, ARC, Ctr Struct & Funct Microbial Genom, Melbourne, Vic 3800, Australia
[4] Monash Univ, Sch Comp Sci & Software Engn, Melbourne, Vic 3800, Australia
[5] Monash Univ, Monash Inst Reprod & Dev, Clayton, Vic 3168, Australia
[6] Penn State Univ, Dept Biochem & Mol Biol, University Pk, PA 16802 USA
关键词
D O I
10.1101/gr.3096505
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Expansion of "low complex" repeats of amino acids such as glutamine (Poly-Q) is associated with protein misfolding and the development of degenerative diseases such as Huntington's disease. The mechanism by which such regions promote misfolding remains controversial, the function of many repeat-containing proteins (RCPs) remains obscure, and the role (if any) of repeat regions remains to be determined. Here, a Web-accessible database of RCPs is presented. The distribution and evolution of RCPs that contain homopeptide repeats tracts are considered, and the existence of functional patterns investigated. Generally, it is found that while polyamino acid repeats are extremely rare in prokaryotes, several eukaryote putative homologs of prokaryote RCP-involved in important housekeeping processes-retain the repetitive region, suggesting an ancient origin for certain repeats. Within eukarya, the most common uninterrupted amino acid repeats are glutamine, asparagines, and alanine. Interestingly, while poly-Q repeats are found in vertebrates and nonvertebrates, poly-N repeats are only common in more primitive nonvertebrate organisms, such as insects and nematodes. We have assigned function to eukaryote RCPs using Online Mendelian Inheritance in Man (OMIM), the Human Reference Protein Database (HRPD), FlyBase, and Wormpep. Prokaryote RCPs were annotated using BLASTp searches and Gene Ontology. These data reveal that the majority of RCPs are involved in processes that require the assembly of large, multiprotein complexes, such as transcription and signaling.
引用
收藏
页码:537 / 551
页数:15
相关论文
共 59 条
[1]   Histone chaperones and nucleosome assembly [J].
Akey, CW ;
Luger, K .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2003, 13 (01) :6-14
[2]   Amino acid reiterations in yeast are overrepresented in particular classes of proteins and show evidence of a slippage-like mutational process [J].
Albà, MM ;
Santibàñez-Koref, MF ;
Hancock, JM .
JOURNAL OF MOLECULAR EVOLUTION, 1999, 49 (06) :789-797
[3]   Comparative analysis of amino acid repeats in rodents and humans [J].
Albà, MM ;
Guigó, R .
GENOME RESEARCH, 2004, 14 (04) :549-554
[4]   Detecting cryptically simple protein sequences using the SIMPLE algorithm [J].
Albà, MM ;
Laskowski, RA ;
Hancock, JM .
BIOINFORMATICS, 2002, 18 (05) :672-678
[5]   ALSCRIPT - A TOOL TO FORMAT MULTIPLE SEQUENCE ALIGNMENTS [J].
BARTON, GJ .
PROTEIN ENGINEERING, 1993, 6 (01) :37-40
[6]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[7]   Intranuclear neuronal inclusions in Huntington's disease and dentatorubral and pallidoluysian atrophy: Correlation between the density of inclusions and IT15 CAG triplet repeat length [J].
Becher, MW ;
Kotzuk, JA ;
Sharp, AH ;
Davies, SW ;
Bates, GP ;
Price, DL ;
Ross, CA .
NEUROBIOLOGY OF DISEASE, 1998, 4 (06) :387-397
[8]   Alanine tracts: the expanding story of human illness and trinucleotide repeats [J].
Brown, LY ;
Brown, SA .
TRENDS IN GENETICS, 2004, 20 (01) :51-58
[9]   Signal transduction: hanging on a scaffold [J].
Burack, WR ;
Shaw, AS .
CURRENT OPINION IN CELL BIOLOGY, 2000, 12 (02) :211-216
[10]  
CAINAN BJ, 1991, SCIENCE, V252, P1167