A guild of 45 CRISPR-associated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes

被引:750
作者
Haft, DH [1 ]
Selengut, J [1 ]
Mongodin, EF [1 ]
Nelson, KE [1 ]
机构
[1] Inst Gen Res, Rockville, MD USA
关键词
D O I
10.1371/journal.pcbi.0010060
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Clustered regularly interspaced short palindromic repeats (CRISPRs) are a family of DNA direct repeats found in many prokaryotic genomes. Repeats of 21-37 bp typically show weak dyad symmetry and are separated by regularly sized, nonrepetitive spacer sequences. Four CRISPR-associated (Cas) protein families, designated Cas1 to Cas4, are strictly associated with CRISPR elements and always occur near a repeat cluster. Some spacers originate from mobile genetic elements and are thought to confer "immunity" against the elements that harbor these sequences. In the present study, we have systematically investigated uncharacterized proteins encoded in the vicinity of these CRISPRs and found many additional protein families that are strictly associated with CRISPR loci across multiple prokaryotic species. Multiple sequence alignments and hidden Markov models have been built for 45 Cas protein families. These models identify family members with high sensitivity and selectivity and classify key regulators of development, DevR and DevS, in Myxococcus xanthus as Cas proteins. These identifications show that CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a repeat cluster or filling the region between two repeat clusters. Distinctive subsets of the collection of Cas proteins recur in phylogenetically distant species and correlate with characteristic repeat periodicity. The analyses presented here support initial proposals of mobility of these units, along with the likelihood that loci of different subtypes interact with one another as well as with host cell defensive, replicative, and regulatory systems. It is evident from this analysis that CRISPR/cas loci are larger, more complex, and more heterogeneous than previously appreciated.
引用
收藏
页码:474 / 483
页数:10
相关论文
共 32 条
  • [1] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
  • [2] The DevT protein stimulates synthesis of FruA, a signal transduction protein required for fruiting body morphogenesis in Myxococcus xanthus
    Boysen, A
    Ellehauge, E
    Julien, B
    Sogaard-Andersen, L
    [J]. JOURNAL OF BACTERIOLOGY, 2002, 184 (06) : 1540 - 1546
  • [3] Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii
    Bult, CJ
    White, O
    Olsen, GJ
    Zhou, LX
    Fleischmann, RD
    Sutton, GG
    Blake, JA
    FitzGerald, LM
    Clayton, RA
    Gocayne, JD
    Kerlavage, AR
    Dougherty, BA
    Tomb, JF
    Adams, MD
    Reich, CI
    Overbeek, R
    Kirkness, EF
    Weinstock, KG
    Merrick, JM
    Glodek, A
    Scott, JL
    Geoghagen, NSM
    Weidman, JF
    Fuhrmann, JL
    Nguyen, D
    Utterback, TR
    Kelley, JM
    Peterson, JD
    Sadow, PW
    Hanna, MC
    Cotton, MD
    Roberts, KM
    Hurst, MA
    Kaine, BP
    Borodovsky, M
    Klenk, HP
    Fraser, CM
    Smith, HO
    Woese, CR
    Venter, JC
    [J]. SCIENCE, 1996, 273 (5278) : 1058 - 1073
  • [4] The FruA signal transduction protein provides a checkpoint for the temporal co-ordination of intercellular signals in Myxococcus xanthus development
    Ellehauge, E
    Norregaard-Madsen, M
    Sogaard-Andersen, L
    [J]. MOLECULAR MICROBIOLOGY, 1998, 30 (04) : 807 - 817
  • [5] Greve Bo, 2004, Archaea, V1, P231, DOI 10.1155/2004/151926
  • [6] Genome Properties: a system for the investigation of prokaryotic genetic content for microbiology, genome annotation and comparative genomics
    Haft, DH
    Selengut, JD
    Brinkac, LM
    Zafar, N
    White, O
    [J]. BIOINFORMATICS, 2005, 21 (03) : 293 - 306
  • [7] The TIGRFAMs database of protein families
    Haft, DH
    Selengut, JD
    White, O
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 371 - 373
  • [8] NUCLEOTIDE-SEQUENCE OF THE IAP GENE, RESPONSIBLE FOR ALKALINE-PHOSPHATASE ISOZYME CONVERSION IN ESCHERICHIA-COLI, AND IDENTIFICATION OF THE GENE-PRODUCT
    ISHINO, Y
    SHINAGAWA, H
    MAKINO, K
    AMEMURA, M
    NAKATA, A
    [J]. JOURNAL OF BACTERIOLOGY, 1987, 169 (12) : 5429 - 5433
  • [9] Identification of genes that are associated with DNA repeats in prokaryotes
    Jansen, R
    van Embden, JDA
    Gaastra, W
    Schouls, LM
    [J]. MOLECULAR MICROBIOLOGY, 2002, 43 (06) : 1565 - 1575
  • [10] Jansen Rund, 2002, OMICS A Journal of Integrative Biology, V6, P23, DOI 10.1089/15362310252780816