Comprehensive aligned sequence construction for automated design of effective probes (CASCADE-P) using 16S rDNA

被引:90
作者
DeSantis, TZ [1 ]
Dubosarskiy, I [1 ]
Murray, SR [1 ]
Andersen, GL [1 ]
机构
[1] Lawrence Berkeley Lab, Ctr Environm Biotechnol, Berkeley, CA 94720 USA
关键词
D O I
10.1093/bioinformatics/btg200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Prokaryotic organisms have been identified utilizing the sequence variation of the 16S rRNA gene. Variations steer the design of DNA probes for the detection of taxonomic groups or specific organisms. The long-term goal of our project is to create probe arrays capable of identifying 16S rDNA sequences in unknown samples. This necessitated the authentication, categorization and alignment of the >75000 publicly available '16S' sequences. Preferably, the entire process should be computationally administrated so the aligned collection could periodically absorb 16S rDNA sequences from the public records. A complete multiple sequence alignment would provide a foundation for computational probe selection and facilitates microbial taxonomy and phylogeny. Results: Here we report the alignment and similarity clustering of 62 662 16S rDNA sequences and an approach for designing effective probes for each cluster. A novel alignment compression algorithm, NAST (Nearest Alignment Space Termination), was designed to produce the uniform multiple sequence alignment referred to as the prokMSA. From the prokMSA, 9020 Operational Taxonomic Units (OTUs) were found based on transitive sequence similarities. An automated approach to probe design was straightforward using the prokMSA clustered into OTUs. As a test case, multiple probes were computationally picked for each of the 27 OTUs that were identified within the Staphylococcus Group. The probes were incorporated into a customized microarray and were able to correctly categorize Staphylococcus aureus and Bacillus anthracis into their correct OTUs. Although a successful probe picking strategy is outlined, the main focus of creating the prokMSA was to provide a comprehensive, categorized, updateable 16S rDNA collection useful as a foundation for any probe selection algorithm.
引用
收藏
页码:1461 / 1468
页数:8
相关论文
共 23 条
  • [1] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [2] PRIMROSE: a computer program for generating and estimating the phylogenetic range of 16S rRNA oligonucleotide probes and primers in conjunction with the RDP-II database
    Ashelford, KE
    Weightman, AJ
    Fry, JC
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (15) : 3481 - 3489
  • [3] d2_cluster: A validated method for clustering EST and full-length cDNA sequences
    Burke, J
    Davison, D
    Hide, W
    [J]. GENOME RESEARCH, 1999, 9 (11) : 1135 - 1142
  • [4] The Comparative RNA Web (CRW) Site:: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs -: art. no. 2
    Cannone, JJ
    Subramanian, S
    Schnare, MN
    Collett, JR
    D'Souza, LM
    Du, YS
    Feng, B
    Lin, N
    Madabusi, LV
    Müller, KM
    Pande, N
    Shang, ZD
    Yu, N
    Gutell, RR
    [J]. BMC BIOINFORMATICS, 2002, 3 (1)
  • [5] Dojka MA, 1998, APPL ENVIRON MICROB, V64, P3869
  • [6] THE PHYLOGENY OF PROKARYOTES
    FOX, GE
    STACKEBRANDT, E
    HESPELL, RB
    GIBSON, J
    MANILOFF, J
    DYER, TA
    WOLFE, RS
    BALCH, WE
    TANNER, RS
    MAGRUM, LJ
    ZABLEN, LB
    BLAKEMORE, R
    GUPTA, R
    BONEN, L
    LEWIS, BJ
    STAHL, DA
    LUEHRSEN, KR
    CHEN, KN
    WOESE, CR
    [J]. SCIENCE, 1980, 209 (4455) : 457 - 463
  • [7] Secondary intracellular symbiotic bacteria in aphids of the genus Yamatocallis (Homoptera: Aphididae: Drepanosiphinae)
    Fukatsu, T
    [J]. APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2001, 67 (11) : 5315 - 5320
  • [9] Impact of culture-independent studies on the emerging phylogenetic view of bacterial diversity
    Hugenholtz, P
    Goebel, BM
    Pace, NR
    [J]. JOURNAL OF BACTERIOLOGY, 1998, 180 (18) : 4765 - 4774
  • [10] HUGENHOLTZ P, 2002, IN PRESS INT J SYS E