A High-Throughput DNA Sequence Aligner for Microbial Ecology Studies

被引:247
|
作者
Schloss, Patrick D. [1 ,2 ]
机构
[1] Univ Massachusetts, Dept Microbiol, Amherst, MA 01003 USA
[2] Univ Michigan, Dept Microbiol & Immunol, Ann Arbor, MI 48109 USA
来源
PLOS ONE | 2009年 / 4卷 / 12期
基金
美国国家科学基金会;
关键词
ESTIMATING SPECIES RICHNESS; GUT MICROBIOTA; DIVERSITY; ALIGNMENT; PROGRAMS; DATABASE; ARB; BIOSPHERE; SEARCH;
D O I
10.1371/journal.pone.0008230
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
As the scope of microbial surveys expands with the parallel growth in sequencing capacity, a significant bottleneck in data analysis is the ability to generate a biologically meaningful multiple sequence alignment. The most commonly used aligners have varying alignment quality and speed, tend to depend on a specific reference alignment, or lack a complete description of the underlying algorithm. The purpose of this study was to create and validate an aligner with the goal of quickly generating a high quality alignment and having the flexibility to use any reference alignment. Using the simple nearest alignment space termination algorithm, the resulting aligner operates in linear time, requires a small memory footprint, and generates a high quality alignment. In addition, the alignments generated for variable regions were of as high a quality as the alignment of full-length sequences. As implemented, the method was able to align 18 full-length 16S rRNA gene sequences and 58 V2 region sequences per second to the 50,000-column SILVA reference alignment. Most importantly, the resulting alignments were of a quality equal to SILVA-generated alignments. The aligner described in this study will enable scientists to rapidly generate robust multiple sequences alignments that are implicitly based upon the predicted secondary structure of the 16S rRNA molecule. Furthermore, because the implementation is not connected to a specific database it is easy to generalize the method to reference alignments for any DNA sequence.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] SOME INTRIGUING HIGH-THROUGHPUT DNA SEQUENCE VARIANTS PREDICTION OVER PROTEIN FUNCTIONALITY
    Kheirkhah, Atabak
    Daud, Salwani Mohd
    Salleh, Noor Azurati Ahmad
    Sam, Suriani Mohd
    Abas, Hafiza
    Shariff, Sya Azmeela
    Yusof, Yusnaidi Md
    JURNAL TEKNOLOGI-SCIENCES & ENGINEERING, 2016, 78 (6-4): : 1 - 6
  • [42] A method for high-throughput production of sequence-verified DNA libraries and strain collections
    Smith, Justin D.
    Schlecht, Ulrich
    Xu, Weihong
    Suresh, Sundari
    Horecka, Joe
    Proctor, Michael J.
    Aiyar, Raeka S.
    Bennett, Richard A. O.
    Chu, Angela
    Li, Yong Fuga
    Roy, Kevin
    Davis, Ronald W.
    Steinmetz, Lars M.
    Hyman, Richard W.
    Levy, Sasha F.
    St Onge, Robert P.
    MOLECULAR SYSTEMS BIOLOGY, 2017, 13 (02)
  • [43] K-means Clustering Based Compression Algorithm for the High-throughput DNA Sequence
    Tan, Li
    Sun, Jifeng
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 952 - 955
  • [44] High-Throughput Analysis of T-DNA Location and Structure Using Sequence Capture
    Inagaki, Soichi
    Henry, Isabelle M.
    Lieberman, Meric C.
    Comai, Luca
    PLOS ONE, 2015, 10 (10):
  • [45] Antisense for high-throughput genomic studies
    Hackett, PB
    Essner, JJ
    GENETIC ENGINEERING NEWS, 2003, 23 (05): : 34 - +
  • [46] High-throughput single-molecule studies of protein-DNA interactions
    Robison, Aaron D.
    Finkelstein, Ilya J.
    FEBS LETTERS, 2014, 588 (19) : 3539 - 3546
  • [47] Fixed DNA Molecule Arrays for High-Throughput Single DNA-Protein Interaction Studies
    Tutkus, Marijonas
    Rakickas, Tomas
    Kopustas, Aurimas
    Ivanovaite, Sarune
    Venckus, Oskaras
    Navikas, Vytautas
    Zaremba, Mindaugas
    Manakova, Elena
    Valiokas, Ramunas
    LANGMUIR, 2019, 35 (17) : 5921 - 5930
  • [48] Spotsizer: High-throughput quantitative analysis of microbial growth
    Bischof, Leanne
    Prevorovsky, Martin
    Rallis, Charalampos
    Jeffares, Daniel C.
    Arzhaeva, Yulia
    Bahler, Jurg
    BIOTECHNIQUES, 2016, 61 (04) : 191 - 201
  • [49] High-throughput metal susceptibility testing of microbial biofilms
    Harrison, JJ
    Turner, RJ
    Ceri, H
    BMC MICROBIOLOGY, 2005, 5 (1)
  • [50] High-throughput metal susceptibility testing of microbial biofilms
    Joe J Harrison
    Raymond J Turner
    Howard Ceri
    BMC Microbiology, 5