A High-Throughput DNA Sequence Aligner for Microbial Ecology Studies

被引:247
|
作者
Schloss, Patrick D. [1 ,2 ]
机构
[1] Univ Massachusetts, Dept Microbiol, Amherst, MA 01003 USA
[2] Univ Michigan, Dept Microbiol & Immunol, Ann Arbor, MI 48109 USA
来源
PLOS ONE | 2009年 / 4卷 / 12期
基金
美国国家科学基金会;
关键词
ESTIMATING SPECIES RICHNESS; GUT MICROBIOTA; DIVERSITY; ALIGNMENT; PROGRAMS; DATABASE; ARB; BIOSPHERE; SEARCH;
D O I
10.1371/journal.pone.0008230
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
As the scope of microbial surveys expands with the parallel growth in sequencing capacity, a significant bottleneck in data analysis is the ability to generate a biologically meaningful multiple sequence alignment. The most commonly used aligners have varying alignment quality and speed, tend to depend on a specific reference alignment, or lack a complete description of the underlying algorithm. The purpose of this study was to create and validate an aligner with the goal of quickly generating a high quality alignment and having the flexibility to use any reference alignment. Using the simple nearest alignment space termination algorithm, the resulting aligner operates in linear time, requires a small memory footprint, and generates a high quality alignment. In addition, the alignments generated for variable regions were of as high a quality as the alignment of full-length sequences. As implemented, the method was able to align 18 full-length 16S rRNA gene sequences and 58 V2 region sequences per second to the 50,000-column SILVA reference alignment. Most importantly, the resulting alignments were of a quality equal to SILVA-generated alignments. The aligner described in this study will enable scientists to rapidly generate robust multiple sequences alignments that are implicitly based upon the predicted secondary structure of the 16S rRNA molecule. Furthermore, because the implementation is not connected to a specific database it is easy to generalize the method to reference alignments for any DNA sequence.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] High-throughput screening for efficient microbial biotechnology
    Sarnaik, Aditya
    Liu, Arren
    Nielsen, David
    Varman, Arul M.
    CURRENT OPINION IN BIOTECHNOLOGY, 2020, 64 : 141 - 150
  • [22] HIGH-THROUGHPUT CHARACTERIZATION AND COMPARISON OF MICROBIAL COMMUNITIES
    Halwachs, Bettina
    Hoeftberger, Johann
    Stocker, Gernot
    Snajder, Rene
    Gorkiewicz, Gregor
    Thallinger, Gerhard G.
    BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2013, 58
  • [23] DNA sequencing in high-throughput neuroanatomy
    Kebschull, Justus M.
    JOURNAL OF CHEMICAL NEUROANATOMY, 2019, 100
  • [24] High-throughput DNA extraction solutions
    Corbett, Geoff
    GENETIC ENGINEERING NEWS, 2006, 26 (20): : 24 - 25
  • [25] HIGH-THROUGHPUT DNA PREPARATION SYSTEM
    GARNER, HR
    ARMSTRONG, B
    KRAMARSKY, DA
    GENETIC ANALYSIS-BIOMOLECULAR ENGINEERING, 1992, 9 (5-6): : 134 - 139
  • [26] Uniform acquisition for high-throughput DNA
    El-Difrawy, S
    Lam, R
    Ehrlich, DJ
    PROCEEDINGS OF THE IEEE 30TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE, 2004, : 112 - 113
  • [27] Plasmid DNA the high-throughput way
    Wolf, PG
    SCIENTIST, 2001, 15 (02): : 21 - +
  • [28] High-throughput mapping of regulatory DNA
    Rajagopal, Nisha
    Srinivasan, Sharanya
    Kooshesh, Kameron
    Guo, Yuchun
    Edwards, Matthew D.
    Banerjee, Budhaditya
    Syed, Tahin
    Emons, Bart J. M.
    Gifford, David K.
    Sherwood, Richard I.
    NATURE BIOTECHNOLOGY, 2016, 34 (02) : 167 - +
  • [29] High-throughput mapping of regulatory DNA
    Nisha Rajagopal
    Sharanya Srinivasan
    Kameron Kooshesh
    Yuchun Guo
    Matthew D Edwards
    Budhaditya Banerjee
    Tahin Syed
    Bart J M Emons
    David K Gifford
    Richard I Sherwood
    Nature Biotechnology, 2016, 34 : 167 - 174
  • [30] High-Throughput Automation of DNA Extraction
    Saul, David
    Price, Nick
    GENETIC ENGINEERING & BIOTECHNOLOGY NEWS, 2009, 29 (01): : 23 - 23