Introducing EzBioCloud: a taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies

被引:6324
作者
Yoon, Seok-Hwan [1 ]
Ha, Sung-Min [1 ]
Kwon, Soonjae [1 ]
Lim, Jeongmin [1 ]
Kim, Yeseul [1 ]
Seo, Hyungseok [1 ]
Chun, Jongsik [1 ]
机构
[1] Seoul Natl Univ, Dept ChunLab Inc, Seoul, South Korea
关键词
16S rRNA gene; genome; average nucleotide identity; identification; database; PROKARYOTES; SYSTEMATICS; IDENTITY; EZTAXON; MARKER; TOOLS;
D O I
10.1099/ijsem.0.001755
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
The recent advent of DNA sequencing technologies facilitates the use of genome sequencing data that provide means for more informative and precise classification and identification of members of the Bacteria and Archaea. Because the current species definition is based on the comparison of genome sequences between type and other strains in a given species, building a genome database with correct taxonomic information is of paramount need to enhance our efforts in exploring prokaryotic diversity and discovering novel species as well as for routine identifications. Here we introduce an integrated database, called EzBioCloud, that holds the taxonomic hierarchy of the Bacteria and Archaea, which is represented by qualitycontrolled 16S rRNA gene and genome sequences. Whole-genome assemblies in the NCBI Assembly Database were screened for low quality and subjected to a composite identification bioinformatics pipeline that employs gene-based searches followed by the calculation of average nucleotide identity. As a result, the database is made of 61 700 species/phylotypes, including 13 132 with validly published names, and 62 362 whole-genome assemblies that were identified taxonomically at the genus, species and subspecies levels. Genomic properties, such as genome size and DNA G+C content, and the occurrence in human microbiome data were calculated for each genus or higher taxa. This united database of taxonomy, 16S rRNA gene and genome sequences, with accompanying bioinformatics tools, should accelerate genome-based classification and identification of members of the Bacteria and Archaea. The database and related search tools are available at www.ezbiocloud.net/.
引用
收藏
页码:1613 / 1617
页数:5
相关论文
共 22 条
[1]   EzTaxon: a web-based tool for the identification of prokaryotes based on 16S ribosomal RNA gene sequences [J].
Chun, Jongsik ;
Lee, Jae-Hak ;
Jung, Yoonyoung ;
Kim, Myungjin ;
Kim, Seil ;
Kim, Byung Kwon ;
Lim, Young-Woon .
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2007, 57 :2259-2261
[2]   Integrating genomics into the taxonomy and systematics of the Bacteria and Archaea [J].
Chun, Jongsik ;
Rainey, Fred A. .
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2014, 64 :316-324
[3]   Ribosomal Database Project: data and tools for high throughput rRNA analysis [J].
Cole, James R. ;
Wang, Qiong ;
Fish, Jordan A. ;
Chai, Benli ;
McGarrell, Donna M. ;
Sun, Yanni ;
Brown, C. Titus ;
Porras-Alfaro, Andrea ;
Kuske, Cheryl R. ;
Tiedje, James M. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D633-D642
[4]   Search and clustering orders of magnitude faster than BLAST [J].
Edgar, Robert C. .
BIOINFORMATICS, 2010, 26 (19) :2460-2461
[5]   HOW CLOSE IS CLOSE - 16S RIBOSOMAL-RNA SEQUENCE IDENTITY MAY NOT BE SUFFICIENT TO GUARANTEE SPECIES IDENTITY [J].
FOX, GE ;
WISOTZKEY, JD ;
JURTSHUK, P .
INTERNATIONAL JOURNAL OF SYSTEMATIC BACTERIOLOGY, 1992, 42 (01) :166-170
[6]   EzEditor: a versatile sequence alignment editor for both rRNA- and protein-coding genes [J].
Jeon, Yoon-Seong ;
Lee, Kihyun ;
Park, Sang-Cheol ;
Kim, Bong-Soo ;
Cho, Yong-Joon ;
Ha, Sung-Min ;
Chun, Jongsik .
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2014, 64 :689-691
[7]   Large-scale evaluation of experimentally determined DNA G plus C contents with whole genome sequences of prokaryotes [J].
Kim, Mincheol ;
Park, Sang-Cheol ;
Baek, Inwoo ;
Chun, Jongsik .
SYSTEMATIC AND APPLIED MICROBIOLOGY, 2015, 38 (02) :79-83
[8]   Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes [J].
Kim, Mincheol ;
Oh, Hyun-Seok ;
Park, Sang-Cheol ;
Chun, Jongsik .
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2014, 64 :346-351
[9]   Introducing EzTaxon-e: a prokaryotic 16S rRNA gene sequence database with phylotypes that represent uncultured species [J].
Kim, Ok-Sun ;
Cho, Yong-Joon ;
Lee, Kihyun ;
Yoon, Seok-Hwan ;
Kim, Mincheol ;
Na, Hyunsoo ;
Park, Sang-Cheol ;
Jeon, Yoon Seong ;
Lee, Jae-Hak ;
Yi, Hana ;
Won, Sungho ;
Chun, Jongsik .
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2012, 62 :716-721
[10]   Phylogenetic analysis of the genus Kribbella based on the gyrB gene: proposal of a gyrB-sequence threshold for species delineation in the genus Kribbella [J].
Kirby, Bronwyn M. ;
Everest, Gareth J. ;
Meyers, Paul R. .
ANTONIE VAN LEEUWENHOEK INTERNATIONAL JOURNAL OF GENERAL AND MOLECULAR MICROBIOLOGY, 2010, 97 (02) :131-142