Estimating the Number of Species in Metagenomes by Clustering Next-Generation Read Sequences

被引:0
|
作者
Seok, Ho-Sik [1 ]
Hong, Woonyoung [1 ]
Kim, Jaebum [2 ]
机构
[1] Konkuk Univ, Dept Anim Biotechnol, Seoul, South Korea
[2] Konkuk Univ, UBITA Ctr, Dept Anim Biotechnol, Biotechnol Res, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Metagenome; clustering; Expectation-Maximization; sampling;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fast and cheap next-generation sequencing (NGS) technologies with the ability to sequence uncultured microbes present us unprecedented opportunities to distill meaningful information from millions of short read sequences of metagenomes. Contrary to the case of a single species genome, NGS read sequences from metagenomes are extremely complex and heterogeneous because metagenomes are a collection of genetic materials from very large number of microbes with varying abundance levels. In this paper we present a method to estimate the number of species in metagenomes sequences through the efficient clustering of metagenomic NGS read sequences. We believe that our method will contribute to the better understanding of a microbial community in metagenomes.
引用
收藏
页码:52 / +
页数:2
相关论文
共 50 条
  • [1] Estimating the composition of species in metagenomes by clustering of next-generation read sequences
    Seok, Ho-Sik
    Hong, Woonyoung
    Kim, Jaebum
    METHODS, 2014, 69 (03) : 213 - 219
  • [2] Estimating sequence similarity from read sets for clustering next-generation sequencing data
    Petr Ryšavý
    Filip Železný
    Data Mining and Knowledge Discovery, 2019, 33 : 1 - 23
  • [3] Estimating sequence similarity from read sets for clustering next-generation sequencing data
    Rysavy, Petr
    Zelezny, Filip
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (01) : 1 - 23
  • [4] SEED: efficient clustering of next-generation sequences
    Bao, Ergude
    Jiang, Tao
    Kaloshian, Isgouhi
    Girke, Thomas
    BIOINFORMATICS, 2011, 27 (18) : 2502 - 2509
  • [5] Software for pre-processing Illumina next-generation sequencing short read sequences
    Chen, Chuming
    Khaleel, Sari S.
    Huang, Hongzhan
    Wu, Cathy H.
    SOURCE CODE FOR BIOLOGY AND MEDICINE, 2014, 9 (01):
  • [6] ART: a next-generation sequencing read simulator
    Huang, Weichun
    Li, Leping
    Myers, Jason R.
    Marth, Gabor T.
    BIOINFORMATICS, 2012, 28 (04) : 593 - 594
  • [7] Graph-based clustering and characterization of repetitive sequences in next-generation sequencing data
    Petr Novák
    Pavel Neumann
    Jiří Macas
    BMC Bioinformatics, 11
  • [8] Metagenome assembly through clustering of next-generation sequencing data using protein sequences
    Sim, Mikang
    Kim, Jaebum
    JOURNAL OF MICROBIOLOGICAL METHODS, 2015, 109 : 180 - 187
  • [9] A clustering method for next-generation sequences of bacterial genomes through multiomics data mapping
    Ho-Sik Seok
    Mikang Sim
    Daehwan Lee
    Jaebum Kim
    Genes & Genomics, 2014, 36 : 191 - 196
  • [10] Graph-based clustering and characterization of repetitive sequences in next-generation sequencing data
    Novak, Petr
    Neumann, Pavel
    Macas, Jiri
    BMC BIOINFORMATICS, 2010, 11