Applications of de Bruijn graphs in microbiome research

被引:3
作者
Dufault-Thompson, Keith [1 ]
Jiang, Xiaofang [1 ,2 ]
机构
[1] NLM, Intramural Res Program, NIH, Bethesda, MD USA
[2] NLM, Intramural Res Program, NIH, Bldg 38A,Room 6N607,8600 Rockville Pike, Bethesda, MD 20894 USA
来源
IMETA | 2022年 / 1卷 / 01期
关键词
de Bruijn graphs; microbiome; Omics; PAN-GENOME ANALYSIS; ALGORITHM; CHALLENGES; EFFICIENT;
D O I
10.1002/imt2.4
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
High-throughput sequencing has become an increasingly central component of microbiome research. The development of de Bruijn graph-based methods for assembling high-throughput sequencing data has been an important part of the broader adoption of sequencing as part of biological studies. Recent advances in the construction and representation of de Bruijn graphs have led to new approaches that utilize the de Bruijn graph data structure to aid in different biological analyses. One type of application of these methods has been in alternative approaches to the assembly of sequencing data like gene-targeted assembly, where only gene sequences are assembled out of larger metagenomes, and differential assembly, where sequences that are differentially present between two samples are assembled. de Bruijn graphs have also been applied for comparative genomics where they can be used to represent large sets of multiple genomes or metagenomes where structural features in the graphs can be used to identify variants, indels, and homologous regions in sequences. These de Bruijn graph-based representations of sequencing data have even begun to be applied to whole sequencing databases for large-scale searches and experiment discovery. de Bruijn graphs have played a central role in how high-throughput sequencing data is worked with, and the rapid development of new tools that rely on these data structures suggests that they will continue to play an important role in biology in the future. The ability to efficiently assemble high-throughput sequencing data using de Bruijn graph-based assembly methods has been an important factor in the adoption of sequencing as a central component of microbiome research. Recent methods have applied the de Bruijn graph data structure as a component of analytical tools as well, opening up new routes of analysis in comparative genomics and metagenomics. de Bruijn graphs will likely continue to have a prominent role in how microbiome sequencing data is assembled and analyzed. image de Bruijn graph-based sequence assembly approaches have been an essential part of the broad application of sequencing methods, especially in microbiome research. de Bruijn graphs can be used to efficiently represent sequencing data in a format that is highly scalable and can be extended and modified to address different research questions. de Bruijn graph-based analysis methods have been developed for comparative genomics, the identification of genetic variants, and for large-scale searching of unassembled sequencing data. The de Bruijn graph data structure will continue to be a central component of sequence assembly and analysis approaches in the future.
引用
收藏
页数:10
相关论文
共 63 条
[51]   Integrating long-range connectivity information into de Bruijn graphs [J].
Turner, Isaac ;
Garimella, Kiran V. ;
Iqbal, Zamin ;
McVean, Gil .
BIOINFORMATICS, 2018, 34 (15) :2556-2565
[52]   MetaFast: fast reference-free graph-based comparison of shotgun metagenomic data [J].
Ulyantsev, Vladimir I. ;
Kazakov, Sergey V. ;
Dubinkina, Veronika B. ;
Tyakht, Alexander V. ;
Alexeev, Dmitry G. .
BIOINFORMATICS, 2016, 32 (18) :2760-2767
[53]   Reference-free detection of isolated SNPs [J].
Uricaru, Raluca ;
Rizk, Guillaume ;
Lacroix, Vincent ;
Quillery, Elsa ;
Plantard, Olivier ;
Chikhi, Rayan ;
Lemaitre, Claire ;
Peterlongo, Pierre .
NUCLEIC ACIDS RESEARCH, 2015, 43 (02) :e11
[54]   A de Bruijn Graph Approach to the Quantification of Closely-Related Genomes in a Microbial Community [J].
Wang, Mingjie ;
Ye, Yuzhen ;
Tang, Haixu .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (06) :814-825
[55]   Xander: employing a novel method for efficient gene-targeted metagenomic assembly [J].
Wang, Qiong ;
Fish, Jordan A. ;
Gilman, Mariah ;
Sun, Yanni ;
Brown, C. Titus ;
Tiedje, James M. ;
Cole, James R. .
MICROBIOME, 2015, 3
[56]   SolidBin: improving metagenome binning with semi-supervised normalized cut [J].
Wang, Ziye ;
Wang, Zhengyang ;
Lu, Yang Young ;
Sun, Fengzhu ;
Zhu, Shanfeng .
BIOINFORMATICS, 2019, 35 (21) :4229-4238
[57]  
Ward R.M., 2013, Systems Biomedicine, V1, P29, DOI [DOI 10.4161/SYSB.24470, 10.4161/sysb.24470]
[58]   SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-Seq reads [J].
Xie, Yinlong ;
Wu, Gengxiong ;
Tang, Jingbo ;
Luo, Ruibang ;
Patterson, Jordan ;
Liu, Shanlin ;
Huang, Weihua ;
He, Guangzhu ;
Gu, Shengchang ;
Li, Shengkang ;
Zhou, Xin ;
Lam, Tak-Wah ;
Li, Yingrui ;
Xu, Xun ;
Wong, Gane Ka-Shu ;
Wang, Jun .
BIOINFORMATICS, 2014, 30 (12) :1660-1666
[59]   Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis [J].
Ye, Yuzhen ;
Tang, Haixu .
BIOINFORMATICS, 2016, 32 (07) :1001-1008
[60]   Velvet: Algorithms for de novo short read assembly using de Bruijn graphs [J].
Zerbino, Daniel R. ;
Birney, Ewan .
GENOME RESEARCH, 2008, 18 (05) :821-829