Applications of de Bruijn graphs in microbiome research

被引:3
作者
Dufault-Thompson, Keith [1 ]
Jiang, Xiaofang [1 ,2 ]
机构
[1] NLM, Intramural Res Program, NIH, Bethesda, MD USA
[2] NLM, Intramural Res Program, NIH, Bldg 38A,Room 6N607,8600 Rockville Pike, Bethesda, MD 20894 USA
来源
IMETA | 2022年 / 1卷 / 01期
关键词
de Bruijn graphs; microbiome; Omics; PAN-GENOME ANALYSIS; ALGORITHM; CHALLENGES; EFFICIENT;
D O I
10.1002/imt2.4
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
High-throughput sequencing has become an increasingly central component of microbiome research. The development of de Bruijn graph-based methods for assembling high-throughput sequencing data has been an important part of the broader adoption of sequencing as part of biological studies. Recent advances in the construction and representation of de Bruijn graphs have led to new approaches that utilize the de Bruijn graph data structure to aid in different biological analyses. One type of application of these methods has been in alternative approaches to the assembly of sequencing data like gene-targeted assembly, where only gene sequences are assembled out of larger metagenomes, and differential assembly, where sequences that are differentially present between two samples are assembled. de Bruijn graphs have also been applied for comparative genomics where they can be used to represent large sets of multiple genomes or metagenomes where structural features in the graphs can be used to identify variants, indels, and homologous regions in sequences. These de Bruijn graph-based representations of sequencing data have even begun to be applied to whole sequencing databases for large-scale searches and experiment discovery. de Bruijn graphs have played a central role in how high-throughput sequencing data is worked with, and the rapid development of new tools that rely on these data structures suggests that they will continue to play an important role in biology in the future. The ability to efficiently assemble high-throughput sequencing data using de Bruijn graph-based assembly methods has been an important factor in the adoption of sequencing as a central component of microbiome research. Recent methods have applied the de Bruijn graph data structure as a component of analytical tools as well, opening up new routes of analysis in comparative genomics and metagenomics. de Bruijn graphs will likely continue to have a prominent role in how microbiome sequencing data is assembled and analyzed. image de Bruijn graph-based sequence assembly approaches have been an essential part of the broad application of sequencing methods, especially in microbiome research. de Bruijn graphs can be used to efficiently represent sequencing data in a format that is highly scalable and can be extended and modified to address different research questions. de Bruijn graph-based analysis methods have been developed for comparative genomics, the identification of genetic variants, and for large-scale searching of unassembled sequencing data. The de Bruijn graph data structure will continue to be a central component of sequence assembly and analysis approaches in the future.
引用
收藏
页数:10
相关论文
共 63 条
  • [1] Metagenome SNP calling via read-colored de Bruijn graphs
    Alipanahi, Bahar
    Muggli, Martin D.
    Jundi, Musa
    Noyes, Noelle R.
    Boucher, Christina
    [J]. BIOINFORMATICS, 2020, 36 (22-23) : 5275 - 5281
  • [2] Almodaresi F, 2017, bioRxiv, DOI [10.1101/138016, 10.1101/138016, DOI 10.1101/138016]
  • [3] [Anonymous], 1995, Genome Science and Technology, DOI [DOI 10.1089/GST.1995.1.9, 10.1089/gst.1995.1.9]
  • [4] Graphical pan-genome analysis with compressed suffix trees and the Burrows-Wheeler transform
    Baier, Uwe
    Beller, Timo
    Ohlebusch, Enno
    [J]. BIOINFORMATICS, 2016, 32 (04) : 497 - 504
  • [5] SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
    Bankevich, Anton
    Nurk, Sergey
    Antipov, Dmitry
    Gurevich, Alexey A.
    Dvorkin, Mikhail
    Kulikov, Alexander S.
    Lesin, Valery M.
    Nikolenko, Sergey I.
    Son Pham
    Prjibelski, Andrey D.
    Pyshkin, Alexey V.
    Sirotkin, Alexander V.
    Vyahhi, Nikolay
    Tesler, Glenn
    Alekseyev, Max A.
    Pevzner, Pavel A.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) : 455 - 477
  • [6] Bowe Alexander, 2012, Algorithms in Bioinformatics. Proceedings of the12th International Workshop, WABI 2012, P225, DOI 10.1007/978-3-642-33122-0_18
  • [7] Simplitigs as an efficient and scalable representation of de Bruijn graphs
    Brinda, Karel
    Baym, Michael
    Kucherov, Gregory
    [J]. GENOME BIOLOGY, 2021, 22 (01)
  • [8] ALLPATHS: De novo assembly of whole-genome shotgun microreads
    Butler, Jonathan
    MacCallum, Iain
    Kleber, Michael
    Shlyakhter, Ilya A.
    Belmonte, Matthew K.
    Lander, Eric S.
    Nusbaum, Chad
    Jaffe, David B.
    [J]. GENOME RESEARCH, 2008, 18 (05) : 810 - 820
  • [9] Short read fragment assembly of bacterial genomes
    Chaisson, Mark J.
    Pevzner, Pavel A.
    [J]. GENOME RESEARCH, 2008, 18 (02) : 324 - 330
  • [10] How to apply de Bruijn graphs to genome assembly
    Compeau, Phillip E. C.
    Pevzner, Pavel A.
    Tesler, Glenn
    [J]. NATURE BIOTECHNOLOGY, 2011, 29 (11) : 987 - 991