A comparison of short-read, HiFi long-read, and hybrid strategies for genome-resolved metagenomics

被引:8
|
作者
Eisenhofer, Raphael [1 ]
Nesme, Joseph [2 ]
Santos-Bay, Luisa [1 ]
Koziol, Adam [1 ]
Sorensen, Soren Johannes [2 ]
Alberdi, Antton [1 ]
Aizpurua, Ostaizka [1 ]
机构
[1] Univ Copenhagen, Globe Inst, Ctr Evolutionary Hologen, Copenhagen, Denmark
[2] Univ Copenhagen, Dept Biol, Sect Microbiol, Copenhagen, Denmark
基金
新加坡国家研究基金会;
关键词
microbiology; metagenomics; long read; mice; gut microbiome; microbiome; MICROBIAL GENOMES; BACTERIA;
D O I
10.1128/spectrum.03590-23
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Shotgun metagenomics enables the reconstruction of complex microbial communities at a high level of detail. Such an approach can be conducted using both short-read and long-read sequencing data, as well as a combination of both. To assess the pros and cons of these different approaches, we used 22 fecal DNA extracts collected weekly for 11 weeks from two respective lab mice to study seven performance metrics over four combinations of sequencing depth and technology: (i) 20 Gbp of Illumina short-read data, (ii) 40 Gbp of short-read data, (iii) 20 Gbp of PacBio HiFi long-read data, and (iv) 40 Gbp of hybrid (20 Gbp of short-read +20 Gbp of long-read) data. No strategy was best for all metrics; instead, each one excelled across different metrics. The long-read approach yielded the best assembly statistics, with the highest N50 and lowest number of contigs. The 40 Gbp short-read approach yielded the highest number of refined bins. Finally, the hybrid approach yielded the longest assemblies and the highest mapping rate to the bacterial genomes. Our results suggest that while long-read sequencing significantly improves the quality of reconstructed bacterial genomes, it is more expensive and requires deeper sequencing than short-read approaches to recover a comparable amount of reconstructed genomes. The most optimal strategy is study-specific and depends on how researchers assess the trade-off between the quantity and quality of recovered genomes.IMPORTANCEMice are an important model organism for understanding the gut microbiome. When studying these gut microbiomes using DNA techniques, researchers can choose from technologies that use short or long DNA reads. In this study, we perform an extensive benchmark between short- and long-read DNA sequencing for studying mice gut microbiomes. We find that no one approach was best for all metrics and provide information that can help guide researchers in planning their experiments. Mice are an important model organism for understanding the gut microbiome. When studying these gut microbiomes using DNA techniques, researchers can choose from technologies that use short or long DNA reads. In this study, we perform an extensive benchmark between short- and long-read DNA sequencing for studying mice gut microbiomes. We find that no one approach was best for all metrics and provide information that can help guide researchers in planning their experiments.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Polypolish: Short-read polishing of long-read bacterial genome assemblies
    Wick, Ryan R.
    Holt, Kathryn E.
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (01)
  • [2] Direct Comparative Analysis of a Pharmacogenomics Panel with PacBio Hifi® Long-Read and Illumina Short-Read Sequencing
    Barthelemy, David
    Belmonte, Elodie
    Di Pilla, Laurie
    Bardel, Claire
    Duport, Eve
    Gautier, Veronique
    Payen, Lea
    JOURNAL OF PERSONALIZED MEDICINE, 2023, 13 (12):
  • [3] Pacbio HiFi long-read genomes offer better exomes by unlocking retinal disease variants missed by short-read sequencing
    Karakaya, Kadin
    Kroell-Hermi, Ariane
    Hiersche, Milan
    Decker, Christian
    Liakopoulos, Sandra
    Preising, Markus
    Rohrschneider, Klaus
    Betz, Christian
    Bolz, Hanno
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1328 - 1329
  • [4] Startups use short-read data to expand long-read sequencing market
    Eisenstein, Michael
    NATURE BIOTECHNOLOGY, 2015, 33 (05) : 433 - 435
  • [5] Startups use short-read data to expand long-read sequencing market
    Michael Eisenstein
    Nature Biotechnology, 2015, 33 : 433 - 435
  • [6] Filling the gap of short-read next generation sequencing in PGD by long-read approach
    Ho, D. N. Y.
    Au, C. H.
    Lau, J.
    Wong, E. Y. L.
    Rocha, K. A.
    Xue, L.
    Shum, T. W.
    Law, Y. C.
    Ng, Y. Y.
    Lok, I. H.
    Tang, O. S.
    Lam, S. T. S.
    Chan, T. L.
    Ma, E. S. K.
    HUMAN REPRODUCTION, 2018, 33 : 419 - 420
  • [7] Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies
    Zhao, Xuefang
    Collins, Ryan L.
    Lee, Wan-Ping
    Weber, Alexandra M.
    Jun, Yukyung
    Zhu, Qihui
    Weisburd, Ben
    Huang, Yongqing
    Audano, Peter A.
    Wang, Harold
    Walker, Mark
    Lowther, Chelsea
    Fu, Jack
    Consortium, Human Genome Structural Variation
    Gerstein, Mark B.
    Devine, Scott E.
    Marschall, Tobias
    Korbel, Jan O.
    Eichler, Evan E.
    Chaisson, Mark J. P.
    Lee, Charles
    Mills, Ryan E.
    Brand, Harrison
    Talkowski, Michael E.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2021, 108 (05) : 919 - 928
  • [8] PolyAtailor: measuring poly(A) tail length from short-read and long-read sequencing data
    Liu, Mengfei
    Hao, Linlin
    Yang, Sien
    Wu, Xiaohui
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (04)
  • [9] Characterization of Fecal Microbiota with Clinical Specimen Using Long-Read and Short-Read Sequencing Platform
    Wei, Po-Li
    Hung, Ching-Sheng
    Kao, Yi-Wei
    Lin, Ying-Chin
    Lee, Cheng-Yang
    Chang, Tzu-Hao
    Shia, Ben-Chang
    Lin, Jung-Chun
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (19) : 1 - 12
  • [10] Tandem repeat genotyping using massively parallel second generation sequencing: comparison of short-read and long-read technologies
    Radvanszky, Jan
    Lojova, Ingrid
    Kucharik, Marcel
    Balaz, Andrej
    Kvapilova, Katerina
    Kvapil, Petr
    Brzon, Ondrej
    Kasny, Martin
    Duranova, Terezia
    Forgacova, Natalia
    Hrnciar, Matej
    Holesova, Zuzana
    Martis, Jozef
    Sitarcik, Jozef
    Budis, Jaroslav
    Szemes, Tomas
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1784 - 1785