A comparison of short-read, HiFi long-read, and hybrid strategies for genome-resolved metagenomics

被引:8
|
作者
Eisenhofer, Raphael [1 ]
Nesme, Joseph [2 ]
Santos-Bay, Luisa [1 ]
Koziol, Adam [1 ]
Sorensen, Soren Johannes [2 ]
Alberdi, Antton [1 ]
Aizpurua, Ostaizka [1 ]
机构
[1] Univ Copenhagen, Globe Inst, Ctr Evolutionary Hologen, Copenhagen, Denmark
[2] Univ Copenhagen, Dept Biol, Sect Microbiol, Copenhagen, Denmark
基金
新加坡国家研究基金会;
关键词
microbiology; metagenomics; long read; mice; gut microbiome; microbiome; MICROBIAL GENOMES; BACTERIA;
D O I
10.1128/spectrum.03590-23
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Shotgun metagenomics enables the reconstruction of complex microbial communities at a high level of detail. Such an approach can be conducted using both short-read and long-read sequencing data, as well as a combination of both. To assess the pros and cons of these different approaches, we used 22 fecal DNA extracts collected weekly for 11 weeks from two respective lab mice to study seven performance metrics over four combinations of sequencing depth and technology: (i) 20 Gbp of Illumina short-read data, (ii) 40 Gbp of short-read data, (iii) 20 Gbp of PacBio HiFi long-read data, and (iv) 40 Gbp of hybrid (20 Gbp of short-read +20 Gbp of long-read) data. No strategy was best for all metrics; instead, each one excelled across different metrics. The long-read approach yielded the best assembly statistics, with the highest N50 and lowest number of contigs. The 40 Gbp short-read approach yielded the highest number of refined bins. Finally, the hybrid approach yielded the longest assemblies and the highest mapping rate to the bacterial genomes. Our results suggest that while long-read sequencing significantly improves the quality of reconstructed bacterial genomes, it is more expensive and requires deeper sequencing than short-read approaches to recover a comparable amount of reconstructed genomes. The most optimal strategy is study-specific and depends on how researchers assess the trade-off between the quantity and quality of recovered genomes.IMPORTANCEMice are an important model organism for understanding the gut microbiome. When studying these gut microbiomes using DNA techniques, researchers can choose from technologies that use short or long DNA reads. In this study, we perform an extensive benchmark between short- and long-read DNA sequencing for studying mice gut microbiomes. We find that no one approach was best for all metrics and provide information that can help guide researchers in planning their experiments. Mice are an important model organism for understanding the gut microbiome. When studying these gut microbiomes using DNA techniques, researchers can choose from technologies that use short or long DNA reads. In this study, we perform an extensive benchmark between short- and long-read DNA sequencing for studying mice gut microbiomes. We find that no one approach was best for all metrics and provide information that can help guide researchers in planning their experiments.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Lineage-resolved complete metagenomics with long-read sequencing for rumen microbial characterization
    Bickhart, D. M.
    McClure, J. C.
    Shin, S. B.
    Smith, T. P. L.
    JOURNAL OF DAIRY SCIENCE, 2022, 105 : 17 - 17
  • [22] Benchmarking short-read metagenomics tools for removing host contamination
    Gao, Yunyun
    Luo, Hao
    Lyu, Hujie
    Yang, Haifei
    Yousuf, Salsabeel
    Huang, Shi
    Liu, Yong-Xin
    GIGASCIENCE, 2025, 14
  • [23] Democratizing long-read genome assembly
    Kirsche, Melanie
    Schatz, Michael C.
    CELL SYSTEMS, 2021, 12 (10) : 945 - 947
  • [24] MetaCC allows scalable and integrative analyses of both long-read and short-read metagenomic Hi-C data
    Du, Yuxuan
    Sun, Fengzhu
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [25] Short-read and long-read full-length transcriptome of mouse neural stem cells across neurodevelopmental stages
    Chaoqiong Ding
    Xiang Yan
    Mengying Xu
    Ran Zhou
    Yuancun Zhao
    Dan Zhang
    Zongyao Huang
    Zhenzhong Pan
    Peng Xiao
    Huifang Li
    Lu Chen
    Yuan Wang
    Scientific Data, 9
  • [26] Short-read and long-read RNA sequencing of mouse hematopoietic stem cells at bulk and single-cell levels
    Zheng, Xiuran
    Zhang, Dan
    Xu, Mengying
    Zeng, Wanqin
    Zhou, Ran
    Zhang, Yiming
    Tang, Chao
    Chen, Li
    Chen, Lu
    Lin, Jing-Wen
    SCIENTIFIC DATA, 2021, 8 (01)
  • [27] Short-read and long-read full-length transcriptome of mouse neural stem cells across neurodevelopmental stages
    Ding, Chaoqiong
    Yan, Xiang
    Xu, Mengying
    Zhou, Ran
    Zhao, Yuancun
    Zhang, Dan
    Huang, Zongyao
    Pan, Zhenzhong
    Xiao, Peng
    Li, Huifang
    Chen, Lu
    Wang, Yuan
    SCIENTIFIC DATA, 2022, 9 (01)
  • [28] Assessment of read depth requirements for gene and isoform discovery: a comparative study of long-read and short-read RNA sequencing data in human heart
    Gonzaludo, Nina
    Bruand, Jocelyne
    Klegarth, Amy
    Underwood, Jason
    Tseng, Elizabeth
    Aldinger, Kimberly A.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1778 - 1779
  • [29] Assessment of read depth requirements for gene and isoform discovery: a comparative study of long-read and short-read RNA sequencing data in human heart
    Gonzaludo, Nina
    Bruand, Jocelyne
    Klegarth, Amy
    Underwood, Jason
    Tseng, Elizabeth
    Aldinger, Kimberly A.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1778 - 1779
  • [30] Comparison of long-read methods for sequencing and assembly of a plant genome
    Murigneux, Valentine
    Rai, Subash Kumar
    Furtado, Agnelo
    Bruxner, Timothy J. C.
    Tian, Wei
    Harliwong, Ivon
    Wei, Hanmin
    Yang, Bicheng
    Ye, Qianyu
    Anderson, Ellis
    Mao, Qing
    Drmanac, Radoje
    Wang, Ou
    Peters, Brock A.
    Xu, Mengyang
    Wu, Pei
    Topp, Bruce
    Coin, Lachlan J. M.
    Henry, Robert J.
    GIGASCIENCE, 2020, 9 (12):