Unexpected absence of ribosomal protein genes from metagenome-assembled genomes

被引:0
作者
Kazumori Mise
Wataru Iwasaki
机构
[1] The University of Tokyo. Bunkyo-ku,Department of Biological Sciences, Graduate School of Science
[2] National Institute of Advanced Industrial Science and Technology,Department of Integrated Biosciences, Graduate School of Frontier Sciences
[3] Sapporo,Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences
[4] The University of Tokyo,Atmosphere and Ocean Research Institute
[5] Kashiwa,Institute for Quantitative Biosciences
[6] The University of Tokyo,Collaborative Research Institute for Innovative Microbiology
[7] Kashiwa,undefined
[8] The University of Tokyo,undefined
[9] Kashiwa,undefined
[10] The University of Tokyo,undefined
[11] Bunkyo,undefined
[12] The University of Tokyo,undefined
[13] Bunkyo,undefined
来源
ISME Communications | / 2卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Metagenome-assembled genomes (MAGs) have revealed the hidden diversity and functions of uncultivated microbes, but their reconstruction from metagenomes remains a computationally difficult task. Repetitive or exogenous sequences, such as ribosomal RNA and horizontally transferred genes, are frequently absent from MAGs because of misassembly and binning errors. Here, we report that ribosomal protein genes are also often absent from MAGs, although they are neither repetitive nor exogenous. Comprehensive analyses of more than 190,000 MAGs revealed that these genes could be missing in more than 20–40% of near-complete (i.e., with completeness of 90% or higher) MAGs. While some uncultivated environmental microbes intrinsically lack some ribosomal protein genes, we found that this unexpected absence is largely due to special evolutionary patterns of codon usage bias in ribosomal protein genes and algorithmic characteristics of metagenomic binning, which is dependent on tetranucleotide frequencies of contigs. This problem reflects the microbial life-history strategy. Fast-growing microbes tend to have this difficulty, likely because of strong evolutionary pressures on ribosomal protein genes toward the efficient assembly of ribosomes. Our observations caution those who study genomics and phylogeny of uncultivated microbes, the diversity and evolution of microbial genes in the central dogma, and bioinformatics in metagenomics.
引用
收藏
相关论文
共 312 条
[1]  
Hug LA(2016)A new view of the tree of life Nat Microbiol 1 16048-701
[2]  
Baker BJ(2015)Genomic expansion of domain archaea highlights roles for organisms from new phyla in anaerobic carbon cycling Curr Biol 25 690-11
[3]  
Anantharaman K(2015)Unusual biology across a group comprising more than 15% of domain Bacteria Nature 523 208-9
[4]  
Brown CT(2014)A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes Nat Commun 5 555-9
[5]  
Probst AJ(2015)Complete nitrification by a single microorganism Nature 528 504-504
[6]  
Castelle CJ(2015)Complete nitrification by Nitrospira bacteria Nature 528 499-11
[7]  
Castelle CJ(2022)The OceanDNA MAG catalog contains over 50,000 prokaryotic genomes originated from various marine environments Sci Data 9 3-14
[8]  
Wrighton KC(2019)A new genomic blueprint of the human gut microbiota Nature 568 e7359-34
[9]  
Thomas BC(2016)MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices Methods 102 6301-33
[10]  
Hug LA(2014)MaxBin: an automated binning method to recover individual genomes from metagenomes using an expectation-maximization algorithm Microbiome 2 824-6