Chlomito: a novel tool for precise elimination of organelle genome contamination from nuclear genome assembly

被引:0
作者
Song, Wei [1 ]
Li, Chong [1 ]
Lu, Yanming [1 ]
Shen, Dawei [2 ]
Jia, Yunxiao [1 ]
Huo, Yixin [1 ]
Piao, Weilan [1 ,3 ]
Jin, Hua [1 ,3 ,4 ]
机构
[1] Beijing Inst Technol, Aerosp Ctr Hosp, Lab Genet & Disorders, Sch Life Sci,Key Lab Mol Med & Biotherapy, Beijing, Peoples R China
[2] Beijing Inst Technol, Res Inst Sci & Technol, Beijing, Peoples R China
[3] Beijing Inst Technol, Adv Technol Res Inst, Jinan, Peoples R China
[4] Aerosp Ctr Hosp, Dept Pathol, Beijing, Peoples R China
来源
FRONTIERS IN PLANT SCIENCE | 2024年 / 15卷
基金
中国国家自然科学基金;
关键词
mitochondrial genome; chloroplast genome; chromosome-level assembly; organelle identification; horizontal gene transfer; MITOCHONDRIAL GENOMES; GENE-TRANSFER; ALIGNMENT; SUITE;
D O I
10.3389/fpls.2024.1430443
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Introduction Accurate reference genomes are fundamental to understanding biological evolution, biodiversity, hereditary phenomena and diseases. However, many assembled nuclear chromosomes are often contaminated by organelle genomes, which will mislead bioinformatic analysis, and genomic and transcriptomic data interpretation.Methods To address this issue, we developed a tool named Chlomito, aiming at precise identification and elimination of organelle genome contamination from nuclear genome assembly. Compared to conventional approaches, Chlomito utilized new metrics, alignment length coverage ratio (ALCR) and sequencing depth ratio (SDR), thereby effectively distinguishing true organelle genome sequences from those transferred into nuclear genomes via horizontal gene transfer (HGT).Results The accuracy of Chlomito was tested using sequencing data from Plum, Mango and Arabidopsis. The results confirmed that Chlomito can accurately detect contigs originating from the organelle genomes, and the identified contigs covered most regions of the organelle reference genomes, demonstrating efficiency and precision of Chlomito. Considering user convenience, we further packaged this method into a Docker image, simplified the data processing workflow.Discussion Overall, Chlomito provides an efficient, accurate and convenient method for identifying and removing contigs derived from organelle genomes in genomic assembly data, contributing to the improvement of genome assembly quality.
引用
收藏
页数:14
相关论文
共 65 条
  • [1] MitoFinder: Efficient automated large-scale extraction of mitogenomic data in target enrichment phylogenomics
    Allio, Remi
    Schomaker-Bastos, Alex
    Romiguier, Jonathan
    Prosdocimi, Francisco
    Nabholz, Benoit
    Delsuc, Frederic
    [J]. MOLECULAR ECOLOGY RESOURCES, 2020, 20 (04) : 892 - 905
  • [2] Chromosome-level genome assembly of the Asian aspen Populus davidiana Dode
    Bae, Eun-Kyung
    Kang, Min-Jeong
    Lee, Seung-Jae
    Park, Eung-Jun
    Kim, Ki-Tae
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [3] A chromosomal-scale genome assembly of modern cultivated hybrid sugarcane provides insights into origination and evolution
    Bao, Yixue
    Zhang, Qing
    Huang, Jiangfeng
    Zhang, Shengcheng
    Yao, Wei
    Yu, Zehuai
    Deng, Zuhu
    Yu, Jiaxin
    Kong, Weilong
    Yu, Xikai
    Lu, Shan
    Wang, Yibin
    Li, Ru
    Song, Yuhan
    Zou, Chengwu
    Xu, Yuzhi
    Liu, Zongling
    Yu, Fan
    Song, Jiaming
    Huang, Youzong
    Zhang, Jisen
    Wang, Haifeng
    Chen, Baoshan
    Zhang, Xingtan
    Zhang, Muqing
    [J]. NATURE COMMUNICATIONS, 2024, 15 (01)
  • [4] Mitochondrial DNA, chloroplast DNA and the origins of development in eukaryotic organisms
    Bendich, Arnold J.
    [J]. BIOLOGY DIRECT, 2010, 5
  • [5] Odintifier - A computational method for identifying insertions of organellar origin from modern and ancient high-throughput sequencing data based on haplotype phasing
    Castruita, Jose Alfredo Samaniego
    Mendoza, Marie Lisandra Zepeda
    Barnett, Ross
    Wales, Nathan
    Gilbert, M. Thomas P.
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [6] Chlorella vulgaris genome assembly and annotation reveals the molecular basis for metabolic acclimation to high light conditions
    Cecchin, Michela
    Marcolungo, Luca
    Rossato, Marzia
    Girolomoni, Laura
    Cosentino, Emanuela
    Cuine, Stephan
    Li-Beisson, Yonghua
    Delledonne, Massimo
    Ballottari, Matteo
    [J]. PLANT JOURNAL, 2019, 100 (06) : 1289 - 1305
  • [7] Comparative analysis of nuclear, chloroplast, and mitochondrial genomes of watermelon and melon provides evidence of gene transfer
    Cui, Haonan
    Ding, Zhuo
    Zhu, Qianglong
    Wu, Yue
    Qiu, Boyan
    Gao, Peng
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [8] Genomic Analysis Based on Chromosome-Level Genome Assembly Reveals an Expansion of Terpene Biosynthesis of Azadirachta indica
    Du, Yuhui
    Song, Wei
    Yin, Zhiqiu
    Wu, Shengbo
    Liu, Jiaheng
    Wang, Ning
    Jin, Hua
    Qiao, Jianjun
    Huo, Yi-Xin
    [J]. FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [9] Accelerated Profile HMM Searches
    Eddy, Sean R.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)
  • [10] Complete Sequence of a 641-kb Insertion of Mitochondrial DNA in the Arabidopsis thaliana Nuclear Genome
    Fields, Peter D.
    Waneka, Gus
    Naish, Matthew
    Schatz, Michael C.
    Henderson, Ian R.
    Sloan, Daniel B.
    [J]. GENOME BIOLOGY AND EVOLUTION, 2022, 14 (05):