Chlomito: a novel tool for precise elimination of organelle genome contamination from nuclear genome assembly

被引:0
作者
Song, Wei [1 ]
Li, Chong [1 ]
Lu, Yanming [1 ]
Shen, Dawei [2 ]
Jia, Yunxiao [1 ]
Huo, Yixin [1 ]
Piao, Weilan [1 ,3 ]
Jin, Hua [1 ,3 ,4 ]
机构
[1] Beijing Inst Technol, Aerosp Ctr Hosp, Lab Genet & Disorders, Sch Life Sci,Key Lab Mol Med & Biotherapy, Beijing, Peoples R China
[2] Beijing Inst Technol, Res Inst Sci & Technol, Beijing, Peoples R China
[3] Beijing Inst Technol, Adv Technol Res Inst, Jinan, Peoples R China
[4] Aerosp Ctr Hosp, Dept Pathol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
mitochondrial genome; chloroplast genome; chromosome-level assembly; organelle identification; horizontal gene transfer; MITOCHONDRIAL GENOMES; GENE-TRANSFER; ALIGNMENT; SUITE;
D O I
10.3389/fpls.2024.1430443
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Introduction Accurate reference genomes are fundamental to understanding biological evolution, biodiversity, hereditary phenomena and diseases. However, many assembled nuclear chromosomes are often contaminated by organelle genomes, which will mislead bioinformatic analysis, and genomic and transcriptomic data interpretation.Methods To address this issue, we developed a tool named Chlomito, aiming at precise identification and elimination of organelle genome contamination from nuclear genome assembly. Compared to conventional approaches, Chlomito utilized new metrics, alignment length coverage ratio (ALCR) and sequencing depth ratio (SDR), thereby effectively distinguishing true organelle genome sequences from those transferred into nuclear genomes via horizontal gene transfer (HGT).Results The accuracy of Chlomito was tested using sequencing data from Plum, Mango and Arabidopsis. The results confirmed that Chlomito can accurately detect contigs originating from the organelle genomes, and the identified contigs covered most regions of the organelle reference genomes, demonstrating efficiency and precision of Chlomito. Considering user convenience, we further packaged this method into a Docker image, simplified the data processing workflow.Discussion Overall, Chlomito provides an efficient, accurate and convenient method for identifying and removing contigs derived from organelle genomes in genomic assembly data, contributing to the improvement of genome assembly quality.
引用
收藏
页数:14
相关论文
共 65 条
[1]   MitoFinder: Efficient automated large-scale extraction of mitogenomic data in target enrichment phylogenomics [J].
Allio, Remi ;
Schomaker-Bastos, Alex ;
Romiguier, Jonathan ;
Prosdocimi, Francisco ;
Nabholz, Benoit ;
Delsuc, Frederic .
MOLECULAR ECOLOGY RESOURCES, 2020, 20 (04) :892-905
[2]   Chromosome-level genome assembly of the Asian aspen Populus davidiana Dode [J].
Bae, Eun-Kyung ;
Kang, Min-Jeong ;
Lee, Seung-Jae ;
Park, Eung-Jun ;
Kim, Ki-Tae .
SCIENTIFIC DATA, 2023, 10 (01)
[3]   A chromosomal-scale genome assembly of modern cultivated hybrid sugarcane provides insights into origination and evolution [J].
Bao, Yixue ;
Zhang, Qing ;
Huang, Jiangfeng ;
Zhang, Shengcheng ;
Yao, Wei ;
Yu, Zehuai ;
Deng, Zuhu ;
Yu, Jiaxin ;
Kong, Weilong ;
Yu, Xikai ;
Lu, Shan ;
Wang, Yibin ;
Li, Ru ;
Song, Yuhan ;
Zou, Chengwu ;
Xu, Yuzhi ;
Liu, Zongling ;
Yu, Fan ;
Song, Jiaming ;
Huang, Youzong ;
Zhang, Jisen ;
Wang, Haifeng ;
Chen, Baoshan ;
Zhang, Xingtan ;
Zhang, Muqing .
NATURE COMMUNICATIONS, 2024, 15 (01)
[4]   Mitochondrial DNA, chloroplast DNA and the origins of development in eukaryotic organisms [J].
Bendich, Arnold J. .
BIOLOGY DIRECT, 2010, 5
[5]   Odintifier - A computational method for identifying insertions of organellar origin from modern and ancient high-throughput sequencing data based on haplotype phasing [J].
Castruita, Jose Alfredo Samaniego ;
Mendoza, Marie Lisandra Zepeda ;
Barnett, Ross ;
Wales, Nathan ;
Gilbert, M. Thomas P. .
BMC BIOINFORMATICS, 2015, 16
[6]   Chlorella vulgaris genome assembly and annotation reveals the molecular basis for metabolic acclimation to high light conditions [J].
Cecchin, Michela ;
Marcolungo, Luca ;
Rossato, Marzia ;
Girolomoni, Laura ;
Cosentino, Emanuela ;
Cuine, Stephan ;
Li-Beisson, Yonghua ;
Delledonne, Massimo ;
Ballottari, Matteo .
PLANT JOURNAL, 2019, 100 (06) :1289-1305
[7]   Comparative analysis of nuclear, chloroplast, and mitochondrial genomes of watermelon and melon provides evidence of gene transfer [J].
Cui, Haonan ;
Ding, Zhuo ;
Zhu, Qianglong ;
Wu, Yue ;
Qiu, Boyan ;
Gao, Peng .
SCIENTIFIC REPORTS, 2021, 11 (01)
[8]   Genomic Analysis Based on Chromosome-Level Genome Assembly Reveals an Expansion of Terpene Biosynthesis of Azadirachta indica [J].
Du, Yuhui ;
Song, Wei ;
Yin, Zhiqiu ;
Wu, Shengbo ;
Liu, Jiaheng ;
Wang, Ning ;
Jin, Hua ;
Qiao, Jianjun ;
Huo, Yi-Xin .
FRONTIERS IN PLANT SCIENCE, 2022, 13
[9]   Accelerated Profile HMM Searches [J].
Eddy, Sean R. .
PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)
[10]   Complete Sequence of a 641-kb Insertion of Mitochondrial DNA in the Arabidopsis thaliana Nuclear Genome [J].
Fields, Peter D. ;
Waneka, Gus ;
Naish, Matthew ;
Schatz, Michael C. ;
Henderson, Ian R. ;
Sloan, Daniel B. .
GENOME BIOLOGY AND EVOLUTION, 2022, 14 (05)