Maximal viral information recovery from sequence data using VirMAP

被引:43
作者
Ajami, Nadim J. [1 ,2 ]
Wong, Matthew C. [1 ,2 ]
Ross, Matthew C. [1 ,2 ]
Lloyd, Richard E. [2 ]
Petrosino, Joseph F. [1 ,2 ]
机构
[1] Baylor Coll Med, Alkek Ctr Metagen & Microbiome Res, Houston, TX 77030 USA
[2] Baylor Coll Med, Dept Mol Virol & Microbiol, Houston, TX 77030 USA
关键词
D O I
10.1038/s41467-018-05658-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate classification of the human virome is critical to a full understanding of the role viruses play in health and disease. This implies the need for sensitive, specific, and practical pipelines that return precise outputs while still enabling case-specific post hoc analysis. Viral taxonomic characterization from metagenomic data suffers from high background noise and signal crosstalk that confounds current methods. Here we develop VirMAP that overcomes these limitations using techniques that merge nucleotide and protein information to tax-onomically classify viral reconstructions independent of genome coverage or read overlap. We validate VirMAP using published data sets and viral mock communities containing RNA and DNA viruses and bacteriophages. VirMAP offers opportunities to enhance metagenomic studies seeking to define virome-host interactions, improve biosurveillance capabilities, and strengthen molecular epidemiology reporting.
引用
收藏
页数:9
相关论文
共 22 条
[21]  
Li D., Liu C.M., Luo R., Sadakane K., Lam T.W., MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph, Bioinformatics, 31, pp. 1674-1676, (2015)
[22]  
Li D., Et al., MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices, Methods, 102, pp. 3-11, (2016)