A network-based integrated framework for predicting virus-prokaryote interactions

被引:80
作者
Wang, Weili [1 ]
Ren, Jie [1 ,5 ]
Tang, Kujin [1 ]
Dart, Emily [2 ]
Ignacio-Espinoza, Julio Cesar [3 ]
Fuhrman, Jed A. [3 ]
Braun, Jonathan [4 ]
Sun, Fengzhu [1 ]
Ahlgren, Nathan A. [2 ]
机构
[1] Univ Southern Calif, Quantitat & Computat Biol Program, Los Angeles, CA 90089 USA
[2] Clark Univ, Biol Dept, Worcester, MA 01610 USA
[3] Univ Southern Calif, Dept Biol Sci, Los Angeles, CA 90089 USA
[4] Cedars Sinai Med Ctr, Inflammatory Bowel & Immunobiol Res Inst, Los Angeles, CA 90048 USA
[5] Google Inc, Mountain View, CA USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
BACTERIA; HOST; VIROME; BACTERIOPHAGES; CLASSIFICATION; DIVERSITY;
D O I
10.1093/nargab/lqaa044
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Metagenomic sequencing has greatly enhanced the discovery of viral genomic sequences; however, it remains challenging to identify the host(s) of these new viruses. We developed VirHostMatcher-Net, a flexible, network-based, Markov random field framework for predicting virus-prokaryote interactions using multiple, integrated features: CRISPR sequences and alignment-free similarity measures (s(2)* and WIsH). Evaluation of this method on a benchmark set of 1462 known virus-prokaryote pairs yielded host prediction accuracy of 59% and 86% at the genus and phylum levels, representing 16-27% and 6-10% improvement, respectively, over previous single-feature prediction approaches. We applied our host prediction tool to crAssphage, a human gut phage, and two metagenomic virus datasets: marine viruses and viral contigs recovered from globally distributed, diverse habitats. Host predictions were frequently consistent with those of previous studies, but more importantly, this new tool made many more confident predictions than previous tools, up to nearly 3-fold more (n > 27 000), greatly expanding the diversity of known virus-host interactions.
引用
收藏
页数:19
相关论文
共 80 条
[1]   Discovery of several novel, widespread, and ecologically distinct marine Thaumarchaeota viruses that encode amoC nitrification genes [J].
Ahlgren, Nathan A. ;
Fuchsman, Clara A. ;
Rocap, Gabrielle ;
Fuhrman, Jed A. .
ISME JOURNAL, 2019, 13 (03) :618-631
[2]   Alignment-free d2* oligonucleotide frequency dissimilarity measure improves prediction of hosts from metagenomically-derived viral sequences [J].
Ahlgren, Nathan A. ;
Ren, Jie ;
Lu, Yang Young ;
Fuhrman, Jed A. ;
Sun, Fengzhu .
NUCLEIC ACIDS RESEARCH, 2017, 45 (01) :39-53
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]   Sulfur Oxidation Genes in Diverse Deep-Sea Viruses [J].
Anantharaman, Karthik ;
Duhaime, Melissa B. ;
Breier, John A. ;
Wendt, Kathleen A. ;
Toner, Brandy M. ;
Dick, Gregory J. .
SCIENCE, 2014, 344 (6185) :757-760
[5]   Analysis of virus genomes from glacial environments reveals novel virus groups with unusual host interactions [J].
Bellas, Christopher M. ;
Anesio, Alexandre M. ;
Barker, Gary .
FRONTIERS IN MICROBIOLOGY, 2015, 6
[6]   CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats [J].
Bland, Charles ;
Ramsey, Teresa L. ;
Sabree, Fareedah ;
Lowe, Micheal ;
Brown, Kyndall ;
Kyrpides, Nikos C. ;
Hugenholtz, Philip .
BMC BIOINFORMATICS, 2007, 8 (1)
[7]   Here a virus, there a virus, everywhere the same virus? [J].
Breitbart, M ;
Rohwer, F .
TRENDS IN MICROBIOLOGY, 2005, 13 (06) :278-284
[8]   Genomic analysis of uncultured marine viral communities [J].
Breitbart, M ;
Salamon, P ;
Andresen, B ;
Mahaffy, JM ;
Segall, AM ;
Mead, D ;
Azam, F ;
Rohwer, F .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (22) :14250-14255
[9]   Patterns and ecological drivers of ocean viral communities [J].
Brum, Jennifer R. ;
Ignacio-Espinoza, J. Cesar ;
Roux, Simon ;
Doulcier, Guilhem ;
Acinas, Silvia G. ;
Alberti, Adriana ;
Chaffron, Samuel ;
Cruaud, Corinne ;
de Vargas, Colomban ;
Gasol, Josep M. ;
Gorsky, Gabriel ;
Gregory, Ann C. ;
Guidi, Lionel ;
Hingamp, Pascal ;
Iudicone, Daniele ;
Not, Fabrice ;
Ogata, Hiroyuki ;
Pesant, Stephane ;
Poulos, Bonnie T. ;
Schwenck, Sarah M. ;
Speich, Sabrina ;
Dimier, Celine ;
Kandels-Lewis, Stefanie ;
Picheral, Marc ;
Searson, Sarah ;
Bork, Peer ;
Bowler, Chris ;
Sunagawa, Shinichi ;
Wincker, Patrick ;
Karsenti, Eric ;
Sullivan, Matthew B. .
SCIENCE, 2015, 348 (6237)
[10]   Fast and sensitive protein alignment using DIAMOND [J].
Buchfink, Benjamin ;
Xie, Chao ;
Huson, Daniel H. .
NATURE METHODS, 2015, 12 (01) :59-60