Computational approaches to predict bacteriophage-host relationships

被引:299
作者
Edwards, Robert A. [1 ,2 ,3 ]
McNair, Katelyn [1 ]
Faust, Karoline [4 ,5 ,6 ]
Raes, Jeroen [4 ,5 ,6 ]
Dutilh, Bas E. [2 ,7 ,8 ]
机构
[1] San Diego State Univ, Dept Comp Sci, 5500 Campanile Dr, San Diego, CA 92182 USA
[2] Univ Fed Rio de Janeiro, Inst Biol, Dept Marine Biol, BR-21941902 Rio De Janeiro, Brazil
[3] Argonne Natl Lab, Div Math & Comp Sci, 9700 S Cass Ave, Argonne, IL 60439 USA
[4] Katholieke Univ Leuven, Rega Inst, Dept Microbiol & Immunol, Herestr 49, B-3000 Leuven, Belgium
[5] VIB, Ctr Biol Dis, Herestr 49, B-3000 Leuven, Belgium
[6] Vrije Univ Brussel, Microbiol Lab, Pleinlaan 2, B-1050 Brussels, Belgium
[7] Univ Utrecht, Theoret Biol & Bioinformat, Padualaan 8, NL-3584 CH Utrecht, Netherlands
[8] Radboud Univ Nijmegen, Med Ctr, Ctr Mol & Biomol Informat, Radboud Inst Mol Life Sci, Geert Grooteplein 28, NL-6525 GA Nijmegen, Netherlands
基金
美国国家科学基金会;
关键词
phages; viruses of microbes; metagenomics; co-occurrence; CRISPR; oligonucleotide usage; SUPERINFECTION EXCLUSION; P22; PROPHAGE; SALMONELLA-TYPHIMURIUM; LEVEL DECONVOLUTION; ACQUIRED-RESISTANCE; VIRAL COMMUNITIES; BACTERIAL GENOMES; GENETIC ELEMENTS; PHAGE; DNA;
D O I
10.1093/femsre/fuv048
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Metagenomics has changed the face of virus discovery by enabling the accurate identification of viral genome sequences without requiring isolation of the viruses. As a result, metagenomic virus discovery leaves the first and most fundamental question about any novel virus unanswered: What host does the virus infect? The diversity of the global virosphere and the volumes of data obtained in metagenomic sequencing projects demand computational tools for virus-host prediction. We focus on bacteriophages (phages, viruses that infect bacteria), the most abundant and diverse group of viruses found in environmental metagenomes. By analyzing 820 phages with annotated hosts, we review and assess the predictive power of in silico phage-host signals. Sequence homology approaches are the most effective at identifying known phage-host pairs. Compositional and abundance-based methods contain significant signal for phage-host classification, providing opportunities for analyzing the unknowns in viral metagenomes. Together, these computational approaches further our knowledge of the interactions between phages and their hosts. Importantly, we find that all reviewed signals significantly link phages to their hosts, illustrating how current knowledge and insights about the interaction mechanisms and ecology of coevolving phages and bacteria can be exploited to predict phage-host relationships, with potential relevance for medical and industrial applications.New viruses infecting bacteria are increasingly being discovered in many environments through sequence-based explorations. To understand their role in microbial ecosystems, computational tools are indispensable to prioritize and guide experimental efforts. This review assesses and discusses a range of bioinformatic approaches to predict bacteriophage-host relationships when all that is known is their genome sequence.New viruses infecting bacteria are increasingly being discovered in many environments through sequence-based explorations. To understand their role in microbial ecosystems, computational tools are indispensable to prioritize and guide experimental efforts. This review assesses and discusses a range of bioinformatic approaches to predict bacteriophage-host relationships when all that is known is their genome sequence.
引用
收藏
页码:258 / 272
页数:15
相关论文
共 120 条
  • [1] PhiSpy: a novel algorithm for finding prophages in bacterial genomes that combines similarity- and composition-based strategies
    Akhter, Sajia
    Aziz, Ramy K.
    Edwards, Robert A.
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (16) : e126
  • [2] Single-cell and population level viral infection dynamics revealed by phageFISH, a method to visualize intracellular and free viruses
    Allers, Elke
    Moraru, Cristina
    Duhaime, Melissa B.
    Beneze, Erica
    Solonenko, Natalie
    Barrero-Canosa, Jimena
    Amann, Rudolf
    Sullivan, Matthew B.
    [J]. ENVIRONMENTAL MICROBIOLOGY, 2013, 15 (08) : 2306 - 2318
  • [3] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [4] Using CRISPRs as a metagenomic tool to identify microbial hosts of a diffuse flow hydrothermal vent viral assemblage
    Anderson, Rika E.
    Brazelton, William J.
    Baross, John A.
    [J]. FEMS MICROBIOLOGY ECOLOGY, 2011, 77 (01) : 120 - 133
  • [5] Virus population dynamics and acquired virus resistance in natural microbial communities
    Andersson, Anders F.
    Banfield, Jillian F.
    [J]. SCIENCE, 2008, 320 (5879) : 1047 - 1050
  • [6] [Anonymous], 1996, SCI PSYCH S
  • [7] Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes
    Aziz, Ramy K.
    Dwivedi, Bhakti
    Akhter, Sajia
    Breitbart, Mya
    Edwards, Robert A.
    [J]. FRONTIERS IN MICROBIOLOGY, 2015, 6
  • [8] CRISPR provides acquired resistance against viruses in prokaryotes
    Barrangou, Rodolphe
    Fremaux, Christophe
    Deveau, Helene
    Richards, Melissa
    Boyaval, Patrick
    Moineau, Sylvain
    Romero, Dennis A.
    Horvath, Philippe
    [J]. SCIENCE, 2007, 315 (5819) : 1709 - 1712
  • [9] Coevolutionary diversification creates nested-modular structure in phage-bacteria interaction networks
    Beckett, Stephen J.
    Williams, Hywel T. P.
    [J]. INTERFACE FOCUS, 2013, 3 (06)
  • [10] Strain- and plasmid-level deconvolution of a synthetic metagenome by sequencing proximity ligation products
    Beitel, Christopher W.
    Froenicke, Lutz
    Lang, Jenna M.
    Korf, Ian F.
    Michelmore, Richard W.
    Eisen, Jonathan A.
    Darling, Aaron E.
    [J]. PEERJ, 2014, 2