Epidemic Reconstruction in a Phylogenetics Framework: Transmission Trees as Partitions of the Node Set

被引:68
作者
Hall, Matthew [1 ,2 ]
Woolhouse, Mark [1 ,2 ]
Rambaut, Andrew [1 ,2 ,3 ]
机构
[1] Univ Edinburgh, Inst Evolutionary Biol, Edinburgh, Midlothian, Scotland
[2] Univ Edinburgh, Ctr Immun Infect & Evolut, Edinburgh, Midlothian, Scotland
[3] NIH, Fogarty Int Ctr, Bldg 10, Bethesda, MD 20892 USA
基金
欧洲研究理事会;
关键词
RELAXED PHYLOGENETICS; DNA-SEQUENCES; GENETIC DATA; INFERENCE; MODELS; NETHERLANDS; DISTANCES; OUTBREAKS; POULTRY; SKYLINE;
D O I
10.1371/journal.pcbi.1004613
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The use of genetic data to reconstruct the transmission tree of infectious disease epidemics and outbreaks has been the subject of an increasing number of studies, but previous approaches have usually either made assumptions that are not fully compatible with phylogenetic inference, or, where they have based inference on a phylogeny, have employed a procedure that requires this tree to be fixed. At the same time, the coalescent-based models of the pathogen population that are employed in the methods usually used for time-resolved phylogeny reconstruction are a considerable simplification of epidemic process, as they assume that pathogen lineages mix freely. Here, we contribute a new method that is simultaneously a phylogeny reconstruction method for isolates taken from an epidemic, and a procedure for transmission tree reconstruction. We observe that, if one or more samples is taken from each host in an epidemic or outbreak and these are used to build a phylogeny, a transmission tree is equivalent to a partition of the set of nodes of this phylogeny, such that each partition element is a set of nodes that is connected in the full tree and contains all the tips corresponding to samples taken from one and only one host. We then implement a Monte Carlo Markov Chain (MCMC) procedure for simultaneous sampling from the spaces of both trees, utilising a newly-designed set of phylogenetic tree proposals that also respect node partitions. We calculate the posterior probability of these partitioned trees based on a model that acknowledges the population structure of an epidemic by employing an individual-based disease transmission model and a coalescent process taking place within each host. We demonstrate our method, first using simulated data, and then with sequences taken from the H7N7 avian influenza outbreak that occurred in the Netherlands in 2003. We show that it is superior to established coalescent methods for reconstructing the topology and node heights of the phylogeny and performs well for transmission tree reconstruction when the phylogeny is well-resolved by the genetic data, but caution that this will often not be the case in practice and that existing genetic and epidemiological data should be used to configure such analyses whenever possible. This method is available for use by the research community as part of BEAST, one of the most widely-used packages for reconstruction of dated phylogenies.
引用
收藏
页数:36
相关论文
共 54 条
  • [1] Modelling the spread of infectious salmon anaemia among salmon farms based on seaway distances between farms and genetic relationships between infectious salmon anaemia virus isolates
    Aldrin, M.
    Lyngstad, T. M.
    Kristoffersen, A. B.
    Storvik, B.
    Borgan, O.
    Jansen, P. A.
    [J]. JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2011, 8 (62) : 1346 - 1356
  • [2] Evolutionary Analysis of Inter-Farm Transmission Dynamics in a Highly Pathogenic Avian Influenza Epidemic
    Bataille, Arnaud
    van der Meer, Frank
    Stegeman, Arjan
    Koch, Guus
    [J]. PLOS PATHOGENS, 2011, 7 (06)
  • [3] πBUSS: a parallel BEAST/BEAGLE utility for sequence simulation under complex evolutionary scenarios
    Bielejec, Filip
    Lemey, Philippe
    Carvalho, Luiz Max
    Baele, Guy
    Rambaut, Andrew
    Suchard, Marc A.
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [4] Risk maps for the spread of highly pathogenic avian influenza in poultry
    Boender, Gert Jan
    Hagenaars, Thomas J.
    Bouma, Annemarie
    Nodelijk, Gonnie
    Elbers, Armin R. W.
    de Jong, Mart C. M.
    van Boven, Michiel
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (04) : 704 - 712
  • [5] A global initiative on sharing avian flu data
    Bogner, Peter
    Capua, Ilaria
    Cox, Nancy J.
    Lipman, David J.
    [J]. NATURE, 2006, 442 (7106) : 981 - 981
  • [6] Estimating the day of highly pathogenic avian influenza (H7N7) virus introduction into a poultry flock based on mortality data
    Bos, Marian E. H.
    Van Boven, Michiel
    Nielen, Mirjam
    Bouma, Annemarie
    Elbers, Armin R. W.
    Nodelijk, Gonnie
    Koch, Guus
    Stegeman, Arjan
    De Jong, Mart C. M.
    [J]. VETERINARY RESEARCH, 2007, 38 (03) : 493 - 504
  • [7] Sequential Bottlenecks Drive Viral Evolution in Early Acute Hepatitis C Virus Infection
    Bull, Rowena A.
    Luciani, Fabio
    McElroy, Kerensa
    Gaudieri, Silvana
    Pham, Son T.
    Chopra, Abha
    Cameron, Barbara
    Maher, Lisa
    Dore, Gregory J.
    White, Peter A.
    Lloyd, Andrew R.
    [J]. PLOS PATHOGENS, 2011, 7 (09)
  • [8] Integrating genetic and epidemiological data to determine transmission pathways of foot-and-mouth disease virus
    Cottam, Eleanor M.
    Thebaud, Gael
    Wadsworth, Jemma
    Gloster, John
    Mansley, Leonard
    Paton, David J.
    King, Donald P.
    Haydon, Daniel T.
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 275 (1637) : 887 - 895
  • [9] Deardon R, 2010, STAT SINICA, V20, P239
  • [10] Bayesian Inference of Infectious Disease Transmission from Whole-Genome Sequence Data
    Didelot, Xavier
    Gardy, Jennifer
    Colijn, Caroline
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2014, 31 (07) : 1869 - 1879