Fast and Accurate Maximum-Likelihood Estimation of Multi-Type Birth-Death Epidemiological Models from Phylogenetic Trees

被引:1
作者
Zhukova, Anna [1 ,2 ]
Hecht, Frederic [3 ]
Maday, Yvon [3 ,4 ]
Gascuel, Olivier [1 ,5 ,6 ]
机构
[1] Univ Paris, Inst Pasteur, Unite Bioinformat Evolut, 28 Rue Docteur Roux, F-75015 Paris, France
[2] Univ Paris, Inst Pasteur, Bioinformat & Biostatist Hub, 28 Rue Docteur Roux, F-75015 Paris, France
[3] Univ Paris Cite, Sorbonne Univ, CNRS, Lab Jacques Louis Lions LJLL, 4 Pl Jussieu, F-75005 Paris, France
[4] Inst Univ France, 1 Rue Descartes, F-75231 Paris 05, France
[5] Museum Natl Hist Nat, Inst Systemat Evolut Biodiversite ISYEB, URM 7205, SU,EPHE,CNRS, 57 Rue Cuvier,CP 50, F-75005 Paris, France
[6] UA, 57 Rue Cuvier,CP 50, F-75005 Paris, France
基金
欧洲研究理事会;
关键词
Birth-death model; Ebola; epidemiology; mathematical modelling; ordinary differential equations; phylodynamics; DYNAMICS; SPECIATION; FRAMEWORK; INFERENCE; SPREAD;
D O I
10.1093/sysbio/syad059
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multi-type birth-death (MTBD) models are phylodynamic analogies of compartmental models in classical epidemiology. They serve to infer such epidemiological parameters as the average number of secondary infections R-e and the infectious time from a phylogenetic tree (a genealogy of pathogen sequences). The representatives of this model family focus on various aspects of pathogen epidemics. For instance, the birth-death exposed-infectious (BDEI) model describes the transmission of pathogens featuring an incubation period (when there is a delay between the moment of infection and becoming infectious, as for Ebola and SARS-CoV-2), and permits its estimation along with other parameters. With constantly growing sequencing data, MTBD models should be extremely useful for unravelling information on pathogen epidemics. However, existing implementations of these models in a phylodynamic framework have not yet caught up with the sequencing speed. Computing time and numerical instability issues limit their applicability to medium data sets (<= 500 samples), while the accuracy of estimations should increase with more data. We propose a new highly parallelizable formulation of ordinary differential equations for MTBD models. We also extend them to forests to represent situations when a (sub-)epidemic started from several cases (e.g., multiple introductions to a country). We implemented it for the BDEI model in a maximum likelihood framework using a combination of numerical analysis methods for efficient equation resolution. Our implementation estimates epidemiological parameter values and their confidence intervals in two minutes on a phylogenetic tree of 10,000 samples. Comparison to the existing implementations on simulated data shows that it is not only much faster but also more accurate. An application of our tool to the 2014 Ebola epidemic in Sierra-Leone is also convincing, with very fast calculation and precise estimates. As MTBD models are closely related to Cladogenetic State Speciation and Extinction (ClaSSE)-like models, our findings could also be easily transferred to the macroevolution domain.
引用
收藏
页码:1387 / 1402
页数:16
相关论文
共 2 条
  • [1] Detection of HIV transmission clusters from phylogenetic trees using a multi-state birth - death model
    Barido-Sottani, Joelle
    Vaughan, Timothy G.
    Stadler, Tanja
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2018, 15 (146)
  • [2] NestedBD: Bayesian inference of phylogenetic trees from single-cell copy number profiles under a birth-death model
    Liu, Yushu
    Edrisi, Mohammadamin
    Yan, Zhi
    Ogilvie, Huw A.
    Nakhleh, Luay
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2024, 19 (01)