Quantifying and Mitigating the Effect of Preferential Sampling on Phylodynamic Inference

被引:34
|
作者
Karcher, Michael D. [1 ]
Palacios, Julia A. [2 ,3 ,4 ]
Bedford, Trevor [5 ]
Suchard, Marc A. [6 ,7 ,8 ]
Minin, Vladimir N. [1 ,9 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
[2] Harvard Univ, Dept Organism & Evolutionary Biol, Cambridge, MA 02138 USA
[3] Brown Univ, Dept Ecol & Evolutionary Biol, Providence, RI 02912 USA
[4] Brown Univ, Ctr Computat Mol Biol, Providence, RI 02912 USA
[5] Fred Hutchinson Canc Res Ctr, Vaccine & Infect Dis Div, 1124 Columbia St, Seattle, WA 98104 USA
[6] Univ Calif Los Angeles, David Geffen Sch Med, Dept Human Genet, Los Angeles, CA 90095 USA
[7] Univ Calif Los Angeles, David Geffen Sch Med, Dept Biomath, Los Angeles, CA 90095 USA
[8] Univ Calif Los Angeles, Fielding Sch Publ Hlth, Dept Biostat, Los Angeles, CA USA
[9] Univ Washington, Dept Biol, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
POPULATION-DYNAMICS; HISTORY; SKYLINE; GROWTH;
D O I
10.1371/journal.pcbi.1004789
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Phylodynamics seeks to estimate effective population size fluctuations from molecular sequences of individuals sampled from a population of interest. One way to accomplish this task formulates an observed sequence data likelihood exploiting a coalescent model for the sampled individuals' genealogy and then integrating over all possible genealogies via Monte Carlo or, less efficiently, by conditioning on one genealogy estimated from the sequence data. However, when analyzing sequences sampled serially through time, current methods implicitly assume either that sampling times are fixed deterministically by the data collection protocol or that their distribution does not depend on the size of the population. Through simulation, we first show that, when sampling times do probabilistically depend on effective population size, estimation methods may be systematically biased. To correct for this deficiency, we propose a new model that explicitly accounts for preferential sampling by modeling the sampling times as an inhomogeneous Poisson process dependent on effective population size. We demonstrate that in the presence of preferential sampling our new model not only reduces bias, but also improves estimation precision. Finally, we compare the performance of the currently used phylodynamic methods with our proposed model through clinically-relevant, seasonal human influenza examples.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Model design for nonparametric phylodynamic inference and applications to pathogen surveillance
    Didelot, Xavier
    Franceschi, Vinicius
    Frost, Simon D. W.
    Dennis, Ann
    Volz, Erik M.
    VIRUS EVOLUTION, 2023, 9 (01)
  • [22] Variational Phylodynamic Inference Using Pandemic-scale Data
    Ki, Caleb
    Terhorst, Jonathan
    MOLECULAR BIOLOGY AND EVOLUTION, 2022, 39 (08)
  • [23] Field validation of phylodynamic analytical methods for inference on epidemiological processes in wildlife
    Pacioni, Carlo
    Vaughan, Timothy G.
    Strive, Tanja
    Campbell, Susan
    Ramsey, David S. L.
    TRANSBOUNDARY AND EMERGING DISEASES, 2022, 69 (03) : 1020 - 1029
  • [24] Epidemiological inference from pathogen genomes: A review of phylodynamic models and applications
    Featherstone, Leo A.
    Zhang, Joshua M.
    Vaughan, Timothy G.
    Duchene, Sebastian
    VIRUS EVOLUTION, 2022, 8 (01)
  • [25] The Impacts of Low Diversity Sequence Data on Phylodynamic Inference during an Emerging Epidemic
    Lam, Anthony
    Duchene, Sebastian
    VIRUSES-BASEL, 2021, 13 (01):
  • [26] Identifying, quantifying, and mitigating extraneous contact effect in dynamic characterization of flexible devices
    Wang, Luming
    Chen, Jianglong
    Wu, Song
    Xiao, Fei
    Xu, Bo
    Deng, Qingyang
    Liang, Yachun
    Zhang, Zejuan
    Zhang, Pengcheng
    Yang, Rui
    Xia, Juan
    Wang, Zenghui
    APPLIED PHYSICS REVIEWS, 2023, 10 (03)
  • [27] SUT: Quantifying and mitigating URL typosquatting
    Banerjee, Anirban
    Rahman, Md Sazzadur
    Faloutsos, Michalis
    COMPUTER NETWORKS, 2011, 55 (13) : 3001 - 3014
  • [28] Quantifying and Mitigating the Costs of FPGA Virtualization
    Yazdanshenas, Sadegh
    Betz, Vaughn
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [29] Methods for preferential sampling in geostatistics
    Dinsdale, Daniel
    Salibian-Barrera, Matias
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2019, 68 (01) : 181 - 198
  • [30] Phylodynamic Inference and Model Assessment with Approximate Bayesian Computation: Influenza as a Case Study
    Ratmann, Oliver
    Donker, Ge
    Meijer, Adam
    Fraser, Christophe
    Koelle, Katia
    PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (12)