Evaluating probabilistic programming and fast variational Bayesian inference in phylogenetics

被引:17
|
作者
Fourment, Mathieu [1 ]
Darling, Aaron E. [1 ]
机构
[1] Univ Technol Sydney, Ithree Inst, Sydney, NSW, Australia
来源
PEERJ | 2019年 / 7卷
关键词
Variational Bayes; Stan; Phylogenetics; molecular clock; Bayesian inference; MODEL; PROPOSALS; SEQUENCES;
D O I
10.7717/peerj.8272
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advances in statistical machine learning techniques have led to the creation of probabilistic programming frameworks. These frameworks enable probabilistic models to be rapidly prototyped and fit to data using scalable approximation methods such as variational inference. In this work, we explore the use of the Stan language for probabilistic programming in application to phylogenetic models. We show that many commonly used phylogenetic models including the general time reversible substitution model, rate heterogeneity among sites, and a range of coalescent models can be implemented using a probabilistic programming language. The posterior probability distributions obtained via the black box variational inference engine in Stan were compared to those obtained with reference implementations of Markov chain Monte Carlo (MCMC) for phylogenetic inference. We find that black box variational inference in Stan is less accurate than MCMC methods for phylogenetic models, but requires far less compute time. Finally, we evaluate a custom implementation of mean-field variational inference on the Jukes-Cantor substitution model and show that a specialized implementation of variational inference can be two orders of magnitude faster and more accurate than a general purpose probabilistic implementation.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Probabilistic Programming with Programmable Variational Inference
    Becker, Mccoy R.
    Lew, Alexander K.
    Wang, Xiaoyan
    Ghavami, Matin
    Huot, Mathieu
    Rinard, Martin C.
    Mansinghka, Vikash K.
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2024, 8 (PLDI):
  • [2] Stochastic Variational Inference for Bayesian Phylogenetics: A Case of CAT Model
    Dang, Tung
    Kishino, Hirohisa
    MOLECULAR BIOLOGY AND EVOLUTION, 2019, 36 (04) : 825 - 833
  • [3] Variational Bayesian inference for the probabilistic model of power load
    Dong, Zijian
    Wang, Yunpeng
    Zhao, Jing
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2014, 8 (11) : 1860 - 1868
  • [4] Stan: A Probabilistic Programming Language for Bayesian Inference and Optimization
    Gelman, Andrew
    Lee, Daniel
    Guo, Jiqiang
    JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2015, 40 (05) : 530 - 543
  • [5] Variational Supertrees for Bayesian Phylogenetics
    Karcher, Michael D.
    Zhang, Cheng
    Matsen IV, Frederic A.
    BULLETIN OF MATHEMATICAL BIOLOGY, 2024, 86 (09)
  • [6] Probabilistic deconvolution for electrochemical impedance through variational Bayesian inference
    Boskoski, Pavle
    Znidaric, Luka
    Gradisar, Ziga
    Subotic, Vanja
    JOURNAL OF POWER SOURCES, 2024, 622
  • [7] Sampling for Inference in Probabilistic Models with Fast Bayesian Quadrature
    Gunter, Tom
    Osborne, Michael A.
    Garnett, Roman
    Hennig, Philipp
    Roberts, Stephen J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [8] SPPL: Probabilistic Programming with Fast Exact Symbolic Inference
    Saad, Feras A.
    Rinard, Martin C.
    Mansinghka, Vikash K.
    PROCEEDINGS OF THE 42ND ACM SIGPLAN INTERNATIONAL CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '21), 2021, : 804 - 819
  • [9] Probabilistic robotic logic programming with hybrid Boolean and Bayesian inference
    Post, Mark A.
    ROBOTICA, 2024, 42 (01) : 40 - 71
  • [10] PROBABILISTIC FILTER AND SMOOTHER FOR VARIATIONAL INFERENCE OF BAYESIAN LINEAR DYNAMICAL SYSTEMS
    Neri, Julian
    Badeau, Roland
    Depalle, Philippe
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 5885 - 5889