EFFICIENT BAYESIAN INFERENCE OF GENERAL GAUSSIAN MODELS ON LARGE PHYLOGENETIC TREES

被引:10
作者
Bastide, Paul [1 ]
Ho, Lam Si Tung [2 ]
Baele, Guy [3 ]
Lemey, Philippe [3 ]
Suchard, Marc A. [4 ,5 ,6 ]
机构
[1] Univ Montpellier, CNRS, IMAG, Montpellier, France
[2] Dalhousie Univ, Dept Math & Stat, Halifax, NS, Canada
[3] Katholieke Univ Leuven, Dept Microbiol Immunol & Transplantat, Rega Inst, Leuven, Belgium
[4] Calif State Univ Los Angeles, Dept Biostat, Los Angeles, CA 90032 USA
[5] Calif State Univ Los Angeles, Dept Biomath, Los Angeles, CA 90032 USA
[6] Calif State Univ Los Angeles, Dept Human Genet, Los Angeles, CA 90032 USA
基金
欧洲研究理事会; 英国惠康基金; 美国国家卫生研究院; 加拿大自然科学与工程研究理事会; 欧盟地平线“2020”;
关键词
Statistical phylogenetics; phylodynamics; Ornstein-Uhlenbeck process; Bayesian inference; Hamiltonian Monte Carlo; model selection; BEAST; heritability; HIV; Musteloidea; total evidence; TRAIT EVOLUTION; LIKELIHOOD-ESTIMATION; ADAPTIVE RADIATION; VIRAL LOAD; R PACKAGE; RATES; PATTERNS; FOSSILS; HERITABILITY; SPECIATION;
D O I
10.1214/20-AOAS1419
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Phylogenetic comparative methods correct for shared evolutionary history among a set of nonindependent organisms by modeling sample traits as arising from a diffusion process along the branches of a possibly unknown history. To incorporate such uncertainty, we present a scalable Bayesian inference framework under a general Gaussian trait evolution model that exploits Hamiltonian Monte Carlo (HMC). HMC enables efficient sampling of the constrained model parameters and takes advantage of the tree structure for fast likelihood and gradient computations, yielding algorithmic complexity linear in the number of observations. This approach encompasses a wide family of stochastic processes, including the general Ornstein-Uhlenbeck (OU) process, with possible missing data and measurement errors. We implement inference tools for a biologically relevant subset of all these models into the BEAST phylogenetic software package and develop model comparison through marginal likelihood estimation. We apply our approach to study the morphological evolution in the superfamily of Musteloidea (including weasels and allies) as well as the heritability of HIV virulence. This second problem furnishes a new measure of evolutionary heritability that demonstrates its utility through a targeted simulation study.
引用
收藏
页码:971 / 997
页数:27
相关论文
共 116 条
[1]  
Abeles Peter., 2016, Efficient Java Matrix Library
[2]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[3]   The complex step approximation to the Frechet derivative of a matrix function [J].
Al-Mohy, Awad H. ;
Higham, Nicholas J. .
NUMERICAL ALGORITHMS, 2010, 53 (01) :133-148
[4]   Phylogenetic Approach Reveals That Virus Genotype Largely Determines HIV Set-Point Viral Load [J].
Alizon, Samuel ;
von Wyl, Viktor ;
Stadler, Tanja ;
Kouyos, Roger D. ;
Yerly, Sabine ;
Hirschel, Bernard ;
Boeni, Juerg ;
Shah, Cyril ;
Klimkait, Thomas ;
Furrer, Hansjakob ;
Rauch, Andri ;
Vernazza, Pietro L. ;
Bernasconi, Enos ;
Battegay, Manuel ;
Buergisser, Philippe ;
Telenti, Amalio ;
Guenthard, Huldrych F. ;
Bonhoeffer, Sebastian .
PLOS PATHOGENS, 2010, 6 (09)
[5]   Bayesian Estimation of Species Divergence Times Using Correlated Quantitative Characters [J].
Alvarez-Carretero, Sandra ;
Goswami, Anjali ;
Yang, Ziheng ;
Dos Reis, Mario .
SYSTEMATIC BIOLOGY, 2019, 68 (06) :967-986
[6]  
[Anonymous], 2004, Inferring phylogenies
[7]  
[Anonymous], 2006, Journal de la societe francaise de statistique
[8]   Understanding the effect of competition during evolutionary radiations: an integrated model of phenotypic and species diversification [J].
Aristide, Leandro ;
Morlon, Helene .
ECOLOGY LETTERS, 2019, 22 (12) :2006-2017
[9]   Brain shape convergence in the adaptive radiation of New World monkeys [J].
Aristide, Leandro ;
dos Reis, Sergio Furtado ;
Machado, Alessandra C. ;
Lima, Inaya ;
Lopes, Ricardo T. ;
Ivan Perez, S. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (08) :2158-2163
[10]   Emerging Concepts of Data Integration in Pathogen Phylodynamics [J].
Baele, Guy ;
Suchard, Marc A. ;
Rambaut, Andrew ;
Lemey, Philippe .
SYSTEMATIC BIOLOGY, 2017, 66 (01) :E47-E65