Uniformization for sampling realizations of Markov processes: applications to Bayesian implementations of codon substitution models

被引:34
作者
Rodrigue, Nicolas [1 ]
Philippe, Herve [1 ]
Lartillot, Nicolas [2 ]
机构
[1] Univ Montreal, Dept Biochim, Canadian Inst Adv Res, Montreal, PQ H3C 3J7, Canada
[2] Univ Montpellier 2, CNRS, URM 5506, Lab Informat Robot & Microelect Montpellier, Montpellier, France
关键词
D O I
10.1093/bioinformatics/btm532
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Mapping character state changes over phylogenetic trees is central to the study of evolution. However, current probabilistic methods for generating such mappings are ill-suited to certain types of evolutionary models, in particular, the widely used models of codon substitution. Results: We describe a general method, based on a uniformization technique, which can be utilized to generate realizations of a Markovian substitution process conditional on an alignment of character states and a given tree topology. The method is applicable under a wide range of evolutionary models, and to illustrate its usefulness in practice, we embed it within a data augmentation-based Markov chain Monte Carlo sampler, for approximating posterior distributions under previously proposed codon substitution models. The sampler is found to be more efficient than the conventional pruning-based sampler with the decorrelation times between draws from the posterior reduced by a factor of 20 or more.
引用
收藏
页码:56 / 62
页数:7
相关论文
共 33 条
[1]   SIMMAP: Stochastic character mapping of discrete traits on phylogenies [J].
Bollback, JP .
BMC BIOINFORMATICS, 2006, 7 (1)
[2]   Detecting coevolving amino acid sites using Bayesian mutational mapping [J].
Dimmic, MW ;
Hubisz, MJ ;
Bustamante, CD ;
Nielsen, R .
BIOINFORMATICS, 2005, 21 :I126-I135
[3]   A combined empirical and mechanistic codon model [J].
Doron-Faigenboim, Adi ;
Pupko, Tal .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (02) :388-397
[4]   An exact Gibbs sampler for the Markov-modulated Poisson process [J].
Fearnhead, Paul ;
Sherlock, Chris .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2006, 68 :767-784
[5]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[6]   Parameterization and Bayesian modeling [J].
Gelman, A .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (466) :537-545
[7]  
GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
[8]   THE RANDOMIZATION TECHNIQUE AS A MODELING TOOL AND SOLUTION PROCEDURE FOR TRANSIENT MARKOV-PROCESSES [J].
GROSS, D ;
MILLER, DR .
OPERATIONS RESEARCH, 1984, 32 (02) :343-361
[9]   Modeling the site-specific variation of selection patterns along lineages [J].
Guindon, S ;
Rodrigo, AG ;
Dyer, KA ;
Huelsenbeck, JP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (35) :12957-12962
[10]  
HOBOLTH A, 2008, IN PRESS J COMPUT GR