A Random Effects Branch-Site Model for Detecting Episodic Diversifying Selection

被引:323
作者
Pond, Sergei L. Kosakovsky [1 ]
Murrell, Ben [2 ,3 ]
Fourment, Mathieu [4 ]
Frost, Simon D. W. [5 ]
Delport, Wayne [4 ]
Scheffler, Konrad [2 ]
机构
[1] Univ Calif San Diego, Dept Med, San Diego, CA 92103 USA
[2] Univ Stellenbosch, Div Comp Sci, Dept Math Sci, ZA-7600 Stellenbosch, South Africa
[3] MRC, Biomed Informat Res Div, Tygerberg, South Africa
[4] Univ Calif San Diego, Dept Pathol, San Diego, CA 92103 USA
[5] Univ Cambridge, Dept Vet Med, Cambridge, England
基金
美国国家卫生研究院;
关键词
episodic selection; random effects model; evolutionary model; branch-site model; AMINO-ACID SITES; CODON-SUBSTITUTION MODELS; LIKELIHOOD RATIO TESTS; POSITIVE SELECTION; ADAPTIVE EVOLUTION; NUCLEOTIDE SUBSTITUTION; DNA-SEQUENCES; T-CELL; RATES;
D O I
10.1093/molbev/msr125
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Adaptive evolution frequently occurs in episodic bursts, localized to a few sites in a gene, and to a small number of lineages in a phylogenetic tree. A popular class of "branch-site" evolutionary models provides a statistical framework to search for evidence of such episodic selection. For computational tractability, current branch-site models unrealistically assume that all branches in the tree can be partitioned a priori into two rigid classes-"foreground" branches that are allowed to undergo diversifying selective bursts and "background" branches that are negatively selected or neutral. We demonstrate that this assumption leads to unacceptably high rates of false positives or false negatives when the evolutionary process along background branches strongly deviates from modeling assumptions. To address this problem, we extend Felsenstein's pruning algorithm to allow efficient likelihood computations for models in which variation over branches (and not just sites) is described in the random effects likelihood framework. This enables us to model the process at every branch-site combination as a mixture of three Markov substitution models-our model treats the selective class of every branch at a particular site as an unobserved state that is chosen independently of that at any other branch. When benchmarked on a previously published set of simulated sequences, our method consistently matched or outperformed existing branch-site tests in terms of power and error rates. Using three empirical data sets, previously analyzed for episodic selection, we discuss how modeling assumptions can influence inference in practical situations.
引用
收藏
页码:3033 / 3043
页数:11
相关论文
共 39 条
  • [1] Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites
    Anisimova, Maria
    Yang, Ziheng
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (05) : 1219 - 1228
  • [2] Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models
    Anisimova, Maria
    Kosiol, Carolin
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2009, 26 (02) : 255 - 271
  • [3] CodonTest: Modeling Amino Acid Substitution Preferences in Coding Sequences
    Delport, Wayne
    Scheffler, Konrad
    Botha, Gordon
    Gravenor, Mike B.
    Muse, Spencer V.
    Pond, Sergei L. Kosakovsky
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (08)
  • [4] Frequent Toggling between Alternative Amino Acids Is Driven by Selection in HIV-1
    Delport, Wayne
    Scheffler, Konrad
    Seoighe, Cathal
    [J]. PLOS PATHOGENS, 2008, 4 (12)
  • [5] Models of coding sequence evolution
    Delport, Wayne
    Scheffler, Konrad
    Seoighe, Cathal
    [J]. BRIEFINGS IN BIOINFORMATICS, 2009, 10 (01) : 97 - 109
  • [6] A hidden Markov Model approach to variation among sites in rate of evolution
    Felsenstein, J
    Churchill, GA
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (01) : 93 - 104
  • [7] EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH
    FELSENSTEIN, J
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) : 368 - 376
  • [8] GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
  • [9] Modeling the site-specific variation of selection patterns along lineages
    Guindon, S
    Rodrigo, AG
    Dyer, KA
    Huelsenbeck, JP
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (35) : 12957 - 12962
  • [10] DATING OF THE HUMAN APE SPLITTING BY A MOLECULAR CLOCK OF MITOCHONDRIAL-DNA
    HASEGAWA, M
    KISHINO, H
    YANO, TA
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1985, 22 (02) : 160 - 174