Inference of Functional Divergence Among Proteins When the Evolutionary Process is Non-stationary

被引:0
作者
Rachael A. Bay
Joseph P. Bielawski
机构
[1] Dalhousie University,Department of Biology
[2] Dalhousie University,Department of Mathematics and Statistics
[3] Stanford University,Department of Biology, Hopkins Marine Station
来源
Journal of Molecular Evolution | 2013年 / 76卷
关键词
Power; Accuracy; Functional divergence; Positive selection; Non-stationary evolution;
D O I
暂无
中图分类号
学科分类号
摘要
Functional shifts during protein evolution are expected to yield shifts in substitution rate, and statistical methods can test for this at both codon and amino acid levels. Although methods based on models of sequence evolution serve as powerful tools for studying evolutionary processes, violating underlying assumptions can lead to false biological conclusions. It is not unusual for functional shifts to be accompanied by changes in other aspects of the evolutionary process, such as codon or amino acid frequencies. However, models used to test for functional divergence assume these frequencies remain constant over time. We employed simulation to investigate the impact of non-stationary evolution on functional divergence inference. We investigated three likelihood ratio tests based on codon models and found varying degrees of sensitivity. Joint effects of shifts in frequencies and selection pressures can be large, leading to false signals for positive selection. Amino acid-based tests (FunDi and Bivar) were also compromised when several aspects of the substitution process were not adequately modeled. We applied the same tests to a core genome “scan” for functional divergence between light-adapted ecotypes of the cyanobacteria Prochlorococcus, and carried out gene-specific simulations for ten genes. Results of those simulations illustrated how the inference of functional divergence at the genomic level can be seriously impacted by model misspecification. Although computationally costly, simulations motivated by data in hand are warranted when several aspects of the substitution process are either misspecified or not included in the models upon which the statistical tests were built.
引用
收藏
页码:205 / 215
页数:10
相关论文
共 120 条
[1]  
Anisimova M(2007)Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites Mol Biol Evol 24 1219-1228
[2]  
Yang Z(2006)Large-scale analyses of synonymous substitution rates can be sensitive to assumptions about the process of mutation Gene 378 58-64
[3]  
Aris-Brosou S(2008)Likelihood-based clustering (LiBaC) for codon models, a method for grouping sites according to similarities in the underlying process of evolution Mol Biol Evol 25 1995-2007
[4]  
Bielawski JP(2011)Recombination detection under evolutionary scenarios relevant to functional divergence J Mol Evol 73 273-286
[5]  
Bao L(2004)A maximum likelihood method for detecting functional divergence at individual codon sites, with application to gene family evolution J Mol Evol 59 121-132
[6]  
Gu H(2000)Bias in phylogenetic reconstruction of vertebrate rhodopsin sequences Mol Biol Evol 17 1220-1231
[7]  
Dunn KA(1988)A novel free-living prochlorophyte abundant in the oceanic euphotic zone Nature 334 340-343
[8]  
Bielawski JP(2003)Genome sequence of the cyanobacterium Proc Natl Acad Sci USA 100 10020-10025
[9]  
Bay RA(2009) SS120, a nearly minimal oxyphototrophic genome Mol Biol Evol 26 1879-1888
[10]  
Bielawski JP(1998)INDELible: a flexible simulator of biological sequence evolution Mol Biol Evol 15 871-879