Multinucleotide mutations cause false inferences of lineage-specific positive selection

被引:91
作者
Venkat, Aarti [1 ]
Hahn, Matthew W. [2 ,3 ]
Thornton, Joseph W. [1 ,4 ]
机构
[1] Univ Chicago, Dept Human Genet, Chicago, IL 60637 USA
[2] Indiana Univ, Dept Biol, Bloomington, IN USA
[3] Indiana Univ, Dept Comp Sci, Bloomington, IN 47405 USA
[4] Univ Chicago, Dept Ecol & Evolut, 940 E 57Th St, Chicago, IL 60637 USA
关键词
NUCLEOTIDE SUBSTITUTION MUTATIONS; EPISODIC DIVERSIFYING SELECTION; FIDELITY DNA-SYNTHESIS; BRANCH-SITE TEST; ADAPTIVE EVOLUTION; LIKELIHOOD METHOD; MODEL; GENOMES; DROSOPHILA; HUMANS;
D O I
10.1038/s41559-018-0584-5
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Phylogenetic tests of adaptive evolution, such as the widely used branch-site test (BST), assume that nucleotide substitutions occur singly and independently. Recent research has shown that errors at adjacent sites often occur during DNA replication, and the resulting multinucleotide mutations (MNMs) are overwhelmingly likely to be non-synonymous. To evaluate whether the BST misinterprets sequence patterns produced by MNMs as false support for positive selection, we analysed two genomescale datasetsone from mammals and one from flies. We found that codons with multiple differences account for virtually all the support for lineage-specific positive selection in the BST. Simulations under conditions derived from these alignments but without positive selection show that realistic rates of MNMs cause a strong and systematic bias towards false inferences of selection. This bias is sufficient under empirically derived conditions to produce false positive inferences as often as the BST infers positive selection from the empirical data. Although some genes with BST-positive results may have evolved adaptively, the test cannot distinguish sequence patterns produced by authentic positive selection from those caused by neutral fixation of MNMs. Many published inferences of adaptive evolution using this technique may therefore be artefacts of model violation caused by unincorporated neutral mutational processes. We introduce a model that incorporates MNMs and may help to ameliorate this bias.
引用
收藏
页码:1280 / 1288
页数:9
相关论文
共 60 条
[21]   The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC [J].
Gharib, Walid H. ;
Robinson-Rechavi, Marc .
MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (07) :1675-1686
[22]  
GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
[23]   Adaptive evolution of young gene duplicates in mammals [J].
Han, Mira V. ;
Demuth, Jeffery P. ;
McGrath, Casey L. ;
Casola, Claudio ;
Hahn, Matthew W. .
GENOME RESEARCH, 2009, 19 (05) :859-867
[24]   Error-prone polymerase activity causes multinucleotide mutations in humans [J].
Harris, Kelley ;
Nielsen, Rasmus .
GENOME RESEARCH, 2014, 24 (09) :1445-1454
[25]   Variation in the mutation rate across mammalian genomes [J].
Hodgkinson, Alan ;
Eyre-Walker, Adam .
NATURE REVIEWS GENETICS, 2011, 12 (11) :756-766
[26]   Patterns of Positive Selection in Six Mammalian Genomes [J].
Kosiol, Carolin ;
Vinar, Tomas ;
da Fonseca, Rute R. ;
Hubisz, Melissa J. ;
Bustamante, Carlos D. ;
Nielsen, Rasmus ;
Siepel, Adam .
PLOS GENETICS, 2008, 4 (08)
[27]   An empirical codon model for protein sequence evolution [J].
Kosiol, Carolin ;
Holmes, Ian ;
Goldman, Nick .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (07) :1464-1479
[28]   Evolution of protein-coding genes in Drosophila [J].
Larracuente, Amanda M. ;
Sackton, Timothy B. ;
Greenberg, Anthony J. ;
Wong, Alex ;
Singh, Nadia D. ;
Sturgill, David ;
Zhang, Yu ;
Oliver, Brian ;
Clark, Andrew G. .
TRENDS IN GENETICS, 2008, 24 (03) :114-123
[29]   DNA polymerases and human disease [J].
Loeb, Lawrence A. ;
Monnat, Raymond J., Jr. .
NATURE REVIEWS GENETICS, 2008, 9 (08) :594-604
[30]  
Lopez P, 2002, MOL BIOL EVOL, V19, P1