Context dependence, ancestral misidentification, and spurious signatures of natural selection

被引:112
作者
Hernandez, Ryan D. [1 ]
Williamson, Scott H. [1 ]
Bustamante, Carlos D. [1 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
关键词
context-dependence; single nucleotide polymorphism; site-frequency spectrum; ancestral misidentification;
D O I
10.1093/molbev/msm108
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Population genetic analyses often use polymorphism data from one species, and orthologous genomic sequences from closely related outgroup species. These outgroup sequences are frequently used to identify ancestral alleles at segregating sites and to compare the patterns of polymorphism and divergence. Inherent in such studies is the assumption of parsimony, which posits that the ancestral state of each single nucleotide polymorphism (SNP) is the allele that matches the orthologous site in the outgroup sequence, and that all nucleotide substitutions between species have been observed. This study tests the effect of violating the parsimony assumption when mutation rates vary across sites and over time. Using a context-dependent mutation model that accounts for elevated mutation rates at CpG dinucleotides, increased propensity for transitional versus transversional mutations, as well as other directional and contextual mutation biases estimated along the human lineage, we show (using both simulations and a theoretical model) that enough unobserved substitutions could have occurred since the divergence of human and chimpanzee to cause many statistical tests to spuriously reject neutrality. Moreover, using both the chimpanzee and rhesus macaque genomes to parsimoniously identify ancestral states causes a large fraction of the data to be removed while not completely alleviating problem. By constructing a novel model of the context-dependent mutation process, we can correct polymorphism data for the effect of ancestral misidentification using a single outgroup.
引用
收藏
页码:1792 / 1800
页数:9
相关论文
共 45 条
  • [1] Akashi H, 1997, GENETICS, V146, P295
  • [2] Akashi H, 1999, GENETICS, V151, P221
  • [3] Adaptive evolution of non-coding DNA in Drosophila
    Andolfatto, P
    [J]. NATURE, 2005, 437 (7062) : 1149 - 1152
  • [4] [Anonymous], 1998, COMMUN STAT STOCH MO, DOI [10.1080/15326349808807471, DOI 10.1080/15326349808807471]
  • [5] Baudry E, 2003, GENETICS, V165, P1619
  • [6] THE INFLUENCE OF NEAREST NEIGHBORS ON THE RATE AND PATTERN OF SPONTANEOUS POINT MUTATIONS
    BLAKE, RD
    HESS, ST
    NICHOLSONTUELL, J
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1992, 34 (03) : 189 - 200
  • [7] Natural selection on protein-coding genes in the human genome
    Bustamante, CD
    Fledel-Alon, A
    Williamson, S
    Nielsen, R
    Hubisz, MT
    Glanowski, S
    Tanenbaum, DM
    White, TJ
    Sninsky, JJ
    Hernandez, RD
    Civello, D
    Adams, MD
    Cargill, M
    Clark, AG
    [J]. NATURE, 2005, 437 (7062) : 1153 - 1157
  • [8] Bustamante CD, 2001, GENETICS, V159, P1779
  • [9] Cutler DJ, 2000, GENETICS, V154, P1403
  • [10] Ewens W.J., 2004, MATH POPULATION GENE, DOI DOI 10.1007/978-0-387-21822-9