Inference of the Distribution of Selection Coefficients for New Nonsynonymous Mutations Using Large Samples

被引:121
作者
Kim, Bernard Y. [1 ]
Huber, Christian D. [1 ]
Lohmueller, Kirk E. [1 ,2 ,3 ]
机构
[1] Univ Calif Los Angeles, David Geffen Sch Med, Dept Ecol & Evolutionary Biol, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, David Geffen Sch Med, Interdept Program Bioinformat, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, David Geffen Sch Med, Dept Human Genet, Los Angeles, CA 90095 USA
基金
美国国家卫生研究院;
关键词
deleterious mutations; diffusion theory; population genetics; site frequency spectrum; DELETERIOUS GENETIC-VARIATION; FISHERS GEOMETRICAL MODEL; POPULATION-GROWTH; BACKGROUND SELECTION; SUBSTITUTION RATE; FITNESS; EVOLUTION; ARCHITECTURE; RECOMBINATION; INTERFERENCE;
D O I
10.1534/genetics.116.197145
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The distribution of fitness effects (DFE) has considerable importance in population genetics. To date, estimates of the DFE come from studies using a small number of individuals. Thus, estimates of the proportion of moderately to strongly deleterious new mutations may be unreliable because such variants are unlikely to be segregating in the data. Additionally, the true functional form of the DFE is unknown, and estimates of the DFE differ significantly between studies. Here we present a flexible and computationally tractable method, called Fit partial derivative a partial derivative i, to estimate the DFE of new mutations using the site frequency spectrum from a large number of individuals. We apply our approach to the frequency spectrum of 1300 Europeans from the Exome Sequencing Project ESP6400 data set, 1298 Danes from the LuCamp data set, and 432 Europeans from the 1000 Genomes Project to estimate the DFE of deleterious nonsynonymous mutations. We infer significantly fewer (0.38-0.84 fold) strongly deleterious mutations with selection coefficient |s| > 0.01 and more (1.24-1.43 fold) weakly deleterious mutations with selection coefficient |s| < 0.001 compared to previous estimates. Furthermore, a DFE that is a mixture distribution of a point mass at neutrality plus a gamma distribution fits better than a gamma distribution in two of the three data sets. Our results suggest that nearly neutral forces play a larger role in human evolution than previously thought.
引用
收藏
页码:345 / 361
页数:17
相关论文
共 77 条
[11]   Adaptation, Plasticity, and Extinction in a Changing Environment: Towards a Predictive Theory [J].
Chevin, Luis-Miguel ;
Lande, Russell ;
Mace, Georgina M. .
PLOS BIOLOGY, 2010, 8 (04)
[12]   Y genetic data support the Neolithic demic diffusion model [J].
Chikhi, L ;
Nichols, RA ;
Barbujani, G ;
Beaumont, MA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (17) :11008-11013
[13]   Background Selection as Baseline for Nucleotide Variation across the Drosophila Genome [J].
Comeron, Josep M. .
PLOS GENETICS, 2014, 10 (06)
[14]   No evidence that selection has been less effective at removing deleterious mutations in Europeans than in Africans [J].
Do, Ron ;
Balick, Daniel ;
Li, Heng ;
Adzhubei, Ivan ;
Sunyaev, Shamil ;
Reich, David .
NATURE GENETICS, 2015, 47 (02) :126-131
[15]   The consequences of not accounting for background selection in demographic inference [J].
Ewing, Gregory B. ;
Jensen, Jeffrey D. .
MOLECULAR ECOLOGY, 2016, 25 (01) :135-141
[16]   The distribution of fitness effects of new mutations [J].
Eyre-Walker, Adam ;
Keightley, Peter D. .
NATURE REVIEWS GENETICS, 2007, 8 (08) :610-618
[17]   The distribution of fitness effects of new deleterious amino acid mutations in humans [J].
Eyre-Walker, Adam ;
Woolfit, Megan ;
Phelps, Ted .
GENETICS, 2006, 173 (02) :891-900
[18]   Genetic architecture of a complex trait and its implications for fitness and genome-wide association studies [J].
Eyre-Walker, Adam .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 :1752-1756
[19]   High-resolution mapping of protein sequence-function relationships [J].
Fowler, Douglas M. ;
Araya, Carlos L. ;
Fleishman, Sarel J. ;
Kellogg, Elizabeth H. ;
Stephany, Jason J. ;
Baker, David ;
Fields, Stanley .
NATURE METHODS, 2010, 7 (09) :741-U108
[20]   Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants [J].
Fu, Wenqing ;
O'Connor, Timothy D. ;
Jun, Goo ;
Kang, Hyun Min ;
Abecasis, Goncalo ;
Leal, Suzanne M. ;
Gabriel, Stacey ;
Altshuler, David ;
Shendure, Jay ;
Nickerson, Deborah A. ;
Bamshad, Michael J. ;
Akey, Joshua M. .
NATURE, 2013, 493 (7431) :216-220