Dating genomic variants and shared ancestry in population-scale sequencing data

被引:102
作者
Albers, Patrick K. [1 ]
McVean, Gil [1 ]
机构
[1] Univ Oxford, Li Ka Shing Ctr Hlth Informat & Discovery, Big Data Inst, Oxford, England
基金
英国惠康基金;
关键词
CONDITIONAL SAMPLING DISTRIBUTION; RECENT POSITIVE SELECTION; LACTASE-PERSISTENCE; HISTORY; GENE; COALESCENT; EVOLUTION; DISEASE; AFRICA; AGE;
D O I
10.1371/journal.pbio.3000586
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The origin and fate of new mutations within species is the fundamental process underlying evolution. However, while much attention has been focused on characterizing the presence, frequency, and phenotypic impact of genetic variation, the evolutionary histories of most variants are largely unexplored. We have developed a nonparametric approach for estimating the date of origin of genetic variants in large-scale sequencing data sets. The accuracy and robustness of the approach is demonstrated through simulation. Using data from two publicly available human genomic diversity resources, we estimated the age of more than 45 million single-nucleotide polymorphisms (SNPs) in the human genome and release the Atlas of Variant Age as a public online database. We characterize the relationship between variant age and frequency in different geographical regions and demonstrate the value of age information in interpreting variants of functional and selective importance. Finally, we use allele age estimates to power a rapid approach for inferring the ancestry shared between individual genomes and to quantify genealogical relationships at different points in the past, as well as to describe and explore the evolutionary history of modern human populations.
引用
收藏
页数:26
相关论文
共 81 条
[1]   New insights into the generation and role of de novo mutations in health and disease [J].
Acuna-Hidalgo, Rocio ;
Veltman, Joris A. ;
Hoischen, Alexander .
GENOME BIOLOGY, 2016, 17
[2]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[3]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[4]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[5]  
[Anonymous], 2003, Jt Comm Perspect, V23, P8
[6]   A Fine-Scale Chimpanzee Genetic Map from Population Sequencing [J].
Auton, Adam ;
Fledel-Alon, Adi ;
Pfeifer, Susanne ;
Venn, Oliver ;
Segurel, Laure ;
Street, Teresa ;
Leffler, Ellen M. ;
Bowden, Rory ;
Aneas, Ivy ;
Broxholme, John ;
Humburg, Peter ;
Iqbal, Zamin ;
Lunter, Gerton ;
Maller, Julian ;
Hernandez, Ryan D. ;
Melton, Cord ;
Venkat, Aarti ;
Nobrega, Marcelo A. ;
Bontrop, Ronald ;
Myers, Simon ;
Donnelly, Peter ;
Przeworski, Molly ;
McVean, Gil .
SCIENCE, 2012, 336 (6078) :193-198
[7]   Evaluating the quality of the 1000 genomes project data [J].
Belsare, Saurabh ;
Levy-Sakin, Michal ;
Mostovoy, Yulia ;
Durinck, Steffen ;
Chaudhuri, Subhra ;
Xiao, Ming ;
Peterson, Andrew S. ;
Kwok, Pui-Yan ;
Seshagiri, Somasekar ;
Wall, Jeffrey D. .
BMC GENOMICS, 2019, 20 (01)
[8]   A Population Genetic Signal of Polygenic Adaptation [J].
Berg, Jeremy J. ;
Coop, Graham .
PLOS GENETICS, 2014, 10 (08)
[9]   Genetic signatures of strong recent positive selection at the lactase gene [J].
Bersaglieri, T ;
Sabeti, PC ;
Patterson, N ;
Vanderploeg, T ;
Schaffner, SF ;
Drake, JA ;
Rhodes, M ;
Reich, DE ;
Hirschhorn, JN .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (06) :1111-1120
[10]   Absence of the lactase-persistence-associated allele in early Neolithic Europeans [J].
Burger, J. ;
Kirchner, M. ;
Bramanti, B. ;
Haak, W. ;
Thomas, M. G. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (10) :3736-3741