Human ancestry correlates with language and reveals that race is not an objective genomic classifier

被引:46
作者
Baker, Jennifer L. [1 ]
Rotimi, Charles N. [1 ]
Shriner, Daniel [1 ]
机构
[1] NHGRI, Ctr Res Genom & Global Hlth, Bldg 12A,Room 4047,12 South Dr, Bethesda, MD 20892 USA
来源
SCIENTIFIC REPORTS | 2017年 / 7卷
基金
美国国家卫生研究院;
关键词
POPULATION GENETIC-STRUCTURE; WIDE PATTERNS; DIVERSITY; ADMIXTURE; STRATIFICATION; EIGENANALYSIS; COMPONENTS; DISPERSAL; INFERENCE; AFRICANS;
D O I
10.1038/s41598-017-01837-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genetic and archaeological studies have established a sub-Saharan African origin for anatomically modern humans with subsequent migrations out of Africa. Using the largest multi-locus data set known to date, we investigated genetic differentiation of early modern humans, human admixture and migration events, and relationships among ancestries and language groups. We compiled publicly available genome-wide genotype data on 5,966 individuals from 282 global samples, representing 30 primary language families. The best evidence supports 21 ancestries that delineate genetic structure of present-day human populations. Independent of self-identified ethno-linguistic labels, the vast majority (97.3%) of individuals have mixed ancestry, with evidence of multiple ancestries in 96.8% of samples and on all continents. The data indicate that continents, ethno-linguistic groups, races, ethnicities, and individuals all show substantial ancestral heterogeneity. We estimated correlation coefficients ranging from 0.522 to 0.962 between ancestries and language families or branches. Ancestry data support the grouping of Kwadi-Khoe, Kx'a, and Tuu languages, support the exclusion of Omotic languages from the Afroasiatic language family, and do not support the proposed Dene-Yeniseian language family as a genetically valid grouping. Ancestry data yield insight into a deeper past than linguistic data can, while linguistic data provide clarity to ancestry data.
引用
收藏
页数:10
相关论文
共 74 条
  • [1] Fast model-based estimation of ancestry in unrelated individuals
    Alexander, David H.
    Novembre, John
    Lange, Kenneth
    [J]. GENOME RESEARCH, 2009, 19 (09) : 1655 - 1664
  • [2] A global reference for human genetic variation
    Altshuler, David M.
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Donnelly, Peter
    Eichler, Evan E.
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Green, Eric D.
    Hurles, Matthew E.
    Knoppers, Bartha M.
    Korbel, Jan O.
    Lander, Eric S.
    Lee, Charles
    Lehrach, Hans
    Mardis, Elaine R.
    Marth, Gabor T.
    McVean, Gil A.
    Nickerson, Deborah A.
    Wang, Jun
    Wilson, Richard K.
    Boerwinkle, Eric
    Doddapaneni, Harsha
    Han, Yi
    Korchina, Viktoriya
    Kovar, Christie
    Lee, Sandra
    Muzny, Donna
    Reid, Jeffrey G.
    Zhu, Yiming
    Chang, Yuqi
    Feng, Qiang
    Fang, Xiaodong
    Guo, Xiaosen
    Jian, Min
    Jiang, Hui
    Jin, Xin
    Lan, Tianming
    Li, Guoqing
    Li, Jingxiang
    Li, Yingrui
    Liu, Shengmao
    Liu, Xiao
    Lu, Yao
    Ma, Xuedi
    Tang, Meifang
    Wang, Bo
    [J]. NATURE, 2015, 526 (7571) : 68 - +
  • [3] Integrating common and rare genetic variation in diverse human populations
    Altshuler, David M.
    Gibbs, Richard A.
    Peltonen, Leena
    Dermitzakis, Emmanouil
    Schaffner, Stephen F.
    Yu, Fuli
    Bonnen, Penelope E.
    de Bakker, Paul I. W.
    Deloukas, Panos
    Gabriel, Stacey B.
    Gwilliam, Rhian
    Hunt, Sarah
    Inouye, Michael
    Jia, Xiaoming
    Palotie, Aarno
    Parkin, Melissa
    Whittaker, Pamela
    Chang, Kyle
    Hawes, Alicia
    Lewis, Lora R.
    Ren, Yanru
    Wheeler, David
    Muzny, Donna Marie
    Barnes, Chris
    Darvishi, Katayoon
    Hurles, Matthew
    Korn, Joshua M.
    Kristiansson, Kati
    Lee, Charles
    McCarroll, Steven A.
    Nemesh, James
    Keinan, Alon
    Montgomery, Stephen B.
    Pollack, Samuela
    Price, Alkes L.
    Soranzo, Nicole
    Gonzaga-Jauregui, Claudia
    Anttila, Verneri
    Brodeur, Wendy
    Daly, Mark J.
    Leslie, Stephen
    McVean, Gil
    Moutsianas, Loukas
    Nguyen, Huy
    Zhang, Qingrun
    Ghori, Mohammed J. R.
    McGinnis, Ralph
    McLaren, William
    Takeuchi, Fumihiko
    Grossman, Sharon R.
    [J]. NATURE, 2010, 467 (7311) : 52 - 58
  • [4] [Anonymous], 2009, Ethnologue: languages of the world
  • [5] The genome-wide structure of the Jewish people
    Behar, Doron M.
    Yunusbayev, Bayazit
    Metspalu, Mait
    Metspalu, Ene
    Rosset, Saharon
    Parik, Jueri
    Rootsi, Siiri
    Chaubey, Gyaneshwer
    Kutuev, Ildus
    Yudkovsky, Guennady
    Khusnutdinova, Elza K.
    Balanovsky, Oleg
    Semino, Ornella
    Pereira, Luisa
    Comas, David
    Gurwitz, David
    Bonne-Tamir, Batsheva
    Parfitt, Tudor
    Hammer, Michael F.
    Skorecki, Karl
    Villems, Richard
    [J]. NATURE, 2010, 466 (7303) : 238 - U112
  • [6] Bergsland K., 1959, J SOC FINNOOUGRIENNE, V61, P1
  • [7] Gene flow from North Africa contributes to differential human genetic diversity in southern Europe
    Botigue, Laura R.
    Henn, Brenna M.
    Gravel, Simon
    Maples, Brian K.
    Gignoux, Christopher R.
    Corona, Erik
    Atzmon, Gil
    Burns, Edward
    Ostrer, Harry
    Flores, Carlos
    Bertranpetit, Jaume
    Comas, David
    Bustamante, Carlos D.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (29) : 11791 - 11796
  • [8] Separation of the largest eigenvalues in eigenanalysis of genotype data from discrete subpopulations
    Bryc, Katarzyna
    Bryc, Wlodek
    Silverstein, Jack W.
    [J]. THEORETICAL POPULATION BIOLOGY, 2013, 89 : 34 - 43
  • [9] Genome-wide patterns of population structure and admixture in West Africans and African Americans
    Bryc, Katarzyna
    Auton, Adam
    Nelson, Matthew R.
    Oksenberg, Jorge R.
    Hauser, Stephen L.
    Williams, Scott
    Froment, Alain
    Bodo, Jean-Marie
    Wambebe, Charles
    Tishkoff, Sarah A.
    Bustamante, Carlos D.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (02) : 786 - 791
  • [10] COEVOLUTION OF GENES AND LANGUAGES REVISITED
    CAVALLISFORZA, LL
    MINCH, E
    MOUNTAIN, JL
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (12) : 5620 - 5624