Estimation of genetic admixture proportions via haplotypes

被引:0
|
作者
Ko, Seyoon [1 ,2 ,3 ]
Sobel, M. [1 ,4 ]
Zhou, Hua [1 ,2 ]
Lange, Kenneth [1 ,4 ,5 ]
机构
[1] Univ Calif Los Angeles, Dept Computat Med, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Math, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[5] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2024年 / 23卷
基金
美国国家科学基金会;
关键词
Admixture; Ancestry informative marker; Sparse clustering; OpenMendel; POPULATION-STRUCTURE; INFERENCE; ASSOCIATION; ANCESTRY; MODELS;
D O I
10.1016/j.csbj.2024.11.043
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Estimation of ancestral admixture is essential for creating personal genealogies, studying human history, and conducting genome-wide association studies (GWAS). The following three primary methods exist for estimating admixture coefficients. The frequentist approach directly maximizes the binomial loglikelihood. The Bayesian approach adds a reasonable prior and samples the posterior distribution. Finally, the nonparametric approach decomposes the genotype matrix algebraically. Each approach scales successfully to datasets with a million individuals and a million single nucleotide polymorphisms (SNPs). Despite their variety, all current approaches assume independence between SNPs. To achieve independence requires performing LD (linkage disequilibrium) filtering before analysis. Unfortunately, this tactic loses valuable information and usually retains many SNPs still in LD. The present paper explores the option of explicitly incorporating haplotypes in ancestry estimation. Our program, HaploADMIXTURE, operates on adjacent SNP pairs and jointly estimates their haplotype frequencies along with admixture coefficients. This more complex strategy takes advantage of the rich information available in haplotypes and ultimately yields better admixture estimates and better clustering of real populations in curated datasets.
引用
收藏
页码:4384 / 4395
页数:12
相关论文
共 50 条
  • [1] Evaluation of model fit of inferred admixture proportions
    Garcia-Erill, Genis
    Albrechtsen, Anders
    MOLECULAR ECOLOGY RESOURCES, 2020, 20 (04) : 936 - 949
  • [2] Unsupervised discovery of ancestry-informative markers and genetic admixture proportions in biobank-scale datasets
    Ko, Seyoon
    Chu, Benjamin B.
    Peterson, Daniel
    Okenwa, Chidera
    Papp, Jeanette C.
    Alexander, David H.
    Sobel, Eric M.
    Zhou, Hua
    Lange, Kenneth L.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2023, 110 (02) : 313 - 325
  • [3] Spatial Inference of Admixture Proportions and Secondary Contact Zones
    Durand, Eric
    Jay, Flora
    Gaggiotti, Oscar E.
    Francois, Olivier
    MOLECULAR BIOLOGY AND EVOLUTION, 2009, 26 (09) : 1963 - 1973
  • [4] GENETIC-VARIATION IN ARIZONA MEXICAN-AMERICANS - ESTIMATION AND INTERPRETATION OF ADMIXTURE PROPORTIONS
    LONG, JC
    WILLIAMS, RC
    MCAULEY, JE
    MEDIS, R
    PARTEL, R
    TREGELLAS, WM
    SOUTH, SF
    REA, AE
    MCCORMICK, SB
    IWANIEC, U
    AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 1991, 84 (02) : 141 - 157
  • [5] Evaluation of Woundfin Augmentation Efforts in the Virgin River by Estimation of Admixture Proportions
    Chen, Yongjiu
    Childs, Michael R.
    Keeler-Foster, Connie
    TRANSACTIONS OF THE AMERICAN FISHERIES SOCIETY, 2011, 140 (03) : 598 - 604
  • [6] Estimating Individual Admixture Proportions from Next Generation Sequencing Data
    Skotte, Line
    Korneliussen, Thorfinn Sand
    Albrechtsen, Anders
    GENETICS, 2013, 195 (03) : 693 - +
  • [7] Ancestry informative markers and admixture proportions in northeastern Mexico
    Martinez-Fierro, Margarita L.
    Beuten, Joke
    Leach, Robin J.
    Parra, Esteban J.
    Cruz-Lopez, Miguel
    Rangel-Villalobos, Hector
    Riego-Ruiz, Lina R.
    Ortiz-Lopez, Rocio
    Martinez-Rodriguez, Herminia G.
    Rojas-Martinez, Augusto
    JOURNAL OF HUMAN GENETICS, 2009, 54 (09) : 504 - 509
  • [8] Inferring Population Structure and Admixture Proportions in Low-Depth NGS Data
    Meisner, Jonas
    Albrechtsen, Anders
    GENETICS, 2018, 210 (02) : 719 - 731
  • [9] Complex genetic admixture histories reconstructed with Approximate Bayesian Computation
    Fortes-Lima, Cesar A.
    Laurent, Romain
    Thouzeau, Valentin
    Toupance, Bruno
    Verdu, Paul
    MOLECULAR ECOLOGY RESOURCES, 2021, 21 (04) : 1098 - 1117
  • [10] Ancestry informative markers and admixture proportions in northeastern Mexico
    Margarita L Martinez-Fierro
    Joke Beuten
    Robin J Leach
    Esteban J Parra
    Miguel Cruz-Lopez
    Hector Rangel-Villalobos
    Lina R Riego-Ruiz
    Rocio Ortiz-Lopez
    Herminia G Martinez-Rodriguez
    Augusto Rojas-Martinez
    Journal of Human Genetics, 2009, 54 : 504 - 509