Biogeographical Ancestry Inference from Genotype: A Comparison of Ancestral Informative SNPs and Genome-wide SNPs

被引:0
作者
Qu, Yue [1 ]
Tran, Dat [1 ]
Martinez-Marroquin, Elisa [1 ]
机构
[1] Univ Canberra, Fac Sci & Technol, Canberra, ACT, Australia
来源
2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI) | 2020年
关键词
Biogeographical ancestry (BGA); Genome-wide analysis; Hidden Markov Model (HMM); Support Vector Machine (SVM); Convolutional Neural network (CNN); GENETIC ANCESTRY; POPULATION-STRUCTURE; POLYMORPHISM; PANEL; IDENTIFICATION; PREDICTION; DIVERSITY; ADMIXTURE; MARKERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The biogeographical ancestry (BGA) information can provide supporting information in epidemiology and leading intelligence in forensics. Several sets of ancestral informative markers (AIM) have been proposed to facilitate the BGA inference. A small set of markers can improve efficiency though, it has limitations in their ability of balancing different populations and differentiating sub-populations. Genome-wide SNPs provide much more comprehensive information of an individual's ancestral information. In this paper, we study the problem of BGA inference under the abundance of genome-wide high density data. We studied 1043 individuals from 7 continental populations of the Human Genonte Diversity Panel at 32212 gelatine-wide autosomal single nucleotide polymorphism (SNP) loci. We detected the population structure and compared the BGA inference accuracy using three widely used genetic sequence analysis algorithms through AIMs and genome-wide SNPs. Our results show that genome-wide SNPs reveal population structure with dearer clusterness and provide more accurate BGA inference, confirming the rich information carried by genome-wide SNPs. The findings help to give a clearer picture of candidate ancestral population groups of an individual, and potentially help the BGA inference in a fine population scale.
引用
收藏
页码:64 / 70
页数:7
相关论文
共 53 条
  • [1] [Anonymous], 1994, History and Geography of Human Genes
  • [2] Cann HM, 2002, SCIENCE, V296, P261
  • [3] Linkage equilibrium and genetic algorithms
    Chakraborty, UK
    Muhlenbein, H
    [J]. PROCEEDINGS OF 1997 IEEE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION (ICEC '97), 1997, : 25 - 29
  • [4] Prediction of biogeographical ancestry from genotype: a comparison of classifiers
    Cheung, Elaine Y. Y.
    Gahan, Michelle Elizabeth
    McNevin, Dennis
    [J]. INTERNATIONAL JOURNAL OF LEGAL MEDICINE, 2017, 131 (04) : 901 - 912
  • [5] COCKERHAM CC, 1993, EVOLUTION, V47, P855, DOI [10.2307/2410189, 10.1111/j.1558-5646.1993.tb01239.x]
  • [6] A DNA polymorphism discovery resource for research on human genetic variation
    Collins, FS
    Brooks, LD
    Chakravarti, A
    [J]. GENOME RESEARCH, 1998, 8 (12) : 1229 - 1231
  • [7] Using Hidden Markov Modeling for Biogeographical Ancestry Analysis
    Currie, Melvin R.
    [J]. JOURNAL OF HUMANISTIC MATHEMATICS, 2019, 9 (02):
  • [8] Dausset F. J., HGDP CEPH HUMAN GENO
  • [9] The roles of nearest neighbor methods in imputing missing data in forest inventory and monitoring databases
    Eskelson, Bianca N. I.
    Temesgen, Hailemariam
    Lemay, Valerie
    Barrett, Tara M.
    Crookston, Nicholas L.
    Hudak, Andrew T.
    [J]. SCANDINAVIAN JOURNAL OF FOREST RESEARCH, 2009, 24 (03) : 235 - 246
  • [10] Falush D, 2003, GENETICS, V164, P1567