Genotype imputation for Han Chinese population using Haplotype Reference Consortium as reference

被引:13
|
作者
Lin, Yuan [1 ]
Liu, Lu [2 ,3 ,4 ]
Yang, Sen [2 ,3 ,4 ]
Li, Yun [5 ]
Lin, Dongxin [6 ]
Zhang, Xuejun [2 ,3 ,4 ]
Yin, Xianyong [2 ,3 ,4 ,7 ]
机构
[1] Peking Univ, Sch Life Sci, Ctr Bioinformat, Beijing 100871, Peoples R China
[2] Anhui Med Univ, Affiliated Hosp 1, Inst Dermatol, 81 Meishan Rd, Hefei 230032, Anhui, Peoples R China
[3] Anhui Med Univ, Affiliated Hosp 1, Dept Dermatol, 81 Meishan Rd, Hefei 230032, Anhui, Peoples R China
[4] Anhui Med Univ, Minist Educ, Key Lab Dermatol, Hefei 230032, Anhui, Peoples R China
[5] Univ N Carolina, Sch Med, Dept Genet, Chapel Hill, NC 27519 USA
[6] Chinese Acad Med Sci, Canc Inst & Hosp, Dept Etiol & Carcinogenesis, Beijing 100021, Peoples R China
[7] Univ Michigan, Dept Biostat, Ctr Stat Genet, Ann Arbor, MI 48109 USA
基金
中国国家自然科学基金;
关键词
GENOME-WIDE ASSOCIATION; GENETIC ARCHITECTURE; AFRICAN-AMERICANS; VARIANTS; SEQUENCE; METAANALYSIS; DISEASE; LOCI;
D O I
10.1007/s00439-018-1894-z
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genotype imputation is now routinely performed in genomic analysis. Reference panel size, that is, the number of haplotypes in the reference panel, has been well established to be one major driving factor of imputation accuracy. For that reason, huge efforts have been made worldwide to provide large reference panels, with the Haplotype Reference Consortium (HRC) being currently the largest available in the public domain. The imputation performance of HRC, whose major samples are Europeans, has been mainly evaluated in Europeans. We conducted whole-genome genotype imputation on two independent genome-wide genotyping datasets, one with 1000 European samples and the other with 1000 Han Chinese samples. We compared the results obtained using HRC with those using Phase III of the 1000 Genomes Project (1000G) reference panel. For the European dataset, using HRC improved imputation quality, especially for rare variants with minor allele-frequency (MAF) < 0.1%. However, 1000G demonstrates better performance in the Han Chinese dataset, in both imputation quality and number of well-imputed variants. We validated the performance of 1000G reference panel in a second, independent cohort of Han Chinese (N = 2402). Our study showcases the limitations of HRC for Han Chinese populations, strongly suggesting the necessity of building population-specific reference panels.
引用
收藏
页码:431 / 436
页数:6
相关论文
共 50 条
  • [31] Genotype imputation performance of three reference panels using African ancestry individuals
    Vergara, Candelaria
    Parker, Margaret M.
    Franco, Liliana
    Cho, Michael H.
    Valencia-Duarte, Ana V.
    Beaty, Terri H.
    Duggal, Priya
    HUMAN GENETICS, 2018, 137 (04) : 281 - 292
  • [32] Characterizing the inaccurate quality metric in genotype imputation using the TOPMed reference panel
    Shi, Mingyang
    Koido, Masaru
    Kamatani, Yoichiro
    Tanikawa, Chizu
    Matsuda, Koichi
    Terao, Chikashi
    Lathrop, Mark
    Munter, Markus
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 680 - 681
  • [33] Impact of reference population relatedness on imputation quality
    Petty, Lauren E.
    Tucker, Lindsay S.
    Highland, Heather M.
    Ramesh, Naveen
    Karhade, Mandar
    Hanis, Craig L.
    Below, Jennifer E.
    GENETIC EPIDEMIOLOGY, 2015, 39 (07) : 574 - 575
  • [34] The Effect of Thalassemia on Erythrocyte Reference Intervals in a Representative Han Chinese Adult Population
    Xu, Jian-Hua
    Hao, Xiao-Ke
    Mu, Run-Qing
    Pan, Bai-Shen
    Zhang, Jie
    Peng, Ming-Ting
    Wang, Lan-Lan
    Huang, Xian-Zhang
    Ma, Yue-Yun
    Zhao, Min
    Guo, Wei
    Qiao, Rui
    Chen, Wen-Xiang
    Jiang, Hong
    Shang, Hong
    CLINICAL LABORATORY, 2015, 61 (3-4) : 405 - 414
  • [35] A diverse ancestrally-matched reference panel increases genotype imputation accuracy in a underrepresented population
    John Mauleekoonphairoj
    Sissades Tongsima
    Apichai Khongphatthanayothin
    Sean J. Jurgens
    Dominic S. Zimmerman
    Boosamas Sutjaporn
    Pharawee Wandee
    Connie R. Bezzina
    Koonlawee Nademanee
    Yong Poovorawan
    Scientific Reports, 13 (1)
  • [36] Effect of composition and size of the reference population in genotype imputation efficiency of INDUSCHIP in HF Crossbred cattle
    Saha, Sujit
    Nayee, Nilesh
    Shah, Heena
    Gajjar, Swapnil
    Kishore, G.
    Gupta, R. O.
    Trivedi, K. R.
    INDIAN JOURNAL OF DAIRY SCIENCE, 2020, 73 (03): : 250 - 255
  • [37] Haplotype diversity in mitochondrial genome in a Chinese Han population
    Ma, Ke
    Li, Hui
    Cao, Yu
    Zhao, Xuejun
    Liu, Wenbin
    Zhao, Xueying
    JOURNAL OF HUMAN GENETICS, 2016, 61 (10) : 903 - 906
  • [38] A diverse ancestrally-matched reference panel increases genotype imputation accuracy in a underrepresented population
    Mauleekoonphairoj, John
    Tongsima, Sissades
    Khongphatthanayothin, Apichai
    Jurgens, Sean J.
    Zimmerman, Dominic S.
    Sutjaporn, Boosamas
    Wandee, Pharawee
    Bezzina, Connie R.
    Nademanee, Koonlawee
    Poovorawan, Yong
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [39] Haplotype diversity in mitochondrial genome in a Chinese Han population
    Ke Ma
    Hui Li
    Yu Cao
    Xuejun Zhao
    Wenbin Liu
    Xueying Zhao
    Journal of Human Genetics, 2016, 61 : 903 - 906
  • [40] Accurate haplotype imputation with individualized ancestry-adjusted reference panels
    Song, Qing
    Xu, Wei
    Li, Wenzhi
    He, Shaohua
    Liu, Jiankang
    Wang, Guangming
    Ma, Li
    GENOMICS, 2018, 110 (05) : 329 - 335