Analyzing the Korean reference genome with meta-imputation increased the imputation accuracy and spectrum of rare variants in the Korean population

被引:6
|
作者
Hwang, Mi Yeong [1 ,2 ]
Choi, Nak-Hyeon [1 ]
Won, Hong Hee [2 ]
Kim, Bong-Jo [1 ]
Kim, Young Jin [1 ]
机构
[1] Natl Inst Hlth, Dept Precis Med, Div Genome Sci, Cheongju, South Korea
[2] Sungkyunkwan Univ, Samsung Med Ctr, Samsung Adv Inst Hlth Sci & Technol SAIHST, Dept Digital Hlth, Seoul, South Korea
关键词
whole-genome sequencing (WGS); variant; genotype imputation; meta-imputation; Korean reference genome;
D O I
10.3389/fgene.2022.1008646
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genotype imputation is essential for enhancing the power of association-mapping and discovering rare and indels that are missed by most genotyping arrays. Imputation analysis can be more accurate with a population-specific reference panel or a multi-ethnic reference panel with numerous samples. The National Institute of Health, Republic of Korea, initiated the Korean Reference Genome (KRG) project to identify variants in whole-genome sequences of & SIM;20,000 Korean participants. In the pilot phase, we analyzed the data from 1,490 participants. The genetic characteristics and imputation performance of the KRG were compared with those of the 1,000 Genomes Project Phase 3, GenomeAsia 100K Project, ChinaMAP, NARD, and TOPMed reference panels. For comparison analysis, genotype panels were artificially generated using whole-genome sequencing data from combinations of four different ancestries (Korean, Japanese, Chinese, and European) and two population-specific optimized microarrays (Korea Biobank Array and UK Biobank Array). The KRG reference panel performed best for the Korean population (R (2) = 0.78-0.84, percentage of well-imputed is 91.9% for allele frequency > 5%), although the other reference panels comprised a larger number of samples with genetically different background. By comparing multiple reference panels and multi-ethnic genotype panels, optimal imputation was obtained using reference panels from genetically related populations and a population-optimized microarray. Indeed, the reference panels of KRG and TOPMed showed the best performance when applied to the genotype panels of KBA (R (2) = 0.84) and UKB (R (2) = 0.87), respectively. Using a meta-imputation approach to merge imputation results from different reference panels increased the imputation accuracy for rare variants (& SIM;7%) and provided additional well-imputed variants (& SIM;20%) with comparable imputation accuracy to that of the KRG. Our results demonstrate the importance of using a population-specific reference panel and meta-imputation to assess a substantial number of accurately imputed rare variants.
引用
收藏
页数:13
相关论文
共 11 条
  • [1] Effect of Genome-Wide Genotyping and Reference Panels on Rare Variants Imputation
    Martin Ladouceur
    Celia M.T.Greenwood
    J.Brent Richards
    Journal of Genetics and Genomics, 2012, (10) : 545 - 550
  • [2] Effect of Genome-Wide Genotyping and Reference Panels on Rare Variants Imputation
    Martin Ladouceur
    Celia MTGreenwood
    JBrent Richards
    遗传学报, 2012, 39 (10) : 545 - 550
  • [3] Effect of Genome-Wide Genotyping and Reference Panels on Rare Variants Imputation
    Zheng, Hou-Feng
    Ladouceur, Martin
    Greenwood, Celia M. T.
    Richards, J. Brent
    JOURNAL OF GENETICS AND GENOMICS, 2012, 39 (10) : 545 - 550
  • [4] Improved Imputation Accuracy of Rare and Low-Frequency Genetic Variants Using Population-Specific High-Coverage Whole-Genome Sequencing Data Based Imputation Reference Panel
    Mitt, Mario
    Kals, Mart
    Parn, Kalle
    Gabriel, Stacey B.
    Lander, Eric S.
    Palotie, Aarno
    Ripatti, Samuli
    Morris, Andrew P.
    Metspalu, Andres
    Esko, Tonu
    Magi, Reedik
    Palta, Priit
    HUMAN HEREDITY, 2016, 81 (04) : 235 - 235
  • [5] Improved imputation accuracy of rare and low-frequency variants using population-specific high-coverage WGS-based imputation reference panel
    Mario Mitt
    Mart Kals
    Kalle Pärn
    Stacey B Gabriel
    Eric S Lander
    Aarno Palotie
    Samuli Ripatti
    Andrew P Morris
    Andres Metspalu
    Tõnu Esko
    Reedik Mägi
    Priit Palta
    European Journal of Human Genetics, 2017, 25 : 869 - 876
  • [6] Improved imputation accuracy of rare and low-frequency variants using population-specific high-coverage WGS-based imputation reference panel
    Mitt, Mario
    Kals, Mart
    Parn, Kalle
    Gabriel, Stacey B.
    Lander, Eric S.
    Palotie, Aarno
    Ripatti, Samuli
    Morris, Andrew P.
    Metspalu, Andres
    Esko, Tonu
    Magi, Reedik
    Palta, Priit
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2017, 25 (07) : 869 - 876
  • [7] A multi-breed reference panel and additional rare variants maximize imputation accuracy in cattle
    Troy N. Rowan
    Jesse L. Hoff
    Tamar E. Crum
    Jeremy F. Taylor
    Robert D. Schnabel
    Jared E. Decker
    Genetics Selection Evolution, 51
  • [8] A multi-breed reference panel and additional rare variants maximize imputation accuracy in cattle
    Rowan, Troy N.
    Hoff, Jesse L.
    Crum, Tamar E.
    Taylor, Jeremy F.
    Schnabel, Robert D.
    Decker, Jared E.
    GENETICS SELECTION EVOLUTION, 2019, 51 (01)
  • [9] NARD: whole-genome reference panel of 1779 Northeast Asians improves imputation accuracy of rare and low-frequency variants
    Seong-Keun Yoo
    Chang-Uk Kim
    Hie Lim Kim
    Sungjae Kim
    Jong-Yeon Shin
    Namcheol Kim
    Joshua Sung Woo Yang
    Kwok-Wai Lo
    Belong Cho
    Fumihiko Matsuda
    Stephan C. Schuster
    Changhoon Kim
    Jong-Il Kim
    Jeong-Sun Seo
    Genome Medicine, 11
  • [10] NARD: whole-genome reference panel of 1779 Northeast Asians improves imputation accuracy of rare and low-frequency variants
    Yoo, Seong-Keun
    Kim, Chang-Uk
    Kim, Hie Lim
    Kim, Sungjae
    Shin, Jong-Yeon
    Kim, Namcheol
    Yang, Joshua Sung Woo
    Lo, Kwok-Wai
    Cho, Belong
    Matsuda, Fumihiko
    Schuster, Stephan C.
    Kim, Changhoon
    Kim, Jong-Il
    Seo, Jeong-Sun
    GENOME MEDICINE, 2019, 11 (01)