Analyzing the Korean reference genome with meta-imputation increased the imputation accuracy and spectrum of rare variants in the Korean population

被引:6
|
作者
Hwang, Mi Yeong [1 ,2 ]
Choi, Nak-Hyeon [1 ]
Won, Hong Hee [2 ]
Kim, Bong-Jo [1 ]
Kim, Young Jin [1 ]
机构
[1] Natl Inst Hlth, Dept Precis Med, Div Genome Sci, Cheongju, South Korea
[2] Sungkyunkwan Univ, Samsung Med Ctr, Samsung Adv Inst Hlth Sci & Technol SAIHST, Dept Digital Hlth, Seoul, South Korea
关键词
whole-genome sequencing (WGS); variant; genotype imputation; meta-imputation; Korean reference genome;
D O I
10.3389/fgene.2022.1008646
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genotype imputation is essential for enhancing the power of association-mapping and discovering rare and indels that are missed by most genotyping arrays. Imputation analysis can be more accurate with a population-specific reference panel or a multi-ethnic reference panel with numerous samples. The National Institute of Health, Republic of Korea, initiated the Korean Reference Genome (KRG) project to identify variants in whole-genome sequences of & SIM;20,000 Korean participants. In the pilot phase, we analyzed the data from 1,490 participants. The genetic characteristics and imputation performance of the KRG were compared with those of the 1,000 Genomes Project Phase 3, GenomeAsia 100K Project, ChinaMAP, NARD, and TOPMed reference panels. For comparison analysis, genotype panels were artificially generated using whole-genome sequencing data from combinations of four different ancestries (Korean, Japanese, Chinese, and European) and two population-specific optimized microarrays (Korea Biobank Array and UK Biobank Array). The KRG reference panel performed best for the Korean population (R (2) = 0.78-0.84, percentage of well-imputed is 91.9% for allele frequency > 5%), although the other reference panels comprised a larger number of samples with genetically different background. By comparing multiple reference panels and multi-ethnic genotype panels, optimal imputation was obtained using reference panels from genetically related populations and a population-optimized microarray. Indeed, the reference panels of KRG and TOPMed showed the best performance when applied to the genotype panels of KBA (R (2) = 0.84) and UKB (R (2) = 0.87), respectively. Using a meta-imputation approach to merge imputation results from different reference panels increased the imputation accuracy for rare variants (& SIM;7%) and provided additional well-imputed variants (& SIM;20%) with comparable imputation accuracy to that of the KRG. Our results demonstrate the importance of using a population-specific reference panel and meta-imputation to assess a substantial number of accurately imputed rare variants.
引用
收藏
页数:13
相关论文
共 11 条
  • [11] Large-Scale Genome-Wide Association Meta-Analysis using Imputation from 1000 Genomes Project Reference Panel Identifies 29 Novel Lipid Loci and New Variants in Previously known Loci
    Magi, Reedik
    Surakka, Ida
    Sarin, Antti-Pekka
    Horikoshi, Momoko
    Ferreira, Teresa
    Marullo, Letizia
    Mahajan, Anubha
    Lindgren, Cecilia M.
    Morris, Andrew P.
    Mccarthy, Mark
    Prokopenko, Inga
    Ripatti, Samuli
    DIABETES, 2013, 62 : A84 - A84