Improving accuracy of rare variant imputation with a two-step imputation approach

被引:0
|
作者
Eskil Kreiner-Møller
Carolina Medina-Gomez
André G Uitterlinden
Fernando Rivadeneira
Karol Estrada
机构
[1] Erasmus University Medical Center,Department of Internal Medicine
[2] Genetic Laboratory of Internal Medicin,Department of Medicine
[3] COPSAC,undefined
[4] Faculty of Health Sciences,undefined
[5] University of Copenhagen,undefined
[6] Copenhagen Prospective Studies on Asthma in Childhood,undefined
[7] The Danish Pediatric Asthma Center,undefined
[8] Copenhagen University Hospital,undefined
[9] Ledreborg Alle 34,undefined
[10] Gentofte,undefined
[11] Denmark,undefined
[12] Analytic and Translational Genetics Unit,undefined
[13] Massachusetts General Hospital and Harvard Medical School,undefined
[14] Program in Medical and Population Genetics,undefined
[15] Broad Institute,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Genotype imputation has been the pillar of the success of genome-wide association studies (GWAS) for identifying common variants associated with common diseases. However, most GWAS have been run using only 60 HapMap samples as reference for imputation, meaning less frequent and rare variants not being comprehensively scrutinized. Next-generation arrays ensuring sufficient coverage together with new reference panels, as the 1000 Genomes panel, are emerging to facilitate imputation of low frequent single-nucleotide polymorphisms (minor allele frequency (MAF) <5%). In this study, we present a two-step imputation approach improving the quality of the 1000 Genomes imputation by genotyping only a subset of samples to create a local reference population on a dense array with many low-frequency markers. In this approach, the study sample, genotyped with a first generation array, is imputed first to the local reference sample genotyped on a dense array and hereafter to the 1000 Genomes reference panel. We show that mean imputation quality, measured by the r2 using this approach, increases by 28% for variants with a MAF between 1 and 5% as compared with direct imputation to 1000 Genomes reference. Similarly, the concordance rate between calls of imputed and true genotypes was found to be significantly higher for heterozygotes (P<1e-15) and rare homozygote calls (P<1e-15) in this low frequency range. The two-step approach in our setting improves imputation quality compared with traditional direct imputation noteworthy in the low-frequency spectrum and is a cost-effective strategy in large epidemiological studies.
引用
收藏
页码:395 / 400
页数:5
相关论文
共 50 条
  • [1] Improving accuracy of rare variant imputation with a two-step imputation approach
    Kreiner-Moller, Eskil
    Medina-Gomez, Carolina
    Uitterlinden, Andre G.
    Rivadeneira, Fernando
    Estrada, Karol
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2015, 23 (03) : 395 - 400
  • [2] A Two-Step Approach for Improving Sentiment Classification Accuracy
    Azam, Muhammad
    Ahmed, Tanvir
    Ahmad, Rehan
    Rehman, Ateeq Ur
    Sabah, Fahad
    Asif, Rao Muhammad
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 30 (03): : 853 - 867
  • [3] Choice Set Imputation Two-Step Weighted Stratified and Hazard-Based Approach
    Langerudi, Mehran Fasihozaman
    Javanmardi, Mahmoud
    Mohammadian, Abolfazl
    Sriraj, P. S.
    TRANSPORTATION RESEARCH RECORD, 2014, (2429) : 79 - 89
  • [4] Mechanism-aware imputation: a two-step approach in handling missing values in metabolomics
    Jonathan P. Dekermanjian
    Elin Shaddox
    Debmalya Nandy
    Debashis Ghosh
    Katerina Kechris
    BMC Bioinformatics, 23
  • [5] Mechanism-aware imputation: a two-step approach in handling missing values in metabolomics
    Dekermanjian, Jonathan P.
    Shaddox, Elin
    Nandy, Debmalya
    Ghosh, Debashis
    Kechris, Katerina
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [6] A Two-Step Semiparametric Method to Accommodate Sampling Weights in Multiple Imputation
    Zhou, Hanzhi
    Elliott, Michael R.
    Raghunathan, Trviellore E.
    BIOMETRICS, 2016, 72 (01) : 242 - 252
  • [7] PreCimp: Pre-collapsing imputation approach increases imputation accuracy of rare variants in terms of collapsed variables
    Kim, Young Jin
    Lee, Juyoung
    Kim, Bong-Jo
    Park, Taesung
    GENETIC EPIDEMIOLOGY, 2017, 41 (01) : 41 - 50
  • [8] Latent Interaction Effect in the CLPM Model: A Two-Step Multiple Imputation Analysis
    Tseng, Ming-Chi
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2025, 32 (01) : 26 - 35
  • [9] On Improving Imputation Accuracy of LTE Spectrum Measurements Data
    Chaudhry, Aizaz
    Li, Wei
    Basri, Amir
    Patenaude, Francois
    2018 WIRELESS TELECOMMUNICATIONS SYMPOSIUM (WTS), 2018,
  • [10] Improving Imputation Accuracy in Ordinal Data Using Classification
    Alam, Shafiq
    Dobbie, Gillian
    Sun, XiaoBin
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA 2016), 2017, 557 : 45 - 56