A Likelihood-Based Approach for Missing Genotype Data

被引:1
|
作者
D'Angelo, Gina M. [1 ]
Kamboh, M. Ilyas [3 ,4 ]
Feingold, Eleanor [2 ]
机构
[1] Washington Univ, Div Biostat, Sch Med, St Louis, MO 63110 USA
[2] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Biostat, Pittsburgh, PA 15261 USA
[3] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Human Genet, Pittsburgh, PA 15261 USA
[4] Univ Pittsburgh, Alzheimers Dis Res Ctr, Sch Med, Pittsburgh, PA 15261 USA
关键词
Missing data; SNPs; Association studies; Logistic regression; Likelihood-based methods; PARAMETRIC REGRESSION-MODELS; GENOME-WIDE ASSOCIATION; LATENT VARIABLE MODELS; MAXIMUM-LIKELIHOOD; MULTIPLE IMPUTATION; COVARIATE DATA; POLYTOMOUS DATA; POLYMORPHISMS; INFERENCE; EQUATION;
D O I
10.1159/000273732
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Missing genotype data in a candidate gene association study can make it difficult to model the effects of multiple genetic variants simultaneously. In particular, when regression models are used to model phenotype as a function of SNP genotypes in several different genes, the most common approach is a complete case analysis, in which only individuals with no missing genotypes are included. But this can lead to substantial reduction in sample size and thus potential bias and loss in efficiency. A number of other methods for handling missing data are applicable, but have rarely been used in this context. The purpose of this paper is to describe how several standard methods for handling missing data can be applied or adapted to this problem, and to compare their performance using a simulation study. We demonstrate these techniques using an Alzheimer's disease association study. We show that the expectation-maximization algorithm and multiple imputation with a bootstrapped expectation-maximization sampling algorithm have the best properties of all the estimators studied. Copyright (C) 2010 S. Karger AG, Basel
引用
收藏
页码:171 / 183
页数:13
相关论文
共 50 条
  • [21] An efficient empirical likelihood approach for estimating equations with missing data
    Tang, Cheng Yong
    Qin, Yongsong
    BIOMETRIKA, 2012, 99 (04) : 1001 - 1007
  • [22] A likelihood-based approach to estimating and testing for isolation by distance
    Yang, RC
    EVOLUTION, 2004, 58 (08) : 1839 - 1845
  • [23] Likelihood-based statistical estimation from quantized data
    Vardeman, SB
    Lee, CS
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2005, 54 (01) : 409 - 414
  • [24] Likelihood-based inference for dynamic panel data models
    Seung C. Ahn
    Gareth M. Thomas
    Empirical Economics, 2023, 64 : 2859 - 2909
  • [25] Likelihood-based modeling and analysis of possum trapping data
    Malcolm Faddy
    Jennifer Brown
    Phillip Commins
    Journal of Agricultural, Biological, and Environmental Statistics, 2001, 6 : 235 - 242
  • [26] Additive Nonlinear Biomass Equations: A Likelihood-Based Approach
    Affleck, David L. R.
    Dieguez-Aranda, Ulises
    FOREST SCIENCE, 2016, 62 (02) : 129 - 140
  • [27] Likelihood-based modeling and analysis of possum trapping data
    Faddy, M
    Brown, J
    Commins, P
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2001, 6 (02) : 235 - 242
  • [28] Likelihood-based inference for dynamic panel data models
    Ahn, Seung C.
    Thomas, Gareth M.
    EMPIRICAL ECONOMICS, 2023, 64 (06) : 2859 - 2909
  • [29] Likelihood-based approach for analysis of longitudinal nominal data using marginalized random effects models
    Lee, Keunbaik
    Kang, Sanggil
    Liu, Xuefeng
    Seo, Daekwan
    JOURNAL OF APPLIED STATISTICS, 2011, 38 (08) : 1577 - 1590
  • [30] Modeling and Inference for Infectious Disease Dynamics: A Likelihood-Based Approach
    Breto, Carles
    STATISTICAL SCIENCE, 2018, 33 (01) : 57 - 69