Bayesian analysis of censored response data in family-based genetic association studies

被引:4
作者
Del Greco M, Fabiola [1 ,2 ]
Pattaro, Cristian [1 ,2 ]
Minelli, Cosetta [3 ]
Thompson, John R. [4 ]
机构
[1] European Acad Bolzano Bozen EURAC, Ctr Biomed, Bolzano, Italy
[2] Univ Lubeck, Lubeck, Germany
[3] Imperial Coll, Populat Hlth & Occupat Dis, Natl Heart & Lung Inst, London, England
[4] Univ Leicester, Dept Hlth Sci, Leicester, Leics, England
关键词
Bayesian methods; Genetic association studies; Left-censored data; Multiple imputation; Tobit model; MULTIPLE IMPUTATION; MISSING DATA; QUANTITATIVE TRAIT; COARSE DATA; MODELS; HERITABILITY; IGNORABILITY; STRATEGIES; COMPONENTS; REGRESSION;
D O I
10.1002/bimj.201400107
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Biomarkers are subject to censoring whenever some measurements are not quantifiable given a laboratory detection limit. Methods for handling censoring have received less attention in genetic epidemiology, and censored data are still often replaced with a fixed value. We compared different strategies for handling a left-censored continuous biomarker in a family-based study, where the biomarker is tested for association with a genetic variant, S, adjusting for a covariate, X. Allowing different correlations between X and S, we compared simple substitution of censored observations with the detection limit followed by a linear mixed effect model (LMM), Bayesian model with noninformative priors, Tobit model with robust standard errors, the multiple imputation (MI) with and without S in the imputation followed by a LMM. Our comparison was based on real and simulated data in which 20% and 40% censoring were artificially induced. The complete data were also analyzed with a LMM. In the MICROS study, the Bayesian model gave results closer to those obtained with the complete data. In the simulations, simple substitution was always the most biased method, the Tobit approach gave the least biased estimates at all censoring levels and correlation values, the Bayesian model and both MI approaches gave slightly biased estimates but smaller root mean square errors. On the basis of these results the Bayesian approach is highly recommended for candidate gene studies; however, the computationally simpler Tobit and the MI without S are both good options for genome-wide studies.
引用
收藏
页码:1039 / 1053
页数:15
相关论文
共 44 条
[1]  
[Anonymous], 2021, Bayesian data analysis
[2]   Practical and statistical issues in missing data for longitudinal patient-reported outcomes [J].
Bell, Melanie L. ;
Fairclough, Diane L. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2014, 23 (05) :440-459
[3]  
Box G E, 2011, BAYESIAN INFERENCE S
[4]  
Casella G., 2002, STAT INFERENCE
[5]   A comparison of inclusive and restrictive strategies in modern missing data procedures [J].
Collins, LM ;
Schafer, JL ;
Kam, CM .
PSYCHOLOGICAL METHODS, 2001, 6 (04) :330-351
[6]  
DelsGreco M. F., 2011, HUM MOL GENET, V20, P1660
[7]   Semiparametric variance-component models for linkage and association analyses of censored trait data [J].
Diao, G. ;
Lin, D. Y. .
GENETIC EPIDEMIOLOGY, 2006, 30 (07) :570-581
[8]   Mapping quantitative trait loci with censored observations [J].
Diao, GQ ;
Lin, DY ;
Zou, F .
GENETICS, 2004, 168 (03) :1689-1698
[9]  
Dipak K. D., 2000, GEN LINEAR MODELS BA
[10]  
El Ghouch A, 2009, STAT SINICA, V19, P1621