Multiple imputation for longitudinal data using Bayesian lasso imputation model

被引：4

作者：

Yamaguchi, Yusuke ^{[1
]}

Yoshida, Satoshi ^{[1
]}

Misumi, Toshihiro ^{[2
]}

Maruo, Kazushi ^{[3
]}

机构：

[1] Astellas Pharma Inc, Data Sci, Dev, Tokyo, Japan

[2] Yokohama City Univ, Sch Med, Dept Biostat, Yokohama, Kanagawa, Japan

[3] Univ Tsukuba, Fac Med, Dept Biostat, Tsukuba, Ibaraki, Japan

来源：

STATISTICS IN MEDICINE | 2022年 / 41卷 / 06期

关键词：

Bayesian lasso; longitudinal clinical study; missing data; multiple imputation; MISSING DATA; AUXILIARY VARIABLES; RANDOM FOREST; SELECTION; BIAS; MICE;

D O I：

10.1002/sim.9315

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Multiple imputation is a promising approach to handle missing data and is widely used in analysis of longitudinal clinical studies. A key consideration in the implementation of multiple imputation is to obtain accurate imputed values by specifying an imputation model that incorporates auxiliary variables potentially associated with missing variables. The use of informative auxiliary variables is known to be beneficial to make the missing at random assumption more plausible and help to reduce uncertainty of the imputations; however, it is not straightforward to pre-specify them in many cases. We propose a data-driven specification of the imputation model using Bayesian lasso in the context of longitudinal clinical study, and develop a built-in function of the Bayesian lasso imputation model which is performed within the framework of multiple imputation using chained equations. A simulation study suggested that the Bayesian lasso imputation model worked well in a variety of longitudinal study settings, providing unbiased treatment effect estimates with well-controlled type I error rates and coverage probabilities of the confidence interval; in contrast, ignorance of the informative auxiliary variables led to serious bias and inflation of type I error rate. Moreover, the Bayesian lasso imputation model offered higher statistical powers compared with conventional imputation methods. In our simulation study, the gains in statistical power were remarkable when the sample size was small relative to the number of auxiliary variables. An illustration through a real example also suggested that the Bayesian lasso imputation model could give smaller standard errors of the treatment effect estimate.

引用

页码：1042 / 1058

页数：17

共 45 条

[1] Logistic Bayesian LASSO for Identifying Association with Rare Haplotypes and Application to Age-Related Macular Degeneration
Biswas, Swati
Lin, Shili
[J]. BIOMETRICS, 2012, 68 (02) : 587 - 597
[2] Multiple Imputation for Missing Data via Sequential Regression Trees
Burgette, Lane F.
Reiter, Jerome P.
[J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2010, 172 (09) : 1070 - 1076
[3] MULTIPLE IMPUTATION FOR NONRESPONSE IN SURVEYS - RUBIN,DB
CAMPION, WM
[J]. JOURNAL OF MARKETING RESEARCH, 1989, 26 (04) : 485 - 486
[4] A comparison of inclusive and restrictive strategies in modern missing data procedures
Collins, LM
Schafer, JL
Kam, CM
[J]. PSYCHOLOGICAL METHODS, 2001, 6 (04) : 330 - 351
[5] Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data
Deng, Yi
Chang, Changgee
Ido, Moges Seyoum
Long, Qi
[J]. SCIENTIFIC REPORTS, 2016, 6
[6] Recursive partitioning for missing data imputation in the presence of interaction effects
Doove, L. L.
Van Buuren, S.
Dusseldorp, E.
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 72 : 92 - 104
[7] Gramacy R. B., 2019, MONOMVN ESTIMATION M
[8] Model uncertainty and variable selection in Bayesian lasso regression
Hans, Chris
[J]. STATISTICS AND COMPUTING, 2010, 20 (02) : 221 - 229
[9] Bayesian lasso regression
Hans, Chris
[J]. BIOMETRIKA, 2009, 96 (04) : 835 - 845
[10] Auxiliary variables in multiple imputation in regression with missing X: a warning against including too many in small sample research
Hardt, Jochen
Herke, Max
Leonhart, Rainer
[J]. BMC MEDICAL RESEARCH METHODOLOGY, 2012, 12

← 1 2 3 4 5 →