Multiple imputation for longitudinal data using Bayesian lasso imputation model

被引:4
|
作者
Yamaguchi, Yusuke [1 ]
Yoshida, Satoshi [1 ]
Misumi, Toshihiro [2 ]
Maruo, Kazushi [3 ]
机构
[1] Astellas Pharma Inc, Data Sci, Dev, Tokyo, Japan
[2] Yokohama City Univ, Sch Med, Dept Biostat, Yokohama, Kanagawa, Japan
[3] Univ Tsukuba, Fac Med, Dept Biostat, Tsukuba, Ibaraki, Japan
关键词
Bayesian lasso; longitudinal clinical study; missing data; multiple imputation; MISSING DATA; AUXILIARY VARIABLES; RANDOM FOREST; SELECTION; BIAS; MICE;
D O I
10.1002/sim.9315
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multiple imputation is a promising approach to handle missing data and is widely used in analysis of longitudinal clinical studies. A key consideration in the implementation of multiple imputation is to obtain accurate imputed values by specifying an imputation model that incorporates auxiliary variables potentially associated with missing variables. The use of informative auxiliary variables is known to be beneficial to make the missing at random assumption more plausible and help to reduce uncertainty of the imputations; however, it is not straightforward to pre-specify them in many cases. We propose a data-driven specification of the imputation model using Bayesian lasso in the context of longitudinal clinical study, and develop a built-in function of the Bayesian lasso imputation model which is performed within the framework of multiple imputation using chained equations. A simulation study suggested that the Bayesian lasso imputation model worked well in a variety of longitudinal study settings, providing unbiased treatment effect estimates with well-controlled type I error rates and coverage probabilities of the confidence interval; in contrast, ignorance of the informative auxiliary variables led to serious bias and inflation of type I error rate. Moreover, the Bayesian lasso imputation model offered higher statistical powers compared with conventional imputation methods. In our simulation study, the gains in statistical power were remarkable when the sample size was small relative to the number of auxiliary variables. An illustration through a real example also suggested that the Bayesian lasso imputation model could give smaller standard errors of the treatment effect estimate.
引用
收藏
页码:1042 / 1058
页数:17
相关论文
共 50 条
  • [41] Bayesian nonparametric multiple imputation of partially observed data with ignorable nonresponse
    Paddock, SM
    BIOMETRIKA, 2002, 89 (03) : 529 - 538
  • [42] A Bayesian multiple imputation approach to bivariate functional data with missing components
    Jang, Jeong Hoon
    Manatunga, Amita K.
    Chang, Changgee
    Long, Qi
    STATISTICS IN MEDICINE, 2021, 40 (22) : 4772 - 4793
  • [43] An application of LASSO and multiple imputation techniques to income dynamics with cross-sectional data
    Lucchetti, Leonardo
    Corral, Paul
    Ham, Andres
    Garriga, Santiago
    REVIEW OF INCOME AND WEALTH, 2025, 71 (01)
  • [44] Correspondence Analysis with Incomplete Paired Data using Bayesian Imputation
    de Tibeiro, Jules J. S.
    Murdoch, Duncan J.
    BAYESIAN ANALYSIS, 2010, 5 (03): : 519 - 532
  • [45] Multivariate imputation of qualitative missing data using Bayesian networks
    Romero, V
    Salmerón, A
    SOFT METHODOLOGY AND RANDOM INFORMATION SYSTEMS, 2004, : 605 - 612
  • [46] Applying Stochastic Process Model to Imputation of Censored Longitudinal Data
    Zhbannikov, Ilya
    Arbeev, Konstantin
    Yashin, Anatoliy
    ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2018, : 457 - 464
  • [47] A Note on Bayesian Inference After Multiple Imputation
    Zhou, Xiang
    Reiter, Jerome P.
    AMERICAN STATISTICIAN, 2010, 64 (02): : 159 - 163
  • [48] BAMITA: Bayesian multiple imputation for tensor arrays
    Jiang, Ziren
    Li, Gen
    Lock, Eric F.
    BIOSTATISTICS, 2024, 26 (01)
  • [49] Using a multiple imputation technique to merge data sets
    Brown, JG
    APPLIED ECONOMICS LETTERS, 2002, 9 (05) : 311 - 314
  • [50] Multiple Imputation for Missing Data Using Genetic Programming
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, : 583 - 590