Multiple imputation for longitudinal data using Bayesian lasso imputation model

被引:4
作者
Yamaguchi, Yusuke [1 ]
Yoshida, Satoshi [1 ]
Misumi, Toshihiro [2 ]
Maruo, Kazushi [3 ]
机构
[1] Astellas Pharma Inc, Data Sci, Dev, Tokyo, Japan
[2] Yokohama City Univ, Sch Med, Dept Biostat, Yokohama, Kanagawa, Japan
[3] Univ Tsukuba, Fac Med, Dept Biostat, Tsukuba, Ibaraki, Japan
关键词
Bayesian lasso; longitudinal clinical study; missing data; multiple imputation; MISSING DATA; AUXILIARY VARIABLES; RANDOM FOREST; SELECTION; BIAS; MICE;
D O I
10.1002/sim.9315
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multiple imputation is a promising approach to handle missing data and is widely used in analysis of longitudinal clinical studies. A key consideration in the implementation of multiple imputation is to obtain accurate imputed values by specifying an imputation model that incorporates auxiliary variables potentially associated with missing variables. The use of informative auxiliary variables is known to be beneficial to make the missing at random assumption more plausible and help to reduce uncertainty of the imputations; however, it is not straightforward to pre-specify them in many cases. We propose a data-driven specification of the imputation model using Bayesian lasso in the context of longitudinal clinical study, and develop a built-in function of the Bayesian lasso imputation model which is performed within the framework of multiple imputation using chained equations. A simulation study suggested that the Bayesian lasso imputation model worked well in a variety of longitudinal study settings, providing unbiased treatment effect estimates with well-controlled type I error rates and coverage probabilities of the confidence interval; in contrast, ignorance of the informative auxiliary variables led to serious bias and inflation of type I error rate. Moreover, the Bayesian lasso imputation model offered higher statistical powers compared with conventional imputation methods. In our simulation study, the gains in statistical power were remarkable when the sample size was small relative to the number of auxiliary variables. An illustration through a real example also suggested that the Bayesian lasso imputation model could give smaller standard errors of the treatment effect estimate.
引用
收藏
页码:1042 / 1058
页数:17
相关论文
共 45 条
  • [11] International Council for Harmonisation, ADD EST SENS AN CLIN
  • [12] Bayesian adaptive Lasso
    Leng, Chenlei
    Minh-Ngoc Tran
    Nott, David
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2014, 66 (02) : 221 - 244
  • [13] Little R., 2002, Statistical analysis with missing data, V2nd ed., DOI DOI 10.1002/9781119013563
  • [14] The Prevention and Treatment of Missing Data in Clinical Trials
    Little, Roderick J.
    D'Agostino, Ralph
    Cohen, Michael L.
    Dickersin, Kay
    Emerson, Scott S.
    Farrar, John T.
    Frangakis, Constantine
    Hogan, Joseph W.
    Molenberghs, Geert
    Murphy, Susan A.
    Neaton, James D.
    Rotnitzky, Andrea
    Scharfstein, Daniel
    Shih, Weichung J.
    Siegel, Jay P.
    Stern, Hal
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2012, 367 (14) : 1355 - 1360
  • [15] The proportion of missing data should not be used to guide decisions on multiple imputation
    Madley-Dowd, Paul
    Hughes, Rachael
    Tilling, Kate
    Heron, Jon
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2019, 110 : 63 - 73
  • [16] Mallinckrodt C H, 2001, J Biopharm Stat, V11, P9, DOI 10.1081/BIP-100104194
  • [17] Mallinckrodt C.H., 2013, Preventing and treating missing data in longitudinal clinical trials: a practical guide
  • [18] MULTIPLE-IMPUTATION INFERENCES WITH UNCONGENIAL SOURCES OF INPUT
    MENG, XL
    [J]. STATISTICAL SCIENCE, 1994, 9 (04) : 538 - 558
  • [19] Statistical Issues and Recommendations for Clinical Trials Conducted During the COVID-19 Pandemic
    Meyer, R. Daniel
    Ratitch, Bohdana
    Wolbers, Marcel
    Marchenko, Olga
    Quan, Hui
    Li, Daniel
    Fletcher, Christine
    Li, Xin
    Wright, David
    Shentu, Yue
    Englert, Stefan
    Shen, Wei
    Dey, Jyotirmoy
    Liu, Thomas
    Zhou, Ming
    Bohidar, Norman
    Zhao, Peng-Liang
    Hale, Michael
    [J]. STATISTICS IN BIOPHARMACEUTICAL RESEARCH, 2020, 12 (04): : 399 - 411
  • [20] Molenberghs G., 2007, MISSING DATA CLIN ST, DOI [10.1002/9780470510445, DOI 10.1002/9780470510445]