BAMITA: Bayesian multiple imputation for tensor arrays

被引:0
|
作者
Jiang, Ziren [1 ]
Li, Gen [2 ]
Lock, Eric F. [1 ]
机构
[1] Univ Minnesota, Sch Publ Hlth, Div Biostat & Hlth Data Sci, 2221 Univ Ave SE, Minneapolis, MN 55414 USA
[2] Univ Michigan, Sch Publ Hlth, Dept Biostat, 1415 Washington Hts,M4210, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院;
关键词
Bayesian inference; microbiome data; missing data; multiple imputation; multiway data; DECOMPOSITION; REGRESSION;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Data increasingly take the form of a multi-way array, or tensor, in several biomedical domains. Such tensors are often incompletely observed. For example, we are motivated by longitudinal microbiome studies in which several timepoints are missing for several subjects. There is a growing literature on missing data imputation for tensors. However, existing methods give a point estimate for missing values without capturing uncertainty. We propose a multiple imputation approach for tensors in a flexible Bayesian framework, that yields realistic simulated values for missing entries and can propagate uncertainty through subsequent analyses. Our model uses efficient and widely applicable conjugate priors for a CANDECOMP/PARAFAC (CP) factorization, with a separable residual covariance structure. This approach is shown to perform well with respect to both imputation accuracy and uncertainty calibration, for scenarios in which either single entries or entire fibers of the tensor are missing. For two microbiome applications, it is shown to accurately capture uncertainty in the full microbiome profile at missing timepoints and used to infer trends in species diversity for the population. Documented R code to perform our multiple imputation approach is available at https://github.com/lockEF/MultiwayImputation.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] BAYESIAN IMPUTATION FOR MISSING DATA
    Nads, Azman A.
    Polestico, Daisy Lou L.
    ADVANCES AND APPLICATIONS IN STATISTICS, 2022, 79 : 83 - 104
  • [22] Tensor time series imputation through tensor factor modelling
    Cen, Zetai
    Lam, Clifford
    JOURNAL OF ECONOMETRICS, 2025, 249
  • [23] A hybrid Bayesian network and tensor factorization approach for missing value imputation to improve breast cancer recurrence prediction
    Vazifehdan, Mahin
    Moattar, Mohammad Hossein
    Jalali, Mehrdad
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2019, 31 (02) : 175 - 184
  • [24] A Bayesian design for phase II clinical trials with delayed responses based on multiple imputation
    Cai, Chunyan
    Liu, Suyu
    Yuan, Ying
    STATISTICS IN MEDICINE, 2014, 33 (23) : 4017 - 4028
  • [25] A Bayesian hierarchical model for multiple imputation of urban spatio-temporal groundwater levels
    Manago, Kimberly F.
    Hogue, Terri S.
    Porter, Aaron
    Hering, Amanda S.
    STATISTICS & PROBABILITY LETTERS, 2019, 144 : 44 - 51
  • [26] IMPROVING BAYESIAN MIXTURE MODELS FOR MULTIPLE IMPUTATION OF MISSING DATA USING FOCUSED CLUSTERING
    Wei, Lan
    Reiter, Jerome P.
    REVSTAT-STATISTICAL JOURNAL, 2018, 16 (02) : 213 - 230
  • [27] Multiple imputation of longitudinal categorical data through bayesian mixture latent Markov models
    Vidotto, Davide
    Vermunt, Jeroen K.
    Van Deun, Katrijn
    JOURNAL OF APPLIED STATISTICS, 2020, 47 (10) : 1720 - 1738
  • [28] Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous–discrete covariates
    Ryo Kato
    Takahiro Hoshino
    Annals of the Institute of Statistical Mathematics, 2020, 72 : 803 - 825
  • [29] The multiple adaptations of multiple imputation
    Reiter, Jerome P.
    Raghunathan, Trivellore E.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (480) : 1462 - 1471
  • [30] A note on approximate Bayesian bootstrap imputation
    Kim, JK
    BIOMETRIKA, 2002, 89 (02) : 470 - 477