BAMITA: Bayesian multiple imputation for tensor arrays

被引:0
|
作者
Jiang, Ziren [1 ]
Li, Gen [2 ]
Lock, Eric F. [1 ]
机构
[1] Univ Minnesota, Sch Publ Hlth, Div Biostat & Hlth Data Sci, 2221 Univ Ave SE, Minneapolis, MN 55414 USA
[2] Univ Michigan, Sch Publ Hlth, Dept Biostat, 1415 Washington Hts,M4210, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院;
关键词
Bayesian inference; microbiome data; missing data; multiple imputation; multiway data; DECOMPOSITION; REGRESSION;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Data increasingly take the form of a multi-way array, or tensor, in several biomedical domains. Such tensors are often incompletely observed. For example, we are motivated by longitudinal microbiome studies in which several timepoints are missing for several subjects. There is a growing literature on missing data imputation for tensors. However, existing methods give a point estimate for missing values without capturing uncertainty. We propose a multiple imputation approach for tensors in a flexible Bayesian framework, that yields realistic simulated values for missing entries and can propagate uncertainty through subsequent analyses. Our model uses efficient and widely applicable conjugate priors for a CANDECOMP/PARAFAC (CP) factorization, with a separable residual covariance structure. This approach is shown to perform well with respect to both imputation accuracy and uncertainty calibration, for scenarios in which either single entries or entire fibers of the tensor are missing. For two microbiome applications, it is shown to accurately capture uncertainty in the full microbiome profile at missing timepoints and used to infer trends in species diversity for the population. Documented R code to perform our multiple imputation approach is available at https://github.com/lockEF/MultiwayImputation.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] A Bayesian tensor decomposition approach for spatiotemporal traffic data imputation
    Chen, Xinyu
    He, Zhaocheng
    Sun, Lijun
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2019, 98 : 73 - 84
  • [2] Multiple imputation for longitudinal data using Bayesian lasso imputation model
    Yamaguchi, Yusuke
    Yoshida, Satoshi
    Misumi, Toshihiro
    Maruo, Kazushi
    STATISTICS IN MEDICINE, 2022, 41 (06) : 1042 - 1058
  • [3] Noise correction using Bayesian multiple imputation
    Van Hulse, Jason
    Khoshgoftaar, Taghi M.
    Seiffert, Chris
    Zhao, Lili
    IRI 2006: PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2006, : 478 - +
  • [4] A Note on Bayesian Inference After Multiple Imputation
    Zhou, Xiang
    Reiter, Jerome P.
    AMERICAN STATISTICIAN, 2010, 64 (02): : 159 - 163
  • [5] Bayesian Multiscale Multiple Imputation With Implications for Data Confidentiality
    Holan, Scott H.
    Toth, Daniell
    Ferreira, Marco A. R.
    Karr, Alan F.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (490) : 564 - 577
  • [6] Bayesian multiple imputation for assay data subject to measurement error
    Guo Y.
    Little R.J.
    Journal of Statistical Theory and Practice, 2013, 7 (2) : 219 - 232
  • [7] Multiple Imputation for Longitudinal Data Under a Bayesian Multilevel Model
    Demirtas, Hakan
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2009, 38 (16-17) : 2812 - 2828
  • [8] Bayesian Latent Class Models for the Multiple Imputation of Categorical Data
    Vidotto, Davide
    Vermunt, Jeroen K.
    Van Deun, Katrijn
    METHODOLOGY-EUROPEAN JOURNAL OF RESEARCH METHODS FOR THE BEHAVIORAL AND SOCIAL SCIENCES, 2018, 14 (02) : 56 - 68
  • [9] Incremental Bayesian matrix/tensor learning for structural monitoring data imputation and response forecasting
    Ren, Pu
    Chen, Xinyu
    Sun, Lijun
    Sun, Hao
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2021, 158
  • [10] Missing traffic data imputation and pattern discovery with a Bayesian augmented tensor factorization model
    Chen, Xinyu
    He, Zhaocheng
    Chen, Yixian
    Lu, Yuhuan
    Wang, Jiawei
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2019, 104 : 66 - 77