A Bayesian multiple imputation approach to bivariate functional data with missing components

被引:3
作者
Jang, Jeong Hoon [1 ]
Manatunga, Amita K. [2 ]
Chang, Changgee [3 ]
Long, Qi [3 ]
机构
[1] Indiana Univ Sch Med, Dept Biostat & Hlth Data Sci, Indianapolis, IN 46202 USA
[2] Emory Univ, Rollins Sch Publ Hlth, Dept Biostat & Bioinformat, 1518 Clifton Rd NE, Atlanta, GA 30322 USA
[3] Univ Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, Philadelphia, PA 19104 USA
关键词
Bayesian latent factor model; bivariate functional data; curves; missing data; multiple imputation;
D O I
10.1002/sim.9093
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Existing missing data methods for functional data mainly focus on reconstructing missing measurements along a single function-a univariate functional data setting. Motivated by a renal study, we focus on a bivariate functional data setting, where each sampling unit is a collection of two distinct component functions, one of which may be missing. Specifically, we propose a Bayesian multiple imputation approach based on a bivariate functional latent factor model that exploits the joint changing patterns of the component functions to allow accurate and stable imputation of one component given the other. We further extend the framework to address multilevel bivariate functional data with missing components by modeling and exploiting inter-component and intra-subject correlations. We develop a Gibbs sampling algorithm that simultaneously generates multiple imputations of missing component functions and posterior samples of model parameters. For multilevel bivariate functional data, a partially collapsed Gibbs sampler is implemented to improve computational efficiency. Our simulation study demonstrates that our methods outperform other competing methods for imputing missing components of bivariate functional data under various designs and missingness rates. The motivating renal study aims to investigate the distribution and pharmacokinetic properties of baseline and post-furosemide renogram curves that provide further insights into the underlying mechanism of renal obstruction, with post-furosemide renogram curves missing for some subjects. We apply the proposed methods to impute missing post-furosemide renogram curves and obtain more refined insights.
引用
收藏
页码:4772 / 4793
页数:22
相关论文
共 32 条
[1]   Sparse Bayesian infinite factor models [J].
Bhattacharya, A. ;
Dunson, D. B. .
BIOMETRIKA, 2011, 98 (02) :291-306
[2]  
BUUREN S, 2011, J STAT SOFTW, V45, P1, DOI DOI 10.18637/JSS.V045.I03
[3]   High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics [J].
Carvalho, Carlos M. ;
Chang, Jeffrey ;
Lucas, Joseph E. ;
Nevins, Joseph R. ;
Wang, Quanli ;
West, Mike .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) :1438-1456
[4]   MULTIVARIATE FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS: A NORMALIZATION APPROACH [J].
Chiou, Jeng-Min ;
Chen, Yu-Ting ;
Yang, Ya-Fang .
STATISTICA SINICA, 2014, 24 (04) :1571-1596
[5]   A functional data approach to missing value imputation and outlier detection for traffic flow data [J].
Chiou, Jeng-Min ;
Zhang, Yi-Chen ;
Chen, Wan-Hui ;
Chang, Chiung-Wen .
TRANSPORTMETRICA B-TRANSPORT DYNAMICS, 2014, 2 (02) :106-129
[6]   Approximating fragmented functional data by segments of Markov chains [J].
Delaigle, A. ;
Hall, P. .
BIOMETRIKA, 2016, 103 (04) :779-799
[7]   Classification Using Censored Functional Data [J].
Delaigle, Aurore ;
Hall, Peter .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (504) :1269-1283
[8]  
Gelman A., 1992, STAT SCI, V7, P457, DOI [DOI 10.1214/SS/1177011136, 10.1214/ss/1177011136]
[9]   STOCHASTIC RELAXATION, GIBBS DISTRIBUTIONS, AND THE BAYESIAN RESTORATION OF IMAGES [J].
GEMAN, S ;
GEMAN, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (06) :721-741
[10]   Measuring the pricing error of the arbitrage pricing theory [J].
Geweke, J ;
Zhou, GF .
REVIEW OF FINANCIAL STUDIES, 1996, 9 (02) :557-587