Deep imputation of missing values in time series health data: A review with benchmarking

被引:13
|
作者
Kazijevs, Maksims [1 ]
Samad, Manar D. [1 ]
机构
[1] Tennessee State Univ, Dept Comp Sci, Nashville, TN 37209 USA
基金
美国国家卫生研究院;
关键词
Time series; Multivariate data; Longitudinal imputation; Cross-sectional imputation; Missing value imputation; Deep neural network; Electronic health records; Sensor data;
D O I
10.1016/j.jbi.2023.104440
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The imputation of missing values in multivariate time series (MTS) data is critical in ensuring data quality and producing reliable data-driven predictive models. Apart from many statistical approaches, a few recent studies have proposed state-of-the-art deep learning methods to impute missing values in MTS data. However, the evaluation of these deep methods is limited to one or two data sets, low missing rates, and completely random missing value types. This survey performs six data-centric experiments to benchmark state-of-the-art deep imputation methods on five time series health data sets. Our extensive analysis reveals that no single imputation method outperforms the others on all five data sets. The imputation performance depends on data types, individual variable statistics, missing value rates, and types. Deep learning methods that jointly perform cross-sectional (across variables) and longitudinal (across time) imputations of missing values in time series data yield statistically better data quality than traditional imputation methods. Although computationally expensive, deep learning methods are practical given the current availability of high-performance computing resources, especially when data quality and sample size are of paramount importance in healthcare informatics. Our findings highlight the importance of data-centric selection of imputation methods to optimize data-driven predictive models.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Trend Tests in Time Series with Missing Values: a Case Study with Imputation
    Rosario Ramos, M.
    Cordeiro, Clara
    11TH INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2013, PTS 1 AND 2 (ICNAAM 2013), 2013, 1558 : 1909 - 1912
  • [22] A Review of Missing Values Handling Methods on Time-Series Data
    Pratama, Irfan
    Permanasari, Adhistya Erna
    Ardiyanto, Igi
    Indrayani, Rini
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY SYSTEMS AND INNOVATION (ICITSI), 2016,
  • [23] Combining Convolution and Transformer for Missing Time Series Data Imputation
    Wang, Yi-Fan
    Bu, Shuai-Yu
    Yan, Jing-Hua
    Hou, Zhi-Wen
    Bu, Ling-Bin
    Meng, Fan-Xu
    Journal of Network Intelligence, 2023, 8 (03): : 823 - 838
  • [24] Comparison of Missing Data Imputation Methods in Time Series Forecasting
    Ahn, Hyun
    Sun, Kyunghee
    Kim, Kwanghoon Pio
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (01): : 767 - 779
  • [25] Imputation of Missing Values in Time Series Using an Adaptive-Learned Median-Filled Deep Autoencoder
    Pan, Zhuofu
    Wang, Yalin
    Wang, Kai
    Chen, Hongtian
    Yang, Chunhua
    Gui, Weihua
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (02) : 695 - 706
  • [26] Handling missing values in healthcare data: A systematic review of deep learning-based imputation techniques
    Liu, Mingxuan
    Li, Siqi
    Yuan, Han
    Ong, Marcus Eng Hock
    Ning, Yilin
    Xie, Feng
    Saffari, Seyed Ehsan
    Shang, Yuqing
    Volovici, Victor
    Chakraborty, Bibhas
    Liu, Nan
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 142
  • [27] Visually and Statistically Guided Imputation of Missing Values in Univariate Seasonal Time Series
    Boegl, M.
    Filzmoser, P.
    Gschwandtner, T.
    Miksch, S.
    Aigner, W.
    Rind, A.
    Lammarsch, T.
    2015 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY, 2015, : 189 - 190
  • [28] Imputation of missing values in environmental time series by D-vine copulas
    Chapon, Antoine
    Ouarda, Taha B. M. J.
    Hamdi, Yasser
    WEATHER AND CLIMATE EXTREMES, 2023, 41
  • [29] Mind the Gap: An Experimental Evaluation of Imputation of Missing Values Techniques in Time Series
    Khayati, Mourad
    Lerner, Alberto
    Tymchenko, Zakhar
    Cudre-Mauroux, Philippe
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (05): : 768 - 782
  • [30] TS-Pothole: automated imputation of missing values in univariate time series
    Sanwouo, Brell
    Quinton, Clément
    Rouvoy, Romain
    Neural Computing and Applications, 2024, 36 (36) : 22923 - 22955