Estimating Missing Data in Temporal Data Streams Using Multi-Directional Recurrent Neural Networks

被引:153
|
作者
Yoon, Jinsung [1 ]
Zame, William R. [2 ]
van der Schaar, Mihaela [3 ,4 ]
机构
[1] Univ Calif Los Angeles, Dept Elect & Comp Engn, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Econ & Math, Los Angeles, CA USA
[3] Univ Oxford, Dept Engn Sci, Oxford, England
[4] Alan Turing Inst, London, England
基金
美国国家科学基金会;
关键词
Missing data; temporal data streams; imputation; recurrent neural nets; MULTIPLE-IMPUTATION;
D O I
10.1109/TBME.2018.2874712
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Missing data is a ubiquitous problem. It is especially challenging in medical settings because many streams of measurements are collected at different-and often irregular-times. Accurate estimation of the missing measurements is critical for many reasons, including diagnosis, prognosis, and treatment. Existing methods address this estimation problem by interpolating within data streams or imputing across data streams (both of which ignore important information) or ignoring the temporal aspect of the data and imposing strong assumptions about the nature of the data-generating process and/or the pattern of missing data (both of which are especially problematic for medical data). We propose a new approach, based on a novel deep learning architecture that we call a Multi-directional Recurrent Neural Network that interpolates within data streams and imputes across data streams. We demonstrate the power of our approach by applying it to five real-world medical datasets. We show that it provides dramatically improved estimation of missing measurements in comparison to 11 state-of-the-art benchmarks (including Spline and Cubic Interpolations, MICE, MissForest, matrix completion, and several RNN methods); typical improvements in Root Mean Squared Error are between 35%-50%. Additional experiments based on the same five datasets demonstrate that the improvements provided by our method are extremely robust.
引用
收藏
页码:1477 / 1490
页数:14
相关论文
共 50 条
  • [1] Estimating missing data in data streams
    Jiang, Nan
    Gruenwald, Le
    ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 981 - +
  • [2] Recurrent neural networks for missing or asynchronous data
    Bengio, Y
    Gingras, F
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 395 - 401
  • [3] Speech enhancement with missing data techniques using recurrent neural networks
    Parveen, S
    Green, P
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 733 - 736
  • [4] Multi-directional beam steering using diffractive neural networks
    Idehenre, I. U.
    Mills, M. S.
    OPTICS EXPRESS, 2020, 28 (18): : 25915 - 25934
  • [5] A Multi-directional Approach for Missing Value Estimation in Multivariate Time Series Clinical Data
    Xiao Xu
    Xiaoshuang Liu
    Yanni Kang
    Xian Xu
    Junmei Wang
    Yuyao Sun
    Quanhe Chen
    Xiaoyu Jia
    Xinyue Ma
    Xiaoyan Meng
    Xiang Li
    Guotong Xie
    Journal of Healthcare Informatics Research, 2020, 4 : 365 - 382
  • [6] A Multi-directional Approach for Missing Value Estimation in Multivariate Time Series Clinical Data
    Xu, Xiao
    Liu, Xiaoshuang
    Kang, Yanni
    Xu, Xian
    Wang, Junmei
    Sun, Yuyao
    Chen, Quanhe
    Jia, Xiaoyu
    Ma, Xinyue
    Meng, Xiaoyan
    Li, Xiang
    Xie, Guotong
    JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2020, 4 (04) : 365 - 382
  • [7] Estimating missing data of wind speeds using neural network
    Siripitayananon, P
    Chen, HC
    Jin, KR
    IEEE SOUTHEASTCON 2002: PROCEEDINGS, 2002, : 343 - 348
  • [8] Multi-directional Geodesic Neural Networks via Equivariant Convolution
    Poulenard, Adrien
    Ovsjanikov, Maks
    ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (06):
  • [9] Speech recognition with missing data using recurrent neural nets
    Parveen, S
    Green, PD
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1189 - 1195
  • [10] Multi-directional Geodesic Neural Networks via Equivariant Convolution
    Poulenard, Adrien
    Ovsjanikov, Maks
    SIGGRAPH ASIA'18: SIGGRAPH ASIA 2018 TECHNICAL PAPERS, 2018,