SAITS: Self-attention-based imputation for time series

被引:164
作者
Du, Wenjie [1 ]
Cote, David [2 ]
Liu, Yan [1 ]
机构
[1] Concordia Univ, Montreal, PQ H3G 1M8, Canada
[2] Ciena Corp, Ottawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Time series; Missing values; Imputation model; Self-attention; Neural network; MULTIPLE IMPUTATION;
D O I
10.1016/j.eswa.2023.119619
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Missing data in time series is a pervasive problem that puts obstacles in the way of advanced analysis. A popular solution is imputation, where the fundamental challenge is to determine what values should be filled in. This paper proposes SAITS, a novel method based on the self-attention mechanism for missing value imputation in multivariate time series. Trained by a joint-optimization approach, SAITS learns missing values from a weighted combination of two diagonally-masked self-attention (DMSA) blocks. DMSA explicitly captures both the temporal dependencies and feature correlations between time steps, which improves imputation accuracy and training speed. Meanwhile, the weighted-combination design enables SAITS to dynamically assign weights to the learned representations from two DMSA blocks according to the attention map and the missingness information. Extensive experiments quantitatively and qualitatively demonstrate that SAITS outperforms the state-of-the-art methods on the time-series imputation task efficiently and reveal SAITS' potential to improve the learning performance of pattern recognition models on incomplete time-series data from the real world.
引用
收藏
页数:15
相关论文
共 54 条
[1]  
Acuña E, 2004, ST CLASS DAT ANAL, P639
[2]  
[Anonymous], 1984, Time Series Analysis of Irregularly Observed Data
[3]  
[Anonymous], 1987, Statistical analysis with missing data
[4]  
Ashman M, 2020, Arxiv, DOI arXiv:2010.10177
[5]   Multiple imputation by chained equations: what is it and how does it work? [J].
Azur, Melissa J. ;
Stuart, Elizabeth A. ;
Frangakis, Constantine ;
Leaf, Philip J. .
INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2011, 20 (01) :40-49
[6]  
Ba JL, 2016, arXiv
[7]  
Bansal P, 2021, Arxiv, DOI arXiv:2103.01600
[8]   Imputation techniques for the reconstruction of missing interconnected data from higher Educational Institutions® [J].
Bruni, Renato ;
Daraio, Cinzia ;
Aureli, Davide .
KNOWLEDGE-BASED SYSTEMS, 2021, 212
[9]   Combining multiple imputation and meta-analysis with individual participant data [J].
Burgess, Stephen ;
White, Ian R. ;
Resche-Rigon, Matthieu ;
Wood, Angela M. .
STATISTICS IN MEDICINE, 2013, 32 (26) :4499-4514
[10]   Estimation of the limit of detection in semiconductor gas sensors through linearized calibration models [J].
Burgues, Javier ;
Manuel Jimenez-Soto, Juan ;
Marco, Santiago .
ANALYTICA CHIMICA ACTA, 2018, 1013 :13-25