Neural additive time-series models: Explainable deep learning for multivariate time-series prediction

被引:8
作者
Jo, Wonkeun [1 ]
Kim, Dongil [2 ]
机构
[1] Chungnam Natl Univ, Dept Comp Sci & Engn, 99 Daehak Ro, Daejeon 34134, South Korea
[2] Ewha Womans Univ, Dept Data Sci, 52 Ewhayeodae Gil, Seoul 03760, South Korea
基金
新加坡国家研究基金会;
关键词
Deep learning; Explainable artificial intelligence; Multivariate time series prediction; Neural additive models; Parameter sharing; ATTENTION; NETWORKS;
D O I
10.1016/j.eswa.2023.120307
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks are one of the most important methods in machine learning. The advantages of neural networks are their excellent prediction performance and versatility using deep architecture and generalized input-output forms. However, as neural networks are black-box models, they lack explanatory power for their predictions. In this study, we propose a new neural network architecture that includes the interpretability of predictions for multivariate time-series (MTS) data by employing a generalized additive method. In addition, we examine parameter sharing networks to decrease the model's complexity, along with hard-shared networks. We conducted experiments to demonstrate that the interpretable neural architecture can quantify the contributions of each input value to the prediction of each MTS by every time step and variable. Experimental results involving a toy example and four real-world datasets demonstrate that the performance of the proposed method in predicting MTS data is comparable to that of state-of-the-art neural networks, while providing reasonable importance for each input value.
引用
收藏
页数:19
相关论文
共 41 条
[1]  
Agarwal R, 2021, ADV NEURAL INFORM PR
[2]   Explaining anomalies detected by autoencoders using Shapley Additive Explanations [J].
Antwarg, Liat ;
Miller, Ronnie Mindlin ;
Shapira, Bracha ;
Rokach, Lior .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 186
[3]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[4]   TimeSHAP: Explaining Recurrent Models through Sequence Perturbations [J].
Bento, Joao ;
Saleiro, Pedro ;
Cruz, Andre F. ;
Figueiredo, Mario A. T. ;
Bizarro, Pedro .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :2565-2573
[5]  
Binkowski M, 2018, PR MACH LEARN RES, V80
[6]   Realtime nowcasting with a Bayesian mixed frequency model with stochastic volatility [J].
Carriero, Andrea ;
Clark, Todd E. ;
Marcellino, Massimiliano .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2015, 178 (04) :837-862
[7]  
[Clark Kevin ELECTRA ELECTRA], 2020, arXiv, DOI [DOI 10.48550/arXiv.2003.10555, DOI 10.48550/ARXIV.2003.10555, 10.48550/arXiv.2003.10555]
[8]  
Crawshaw Michael., MULTITASK LEARNING D
[9]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[10]   Real-time forecasting of time series in financial markets using sequentially trained dual-LSTMs [J].
Gajamannage, Kelum ;
Park, Yonggi ;
Jayathilake, Dilhani I. .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 223