Traffic Prediction With Missing Data: A Multi-Task Learning Approach

被引:25
作者
Wang, Ao [1 ]
Ye, Yongchao [1 ]
Song, Xiaozhuang [1 ]
Zhang, Shiyao [2 ]
Yu, James J. Q. [1 ]
机构
[1] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Guangdong Prov Key Lab Brain Inspired Intelligent, Shenzhen 518055, Peoples R China
[2] Southern Univ Sci & Technol, Res Inst Trustworthy Autonomous Syst, Shenzhen 518055, Peoples R China
关键词
Task analysis; Predictive models; Training; Multitasking; Feature extraction; Deep learning; Data mining; Traffic speed prediction; missing data; spatio-temporal modeling; deep learning; multi-task learning; MODEL;
D O I
10.1109/TITS.2022.3233890
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traffic speed prediction based on real-world traffic data is a classical problem in intelligent transportation systems (ITS). Most existing traffic speed prediction models are proposed based on the hypothesis that traffic data are complete or have rare missing values. However, such data collected in real-world scenarios are often incomplete due to various human and natural factors. Although this problem can be solved by first estimating the missing values with an imputation model and then applying a prediction model, the former potentially breaks critical latent features and further leads to the error accumulation issues. To tackle this problem, we propose a graph-based spatio-temporal autoencoder that follows an encoder-decoder structure for spatio-temporal traffic speed prediction with missing values. Specifically, we regard the imputation and prediction as two parallel tasks and train them sequentially to eliminate the negative impact of imputation on raw data for prediction and accelerate the model training process. Furthermore, we utilize graph convolutional layers with a self-adaptive adjacency matrix for spatial dependencies modeling and apply gated recurrent units for temporal learning. To evaluate the proposed model, we conduct comprehensive case studies on two real-world traffic datasets with two different missing patterns and a wide and practical missing rate range from 20% to 80%. Experimental results demonstrate that the model consistently outperforms the state-of-the-art traffic prediction with missing values methods and achieves steady performance in the investigated missing scenarios and prediction horizons.
引用
收藏
页码:4189 / 4202
页数:14
相关论文
共 52 条
[1]  
Bai L, 2020, ADV NEUR IN, V33
[2]   Traffic Data Imputation Using Deep Convolutional Neural Networks [J].
Benkraouda, Ouafa ;
Thodi, Bilal Thonnam ;
Yeo, Hwasoo ;
Menendez, Monica ;
Jabari, Saif Eddin .
IEEE ACCESS, 2020, 8 (08) :104740-104752
[3]  
Bruna J, 2014, Arxiv, DOI [arXiv:1312.6203, DOI 10.48550/ARXIV.1312.6203]
[4]  
Cao W, 2018, ADV NEUR IN, V31
[5]  
Che ZP, 2016, Arxiv, DOI arXiv:1606.01865
[6]   Bayesian Temporal Factorization for Multidimensional Time Series Prediction [J].
Chen, Xinyu ;
Sun, Lijun .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :4659-4673
[7]   A Bayesian tensor decomposition approach for spatiotemporal traffic data imputation [J].
Chen, Xinyu ;
He, Zhaocheng ;
Sun, Lijun .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2019, 98 :73-84
[8]  
Chung JY, 2014, Arxiv, DOI arXiv:1412.3555
[9]   Stacked bidirectional and unidirectional LSTM recurrent neural network for forecasting network-wide traffic state with missing values [J].
Cui, Zhiyong ;
Ke, Ruimin ;
Pu, Ziyuan ;
Wang, Yinhai .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 118
[10]   Graph Markov network for traffic forecasting with missing data [J].
Cui, Zhiyong ;
Lin, Longfei ;
Pu, Ziyuan ;
Wang, Yinhai .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 117 (117)