Imputation of Missing Traffic Flow Data Using Denoising Autoencoders

被引:8
作者
Jiang, Boyuan [1 ,2 ]
Siddiqi, Muhammad Danial [2 ]
Asadi, Reza [2 ]
Regan, Amelia [1 ,2 ]
机构
[1] Univ Calif Irvine, Inst Transportat Studies ITS, Irvine, CA 92697 USA
[2] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci ICS, Irvine, CA 92697 USA
来源
12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS | 2021年 / 184卷
关键词
Transportation data analysis; Spatio-temporal problem; Denoising autoencoder; Missing data imputation;
D O I
10.1016/j.procs.2021.03.122
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In transportation engineering, spatio-temporal data including traffic flow, speed, and occupancy are collected from different kinds of sensors and used by transportation engineers for analysis. However, the missing data influence the analysis and prediction results significantly. In this paper, Denoising Autoencoders are used to impute the missing traffic flow data. In our earlier research, we focused on a more general situation and used three kinds of Denoising Autoencoders: "Vanilla", CNN, and Bi-LSTM, to impute the data with a general missing rate of 30%. The Autoencoder models are used to train on data with a high missing rate of about 80% in this paper. We demonstrate that even under extreme loss conditions, and Autoencoder models are very robust. By observing the hyper-parameter tuning process, the changing prediction accuracy is shown and in most cases, the three models maintain the original accuracy even under the worst situations. Moreover, the error patterns and trends concerning different sensor stations and different hours on weekdays and weekends are also visualized and analyzed. Finally, based on these results, we separate the data into weekdays and weekends, train and test the model respectively, and improve the accuracy of the imputation result significantly. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 9 条
[1]  
Asadi R, 2019, P INT C ART INT ICAI, P206
[2]  
Asadi Reza, 2020, DEEP LEARNING MODELS
[3]   Spatio-Temporal Data Mining: A Survey of Problems and Methods [J].
Atluri, Gowtham ;
Karpatne, Anuj ;
Kumar, Vipin .
ACM COMPUTING SURVEYS, 2018, 51 (04)
[4]  
Costa Adriana Fonseca, 2018, INT S INT DAT AN INT S INT DAT AN
[5]   An efficient realization of deep learning for traffic data imputation [J].
Duan, Yanjie ;
Lv, Yisheng ;
Liu, Yu-Liang ;
Wang, Fei-Yue .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2016, 72 :168-181
[6]  
Duan YJ, 2014, 2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), P912, DOI 10.1109/ITSC.2014.6957805
[7]  
Gondara Lovedeep, 2018, PAC AS C KNOWL DISC PAC AS C KNOWL DISC
[8]  
Zhang JY, 2019, IEEE INT C BIOINFORM, P760, DOI 10.1109/BIBM47256.2019.8982996
[9]   Innovative method for traffic data imputation based on convolutional neural network [J].
Zhuang, Yifan ;
Ke, Ruimin ;
Wang, Yinhai .
IET INTELLIGENT TRANSPORT SYSTEMS, 2019, 13 (04) :605-613