A multi-head attention-based transformer model for traffic flow forecasting with a comparative analysis to recurrent neural networks

被引:168
作者
Reza, Selim [1 ]
Ferreira, Marta Campos [1 ]
Machado, J. J. M. [2 ]
Tavares, Joao Manuel R. S. [1 ,2 ]
机构
[1] Univ Porto, Fac Engn, Rua Dr Roberto Frias,S-N, P-4200465 Porto, Portugal
[2] Univ Porto, Fac Engn, Dept Engn Mecan, Rua Dr Roberto Frias,S-N, P-4200465 Porto, Portugal
关键词
Intelligent transportation system; Time-series forecasting; Deep learning; Long short-term memory; Gated recurrent unit; PeMS; PREDICTION;
D O I
10.1016/j.eswa.2022.117275
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traffic flow forecasting is an essential component of an intelligent transportation system to mitigate congestion. Recurrent neural networks, particularly gated recurrent units and long short-term memory, have been the stateof-the-art traffic flow forecasting models for the last few years. However, a more sophisticated and resilient model is necessary to effectively acquire long-range correlations in the time-series data sequence under analysis. The dominant performance of transformers by overcoming the drawbacks of recurrent neural networks in natural language processing might tackle this need and lead to successful time-series forecasting. This article presents a multi-head attention based transformer model for traffic flow forecasting with a comparative analysis between a gated recurrent unit and a long-short term memory-based model on PeMS dataset in this context. The model uses 5 heads with 5 identical layers of encoder and decoder and relies on Square Subsequent Masking techniques. The results demonstrate the promising performance of the transform-based model in predicting long-term traffic flow patterns effectively after feeding it with substantial amount of data. It also demonstrates its worthiness by increasing the mean squared errors and mean absolute percentage errors by (1.25 - 47.8)% and (32.4 - 83.8)%, respectively, concerning the current baselines.
引用
收藏
页数:11
相关论文
共 37 条
[1]   Prediction of Road Traffic Flow Based on Deep Recurrent Neural Networks [J].
Bartlett, Zoe ;
Han, Liangxiu ;
Trung Thanh Nguyen ;
Johnson, Princy .
2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, :102-109
[2]  
Basak D., 2007, Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers, V11, P203, DOI [10.1007/978-1-4302-5990-9_4, DOI 10.1007/978-1-4302-5990-9_4]
[3]  
Baum Christopher, 2018, Kpss: Stata module to compute kwiatkowskiphillips-schmidt-shin test for stationarity
[4]  
Chen C., 2002, Freeway performance measurement system (PeMS)
[5]  
Cholakov R., 2021, ARXIV PREPRINT ARXIV
[6]  
Chung JY, 2015, PR MACH LEARN RES, V37, P2067
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]  
Dey R, 2017, MIDWEST SYMP CIRCUIT, P1597, DOI 10.1109/MWSCAS.2017.8053243
[9]   Random radial basis function kernel-based support vector machine [J].
Ding, Xiaojian ;
Liu, Jian ;
Yang, Fan ;
Cao, Jie .
JOURNAL OF THE FRANKLIN INSTITUTE, 2021, 358 (18) :10121-10140
[10]  
Elhenawy M, 2017, Advances in Science Technology and Engineering Systems Journal, V2, P678, DOI [10.25046/aj020387, 10.25046/aj020387, DOI 10.25046/AJ020387]