A Novel Confined Attention Mechanism Driven Bi-GRU Model for Traffic Flow Prediction

被引:16
作者
Chauhan, Nisha Singh [1 ]
Kumar, Neetesh [1 ]
Eskandarian, Azim [2 ]
机构
[1] Indian Inst Technol Roorkee, Roorkee 247667, Uttar Pradesh, India
[2] Virginia Tech, Dept Mech Engn, Autonomous Syst & Intelligent Machines ASIM Lab, Blacksburg, VA 24061 USA
关键词
Traffic flow prediction; deep learning; hybrid models; Bi-GRU; attention; external features;
D O I
10.1109/TITS.2024.3375890
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traffic congestion is a pressing issue worldwide, and machine learning (ML) methods are increasingly being used in Intelligent Transportation Systems (ITS) to address this problem. Deep hybrid models, in particular, have emerged as an efficient solution for traffic flow prediction. Among these models, Long Short-Term Memory (LSTM) and Bidirectional LSTM (Bi-LSTM) have been widely used to capture the temporal and periodic features of traffic data. The advancements in RNNs provide an opportunity to enhance the performance of existing models. Therefore, this work proposes a BiGRU-BiGRU model with two modules to extract temporal and periodic features from traffic data. Recurrent Neural Networks (RNNs) have proven to perform well using attention mechanism. However, there is a need for attention mechanism that strictly focuses on traffic dynamics and nearby data from the prediction points. Thus, a novel confined attention mechanism is proposed and incorporated into the first module to improve the model's performance by focusing only on the recent relevant information in the traffic flow sequence. Furthermore, the external features are integrated to improve the model's prediction performance. The proposed model is evaluated on the publicly available real-world dataset and compared with several baseline state-of-the-art methods. As an outcome, the model offers a reduction in the average value of RMSE, MAE, and MAPE for all the prediction horizons, that is ranged from 5.1% - 20.4%, 7.3% - 27.3%, and 6.1% - 56.6%, respectively.
引用
收藏
页码:9181 / 9191
页数:11
相关论文
共 43 条
[1]  
Afaq S, 2020, International journal of scientific and technology research, V9, P485
[2]  
Ahmed A. R., 1979, Analysis of Freeway Traffic Time-SeriesData By Using Box-Jenkins Techniques, V722
[3]   Leveraging Spatio-Temporal Patterns for Predicting Citywide Traffic Crowd Flows Using Deep Hybrid Neural Networks [J].
Ali, Ahmad ;
Zhu, Yanmin ;
Chen, Qiuxia ;
Yu, Jiadi ;
Cai, Haibin .
2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, :125-132
[4]  
Bahdanau D., 2016, P ICLR, P1
[5]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
[6]  
Chauhan N.S., 2022, P INT C NEUR INF PRO, P505
[7]  
Chen CY, 2011, IEEE INT VEH SYM, P607, DOI 10.1109/IVS.2011.5940418
[8]   Short-Term Traffic Flow Prediction Method for Urban Road Sections Based on SpaceTime Analysis and GRU [J].
Dai, Guowen ;
Ma, Changxi ;
Xu, Xuecai .
IEEE ACCESS, 2019, 7 :143025-143035
[9]  
Dua D., 2017, UCI MACHINE LEARNING
[10]   Improved Deep Hybrid Networks for Urban Traffic Flow Prediction Using Trajectory Data [J].
Duan, Zongtao ;
Yang, Yun ;
Zhang, Kai ;
Ni, Yuanyuan ;
Bajgain, Saurab .
IEEE ACCESS, 2018, 6 :31820-31827