An efficient spatial-temporal transformer with temporal aggregation and spatial memory for traffic forecasting

被引:5
作者
Liu, Aoyu [1 ]
Zhang, Yaying [1 ,2 ]
机构
[1] Tongji Univ, Serv Comp, Key Lab Embedded Syst, Minist Educ, Shanghai, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Traffic forecasting; Transformer; Memory network; Data mining; REGRESSION; PREDICTION;
D O I
10.1016/j.eswa.2024.123884
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traffic forecasting technology has widespread applications in various domains, such as urban traffic planning and intelligent transportation systems. Traffic forecasting encounters challenges in effectively capturing the intricate spatial-temporal correlations in traffic data. While the latest methods have achieved satisfactory performance, they still suffer from two limitations: (i) Most methods overlook the memory of valuable traffic patterns at each traffic node, thus making it struggle to reveal dynamic spatial-temporal correlations using their inherent periodicity and trend characteristics from a broader perspective. (ii) As the research progresses, recently proposed models become increasingly complex and massive. To address these issues, we propose a Spatial-Temporal Aggregation Memory Transformer (STAMT) for traffic forecasting. Specifically, we propose a memory bank to enhance vanilla spatial attention and cache the traffic patterns of historical input. By querying these traffic patterns, which contain rich spatial-temporal semantic information, the model can optimize prediction performance by extracting trends and regularities across various periods. To reduce the computational costs, we introduce a temporal module to capture temporal correlations while reducing temporal dimension information. In addition, we leverage the random feature map and matrix multiplication associativity property, which reduce the quadratic complexity of spatial attention to linearity with regard to the number of nodes. Ultimately, our theoretical analysis concludes that a single-layer spatial attention network is sufficient to capture spatial-temporal correlations deeply without stacking. Extensive experiments on nine real-world datasets demonstrate that STAMT outperforms state-of-the-art baselines in regular, long-range, and large-scale traffic forecasting tasks while significantly reducing the computational costs. Codes are available at https://github.com/LiuAoyu1998/STAMT.
引用
收藏
页数:15
相关论文
共 55 条
[31]   Transformer-enhanced periodic temporal convolution network for long short-term traffic flow forecasting [J].
Ren, Qianqian ;
Li, Yang ;
Liu, Yong .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
[32]   Spatial-Temporal Identity: A Simple yet Effective Baseline for Multivariate Time Series Forecasting [J].
Shao, Zezhi ;
Zhang, Zhao ;
Wang, Fei ;
Wei, Wei ;
Xu, Yongjun .
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, :4454-4458
[33]  
Shao Zezhi, 2022, Decoupled dynamic spatial-temporal graph neural network for traffic forecasting, V15, P2733
[34]   DAGCRN: Graph convolutional recurrent network for traffic forecasting with dynamic adjacency matrix [J].
Shi, Zheng ;
Zhang, Yingjun ;
Wang, Jingping ;
Qin, Jiahu ;
Liu, Xiaoqian ;
Yin, Hui ;
Huang, Hua .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
[35]  
Song C, 2020, AAAI CONF ARTIF INTE, V34, P914
[36]   A Survey on Modern Deep Neural Network for Traffic Prediction: Trends, Methods and Challenges [J].
Tedjopurnomo, David Alexander ;
Bao, Zhifeng ;
Zheng, Baihua ;
Choudhury, Farhana ;
Qin, A. K. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (04) :1544-1561
[37]  
Velickovic P., 2018, PROC INT C LEARN REP, P1
[38]   Deep Learning for Spatio-Temporal Data Mining: A Survey [J].
Wang, Senzhang ;
Cao, Jiannong ;
Yu, Philip S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) :3681-3700
[39]  
Wang ZN, 2022, AAAI CONF ARTIF INTE, P4228
[40]   Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results [J].
Williams, BM ;
Hoel, LA .
JOURNAL OF TRANSPORTATION ENGINEERING, 2003, 129 (06) :664-672