STVANet: A spatio-temporal visual attention framework with large kernel attention mechanism for citywide traffic dynamics prediction

被引:2
作者
Yang, Hongtai [1 ,4 ]
Jiang, Junbo [1 ]
Zhao, Zhan [2 ]
Pan, Renbin [3 ]
Tao, Siyu [1 ]
机构
[1] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Natl United Engn Lab Integrated & Intelligent Tran, Sch Transportat & Logist,Inst Syst Sci & Engn, Chengdu 611756, Peoples R China
[2] Univ Hong Kong, Dept Urban Planning & Design, Hong Kong, Peoples R China
[3] Southwestern Univ Finance & Econ, Sch Business Adm, Chengdu 611130, Peoples R China
[4] Guangdong Hong Kong Macau Joint Lab Smart Cities, Hong Kong, Peoples R China
关键词
Traffic information; 2D ConvNets; Spatio-temporal data; Squeeze -and -Excitation mechanism; Deep learning; FLOW; NETWORKS; MODELS;
D O I
10.1016/j.eswa.2024.124466
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Enhancing the efficiency and safety of the Intelligent Transportation System requires effective modeling and prediction of citywide traffic dynamics. Most studies employ convolutional neural networks (CNNs) with a 3D convolutional structure or spatio-temporal models with self-attention mechanisms to capture the spatio-temporal information of traffic distribution. Although 3D CNNs excel at capturing local contextual information, they are computationally complex due to the large number of parameters and cannot capture long-range dependence. By contrast, although self-attention mechanisms originally designed to address challenges in natural language processing can capture long-range dependence, their application to 2D image structures requires breaking down the inherent 2D context into a 1D sequence, increasing the computational complexity and neglecting the adaptability between local contextual information and channels. Accordingly, we propose a spatio-temporal visual attention neural network (STVANet), a novel spatio-temporal visual attention 2D CNN, which integrates a unique visual attention module with a large kernel attention (LKA) mechanism, a squeeze-andexcitation (SE) mechanism and a feedforward component to capture long-range dependence and channel information in urban traffic data while preserving the 2D image structure. LKA-based spatio-temporal attention networks extract spatial and temporal features from weekly, daily, and recent hourly periods, and aggregate them with weighted consideration of external features to make predictions. Evaluation of real-world datasets demonstrates STVANet's superiority over baseline models, showcasing its potential in citywide traffic prediction.
引用
收藏
页数:16
相关论文
共 55 条
[1]   Spatial-Temporal Complex Graph Convolution Network for Traffic Flow Prediction [J].
Bao, Yinxin ;
Huang, Jiashuang ;
Shen, Qinqin ;
Cao, Yang ;
Ding, Weiping ;
Shi, Zhenquan ;
Shi, Quan .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
[2]   A survey of results on mobile phone datasets analysis [J].
Blondel, Vincent D. ;
Decuyper, Adeline ;
Krings, Gautier .
EPJ DATA SCIENCE, 2015, 4 (01) :1-55
[3]   Traffic transformer: Capturing the continuity and periodicity of time series for traffic forecasting [J].
Cai, Ling ;
Janowicz, Krzysztof ;
Mai, Gengchen ;
Yan, Bo ;
Zhu, Rui .
TRANSACTIONS IN GIS, 2020, 24 (03) :736-755
[4]   Exploiting Spatio-Temporal Correlations with Multiple 3D Convolutional Neural Networks for Citywide Vehicle Flow Prediction [J].
Chen, Cen ;
Li, Kenli ;
Teo, Sin G. ;
Chen, Guizi ;
Zou, Xiaofeng ;
Yang, Xulei ;
Vijay, Ramaseshan C. ;
Feng, Jiashi ;
Zeng, Zeng .
2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, :893-898
[5]   Multiple local 3D CNNs for region-based prediction in smart cities [J].
Chen, Yibi ;
Zou, Xiaofeng ;
Li, Kenli ;
Li, Keqin ;
Yang, Xulei ;
Chen, Cen .
INFORMATION SCIENCES, 2021, 542 :476-491
[6]   Spatio-temporal autocorrelation of road network data [J].
Cheng, Tao ;
Haworth, James ;
Wang, Jiaqiu .
JOURNAL OF GEOGRAPHICAL SYSTEMS, 2012, 14 (04) :389-413
[7]   Inferring fine-grained transport modes from mobile phone cellular signaling data [J].
Chin, Kimberley ;
Huang, Haosheng ;
Horn, Christopher ;
Kasanicky, Ivan ;
Weibel, Robert .
COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2019, 77
[8]  
Dosovitskiy A, 2020, ARXIV
[9]   Listening to the City, Attentively: A Spatio-Temporal Attention-Boosted Autoencoder for the Short-Term Flow Prediction Problem [J].
Fiorini, Stefano ;
Ciavotta, Michele ;
Maurino, Andrea .
ALGORITHMS, 2022, 15 (10)
[10]   3D-CLoST: A CNN-LSTM Approach for Mobility Dynamics Prediction in Smart Cities [J].
Fiorini, Stefano ;
Pilotti, Giorgio ;
Ciavotta, Michele ;
Maurino, Andrea .
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, :3180-3189