STVANet: A spatio-temporal visual attention framework with large kernel attention mechanism for citywide traffic dynamics prediction

被引：2

作者：

Yang, Hongtai ^{[1
,4
]}

Jiang, Junbo ^{[1
]}

Zhao, Zhan ^{[2
]}

Pan, Renbin ^{[3
]}

Tao, Siyu ^{[1
]}

机构：

[1] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Natl United Engn Lab Integrated & Intelligent Tran, Sch Transportat & Logist,Inst Syst Sci & Engn, Chengdu 611756, Peoples R China

[2] Univ Hong Kong, Dept Urban Planning & Design, Hong Kong, Peoples R China

[3] Southwestern Univ Finance & Econ, Sch Business Adm, Chengdu 611130, Peoples R China

[4] Guangdong Hong Kong Macau Joint Lab Smart Cities, Hong Kong, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 254卷

关键词：

Traffic information; 2D ConvNets; Spatio-temporal data; Squeeze -and -Excitation mechanism; Deep learning; FLOW; NETWORKS; MODELS;

D O I：

10.1016/j.eswa.2024.124466

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Enhancing the efficiency and safety of the Intelligent Transportation System requires effective modeling and prediction of citywide traffic dynamics. Most studies employ convolutional neural networks (CNNs) with a 3D convolutional structure or spatio-temporal models with self-attention mechanisms to capture the spatio-temporal information of traffic distribution. Although 3D CNNs excel at capturing local contextual information, they are computationally complex due to the large number of parameters and cannot capture long-range dependence. By contrast, although self-attention mechanisms originally designed to address challenges in natural language processing can capture long-range dependence, their application to 2D image structures requires breaking down the inherent 2D context into a 1D sequence, increasing the computational complexity and neglecting the adaptability between local contextual information and channels. Accordingly, we propose a spatio-temporal visual attention neural network (STVANet), a novel spatio-temporal visual attention 2D CNN, which integrates a unique visual attention module with a large kernel attention (LKA) mechanism, a squeeze-andexcitation (SE) mechanism and a feedforward component to capture long-range dependence and channel information in urban traffic data while preserving the 2D image structure. LKA-based spatio-temporal attention networks extract spatial and temporal features from weekly, daily, and recent hourly periods, and aggregate them with weighted consideration of external features to make predictions. Evaluation of real-world datasets demonstrates STVANet's superiority over baseline models, showcasing its potential in citywide traffic prediction.

引用

页数：16

共 55 条

[1] Spatial-Temporal Complex Graph Convolution Network for Traffic Flow Prediction [J].

Bao, Yinxin ;

Huang, Jiashuang ;

Shen, Qinqin ;

Cao, Yang ;

Ding, Weiping ;

Shi, Zhenquan ;

Shi, Quan .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121

[2] A survey of results on mobile phone datasets analysis [J].

Blondel, Vincent D. ;

Decuyper, Adeline ;

Krings, Gautier .

EPJ DATA SCIENCE, 2015, 4 (01) :1-55

[3] Traffic transformer: Capturing the continuity and periodicity of time series for traffic forecasting [J].

Cai, Ling ;

Janowicz, Krzysztof ;

Mai, Gengchen ;

Yan, Bo ;

Zhu, Rui .

TRANSACTIONS IN GIS, 2020, 24 (03) :736-755

[4] Exploiting Spatio-Temporal Correlations with Multiple 3D Convolutional Neural Networks for Citywide Vehicle Flow Prediction [J].

Chen, Cen ;

Li, Kenli ;

Teo, Sin G. ;

Chen, Guizi ;

Zou, Xiaofeng ;

Yang, Xulei ;

Vijay, Ramaseshan C. ;

Feng, Jiashi ;

Zeng, Zeng .

2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, :893-898

[5] Multiple local 3D CNNs for region-based prediction in smart cities [J].

Chen, Yibi ;

Zou, Xiaofeng ;

Li, Kenli ;

Li, Keqin ;

Yang, Xulei ;

Chen, Cen .

INFORMATION SCIENCES, 2021, 542 :476-491

[6] Spatio-temporal autocorrelation of road network data [J].

Cheng, Tao ;

Haworth, James ;

Wang, Jiaqiu .

JOURNAL OF GEOGRAPHICAL SYSTEMS, 2012, 14 (04) :389-413

[7] Inferring fine-grained transport modes from mobile phone cellular signaling data [J].

Chin, Kimberley ;

Huang, Haosheng ;

Horn, Christopher ;

Kasanicky, Ivan ;

Weibel, Robert .

COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2019, 77

[8]

Dosovitskiy A, 2020, ARXIV

[9] Listening to the City, Attentively: A Spatio-Temporal Attention-Boosted Autoencoder for the Short-Term Flow Prediction Problem [J].

Fiorini, Stefano ;

Ciavotta, Michele ;

Maurino, Andrea .

ALGORITHMS, 2022, 15 (10)

[10] 3D-CLoST: A CNN-LSTM Approach for Mobility Dynamics Prediction in Smart Cities [J].

Fiorini, Stefano ;

Pilotti, Giorgio ;

Ciavotta, Michele ;

Maurino, Andrea .

2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, :3180-3189

← 1 2 3 4 5 6 →