Spatio-Temporal Transformer Network for Weather Forecasting

被引:1
作者
Ji, Junzhong [1 ]
He, Jing [1 ]
Lei, Minglong [1 ]
Wang, Muhua [2 ]
Tang, Wei [2 ]
机构
[1] Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[2] China Meteorol Adm, Publ Meteorol Serv Ctr, Beijing 100081, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Transformers; Meteorology; Forecasting; Weather forecasting; Feature extraction; Predictive models; Task analysis; spatio-temporal neural networks; meteorological graphs; transformers;
D O I
10.1109/TBDATA.2024.3378061
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spatio-temporal neural networks have been successfully applied to weather forecasting tasks recently. The key notion is to learn spatio-temporal features concurrently from spatial and temporal dependencies. Existing methods are mainly based on local smoothness assumptions where the features are learned by accumulating information in local spatio-temporal regions. However, the weather conditions in a certain spatio-temporal region are usually influenced by global meteorological changes and long-range historical weather conditions. Therefore, these methods that ignore the large-scale spatio-temporal effects can hardly learn effective features. In this paper, we propose a novel spatio-temporal Transformer network in weather forecasting to address the above challenges. The main idea is to leverage the Transformer architecture to carefully capture the multi-scale spatial and long-range temporal information in weather data. First, we propose to combine the global and local position encodings based on absolute geographic locations and relative geodesic distances and insert them into the spatial Transformer to extract the multi-scale spatial information in meteorological graphs. Then, we further capture the long-range temporal dependencies by a temporal Transformer where the attention mechanism is used to improve the representation ability and scalability of the models. Extensive experiments over real weather datasets demonstrate the effectiveness of our framework.
引用
收藏
页码:372 / 387
页数:16
相关论文
共 53 条
  • [1] Adams DK, 1997, B AM METEOROL SOC, V78, P2197, DOI 10.1175/1520-0477(1997)078<2197:TNAM>2.0.CO
  • [2] 2
  • [3] Review of the use of Numerical Weather Prediction (NWP) Models for wind energy assessment
    Al-Yahyai, Sultan
    Charabi, Yassine
    Gastli, Adel
    [J]. RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2010, 14 (09) : 3192 - 3198
  • [4] IGAGCN: Information geometry and attention-based spatiotemporal graph convolutional networks for traffic flow prediction
    An, Jiyao
    Guo, Liang
    Liu, Wei
    Fu, Zhiqiang
    Ren, Ping
    Liu, Xinzhi
    Li, Tao
    [J]. NEURAL NETWORKS, 2021, 143 : 355 - 367
  • [5] [Anonymous], 2017, ARXIV170701926
  • [6] Bai L, 2020, ADV NEUR IN, V33
  • [7] Chen DX, 2022, PR MACH LEARN RES
  • [8] Hybrid machine intelligent SVR variants for wind forecasting and ramp events
    Dhiman, Harsh S.
    Deb, Dipankar
    Guerrero, Josep M.
    [J]. RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2019, 108 : 369 - 379
  • [9] Gao Z., 2022, Advances in Neural Information Processing Systems, V35, P25390
  • [10] Short-term instead of long-term rainfall time series in rainwater harvesting simulation in houses: An assessment using Bayesian Network
    Geraldi, Matheus Soares
    Ghisi, Enedir
    [J]. RESOURCES CONSERVATION AND RECYCLING, 2019, 144 : 1 - 12