SWAN: A multihead autoregressive attention model for solar wind speed forecasting

被引:6
作者
Cobos-Maestre, Mario [1 ]
Flores-Soriano, Manuel [2 ]
Barrero, David F. [1 ]
机构
[1] Univ Alcala, Dept Automat, Alcala De Henares 28805, Madrid, Spain
[2] Univ Alcala, Dept Fis & Matemat, Space Weather Res Grp, Alcala De Henares 28801, Madrid, Spain
关键词
Self-attention; Time series regression; Space weather; Solar wind speed forecasting; SPACE PHYSICS;
D O I
10.1016/j.eswa.2024.124128
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High speed streams of solar wind can impact Earth's magnetic environment. Current state of the art geomagnetic effect forecasting methods can predict perturbations up to two hours in advance by using solar wind measurements made in the L1 Lagrange point. By forecasting these solar wind parameters, it should be possible to extend the time range for which said methods can be used. We present Solar Wind Attention Network (SWAN), an attention-based autoregressive model for the forecasting of daily average bulk speed values. We apply a novel automatic design method to create its architecture by developing a pipeline of hyperparameter optimization techniques as a proof of concept. Usage of this pipeline leads to a robust, reliable architecture which we empirically verify the convergence of in a variety of problem configurations regarding solar wind speed forecasting. An improvement in terms of root mean squared error (RMSE) is found when compared to WindNet and convolutional neural networks (CNNs) expanding it, even when SWAN does not incorporate images of the solar disk into the input. SWAN's RMSE of 68.79 +/- 1.05 km s -1 with a lead time of three days compares to RMSE values in the order of 80.28 +/- 3.05, 76.30 +/- 1.87 and 71.93 +/- 0.40 km s -1 . From these results, we discuss the possibility of incorporating both autoregressive and image information as input to enhance predictive capability of newly developed models. Furthermore, we analyze the resulting model's behavior to characterize its main error sources, and find its performance to be at its best in scenarios not involving newly formed coronal holes or coronal mass ejections. It is found that this behavior matches well with a priori expectations derived from the underlying physics. Finally, we briefly discuss the need for an in-depth study of different evaluation metrics to provide a more thorough understanding of modeling results on the field.
引用
收藏
页数:13
相关论文
共 44 条
[1]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[3]   Attention-Based Machine Vision Models and Techniques for Solar Wind Speed Forecasting Using Solar EUV Images [J].
Brown, Edward J. E. ;
Svoboda, Filip ;
Meredith, Nigel P. ;
Lane, Nicholas ;
Horne, Richard B. .
SPACE WEATHER-THE INTERNATIONAL JOURNAL OF RESEARCH AND APPLICATIONS, 2022, 20 (03)
[4]   Forecasting High-Speed Solar Wind Streams Based on Solar Extreme Ultraviolet Images [J].
Bu, X. ;
Luo, B. ;
Shen, C. ;
Liu, S. ;
Gong, J. ;
Cao, Y. ;
Wang, H. .
SPACE WEATHER-THE INTERNATIONAL JOURNAL OF RESEARCH AND APPLICATIONS, 2019, 17 (07) :1040-1058
[5]   Deep Neural Networks With Convolutional and LSTM Layers for SYM-H and ASY-H Forecasting [J].
Collado-Villaverde, Armando ;
Munoz, Pablo ;
Cid, Consuelo .
SPACE WEATHER-THE INTERNATIONAL JOURNAL OF RESEARCH AND APPLICATIONS, 2021, 19 (06)
[6]   Mean Absolute Percentage Error for regression models [J].
de Myttenaere, Arnaud ;
Golden, Boris ;
Le Grand, Benedicte ;
Rossi, Fabrice .
NEUROCOMPUTING, 2016, 192 :38-48
[7]   EIT: Extreme-ultraviolet imaging telescope for the SOHO mission [J].
Delaboudiniere, JP ;
Artzner, GE ;
Brunaud, J ;
Gabriel, AH ;
Hochedez, JF ;
Millier, F ;
Song, XY ;
Au, B ;
Dere, KP ;
Howard, RA ;
Kreplin, R ;
Michels, DJ ;
Moses, JD ;
Defise, JM ;
Jamar, C ;
Rochus, P ;
Chauvineau, JP ;
Marioge, JP ;
Catura, RC ;
Lemen, JR ;
Shing, L ;
Stern, RA ;
Gurman, JB ;
Neupert, WM ;
Maucherat, A ;
Clette, F ;
Cugnon, P ;
VanDessel, EL .
SOLAR PHYSICS, 1995, 162 (1-2) :291-312
[8]  
Fleck B., 1995, Coronal Magnetic Energy Releases. Proceedings of the CESRA Workshop, P233
[9]   Multiple-Hour-Ahead Forecast of the Dst Index Using a Combination of Long Short-Term memory Neural Network and Gaussian Process [J].
Gruet, M. A. ;
Chandorkar, M. ;
Sicard, A. ;
Camporeale, E. .
SPACE WEATHER-THE INTERNATIONAL JOURNAL OF RESEARCH AND APPLICATIONS, 2018, 16 (11) :1882-1896
[10]   NEURAL-NET FORECASTING FOR GEOMAGNETIC-ACTIVITY [J].
HERNANDEZ, JV ;
TAJIMA, T ;
HORTON, W .
GEOPHYSICAL RESEARCH LETTERS, 1993, 20 (23) :2707-2710