Predicting Lake Erie wave heights and periods using XGBoost and LSTM

被引:67
作者
Hu, Haoguo [1 ]
van der Westhuysen, Andre J. [2 ]
Chu, Philip [3 ]
Fujisaki-Manome, Ayumi [1 ]
机构
[1] Univ Michigan, Cooperat Inst Great Lakes Res, Ann Arbor, MI 48109 USA
[2] NOAA, IMSG, Natl Ctr Environm Predict, Environm Modeling Ctr, College Pk, MD 20740 USA
[3] NOAA, Great Lakes Environm Res Lab, 2205 Commonwealth Blvd, Ann Arbor, MI 48105 USA
关键词
NEURAL-NETWORKS; MODEL; IMPLEMENTATION;
D O I
10.1016/j.ocemod.2021.101832
中图分类号
P4 [大气科学(气象学)];
学科分类号
0706 ; 070601 ;
摘要
Waves in large lakes put coastal communities and vessels under threat, and accurate wave predictions are needed for early warnings. While physics-based numerical wave models such as WAVEWATCH III (WW3) are useful to provide spatial information to supplement in situ observations, they require intensive computational resources. An attractive alternative is machine learning (ML) methods, which can potentially improve the performance of numerical wave models, while only requiring a small fraction of the computational cost. In this study, we applied novel ML methods based on XGBoost and a Long Short-Term Memory (LSTM) recurrent neural network for predicting wave height and period under the near-idealized wave growth conditions of Lake Erie. Data sets of significant wave height (H), peak wave period (T-p) and surface wind from two offshore buoys from 1994 to 2017 were processed for model training and testing. We trained and validated the ML models with the data sets from 1994 to 2015, and then used the trained models to predict significant wave height and peak period for 2016 and 2017. The XGBoost model yielded the best overall performance, with Mean Absolute Percentage Error (MAPE) values of 15.6%-22.9% in.. and 8.3%-13.4% in T-p. The LSTM model yielded MAPE values of 23.4%-30.8% in H and 9.1%-13.6% in T-p. An unstructured grid WW3 applied to Lake Erie yielded MAPE values of 15.3%-21.0% in H and 12.5%-19.3% in T-p. However, WW3 underestimated H and T-p during strong wind events, with relative biases of -11.76% to -14.15% in H and -15.59% to -19.68% in T-p. XGBoost and LSTM improve on these predictions with relative biases of -2.56% to -10.61% in H and -8.08% to -10.13% in T-p. An ensemble mean of these three models yielded lower scatter scores than the members, with MAPE values of 13.3%-17.3% in H and 8.0%-13.0% in T-p, although it did not improve the bias. The ML models ran significantly faster than WW3: For this 2-year run on the same computing environment, WW3 needed 24 h with 60 CPUs, whereas the trained LSTM needed 0.24 s on 1 CPU, and the trained XGBoost needed only 0.03 s on 1 CPU.
引用
收藏
页数:23
相关论文
共 52 条
[1]  
Abhigna P, 2017, 2017 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), P1856, DOI 10.1109/ICCSP.2017.8286717
[2]   On-line wave prediction [J].
Agrawal, JD ;
Deo, MC .
MARINE STRUCTURES, 2002, 15 (01) :57-74
[3]   The Operational Implementation of a Great Lakes Wave Forecasting System at NOAA/NCEP [J].
Alves, Jose-Henrique G. M. ;
Chawla, Arun ;
Tolman, Hendrik L. ;
Schwab, David ;
Lang, Gregory ;
Mann, Greg .
WEATHER AND FORECASTING, 2014, 29 (06) :1473-1497
[4]  
[Anonymous], 2004, P 29 INT C COAST ENG
[5]   Semiempirical Dissipation Source Functions for Ocean Waves. Part I: Definition, Calibration, and Validation [J].
Ardhuin, Fabrice ;
Rogers, Erick ;
Babanin, Alexander V. ;
Filipot, Jean-Francois ;
Magne, Rudy ;
Roland, Aaron ;
van der Westhuysen, Andre ;
Queffeulou, Pierre ;
Lefevre, Jean-Michel ;
Aouf, Lotfi ;
Collard, Fabrice .
JOURNAL OF PHYSICAL OCEANOGRAPHY, 2010, 40 (09) :1917-1941
[6]  
Battjes J.A., 1978, P 16 INT C COAST ENG, P569, DOI [DOI 10.1061/9780872621909.034, 10.1061/9780872621909.034, DOI 10.9753/ICCE.V16.%P]
[7]   Explicit Precipitation-Type Diagnosis from a Model Using a Mixed-Phase Bulk Cloud-Precipitation Microphysics Parameterization [J].
Benjamin, Stanley G. ;
Brown, John M. ;
Smirnova, Tatiana G. .
WEATHER AND FORECASTING, 2016, 31 (02) :609-619
[8]   A North American Hourly Assimilation and Model Forecast Cycle: The Rapid Refresh [J].
Benjamin, Stanley G. ;
Weygandt, Stephen S. ;
Brown, John M. ;
Hu, Ming ;
Alexander, Curtis R. ;
Smirnova, Tatiana G. ;
Olson, Joseph B. ;
James, Eric P. ;
Dowell, David C. ;
Grell, Georg A. ;
Lin, Haidao ;
Peckham, Steven E. ;
Smith, Tracy Lorraine ;
Moninger, William R. ;
Kenyon, Jaymes S. ;
Manikin, Geoffrey S. .
MONTHLY WEATHER REVIEW, 2016, 144 (04) :1669-1694
[9]   A third-generation wave model for coastal regions - 1. Model description and validation [J].
Booij, N ;
Ris, RC ;
Holthuijsen, LH .
JOURNAL OF GEOPHYSICAL RESEARCH-OCEANS, 1999, 104 (C4) :7649-7666
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32