Comparative Analysis of Rainfall Prediction Models Using Machine Learning in Islands with Complex Orography: Tenerife Island

被引:17
作者
Aguasca-Colomo, Ricardo [1 ]
Castellanos-Nieves, Dagoberto [2 ]
Mendez, Maximo [1 ]
机构
[1] ULPGC, Inst Univ SIANI, Las Palmas De Gc 35017, Spain
[2] ULL, Dept Ingn Informat & Sistemas, San Cristobal De La Lagu 38200, Spain
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 22期
关键词
classification algorithms; data processing; machine learning; computational methods; predictive models; rainfall forecasting; extreme gradient boosting (XGBoost); random forest (rf); WEATHER; CLASSIFICATION; CLIMATE;
D O I
10.3390/app9224931
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We present a comparative study between predictive monthly rainfall models for islands of complex orography using machine learning techniques. The models have been developed for the island of Tenerife (Canary Islands). Weather forecasting is influenced both by the local geographic characteristics as well as by the time horizon comprised. Accuracy of mid-term rainfall prediction on islands with complex orography is generally low when carried out with atmospheric models. Predictive models based on algorithms such as Random Forest or Extreme Gradient Boosting among others were analyzed. The predictors used in the models include weather predictors measured in two main meteorological stations, reanalysis predictors from the National Oceanic and Atmospheric Administration, and the global predictor North Atlantic Oscillation, all of them obtained over a period of time of more than four decades. When comparing the proposed models, we evaluated accuracy, kappa and interpretability of the model obtained, as well as the relevance of the predictors used. The results show that global predictors such as the North Atlantic Oscillation Index (NAO) have a very low influence, while the local Geopotential Height (GPH) predictor is relatively more important. Machine learning prediction models are a relevant proposition for predicting medium-term precipitation in similar geographical regions.
引用
收藏
页数:17
相关论文
共 65 条
[1]   A new approach for simulating and forecasting the rainfall-runoff process within the next two months [J].
Alizadeh, Mohamad Javad ;
Kavianpour, Mohamad Reza ;
Kisi, Ozgur ;
Nourani, Vahid .
JOURNAL OF HYDROLOGY, 2017, 548 :588-597
[2]  
Azevedo A., 2008, IADIS EUR C DAT MIN, P182
[3]   Data mining numerical model output for single-station cloud-ceiling forecast algorithms [J].
Bankert, Richard L. ;
Hadjimichael, Michael .
WEATHER AND FORECASTING, 2007, 22 (05) :1123-1131
[4]   Data Mining and Integration for Predicting Significant Meteorological Phenomena [J].
Bartok, Juraj ;
Habala, Ondrej ;
Bednar, Peter ;
Gazak, Martin ;
Hluchy, Ladislav .
ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01) :37-46
[5]  
Basak J, 2004, J MACH LEARN RES, V5, P239
[6]  
Bischl B, 2016, J MACH LEARN RES, V17
[7]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Semi-supervised graph-based hyperspectral image classification [J].
Camps-Valls, Gustavo ;
Bandos, Tatyana V. ;
Zhou, Dengyong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (10) :3044-3054
[10]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)