Precipitation prediction in several Chinese regions using machine learning methods

被引:1
作者
Wang, Yuyao [1 ]
Pei, Lijun [1 ]
Wang, Jiachen [2 ]
机构
[1] Zhengzhou Univ, Sch Math & Stat, Zhengzhou 450001, Henan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Rainfall prediction; Machine learning; Linear regression; Random forest; Support vector regression; Bayesian ridge regression; SUPPORT VECTOR MACHINE; CLASSIFICATION;
D O I
10.1007/s40435-023-01250-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The severity of global climate change is exemplified by the significant increase in extreme precipitation events, leading to an urgent need for accurate rainfall prediction models to mitigate flood disasters that adversely affect economic and social development. With the rapid progress of machine learning in the big data era, novel solutions to regression problems are being proposed. In this paper, we try to construct and evaluate different rainfall prediction models based on specific humidity, relative humidity, horizontal and vertical water vapor flux, and lifting index as variables, using four classic machine learning algorithms: linear regression, random forest regression, support vector regression, and Bayesian ridge regression. The grid search method is employed for hyperparameter tuning, significantly improving the models' prediction accuracy and generalization ability. Evaluation of the predictive performance of the models on nine typical regions in China, including Zhengzhou, Beijing, and Chengdu, demonstrates that the random forest regression model has the highest predictive accuracy, with an average fitting degree of 0.8 or above, followed by support vector regression and Bayesian ridge regression models. Conversely, the linear regression model may have the poorest predictive performance. Therefore, the random forest regression model is recommended for future precipitation prediction, providing a valuable solution to various regression problems. The appropriate selection of variables for prediction and grid search for hyperparameter tuning are possibly the highlights of this paper.
引用
收藏
页码:1180 / 1196
页数:17
相关论文
共 43 条
[1]   Response surface analysis, clustering, and random forest regression of pressure in suddenly expanded high-speed aerodynamic flows [J].
Afzal, Asif ;
Aabid, Abdul ;
Khan, Ambareen ;
Khan, Sher Afghan ;
Rajak, Upendra ;
Verma, Tikendra Nath ;
Kumar, Rahul .
AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 107
[2]   Multi-model ensemble predictions of precipitation and temperature using machine learning algorithms [J].
Ahmed, Kamal ;
Sachindra, D. A. ;
Shahid, Shamsuddin ;
Iqbal, Zafar ;
Nawaz, Nadeem ;
Khan, Najeebullah .
ATMOSPHERIC RESEARCH, 2020, 236
[3]   A comparative study of several machine learning based non-linear regression methods in estimating solar radiation: Case studies of the USA and Turkey regions [J].
Alizamir, Meysam ;
Kim, Sungwon ;
Kisi, Ozgur ;
Zounemat-Kermani, Mohammad .
ENERGY, 2020, 197
[4]  
[Anonymous], 2008, The people of the Yangtze River, V9, P2931, DOI [10.16232/j.carolcarrollnki.10014179.2008.19.001, DOI 10.16232/J.CAROLCARROLLNKI.10014179.2008.19.001]
[5]  
Arora S, 2022, PR MACH LEARN RES
[6]   Isotopic measurements in water vapor, precipitation, and seawater during EUREC4A [J].
Bailey, Adriana ;
Aemisegger, Franziska ;
Villiger, Leonie ;
Los, Sebastian A. ;
Reverdin, Gilles ;
Melendez, Estefania Quinones ;
Acquistapace, Claudia ;
Baranowski, Dariusz B. ;
Bock, Tobias ;
Bony, Sandrine ;
Bordsdorff, Tobias ;
Coffman, Derek ;
de Szoeke, Simon P. ;
Diekmann, Christopher J. ;
Duetsch, Marina ;
Ertl, Benjamin ;
Galewsky, Joseph ;
Henze, Dean ;
Makuch, Przemyslaw ;
Noone, David ;
Quinn, Patricia K. ;
Roesch, Michael ;
Schneider, Andreas ;
Schneider, Matthias ;
Speich, Sabrina ;
Stevens, Bjorn ;
Thompson, Elizabeth J. .
EARTH SYSTEM SCIENCE DATA, 2023, 15 (01) :465-495
[7]  
Battey HS, 2022, Arxiv, DOI arXiv:2106.12001
[8]   Grid search in hyperparameter optimization of machine learning models for prediction of HIV/AIDS test results [J].
Belete D.M. ;
Huchaiah M.D. .
International Journal of Computers and Applications, 2022, 44 (09) :875-886
[9]  
Belete DM., 2022, Int J Comput Appl, V44
[10]   A machine learning model for drought tracking and forecasting using remote precipitation data and a standardized precipitation index from arid regions [J].
Bouaziz, Moncef ;
Medhioub, Emna ;
Csaplovisc, Elmar .
JOURNAL OF ARID ENVIRONMENTS, 2021, 189