Modeling and forecasting rainfall patterns in India: a time series analysis with XGBoost algorithm

被引:0
作者
Pradeep Mishra
Abdullah Mohammad Ghazi Al Khatib
Shikha Yadav
Soumik Ray
Achal Lama
Binita Kumari
Divya Sharma
Ramesh Yadav
机构
[1] Jawaharlal Nehru Krishi Vishwavidyalaya,College of Agriculture
[2] Damascus University,Department of Banking and Insurance, Faculty of Economics
[3] University of Delhi,Department of Geography, Miranda House
[4] Centurion University of Technology and Management,Department of Agricultural Economics
[5] ICAR,Department of Fisheries
[6] Indian Agricultural Statistics Research Institute,undefined
[7] Rashtriya Kisan (P.G.) College,undefined
[8] Central Institute of Coastal Engineering for Fisheries,undefined
[9] Ministry of Fisheries,undefined
[10] Animal Husbandry and Dairying,undefined
[11] Government of India,undefined
[12] Ministry of Micro,undefined
[13] Small and Medium Enterprises,undefined
来源
Environmental Earth Sciences | 2024年 / 83卷
关键词
Time series; ARIMA models; State space models; Machine learning; XGBoost; Rainfall; Forecasting; Water resource management; Agriculture; Hydroelectric power generation; Climate change; Environmental management;
D O I
暂无
中图分类号
学科分类号
摘要
This study utilizes time series analysis and machine learning techniques to model and forecast rainfall patterns across different seasons in India. The statistical models, i.e., autoregressive integrated moving average (ARIMA) and state space model and machine learning models, i.e., Support Vector Machine, Artificial Neural Network and Random Forest Model were developed and their performance was compared against XGBoost, an advanced machine learning algorithm, using training and testing datasets. The results demonstrate the superior accuracy of XGBoost compared to the statistical models in capturing complex nonlinear rainfall patterns. While ARIMA models tend to overfit the training data, state space models prove more robust to outliers in the testing set. Diagnostic checks show the models adequately capture the time series properties. The analysis indicates essential unchanging rainfall patterns in India for 2023–2027, with implications for water resource management and climate-sensitive sectors like agriculture and power generation. Overall, the study highlights the efficacy of modern machine learning approaches like XGBoost for forecasting complex meteorological time series. The framework presented enables rigorous validation and selection of optimal techniques. Further applications of such sophisticated data analysis can significantly enhance planning and research on the Indian monsoons amidst climate change challenges.
引用
收藏
相关论文
共 150 条
[1]  
Al Khatib AMG(2021)Modeling and forecasting of egg production in India using time series models Eurasian J Vet Sci 37 265-273
[2]  
Yonar H(2020)Streamflow forecasting by modeling the rainfall–streamflow relationship using artificial neural networks Model Earth Syst Environ 6 1645-1656
[3]  
Abotaleb M(2022)Rainfall modeling and prediction using neural networks: a case study of Maharashtra Disaster Adv 15 39-43
[4]  
Mishra P(2014)Prediction of rainfall using data mining technique over Assam Indian J Comput Sci Eng 5 85-90
[5]  
Yonar A(2023)Forecasting actual evapotranspiration without climatic data based on stacked integration of DNN and meta-heuristic models across China from 1958 to 2021 J Environ Manage 345 621-636
[6]  
Karakaya K(2008)Tourism in the Canary Islands: forecasting using several seasonal time series models J Forecast 27 1453-1469
[7]  
Badr A(2008)Trends in the rainfall pattern over India Intern J Climatol 28 53-58
[8]  
Dhaka V(2020)ARIMA and State-Space models for sugarcane ( J Appl Nat Sci 12 1-17
[9]  
Ali S(2020)) yield forecasting in Northern agro-climatic zone of Haryana Theoret Appl Climatol 141 1-13
[10]  
Shahbaz M(2011)Adaptive EEMD-ANN hybrid model for Indian summer monsoon rainfall forecasting J Geophys Res Atmos 116 39-46