Yellow Fever Vaccine Demand Forecasting With ARIMA, SARIMA, Linear Regression, and XGBoost

被引：0

作者：

Sen, N. ^{[1
]}

Temur, L. O. ^{[2
]}

Atilla, D. C. ^{[3
]}

机构：

[1] Altinbas Univ, Inst Grad Studies, Elect & Comp Engn Dept, TR-34217 Istanbul, Turkiye

[2] Altinbas Univ, Inst Grad Studies, Data Analyt Dept, TR-34217 Istanbul, Turkiye

[3] Altinbas Univ, Inst Grad Studies, Data Analyt Dept, TR-34217 Istanbul, Turkiye

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Vaccines; Predictive models; Biological system modeling; Time series analysis; Demand forecasting; Machine learning; Linear regression; Data models; COVID-19; Accuracy; Yellow fever; machine learning; ensemble technique; ARIMA; SARIMA; linear regression; XGBoost; SELECTION;

D O I：

10.1109/ACCESS.2024.3517652

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The demand for vaccines is significantly increasing in various countries due to heightened population mobility and the prevalence of epidemics. This study employed machine learning methods to predict optimal vaccine stock levels, aiming to prevent both shortages and oversupply, and to compare the effectiveness of these predictions. The data utilized in the prediction models were sourced from the General Directorate of Border and Coastal Health. This study analyzed a 21-year retrospective dataset collected between 2003 and 2023, which contains monthly vaccination coverage data. Four different methods commonly used in the literature were applied to estimate annual vaccine demand. Among these, the most widely utilized method was the Autoregressive Integrated Moving Average (ARIMA). Additionally, Seasonal Autoregressive Integrated Moving Average (SARIMA), Linear Regression, and XGBoost models are employed. Certain events, such as the COVID-19 pandemic, disrupt patterns within the dataset. In pruning tests, variations in data frequency within the raw dataset are analyzed. The models are evaluated using Root Mean Square Error (RMSE) and Mean Absolute Error (MAE). The entire dataset is then transformed to achieve stationarity. The models are re-evaluated after removing seasonality and white noise. Cross-validation is applied to the models that yield the most accurate predictions. The forecast results obtained from the optimized model serve as input for the Value at Risk (VaR) model. Actual, projected, and average vaccination numbers are presented with 95% and 99% confidence intervals (critical stock range) based on SARIMA, Linear Regresion and XGBoost estimates. Due to the vaccine forecast range balance, XGBoost's outputs are input into the Value at Risk (VaR) model and the cost risk related to the safe vaccine stock that may arise in the coming days is evaluated. Throughout the study, the conditions under which models can continue to learn effectively, as well as the rationale for selecting these models, can be monitored.

引用

页码：197557 / 197576

页数：20

共 7 条

[1] Evaluation of a multiple linear regression model and SARIMA model in forecasting heat demand for district heating system
Fang, Tingting
Lahdelma, Risto
APPLIED ENERGY, 2016, 179 : 544 - 552
[2] A comparative study of SIR Model, Linear Regression, Logistic Function and ARIMA Model for forecasting COVID-19 cases
Abolmaali, Saina
Shirzaei, Samira
AIMS PUBLIC HEALTH, 2021, 8 (04): : 598 - 613
[3] Electricity Demand Prediction using Data Driven Forecasting Scheme: ARIMA and SARIMA for Real-Time Load Data of Assam
Goswami, Kakoli
Kandali, Aditya Bihar
2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 570 - 574
[4] Forecasting ethanol demand in India to meet future blending targets: A comparison of ARIMA and various regression models
Dey, Bishal
Roy, Bidesh
Datta, Subir
Ustun, Taha Selim
ENERGY REPORTS, 2023, 9 : 411 - 418
[5] Forecasting ethanol demand in India to meet future blending targets: A comparison of ARIMA and various regression models
Dey, Bishal
Roy, Bidesh
Datta, Subir
Ustun, Taha Selim
ENERGY REPORTS, 2023, 9 : 411 - 418
[6] Forecasting the incidence of dengue fever in Malaysia: A comparative analysis of seasonal ARIMA, dynamic harmonic regression, and neural network models
Mustaffa, Nurakmal Ahmad
Zahari, Siti Mariam
Farhana, Nor Alia
Nasir, Noryanti
Azil, Aishah Hani
INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2024, 11 (01): : 20 - 31
[7] Short term electricity demand forecasting using partially linear additive quantile regression with an application to the unit commitment problem
Lebotsa, Moshoko Emily
Sigauke, Caston
Bere, Alphonce
Fildes, Robert
Boylan, John E.
APPLIED ENERGY, 2018, 222 : 104 - 118

← 1 →