Yellow Fever Vaccine Demand Forecasting With ARIMA, SARIMA, Linear Regression, and XGBoost

被引:0
|
作者
Sen, N. [1 ]
Temur, L. O. [2 ]
Atilla, D. C. [3 ]
机构
[1] Altinbas Univ, Inst Grad Studies, Elect & Comp Engn Dept, TR-34217 Istanbul, Turkiye
[2] Altinbas Univ, Inst Grad Studies, Data Analyt Dept, TR-34217 Istanbul, Turkiye
[3] Altinbas Univ, Inst Grad Studies, Data Analyt Dept, TR-34217 Istanbul, Turkiye
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Vaccines; Predictive models; Biological system modeling; Time series analysis; Demand forecasting; Machine learning; Linear regression; Data models; COVID-19; Accuracy; Yellow fever; machine learning; ensemble technique; ARIMA; SARIMA; linear regression; XGBoost; SELECTION;
D O I
10.1109/ACCESS.2024.3517652
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The demand for vaccines is significantly increasing in various countries due to heightened population mobility and the prevalence of epidemics. This study employed machine learning methods to predict optimal vaccine stock levels, aiming to prevent both shortages and oversupply, and to compare the effectiveness of these predictions. The data utilized in the prediction models were sourced from the General Directorate of Border and Coastal Health. This study analyzed a 21-year retrospective dataset collected between 2003 and 2023, which contains monthly vaccination coverage data. Four different methods commonly used in the literature were applied to estimate annual vaccine demand. Among these, the most widely utilized method was the Autoregressive Integrated Moving Average (ARIMA). Additionally, Seasonal Autoregressive Integrated Moving Average (SARIMA), Linear Regression, and XGBoost models are employed. Certain events, such as the COVID-19 pandemic, disrupt patterns within the dataset. In pruning tests, variations in data frequency within the raw dataset are analyzed. The models are evaluated using Root Mean Square Error (RMSE) and Mean Absolute Error (MAE). The entire dataset is then transformed to achieve stationarity. The models are re-evaluated after removing seasonality and white noise. Cross-validation is applied to the models that yield the most accurate predictions. The forecast results obtained from the optimized model serve as input for the Value at Risk (VaR) model. Actual, projected, and average vaccination numbers are presented with 95% and 99% confidence intervals (critical stock range) based on SARIMA, Linear Regresion and XGBoost estimates. Due to the vaccine forecast range balance, XGBoost's outputs are input into the Value at Risk (VaR) model and the cost risk related to the safe vaccine stock that may arise in the coming days is evaluated. Throughout the study, the conditions under which models can continue to learn effectively, as well as the rationale for selecting these models, can be monitored.
引用
收藏
页码:197557 / 197576
页数:20
相关论文
共 7 条
  • [1] Evaluation of a multiple linear regression model and SARIMA model in forecasting heat demand for district heating system
    Fang, Tingting
    Lahdelma, Risto
    APPLIED ENERGY, 2016, 179 : 544 - 552
  • [2] A comparative study of SIR Model, Linear Regression, Logistic Function and ARIMA Model for forecasting COVID-19 cases
    Abolmaali, Saina
    Shirzaei, Samira
    AIMS PUBLIC HEALTH, 2021, 8 (04): : 598 - 613
  • [3] Electricity Demand Prediction using Data Driven Forecasting Scheme: ARIMA and SARIMA for Real-Time Load Data of Assam
    Goswami, Kakoli
    Kandali, Aditya Bihar
    2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 570 - 574
  • [4] Forecasting ethanol demand in India to meet future blending targets: A comparison of ARIMA and various regression models
    Dey, Bishal
    Roy, Bidesh
    Datta, Subir
    Ustun, Taha Selim
    ENERGY REPORTS, 2023, 9 : 411 - 418
  • [5] Forecasting ethanol demand in India to meet future blending targets: A comparison of ARIMA and various regression models
    Dey, Bishal
    Roy, Bidesh
    Datta, Subir
    Ustun, Taha Selim
    ENERGY REPORTS, 2023, 9 : 411 - 418
  • [6] Forecasting the incidence of dengue fever in Malaysia: A comparative analysis of seasonal ARIMA, dynamic harmonic regression, and neural network models
    Mustaffa, Nurakmal Ahmad
    Zahari, Siti Mariam
    Farhana, Nor Alia
    Nasir, Noryanti
    Azil, Aishah Hani
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2024, 11 (01): : 20 - 31
  • [7] Short term electricity demand forecasting using partially linear additive quantile regression with an application to the unit commitment problem
    Lebotsa, Moshoko Emily
    Sigauke, Caston
    Bere, Alphonce
    Fildes, Robert
    Boylan, John E.
    APPLIED ENERGY, 2018, 222 : 104 - 118