Building a Lucy hybrid model for grocery sales forecasting based on time series

被引:0
|
作者
Duy Thanh Tran
Jun-Ho Huh
Jae-Hwan Kim
机构
[1] University of Economics and Law,Faculty of Information Systems
[2] Vietnam National University Ho Chi Minh City,Department of Data Informatics
[3] (National) Korea Maritime and Ocean University,Department of Data Science
[4] (National) Korea Maritime and Ocean University,undefined
来源
The Journal of Supercomputing | 2023年 / 79卷
关键词
Lucy hybrid; Hybrid model; Forecast; Grocery sales; Machine learning; Time series; Trend;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, time series data are applied in many fields, such as economics, medicine, biology, science, society, nature, environment, or typically in weather forecasting. Time series is a tool that includes methodological formulas and models to help us analyze time series data, extract potentially valuable information, capture historical fluctuations, present and support forecasts of the value of the research object in future. There are many models and methods of time series analysis that have been researched and improved these days for trend analysis and forecasts. Techniques related to time series data processing include linear regression with time series with two features unique to time series lags and time steps, the trend for model long-term changes with moving averages and time dummy, seasonality to create indicators, Fourier features to capture periodic change, and time series as features to predict the future from the pass with a lag embedding. In this article, we build a new hybrid model called Lucy Hybrid that provides full steps in the machine learning process including data pre-processing, training model, evaluation model with Mean Square Error (MSE), Root-Mean-Square Error (RMSE) and Mean Absolute Error (MAE) to compare and get the best model quality. The model also provides functions like storage and loading model to support researchers to reuse and save time on training model. In the Lucy hybrid, we also support the trend and forecast function for time series data. We experiment with a large dataset of more than 3,000,000 records from a large Ecuadorian-based grocery retailer, and we used Linear Regression, Elastic Net, Lasso, Ridge and Extra Trees Regressor, Random Forest Regressor, K-Neighbors Regressor, MLP Regressor, XGB Regressor to experiment and create 20 Lucy hybrid sample models and publish a full source code for researchers to use to expand the model.
引用
收藏
页码:4048 / 4083
页数:35
相关论文
共 50 条
  • [21] Time Series Forecasting using a hybrid Jaya-FLANN model
    Panigrahi, Sibarama
    Behera, H. S.
    2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 3402 - 3407
  • [22] Memetic algorithm-based optimization of hybrid forecasting systems for multivariate time series
    Guilherme Afonso Galindo Padilha
    Jason J. Jung
    Paulo S. G. de Mattos Neto
    Neural Computing and Applications, 2025, 37 (17) : 10689 - 10706
  • [23] A new hybrid method for predicting univariate and multivariate time series based on pattern forecasting
    Castan-Lascorz, M. A.
    Jimenez-Herrera, P.
    Troncoso, A.
    Asencio-Cortes, G.
    INFORMATION SCIENCES, 2022, 586 : 611 - 627
  • [24] A forecasting model for time series based on improvements from fuzzy clustering problem
    Tai Vovan
    Luan Nguyenhuynh
    Thuy Lethithu
    Annals of Operations Research, 2022, 312 : 473 - 493
  • [25] A forecasting model for time series based on improvements from fuzzy clustering problem
    Vovan, Tai
    Nguyenhuynh, Luan
    Lethithu, Thuy
    ANNALS OF OPERATIONS RESEARCH, 2022, 312 (01) : 473 - 493
  • [26] A hybrid short-term traffic flow forecasting model based on time series multifractal characteristics
    Zhang, Hong
    Wang, Xiaoming
    Cao, Jie
    Tang, Minan
    Guo, Yirong
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2429 - 2440
  • [27] A hybrid short-term traffic flow forecasting model based on time series multifractal characteristics
    Hong Zhang
    Xiaoming Wang
    Jie Cao
    Minan Tang
    Yirong Guo
    Applied Intelligence, 2018, 48 : 2429 - 2440
  • [28] Time Series Analysis of Pulmonary Tuberculosis Incidence: Forecasting by Applying the Time Series Model
    Chan, Yinping
    Wu, Aiping
    Wang, Cuiling
    Zhou, Haiying
    Feng, Shuxiu
    ADVANCES IN APPLIED SCIENCE, ENGINEERING AND TECHNOLOGY, 2013, 709 : 819 - 822
  • [29] A hybrid fresh apple export volume forecasting model based on time series and artificial neural network
    Yang, Lihua
    Li, Baolin
    Advance Journal of Food Science and Technology, 2015, 7 (12) : 966 - 970
  • [30] Stock Markets Forecasting Based on Fuzzy Time Series Model
    Lin, Yupei
    Yang, Yiwen
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 782 - 786