Building a Lucy hybrid model for grocery sales forecasting based on time series

被引:0
|
作者
Duy Thanh Tran
Jun-Ho Huh
Jae-Hwan Kim
机构
[1] University of Economics and Law,Faculty of Information Systems
[2] Vietnam National University Ho Chi Minh City,Department of Data Informatics
[3] (National) Korea Maritime and Ocean University,Department of Data Science
[4] (National) Korea Maritime and Ocean University,undefined
来源
The Journal of Supercomputing | 2023年 / 79卷
关键词
Lucy hybrid; Hybrid model; Forecast; Grocery sales; Machine learning; Time series; Trend;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, time series data are applied in many fields, such as economics, medicine, biology, science, society, nature, environment, or typically in weather forecasting. Time series is a tool that includes methodological formulas and models to help us analyze time series data, extract potentially valuable information, capture historical fluctuations, present and support forecasts of the value of the research object in future. There are many models and methods of time series analysis that have been researched and improved these days for trend analysis and forecasts. Techniques related to time series data processing include linear regression with time series with two features unique to time series lags and time steps, the trend for model long-term changes with moving averages and time dummy, seasonality to create indicators, Fourier features to capture periodic change, and time series as features to predict the future from the pass with a lag embedding. In this article, we build a new hybrid model called Lucy Hybrid that provides full steps in the machine learning process including data pre-processing, training model, evaluation model with Mean Square Error (MSE), Root-Mean-Square Error (RMSE) and Mean Absolute Error (MAE) to compare and get the best model quality. The model also provides functions like storage and loading model to support researchers to reuse and save time on training model. In the Lucy hybrid, we also support the trend and forecast function for time series data. We experiment with a large dataset of more than 3,000,000 records from a large Ecuadorian-based grocery retailer, and we used Linear Regression, Elastic Net, Lasso, Ridge and Extra Trees Regressor, Random Forest Regressor, K-Neighbors Regressor, MLP Regressor, XGB Regressor to experiment and create 20 Lucy hybrid sample models and publish a full source code for researchers to use to expand the model.
引用
收藏
页码:4048 / 4083
页数:35
相关论文
共 50 条
  • [31] A Hybrid Model of ARIMA and ANN with Discrete Wavelet Transform for Time Series Forecasting
    Pannakkong, Warut
    Huynh, Van-Nam
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2017), 2017, 10571 : 159 - 169
  • [32] A Differentiated DBN Model Based on CRBM for Time Series Forecasting
    Wang, Siyuan
    Liu, Yong
    Zhang, Xu
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1926 - 1931
  • [33] A hybrid time-series forecasting model using extreme learning machines
    Pan, F.
    Zhang, H.
    Xia, M.
    ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 933 - 936
  • [34] HYBRID MODEL OF MARIMA AND ANNBP FOR FORECASTING SALES TAX IN EGYPT
    Abdelaal, Medhat M. A.
    Moustafa, Moustafa Galal
    Aglan, Dalia Mohamed Samy
    ADVANCES AND APPLICATIONS IN STATISTICS, 2013, 37 (01) : 37 - 71
  • [35] The Building of Market Potential Forecasting Model in Emerging Countries Based on Time Series-Taking Singapore as an Example
    Ye Chunyan
    Yin Chao
    STRATEGY IN EMERGING MARKETS: MANAGEMENT, FINANCE AND SUSTAINABLE DEVELOPMENT, 2014, : 352 - 356
  • [36] A hybrid VMD-BiGRU model for rubber futures time series forecasting
    Zhu, Qing
    Zhang, Fan
    Liu, Shan
    Wu, Yiqiong
    Wang, Lin
    APPLIED SOFT COMPUTING, 2019, 84
  • [37] Theta Autoregressive Neural Network: A Hybrid Time Series Model for Pandemic Forecasting
    Bhattacharyya, Arinjita
    Chattopadhyay, Swarup
    Pattnaik, Monalisha
    Chakraborty, Tanujit
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [38] Hybrid Fuzzy Time Series Methods Applied to Solar Radiation Forecasting
    Olcan, Ceyda
    Mahmudov, Elimhan
    JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2016, 27 (01) : 89 - 116
  • [39] Time series forecasting of domestic shipping market: comparison of SARIMAX, ANN-based models and SARIMAX-ANN hybrid model
    Fiskin, Cemile Solak
    Turgut, Ozgu
    Westgaard, Sjur
    Cerit, A. Guldem
    INTERNATIONAL JOURNAL OF SHIPPING AND TRANSPORT LOGISTICS, 2022, 14 (03) : 193 - 221
  • [40] Hybrid Method for Short-Term Time Series Forecasting Based on EEMD
    Yang, Yujun
    Yang, Yimei
    IEEE ACCESS, 2020, 8 : 61915 - 61928