Building a Lucy hybrid model for grocery sales forecasting based on time series

被引:0
|
作者
Duy Thanh Tran
Jun-Ho Huh
Jae-Hwan Kim
机构
[1] University of Economics and Law,Faculty of Information Systems
[2] Vietnam National University Ho Chi Minh City,Department of Data Informatics
[3] (National) Korea Maritime and Ocean University,Department of Data Science
[4] (National) Korea Maritime and Ocean University,undefined
来源
The Journal of Supercomputing | 2023年 / 79卷
关键词
Lucy hybrid; Hybrid model; Forecast; Grocery sales; Machine learning; Time series; Trend;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, time series data are applied in many fields, such as economics, medicine, biology, science, society, nature, environment, or typically in weather forecasting. Time series is a tool that includes methodological formulas and models to help us analyze time series data, extract potentially valuable information, capture historical fluctuations, present and support forecasts of the value of the research object in future. There are many models and methods of time series analysis that have been researched and improved these days for trend analysis and forecasts. Techniques related to time series data processing include linear regression with time series with two features unique to time series lags and time steps, the trend for model long-term changes with moving averages and time dummy, seasonality to create indicators, Fourier features to capture periodic change, and time series as features to predict the future from the pass with a lag embedding. In this article, we build a new hybrid model called Lucy Hybrid that provides full steps in the machine learning process including data pre-processing, training model, evaluation model with Mean Square Error (MSE), Root-Mean-Square Error (RMSE) and Mean Absolute Error (MAE) to compare and get the best model quality. The model also provides functions like storage and loading model to support researchers to reuse and save time on training model. In the Lucy hybrid, we also support the trend and forecast function for time series data. We experiment with a large dataset of more than 3,000,000 records from a large Ecuadorian-based grocery retailer, and we used Linear Regression, Elastic Net, Lasso, Ridge and Extra Trees Regressor, Random Forest Regressor, K-Neighbors Regressor, MLP Regressor, XGB Regressor to experiment and create 20 Lucy hybrid sample models and publish a full source code for researchers to use to expand the model.
引用
收藏
页码:4048 / 4083
页数:35
相关论文
共 50 条
  • [41] Hybrid Approach for Time Series Forecasting Based on ANFIS and Fuzzy Cognitive Maps
    Averkin, Alexey N.
    Yarushev, Sergey
    PROCEEDINGS OF 2017 XX IEEE INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM), 2017, : 379 - 381
  • [42] LSTM Based Time Series Forecasting of Noisy Signals
    Getu, Beza Negash
    RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, ACIIDS 2024, 2024, 2145 : 133 - 146
  • [43] A Hybrid Model for Monthly Precipitation Time Series Forecasting Based on Variational Mode Decomposition with Extreme Learning Machine
    Li, Guohui
    Ma, Xiao
    Yang, Hong
    INFORMATION, 2018, 9 (07)
  • [44] Traffic flow forecasting based on a hybrid model
    Wang, Chao
    Ye, Zhirui
    JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 20 (05) : 428 - 437
  • [45] A moving-average filter based hybrid ARIMA-ANN model for forecasting time series data
    Babu, C. Narendra
    Reddy, B. Eswara
    APPLIED SOFT COMPUTING, 2014, 23 : 27 - 38
  • [46] A Hybrid Neural Network Model for Sales Forecasting Based on ARIMA and Search Popularity of Article Titles
    Omar, Hani
    Van Hai Hoang
    Liu, Duen-Ren
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [47] Forecasting hepatitis epidemic situation by applying the time series model
    Department of Epidemiology and Health Statistics, School of Public Health, Hebei United University, No. 57, Jianshe South Road, Tangshan, Hebei 063000, China
    不详
    不详
    Int. J. Simul. Process Model., 1-2 (42-49): : 42 - 49
  • [48] Wind Power Prediction Based on a Hybrid Granular Chaotic Time Series Model
    Wang, Yanyang
    Xiong, Wei
    E, Shiping
    Liu, Qingguo
    Yang, Nan
    Fu, Ping
    Gong, Kang
    Huang, Yu
    FRONTIERS IN ENERGY RESEARCH, 2022, 9
  • [49] A hybrid support vector regression for time series forecasting
    Xiang, Ling
    Zhu, Yongli
    Tang, Gui-ji
    2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 4, PROCEEDINGS, 2009, : 161 - 165
  • [50] CNN-N-BEATS: Novel Hybrid Model for Time-Series Forecasting
    Aiwansedo, Konstandinos
    Bosche, Jerome
    Badreddine, Wafa
    Kermia, M. H.
    Djadane, Oussama
    DEEP LEARNING THEORY AND APPLICATIONS, PT I, DELTA 2024, 2024, 2171 : 38 - 57