Achieving Sales Forecasting with Higher Accuracy and Efficiency: A New Model Based on Modified Transformer

被引:5
作者
Li, Qianying [1 ]
Yu, Mingyang [1 ]
机构
[1] Shanghai Jiao Tong Univ, Antai Coll Econ & Management, Shanghai 200030, Peoples R China
关键词
sales forecasting; higher accuracy and efficiency; model; multi-head attention mechanisms; Transformer; deep learning; TIME-SERIES; NEURAL-NETWORKS;
D O I
10.3390/jtaer18040100
中图分类号
F [经济];
学科分类号
02 ;
摘要
With the exponential expansion of e-commerce, an immense volume of historical sales data has been generated and amassed. This influx of data has created an opportunity for more accurate sales forecasting. While various sales forecasting methods and models have been applied in practice, existing ones often struggle to fully harness sales data and manage significant fluctuations. As a result, they frequently fail to make accurate predictions, falling short of meeting enterprise needs. Therefore, it is imperative to explore new models to enhance the accuracy and efficiency of sales forecasting. In this paper, we introduce a model tailored for sales forecasting based on a Transformer with encoder-decoder architecture and multi-head attention mechanisms. We have made specific modifications to the standard Transformer model, such as removing the Softmax layer in the last layer and adapting input embedding, position encoding, and feedforward network components to align with the unique characteristics of sales forecast data and the specific requirements of sales forecasting. The multi-head attention mechanism in our proposed model can directly compute the dot product results in a single step, addressing long-term time-dependent computation challenges while maintaining lower time complexity and greater interpretability. This enhancement significantly contributes to improving the model's accuracy and efficiency. Furthermore, we provide a comprehensive formula representation of the model for the first time, facilitating better understanding and implementation. We conducted experiments using sales datasets that incorporate various factors influencing sales forecasts, such as seasons, holidays, and promotions. The results demonstrate that our proposed model significantly outperforms seven selected benchmark methods, reducing RMSLE, RMSWLE, NWRMSLE, and RMALE by approximately 48.2%, 48.5%, 45.2, and 63.0%, respectively. Additionally, ablation experiments on the multi-head attention and the number of encoder-decoders validate the rationality of our chosen model parameters.
引用
收藏
页码:1990 / 2006
页数:17
相关论文
共 50 条
[41]   Tourism demand forecasting: a deep learning model based on spatial-temporal transformer [J].
Chen, Jiaying ;
Li, Cheng ;
Huang, Liyao ;
Zheng, Weimin .
TOURISM REVIEW, 2025, 80 (03) :648-663
[42]   A Hybrid Neural Network Model for Sales Forecasting Based on ARIMA and Search Popularity of Article Titles [J].
Omar, Hani ;
Van Hai Hoang ;
Liu, Duen-Ren .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
[43]   Investigating market efficiency through a forecasting model based on differential equations [J].
de Resende, Charlene C. ;
Pereira, Adriano C. M. ;
Cardoso, Rodrigo T. N. ;
de Magalhaes, A. R. Bosco .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2017, 474 :199-212
[44]   Forecasting New Energy Vehicle Sales in China Based on a Novel Grey Lotka-Volterra Model and Assessing Its Environmental Impact [J].
Qian, Wuyong ;
Zou, Tingting ;
Wang, Yuhong ;
Ji, Chunyi ;
Ran, Minghao .
JOURNAL OF GREY SYSTEM, 2023, 35 (03) :82-99
[45]   Enhancing accuracy in point-interval load forecasting: A new strategy based on data augmentation, customized deep learning, and weighted linear error correction [J].
Liu, Weican ;
Tian, Zhirui ;
Qiu, Yuyan .
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
[46]   Flare Set-Prediction Transformer: A Transformer-Based Set-Prediction Model for Detailed Solar Flare Forecasting [J].
Qiao, Liang ;
Qin, Gang .
UNIVERSE, 2025, 11 (06)
[47]   Multi-step tap-water quality forecasting in South Korea with transformer-based deep learning model [J].
Cai, Danqi ;
Chen, Kunwei ;
Lin, Zhizhe ;
Zhou, Jinglin ;
Mo, Xinyue ;
Zhou, Teng .
URBAN WATER JOURNAL, 2024, 21 (09) :1109-1120
[48]   A Hybrid Model Based on Improved Transformer and Graph Convolutional Network for COVID-19 Forecasting [J].
Li, Yulan ;
Ma, Kun .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (19)
[49]   New Graph-Based and Transformer Deep Learning Models for River Dissolved Oxygen Forecasting [J].
Rocha, Paulo Alexandre Costa ;
Santos, Victor Oliveira ;
The, Jesse Van Griensven ;
Gharabaghi, Bahram .
ENVIRONMENTS, 2023, 10 (12)
[50]   A Sales Forecasting Model for New-Released and Short-Term Product: A Case Study of Mobile Phones [J].
Hwang, Seongbeom ;
Yoon, Goonhu ;
Baek, Eunjung ;
Jeon, Byoung-Ki .
ELECTRONICS, 2023, 12 (15)