TADA: Trend Alignment with Dual-Attention Multi-Task Recurrent Neural Networks for Sales Prediction

被引:52
作者
Chen, Tong [1 ]
Yin, Hongzhi [1 ]
Chen, Hongxu [1 ]
Wu, Lin [1 ,2 ]
Wang, Hao [3 ]
Zhou, Xiaofang [1 ]
Li, Xue [1 ,4 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
[2] Hefei Univ Technol, Sch Comp & Informat, Hefei, Anhui, Peoples R China
[3] Alibaba AI Labs, Hangzhou, Zhejiang, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Nanjing, Jiangsu, Peoples R China
来源
2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2018年
关键词
Sales Prediction; Time Series Data; Deep Neural Network;
D O I
10.1109/ICDM.2018.00020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a common strategy in sales-supply chains, the prediction of sales volume offers precious information for companies to achieve a healthy balance between supply and demand. In practice, the sales prediction task is formulated as a time series prediction problem which aims to predict the future sales volume for different products with the observation of various influential factors (e.g., brand, season, discount, etc.) and corresponding historical sales records. However, with the development of contemporary commercial markets, the dynamic interaction between influential factors with different semantic meanings becomes more subtle, causing challenges in fully capturing dependencies among these variables. Besides, though seeking similar trends from the history benefits the accuracy for the prediction of upcoming sales, existing methods hardly suit sales prediction tasks because the trends in sales time series are more irregular and complex. Hence, we gain insights from the encoder-decoder recurrent neural network (RNN) structure, and propose a novel framework named TADA to carry out trend alignment with dual-attention, multi-task RNNs for sales prediction. In TADA, we innovatively divide the influential factors into internal feature and external feature, which are jointly modelled by a multi-task RNN encoder. In the decoding stage, TADA utilizes two attention mechanisms to compensate for the unknown states of influential factors in the future and adaptively align the upcoming trend with relevant historical trends to ensure precise sales prediction. Experimental results on two real-world datasets comprehensively show the superiority of TADA in sales prediction tasks against other state-of-the-art competitors.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 46 条
[1]  
[Anonymous], 2013, INT C MACHINE LEARNI
[2]  
[Anonymous], IJCAI
[3]  
[Anonymous], 2015, NATURE
[4]  
[Anonymous], 2003, Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '03, DOI DOI 10.1145/956750.956808
[5]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[6]   A deep learning framework for financial time series using stacked autoencoders and long-short term memory [J].
Bao, Wei ;
Yue, Jun ;
Rao, Yulei .
PLOS ONE, 2017, 12 (07)
[7]  
Bengio S., 2015, NEURIPS, P1171, DOI DOI 10.5555/2969239.2969370
[8]  
Box G.E., 2015, TIME SERIES ANAL FOR
[9]   Dynamically Modeling Patient's Health State from Electronic Medical Records: A Time Series Approach [J].
Caballero, Karla ;
Akella, Ram .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :69-78
[10]  
Carbonneau R., 2008, EUROPEAN J OPERATION