Dynamic pricing of hotel rooms based on reinforcement learning with unknown demand distribution

被引:0
作者
Zhu H. [1 ]
Zhang M. [1 ]
Tang J. [1 ]
机构
[1] School of Management Science and Engineering, Dongbei University of Finance and Economics, Dalian
来源
Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice | 2023年 / 43卷 / 02期
基金
中国国家自然科学基金;
关键词
dynamic pricing; reinforcement learning; revenue management; SARSA(λ) algorithm;
D O I
10.12011/SETP2022-1705
中图分类号
学科分类号
摘要
Traditional hotel dynamic pricing research always considers improving demand forecasting methods or considers that the demand environment is known, while the demand distribution in real life is usually unknown. In this paper, we established a multi-period dynamic pricing model for hotel rooms based on Markov decision process with unknown demand distribution, and used the reinforcement learning method to propose improved algorithms based on SARSA(λ) to solve the dynamic pricing model of rooms. In order to improve the solving ability and convergence speed of the algorithm, we proposed the ε-SARSA(λ) algorithm based on the improved ε-greedy strategy and the ISA-SARSA(λ) algorithm based on the improved simulated annealing strategy. Through numerical experiments, the revenue optimization results of the four algorithms, SARSA(λ), ε-SARSA(λ), SA-SARSA(λ) and ISA-SARSA(λ), were compared. The study results verify the effectiveness of improved algorithms and show that the ISA-SARSA(λ) algorithm has the best solution performance. © 2023 Systems Engineering Society of China. All rights reserved.
引用
收藏
页码:509 / 523
页数:14
相关论文
共 37 条
[11]  
Yang J, Xia Y., A nonatomic-game approach to dynamic pricing under competition[J], Production and Operations Management, 22, 1, pp. 88-103, (2013)
[12]  
Zu C S, Hotel revenue management[M], pp. 58-62, (2016)
[13]  
Petricek M, Chalupa S, Melas D., Model of price optimization as a part of hotel revenue management — Stochastic approach, Mathematics, 9, (2021)
[14]  
Ren W J, Li X., Tourism demand analysis based on Internet big data: The case of Huairou, Beijing[J], Systems Engineering — Theory & Practice, 38, 2, pp. 437-443, (2018)
[15]  
Ladany S P., Optimal market segmentation of hotel rooms — The non-linear case[J], Omega, 24, 1, pp. 29-36, (1996)
[16]  
Bandalouski A M, Egorova N G, Kovalyov M Y, Et al., Dynamic pricing with demand disaggregation for hotel revenue management[J], Journal of Heuristics, 27, 5, pp. 869-885, (2021)
[17]  
Bernal A., Pricing in network revenue management systems with reusable resources, (2020)
[18]  
Weatherford L R, Kimes S E., A comparison of forecasting methods for hotel revenue management[J], International Journal of Forecasting, 19, 3, pp. 401-415, (2003)
[19]  
Lin S J, Chen J Y, Liao Z X, Et al., A EMD-BP integrated model to forecast tourist number and applied to Jiuzhaigou[J], Journal of Intelligent & Fuzzy Systems, 34, 2, pp. 1045-1052, (2018)
[20]  
Li X X, Lu B F, Zeng P Z, Et al., Tourism prediction using web search data based on CLSI-EMD-BP[J], Systems Engineering — Theory & Practice, 37, 1, pp. 106-118, (2017)