Dynamic pricing of hotel rooms based on reinforcement learning with unknown demand distribution

被引:0
作者
Zhu H. [1 ]
Zhang M. [1 ]
Tang J. [1 ]
机构
[1] School of Management Science and Engineering, Dongbei University of Finance and Economics, Dalian
来源
Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice | 2023年 / 43卷 / 02期
基金
中国国家自然科学基金;
关键词
dynamic pricing; reinforcement learning; revenue management; SARSA(λ) algorithm;
D O I
10.12011/SETP2022-1705
中图分类号
学科分类号
摘要
Traditional hotel dynamic pricing research always considers improving demand forecasting methods or considers that the demand environment is known, while the demand distribution in real life is usually unknown. In this paper, we established a multi-period dynamic pricing model for hotel rooms based on Markov decision process with unknown demand distribution, and used the reinforcement learning method to propose improved algorithms based on SARSA(λ) to solve the dynamic pricing model of rooms. In order to improve the solving ability and convergence speed of the algorithm, we proposed the ε-SARSA(λ) algorithm based on the improved ε-greedy strategy and the ISA-SARSA(λ) algorithm based on the improved simulated annealing strategy. Through numerical experiments, the revenue optimization results of the four algorithms, SARSA(λ), ε-SARSA(λ), SA-SARSA(λ) and ISA-SARSA(λ), were compared. The study results verify the effectiveness of improved algorithms and show that the ISA-SARSA(λ) algorithm has the best solution performance. © 2023 Systems Engineering Society of China. All rights reserved.
引用
收藏
页码:509 / 523
页数:14
相关论文
共 37 条
[1]  
Zhu H., Research on sharing economy in the context of O2O[J], Systems Engineering — Theory & Practice, 41, 2, pp. 411-420, (2021)
[2]  
Zhou J H, Kuang X, Chen Z M, Et al., A demand forecasting model for air passenger traffic in China: Based on stochastic frontier analysis model and model averaging[J], Systems Engineering — Theory & Practice, 40, 11, pp. 2861-2871, (2020)
[3]  
Kong X W, Tang X Z, Wang Z M., A survey of explainable artificial intelligence decision[J], Systems Engineering — Theory & Practice, 41, 2, pp. 524-536, (2021)
[4]  
Wang X, Wang F., A review of research on dynamic pricing strategies based on reinforcement learning[J], Computer Applications and Software, 36, 12, pp. 1-6, (2019)
[5]  
Littlewood K., Special issue papers: Forecasting and control of passenger bookings[J], Journal of Revenue and Pricing Management, 4, 2, pp. 111-123, (2005)
[6]  
Donaghy K, Mcmahon U, Mcdowell D., Yield management: An overview[J], International Journal of Hospitality Management, 14, 2, pp. 139-150, (1995)
[7]  
Chen X., Hotel revenue management: Research overview and prospects[J], Journal of Management Sciences in China, 6, 6, pp. 72-78, (2003)
[8]  
Chen W H, Sun Y H, Hua Z S., Joint pricing of multiple types of rooms in hotel under customer choices[J], Journal of Management Sciences in China, 16, 7, pp. 23-33, (2013)
[9]  
Vives A, Jacob M., Dynamic pricing for online hotel demand: The case of resort hotels in Majorca[J], Journal of Vacation Marketing, 26, 2, pp. 268-283, (2020)
[10]  
Li H, Zhang Y L, Zhong W J., Dynamic pricing strategies in the presence of strategic consumer behavior risks[J], Journal of Management Sciences in China, 15, 10, pp. 11-25, (2012)