Reinforcement learning for dynamic pricing and capacity allocation in monetized customer wait-skipping services

被引:0
作者
Garcia, Christopher [1 ]
机构
[1] Univ Mary Washington, Coll Business, 1301 Coll Ave, Fredericksburg, VA 22401 USA
关键词
Dynamic pricing; revenue management; waiting lines; reinforcement learning; proximal policy optimization; REVENUE MANAGEMENT; TIME; ALGORITHM; STRATEGY; GREATER; MODEL;
D O I
10.1080/2573234X.2024.2424542
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider how to facilitate a dynamically-priced premium service option that enables customer parties to shorten their wait in a queue. Offering such a service requires that some of a business's capacity be reserved continuously and kept ready for premium customers. In tandem with capacity reservation, pricing must be coordinated. Hence, a joint dynamic pricing and capacity allocation problem lies at the heart of this service. We propose a conceptual solution architecture and employ Proximal Policy Optimization (PPO) for dynamic pricing and capacity allocation to maximize total revenue. Simulation experiments over multiple scenarios compared PPO against a human-engineered policy and a baseline policy having no premium option. The human-engineered policy led to significantly greater revenues than the baseline policy in each scenario, illustrating the potential increase in revenues afforded by this concept. The PPO agent substantially improved upon the human-engineered policy advantage, with improvements ranging from 28% to 161%.
引用
收藏
页码:36 / 54
页数:19
相关论文
共 63 条
[21]   Dynamic pricing under competition using reinforcement learning [J].
Kastius, Alexander ;
Schlosser, Rainer .
JOURNAL OF REVENUE AND PRICING MANAGEMENT, 2022, 21 (01) :50-63
[22]   Dynamic Pricing and Energy Consumption Scheduling With Reinforcement Learning [J].
Kim, Byung-Gook ;
Zhang, Yu ;
van der Schaar, Mihaela ;
Lee, Jang-Won .
IEEE TRANSACTIONS ON SMART GRID, 2016, 7 (05) :2187-2198
[23]  
Kimes S. E., 2000, Cornell Hotel and Restaurant Administration Quarterly, V41, P120, DOI 10.1177/001088040004100129
[24]   Online pricing of demand response based on long short-term memory and reinforcement learning [J].
Kong, Xiangyu ;
Kong, Deqian ;
Yao, Jingtao ;
Bai, Linquan ;
Xiao, Jie .
APPLIED ENERGY, 2020, 271
[25]   Reinforcement learning for pricing strategy optimization in the insurance industry [J].
Krasheninnikova, Elena ;
Garcia, Javier ;
Maestre, Roberto ;
Fernandez, Fernando .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 80 :8-19
[26]   Semi-Markov adaptive critic heuristics with application to airline revenue management [J].
Kulkarni K. ;
Gosavi A. ;
Murray S. ;
Grantham K. .
Journal of Control Theory and Applications, 2011, 9 (03) :421-430
[27]   A bounded actor-critic reinforcement learning algorithm applied to airline revenue management [J].
Lawhead, Ryan J. ;
Gosavi, Abhijit .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 82 :252-262
[29]   Dynamic pricing based electric vehicle charging station location strategy using reinforcement learning [J].
Li, Yanbin ;
Wang, Jiani ;
Wang, Weiye ;
Liu, Chang ;
Li, Yun .
ENERGY, 2023, 281
[30]   Wait Time-Based Pricing for Queues with Customer-Chosen Service Times [J].
Lin, Chen-An ;
Shang, Kevin ;
Sun, Peng .
MANAGEMENT SCIENCE, 2023, 69 (04) :2127-2146