Reinforcement learning for dynamic pricing and capacity allocation in monetized customer wait-skipping services

被引:0
作者
Garcia, Christopher [1 ]
机构
[1] Univ Mary Washington, Coll Business, 1301 Coll Ave, Fredericksburg, VA 22401 USA
关键词
Dynamic pricing; revenue management; waiting lines; reinforcement learning; proximal policy optimization; REVENUE MANAGEMENT; TIME; ALGORITHM; STRATEGY; GREATER; MODEL;
D O I
10.1080/2573234X.2024.2424542
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider how to facilitate a dynamically-priced premium service option that enables customer parties to shorten their wait in a queue. Offering such a service requires that some of a business's capacity be reserved continuously and kept ready for premium customers. In tandem with capacity reservation, pricing must be coordinated. Hence, a joint dynamic pricing and capacity allocation problem lies at the heart of this service. We propose a conceptual solution architecture and employ Proximal Policy Optimization (PPO) for dynamic pricing and capacity allocation to maximize total revenue. Simulation experiments over multiple scenarios compared PPO against a human-engineered policy and a baseline policy having no premium option. The human-engineered policy led to significantly greater revenues than the baseline policy in each scenario, illustrating the potential increase in revenues afforded by this concept. The PPO agent substantially improved upon the human-engineered policy advantage, with improvements ranging from 28% to 161%.
引用
收藏
页码:36 / 54
页数:19
相关论文
共 63 条
[11]   Dynamic pricing for fast charging stations with deep reinforcement learning [J].
Cui, Li ;
Wang, Qingyuan ;
Qu, Hongquan ;
Wang, Mingshen ;
Wu, Yile ;
Ge, Le .
APPLIED ENERGY, 2023, 346
[12]   The Economics of Line-Sitting [J].
Cui, Shiliang ;
Wang, Zhongbin ;
Yang, Luyi .
MANAGEMENT SCIENCE, 2020, 66 (01) :227-242
[13]   A reinforcement learning approach to competitive ordering and pricing problem [J].
Dogan, Ibrahim ;
Guener, Ali R. .
EXPERT SYSTEMS, 2015, 32 (01) :39-48
[14]   The Value of Time [J].
Festjens, Anouk ;
Janiszewski, Chris .
JOURNAL OF CONSUMER RESEARCH, 2015, 42 (02) :178-195
[15]   Self-Selecting Priority Queues with Burr Distributed Waiting Costs [J].
Gavirneni, Srinagesh ;
Kulkarni, Vidyadhar G. .
PRODUCTION AND OPERATIONS MANAGEMENT, 2016, 25 (06) :979-992
[16]   Revenue management saves National Car Rental [J].
Geraghty, MK ;
Johnson, E .
INTERFACES, 1997, 27 (01) :107-127
[17]  
Gosavi A, 2002, IIE TRANS, V34, P729
[18]   THE ROLE OF PERCEPTION OF TIME IN CONSUMER RESEARCH [J].
GRAHAM, RJ .
JOURNAL OF CONSUMER RESEARCH, 1981, 7 (04) :335-342
[19]   Centralized Cooperation for Connected and Automated Vehicles at Intersections by Proximal Policy Optimization [J].
Guan, Yang ;
Ren, Yangang ;
Li, Shengbo Eben ;
Sun, Qi ;
Luo, Laiquan ;
Li, Keqiang .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) :12597-12608
[20]   Deep Reinforcement Learning-based Trajectory Pricing on Ride-hailing Platforms [J].
Huang, Jianbin ;
Huang, Longji ;
Liu, Meijuan ;
Li, He ;
Tan, Qinglin ;
Ma, Xiaoke ;
Cui, Jiangtao ;
Huang, De-Shuang .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (03)