Reinforcement learning for dynamic pricing and capacity allocation in monetized customer wait-skipping services

被引：0

作者：

Garcia, Christopher ^{[1
]}

机构：

[1] Univ Mary Washington, Coll Business, 1301 Coll Ave, Fredericksburg, VA 22401 USA

来源：

JOURNAL OF BUSINESS ANALYTICS | 2025年 / 8卷 / 01期

关键词：

Dynamic pricing; revenue management; waiting lines; reinforcement learning; proximal policy optimization; REVENUE MANAGEMENT; TIME; ALGORITHM; STRATEGY; GREATER; MODEL;

D O I：

10.1080/2573234X.2024.2424542

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider how to facilitate a dynamically-priced premium service option that enables customer parties to shorten their wait in a queue. Offering such a service requires that some of a business's capacity be reserved continuously and kept ready for premium customers. In tandem with capacity reservation, pricing must be coordinated. Hence, a joint dynamic pricing and capacity allocation problem lies at the heart of this service. We propose a conceptual solution architecture and employ Proximal Policy Optimization (PPO) for dynamic pricing and capacity allocation to maximize total revenue. Simulation experiments over multiple scenarios compared PPO against a human-engineered policy and a baseline policy having no premium option. The human-engineered policy led to significantly greater revenues than the baseline policy in each scenario, illustrating the potential increase in revenues afforded by this concept. The PPO agent substantially improved upon the human-engineered policy advantage, with improvements ranging from 28% to 161%.

引用

页码：36 / 54

页数：19

共 63 条

[11] Dynamic pricing for fast charging stations with deep reinforcement learning [J].