Reinforcement learning for dynamic pricing and capacity allocation in monetized customer wait-skipping services

被引:0
作者
Garcia, Christopher [1 ]
机构
[1] Univ Mary Washington, Coll Business, 1301 Coll Ave, Fredericksburg, VA 22401 USA
关键词
Dynamic pricing; revenue management; waiting lines; reinforcement learning; proximal policy optimization; REVENUE MANAGEMENT; TIME; ALGORITHM; STRATEGY; GREATER; MODEL;
D O I
10.1080/2573234X.2024.2424542
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider how to facilitate a dynamically-priced premium service option that enables customer parties to shorten their wait in a queue. Offering such a service requires that some of a business's capacity be reserved continuously and kept ready for premium customers. In tandem with capacity reservation, pricing must be coordinated. Hence, a joint dynamic pricing and capacity allocation problem lies at the heart of this service. We propose a conceptual solution architecture and employ Proximal Policy Optimization (PPO) for dynamic pricing and capacity allocation to maximize total revenue. Simulation experiments over multiple scenarios compared PPO against a human-engineered policy and a baseline policy having no premium option. The human-engineered policy led to significantly greater revenues than the baseline policy in each scenario, illustrating the potential increase in revenues afforded by this concept. The PPO agent substantially improved upon the human-engineered policy advantage, with improvements ranging from 28% to 161%.
引用
收藏
页码:36 / 54
页数:19
相关论文
共 63 条
  • [1] Dynamic Pricing for Differentiated PEV Charging Services Using Deep Reinforcement Learning
    Abdalrahman, Ahmed
    Zhuang, Weihua
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) : 1415 - 1427
  • [2] Optimal Price/Lead-Time Menus for Queues with Customer Choice: Segmentation, Pooling, and Strategic Delay
    Afeche, Philipp
    Pavlin, J. Michael
    [J]. MANAGEMENT SCIENCE, 2016, 62 (08) : 2412 - 2436
  • [3] Optuna: A Next-generation Hyperparameter Optimization Framework
    Akiba, Takuya
    Sano, Shotaro
    Yanase, Toshihiko
    Ohta, Takeru
    Koyama, Masanori
    [J]. KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 2623 - 2631
  • [4] From concept drift to model degradation: An overview on performance-aware drift detectors
    Bayram, Firas
    Ahmed, Bestoun S.
    Kassler, Andreas
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 245
  • [5] Bohn E, 2019, INT CONF UNMAN AIRCR, P523, DOI [10.1109/ICUAS.2019.8798254, 10.1109/icuas.2019.8798254]
  • [6] The efficacy of utility functions for multicriteria hospital case-mix planning
    Burdett, Robert L.
    Corry, Paul
    Yarlagadda, Prasad
    Cook, David
    Birgan, Sean
    [J]. INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 2024, 31 (02) : 807 - 862
  • [7] Caneen J., 1998, J HOSPITALITY FINANC, V6, P109, DOI [10.1080/10913211.1998.10653714, DOI 10.1080/10913211.1998.10653714]
  • [8] A psychological perspective on service segmentation models: The significance of accounting for consumers' perceptions of waiting and service
    Carmon, Z
    Shanthikumar, JG
    Carmon, TF
    [J]. MANAGEMENT SCIENCE, 1995, 41 (11) : 1806 - 1815
  • [9] Spatial-temporal pricing for ride-sourcing platform with reinforcement learning
    Chen, Chuqiao
    Yao, Fugen
    Mo, Dong
    Zhu, Jiangtao
    Chen, Xiqun
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 130
  • [10] A reinforcement learning approach for hotel revenue management with evidence from field experiments
    Chen, Ji
    Xu, Yifan
    Yu, Peiwen
    Zhang, Jun
    [J]. JOURNAL OF OPERATIONS MANAGEMENT, 2023, 69 (07) : 1176 - 1201